Skip to main content

Table 1 U12 type of introns identified in T. spiralis.

From: U12 type introns were lost at multiple occasions during evolution

EST

Genomic sequence

Intron length

B

R

Relationship to C. elegans

C. elegans orthologue

Gene function

 

ATAC introns

      

EX501652.1

AGA|ATATCCTTTC...TTGGGCATTGTTAT ATTTCCTTAACG GGTATGGTTTAC|GTT

2095

+

+

U12->U2

EEED8.7

SR (splicing factor)

 

   TyrIleGl       nPheGlnGl       nLeuLysAspAla

      
 

Ts TACATAGA AT...ACGTTCGAAGA       GCTGAAAGATGCT

      
 

    PheValAr      gPheTyrGl       nArgArgAspAla

      
 

Ce  TTCGTTAG      ATTTTATGA GT...AGACGTCGTGCTGCT

      

EX500683.1

ATT|ATATCCTTTC...AATTTC ATTTCCTTAACG TTAGATTTTTGTTGTTTTAC|TGA

94

+

+

U12 lost

E04D5.1a

NA

ES570647.1

ATC|ATATCCTTTC...GTATTGTTTGTA TTTTCCTTAACT TCATATGTTTTTAC|GTA

182

+

+

U12 lost

Y37D8A.10

Signal peptidase complex, subunit SPC25

 

GTAG introns

      

EX499999.1

Ts GAG|GTATCCTTTG...TTTTGTTTT TCTCTCTTTTTACA ATTATTATACAG|GCC

90

-

+

U12->U2

F10F2.1

PH, BEACH and WD40 domain-containing protein

 

Ce GAG|GTTTGAAACA...TTTTAATATTGAACTAAAATTTTTGAATTTTCCAG|GCG

64

     

ES570692.1

Ts TAG|GTATTGTTTT...TGCTACAAGGAATTTTTTT ATTGCTTTGATT TTAG|AGT

617

-

-

U12->U2

F40F8.10

Small ribosomal subunit S9 protein

 

Ce CCG|GTTTAGTTTT...AAGATTAGTATCGACTTCAAATTCTTCTCTTTCAG|TGT

291

     

ES561213.1

Ts TCG|GTATTATTTT...CATATTAATCGTTT CATTTCTTAATG TATTTTTAG|TGG

54

-

-

U12->U2

ZC395.10

NA

 

Ce CCA|GTACGTTTCG...ACATAGAATGAGTCGTAATTCGTAAATTTTCAGAG|GAA

150

     

EX500486.1

Ts TCG|GTATTCTTTC...TAATATGTTTTTCT TTTTTTTCAACT TATTTTAAG|ATT

87

-

+

U12->U2

ZC328.3a

NA

 

Ce CAT|GTGAGTTTCA...TCCTGAATTTATTCAAGTTTCAACCACATTTCCAG|CAT

758

     

ES569928.1

ATG|GTATTCTTTT... ATTTCCATTACA AAATTACAACCGCGTTGTTCTTTCAG|TGC

107

+

+

Not known

Y82E9BR.15

Transcription elongation factor B

ES565768.1

CAG|GTATTCTTTT...CAAATTTTGGAAAAATTCT TTTTTTTTAATC CGAACAG|GTA

94

-

+

Not known

C34D4.4a

NA

ES562099.1

AAT|GTATCCTTAA...TGTATGAGGTTTGGTATTT CTGATTTTAATC ATTTTAG|TGT

50

-

+

Not known

R07E5.14

RRM RNA binding domain) containing

ES563059.1

GCG|GTATCTTTTC...TATTTATAACTGAATCG TTTTTATTAATA ATTTTTTAG|AGT

54

-

+

Not known

M04F3.4

NA

BQ738460.1

ACG|GTATCGTTCA...TCAATTTTTTTAAAAGTA ATTTTCTTCATA TATTTTAG|AAC

72

-

-

U12 lost/Not known

Y56A3A.36

NA

ES566079.1

TGG|GTATCGTTCG...ATTAACTAACACT TTGAAGTTGACA AGTGAATGTTTAG|GAT

140

-

-

Not known

M02B7.4

beta-1,4-N-acetylglucosaminyl transferase

EX500543.1

TCG|GTATTCTTTG... TTATTATTAATT TCTGTTTTTTTTGGTTTTCTAAACAG|AGA

86

-

+

 

None

 

ES561535.1

GGG|GTATTATTTT... TTTTCTGTGATT TAATTGCATTTTAATGTTCTATCTAG|TGA

71

-

-

 

None

 

BQ738918.1

GAA|GTATCTTTTA...TGAATTTTGCTAAA TTGTACTTAACA GGTTGTTTTTAG|AAA

153

-

+

 

None

 
  1. Table shows all introns identified by the Sheth et al method [11]. Regions with best match to branch site PWM is underlined. For one of the ATAC introns (EST EX501652.1) is shown the shift in 5' splice site observed between T. spiralis and C. elegans. Rule of 5' splice site (R) is that 5' splice site sequence is RTATCCTT where one of the Cs in positions +5 and +6 may be converted into a T. Burge et al method (B) is described in [5]. NA = not available. For sequences of U2 and U12 introns listed in table, see additional file 2.