From: U12 type introns were lost at multiple occasions during evolution
EST | Genomic sequence | Intron length | B | R | Relationship to C. elegans | C. elegans orthologue | Gene function |
---|---|---|---|---|---|---|---|
ATAC introns | |||||||
EX501652.1 | AGA|ATATCCTTTC...TTGGGCATTGTTAT ATTTCCTTAACG GGTATGGTTTAC|GTT | 2095 | + | + | U12->U2 | EEED8.7 | SR (splicing factor) |
TyrIleGl nPheGlnGl nLeuLysAspAla | |||||||
Ts TACATAGA AT...ACGTTCGAAGA GCTGAAAGATGCT | |||||||
PheValAr gPheTyrGl nArgArgAspAla | |||||||
Ce TTCGTTAG ATTTTATGA GT...AGACGTCGTGCTGCT | |||||||
EX500683.1 | ATT|ATATCCTTTC...AATTTC ATTTCCTTAACG TTAGATTTTTGTTGTTTTAC|TGA | 94 | + | + | U12 lost | E04D5.1a | NA |
ES570647.1 | ATC|ATATCCTTTC...GTATTGTTTGTA TTTTCCTTAACT TCATATGTTTTTAC|GTA | 182 | + | + | U12 lost | Y37D8A.10 | Signal peptidase complex, subunit SPC25 |
GTAG introns | |||||||
EX499999.1 | Ts GAG|GTATCCTTTG...TTTTGTTTT TCTCTCTTTTTACA ATTATTATACAG|GCC | 90 | - | + | U12->U2 | F10F2.1 | PH, BEACH and WD40 domain-containing protein |
Ce GAG|GTTTGAAACA...TTTTAATATTGAACTAAAATTTTTGAATTTTCCAG|GCG | 64 | ||||||
ES570692.1 | Ts TAG|GTATTGTTTT...TGCTACAAGGAATTTTTTT ATTGCTTTGATT TTAG|AGT | 617 | - | - | U12->U2 | F40F8.10 | Small ribosomal subunit S9 protein |
Ce CCG|GTTTAGTTTT...AAGATTAGTATCGACTTCAAATTCTTCTCTTTCAG|TGT | 291 | ||||||
ES561213.1 | Ts TCG|GTATTATTTT...CATATTAATCGTTT CATTTCTTAATG TATTTTTAG|TGG | 54 | - | - | U12->U2 | ZC395.10 | NA |
Ce CCA|GTACGTTTCG...ACATAGAATGAGTCGTAATTCGTAAATTTTCAGAG|GAA | 150 | ||||||
EX500486.1 | Ts TCG|GTATTCTTTC...TAATATGTTTTTCT TTTTTTTCAACT TATTTTAAG|ATT | 87 | - | + | U12->U2 | ZC328.3a | NA |
Ce CAT|GTGAGTTTCA...TCCTGAATTTATTCAAGTTTCAACCACATTTCCAG|CAT | 758 | ||||||
ES569928.1 | ATG|GTATTCTTTT... ATTTCCATTACA AAATTACAACCGCGTTGTTCTTTCAG|TGC | 107 | + | + | Not known | Y82E9BR.15 | Transcription elongation factor B |
ES565768.1 | CAG|GTATTCTTTT...CAAATTTTGGAAAAATTCT TTTTTTTTAATC CGAACAG|GTA | 94 | - | + | Not known | C34D4.4a | NA |
ES562099.1 | AAT|GTATCCTTAA...TGTATGAGGTTTGGTATTT CTGATTTTAATC ATTTTAG|TGT | 50 | - | + | Not known | R07E5.14 | RRM RNA binding domain) containing |
ES563059.1 | GCG|GTATCTTTTC...TATTTATAACTGAATCG TTTTTATTAATA ATTTTTTAG|AGT | 54 | - | + | Not known | M04F3.4 | NA |
BQ738460.1 | ACG|GTATCGTTCA...TCAATTTTTTTAAAAGTA ATTTTCTTCATA TATTTTAG|AAC | 72 | - | - | U12 lost/Not known | Y56A3A.36 | NA |
ES566079.1 | TGG|GTATCGTTCG...ATTAACTAACACT TTGAAGTTGACA AGTGAATGTTTAG|GAT | 140 | - | - | Not known | M02B7.4 | beta-1,4-N-acetylglucosaminyl transferase |
EX500543.1 | TCG|GTATTCTTTG... TTATTATTAATT TCTGTTTTTTTTGGTTTTCTAAACAG|AGA | 86 | - | + | None | ||
ES561535.1 | GGG|GTATTATTTT... TTTTCTGTGATT TAATTGCATTTTAATGTTCTATCTAG|TGA | 71 | - | - | None | ||
BQ738918.1 | GAA|GTATCTTTTA...TGAATTTTGCTAAA TTGTACTTAACA GGTTGTTTTTAG|AAA | 153 | - | + | None |