Characterization of RNase MRP RNA and novel snoRNAs from Giardia intestinalis and Trichomonas vaginalis
© Chen et al; licensee BioMed Central Ltd. 2011
Received: 22 June 2011
Accepted: 6 November 2011
Published: 6 November 2011
Eukaryotic cells possess a complex network of RNA machineries which function in RNA-processing and cellular regulation which includes transcription, translation, silencing, editing and epigenetic control. Studies of model organisms have shown that many ncRNAs of the RNA-infrastructure are highly conserved, but little is known from non-model protists. In this study we have conducted a genome-scale survey of medium-length ncRNAs from the protozoan parasites Giardia intestinalis and Trichomonas vaginalis.
We have identified the previously 'missing' Giardia RNase MRP RNA, which is a key ribozyme involved in pre-rRNA processing. We have also uncovered 18 new H/ACA box snoRNAs, expanding our knowledge of the H/ACA family of snoRNAs.
Results indicate that Giardia intestinalis and Trichomonas vaginalis, like their distant multicellular relatives, contain a rich infrastructure of RNA-based processing. From here we can investigate the evolution of RNA processing networks in eukaryotes.
The current view of cellular RNA organization indicates an RNA infrastructure [1, 2], which describes the spatial and temporal network of the many different RNAs. The interconnection of the RNA-processing pathways is crucial for cellular processes because different RNA processing mechanisms are tightly linked. For example, transcription and splicing happen in close proximity [3, 4], and splicing is tightly connected with downstream mRNA processes including localization, translational regulation, and nonsense-mediated decay [5–7]. With the advancement of large-scale RNA analysis and high-throughput sequencing, conserved features of the eukaryotic RNA infrastructure have come to light from studies in animals and plants. In contrast to the genome-wide transcriptional information known in other eukaryotic models, only limited information on RNA biology is available from G. intestinalis and T. vaginalis. We can compare the ncRNAs from these two protists with other model eukaryotes (e.g. animals, plants, yeasts), to understand more about the general nature of the RNA infrastructure within eukaryotes.
We have previously sequenced ncRNAs sized from 10 to 200 nucleotides (nt) from G. intestinalis and T. vaginalis, and identified from this data small ncRNAs such as microRNAs in the two protists . Using this same sequencing data we are also able to analyze longer ncRNAs due to the wide size range. In this study we identify medium-length ncRNAs (50-250 nt) from small RNA based sequencing data, characterize the RNase MRP of G. intestinalis, and clarify annotations of RNase P and MRP from both G. intestinalis and T. vaginalis. We also identify new H/ACA box snoRNAs from these species. Our study clearly demonstrates that high-throughput sequencing cannot only screen for small regulatory RNAs, but can also be used for the characterization of longer ncRNAs from diverse organisms. Results from our work support that G. intestinalis and T. vaginalis possess a rich network of RNA processing components expected in the consensus eukaryotic RNA infrastructure.
Construction of RNA contigs using consensus mapping
Summary of RNA consensus contig results
Number of Contigs
Mean length (nt)
Median length (nt)
Max length (nt)
With the length cutoff of the new contig datasets set above 50nt we discarded mature miRNAs and siRNAs. De novo assembly tools such as Velvet  and Abyss  produced very few contigs and therefore were not used in this study. Hence, we recommend that for medium length ncRNA assembly that a reference genome is used for the initial assembly until tools are developed to permit the de novo assembly of small contigs. Our study was carried out on G. intestinalis WB strain (Genome Assemblage A), and the genome assemblages of the other two strains (Isolate GS/Assemblage B, and Isolate P15/Assemblage E) were used for comparison in some of the subsequent analysis.
Our RNA contigs and trimmed sequences were compared to the available G. intestinalis and T. vaginalis genome annotation to check that rRNAs and tRNAs were represented as expected. From the G. intestinalis sequences all 80 annotated rRNAs and tRNAs from the WB isolate were covered by contigs generated with Bowtie mapping. However, many of these contigs contain Illumina short reads mapped to multiple sites in the genome and therefore were assigned equally to each of the possibilities. Contigs were trimmed in length from the 5' end, to a minimum length of 20 nt, 36 nt and 50 nt to use as sequence tags and mapped against the annotated ncRNAs. Our original trimmed sequence datasets were also mapped against the annotated ncRNAs. All G. intestinalis and T. vaginalis rRNAs were found by sequence and contigs. In G. intestinalis all tRNAs and rRNAs were found from Assemblage A with 3 tRNAs not found by contig but found by sequence in Assemblage B, (P15) (Trp, Met, Phe) and four not found by sequence or contig (Cys, Tyr, His, Asp). A comparison against Assemblage E (Isolate P15) had 4 tRNAs (Trp, Met, Cys, Asp) not found by sequence or contig. Given that our sequences came from the same strain as Assemblage A, not finding some sequences in the other strains is not surprising. T. vaginalis had many annotations for its rRNAs and tRNAs and the majority were found by both contig and sequence (results are summarized in Additional file 1, Tables s1-s3).
Results from comparing our contigs and sequences with known tRNAs and rRNAs indicate that our method is assembling ncRNAs effectively, and that such contigs can be used as sequence tags if trimmed to 20 or 36 nucleotides.
Identification of the MRP RNA from G. intestinalis
RNase MRP is a ribonucleoprotein complex, consisting of one ncRNA and (in humans) ~10 proteins. It catalyzes the nucleotide cleavage reaction at the A3 site (at the internal transcribed spacer region between small-subunit and 5.8S rRNAs) on the pre-rRNA transcript. MRP RNA is evolutionarily related to the eukaryotic RNase P RNA which processes the 3'- end of pre-tRNA transcripts . The RNase P and MRP complexes share a number of common protein subunits [24, 25], and the secondary structures of RNase P and MRP RNAs share a common backbone. The RNase P RNA has been identified in all eukaryotes and prokaryotes studied to date and shown to be one of the few RNAs that have retained catalytic features [26, 27]. The RNase MRP is thought to have evolved before the ancestor of modern eukaryotes , but previously missing evidence of the MRP RNA from G. intestinalis [8, 29] imposed a question on the origin of MRP and rRNA processing. However the conserved A3 site on rRNA-gene sequence has been identified in G. intestinalis, strongly suggesting the presence of G. intestinalis MRP RNA .
Genomic location of G. intestinalis and T. vaginalis RNase MRP and RNase P RNA genes
RNase MRP RNA
RNase P RNA
Another important characteristic of the MRP RNA is the P3 helix, which is a structure common between RNase MRP and P. The P3 helix associates with the protein POP1 , and for MRP it is essential for the cleavage of pre-rRNAs at the A3 site on the pre-ribosomal RNA transcript . Previous computational studies have already identified both the RNase P and RNase MRP RNAs from T. vaginalis . The P3 helix of G. intestinalis RNase P and RNase MRP are both short and lack the intra-helical loop, however they do share a 3 nt sequence at the terminal loop, consistent with previous studies showing the same relation of P and MRP in a number of eukaryotes [8, 28]. Comparison of the P3 helices of T. vaginalis RNase P and MRP also show a consensus sequence located at the intra-helical loop and the terminal loop regions (Figure 3C). The structure of helix P19 between G. intestinalis MRP and P is also conserved, although no sequence similarity is observed.
Although we are concentrating on the RNase P and RNase MRP ncRNA molecules from G. intestinalis and T. vaginalis, it is worth a quick mention of the important proteins associated with their ribonucleoprotein macromolecules. RNase P and RNase MRP macromolecules share many of their proteins [24, 25, 34], which have been characterized to small extent in G. intestinalis . Some proteins such as the scaffolding protein POP1 can be hard to identify in protists, due to the large amount of evolutionary distance between them and species from which these proteins are known, but other proteins such as POP4 are much more conserved across eukaryotes . For a more detailed analysis of the changes in the secondary structure of the RNA components we will require further domain and protein structural analysis to understand how the RNA and Protein components have evolved together. However, even without this detailed protein-RNA analysis, we can see that our results strongly support the conservation of the structure-function relationship between RNase MRP and P within G. intestinalis and T. vaginalis.
Our study also sought to clarify the annotation of the RNase MRP RNA especially in the Assemblage B from version 2.3 of the genome. In this annotation within contig ACGJ00100236, the MRP RNA is overlapping with a snoRNA which is in turn overlapping an area annotated as RNase P. The RNase MRP RNA from G. intestinalis we have identified does not match this region, but instead this region corresponds to its close relative the RNase P RNA. Corrected contigs and co-ordinates for the Assemblage B genes are given in Table 2.
Although we have characterized the G. intestinalis MRP RNA computationally, further molecular biology experimentation will be required before it can be functionally verified. Until then, we classify our sequences in G. intestinalis as computationally predicted.
New H/ACA box snoRNAs
Small nucleolar (sno)RNAs are a group of ncRNAs of variable length (from 60 up to 1000 nt in yeast), which are involved in processing of several types of transcripts . Most of the known snoRNAs belong to two classes, which are determined by evolutionarily conserved sequence elements: the C/D box and H/ACA box . SnoRNAs exist in large numbers in eukaryotes. In humans, there have been more than 400 snoRNAs identified , and in general have been shown to have a diverse range of locations and expression patterns . In our previous studies, we have characterized novel C/D box snoRNAs in both G. intestinalis and T. vaginalis [9, 14]. C/D box snoRNAs direct 2'-O methylation, and are relatively easy to identify based on conserved sequence elements and complementary binding to the target RNAs. H/ACA box snoRNAs direct pseudouridylation, and often exhibit more variable features due to their shorter length of conserved elements and discontinuous complementary target binding regions.
Scoring rules and scores of snoRNA candidates
Number of candidates*
Average of candidates'
The new G. intestinalis snoRNAs all have different target pseudouridylation sites on rRNAs, whereas two of the T. vaginalis new snoRNAs (Tv/ACA.3 and Tv/ACA.10) share the same target (LSU rRNA U2214). In addition, Gi/ACA.5, Tv/ACA.3 and Tv/ACA.10 target a conserved site on the large subunit rRNA, the same as Gi/ACA.6 and Tv/ACA.2. The target regions are highly conserved between G. intestinalis and T. vaginalis, but the sequences of the snoRNAs targeting these sites do not show substantial sequence similarity. A BLAST  search of the newly identified G. intestinalis H/ACA box snoRNAs was performed against the genomes of the other two isolates of Giardia strains (Assemblage B and E) to look for homologous sequences. Three G. intestinalis new snoRNAs have homologous sequences in either Assemblage B or E, and 2 have homologous sequences in both Assemblage B and E. Overall the sequences are highly conserved and the nucleotides complementary to the rRNA target sequences show minimal changes across the three Giardia strains.
Recent studies on ncRNAs throughout eukaryotes have expanded our understanding of the RNA-processing infrastructure  with the discovery that key components of the RNA-processing machinery occur throughout eukaryotes. It is now clear that the general ncRNA infrastructure has been conserved in excavates, which is an extended but less studied group of eukaryotes.
Characterization of the G. intestinalis RNase MRP RNA is an important achievement in searching for conserved key ncRNAs of the central RNA-processing pathway. Sequence and structural analysis of the G. intestinalis MRP RNA has shown all the conserved characteristics of eukaryotic MRP, as referred to in Figure 3. The conserved structural relationship between G. intestinalis P and MRP RNAs indicates that the protein-RNA relation in G. intestinalis P and MRP does not differ significantly from other eukaryotes. Identification of the MRP RNA from G. intestinalis has filled the gap left from previous studies of MRPs. In looking at the structure of the G. intestinalis MRP we can see how consensus models based on the eukaryotic P3 region could not have detected either the sequence or the structure. The typical P3 region of G. intestinalis resembles more the bacterial model of RNase P RNA than the eukaryotic P3 model for both RNase P and RNase MRP RNAs. However, the rest of the structure fits the eukaryotic RNase MRP RNA model. This demonstrates that with protists in particular, the standard eukaryotic models for ncRNAs may not necessarily apply.
The novel H/ACA box snoRNAs identified from G. intestinalis and T. vaginalis all have only one predicted target-binding site regardless the number of stem-loops. Having one target-binding site is also seen in the H/ACA box snoRNAs found in Trypanosomes . Identification of the new snoRNAs is usually reliant on predicted conserved target sites, which however appear only partially conserved across distant organisms. ncRNAs in protists may exhibit characteristics not typically seen in more commonly studied model species, and thus these methods may not reveal all the ncRNAs within non-model genomes. For instance the number of pseudouridylation sites in Trypanosomes is estimated to be 70-80 , but to date only around 50% of snoRNAs involved in the modification of these sites have been found . Therefore we can expect that the total number of H/ACA box snoRNAs will actually be much larger in both G. intestinalis and T. vaginalis. Understanding more about general structural and sequence motifs, will aid us in further searches for H/ACA box snoRNAs.
We demonstrate in our study that high-throughput sequencing of ncRNAs larger than ~21-25nt (typical for miRNA and siRNA studies) is possible, and that the assembly of short read sequences can lead to the characterization of medium-length ncRNAs. We have found this technology to be a boost to the study of ncRNAs from non-model eukaryotes and especially those distantly related to well characterized species (e.g. human, mosquito, nematode, yeasts and some plants). A limitation of this study is that the sequencing length of 36nt (standard in ncRNA sequencing) is short and that longer sequences would enable the identification of ncRNAs ~70-150nt in single reads, thus not requiring assembly.
In conclusion, we constructed new RNA contigs with updated genomes and identified the RNase MRP of G. intestinalis, which answers positively the previously open question as whether MRP exists in all extant eukaryotes with the A3 site. In addition, a number of new H/ACA box snoRNAs have been identified in G. intestinalis and T. vaginalis, with a reduced structure compared to model species, but still possessing the characteristic target-binding pattern and sequence motifs. It is becoming evident that not only are the components of RNA-processing network highly conserved within eukaryotes, but also the pattern of transcription across the genome appears to be shared among distant lineages. Overall, our results imply that it is increasingly likely that the main classes of RNA processing and regulation were present in the last common ancestor of eukaryotes . We demonstrate that high-throughput sequencing can be used for the characterization of ncRNAs longer than 21-22nt small regulatory RNAs for which, in ncRNA studies, this technology is typically applied.
Our general strategy has been to search for the major classes of RNA in all major groups of eukaryotes and investigate the evolution of their mechanisms [14, 43, 44]. Increasingly it appears that the major groups are universal in eukaryotes, even though there is continued evolution of individual subgroups of regulatory RNA. Discovering how RNA systems work in protists, which are distantly remote in an evolutionary sense from other eukaryotes, may hold the key in uncovering how RNA mechanisms evolved from our early ancestors.
Cultures and RNA sequencing
Note that the sample preparation and sequencing was prepared as for previous publications from this data . We have summarised this procedure here.
G. intestinalis trophozoites (WB strain) were grown in TY1-S-33 media, and T. vaginalis was maintained in T. vaginalis broth (Fort Richard) at 37°C. Cells were harvested by centrifugation (2,500 rpm, 10 min, 4°C). Growth media was removed and cells were washed once in PBS buffer. Total RNA was extracted using Trizol (Invitrogen) according to the protocol provided by the manufacturer, and further purified by phenol: chloroform extraction. The purified RNA was dissolved in double distilled water. For sequencing, 10 μg of DNase treated, 5'- de-capped total RNAs were separated on a 15% denaturing acrylamide 8 M urea gel and RNAs ranging from 10 to 200 nt were cut out from the gel and prepared according to Illumina's small RNA preparation protocol. This effectively includes an RT-PCR step that converts RNA to DNA for further sequencing. 8 and 12 pmol (in each lane) of G. intestinalis and T. vaginalis cDNA were sent for sequencing on an Illumina Genome Analyzer for 36 cycles. Pipeline analysis was performed with the Illumina Pipeline version 1.4.
RNA Consensus Contig construction
During our previous study  full length (36nt) trimmed (22nt) and unique read datasets were mapped to the genomes  of G. intestinalis (version 1.2) and T. vaginalis (version 1.1) and short consensus 'RNA contigs' were generated using Maq version 0.7.1 . We initially used these RNA contigs, then constructed new RNA contigs generated when updated genomes became available for downloading from GiardiaDB and TrichDB  (G. intestinalis version 2.3 and T. vaginalis version 1.2). For these new contigs we used Bowtie  for mapping, and SAMTools  for conversion and consensus sequence generation. Contigs less than 50nt were discarded for this study. A covariance model of MRP RNA was built from the seed alignment of 89 MRP RNA sequences from the Rfam database , and then used to search for MRP candidates in G. intestinalis RNA contigs using Infernal 1.0 . Secondary structures were drawn using VARNA  and the potential RNase MRP was compared to similar regions in the other Giardia strains using BLAST.
snoRNA search and characterisation
The search for new H/ACA box snoRNAs used SnoGPS 2.0  with default parameters and predicted pseudouridylation sites. The negative control sequences for SnoGPS were generated using uShuffle , and additional analysis was carried out using RNAMotif  to look for H/ACA-like RNA structures. Potential rRNA modification sites needed to be clarified before searching for new H/ACA-box snoRNAs from G. intestinalis and T. vaginalis. rRNAs of G. intestinalis and T. vaginalis were first aligned with human rRNAs, then conserved sites in rRNAs with known pseudouridylation in human were selected as possible target sites. These sites were then used for snoRNA prediction using SnoGPS . Initially, SnoGPS control runs were performed with randomized sequences in order to determine the distribution of total scores for later analysis runs. To construct these random sequences, a selection of G. intestinalis and T. vaginalis contigs were shuffled with the same nucleotide frequency 100 times and the resulting sequences were used as control input for SnoGPS program. First, the standard two stem-loop model was used, and this search permitted G-U base pairs in snoRNA-target binding. However, previous studies have shown that snoRNAs in G. intestinalis could also adopt an archaeal one-stem structure , therefore, a one-stem search was also tested. The total score cutoff for searching the original contigs was set to above 95% of the randomized control sequences.
All new sequences are in preparation to be included into Rfam .
The G. intestinalis culture was kindly provided by Errol Kwan, Protozoa Research Unit, Hopkirk Institute, Massey University; and the T. vaginalis samples were collected by Lynn Rogers at Medlab Central, Palmerston North. LJC was partially supported by an Emerging Researcher Grant from the Health Research Council of New Zealand.
- Collins LJ, Penny D: The RNA infrastructure: dark matter of the eukaryotic cell?. Trends Genet. 2009, 25 (3): 120-128. 10.1016/j.tig.2008.12.003.PubMedView Article
- Collins LJ: The RNA Infrastructure: An Introduction to ncRNA Networks 1-16. RNA Infrastructure and Networks. Edited by: Lesley J Collins. © 2011, Landes Bioscience and Springer Science+Business Media
- Kornblihtt AR, de la Mata M, Fededa JP, Munoz MJ, Nogues G: Multiple links between transcription and splicing. RNA. 2004, 10 (10): 1489-1498. 10.1261/rna.7100104.PubMed CentralPubMedView Article
- Collins LJ: Spliceosomal RNA Infrastructure: The Network of Splicing Components and their Regulation by miRNAs 86-102. RNA Infrastructure and Networks. Edited by: Lesley J Collins. © 2011 Landes Bioscience and Springer Science+Business Media
- Lin CL, Evans V, Shen S, Xing Y, Richter JD: The nuclear experience of CPEB: Implications for RNA processing and translational control. RNA. 2009, 16 (2): 338-348.PubMedView Article
- Millevoi S, Vagner S: Molecular mechanisms of eukaryotic pre-mRNA 3' end processing regulation. Nucleic Acids Res. 2009, gkp1176-
- Kawashima T, Pellegrini M, Chanfreau GF: Nonsense-mediated mRNA decay mutes the splicing defects of spliceosome component mutations. RNA. 2009, 15 (12): 2236-2247. 10.1261/rna.1736809.PubMed CentralPubMedView Article
- Piccinelli P, Rosenblad MA, Samuelsson T: Identification and analysis of ribonuclease P and MRP RNA in a broad range of eukaryotes. Nucleic Acids Res. 2005, 33 (14): 4485-4495. 10.1093/nar/gki756.PubMed CentralPubMedView Article
- Chen XS, Rozhdestvensky TS, Collins LJ, Schmitz J, Penny D: Combined experimental and computational approach to identify non-protein-coding RNAs in the deep-branching eukaryote Giardia intestinalis. Nucleic Acids Res. 2007, 35 (14): 4619-4628. 10.1093/nar/gkm474.PubMedView Article
- Yang CY, Zhou H, Luo J, Qu LH: Identification of 20 snoRNA-like RNAs from the primitive eukaryote, Giardia lamblia. Biochem Biophys Res Commun. 2005, 328 (4): 1224-1231. 10.1016/j.bbrc.2005.01.077.PubMedView Article
- Chen XS, White WT, Collins LJ, Penny D: Computational identification of four spliceosomal snRNAs from the deep-branching eukaryote Giardia intestinalis. PLoS One. 2008, 3 (8): e3106-10.1371/journal.pone.0003106.PubMed CentralPubMedView Article
- Saraiya AA, Wang CC: snoRNA, a novel precursor of microRNA in Giardia lamblia. PLoS Pathog. 2008, 4 (11): e1000224-10.1371/journal.ppat.1000224.PubMed CentralPubMedView Article
- Zhang YQ, Chen DL, Tian HF, Zhang BH, Wen JF: Genome-wide computational identification of microRNAs and their targets in the deep-branching eukaryote Giardia lamblia. Comput Biol Chem. 2009, 33 (5): 391-396. 10.1016/j.compbiolchem.2009.07.013.PubMedView Article
- Chen XS, Collins LJ, Biggs PJ, Penny D: High Throughput Genome-Wide Survey of Small RNAs from the Parasitic Protists Giardia intestinalis and Trichomonas vaginalis. Genome Biol Evol. 2009, 1: 165-175.PubMedView Article
- Teodorovic S, Walls CD, Elmendorf HG: Bidirectional transcription is an inherent feature of Giardia lamblia promoters and contributes to an abundance of sterile antisense transcripts throughout the genome. Nucleic Acids Res. 2007, 35 (8): 2544-2553. 10.1093/nar/gkm105.PubMed CentralPubMedView Article
- Lin WC, Li SC, Lin WC, Shin JW, Hu SN, Yu XM, Huang TY, Chen SC, Chen HC, Chen SJ, et al: Identification of microRNA in the protist Trichomonas vaginalis. Genomics. 2009, 93 (5): 487-493. 10.1016/j.ygeno.2009.01.004.PubMedView Article
- Simoes-Barbosa A, Meloni D, Wohlschlegel JA, Konarska MM, Johnson PJ: Spliceosomal snRNAs in the unicellular eukaryote Trichomonas vaginalis are structurally conserved but lack a 5'-cap structure. RNA. 2008, 14 (8): 1617-1631. 10.1261/rna.1045408.PubMed CentralPubMedView Article
- Smith A, Johnson P: Gene expression in the unicellular eukaryote Trichomonas vaginalis. Research in Microbiology. 2011, 162 (6): 1-9.View Article
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10 (3): R25-10.1186/gb-2009-10-3-r25.PubMed CentralPubMedView Article
- Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008, 18 (11): 1851-1858. 10.1101/gr.078212.108.PubMed CentralPubMedView Article
- Zerbino DR, Birney E: Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18 (5): 821-829. 10.1101/gr.074492.107.PubMed CentralPubMedView Article
- Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I: ABySS: a parallel assembler for short read sequence data. Genome Res. 2009, 19 (6): 1117-1123. 10.1101/gr.089532.108.PubMed CentralPubMedView Article
- Ellis JC, Brown JW: The RNase P family. RNA Biol. 2009, 6 (4): 362-369. 10.4161/rna.6.4.9241.PubMedView Article
- Chamberlain JR, Lee Y, Lane WS, Engelke DR: Purification and characterization of the nuclear RNase P holoenzyme complex reveals extensive subunit overlap with RNase MRP. Genes Dev. 1998, 12 (11): 1678-1690. 10.1101/gad.12.11.1678.PubMed CentralPubMedView Article
- van Eenennaam H, Jarrous N, van Venrooij WJ, Pruijn GJ: Architecture and function of the human endonucleases RNase P and RNase MRP. IUBMB Life. 2000, 49 (4): 265-272. 10.1080/15216540050033113.PubMedView Article
- Kikovska E, Svard SG, Kirsebom LA: Eukaryotic RNase P RNA mediates cleavage in the absence of protein. Proc Natl Acad Sci USA. 2007, 104 (7): 2062-2067. 10.1073/pnas.0607326104.PubMed CentralPubMedView Article
- Pannucci JA, Haas ES, Hall TA, Harris JK, Brown JW: RNase P RNAs from some Archaea are catalytically active. Proc Natl Acad Sci USA. 1999, 96 (14): 7803-7808. 10.1073/pnas.96.14.7803.PubMed CentralPubMedView Article
- Munroe SH, Zhu J: Overlapping transcripts, double-stranded RNA and antisense regulation: a genomic perspective. Cell Mol Life Sci. 2006, 63 (18): 2102-2118. 10.1007/s00018-006-6070-2.PubMedView Article
- Gardner PP, Daub J, Tate JG, Nawrocki EP, Kolbe DL, Lindgreen S, Wilkinson AC, Finn RD, Griffiths-Jones S, Eddy SR: Rfam: updates to the RNA families database. Nucleic Acids Res. 2009, D136-140. 37 Database
- Woodhams MD, Stadler PF, Penny D, Collins LJ: RNase MRP and the RNA processing cascade in the eukaryotic ancestor. BMC Evol Biol. 2007, 7 (Suppl 1): S13-10.1186/1471-2148-7-S1-S13.PubMed CentralPubMedView Article
- Nawrocki EP, Kolbe DL, Eddy SR: Infernal 1.0: inference of RNA alignments. Bioinformatics. 2009, 25 (10): 1335-1337. 10.1093/bioinformatics/btp157.PubMed CentralPubMedView Article
- Davila Lopez M, Rosenblad MA, Samuelsson T: Conserved and variable domains of RNase MRP RNA. RNA Biol. 2009, 6 (3): 208-220. 10.4161/rna.6.3.8584.PubMedView Article
- Lygerou Z, Mitchell P, Petfalski E, Seraphin B, Tollervey D: The POP1 gene encodes a protein component common to the RNase MRP and RNase P ribonucleoproteins. Genes Dev. 1994, 8 (12): 1423-1433. 10.1101/gad.8.12.1423.PubMedView Article
- Welting TJ, Kikkert BJ, Van Venrooij WJ, Pruijn GJ: Differential association of protein subunits with the human RNase MRP and RNase P complexes. RNA. 2006, 12 (7): 1373-1382. 10.1261/rna.2293906.PubMed CentralPubMedView Article
- Collins LJ, Poole AM, Penny D: Using ancestral sequences to uncover potential gene homologues. Appl Bioinformatics. 2003, 2 (3 Suppl): S85-95.PubMed
- Bachellerie JP, Cavaille J, Huttenhofer A: The expanding snoRNA world. Biochimie. 2002, 84 (8): 775-790. 10.1016/S0300-9084(02)01402-5.PubMedView Article
- Dieci G, Preti M, Montanini B: Eukaryotic snoRNAs: a paradigm for gene expression flexibility. Genomics. 2009, 94 (2): 83-88. 10.1016/j.ygeno.2009.05.002.PubMedView Article
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralPubMedView Article
- Myslyuk I, Doniger T, Horesh Y, Hury A, Hoffer R, Ziporen Y, Michaeli S, Unger R: Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes. BMC Bioinformatics. 2008, 9: 471-10.1186/1471-2105-9-471.PubMed CentralPubMedView Article
- Doniger T, Michaeli S, Unger R: Families of H/ACA ncRNA molecules in trypanosomatids. RNA Biol. 2009, 6 (4): 370-374. 10.4161/rna.6.4.9270.PubMedView Article
- Macke TJ, Ecker DJ, Gutell RR, Gautheret D, Case DA, Sampath R: RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res. 2001, 29 (22): 4724-4735. 10.1093/nar/29.22.4724.PubMed CentralPubMedView Article
- Collins LJ, Chen XS: Ancestral RNA: the RNA biology of the eukaryotic ancestor. RNA Biol. 2009, 6 (5): 495-502. 10.4161/rna.6.5.9551.PubMedView Article
- Collins L, Penny D: Complex spliceosomal organization ancestral to extant eukaryotes. Mol Biol Evol. 2005, 22 (4): 1053-1066. 10.1093/molbev/msi091.PubMedView Article
- Collins L, Penny D: Investigating the intron recognition mechanism in eukaryotes. Mol Biol Evol. 2006, 23 (5): 901-910. 10.1093/molbev/msj084.PubMedView Article
- Aurrecoechea C, Brestelli J, Brunk BP, Carlton JM, Dommer J, Fischer S, Gajria B, Gao X, Gingle A, Grant G: GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis. Nucleic Acids Res. 2009, D526-530. 37 Database
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079. 10.1093/bioinformatics/btp352.PubMed CentralPubMedView Article
- Darty K, Denise A, Ponty Y: VARNA: Interactive drawing and editing of the RNA secondary structure. Bioinformatics. 2009, 25 (15): 1974-1975. 10.1093/bioinformatics/btp250.PubMed CentralPubMedView Article
- Schattner P, Brooks AN, Lowe TM: The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005, W686-689. 33 Web Server
- Jiang M, Anderson J, Gillespie J, Mayne M: uShuffle: a useful tool for shuffling biological sequences while preserving the k-let counts. BMC Bioinformatics. 2008, 9: 192-10.1186/1471-2105-9-192.PubMed CentralPubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.