Volume 11 Supplement 4
Discovery and characterization of medaka miRNA genes by next generation sequencing platform
© Li et al; licensee BioMed Central Ltd. 2010
Published: 2 December 2010
MicroRNAs (miRNAs) are endogenous non-protein-coding RNA genes which exist in a wide variety of organisms, including animals, plants, virus and even unicellular organisms. Medaka (Oryzias latipes) is a useful model organism among vertebrate animals. However, no medaka miRNAs have been investigated systematically. It is beneficial to conduct a genome-wide miRNA discovery study using the next generation sequencing (NGS) technology, which has emerged as a powerful sequencing tool for high-throughput analysis.
In this study, we adopted ABI SOLiD platform to generate small RNA sequence reads from medaka tissues, followed by mapping these sequence reads back to medaka genome. The mapped genomic loci were considered as candidate miRNAs and further processed by a support vector machine (SVM) classifier. As result, we identified 599 novel medaka pre-miRNAs, many of which were found to encode more than one isomiRs. Besides, additional minor miRNAs (also called miRNA star) can be also detected with the improvement of sequencing depth. These quantifiable isomiRs and minor miRNAs enable us to further characterize medaka miRNA genes in many aspects. First of all, many medaka candidate pre-miRNAs position close to each other, forming many miRNA clusters, some of which are also conserved across other vertebrate animals. Secondly, during miRNA maturation, there is an arm selection preference of mature miRNAs within precursors. We observed the differences on arm selection preference between our candidate pre-miRNAs and their orthologous ones. We classified these differences into three categories based on the distribution of NGS reads. Finally, we also investigated the relationship between conservation status and expression level of miRNA genes. We concluded that the evolutionally conserved miRNAs were usually the most abundant ones.
Medaka is a widely used model animal and usually involved in many biomedical studies, including the ones on development biology. Identifying and characterizing medaka miRNA genes would benefit the studies using medaka as a model organism.
MicroRNAs (miRNAs) are endogenous non-protein-coding RNAs with ~22 nucleotides in length. They exerts down-regulation ability, either by translation inhibition or by degradation mRNA, on target genes through complementary binding to their 3'-UTR regions . More and more studies have discovered the critical modulation functions of miRNAs in many physiological activities. miRNA genes were also found to exist in a wide variety of organisms, including animals, plants, virus and even unicellular organisms [1, 2], which suggests the evolutionary conservation of miRNA genes and miRNA regulation mechanisms . Therefore, a few reports have investigated cross metazoan or bilaterian conservation of miRNAs and discovered unique miRNA evolution conservatios [4–9].
Establishing a comprehensive miRNA resource in additional organisms would benefit subsequent researches on subsequent miRNA evolution and function. To do so, a prerequisite identification of miRNA gene in model organisms is essential. However, discovery of miRNAs by traditional experimental approaches, including direct cloning, northern blot assay and stem-loop RT-PCR, is not an easy task due to their relatively small size and distinct tissue expression patterns. A systems biology approach would be preferable for large scale validation.
Recently, next generation sequencing (NGS) technologies, including Roche 454, Illumina GA (Genome Analyzer) and ABI SOLiD platforms, emerged as powerful sequencing platforms for genomic and transcriptomic studies. All NGS platforms have good detection sensitivity by a evaluation study . Therefore, NGS technology has been adopted in transcriptome profiling [11–13], SNP identification [14, 15], genome sequencing [16, 17], biomarker detection , and so on. Recently, NGS technology was also applied in miRNA identification and profiling studies. Morin et al identified 104 novel human miRNA genes and made a list of miRNAs differentially expressed between two embryo cell libraries . Glazov and colleagues discovered 449 new chicken miRNAs and 39 mirtrons . In addition, Wheeler et al not only sequenced miRNAs from metazoan genomes but also interrogated evolution status of discovered miRNA genes .
Medaka (Oryzias latipes) is a useful model organism among vertebrate animals [21, 22]. Although widely used in many researches, little information on medaka gene annotation is available thus far. Up to now, there are only 474 medaka protein-coding reference genes reported in RefSeq release 37. In addition, only small number of fish miRNAs was reported comparing with other animal categories. There are only 360, 131, and 132 miRNA entries in Danio rerio, Fugu rubripes, and Tetraodon nigroviridis, respectively . To date, no medaka miRNAs were discovered and reported even if the miRNA collection in miRBase 14.0 has reach up to 10,883 entries. Since it is a model animal widely used for different research purposes and miRNA genes play critical biological activities, identifying miRNA genes on medaka genome would greatly benefit subsequent studies. In this study, we adopted ABI SOLiD platform for medaka miRNA gene identification. In summary, we identified 599 novel medaka pre-miRNAs, many of which were found to encode more than one isomiRs. Discovery of miRNA genes in medaka genome would enhance further understanding of miRNA evolutions and functions in fish and vertebrates.
Initial NGS read analysis
Statistics of mappable sequence reads
# mappable reads
# mappable unique reads
# mappable genmic loci
Identified miRNAs by SVM pipeline
Statistics of candidates in different sets
Comparing all six isomiRs in Mh40, most variances occur at the 3' end. Although other types of RNA editing were mentioned, e.g. A to G transition catalyzed by adenosine deaminase or C to U transition catalyzed by cytidine deaminase, we believe that they are not significantly prevalent comparing the sequencing errors generated from technology problem [4, 19]. Therefore, in this study, we did not consider these RNA editing modifications. We have employed a perfect sequence match mapping criterion and reads with sequence variations were discarded. Besides, 29.2% (175/599) of candidate pre-miRNAs encode isomiRs. Information of all candidates of Mh and Mn sets can be accessed in Additional files 1 and 2.
miRNA cluster in medaka genome
It is known that many miRNAs are located close to each other and could form a gene cluster [26, 27]. miRNA genes in the same cluster might be transcribed from a polycistronic transcript if they were located in a close distance. Based on miRBase’s definition of miRNA cluster (10,000 bp range), we discovered a total 63 miRNA clusters from all candidate pre-miRNAs in medaka genome. The clusters and the candidate pre-miRNAs within them are listed in Additional file 3. As shown in Additional file 3, most clusters have two (41 clusters) or three (11 clusters) pre-miRNAs. These cluster-based pre-miRNAs account for 32.4% (194/599) of all medaka pre-miRNAs. Among all clusters, some clusters have only homologous pre-miRNAs such as cluster 37; some clusters have only novel pre-miRNAs such as cluster 17; and, some clusters have both homologous and novel pre-miRNAs such as cluster 51. In cluster 51, there are two homologous pre-miRNAs at the two ends and five novel pre-miRNAs located within the boundaries of two homologous pre-miRNAs. It is likely that the clustered candidate pre-miRNAs are under the same transcription regulation unit and are more probable to be authentic miRNAs. Therefore, they deserve more attention.
There are six candidate pre-miRNAs, Mh179, Mh180, Mh181, Mh182, Mh183, Mh184, located in cluster 37 at the minus strand of chromosome 21. Their orthologous pre-miRNAs are eca-mir-92a, eca-mir-19b, eca-mir-20a, eca-mir-19a, eca-mir-18a and eca-mir-17, respectively (see Additional file 3). Querying miRBase, we found that these six horse pre-miRNAs also formed a cluster in a reverse order at the plus strand of chromosome 17. Besides, this miRNA cluster is commonly shared by vertebrate animals. This observation indicates these clustered miRNAs could contribute as a conserved transcript unit and evolve together with each other.
Arm selection preference of mature miRNAs within precursor
During miRNA maturation, there is an arm selection preference of mature miRNAs within precursors. Therefore, most mature miRNAs were generated from either 5’ or 3’ arm of pre-miRNA hairpins. But, it is also observed that some pre-miRNAs could encode mature miRNAs at both arms. By miRBase definition, miRNAs from both arms can be of equal abundance, named with -5p or -3p suffix, or of unequal abundance, named with asterisk(*) suffix for the minor one. In this study, a large number of NGS reads allow us to investigate comprehensively on the arm selection preference of mature miRNA. Based on location preference of mature miRNAs generation, we classified our candidate pre-miRNAs into five categories, including 5P only, 3P only, 5P dominant, 3P dominant, and equal abundance. According to Additional file 1, the five categories account for 43.3%, 41.4%, 6.7%, 5.2% and 3.4%, respectively. Similar distribution was also observed in known Zebrafish miRNAs (data not shown). This distribution pattern shows that most pre-miRNAs encode mature miRNAs mainly at only one side of precursor hairpins.
In a previous report, Wheeler et al discovered a case of difference on arm selection preference, at which mir-33* in Haliotis expressed 1.56 fold higher than that of miR-33, which implicated the annotation issue of major and minor of mir-33 in Haliotis. In this study, we conducted an in-depth analysis on the major and minor forms of miRNAs in medaka. Similar findings were also observed by comparing the copy number of mature miRNAs at both arms of the same precursor (Additional file 1). In summary, we classified these differences on arm selection preference into three classes and used the candidates in Mh set (Additional file 1) for illustration.
The second class of difference on arm selection preference can be illustrated by Mh64 and Mh65 in Figure 2b. On the contrary to the first class, Mh64 and Mh65 encode mature miRNA only at their 5’ arm or 3’ arm, respectively; nevertheless, their orthologous pre-miRNAs, eca-mir-199a and gga-mir-33-2, encode mature miRNA at both 5’ arm and 3’ arm of the precursors. One possible explanation could just be that the original expression level of the Mh64’s minor one is too low to be detected under such sequencing intensity. However, the 5’ arm and 3’ arm of Mh65 are homologous to gga-miR-33 (major) and gga-miR-33* (minor), respectively. We detected only mature miRNA from 3’ arm of Mh65. Our explanation seemed to fit Mh64 well but not fit Mh65. The better explanation for Mh65 could be that the sequence difference at the loop part of hairpin makes Mh65’s structure differ from gga-mir-33-2’s. Therefore, the altered structure changed the released miRNA/miRNA* duplex and consequently the selection preference of RISC miRNA selection from duplex.
Another interesting example of the second class is Mh62 (Figure 2b) whose orthologous pre-miRNAs is hsa-mir-205. hsa-mir-205 encodes hsa-miR-205 and hsa-miR-205* at its 5’ and 3’ arm, respectively. When a sequence comparison was conducted, we found that the 5’ arm of Mh62 is homologous to the 3’ arm of hsa-mir-205. As a result, the encoded miRNA by Mh62 is homologous to the 3’ arm miRNA of hsa-mir-205 (hsa-miR-205*). Other candidates of this class include Mh50, Mh56, Mh39, and Mh47.
The third class of difference on arm selection preference can be illustrated by Mh37 and Mh72 in Figure 2c. Mh37 and Mh72 encode mature miRNAs individually only at their 5’ arm or 3’ arm; however, their orthologous pre-miRNAs, cfa-let-7b and dre-mir-150, encode mature miRNAs at exactly the opposite arms according to miRBase annotation. Such phenomenon is also observed among known miRNAs in miRBase. For example, ptr-let-7b, mml-let-7b and bta-let-7b encode let-7b only at their 5’ arms; cfa-let-7b encodes let-7b only at its 3’ arm; and, oan-let-7b encodes let-7b and let-7b* at both arms. This observation is similar to Mh62 of second class and shows the conservation between mature miRNA sequences does not guarantee the conservation of the whole precursors. Other candidates in this class include Mh8, Mh18 and Mh12.
miRNA expression level relation to conservation level
For more detailed comparison, we then did pair-wised t-test analysis on each adjacent pair to test the null hypothesis that the expression levels of the two adjacent sets are the same. As shown in Figure 3, except for Q3Q4 pair, the p-values of all comparison pairs are smaller than 0.05, which rejects the null hypothesis and concludes that the expression levels of different sets are significantly different. In short, the result demonstrates that the more conserved miRNA families tend to have higher expression level, which is consistent with previous report .
In the courses of identifying miRNA genes from sequences, there are different strategies depending on whether the target species has known miRNA genes. For the species with known miRNAs registered in miRBase, the mappable reads identical to known miRNAs of own species can be simply regarded as miRNAs. Other mappable reads without known miRNA matches should be carefully evaluated as novel miRNAs [19, 20]. On the contrary, for the species without known miRNAs registered in miRBase, all of the mappable reads should be carefully evaluated [4, 25]. Since there are no known medaka miRNA reported in miRBase, we applied a SVM pipeline to identify authentic candidate miRNAs.
During our analysis pipeline, there are many reads mapped back to multiple medaka genomic loci even if repeat-masked genome was used. For example, read_1182637 (AACACGAAGCACACACGACGCC), read_7361491 (CCCCCTGCTACATCTACTCCCAGTG) and read_263914823 (TCCGAAAATCCTAAAACGCGC) individually have 45, 50 and 48 occurrences in medaka genome and they were not recognized by RepeatMasker. Therefore, we wonder whether they are truly repeat elements and they were not recognized by RepeatMasker just because they were not included in repbase. Another possible explanation is that such high-frequency elements come from the fact that medaka scaffold assembly quality is less satisfied. In medaka, the total size of genome is about 889 Mb, where 24 chromosomes contribute 717 Mb but 7,164 scaffolds contribute the rest 20% of genome. However, half of the occurrences of seq_1182637, seq_7361491 and seq_263914823 locate at chromosomes and the rest half at scaffolds. With better quality of scaffold assembly, such multiple-loci problem could be solved.
In this study, we found that highly conserved miRNA families seem to have higher expression levels. There are many genes, such GAPDH, acting basal and critical functions so that their roles can not be replaced, which results in their conservation in the course of evolution. Similar conclusions can be made on these conserved miRNA families and implies their critical and un-replaceable roles in medaka. In fact, performing such examination on miRNAs sampled from single tissue, organ, development stage or cell line might cause bias because the miRNA’s expression profile in different tissues or stages are usually distinct. Our RNA samples were collected from whole body of one pair of male and female medaka so that our result to such issue would be reliable and more unbiased.
During miRNA maturation, there is an arm selection preference of mature miRNAs within precursors. In this study, we investigated the differences on arm selection preference. The results from the third class of difference demonstrates the conservation between mature miRNA sequences does not guarantee the conservation of the whole precursors. In miRNA gene identification studies, sequence conservation was a commonly used criterion of some bioinformatics pipelines to identify homologous miRNA in genomes [32–34]. It might be too stringent to demand the conservation of the whole hairpin in miRNA identification studies . On the contrary, it is more suitable to demand the conservation of the mature miRNA.
Medaka is a widely used model animal and usually involved in many biomedical studies, including the ones on development biology . miRNAs were also reported to play important regulation roles during animal embryo development . Therefore, identifying medaka miRNA genes may contribute to the studies on animal development and provide insight into the regulation on development.
Materials and methods
Raw reads from SOLiD platform
Two healthy adult medaka fish (Oryzias latipes), one male and one female, were provided by Dr. Pung-Pung Hwang (Institute of Cellular and Organismic Biology, Academia Sinica). They were lysed with a tissue lyser (TissueLyser QIAGEN), followed by RNA extraction with Trizol reagent (Invitrogen) according to the manufacturer’s protocol. Total RNA from the whole bodies of male and female medaka fish was thus pooled and used for small RNA direct sequencing analysis by ABI SOLiD system.
The unique reads with copy number equal to or greater than three were processed for trimming adaptor, followed by being mapped back to genome by Razers program . Owing to the concern of repeat element, we used repeat-masked genomes for mapping and the release versions of genomes for medaka is oryLat2, downloaded from UCSC. In this study, we requested a mappable read must completely identical to genome loci without mismatch or gap as previous report . In addition, a mappable read must range from 18 to 25 nt. in length.
Excluding sequence reads from protein-coding genes and repeat elements
Owing to the concern of contamination by protein-coding gene or other ncRNAs, candidate miRNAs highly homologous to RefSeq sequences were usually removed in miRNA gene identification studies . Up to now, there are only 474 medaka protein-coding reference genes reported in RefSeq release 37. Therefore, all mappable sequence reads with more than 90% identity to any reference sequence (from RefSeq 37) were discarded. Besides, we also had these mappable reads processed by RepeatMasker to exclude repeat elements. However, we still observed many reads with multiple mappable genomic loci (see Discussion). For solving such problem, sequence reads mappable back to more than ten genomic loci were also discarded in this study.
miRNA identification by SVM pipeline
The sequences of the genomic loci mapped back by reads were considered as candidate miRNAs. For each candidate miRNA, we extended 60-nt regions individually at its upstream and downstream and the resulting ~140 nt. fragments were considered as candidate pre-miRNAs. Each pair of candidate miRNA and pre-miRNA was subject to folding and calculation of the ten classification features, followed by classified by our SVM pipeline for identifying authentic miRNAs. The SVM pipeline classifies input cases into positive or negative set according to the trained model .
This work was supported by grants from Academia Sinica and National Science Council of Taiwan.
This article has been published as part of BMC Genomics Volume 11 Supplement 4, 2010: Ninth International Conference on Bioinformatics (InCoB2010): Computational Biology. The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2164/11?issue=S4.
- Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004, 116 (2): 281-297. 10.1016/S0092-8674(04)00045-5.View ArticlePubMed
- Lin WC, Li SC, Shin JW, Hu SN, Yu XM, Huang TY, Chen SC, Chen HC, Chen SJ, Huang PJ: Identification of microRNA in the protist Trichomonas vaginalis. Genomics. 2009, 93 (5): 487-493. 10.1016/j.ygeno.2009.01.004.View ArticlePubMed
- Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, 34 (Databaseissue): D140-144. 10.1093/nar/gkj112.PubMed CentralView ArticlePubMed
- Wheeler BM, Heimberg AM, Moy VN, Sperling EA, Holstein TW, Heber S, Peterson KJ: The deep evolution of metazoan microRNAs. Evol Dev. 2009, 11 (1): 50-68. 10.1111/j.1525-142X.2008.00302.x.View ArticlePubMed
- Prochnik SE, Rokhsar DS, Aboobaker AA: Evidence for a microRNA expansion in the bilaterian ancestor. Dev Genes Evol. 2007, 217 (1): 73-77. 10.1007/s00427-006-0116-1.View ArticlePubMed
- Niwa R, Slack FJ: The evolution of animal microRNA function. Curr Opin Genet Dev. 2007, 17 (2): 145-150. 10.1016/j.gde.2007.02.004.View ArticlePubMed
- Gerlach D, Kriventseva EV, Rahman N, Vejnar CE, Zdobnov EM: miROrtho: computational survey of microRNA genes. Nucleic Acids Res. 2009, 37 (Databaseissue): D111-117. 10.1093/nar/gkn707.PubMed CentralView ArticlePubMed
- Stark A, Kheradpour P, Parts L, Brennecke J, Hodges E, Hannon GJ, Kellis M: Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes. Genome Res. 2007, 17 (12): 1865-1879. 10.1101/gr.6593807.PubMed CentralView ArticlePubMed
- Grimson A, Srivastava M, Fahey B, Woodcroft BJ, Chiang HR, King N, Degnan BM, Rokhsar DS, Bartel DP: Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals. Nature. 2008, 455 (7217): 1193-1197. 10.1038/nature07415.View ArticlePubMed
- Harismendy O, Ng PC, Strausberg RL, Wang X, Stockwell TB, Beeson KY, Schork NJ, Murray SS, Topol EJ, Levy S: Evaluation of next generation sequencing platforms for population targeted sequencing studies. Genome Biol. 2009, 10 (3): R32-10.1186/gb-2009-10-3-r32.PubMed CentralView ArticlePubMed
- Peters LM, Belyantseva IA, Lagziel A, Battey JF, Friedman TB, Morell RJ: Signatures from tissue-specific MPSS libraries identify transcripts preferentially expressed in the mouse inner ear. Genomics. 2007, 89 (2): 197-206. 10.1016/j.ygeno.2006.09.006.PubMed CentralView ArticlePubMed
- Wang X, Sun Q, McGrath SD, Mardis ER, Soloway PD, Clark AG: Transcriptome-wide identification of novel imprinted genes in neonatal mouse brain. PLoS One. 2008, 3 (12): e3839-10.1371/journal.pone.0003839.PubMed CentralView ArticlePubMed
- Yassour M, Kaplan T, Fraser HB, Levin JZ, Pfiffner J, Adiconis X, Schroth G, Luo S, Khrebtukova I, Gnirke A: Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing. Proc Natl Acad Sci U S A. 2009, 106 (9): 3264-3269. 10.1073/pnas.0812841106.PubMed CentralView ArticlePubMed
- Qi W, Kaser M, Roltgen K, Yeboah-Manu D, Pluschke G: Genomic diversity and evolution of Mycobacterium ulcerans revealed by next-generation sequencing. PLoS Pathog. 2009, 5 (9): e1000580-10.1371/journal.ppat.1000580.PubMed CentralView ArticlePubMed
- Trick M, Long Y, Meng J, Bancroft I: Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol J. 2009, 7 (4): 334-346. 10.1111/j.1467-7652.2008.00396.x.View ArticlePubMed
- Hillier LW, Marth GT, Quinlan AR, Dooling D, Fewell G, Barnett D, Fox P, Glasscock JI, Hickenbotham M, Huang W: Whole-genome sequencing and variant discovery in C. elegans. Nat Methods. 2008, 5 (2): 183-188. 10.1038/nmeth.1179.View ArticlePubMed
- Shen Y, Sarin S, Liu Y, Hobert O, Pe'er I: Comparing platforms for C. elegans mutant identification using high-throughput whole-genome sequencing. PLoS One. 2008, 3 (12): e4012-10.1371/journal.pone.0004012.PubMed CentralView ArticlePubMed
- Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, Goodhead I, Rance R, Baker S, Maskell DJ, Wain J: High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nat Genet. 2008, 40 (8): 987-993. 10.1038/ng.195.PubMed CentralView ArticlePubMed
- Morin RD, O'Connor MD, Griffith M, Kuchenbauer F, Delaney A, Prabhu AL, Zhao Y, McDonald H, Zeng T, Hirst M: Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res. 2008, 18 (4): 610-621. 10.1101/gr.7179508.PubMed CentralView ArticlePubMed
- Glazov EA, Cottee PA, Barris WC, Moore RJ, Dalrymple BP, Tizard ML: A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. Genome Res. 2008, 18 (6): 957-964. 10.1101/gr.074740.107.PubMed CentralView ArticlePubMed
- Flynn K, Haasch M, Shadwick DS, Johnson R: Real-time PCR-based prediction of gonad phenotype in medaka. Ecotoxicol Environ Saf.
- Taneda Y, Konno S, Makino S, Morioka M, Fukuda K, Imai Y, Kudo A, Kawakami A: Epigenetic control of cardiomyocyte production in response to a stress during the medaka heart development. Dev Biol.
- Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36 (Databaseissue): D154-158.PubMed CentralPubMed
- Li SC, Chan WC, Hu LY, Lai CH, Hsu CN, Lin WC: Identification of homologous microRNAs in 56 animal genomes. Genomics.
- Chen X, Li Q, Wang J, Guo X, Jiang X, Ren Z, Weng C, Sun G, Wang X, Liu Y: Identification and characterization of novel amphioxus microRNAs by Solexa sequencing. Genome Biol. 2009, 10 (7): R78-10.1186/gb-2009-10-7-r78.PubMed CentralView ArticlePubMed
- Du T, Zamore PD: microPrimer: the biogenesis and function of microRNA. Development. 2005, 132 (21): 4645-4652. 10.1242/dev.02070.View ArticlePubMed
- Sewer A, Paul N, Landgraf P, Aravin A, Pfeffer S, Brownstein MJ, Tuschl T, van Nimwegen E, Zavolan M: Identification of clustered microRNAs using an ab initio prediction method. BMC Bioinformatics. 2005, 6: 267-10.1186/1471-2105-6-267.PubMed CentralView ArticlePubMed
- Wang M, Zhang X, Zhao H, Wang Q, Pan Y: FoxO gene family evolution in vertebrates. BMC Evol Biol. 2009, 9: 222-10.1186/1471-2148-9-222.PubMed CentralView ArticlePubMed
- Berezikov E, Thuemmler F, van Laake LW, Kondova I, Bontrop R, Cuppen E, Plasterk RH: Diversity of microRNAs in human and chimpanzee brain. Nat Genet. 2006, 38 (12): 1375-1377. 10.1038/ng1914.View ArticlePubMed
- Ruby JG, Stark A, Johnston WK, Kellis M, Bartel DP, Lai EC: Evolution, biogenesis, expression, and target predictions of a substantially expanded set of Drosophila microRNAs. Genome Res. 2007, 17 (12): 1850-1864. 10.1101/gr.6597907.PubMed CentralView ArticlePubMed
- Landgraf P, Rusu M, Sheridan R, Sewer A, Iovino N, Aravin A, Pfeffer S, Rice A, Kamphorst AO, Landthaler M: A mammalian microRNA expression atlas based on small RNA library sequencing. Cell. 2007, 129 (7): 1401-1414. 10.1016/j.cell.2007.04.040.PubMed CentralView ArticlePubMed
- Li SC, Pan CY, Lin WC: Bioinformatic discovery of microRNA precursors from human ESTs and introns. BMC Genomics. 2006, 7: 164-10.1186/1471-2164-7-164.PubMed CentralView ArticlePubMed
- Grad Y, Aach J, Hayes GD, Reinhart BJ, Church GM, Ruvkun G, Kim J: Computational and experimental identification of C. elegans microRNAs. Mol Cell. 2003, 11 (5): 1253-1263. 10.1016/S1097-2765(03)00153-9.View ArticlePubMed
- Wang X, Zhang J, Li F, Gu J, He T, Zhang X, Li Y: MicroRNA identification based on sequence and structure alignment. Bioinformatics. 2005, 21 (18): 3610-3614. 10.1093/bioinformatics/bti562.View ArticlePubMed
- Artzi S, Kiezun A, Shomron N: miRNAminer: a tool for homologous microRNA gene search. BMC Bioinformatics. 2008, 9 (1): 39-10.1186/1471-2105-9-39.PubMed CentralView ArticlePubMed
- Lynn Lamoreux M, Kelsh RN, Wakamatsu Y, Ozato K: Pigment pattern formation in the medaka embryo. Pigment Cell Res. 2005, 18 (2): 64-73. 10.1111/j.1600-0749.2005.00216.x.View ArticlePubMed
- Liu C, Zhao X: MicroRNAs in adult and embryonic neurogenesis. Neuromolecular Med. 2009, 11 (3): 141-152. 10.1007/s12017-009-8077-y.PubMed CentralView ArticlePubMed
- Weese D, Emde AK, Rausch T, Doring A, Reinert K: RazerS--fast read mapping with sensitivity control. Genome Res. 2009, 19 (9): 1646-1654. 10.1101/gr.088823.108.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.