- Research article
- Open Access
Identification of common carp (Cyprinus carpio) microRNAs and microRNA-related SNPs
BMC Genomics volume 13, Article number: 413 (2012)
MicroRNAs (miRNAs) exist pervasively across viruses, plants and animals and play important roles in the post-transcriptional regulation of genes. In the common carp, miRNA targets have not been investigated. In model species, single-nucleotide polymorphisms (SNPs) have been reported to impair or enhance miRNA regulation as well as to alter miRNA biogenesis. SNPs are often associated with diseases or traits. To date, no studies into the effects of SNPs on miRNA biogenesis and regulation in the common carp have been reported.
Using homology-based prediction combined with small RNA sequencing, we have identified 113 common carp mature miRNAs, including 92 conserved miRNAs and 21 common carp specific miRNAs. The conserved miRNAs had significantly higher expression levels than the specific miRNAs. The miRNAs were clustered into three phylogenetic groups. Totally 394 potential miRNA binding sites in 206 target mRNAs were predicted for 83 miRNAs. We identified 13 SNPs in the miRNA precursors. Among them, nine SNPs had the potential to either increase or decrease the energy of the predicted secondary structures of the precursors. Further, two SNPs in the 3’ untranslated regions of target genes were predicted to either disturb or create miRNA-target interactions.
The common carp miRNAs and their target genes reported here will help further our understanding of the role of miRNAs in gene regulation. The analysis of the miRNA-related SNPs and their effects provided insights into the effects of SNPs on miRNA biogenesis and function. The resource data generated in this study will help advance the study of miRNA function and phenotype-associated miRNA identification.
MicroRNAs (miRNAs) are endogenous small non-coding RNA molecules that are an average of 22 bp long . They exist widely in metazoa, viridiplantae and viruses [2–7] and play essential roles in gene expression regulation by binding to their target genes, leading to translational repression or transcript degradation [8, 9]. The role of miRNAs in the regulation of genes leads to their involvement in diverse biological processes that include animal organ development and growth , cell differentiation and proliferation , innate and adaptive immunity , and signal transduction . Many studies have identified miRNAs in fish species [14–18]. After miRNAs have been identified in a species, one of the main aims is to identify the miRNA target genes. Computational predictions have been widely applied to miRNA target identification . Common carp is one of the main commercial fishes captured and cultured worldwide. Its agricultural production accounts for nearly 30% of fresh water fish cultured in China. However, miRNAs targets have not been investigated in common carp.
MiRNAs bind to their target genes based on sequence complementarity. Mutations in miRNAs or in their target sites have been found to either create or disturb miRNA-target interactions. Many studies have reported the effect of SNPs in the 3’untranslated regions (3’UTRs) of the target genes. For example, in Texel sheep, a SNP in the 3’UTR of GDF8 created a binding site targeted by two miRNAs, miR-1 and miR-206, resulting in GDF8 inhibition and increased muscular hypertrophy . In contrast, a SNP in the 3’UTR of the SPL14 gene in rice perturbed osa-miR156-associated translational regulation which led to an improved rice plant with reduced tiller number, better gain yield and enhanced lodging resistance . In addition, SNPs in miRNAs have been shown to affect miRNA regulation resulting in phenotypic changes [20, 22, 23]. For instance, in Chinese Holstein cattle, a SNP in bta-miR-484 disrupted miRNA binding which relieved the transcriptional repression and increased the expression of the target gene, the heat-shock transcription factor 1 . Moreover, SNPs in miRNAs can also affect miRNA biogenesis. Two mutations in the seed region of hsa-miR-96 impaired the biogenesis of the miRNA and result in a significant reduction of mRNA targeting . Because of the recently recognized significance of SNPs in miRNA biogenesis and regulation, many reports have concentrated on collecting miRNA-related SNPs and investigating their influence on miRNA function [20, 26–28]. However, all these studies have focused on model species mainly because a substantial amount of information on miRNAs and SNPs is available for these species. For common carp, no reports of miRNA-related SNPs and their effects have been published so far.
In this study, we used a combinational strategy to identify miRNAs in common carp and characterized them based on their conservation and expression profiles. Next, we used target prediction software to predict the miRNA targets. After identifying the potential targets we scanned SNPs in the miRNAs and studied their effects on miRNA biogenesis and target alteration. Finally, we identified SNPs in targets’ 3’UTRs and predicted the influence of the identified SNPs on miRNA regulation of the target genes.
Identification of common carp miRNAs with computational prediction and small RNA-sequencing
The previously published high-quality (Q20) BAC end sequences (BES)  that were assembled into 38,883 genomic contigs, together with 49,669 common carp transcriptome contigs , were used as reference sequences for miRNA identification.
To identify common carp miRNAs, we firstly performed a homology-based prediction. The prediction pipeline is shown schematically in Additional file 1: Figure S1. We downloaded 16,822 animal miRNAs from miRBase 17.0 . The analysis of the hairpin structures of animal miRNA precursors using UNAfold  showed that over 96.53% of miRNA precursors satisfied the criteria for hairpin structures previously described by Fu et al.  [see Additional file 2: Figure S2. We identified 9,656 non-redundant miRNAs after removing identical miRNAs. After aligning the unique miRNAs to the reference sequences and carefully evaluating the hairpin structures, 81 conserved miRNAs were identified from common carp.
Next, a small RNA (sRNA) library was constructed from multiple tissues of 17 adult common carp and sequenced following the illumina protocol . A total of 11,665,437 raw reads were generated and 7,327,921 cleaned reads (62.82%) were obtained and used in the analysis (Figure 1A). The length of the cleaned reads peaked at 22 bp (Figure 1B). Using BLASTN searches, a total of 125,827 cleaned reads (1.72% out of clean reads) were identified as fragments of other sRNAs (rRNA, tRNA and snRNA) and another 18,994 reads (0.26% out of clean reads) aligned to common carp repeats (Figure 1C). These were removed from the dataset and the remaining 7,183,100 reads (98% out of clean reads) were used for miRNA identification with MIREAP. A total of 68 sRNAs were identified as reliable miRNAs. Among them, 36 miRNAs were homologous to known animal miRNAs and the precursors of an additional 11 miRNAs could be aligned to the NCBI non-redundant nucleotide database using BLASTN with an e-value of 1e-10. The remaining 21 miRNA had no hits, indicating that they might be common carp specific.
Finally, the results from the two methods were integrated into a non-redundant dataset that included 92 conserved miRNAs and 21 specific miRNAs. The length distribution of the miRNAs was between 20 and 26 bp. Detailed information about the predicted miRNAs, including the prediction method, conservation, reference sequences, location in the reference sequences, precursor sequences, hairpin structures, minimum folding free energies, mature sequences and A + U content is available in Additional file 3: Table S1, Additional file 4: Table S2, Additional file 5: Table S3, and Additional file 6: Table S4.
To validate the reliability of the predicted miRNAs, their expression in the RNAs from the pooled-tissues was examined by PCR. Ten miRNAs, including conserved and specific miRNAs [see Additional files 7: Table S5], were selected randomly from the dataset. The PCR results showed that all the selected miRNAs could be amplified [see Additional file 8: Figure S3], indicating that these miRNAs were correctly identified and truly expressed.
Characterization of common carp miRNAs
Many miRNAs are often located close to each other, forming gene clusters that have a common transcription promoter . We discovered five miRNA clusters (Table 1) among the common carp miRNAs. Four clusters consisted of conserved miRNAs while one cluster had one conserved miRNA and one specific miRNA. Interestingly, three of the clusters contained miRNAs from the mir-430 family. In the zebrafish genome (Zv_9), these three clusters are located close together, suggesting that they might also be part of one bigger miRNA cluster in the common carp genome. We looked for members of the mir-430 family in other animals and found that this family existed only in zebrafish, medaka and sea lamprey. This result indicates that the mir-430 family might be fish specific. Further experiments are warranted to study the function of this family of miRNAs.
The sRNA sequencing allowed us to identify miRNAs and also to determine the expression levels of the miRNAs [15, 28]. The number of reads that could be aligned to each of the miRNAs was assumed to represent the expression level of the miRNA. We found a lot of divergence in the abundance of the different miRNAs (Figure 2). In general, the conserved miRNAs had higher expression than the specific miRNAs (Mann Whitney test, P value = 7.91e-3). This result is consistent with previous observations that non-conserved miRNAs are often expressed at lower levels than miRNAs with tissue-specific or developmental-specific expression patterns [34–36]. The seven most abundant miRNAs, each with over 8,000 reads, were conserved miRNAs. These results agree with the conclusion that evolutionarily conserved miRNAs are often the most abundant . In contrast, miR-204*, miR-406b* and let-7a* had extremely low frequency in our library (frequency = 2), consistent with the observation that most miRNA*s showed weak expression and that their expression levels were much lower than their corresponding miRNAs . This is because miRNA*s are rapidly degraded during the biogenesis of mature miRNAs .
The conservation of miRNAs has been used to study miRNA phylogenetic evolution . Twenty-one conserved miRNA families were clustered into three groups based on their phylogenetic distributions (Figure 3). Six miRNA families (mir-124, mir-9, mir-137, mir-7, mir-306 and let-7) were shared by both protostomes and deuterostomes; 17 miRNAs families were present only in vertebrates; and the remaining two miRNA families (mir-430 and mir-727) were identified only in fish and were possibly fish-specific miRNAs. Six miRNA families (mir-3529, mir-467, mir-297, mir-28, mir-3065 and mir-541) were first detected in fish, but have been identified in other species.
Expression profile of miRNAs among the developmental stages
The expression of homologous miRNAs in other species may help us infer the expression of the common carp miRNAs. To study the expression pattern of miRNAs during the embryo development, we selected eight miRNAs of which homologous miRNAs were related to embryo development [38–42]. Another four common carp specific miRNAs were also randomly selected. We found that the expression patterns of some of the miRNA families were associated with certain development stages. Hierarchical clustering of the RT-qPCR products showed three major expression patterns: a) fertilized oocytes expression (0 h) which might be related to maternal miRNAs; b) embryonic expression (72 hpf) during hatching; and c) larvae expression (1 dph, 5 dph and 10 dph) when the larvae undertake exogenous feeding (Figure 4).
The expression levels of all the miRNAs varied among the developmental stages. The expression levels decreased from the oocytes stage to embryonic stage and then increased again in the larvae stage. At the embryonic stage, the expression of the majority of miRNAs was lower than at any of the other stages. The most dominant miRNAs at the embryonic stage were from the miRNA-204 family. The miR-204 family has been reported to be associated with mouse eye development , indicating that this miRNA family may play roles in organogenesis in the common carp embryo.
Hierarchical clustering also indicated that the miRNAs selected for RT-qPCR were classified into two groups based on their expression patterns. In general, the expression of miRNAs in the first group (s0007-5p, miR-124-5p, miR-460b-5p, miR-150, miR-140-3p, miR-204, and miR-22a) was higher than the expression of the miRNAs in the second group (s0013-5p, s0011-5p, s0010-5p, miR-541, miR-727-5p, and miR-3065-3p). The similar expression patterns of the miRNAs in each of the groups may imply that they share similar functions, which would help us understand the function of the specific miRNAs based on the known function of other miRNAs in the same group.
Prediction of miRNA targets
To predict miRNA target genes in common carp, we downloaded 506 mRNAs containing 3’UTR information from GenBank  and used a combination of TargetScan  and PITA  to predict miRNA targets. We identified 394 miRNA-binding sites in 206 mRNAs targeted by 83 miRNAs; an average of 2.5 mRNAs per miRNA. The predicted target genes were found to be involved in a broad range of biological functions; for instance, transferase activity, hydrolase activity, nucleotide binding, protein binding and signal transducer activity [see Additional file 9: Table S6. Strikingly, IGF-IRb [GenBank: AY144592] had the most miRNA binding sites; 13 sites targeted by 12 miRNAs.
Furthermore, we found 18 miRNAs that were generated from the antisense strands of 19 mRNAs. The miRNAs were perfectly complementary to the sense mRNAs [see Additional file 10: Table S7. MiRNAs in such arrangements might function to depress sense mRNA expression or to disrupt mRNA splicing .
To investigate the regulation of predicted targets by miRNAs, the expression of four miRNA-target pairs was examined by reverse transcribed-quantitative real-time PCR (RT-qPCR) analyses during five developmental stages. All four pairs showed significantly reciprocal expression patterns (Figure 5 A-D), consistent with the observation that miRNAs in mammalian predominantly function to decrease target gene levels .
Influence of SNPs in miRNAs on the energy of the miRNA secondary structure
SNPs in precursors have been reported to enhance or interrupt miRNA biogenesis [49, 50]. We identified 13 SNPs in seven precursors by mapping the cleaned sRNA sequencing reads to the precursors. These SNPs were classified into four types: 1) two SNPs in the mature miRNA; 2) six SNPs in the stem regions; 3) three SNPs in the loops; and 4) two SNPs in the anti-stem regions (Table 2).
In addition, we PCR-amplified and resequenced five miRNA precursors surrounding nine putative SNPs by conventional Sanger sequencing. All the regions were successfully covered. When the Sanger reads were compared with the reference precursors, we found that seven SNPs were successfully detected with Sanger sequencing [see Additional file 11: Figure S4].
Next, we investigated the effect of the SNPs on the energy change (ΔΔG) of the secondary structures of the precursors. We found that SNPs in the loop regions did not change the energy of the structure while the other 10 SNPs (types 1, 2 and 4) did change the energy of the predicted secondary structures. For nine of the SNPs, the absolute energy change values were ≥ 0.3 kcal/mol, the minimum energy change reported to be required to change the production of mature miRNAs . Gong et al. summarized the rule that if a SNP decreases the hairpin structure energy, the production of the mature miRNA will be reduced; and if the SNP increases the energy, the production of the mature miRNA will increase . The SNPs reported here might therefore enhance or reduce the production of the mature miRNAs.
MiRNA-mRNA interaction alteration by SNPs in targets
SNPs in the miRNA binding site can alter the miRNA-mRNA interaction . To identify SNPs in common carp mRNA 3’UTRs, we aligned the 298,817 trimmed 454 transcriptome reads to 506 mRNAs containing 3’UTR annotation and identified 464 SNPs in 95 mRNAs. A total of 57 SNPs were located in the 3’UTRs of 33 mRNAs. These mRNAs were called SNP-type mRNAs. In addition, we PCR-amplified and resequenced ten randomly selected SNPs in nine mRNA 3’UTRs. A comparison between the Sanger sequences and the reference 3’UTRs showed that nine SNPs were successfully detected by conventional sequencing [see Additional file 12: Figure S5.
Fifteen SNP-type mRNAs were predicted to have 35 sites targeted by 27 miRNAs. Most of these binding sites existed in both the wild type mRNAs and SNP-type mRNAs. However, one SNP in the 3’UTR of C1rs-A [GenBank: AB042609] was predicted to result in target gain and one SNP in the 3’UTR of cyp c1.02 [GenBank: AJ292212] was predicted to lead to target loss (Table 3). These two SNPs were also confirmed by Sanger sequencing.
The aim of this study was to predict miRNA targets in common carp and to in silico examine the effects of SNPs on miRNA biogenesis and target binding. Recently Yan et al.  identified 188 common carp miRNAs by sequencing sRNA library built from the muscle tissue. Our dataset only covered 34 of the miRNAs in their dataset. The difference in the two datasets is mainly because Yan et al. aligned the sequencing reads to the zebrafish genome to predict the miRNAs, while we used common carp sequences to identify the common carp miRNAs. The researchers did not systematically characterize the common carp miRNAs nor did they attempt to predict the miRNA targets. In the present study, we identified miRNAs and predicted the miRNA targets which will make it into a useful resource for studying miRNA function.
In our study, we classified the conserved miRNAs shared by other animals into different groups to help us investigate the evolution of these miRNAs during the divergence of the animals. We found that many of the miRNAs were conserved among several animals, suggesting that they may have essential biological functions. The conservation of these miRNAs may help us infer the functions of these miRNAs in common carp based on their known functions in other species. We discovered six conserved miRNA families in fish that are being reported here for the first time. Two of the miRNA families detected in the common carp were conserved only in fish, indicating that these miRNAs might be involved in the adaption to the aquatic environment. Strikingly, 21 miRNAs are found only in common carp. It is possible that these miRNAs are involved in regulatory interactions during adaptation to the common carp specific environment.
In this study, we focused on the identification and characterization of miRNAs in common carp. Recently, many studies identified miRNA-offset RNAs (moRNAs) generated from sequences immediately adjacent to the miRNA and miRNA* [53, 54], although their functional role remains to be elucidated and these sRNAs had no obvious sequence or structural features. Indeed, after aligning cleaned sRNA sequencing reads to miRNA precursors using BLASTN with an identity value of 100%, we found that the non-miRNA regions in five pre-miRNA loci had matched sRNA reads adjacent to miRNAs, indicating that these sRNAs might be moRNAs Additional file 13: Figure S6. Another class of miRNA variant is isomiRs, which have variations with respect to the reference miRNAs . These isomiRs are mainly generated by either a shift of Drosha and Dicer in the cleavage site or nucleotide additions at the 3’ end . In our sRNA dataset, we also found that 1,053 sRNAs were identical to the identified miRNAs with either longer or shorter sequences, suggesting that these sRNAs are putative isomiRs.
MiRNA target identification is important to predict the functions of the miRNAs. Although computational approaches have been widely used to predict miRNA targets, most of these methods suffer from high false positive rates . In the present study, we combined the results of two popular methods to predict miRNA targets. Using a set of 59 negative examples from TarBase , we estimated that the false positive rates of TargetScan and PITA were 28.9% and 35.5%, respectively. Significantly, a combination of these two methods gave a false positive rate of 13.5%, much lower than by either of the methods alone Additional file 14: Figure S7. We assume, therefore, that the strategy we have applied here should have reduced the number of false positive miRNA targets to a low error rate.
Because of the limited number of publicly available common carp genes with 3’UTR information, the number of targets per miRNA that we could identify was lower than the numbers reported previously [59–62]. Even so, our prediction revealed that many of the genes were regulated by multiple miRNAs. By targeting a gene with multiple miRNAs the expression of the gene can be subtly and precisely regulated. As further information is obtained, we can expect further improvements in common carp genome annotation which will provide a better overview of its miRNAs and targets.
No common carp SNPs are currently publicly available, making the identification of SNPs in miRNAs difficult. A number of challenges including sequencing errors and post-transcriptional editing, would also lead to the sequence variability between sRNAs and reference miRNAs. To avoid artificial variability by sequencing errors, we have filtered out the low-quality reads before aligning sRNAs to reference sequences. Re-sequencing genomic regions of randomly selected miRNAs and 3’UTRs with SNPs using Sanger sequencing revealed that most SNPs were successfully detected, indicating that the sequence variability was mainly from SNPs in genomic DNA. The left undetected sequences might be from miRNA post-transcriptional editing. Many studies reported that miRNA post-transcriptional editing could alter processing of some miRNAs  or modulate the target specificity of the mature miRNA . Therefore, it is necessary to further analyze miRNA post-transcriptional editing in common carp in the future.
We focused on the prediction of the potential effects of the SNPs in the miRNA genes on miRNA production and target selection. Two SNPs in stem or anti-stem regions of the precursors led to energy changes of over 2 kcal/mol (Table 2) might greatly change the miRNA product. Two SNPs were located in the mature miRNAs. Because of the limited number of SNPs and target 3’UTRs that were obtained, we did not find any potential SNPs that change miRNA targets. The targets of miRNAs can be altered by variations in the target sequences . MiRNA target loss may increase the expression of the mRNA or protein, while target gain may repress protein expression or degrade the transcript. Here, using our pipeline we identified many SNPs in potential miRNA target sites. These SNPs will be important candidates for causal variants of common carp phenotypes.
This study provides data for the identification and characterization of common carp miRNAs and their potential targets. These results will help further our understanding of common carp miRNA function in gene regulation. The study further identified SNPs in miRNAs and their target genes and the effects of these SNPs on miRNA biogenesis and function was discussed. The resource data described here will be a useful resource for the scientific community to study miRNAs function and to find SNP-associated phenotypes.
The wild common carp that were used in the experiments were bred in the Heilongjiang Fishery Institute of the Chinese Academy of Fishery Science, or obtained from the Freshwater Fisheries Research Center of the Chinese Academy of Fishery Sciences.
For sRNA sequencing, a total of 17 common carp were maintained in out-door tanks with running fresh water at 23°C and fed a commercial diet twice daily. The fish were anaesthetized with eugenol and brain, skin, liver, muscle, spleen, head kidney, body kidney, intestine, gill and heart tissue were carefully separated and snap stored at −80°C until required for RNA extraction. Equivalent concentrations of the RNAs from the tissues of the 17 fish were pooled for sequencing.
Eggs from female adult common carp were fertilized following the procedure described by Yan et al. . Common carp embryos were collected at the following developmental stages: fertilized oocytes (0 h), 72 hours post-fertilization (hpf) embryos, 1 day post-hatching (dph) larva, 5 dph larva and 10 dph larva.
For SNP validation, 20 adult common carp were randomly selected and blood DNA was extracted. Blood samples (0.5 to 1 ml) were collected in 1 ml syringes primed with EDTA anticoagulation agent. The DNA was isolated with a QIAamp DNA Blood Midi Kit (QIAGEN, Hilden, Germany) and quantified using a spectrophotometer.
All animal experiments reported in this study conformed to the Chinese Academy of Fishery Science, Beijing, guidelines for the care and use of laboratory animals, and to the National Institutes of Health Guide for Care and Use of Laboratory Animals.
We used common carp BESs and transcriptome data as reference sequences to identify miRNAs because the common carp genome is still unfinished. Previously, we published 75,744 high-quality common carp BESs with a minimum length of 50 bp that were generated after base calling and trimming for E. coli and vector sequences . The BESs were assembled into genomic contigs using the Celera assembler  with default parameters. An additional 49,669 transcriptome contigs that we have reported  were also used as reference sequences.
Homology-based prediction of common carp miRNAs
To identify potentially conserved miRNAs from the known mature miRNAs, we downloaded all the animal miRNAs from the miRBase release 17.0  and removed identical mature miRNAs. The remaining non-redundant miRNAs were aligned against the reference sequences using NCBI’s BLASTN program. To select an appropriate identity threshold for BLASTN, we aligned the miRNAs in the same families using BLASTN and found that the minimum sequence identity among the miRNAs was 90% [see Additional file 15: Figure S8. Therefore, we set the identity threshold to predict common carp miRNAs at 90% . We extracted the mapped regions and 60 bp bilateral flanking sequences from the reference sequences and then ran UNAfold  to predict the hairpin structures of the extracted sequences. A sequence was considered to be a candidate miRNA precursor if the hairpin structure satisfied the criteria previously described by Fu et al. : (1) the minimum free energy (ΔG) is ≤ −15 kcal/mol; (2) the stem region includes at least 80% of the mature miRNA; (3) the number of allowed errors in one bulge is ≤ 18 bp; (4) the hairpin is > 53 bp long; (5) the loop region is < 22 bp; and (6) the number of mismatches between the miRNA and the anti-stem sequence are ≤ 6 bp.
Identification of common carp miRNAs from small RNA sequencing
A sRNA library was constructed and sequenced on an illumina Genome Analyzer following the manufacturer’s protocol. The entire set of reads that was used for miRNA identification was submitted to NCBI’s Gene Expression Omnibus [GEO:GSE35131].
We used MIREAP  to clean the initial sequencing reads by removing poor quality reads, 5’ adapter, 3’ adapter, reads containing poly(A) stretches, and reads less than 18 bp long. To annotate the sRNAs as rRNA, tRNA, snRNA and repeats, the cleaned reads were searched against the Rfam database , common carp ribosomal RNAs collected from GenBank, and common carp repeats  using BLASTN with an e-value of 0.01. All unaligned reads were then mapped to the reference sequences for miRNA identification using SOAP . The secondary structures of the matched reads were generated using RNAfold  with default parameters and analyzed using MIREAP.
RT-PCR and RT-qPCR
To detect miRNA expression, total RNA was extracted from the brain, muscle and liver of three young common carp (average weight: 200 g) with Trizol Reagent (Invitrogen, Carlsbad, USA). The RNAs from these tissues were mixed together in equivalent concentrations. The total RNA was polyadenylated with poly (A) polymerase (GeneCopoeia, Rockville, USA) to add poly(A) tails at the 3’ ends of the miRNAs. Then the poly(A) miRNAs were reverse transcribed using M-MLVRTase and a unique Oligo-dT adaptor primer in an All-in-OneTM miRNA qRT-PCR Detection Kit (GeneCopoeia, Rockville, USA). The forward primers used in the PCR that was run on an ABI 9700 thermal cycler (Applied Biosystems, Foster City, USA), were specific to the miRNAs. The PCR products were separated on 3% agarose gel and stained with ethidium bromide.
RT-qPCR was used to validate the expression profiles of the selected miRNAs. The total RNAs from the five developmental stages were reverse transcribed as described above for the miRNAs. The RT-qPCR was performed using the ABI 7500 sequence detection system (Applied Biosystems, Foster City, USA). U6 small nuclear RNA was used as an endogenous control for the miRNAs. All reactions were run in triplicate for each gene. The relative amount of miRNA to U6 RNA was calculated using the 2 − ΔΔCT method. To differentiate the expression of the selected miRNAs and to categorize them according to their expression patterns, a heatmap chart was drawn by transforming the normalized data to a log 2 scale for visualization purpose. Hierarchical clustering was performed using the R program version 2.10.1 . All primers for RT-qPCR were listed in the Additional file 16: Table S8.
Prediction of common carp miRNA target genes
To identify the putative miRNA target genes, we downloaded 506 common carp mRNAs containing 3’UTR annotation from NCBI’s dbEST  and extracted the 3’UTRs. We combined the results of two popular programs, TargetScan  and PITA , to identify candidate target genes. Briefly, we ran TargetScan with the default parameters and used PITA with the following parameters: 1) a seed of 6–8 bases; 2) no mismatches; 3) up to one G:U wobble in 7- or 8-mers; and 4) ΔG low than −9 Kcal/mol. When a miRNA-target pair was predicted by both TargetScan and PITA, it was considered as a miRNA target.
RT-qPCR was also used to detect the expression pattern of four selected miRNA-transcript pairs. The total RNAs from five developmental stages were treated with RNase-free DNase (Ambion, Austin, USA) according to the manufacturer’s instructions. Reverse transcription was performed with oligo (dT) primers using the First Strand cDNA Synthesis Kit (Fermentas, Burlington, Canada). RT-qPCR was performed using an ABI 7500 Sequence Detection System (Applied Biosystems, Foster City, CA, USA) with SYBR Premix Ex Taq II (Takara, Shiga, Japan) and the 2 − ΔΔCT method. The expression of each target gene was normalized to that of beta-actin. The primers used were listed in Additional file 17: Table S9.
SNPs in the miRNAs
Cleaned reads were aligned against the common carp miRNA precursors using MAQ software (parameters set as: -N 17; -E 0; -q 20; -e 2; -D 1000)  to call the SNPs. The SNPs in the pre-miRNAs were classified into four types: 1) SNPs in the mature miRNAs; 2) SNPs in the stem region but not the mature region; 3) SNPs in the loops; and 4) SNPs in the anti-stem regions. To study the effect of SNPs on miRNA biogenesis, we calculated the second structure energy of the different SNP-type precursors using RNAfold  and compared the energy changes between SNP-type pre-miRNAs and wild type pre-miRNAs. In addition, both TargetScan and PITA were used to scan the binding sites for type 1 miRNAs using the same parameters as we used to predict target sites in the wild-type miRNAs.
SNPs in the mRNA 3’UTRs
The previously published common carp 454 transcriptome reads [SRA:SRA009366] were cleaned with SolexaQA package  and the low-quality reads including poly(A/T) sequences were filtered out . The cleaned 454 reads were aligned to the 506 common carp mRNAs containing 3’UTR information using BWA . Reads aligned to multiple mRNAs were discarded to avoid ambiguity and only the uniquely mapped reads were selected for further analysis. We identified SNPs in the selected mRNAs using SAMtools  (Q value = 20). We focused on SNPs in the 3’UTRs of the mRNAs and searched for miRNA-transcript pairs in the SNP-type 3’UTRs with all the common carp miRNAs. When a miRNA-target pair was predicted in the wild-type target by both TargetScan and PITA but was not predicted in the SNP-type 3’UTR by either TargetScan or PITA, we considered this as a target loss. Conversely, if a miRNA-target pair was predicted in the SNP-type 3’UTR target but not in the wild-type 3’UTR by either TargetScan or PITA, this was considered as a target gain.
PCR-based validation of SNPs
Candidate SNPs including those which might interfere with miRNA biogenesis or target alteration were validated by PCR analysis using the DNA from 20 wild common carp. A total of 14 primer pairs were designed based on the flanking sequences of the SNPs. All the primer sequences are listed in Additional file 18: Table S10. The PCR products were sequenced by an ABI 3730xl genetic analyzer using standard protocols. The Sanger sequences that were obtained were aligned with the miRNA precursors or mRNA sequences using CLUSTALW  to identify any differences.
Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004, 116 (2): 281-297.
Berezikov E, Guryev V, van de Belt J, Wienholds E, Plasterk RH, Cuppen E: Phylogenetic shadowing and computational identification of human microRNA genes. Cell. 2005, 120 (1): 21-24.
Colaiacovo M, Subacchi A, Bagnaresi P, Lamontanara A, Cattivelli L, Faccioli p: A computational-based update on microRNAs and their targets in barley (Hordeum vulgare L.). BMC Genomics. 2010, 11: 595-
Lagos-Quintana M, Rauhut R, Meyer J, Borkhardt A, Tuschl T: New microRNAs from mouse and human. RNA. 2003, 9 (2): 175-179.
Reese TA, Xia J, Johnson LS, Zhou X, Zhang W, Virgin HW: Identification of novel microRNA-like molecules generated from herpesvirus and host tRNA transcripts. J Virol. 2010, 84 (19): 10344-10353.
Sunkar R, Zhou X, Zheng Y, Zhang W, Zhu JK: Identification of novel and candidate miRNAs in rice by high throughput sequencing. BMC Plant Biol. 2008, 8: 25-
Te JL, Dozmorov IM, Guthridge JM, Nguyen KL, Cavett JW, Kelly JA, Bruner GR, Harley JB, Ojwang JO: Identification of unique microRNA signature associated with lupus nephritis. PLoS One. 2010, 5 (5): e10344-
Filipowicz W, Jaskiewicz L, Kolb FA, Pillai RS: Post-transcriptional gene silencing by siRNAs and miRNAs. Curr Opin Struct Biol. 2005, 15 (3): 331-341.
Sontheimer EJ, Carthew RW: Silence from within: endogenous siRNAs and miRNAs. Cell. 2005, 122 (1): 9-12.
Kumar MS, Erkeland SJ, Pester RE, Chen CY, Ebert MS, Sharp PA, Jacks T: Suppression of non-small cell lung tumor development by the let-7 microRNA family. Proc Natl Acad Sci U S A. 2008, 105 (10): 3903-3908.
Xiao C, Calado DP, Galler G, Thai TH, Patterson HC, Wang J, Rajewsky N, Bender TP, Rajewsky K: MiR-150 controls B cell differentiation by targeting the transcription factor c-Myb. Cell. 2007, 131 (1): 146-159.
Ou J, Meng Q, Li Y, Xiu Y, Du J, Gu W, Wu T, Li W, Ding Z, Wang W: Identification and comparative analysis of the Eriocheir sinensis microRNA transcriptome response to Spiroplasma eriocheiris infection using a deep sequencing approach. Fish Shellfish Immunol. 2012, 32 (2): 345-352.
Inui M, Martello G, Piccolo S: MicroRNA control of signal transduction. Nat Rev Mol Cell Biol. 2010, 11 (4): 252-263.
Bizuayehu TT, Lanes CF, Furmanek T, Karlsen BO, Fernandes JM, Johansen SD, Babiak I: Differential expression patterns of conserved miRNAs and isomiRs during Atlantic halibut development. BMC Genomics. 2012, 13: 11-
Li SC, Chan WC, Ho MR, Tsai KW, Hu LY, Lai CH, Hsu CN, Hwang PP, Lin WC: Discovery and characterization of medaka miRNA genes by next generation sequencing platform. BMC Genomics. 2010, 11 (Suppl 4): S8-
Salem M, Xiao C, Womack J, Rexroad CE, Yao J: A microRNA repertoire for functional genome research in rainbow trout (Oncorhynchus mykiss). Mar Biotechnol (NY). 2010, 12 (4): 410-429.
Fu Y, Shi Z, Wu M, Zhang J, Jia L, Chen X: Identification and differential expression of microRNAs during metamorphosis of the Japanese flounder (Paralichthys olivaceus). PLoS One. 2011, 6 (7): e22957-
Xia JH, He XP, Bai ZY, Yue GH: Identification and characterization of 63 MicroRNAs in the Asian seabass Lates calcarifer. PLoS One. 2011, 6 (3): e17537-
Wang X, Zhang J, Li F, Gu J, He T, Zhang X, Li Y: MicroRNA identification based on sequence and structure alignment. Bioinformatics. 2005, 21 (18): 3610-3614.
Clop A, Marcq F, Takeda H, Pirottin D, Tordoir X, Bibe B, Bouix J, Caiment F, Elsen JM, Eychenne F, et al: A mutation creating a potential illegitimate microRNA target site in the myostatin gene affects muscularity in sheep. Nat Genet. 2006, 38 (7): 813-818.
Jiao Y, Wang Y, Xue D, Wang J, Yan M, Liu G, Dong G, Zeng D, Lu Z, Zhu X, et al: Regulation of OsSPL14 by OsmiR156 defines ideal plant architecture in rice. Nat Genet. 2010, 42 (6): 541-544.
Chen K, Rajewsky N: Natural selection on human microRNA binding sites inferred from SNP data. Nat Genet. 2006, 38 (12): 1452-1456.
Ryan BM, Robles AI, Harris CC: Genetic variation in microRNA networks: the implications for cancer research. Nat Rev Cancer. 2010, 10 (6): 389-402.
Li QL, Ju ZH, Huang JM, Li JB, Li RL, Hou MH, Wang CF, Zhong JF: Two novel SNPs in HSF1 gene are associated with thermal tolerance traits in Chinese Holstein cattle. DNA Cell Biol. 2011, 30 (4): 247-254.
Mencia A, Modamio-Hoybjor S, Redshaw N, Morin M, Mayo-Merino F, Olavarrieta L, Aguirre LA, del Castillo I, Steel KP, Dalmay T, et al: Mutations in the seed region of human miR-96 are responsible for nonsyndromic progressive hearing loss. Nat Genet. 2009, 41 (5): 609-613.
Sethupathy P, Collins FS: MicroRNA target site polymorphisms and human disease. Trends Genet. 2008, 24 (10): 489-497.
Ehrenreich IM, Purugganan MD: Sequence variation of MicroRNAs and their binding sites in Arabidopsis. Plant Physiol. 2008, 146 (4): 1974-1982.
Chen X, Li Q, Wang J, Guo X, Jiang X, Ren Z, Weng C, Sun G, Wang X, Liu Y, et al: Identification and characterization of novel amphioxus microRNAs by Solexa sequencing. Genome Biol. 2009, 10 (7): R78-
Xu P, Li J, Li Y, Cui R, Wang J, Zhang Y, Zhao Z, Sun X: Genomic insight into the common carp (Cyprinus carpio) genome by sequencing analysis of BAC-end sequences. BMC Genomics. 2011, 12: 188-
Wang JT, Li JT, Zhang XF, Sun XW: Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio). BMC Genomics. 2012, 13 (1): 96-
Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36 (Database issue): D154-D158.
Markham NR, Zuker M: UNAFold: software for nucleic acid folding and hybridization. Methods Mol Biol. 2008, 453: 3-31.
Chen K, Rajewsky N: The evolution of gene regulation by transcription factors and microRNAs. Nat Rev Genet. 2007, 8 (2): 93-103.
Zhao CZ, Xia H, Frazier TP, Yao YY, Bi YP, Li AQ, Li MJ, Li CS, Zhang BH, Wang XJ: Deep sequencing identifies novel and conserved microRNAs in peanuts (Arachis hypogaea L.). BMC Plant Biol. 2010, 10: 3-
Kwak PB, Wang QQ, Chen XS, Qiu CX, Yang ZM: Enrichment of a set of microRNAs during the cotton fiber development. BMC Genomics. 2009, 10: 457-
Zhang J, Xu Y, Huan Q, Chong K: Deep sequencing of Brachypodium small RNAs at the global genome level identifies microRNAs involved in cold stress response. BMC Genomics. 2009, 10: 449-
Glazov EA, Cottee PA, Barris WC, Moore RJ, Dalrymple BP, Tizard ML: A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. Genome Res. 2008, 18 (6): 957-964.
de Faria O, Cui QL, Bin JM, Bull SJ, Kennedy TE, Bar-Or A, Antel JP, Colman DR, Dhaunchak AS: Regulation of miRNA 219 and miRNA Clusters 338 and 17–92 in Oligodendrocytes. Front Genet. 2012, 3: 46-
Kloosterman WP, Steiner FA, Berezikov E, de Bruijn E, van de Belt J, Verheul M, Cuppen E, Plasterk RH: Cloning and expression of new microRNAs from zebrafish. Nucleic Acids Res. 2006, 34 (9): 2558-2569.
Visvanathan J, Lee S, Lee B, Lee JW, Lee SK: The microRNA miR-124 antagonizes the anti-neural REST/SCP1 pathway during embryonic CNS development. Genes Dev. 2007, 21 (7): 744-749.
Sonkoly E, Stahle M, Pivarcsi A: MicroRNAs and immunity: novel players in the regulation of normal immune function and inflammation. Semin Cancer Biol. 2008, 18 (2): 131-140.
Lynn FC, Skewes-Cox P, Kosaka Y, McManus MT, Harfe BD, German MS: MicroRNA expression is required for pancreatic islet cell genesis in the mouse. Diabetes. 2007, 56 (12): 2938-2945.
Karali M, Peluso I, Marigo V, Banfi S: Identification and characterization of microRNAs expressed in the mouse eye. Invest Ophthalmol Vis Sci. 2007, 48 (2): 509-515.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW: GenBank. Nucleic Acids Res. 2011, 39 (Database issue): D32-37.
Lewis BP, Shih IH, Jones-Rhoades MW, Bartel DP, Burge CB: Prediction of mammalian microRNA targets. Cell. 2003, 115 (7): 787-798.
Kertesz M, Iovino N, Unnerstall U, Gaul U, Segal E: The role of site accessibility in microRNA target recognition. Nat Genet. 2007, 39 (10): 1278-1284.
Berezikov E, Robine N, Samsonova A, Westholm JO, Naqvi A, Hung JH, Okamura K, Dai Q, Bortolamiol-Becet D, Martin R, et al: Deep annotation of Drosophila melanogaster microRNAs yields insights into their processing, modification, and emergence. Genome Res. 2011, 21 (2): 203-215.
Guo H, Ingolia NT, Weissman JS, Bartel DP: Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature. 2010, 466 (7308): 835-840.
Sun G, Yan J, Noltner K, Feng J, Li H, Sarkis DA, Sommer SS, Rossi JJ: SNPs in human miRNA genes affect biogenesis and function. RNA. 2009, 15 (9): 1640-1651.
Duan R, Pak C, Jin P: Single nucleotide polymorphism associated with mature miR-125a alters the processing of pri-miRNA. Hum Mol Genet. 2007, 16 (9): 1124-1131.
Gong J, Tong Y, Zhang HM, Wang K, Hu T, Shan G, Sun J, Guo AY: Genome-wide identification of SNPs in microRNA genes and the SNP effects on microRNA target binding and biogenesis. Hum Mutat. 2012, 33 (1): 254-263.
Yan X, Ding L, Li Y, Zhang X, Liang Y, Sun X, Teng CB: Identification and profiling of microRNAs from skeletal muscle of the common carp. PLoS One. 2012, 7 (1): e30925-
Shi W, Hendrix D, Levine M, Haley B: A distinct class of small RNAs arises from pre-miRNA-proximal regions in a simple chordate. Nat Struct Mol Biol. 2009, 16 (2): 183-189.
Bortoluzzi S, Bisognin A, Biasiolo M, Guglielmelli P, Biamonte F, Norfo R, Manfredini R, Vannucchi AM: Characterization and discovery of novel miRNAs and moRNAs in JAK2V617F-mutated SET2 cells. Blood. 2012, 119 (13): e120-e130.
Morin RD, O’Connor MD, Griffith M, Kuchenbauer F, Delaney A, Prabhu AL, Zhao Y, McDonald H, Zeng T, Hirst M, et al: Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res. 2008, 18 (4): 610-621.
Lu S, Sun YH, Chiang VL: Adenylation of plant miRNAs. Nucleic Acids Res. 2009, 37 (6): 1878-1885.
Bandyopadhyay S, Mitra R: TargetMiner: microRNA target prediction with systematic identification of tissue-specific negative examples. Bioinformatics. 2009, 25 (20): 2625-2631.
Sethupathy P, Corda B, Hatzigeorgiou AG: TarBase: A comprehensive database of experimentally supported animal microRNA targets. RNA. 2006, 12 (2): 192-197.
Su WL, Kleinhanz RR, Schadt EE: Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques. Mol Syst Biol. 2011, 7: 490-
Laczny C, Leidinger P, Haas J, Ludwig N, Backes C, Gerasch A, Kaufmann M, Vogel B, Katus HA, Meder B, et al: miRTrail - a comprehensive webserver for analyzing gene and miRNA patterns to enhance the understanding of regulatory mechanisms in diseases. BMC Bioinforma. 2012, 13 (1): 36-
Jovanovic M, Reiter L, Picotti P, Lange V, Bogan E, Hurschler BA, Blenkiron C, Lehrbach NJ, Ding XC, Weiss M, et al: A quantitative targeted proteomics approach to validate predicted microRNA targets in C. elegans. Nat Methods. 2010, 7 (10): 837-842.
Stark A, Brennecke J, Russell RB, Cohen SM: Identification of Drosophila MicroRNA targets. PLoS Biol. 2003, 1 (3): E60-
Yang W, Chendrimada TP, Wang Q, Higuchi M, Seeburg PH, Shiekhattar R, Nishikura K: Modulation of microRNA processing and expression through RNA editing by ADAR deaminases. Nat Struct Mol Biol. 2006, 13 (1): 13-21.
Kawahara Y, Zinshteyn B, Sethupathy P, Iizasa H, Hatzigeorgiou AG, Nishikura K: Redirection of silencing targets by adenosine-to-inosine editing of miRNAs. Science. 2007, 315 (5815): 1137-1140.
Huson DH, Reinert K, Kravitz SA, Remington KA, Delcher AL, Dew IM, Flanigan M, Halpern AL, Lai Z, Mobarry CM, et al: Design of a compartmentalized shotgun assembler for the human genome. Bioinformatics. 2001, 17 (Suppl 1): S132-S139.
Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39 (Database issue): D152-D157.
Luo Y, Zhang S: Computational prediction of amphioxus microRNA genes and their targets. Gene. 2009, 428 (1–2): 41-46.
Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR: Rfam: an RNA family database. Nucleic Acids Res. 2003, 31 (1): 439-441.
Common carp repeats library.http://genomics.cafs.ac.cn/repeatdb,
Li R, Li Y, Kristiansen K, Wang J: SOAP: short oligonucleotide alignment program. Bioinformatics. 2008, 24 (5): 713-714.
Hofacker IL: Vienna RNA secondary structure server. Nucleic Acids Res. 2003, 31 (13): 3429-3431.
The R Project for Statistical Computing.http://www.r-project.org/,
Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008, 18 (11): 1851-1858.
Cox MP, Peterson DA, Biggs PJ: SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinforma. 2010, 11: 485-
Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25 (14): 1754-1760.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22 (22): 4673-4680.
We thank Edanz group for manuscript editing. We are grateful to Anda Cheng, Baosen Wang, Jing Wu and Haixia Deng for technical support. This study was supported by the grants from Research Foundation of Chinese Academy of Fishery Sciences (No. 2011C016), China Ministry of Science and Technology 863 Hi-Tech Research and Development Program (No. 2011AA100401), Key Project of Chinese National Programs for Fundamental Research and Development (No. 2010CB126305) and National Department Public Benefit Research Foundation (No. 200903045). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The authors declare that they have no competing interests.
YPZ, WX, JTW and JTL conducted the bioinformatic analysis. YMW, XP and ZY prepared the samples. SLW took charge of small RNAs. YPZ and JTL drafted the manuscript. JTL and XWS supervised the study and assisted in manuscript preparation. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 3: Table S1. Information of common carp miRNAs only identified by our homology-based prediction. (XLS 43 KB)
Additional file 4: Table S2. Information of common carp miRNAs identified by both our homology-based prediction and small RNA sequencing. (XLS 36 KB)
Addition file 13:Figure S6. The non-miRNA regions in five pre-miRNA loci had matched sRNA reads adjacent to miRNAs. (DOC 36 KB)
Additional file 18: Table S10. Primers designed for PCR-based validation SNPsof miRNA precursors or mRNA sequences. (DOC 36 KB)
About this article
Cite this article
Zhu, YP., Xue, W., Wang, JT. et al. Identification of common carp (Cyprinus carpio) microRNAs and microRNA-related SNPs. BMC Genomics 13, 413 (2012). https://doi.org/10.1186/1471-2164-13-413
- miRNA biogenesis
- Common carp