Identification and characterization of maize microRNAs involved in the very early stage of seed germination
BMC Genomics volume 12, Article number: 154 (2011)
MicroRNAs (miRNAs) are a new class of endogenous small RNAs that play essential regulatory roles in plant growth, development and stress response. Extensive studies of miRNAs have been performed in model plants such as rice, Arabidopsis thaliana and other plants. However, the number of miRNAs discovered in maize is relatively low and little is known about miRNAs involved in the very early stage during seed germination.
In this study, a small RNA library from maize seed 24 hours after imbibition was sequenced by the Solexa technology. A total of 11,338,273 reads were obtained. 1,047,447 total reads representing 431 unique sRNAs matched to known maize miRNAs. Further analysis confirmed the authenticity of 115 known miRNAs belonging to 24 miRNA families and the discovery of 167 novel miRNAs in maize. Both the known and the novel miRNAs were confirmed by sequencing of a second small RNA library constructed the same way as the one used in the first sequencing. We also found 10 miRNAs that had not been reported in maize, but had been reported in other plant species. All novel sequences had not been earlier described in other plant species. In addition, seven miRNA* sequences were also obtained. Putative targets for 106 novel miRNAs were successfully predicted. Our results indicated that miRNA-mediated gene expression regulation is present in maize imbibed seed.
This study led to the confirmation of the authenticity of 115 known miRNAs and the discovery of 167 novel miRNAs in maize. Identification of novel miRNAs resulted in significant enrichment of the repertoire of maize miRNAs and provided insights into miRNA regulation of genes expressed in imbibed seed.
In recent years, the discovery of numerous small RNAs has a great deal of interest in post-transcriptional gene expression regulation during development and other biological processes. Small RNAs (sRNA) include several kinds of short non-coding RNAs, such as microRNA (miRNA), small interfering RNA (siRNA), and Piwi-associated RNA (piRNA), which all regulate gene expression at the post-transcriptional level. The sRNA content of plant cells is surprisingly complex, suggesting an extensive regulatory role for these molecules . The best-characterized class of plant sRNAs is miRNAs . Typically, miRNAs are approximately 22 nucleotide small-RNA sequences that play key roles in many diverse biological processes, including development, viral defense, metabolism and apoptosis .
MicroRNAs (miRNAs) are generated from precursor RNA (pre-miRNA) with hairpin structures by DICER-LIKE 1 (DCL1) . DCL1 trims the hairpin structure (pre-miRNA), and then a further cleavage by the same enzyme releases the miRNA/miRNA* duplex . This duplex has a 2-nt 3-overhang at each side and contains a few mismatches . One of the strands of the generated miRNA/miRNA* duplex is incorporated into the RNA-induced silencing complex (RISC). This strand is usually the mature miRNA strand and the miRNA* strand gets degraded, although in some cases the miRNA* strand also accumulates at a lower level . The incorporated mature miRNA guides RISC to mRNAs containing a target site and RISC down-regulates the expression of the mRNA. The 'seed' region located at miRNA nucleotides 2-8 is the most important sequence for interaction with mRNA targets . In plants the target site shows near perfect complementarity to the miRNA sequence, and as a consequence most target mRNAs are cleaved by RISC, although there are examples where the translation of the mRNA is suppressed without a cleavage .
MicroRNAs' regulatory role has been exemplified by the critical regulatory behavior of miRNAs at key positions in a variety of pathways, such as root , shoot , leaf , flower development  and cell fate . Additionally, they also include responses to phytohormones , nutrient  and other environmental stresses [16–18]. Furthermore, the targets of several miRNAs are genes that play important roles in stress tolerance, including the gene encoding Cu/Zn SOD . MiR393 targets auxin receptor genes, such as TIR1, AFB2 and AFB3, which lower auxin signals and inhibit the pathogen P. syringae. MiRNAs are also induced by pathogens, which suggests miRNAs are involved in plant-microorganism interactions such as symbiosis events with legumes and rhizobia bacteria [21, 22]. Increasing evidence demonstrates that miRNAs might provide a novel platform to better understand plant development and resistance to biotic as well as abiotic stresses.
Currently, 14,197 mature miRNAs have been discovered and deposited in the public available miRNA database miRBase (Release 15.0, April 2010) . These miRNAs include 2,566 miRNAs from 37 plant species. The study of small RNAs in maize has been reported but compared with other crops such as rice the number of maize miRNAs identified so far is relatively small . The number of maize miRNAs is even smaller than that of Arabidopsis though maize genome size is much larger than that of Arabidopsis. To date, there are 170 maize miRNAs, 447 rice miRNAs and 199 Arabidopsis miRNAs in miRBase. The identification of a near complete set of small RNAs in any organism is of fundamental importance to understanding small RNA-mediated gene regulations and the diversity of small RNAs. It lays the necessary foundation for unavailing the complex small RNA-mediated regulatory networks. Maize is an obvious choice for high-throughput small RNA sequencing, because of its worldwide agricultural importance, besides being a C4 plant with a sequenced genome.
The most challenging problem in understanding plant miRNAs is to identify more novel miRNAs. Three major approaches have been used for miRNA discovery in plants: forward genetics, bioinformatics prediction as well as direct cloning and sequencing. Only a few miRNAs were identified by forward genetic studies and predicting species-specific miRNAs using bioinformatics method was difficult [25, 26]. Thus, direct cloning and sequencing is the most effective method for plant miRNA discovery. Recently, the deep sequencing technology has revolutionized small RNA discovery and more and more miRNAs have been identified. This study leads to the confirmation of the authenticity of 115 known miRNAs and the discovery of 167 novel ones in maize. Identification of novel miRNAs results in significant enrichment of the repertoire of maize miRNAs and provides insights into miRNA regulation of genes expressed in imbibed seed.
Results and Discussion
Deep sequencing of maize short RNAs
In order to identify the miRNAs involved in the very early stage of seed germination, a small RNA library from maize seed 24 hours after imbibition was sequenced by the Solexa technology. A total of 11,338,273 reads were obtained. After removing the low quality sequences and adapter sequences, 9,731,557 reads were obtained with 18-30nt in length (Additional file 1). We then further removed sequences that were read only once and 6,870,535 reads remained. Next, all Solexa reads were aligned against the Maize B73 RefGen_v2 (release 5a.57 in May, 2010) using SOAP and 5,791,874 reads were perfectly matched to the maize genome , representing 84.3% of the total reads. Around 5.08% of the distinct reads matched noncoding RNAs including rRNAs (2.81%), tRNAs (0.34%), snRNAs (0.06%), siRNAs (1.82%) and snoRNAs (0.05%), which accounted for 16.11% of the total reads (Table 1). All the sequences excluding noncoding RNAs were then regarded as miRNAs for further analysis.
In maize, the size of the small RNAs was not evenly distributed. Among these sequences shown in Figure 1, the number of 24nt sequences was significantly greater than other sequences and accounted for 33.43% of the total reads. This result was consistent with that of Medicago truncatula, rice , peanut  and Arabidopsis. However, the size distribution differed from wheat and conifer miRNAs obtained through 454 high-throughput sequencing [29, 32] and Chinese yew sequences obtained through Solexa technology . In conifer, the fraction of 24nt microRNAs was very small (2.6%) due to the lack of DCL3, the enzyme that matured 24nt RNAs in angiosperms [29, 34]. The next larger fractions were the 22nt (22.20%), 21nt (14.29%) and 20nt (7.87%) fractions, representing the typical length of plant mature miRNAs. Intriguingly, the fraction of 23nt miRNAs was very small (3.5%) compared to those of 21, 22 and 24nt fractions. The same phenomenon was also observed in peanut , cotton  and Medicago truncatula. As shown in figure 2, the abundance of miRNAs in our dataset varied drastically. Some were sequenced only a few times, whereas others were present thousands of times, indicating many physiological and biochemical processes are being carried out and maize seed contains a large and diverse small RNA population at the very early stage of seed germination.
Identification of conserved maize miRNAs
Conserved miRNA families are found in many plant species and have important functions in plant development. To identify conserved miRNAs in our dataset, all small RNA sequences were Blastn searched against the known maize mature miRNAs and their precursors in the miRNA database miRBase. There are currently 28 families containing 170 known miRNAs in miRBase. Blastn searches showed that 1,047,447 total reads representing 431 unique sRNAs matched to known maize miRNAs. Further analysis identified a total of 115 conserved miRNAs that belong to 24 miRNA families (Additional file 2). The identified miRNA families have been shown conserved in a variety of plant species. For example, miR156/157, miR159/319, miR166, miR169, and miR394 have been found in 51, 45, 41, 40 and 40 plant species, respectively [36–38]. Maize miRNA families displayed significantly varied abundance from each other. This varied abundance of the miRNA families suggested that miRNA genes would be differentially transcribed at this very early stage of seed germination. For example, the majority of maize miRNAs were only sequenced less than 1,000 times, and some rare miRNAs were detected less than 10 times, whereas zma-miR167a, zma-miR166a, and zma-miR156a were detected 27,634, 300,503 and 374,492 times respectively (Additional file 2). The abundance of zma-miR172 was extremely low compared to that of zma-miR156 in our dataset, which was consistent with previous finding that these two miRNAs are conversely regulated. In comparison to other plant species, tae-miR169b in wheat and osa-miR169 in rice are the most frequently sequenced miRNAs while miR156 in rice and wheat exhibits low abundance . This may suggest a species-specific expression profile for miRNAs. MiR156a was also found to be highly expressed in Medicago truncatula. In Arabidopsis, miR156a, located on chromosome 2, targets 10 mRNAs that code for the squamosa promoter-binding protein (SBP) box, which is involved in leaf morphogenesis [39, 40]. However, mechanisms causing the differential expression profile of a same miRNA in different plant species is unknown. Diversity of maize miRNAs also could be found in the aspect of the amount of members they contained (figure 3). The largest miRNA family size identified was miR166 that consisted of 14 members and miR156, miR169 and miR167 possessed 12, 12 and 10 members, respectively; whereas other miRNA families such as miR162, miR529, miR827 and miR1432 had only one member detected in this period. The size of miRNA families may be indicative of their function. Different family members also displayed drastically different expression levels (Additional file 2). For example, the abundance of miR156 family varied from 261 reads (zma-miR156j) to 409,637 reads (zma-miR156d) in the deep sequencing. This was also the case for some other miRNA families, such as zma-miR164 (from 14 read to 25,253 reads) and zma-miR166 (from 931 reads to 300,478 reads). Two members, zma-miR528a and zma-miR528b in zma-miR528 family, however, their expression levels were similar and were detected 147,619 and 158,200 times, respectively. The existence of a dominant member in a miRNA family may suggest that the regulatory role of this family was performed by the dominant member at the developmental time when the samples were collected for RNA extraction. Abundance comparisons of different members in one miRNA family may provide valuable information on the role that miRNAs play in that plant specific developmental stage. Four known miRNA families, miR395, miR482, miR2118 and miR2275 were not successfully detected in our datasets suggesting that miRNAs expression maybe developmental and/or tissue-specific. After our dataset was Blastn searched against known maize mature miRNAs, the same dataset was used to compare with 2392 known miRNAs from other diverse plant species. We found 10 additional conserved miRNAs that have not been reported in maize (Additional file 3). The 'seed' region, located at miRNA nucleotides 2-8, is the most important sequence for interaction with mRNA targets . The seed regions of the newly identified maize miRNAs t0002967, t0511822, t0207061, t0448353 and t0053880 were identical to those of ctr-miR171 and ctr-miR166, respectively, indicating that they may share the same targets.
Identification of novel maize miRNAs
Although the characteristic hairpin structure of miRNA precursor could be used to predict novel miRNA, it is very challenging to define novel miRNAs. We developed a prediction software Mireap to predict novel miRNA by exploring the secondary structure, the Dicer cleavage site and the minimum free energy of the unannotated small RNA tags which could be mapped to the maize genome. A small RNA is considered as a potential miRNA candidate only if it meets all of the following strict criterias: 1) the sequence could fold into an appropriate stemloop hairpin secondary structure, 2) the small RNA sits in one arm of the hairpin structure, 3) no more than 6 mismatches between the predicted mature miRNA sequence and its opposite miRNA* sequence in the secondary structure, 4) no loop or break in the miRNA or miRNA* sequences, and 5) predicted secondary structure has higher minimal folding free energy index and negative minimal folding free energy. 1068 sequences were obtained based on the above criteria. Although forming specific hairpin stem loop structures is one of the most important characteristics of pre-miRNAs, it is not unique to pre-miRNAs; lots of other coding or non-coding RNAs, such as rRNAs, tRNAs and mRNAs, also have the similar hairpin structures . Several studies observed that miRNA precursors have low folding free energy, and considered that low free energy is one important characteristic of miRNAs . However, minimal folding free energy depends on the lengths of RNAs  and the length of miRNA precursors significantly varies, for example, the lengths of plant miRNA precursors range from 60 to more than 400 nucleotides . To avoid the effect of using minimal folding free energy as a criterion to identify genuine miRNAs, the length of RNAs must be considered. To better distinguish miRNAs from other RNAs, Zhang et al.  combined several parameters to form a new criterion called minimal folding free energy index (MFEI) . Pre-miRNAs have high minimal folding free energy index (MFEI) . They found that the average MFEI of miRNA precursors is 0.97 in previously known plant pre-miRNAs, and this value is significantly higher than that for tRNAs (0.64), rRNAs (0.59), and mRNAs (0.62-0.66). More importantly, more than 90% of miRNA precursors have an MFEI greater than 0.85, and no other RNAs have MFEI higher than 0.85. This suggests that MFEI is useful to distinguish miRNAs from other non-coding and coding RNAs. Their results suggest that RNA sequences with MFEI larger than 0.85 are most likely to be miRNAs . This finding provids a more precision criterion to predict miRNAs using computational and/or experimental approaches. Out of the 1068 miRNA candidates, 386 had MFEI greater than 0.85.
In order to make certain that the 386 miRNA candidates we identified are true miRNAs, we constructed and sequenced a second small RNA library from the same tissue. Following the same analysis approach as that used for the first library, 362 sequences with MFEI greater than 0.85 were identified. By comparing these 386 and 362 distinct reads, we found that 167 of them were identical between the two small RNA libraries in terms of their precursor sequences, mature miRNA sequences, and their chromosomal locations. Further more, no significant differences of their expression profiles existed between the two experiments. We believed that these 167 sequences were most likely true novel miRNAs. The stringent criteria used to predict novel miRNAs could potentially reduce false positive rates at the cost of missing authentic miRNAs.
The 167 novel miRNAs can be classified into 77 families (Additional file 4) and their pre-miRNAs, secondary structures, and chromosomal locations were listed in Additional file 5. These novel miRNA precursors had negative folding free energies (20.8-210 kcal mol-1 with an average of about -68.8 kcal mol-1) according to Mfold3.2 (Additional file 4) ; this was similar to the computational values of Arabidopsis thaliana miRNA precursors (-57 kcal mol-1) and much lower than folding free energies of tRNA (-27.5 kcal mol-1) or rRNA (-33 kcal mol-1) . Previous study indicated that animal miRNA precursors typically have 70-80 nucleotides, but plant miRNA precursors are more diverse in structure and size. They vary in size from 60 to 509 nucleotides, with an average of 144.6 ± 56.9 (n = 513); most (73.5%) of the detected miRNAs have 81-160 nucleotides. Only 1.6% of plant miRNAs are less than 81 nucleotides in length, a stark contrast to animal miRNAs . In our research as shown in figure 4, the foldback precursors of 167 novel miRNAs were about 67-356 nucleotides in length, and about 71.9% with 81-160 nucleotides. The novel pre-miRNAs were also evaluated for their A+U content, which ranged from 34% to 69.01% (Additional file 4), in agreement with previous studies. We also looked for sequenced miRNA* sequences, only seven complementary sequences were found in our combined data sets (Additional file 4). Most miRNA* shows weak expression (sequencing frequency <10) and their expression levels are much lower than their corresponding miRNAs, consistent with the idea that miRNA* strands are degraded rapidly during the biogenesis of mature miRNAs . It may also be the fact that the expression levels of the majority of the novel miRNAs identified were low (the majority of them were sequenced less than 50 times).
Target prediction of maize miRNAs
As dry seeds imbibe water, the resumption of energy metabolism and cellular repair occur. Later, events such as the activation of genes encoding enzymes involved starch degradation and protein and DNA/RNA synthesis play critical roles in the decision as to whether a seed would germinate or not. The shift from the seed development/maturation mode to the germination mode is a critical change in the developmental program of seed. Regulation of transcription factors targeted by miRNAs is involved at this critical stage in plant development . Our target prediction criteria and methods were also stringent, but still allowed us to capture most miRNA targets that are conserved across several plant species, including Arabidopsis, poplar , rice , wheat , soybean , mustard  and grape . The majority of conserved miRNA targets are various transcriptional factors including SBP, MYB, ARF, NAM, CBF, TCP and GRF that are known to regulate plant development. Other conserved miRNA targets includes F-box protein (miRNA393, miRNA394), ATP sulfurylase (miRNA395), CCHC type zinc finger protein (miRNA482), NAD(P)-binding protein (miRNA827), and Poly(ADP-ribose) polymerase (miRNA1432), all of them are known to play roles in the expression control of genes involved in regulation of metabolic processes. In our datasets, miRNA166 showed the highest abundance followed by miRNA156 and miRNA528, respectively, during the very early stage of seed germination. Previous studies indicated that MiRNA166 targets HD-ZIP transcription factors that are involved in plant leaf morphogenesis. HD-ZIP proteins also regulates vascular development as well as lateral organ polarity and meristem formation. ATHB15, a member of the HD-ZIP family, is predominantly expressed in vascular tissues, suggesting that it may play some roles in plant vascular development [54, 55]. Overexpression of miR166a results in decreasing ATHB15 mRNA levels and causes accelerated vascular cell differentiation from cambial/procambial cells and consequently produces an altered vascular system with expanded xylem tissue and an interfascicular region . This regulation mechanism may exist in all vascular plant species [55, 56]. MiRNA156 has been shown to be involved in floral development and phase change by targeting members of squamosa promoter binding protein like (SPL) plant-specific transcription factors. The SPL family has 16 members; some (such as SPL3) are involved in floral transition and regulating plant flowering . Recent results indicated that overexpression of miR156 affects phase transition from vegetative growth to reproductive growth, including the quickly initiation of rosette leaves, a severe decrease in apical dominance, and a moderate delay in flowering . MiRNA528 targets copper proteins, cupredoxin, multicopper oxidase and laccase genes and thus might play a critical role in regulating physiological processes and stress responses. Not only the miRNA166 and miRNA156 families were abundant during this stage of seed germination, but also they had more family members than other miRNA families, suggesting the importance of these two miRNA families at this very early stage of seed germination. In Arobidopsis, MiRNA159 has been shown to be involved in the regulation of seed dormancy and germination by targeting MYB33 and MYB101, two positive regulators of ABA responses during germination. ABA is a key regulator of seed maturation and dormancy . Many ABA signal transduction proteins are involved in seed development and germination [59–62]. The sensitivity of seeds to ABA that is vital to the termination of seed maturation program, an essential change to increase the competence of seeds for germination, is regulated by conserved miRNA160. Since there is no dormancy in maize seed, the abundance of both miRNA159 and miRNA160 is extremely low compared to that of miRNA166 and miRNA156 families in our datasets. AUXIN RESPONSIVE FACTORs (ARFs) are a class of targets of miRNA160 families. ARFs are important components of auxin signal transduction . Therefore, there is cross-talk between ABA and auxin in imbibed mature seeds. Studies has indicated that ABA-responsive genes that are typical of seed maturation stages and have ABA response elements (ABREs) in their promoter regions are specifically up-regulated in the miRNA-resistant mARF10 seeds. The down-regulation of a component important for auxin signal transduction by miRNA may be a regulatory step to decrease ABA sensitivity in mature seeds and to switch to the germination mode. The mechanisms involved in ABA-auxin cross-talk during seed germination are unknown.
To better understand the functions of the newly identified novel miRNAs in maize, putative targets of the 167 novel miRNAs were predicted. The target genes for 106 novel miRNAs were successfully predicted (Additional file 6). Analysis and annotation of the predicted target genes showed that they were with diverse functions, ranging from genes encoding transcription factors involved in transcription regulation to genes encoding enzymes involved in metabolism, genes regulating transport, genes encoding various kinases, genes regulating oxidative reduction and genes encoding isomarase and helicase (Additional file 6). When dry seeds absorb water, many cellular processes resume. Isomarase and helicase are important enzymes for DNA replication, transcription, translation, recombination, DNA repair, ribosome biogenesis. Most miRNA families have multiple target sites, suggesting that these miRNAs are functionally divergent. With 61 newly identified miRNAs, we failed to discover any targets for them in Maize. This could have resulted from incomplete coverage of the mRNA in the database. It is likely that a number of mRNAs could not be identified because they are poorly expressed or highly unstable, or because their expression is restricted to times and locations such that isolation of sufficient amounts of RNA for cloning is impractical or has not been done yet. Further analysis for their targets is needed and would help us gain insight into the roles these newly identified miRNAs play during maize seed imbibition.
We had sequenced two independent small RNA libraries from maize imbibed seed. Our data confirmed the authenticity of 115 known miRNAs in maize. We found 10 miRNAs that had not been reported in maize, but had been reported in other plant species. We also found 167 novel miRNAs that had not been reported elsewhere. Putative targets for 106 novel miRNAs were predicted. Dry seeds imbibe water and re-initiate active physiology. An important decision as to whether a seed would germinate or not is made following the reactivation events during imbibition. Regulation of genes targeted by miRNAs is involved at this critical stage in plant development. Identification of novel miRNAs resulted in significant enrichment of the repertoire of maize miRNAs and provided insights into miRNA regulation of genes expressed in imbibed seed.
RNA isolation and cloning of maize small RNAs
Maize (Zea mays) inbred line 87-1 was used in this study. Seeds were sterilized, wrapped in paper towels and incubated at 25°C for 24 hours. Embryos were then cut out and used for RNA extraction. Briefly, total RNA was isolated using Trizol kit (Invitrogen, USA). Small RNAs were enriched by poly-ethylene glycol precipitation, separated on 15% denaturing PAGE, and visualized by SYBR-gold staining. Small RNAs of 16-28nt were gel-purified. Small RNAs were ligated to a 5'adaptor and a 3'adaptor sequentially, reverse-transcription polymerase chain reaction (RT-PCR) amplified, and used for sequencing directly . Sequencing was performed on a Solexa machine (Beijing Genomics Institute, China).
Identification of conserved and novel miRNAs
The raw sequences were processed using PHRED and CROSS MATCH programs as previously reported [18, 65]. After removing the vector sequences, trimmed sequences longer than 18 nt were used for further analyses. First, rRNA, tRNA, snRNA, and snoRNA, as well as those containing the polyA tail, were removed from the small RNA sequences and the remaining sequences were compared against maize ncRNAs deposited in the NCBI Genbank database and Rfam database. Then, the unique small RNA sequences were used to do a Blastn search against the miRNA database, miRBase 15.0, in order to identify conserved miRNAs in maize. Only those small RNAs whose mature and precursor sequences perfectly matched known maize miRNAs in miRBase 15.0 were considered to be conserved miRNAs.
To discover potential novel miRNA precursor sequences in our dataset, we used the identified mature miRNA sequences to do Blastn searches against maize genomic sequence. Sequences that met previously described criteria were then considered to be miRNA precursors . Specifically, dominant, mature sequences residing in the stem region of the stem-loop structure and ranging between 20-22 nt with a maximum free-folding energy of -20 kcal mol-1were considered. A maximum of six unpaired nucleotides between the miRNA and miRNA* was allowed. The distance between the miRNA and miRNA* ranged between 5 and 240-nt. The selected sequences were then folded into a secondary structure using an RNA-folding program mFold3.2. If a perfect stem-loop structure was formed, the small RNA sequence was sit at one arm of the stem as well as other criteria were followed, this small RNA was consisted as one potential novel maize miRNA candidate. Previous study indicated that more than 90% of miRNA precursors had an MFEI greater than 0.85, and no other RNAs had MFEI higher than 0.85 (MFEI = [(MFE/length of the RNA sequence) × 100]/(G+C)% ). This suggested that MFEI is useful to distinguish miRNAs from other non-coding and coding sRNAs. The MFEI was calculated and potential novel maize miRNA candidates were further screened.
Target gene prediction
In order to predict the target genes of novel miRNAs, we used the Mireap method for target prediction [68, 69]. Briefly, the criteria were as follows: 1) No more than four mismatches between sRNA & target (G-U bases count as 0.5 mismatches), 2) No more than two adjacent mismatches in the miRNA/target duplex, 3) No adjacent mismatches in positions 2-12 of the miRNA/target duplex (5' of miRNA), 4) No mismatches in positions 10-11 of miRNA/target duplex, 5) No more than 2.5 mismatches in positions 1-12 of the of the miRNA/target duplex (5' of miRNA), and 6) Minimum free energy (MFE) of the miRNA/target duplex should be > = 75% of the MFE of the miRNA bound to it's perfect complement. Target mRNA sequences were predicted follow the criteria for the identified novel miRNAs (see Additional file 4). More strictly, at most three mismatches between miRNA sequences and potential mRNA targets were allowed in this study and grouped by the biological function of the proteins they encode for, as described by UniProt (http://www.uniprot.org/).
Analysis of sequencing data
Raw sequence reads were produced by the Illumina 1G Genome Analyzer at BGI-Shenzhen, China and processed into clean full length reads by the BGI small RNA pipeline. During this procedure all low quality reads, including 3' adapter reads and 5' adapter contaminants were removed. The remaining high quality sequences were trimmed of their adapter sequences and sequences larger than 30nt and smaller than 18nt were discarded. All high quality sequences, even those with only a single unique read, were considered as significant and further analyzed. Unique small RNA sequences were mapped to maize genome (B73 RefGen_v2 (release 5a.57 in May, 2010)) reference sequences by SOAP . Small RNAs derived from rRNAs, tRNAs, snRNAs and snoRNAs deposited at the Rfam and NCBI GenBank databases http://www.ncbi.nlm.nih.gov/Ftp/ were identified by NCBI blast. In order to determine conserved miRNAs, unique sequences were aligned with known maize miRNAs from miRBase (Release15.0, Apil, 2010) with a maximum of two mismatches, where gaps count as mismatches. Potential novel miRNAs were identified by folding the flanking genome sequence of unique small RNAs using MIREAP (https://sourceforge.net/projects/mireap/), followed by the prediction of the secondary structure by mFold 3.2 . The essential criteria were used for selecting the miRNA candidates, e.g. sequences of miRNA precursors can fold into a hairpin secondary structure that contains the ~21nt mature miRNA sequence from one arm and miRNA*derived from the opposite arm, both of which form a duplex with two nucleotide, 3' overhangs . The filtered small RNA sequencing data were deposited in the National Center for Biotechnology Information Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/projects/geo/) under accession number GSE27664.
For prediction of miRNA targets, the procedure and criteria were followed as described previously [73, 74]. More strictly, at most three mismatches between miRNA sequences and potential mRNA targets were allowed in this study. The biological function of the predicted targets was retrieved from the UniProt.
Lu C, Tej SS, Luo S, Haudenschild CD, Meyers BC, Green PJ: Elucidation of the small RNA component of the transcriptome. Science. 2005, 309: 1567-1569. 10.1126/science.1114112.
Jones-Rhoades MW, Bartel DP, Bartel B: MicroRNAS and their regulatory roles in plants. Annu Rev Plant Biol. 2006, 57: 19-53. 10.1146/annurev.arplant.57.032905.105218.
Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004, 116: 281-297. 10.1016/S0092-8674(04)00045-5.
Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP: MicroRNAs in plants. Genes & Dev. 2002, 16: 1616-1626.
Kurihara Y, Watanabe Y: Arabidopsis micro-RNA biogenesis through Dicer-like 1 protein functions. Proc Natl Acad Sci. 2004, 101: 12753-12758. 10.1073/pnas.0403115101.
Jones-Rhoades MW, Bartel DP, Bartel B: MicroRNAS and their regulatory roles in plants. Annu Rev Plant Biol. 2006, 57: 19-53. 10.1146/annurev.arplant.57.032905.105218.
Brennecke J, Stark A, Russell RB, Cohen SM: Principles of microRNA-target recognition. PLoS Biol. 2005, 3: e85-10.1371/journal.pbio.0030085.
Chen X: A microRNA as a translational repressor of APETALA2 in Arabidopsis flower development. Science. 2004, 303: 2022-2025. 10.1126/science.1088060.
Wang JW, Wang LJ, Mao YB, Cai WJ, Xue HW, Cen XY: Control of root cap formation by MicroRNA-targeted auxin response factors in Arabidopsis. Plant Cell. 2005, 17: 2204-2216. 10.1105/tpc.105.033076.
Golz JF: Signalling between the shoot apical meristem and developing lateral organs. Plant Mol Biol. 2006, 60: 889-903. 10.1007/s11103-005-1270-y.
Kidner CA, Martienssen RA: Spatially restricted microRNA directs leaf polarity through ARGONAUTE1. Nature. 2004, 428: 81-84. 10.1038/nature02366.
Mallory AC, Dugas DV, Bartel DP, Bartel B: MicroRNA regulation of NAC-domain targets is required for proper formation and separation of adjacent embryonic, vegetative, and floral organs. Curr Biol. 2004, 14: 1035-1046. 10.1016/j.cub.2004.06.022.
Carraro N, Peaucelle A, Laufs P, Traas J: Cell differentiation and organ initiation at the shoot apical meristem. Plant Mol Biol. 2006, 60: 811-826. 10.1007/s11103-005-2761-6.
Liu Q, Zhang YC, Wang CY, Luo YC, Huang QJ: Expression analysis of phytohormone-regulated microRNAs in rice, implying their regulation roles in plant hormone signaling. FEBS Lett. 2009, 583: 723-728. 10.1016/j.febslet.2009.01.020.
Fujii H, Chiou TJ, Lin SI, Aung K, Zhu JK: A miRNA involved in phosphate-starvation response in Arabidopsis. Curr Biol. 2005, 15: 2038-2043. 10.1016/j.cub.2005.10.016.
Zhao B, Liang R, Ge L, Li W, Xiao H, Lin H, Ruan K, Jin Y: Identification of drought-induced microRNAs in rice. Biochem Biophys Res Commun. 2007, 354: 585-590. 10.1016/j.bbrc.2007.01.022.
Lu S, Sun YH, Chiang VL: Stress-responsive microRNAs in Populus. Plant J. 2008, 55: 131-151. 10.1111/j.1365-313X.2008.03497.x.
Sunkar R, Zhu JK: Novel and stress-regulated microRNAs and other small RNAs from Arabidopsis. Plant Cell. 2004, 16 (8): 2001-2019. 10.1105/tpc.104.022830.
Jones-Rhoades MW, Bartel DP: Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. Molecular Cell. 2004, 14 (6): 787-799. 10.1016/j.molcel.2004.05.027.
Sunkar R, Kapoor A, Zhu JK: Posttranscriptional induction of two Cu/Zn superoxide dismutase genes in Arabidopsis is mediated by downregulation of miR398 and important for oxidative stress tolerance. Plant Cell. 2006, 18 (8): 2051-2065. 10.1105/tpc.106.041673.
Navarro L, Dunoyer P, Jay F, Arnold B, Dharmasiri N, Estelle M, Voinnet O, Jones JDG: A plant miRNA contributes to antibacterial resistance by repressing auxin signaling. Science. 2006, 312 (5772): 436-439. 10.1126/science.1126088.
Katiyar-Agarwal S, Gao S, Vivian-Smith A, Jin H: A novel class of bacteria-induced small RNAs in Arabidopsis. Genes & Development. 2007, 21 (23): 3123-3134.
Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Research. 2008, 36: D154-D158. 10.1093/nar/gkm952.
Zhang L, Chia J, Kumari S, Stein JC, Liu Z, Narechania A, Maher CA, Guill K, McMullen MD, Ware D: A Genome-Wide Characterization of MicroRNA Genes in Maize. PLoS Genet. 2009, 5 (11): e1000716-10.1371/journal.pgen.1000716.
Aukerman MJ, Sakai H: Regulation of flowering time and floral organ identity by a microRNA and its APETALA2-like target genes. Plant Cell. 2003, 15: 2730-2741. 10.1105/tpc.016238.
Palatnik JF, Allen E, Wu XL, Schommer C, Schwab R, Carrington JC, Weigel D: Control of leaf morphogenesis by microRNAs. Nature. 2003, 425: 257-263. 10.1038/nature01958.
Li R, Li Y, Kristiansen K, Wang J: SOAP: Short Oligonucleotide Alignment Program. Bioinformatics. 2008, 24: 713-714. 10.1093/bioinformatics/btn025.
Szittya G, Moxon S, Santos DM, Jing R, Fevereiro MPS, Moulton V, Dalmay T: High-throughput sequencing of Medicago truncatula short RNAs identifies eight new miRNA families. Bmc Genomics. 2008
Morin RD, Aksay G, Dolgosheina E, Ebhardt HA, Magrini V, Mardis ER, Sahinalp SC, Unrau PJ: Comparative analysis of the small RNA transcriptomes of Pinus contorta and Oryza sativa. Genome Research. 2008, 18 (4): 571-584. 10.1101/gr.6897308.
Zhao CZ, Xiao H, Frazier TP, Yao YY, Bi YP, Li AQ, Li MG, Li CS, Zhang BH, Wang XJ: Deep sequencing identifies novel and conserved microRNAs in peanuts (Arachis hypogaea L.). BMC Plant Biology. 2010, 10: 3-10.1186/1471-2229-10-3.
Rajagopalan R, Vaucheret H, Trejo J, Bartel DP: A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes & Development. 2006, 20 (24): 3407-3425.
Yao YY, Guo GG, Ni ZF, Sunkar R, Du JK, Zhu JK, Sun QX: Cloning and characterization of microRNAs from wheat (Triticum aestivum L.). Genome Biology. 2007, 8 (6): 10.1186/gb-2007-8-6-r96.
Qiu D, Pan X, Wilson IW, Ketchum REB, Li F, Liu M, Teng W, Zhang BH: High throughput sequencing technology reveals that the taxoid elicitor methyl jasmonate regulates microRNA expression in Chinese yew (Taxus chinensis). Gene. 2009, 436 (1-2): 37-44. 10.1016/j.gene.2009.01.006.
Dolgosheina EV, Morin RD, Aksay G, Sahinalp SC, Magrini V, Mardis ER, Mattsson J, Unrau PJ: Conifers have a unique small RNA silencing signature. RNA. 2008, 14 (8): 1508-1515. 10.1261/rna.1052008.
Pieter BK, Wang QQ, Chen XS, Qiu ChX, Yang ZM: Enrichment of a set of microRNAs during the cotton fiber development. BMC Genomics. 2009, 10: 457-10.1186/1471-2164-10-457.
Zhang BH, Pan XP, Cannon CH, Cobb GP, Anderson TA: Conservation and divergence of plant microRNA genes. Plant Journal. 2006, 46 (2): 243-259. 10.1111/j.1365-313X.2006.02697.x.
Zhang BH, Pan XP, Wang QL, Cobb GP, Anderson TA: Identification and characterization of new plant microRNAs using EST analysis. Cell Research. 2005, 15 (5): 336-360. 10.1038/sj.cr.7290302.
Sunkar R, Jagadeeswaran G: In silico identification of conserved microRNAs in large number of diverse plant species. BMC Plant Biology. 2008, 8: 13-10.1186/1471-2229-8-13.
Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP: MicroRNAs in plants. Genes & Development. 2002, 16 (13): 1616-1626.
Rhoades MW, Reinhart BJ, Lim LP, Burge CB, Bartel B, Bartel DP: Prediction of plant microRNA targets. Cell. 2002, 110 (4): 513-520. 10.1016/S0092-8674(02)00863-2.
Zhang BH, Pan XP, Cox SB, Cobb GP, Anderson TA: Evidence that miRNAs are different from other RNAs. Cell Mol Life Sci. 2006, 63: 246-254. 10.1007/s00018-005-5467-7.
Bonnet E, Wuyts J, Rouze P, Van de Peer Y: Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences. Bioinformatics. 2004, 20: 2911-2917. 10.1093/bioinformatics/bth374.
Seffens W, Digby D: mRNAs have greater negative folding free energies than shuffied or codon choice randomized sequences. Nucleic Acids Res. 1999, 27: 1578-1584. 10.1093/nar/27.7.1578.
Mathews DH, Sabina J, Zuker M, Turner DH: Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J Mol Biol. 1999, 288: 911-940. 10.1006/jmbi.1999.2700.
Zhang B, Pan X, Wang Q, Cobb GP, Anderson TA: Computational identification of microRNAs and their targets. Computational Biology and Chemistry. 2006, 30: 395-407. 10.1016/j.compbiolchem.2006.08.006.
Rajagopalan R, Vaucheret H, Trejo J, Bartel DP: A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes & Dev. 2006, 20: 3407-3425.
Martin RC, Liu PP, Goloviznina NA, Nonogaki H: MicroRNA, seeds, and Darwin: diverse function of miRNA in seed biology and plant responses to stress. J Exp Bot. 2010, 61: 2229-2234. 10.1093/jxb/erq063.
Adai A, Johnson C, Mlotshwa S, Archer-Evans S, Manocha V, Archer-Evans S, Vance V, Sundaresan V: Computational prediction of miRNAs in Arabidopsis thaliana. Genome Res. 2005, 15: 78-91. 10.1101/gr.2908205.
Sunkar R, Girke T, Zhu JK: Identification and characterization of endogenous small interfering RNAs from rice. Nucleic Acids Res. 2005, 33: 4443-4454. 10.1093/nar/gki758.
Jin W, Li N, Zhang B, Wu F, Li W: Identification and verification of microRNA in wheat (Triticum aestivum). J Plant Res. 2008, 121: 351-355. 10.1007/s10265-007-0139-3.
Subramanian S, Fu Y, Sunkar R, Barbazuk WB, Zhu JK: Novel and nodulation-regulated microRNAs in soybean roots. BMC Genomics. 2008, 9: 160-10.1186/1471-2164-9-160.
Xie FL, Huang SQ, Guo K, Xiang AL, Zhu YY: Computational identification of novel microRNAs and targets in Brassica napus. FEBS Lett. 2007, 581: 1464-1474. 10.1016/j.febslet.2007.02.074.
Carra A, Mica E, Gambino G, Pindo M, Moser C: Cloning and characterization of small non-coding RNAs from grape. Plant J. 2009
Ohashi-Ito K, Fukuda H: HD-Zip III homeobox genes that Include a novel member, ZeHB-13 (Zinnia)/ATHB-15 (Arabidopsis), are involved in procambium and xylem cell differentiation. Plant Cell Physiol. 2003, 44: 1350-1358. 10.1093/pcp/pcg164.
Kim J, Jung JH, Reyes JL, Kim YS, Kim SY, Chung KS, Kim JA, Lee M, Lee Y, Narry Kim V, Chua NH, Park CM: MicroRNA-directed cleavage of ATHB15 mRNA regulates vascular development in Arabidopsis inflorescence stems. Plant J. 2005, 42 (1): 84-94. 10.1111/j.1365-313X.2005.02354.x.
Floyd SK, Bowman JL: Gene regulation: ancient microRNA target sequences in plants. Nature. 2004, 428 (6982): 485-6. 10.1038/428485a.
Cardon GH, Höhmann S, Nettesheim K, Saedler H, Huijser P: Functional analysis of the Arabidopsis thaliana SBP-box gene SPL3: a novel gene involved in the floral transition. Plant J. 1997, 12 (2): 367-77. 10.1046/j.1365-313X.1997.12020367.x.
Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D: Specific effects of microRNAs on the plant transcriptome. Dev Cell. 2005, 8 (4): 517-27. 10.1016/j.devcel.2005.01.018.
Finkelstein R, Reeves W, Ariizumi T, Steber C: Molecular aspects of seed dormancy. Annu Rev Plant Biol. 2008, 59: 387-415. 10.1146/annurev.arplant.59.032607.092740.
Nakashima K, Fujita Y, Kanamori N, Katagiri T, Umezawa T, Kidokoro S: Three Arabidopsis SnRK2 protein kinases, SRK2D/SnRK2.2, SRK2E/SnRK2.6/OST1 and SRK2I/SnRK2.3, involved in ABA signaling are essential for the control of seed development and dormancy. Plant Cell Physiol. 2009, 50: 1345-1363. 10.1093/pcp/pcp083.
Yamagishi K, Tatematsu K, Yano R, Preston J, Kitamura S, Takahashi H: CHOTTO1, a double AP2 domain protein of Arabidopsis thaliana, regulates germination and seedling growth under excess supply of glucose and nitrate. Plant Cell Physiol. 2009, 50: 330-340. 10.1093/pcp/pcn201.
Kinoshita N, Berr A, Belin C, Chappuis R, Nishizawa NK, Lopez-Molina L: Identification of growth insensitive to ABA3 (gia3), a recessive mutation affecting ABA signaling for the control of early post-germination growth in Arabidopsis thaliana. Plant Cell Physiol. 2010, 51: 239-251. 10.1093/pcp/pcp183.
Guilfoyle TJ, Hagen G: Auxin response factors. Curr Opin Plant Biol. 2007, 10: 453-460. 10.1016/j.pbi.2007.08.014.
Lau : MicroRNA and siRNA Cloning Protocol. Science. 2001, 294: 858-6. 10.1126/science.1065062.
Sunkar R, Girke T, Zhu JK: Identification and characterization of endogenous small interfering RNAs from rice. Nucleic Acids Research. 2005, 33 (14): 4443-4454. 10.1093/nar/gki758.
Ambros V, Bartel B, Bartel DP, Burge CB, Carrington JC, Chen XM, Dreyfuss G, Eddy SR, Griffiths-Jones S, Marshall M: A uniform system for microRNA annotation. RNA. 2003, 9 (3): 277-279. 10.1261/rna.2183803.
Yin Z, Li C, Han X, Shen F: Identification of conserved microRNAs and their target genes in tomato (Lycopersicon esculentum). Gene. 2008, 414: 60-66. 10.1016/j.gene.2008.02.007.
Allen E, Xie ZX, Gustafson AM, Carrington JC: microRNA-Directed Phasing during Trans-Acting siRNA Biogenesis in Plants. Cell. 2005, 121: 207-221. 10.1016/j.cell.2005.04.004.
Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D: Specific Effects of MicroRNAs on the Plant Transcriptome. Developmental Cell. 2005, 8: 517-527. 10.1016/j.devcel.2005.01.018.
Li R, Li Y, Kristiansen K, Wang J: SOAP: short oligonucleotide alignment program. Bioinformatics. 2008, 24: 713-714. 10.1093/bioinformatics/btn025.
Mathews DH, Sabina J, Zuker M, Turner DH: Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J Mol Biol. 1999, 288: 911-940. 10.1006/jmbi.1999.2700.
Meyers BC, Axtell MJ, Bartel B, Bartel DP, Baulcombe D, Bowman JL, Cao X, Carrington JC, Chen X, Green PJ, Griffiths-Jones S, Jacobsen SE, Mallory AC, Martienssen RA, Poethig RS, Qi Y, Vaucheret H, Voinnet O, Watanabe Y, Weigel D, Zhu JK: Criteria for annotation of plant microRNAs. Plant Cell. 2008, 20: 3186-90. 10.1105/tpc.108.064311.
Qiu CX, Xie FL, Zhu YY, Guo K, Huang SQ, Nie L, Yang ZM: Computational identification of microRNAs and their targets in Gossypium hirsutum expressed sequence tags. Gene. 2007, 395: 49-61. 10.1016/j.gene.2007.01.034.
Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D: Specific effects of microRNAs on the plant transcriptome. Dev Cell. 2005, 8: 517-527. 10.1016/j.devcel.2005.01.018.
The authors thank Dr. Jihua Tang for kindly providing maize inbred line 87-1. This work was supported by the National Science Foundation of China (31071424) and by the Shandong Pivotal Project for Elite Agricultural Variety Development.
LW participated in the data analysis and drafted the manuscript. HL participated in the design of the study and performed the miRNA targets prediction. DL carried out RNA extraction and sequence data analysis. HC conceived of the study, participated in its design and coordination, revised the manuscript and gave final approval of the version to be published. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 2:Conserved miRNAs in maize. These miRNAs are conserved in maize and have been reported in miRBase. "+"present in our dataset, "-"absent in our dataset. L length. **: miRNA sequences of maize were identical to those in other species; *: maize miRNA sequences were conserved in other species but have variations in some nucleotide positions. (XLS 68 KB)
Additional file 5:Secondary structures of novel miRNA in maize. Red colored letter: mature miRNA sequence; blue colored letter: miRNA* sequence. (DOC 308 KB)
Additional file 6:Predicted targets for novel miRNAs. MiRNAs targeting the same gene and site are grouped together. The first listed miRNA has the least mismatches. "Mm" stands for the total amount of mismatches between the first mentioned miRNA and the predicted target. "Alignment" visually represents miRNA/mRNA complementary base-pairs and mismatches for the first listed miRNA, with vertical bars and spaces as Watson-Crick base-pairs and mismatches, respectively (G:U wobbles count as mismatches). (XLS 86 KB)
About this article
Cite this article
Wang, L., Liu, H., Li, D. et al. Identification and characterization of maize microRNAs involved in the very early stage of seed germination. BMC Genomics 12, 154 (2011). https://doi.org/10.1186/1471-2164-12-154