- Research article
- Open Access
Small RNA and PARE sequencing in flower bud reveal the involvement of sRNAs in endodormancy release of Japanese pear (Pyrus pyrifolia 'Kosui')
BMC Genomicsvolume 17, Article number: 230 (2016)
In woody perennial plants, including deciduous fruit trees, such as pear, endodormancy is a strategy for surviving the cold winter. A better understanding of the mechanism underlying the endodormancy phase transition is necessary for developing countermeasures against the effects of global warming. In this study, we analyzed the sRNAome of Japanese pear flower buds in endodormant and ecodormant stages over two seasons by implementing of RNA-seq and degradome-sequencing.
We identified 137 conserved or less conserved miRNAs and 50 pear-specific miRNAs. However, none of the conserved microRNAs or pear-specific miRNAs was differentially expressed between endodormancy and ecodormancy stages. On the contrast, 1540 of 218,050 loci that produced sRNAs were differentially expressed between endodormancy and ecodormancy, suggesting their potential roles on the phase transition from endodormancy to ecodomancy. We also characterized a multifunctional miRNA precursor MIR168, which produces two functional miR168 transcripts, namely miR168.1 and miR168.2; cleavage events were predominantly mediated by the non-conserved variant miR168.2 rather than the conserved variant miR168.1. Finally, we showed that a TAS3 trans-acting siRNA triggered phased siRNA within the ORF of one of its target genes, AUXIN RESPONSE FACTOR 4, via the analysis of phased siRNA loci, indicating that siRNAs are able to trigger phased siRNAs in pear.
We analyzed the sRNAome of pear flower bud during dormant phase transition. Our work described the sRNA profiles of pear winter buds during dormant phase transition, showing that dormancy release is a highly coordinated physiological process involving the regulation of sRNAs.
Small RNAs (sRNAs) are ubiquitous regulatory molelcules produced by many thousands of endogenous genes. Silencing pathways based on sRNAs that function at the transcriptional or post-transcriptional levels to negatively regulate the expression of protein-encoding genes have been identified in most eukaryotes. These pathways are also considered to play a role in the silencing of viral RNA and the suppression of mobile elements [1, 2]. Plant sRNAs can be categorized into several major classes, including microRNA (miRNA), heterochromatic small interfering RNA (hc-siRNA), and phased/secondary siRNA (pha-siRNA), according to their origin and biosynthesis [3, 4]. miRNAs are transcribed from miRNA (MIR) gene families, which occur mainly in the intergenic genomic region and sometimes inside protein-coding genes; transcripts eventually become mature miRNAs (mainly 20–22 nt long) via processing by DICER-LIKE1 (DCL1) protein . siRNA is processed from double stranded RNA (dsRNA) or from long, perfectly complementary hairpin RNA molecules. dsRNA is recognized and cleaved by DCL2/3/4 to generate different classes of siRNA. hc-siRNA is 24 nt long and results from DCL3 processing of dsRNA transcribed from intergenic or repetitive regions of the genome by the plant-specific RNA polymerase Pol IV, and possibly also Pol V [6–8]. hc-siRNA functions to maintain genome integrity by suppressing transposable elements via an RNA-dependent DNA methylation (RdDM) pathway . In contrast, pha-siRNA is processed by DCL4 from dsRNA dependent on RNA-dependent RNA polymerase 6 (RDR6). A well-described category of pha-siRNA includes trans-acting siRNA (ta-siRNA) of Arabidopsis . While hc-siRNA plays a crucial role in chromatin modification, miRNA and pha-siRNA function mainly at the posttranscriptional level, via either cleavage or translational suppression of target transcripts, and in a few instances also directing the methylation of DNA [10, 11].
miRNA has been implicated in the control of diverse cellular, physiological and developmental processes in plants. Among plant species, there are several miRNA species with well-conserved sequences and functions. For example, miR156 targets a series of SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL) genes, regulating the phase transition from juvenile to adult during shoot development . miR172 acts downstream of miR156 and mediates regulation of APETALA2 (AP2) and AP2-like genes, which are needed for proper specification of flower organs . The relative balance of miR156 and miR172 is essential for the regulation of phase change and flowering . miR390 directs cleavage of trans-acting siRNA 3 gene (TAS3), leading to production of ta-siRNAs , which, together with miR160 and miR167, targets the AUXIN RESPONSE FACTOR (ARF) gene family, regulating the response of auxin in plant cells [15, 16].
Recent bioinformatics and high-throughput sequencing studies have uncovered a large number of non-conserved miRNAs. These are mostly expressed at low levels with divergent target genes; thereby, they may have specialized functions. Several non-conserved miRNAs indeed function in fine-tuning roles in the target regulatory networks of different plants [17–20]. Discoveries of specific mechanisms and functions of non-conserved plant miRNAs over a wide range of conditions are an expanding topic of investigation . In pome fruits, some progresses in the identification of miRNAs have been achieved. Xia et al.  identified apple miRNAs and pha-siRNAs, describing novel regulatory networks targeting a multitude of genes inside and outside the MYB family. Visser et al.  further extended the apple sRNAome by characterizing miRNAs and siRNAs in apple leaves. Ma et al.  suggested that an apple-specific miRNA may affect the disease resistance pathway by targeting a group of NUCLEOTIDE BINDING SITE-LEUCINE-RICH REPEATS (NBS-LRR) resistance genes. In addition, Niu et al.  in silico predicted miRNAs using the genome sequence of Chinese pear (Pyrus pyrifolia ‘white pear’ group), providing a basic resource to support future research in this specie.
In woody perennial plants, including deciduous fruit trees, such as pear, endodormancy is a strategy for surviving the cold winter. To complete endodormancy and resume growth, low temperatures are required, but the recent global warming trend has sometimes interrupted normal endodormancy establishment and release . Therefore, elucidation of the mechanism underlying the endodormancy phase transition is necessary for developing countermeasures against the effects of global warming. In a previous study, we carried out transcriptome analysis of Japanese pear flower buds during endodormancy phase transition and demonstrated changes in transcript abundance for genes involved in phytohormone metabolism, antioxidant response and methylation changes . We also focused on the function of dormancy-associated MADS-box genes. However, whether sRNAs were involved in the phase transition during dormancy has thus far not been well examined.
In this study, we analyze the sRNAome of Japanese pear flower buds in endodormant and ecodormant stages over two seasons through next-generation sequencing. By implementing of RNA-seq and parallel analysis of RNA end sequencing (PARE-seq), we found that several sRNA loci differential expressed between endodormancy and ecodormancy stages, demonstrating possible involvement in the regulation of endodormancy release. Moreover, we observed a multifunctional miRNA precursor MIR168 and the pha-siRNA production within the ORF of ARF4 triggered by a ta-siRNA. This work constitutes the study to characterize sRNAs in pear winter buds and provides a platform for further investigation of specific sRNAs in various biological processes in pear.
High-throughput sequencing of sRNAs in winter flower buds
The plant samples used in the present study have been described in previous reports . sRNA libraries were constructed from flower buds collected on December 6 (endodormancy) and December 31 (ecodormancy) of the 2009/2010 winter season and on December 2 (endodormancy) and December 31 (ecodormancy) of the 2010/2011 season. As endodormancy released in the late December in both years according to the DVI model [27, 28], our samples thus designated as endo2009, eco2009, endo2010 and eco2010 for respective buds from December 6 and 31 of the 2009/2010 season and from December 2 and 31 of the 2010/2011 season . High-throughput sequencing generated at least eighteen million clean reads per sample (Table 1). The total numbers of reads are similar in all the samples, allowing a meaningful comparison between endodormant and ecodormant stages. After collapsing into unique reads, 5.2–7.7 million unique sequences were obtained in each library.
Reads of 19–25 nt accounted for over 95 % of the total reads, among which about half were 24 nt long in all four libraries (44.2–51.9 %, Fig. 1). The second most abundant size was 23 nt (~20 %), followed by 21 nt (~10 %) and 22 nt (~10 %). The sRNA sequences were then annotated with repeat-associated sequences and the non-coding RNA database (Rfam 10.0) . Sequence reads from rRNA, tRNA and repeat regions made up about 10 % of the unique sequences and 20 % of the redundant sequences in all four samples (Table 1). For further analysis, reads of length 19-25 nt that were sequenced at least twice were mapped to the Chinese pear genome (Pbr1.0, downloaded from http://peargenome.njau.edu.cn; ). About 80 % of the sequences mapped to at least one scaffold and were further analyzed.
De novo annotation of sRNA loci in the pear genome
After filtering the reads derived from rRNA and tRNA, clean reads were aligned to the pear genome using Bowtie (v1.0.0, ) and were analyzed with ShortStack  using the default “plant mode.” In total, 218,050 loci were identified as producing sRNA (Additional file 1: Table S1), among which 164 miRNA loci (MIR genes) and 37,212 other hairpin (HP) RNA loci were identified. The sRNA derived from most of these loci is putatively processed by dicers, but 3218 loci were annotated as non-dicer processed. The sRNA loci were grouped according to mapped sRNA length. As shown in Fig. 2, 24 nt loci were the most abundant (87.4 %), while 20-21 nt loci occurred least frequently. In all the loci groups, non-HP loci accounted for over 75 % of all loci. In addition, most of the miRNA loci were categorized as 21 or 22 nt.
Summary of PARE-seq
PARE-seq of winter flower buds produced 16,986,045 clean reads with sizes of 20 and 21 nt, corresponding to 4,531,717 non-redundant tags, among which 2,055,412 tags were successfully mapped to 32,944 pear transcripts . To describe transcriptome-wide cleavage events, we performed the analysis using the well-established pipeline PAREsnip . In total, 7477 cleavage events (Additional file 2: Table S2) were found in 4819 genes with statistical significance. These cleavage events fell into categories 0 to 3 (category 0 represented the cleavage position with the maximum depth (>1 read) on the transcript and there is just 1 position at the maximum value on the transcript; category 1 represented the position with the depth equal to the maximum value (>1 read) on the transcript, and there is >1 position at maximum value; category 2 represented the position with the depth (>1 read) above the average depth, but not the maximum on the transcript; category 3 represented the position with the depth (>1 read) below or equal to the average depth of coverage on the transcript) defined in PAREsnip (to avoid false positive results, we did not use category 4, which represented cleavage events supported by only one read mapped). As shown in Fig. 3a, 22.4 % of cleavage events belonged to category 0, which was defined as the most abundant cleavage event on a transcript. The cleaved genes were then subjected to GO enrichment analysis. The results showed that “responds to stimulus” was enriched with the lowest false discovery rate (1.2e-09), suggesting a role for miRNA in regulation of responses to biotic and/or abiotic stimuli. In addition, metabolic process and other 19 categories were significantly enriched (Fig. 3b).
Identification of known miRNAs in pear winter flower buds
MIR loci identified by ShortStack were compared to digitally predicted miRNAs . Of 164 annotated MIR loci, 47 overlapped with previously predicted MIR loci, which generated 68 mature miRNAs (including both the 5p and 3p miRNAs from the same precursor). These MIR loci produce miRNAs in 22 miRNA families, including 15 conserved miRNAs and seven less-conserved miRNAs (data not shown).
As the relatively high stringent ShortStack algorithm tended to produce false negative results , BLAST-based strategies were also used to identify known miRNAs with miRProf . The clean reads were compared to a publicly available miRNA database . When only allowing perfect matches, a total of 137 unique sequences were annotated as known miRNAs in the four sRNA libraries. They clustered into 32 miRNA families, including 20 conserved families and 12 less-conserved families (Fig. 4). The abundance of individual conserved miRNAs ranges from several reads to a few million reads in the libraries. These miRNAs showed similar expression patterns in the four libraries: miR166 was the largest represented miRNA family, followed by miR167, both of which have over 1000 normalized reads in all four libraries. Six miRNAs (miR156, miR164, miR168, miR319, miR390 and miR482) have normalized read counts between 100 and 1000 in at least one library. Compared to apple, which contains 206 mature miRNAs belonging to 43 families, the pear winter buds lacked several sRNA families, such as miR828, probably due to their temporal- or spatial- specific expression that was also observed in apple .
Targets of miRNAs were identified using PARE-seq data. A total 131 genes were identified as targets of conserved miRNAs. Representative targets of conserved miRNAs are shown in Additional file 3: Table S3. As previously reported, most conserved miRNAs targeted gene families. Although most (122) of the 131 identified genes were conserved targets for these miRNAs across a wide range of plant species, nine had not been reported in other species. For example, miR166, which is known to target homeobox proteins in plant, was found to target genes encoding fructose-bispospate aldolase and glutathione peroxidase. Similarly, miR167, which targeted a series of ARF genes, also targeted a gene similar to the Arabidopsis RESPONSIVE TO DEHYDRATION 19 (RD19) gene. About two-third of the cleavage events (85 of 131) mediated by conserved miRNAs were in category 0, with the maximum depth on the transcripts.
Identification of pear-specific miRNAs and their targets
To identify pear-specific miRNAs, we used two well-established pipelines, ShortStack and miRCAT . As mentioned above, ShortStack identified 164 miRNA loci, among which 47 were annotated as known miRNAs. The remaining 117 MIR loci were considered specific to pear. Among these 117 loci, 12 loci produced mature miRNAs of 21–22 nt (Table 2), while others generated 23–24 nt miRNAs. Meanwhile, after removing known miRNAs, the remaining 20–22 nt sRNA reads were subjected to the miRCAT pipeline using default parameters for plant miRNA identification. A total of 50 miRNA candidates were identified from the four libraries, of which 7 had miRNA* (star strand) sequences identified from the same libraries, while the other 43 had no identified miRNA* sequences (Table 2). In total, we identified 16 candidates with miRNA* (3 candidates were identified by both miRCAT and ShortStack), which were considered as pear-specific miRNAs, and 43 candidates without miRNA* sequences were considered as pear-specific miRNA candidates. Stem-loop structures of the predicted precursors of each pear specific miRNAs were shown in Additional file 3: Figure S1.
Of the 59 pear-specific miRNAs, 37 belonged to the 21 nt class of miRNAs and 19 belonged to the 22 nt class, while the remaining three were 20 nt long (Table 2). In general, the pear-specific miRNAs were much less abundant than the conserved miRNAs in our libraries. For example, in the endo2009 dataset, only 4 of the 59 novel miRNAs yielded levels over 10 RPM, while 37 were below 1 RPM (Additional file 5: Table S4).
Target genes were also identified for the 59 pear-specific miRNAs. Twenty-nine genes were identified as targets of 16 pear-specific miRNAs (Table 3). Of the 29 target genes, two belonged to category 0, three to category 1 and eight to category 2, while all others were classified into category 3. Among these 29 genes, eight genes were successfully annotated by TAIR 10. The target of miRn20 encoded GAMMA CARBONIC ANHYDRASE-LIKE 2, while miRn35 targeted a NITRATE TRANSPORTER gene. In addition, miRn42 targeted POLYUBIQUITIN 10 (UBQ10) and miRn52 targeted an ACTIN-RELATED PROTEIN 5 (ARP5). Hence, these pear specific miRNAs may be involved in the regulation of an array of metabolic and biological processes and signaling pathways. Similar to previous reports, however, most of the pear-specific miRNA targets fell into categories 2 or 3, which represents a relatively low-confidence group and necessitates further experimental validation.
Differential expression of sRNAs during endodormancy release
Differential expression analysis of sRNAs was performed on two levels: individual sequences of miRNAs and clusters annotated by ShortStack. Differential expression was defined as at least a 2-fold (upregulation) or 0.5-fold (downregulation) change, with statistically significant difference between ecodormancy and endodormancy samples.
The differential expression analysis between endodormancy and ecodormancy stages of miRNAs was performed using edgeR . Two-year samples of similar dormancy stages, judged by DVI values, were used as biological replicates. In our datasets, we did not observe any differentially expressive conserved miRNA (Fig. 4). Furhtermore, we carried out qRT-PCR to confirm their expression patterns, and most of the miRNA expression patterns were similar as that determined by the next-generation sequencing (Fig. 5). In our work, the samples collected from two successive seasons allowed us to identify whether the miRNAs were stably differential expressive through different years. As shown in Fig. 5, although some miRNAs, e.g. miR164, miR167, significantly differential expressed between DVI 0.5 and DVI 1.15 in the 2009/2010 season, their expression patterns in the samples within the similar dormant stages in 2010/2011 season were totally different, suggesting that such differential expressions were not related to the dormant stage transition but to other reasons such as environmental changes and so on. In addition, the pear-specific miRNAs were also analyzed, but none of them showed differential expression between endodormancy and ecodormancy buds (data not shown).
Differential expression analysis of sRNA loci was also performed with edgeR. Among the 218,050 sRNA loci, a total of 1540 differential expression loci were identified, of which 1287 were upregulated and 253 were downregulated in the ecodormancy stage (Fig. 6). The imbalance between up- and down-regulated loci might suggest increased production of sRNAs in ecodormancy pear winter buds. As shown in Fig. 7a, ~70 % of the differentially expressed loci belonged to the 23–24 nt group, while 30 % were categorized as 20–22 nt. In terms of structure, 14 differentially expressed loci were MIR loci and 299 were HP loci, while the rest were non-HP loci (Fig. 7b). Among these differential expressive loci, 571 loci were annotated as transposons and 66 loci overlapped with annotated transcripts, including 46 located in introns and 20 in exons (Fig. 7c). All 20 exon-overlapped loci were upregulated at ecodormancy. As the presence of exon-originated siRNA suggested DCL-mediated cleavage of transcripts, the corresponding transcript abundances were analyzed using RNA-seq data (Fig. 7d and e). As a result, the 20 exon-overlapped loci were found located in 19 genes, two of which showed significant upregulation at ecodormancy (Fig. 7e): Pbr009262.1, similar to AtbZIP9 (AT5G24800) and an un-annotatable gene Pbr041593.1. The remaining genes were stably expressed near endodormancy release in both ‘Kosui’ and ‘Suli’ libraries (Fig. 7d and e).
Identification of the multi-functional precursor of miR168
It has been shown that miR168 is involved in the post-transcriptional regulation of AGO1 in Arabidopsis and other plant species . In pear winter flower buds, we observed a similar cleavage event via PARE-seq. However, this cleavage event was in category 2, while second cleavage event, located 5 bp upstream and in category 0, was mediated by an unknown 21 nt sRNA with two variants, between which there was a SNP near the 3′ end. Alignment with the known miR168 sequence showed that the 16 nt sequence at the 5′ end of the unknown sRNAs perfectly matched 16 nt at the 3′ end of miR168. Analysis of their precursors showed that the unknown sRNAs and miR168 derived from overlapping position within the same precursor, while differing in abundance (Fig. 8a and b). These results suggested that in pear, the miR168 precursor was able to produce multiple functional mature miRNAs. We thus named the two miRNAs miR168.1 and miR168.2. However, miR168.1 was the major miR168 species in all four libraries, e.g., in the endo2009 library, miR168.1 and its miRNA* account for over 80 % of sRNA produced from the precursor, while miR168.2 accounts for only ~5 % (Fig. 8b). We found, though, that miR168.2 dominantly mediated AGO1 cleavage in pear winter buds (Fig. 8c and d). This was further confirmed by 5′-RACE (Fig. 8e); in all nine independently sequenced colonies, the 5′ end corresponded to the cleavage site of position 512, which was mediated by miR168.2.
Identification of TAS3 locus and the pha-siRNA locus triggered by TAS3 ta-siRNA
In the present study, identification of pha-siRNA was performed using ShortStack, by implementing the pear genome sequences. In total, 133 phased clusters were predicted to be statistically significant. For further analysis, 32 loci with hairpin structure and 18 loci annotated as miRNA precursors were excluded. Among the remaining loci, we identified a TAS3 locus (Fig. 9a) with two miR390 target sites with Allan scores of 4.5 and 5.5 (Fig. 9b). Alignment of the TAS3 locus with those of other plant species revealed conservation of the two target sites as well as a region for the production of ARF-targeting ta-siRNAs (Fig. 9a and b). The conserved TAS3-derived pha-siRNAs D7(+) and D8(+) and the PARE data were subsequently submitted to the CleaveLand pipeline. Seven ARF genes, and an unannotated gene (Pbr024661.1), were identified as targets of D7(+) and D8(+) (Fig. 9c). Interestingly, the upstream region of the D8(+) target site in Pbr041836.1 (ARF4) was predicted as a phased locus (Fig. 9d). The fact that the primary cleavage was mediated by ta-siRNA D8(+) suggested that this was not a case of miRNA-mediated phasing process. Therefore, the phased locus observed here may produce secondary siRNAs through the RDR6-DCL4 pathway triggered by D8(+) pha-siRNA.
Next-generation sequencing has tremendously increased the ability for analysis of sRNA. To date, much research has focused on the identification and annotation of miRNAs. However, the majority of expressed sRNAs are not miRNAs. In Arabidopsis, miRNAs account for approximately 10 % of the genome. In pear winter buds, only 169 loci (fewer than 0.1 %) were annotated as miRNAs among a total of 218,050 sRNA loci. Because we used ShortStack, which employs a relatively high stringent algorithm, to annotate MIR loci, the actual number of miRNA loci may be a bit larger; nonetheless, it was clear that a large number of sRNA loci were not miRNAs. Among these unannotated loci, HP loci represented a small portion. HP loci have long been used as a tool to manipulate plant mRNA expression levels. Unlike the loci that produce miRNAs, sRNAs from HP loci are relatively evenly distributed among the sequences. To date, although a few endogenous HP loci have been characterized in detail, including Arabidopsis IR71, IR2039  and maize Mu killer , these loci require further studies. As shown in Fig. 2, the majority of sRNA loci in the pear genome cannot form the hairpin structure (i.e. they are non-HP); these non-HP loci include pha-siRNA loci and hc-siRNA loci, among others. Several phased loci, such as miR390-targeted TAS3 loci and miR482-targeted NBS-LRR loci, are conserved among species. This work identified 113 phased loci, among which a conserved TAS3 locus was identified. Interestingly, ta-siRNA generated from the TAS3 locus further triggered the production of pha-siRNAs in the upstream region of the target gene ARF4. Recently, similar events have also been observed in soybean: transcripts of ARF3 and ARF4 were not only cleaved by ta-siARFs 7D(+) and 8D(+), but the ARF targets produced pha-siRNAs . Thus, our work confirmed that siRNAs can function as pha-siRNA triggers, not only in Solanaceae, but also in Rosaceae.
The expression of conserved miRNA differed among tissue and organs. Previous reports identified apple miRNA expression in various tissues , comparison with which suggests distinct profile of miRNA in pear winter buds. Although Xia et al.  and the present work both used RPM to normalize raw reads for expression level, they used genome-mapped reads while we used total clean reads. It is therefore impossible to directly compare the absolute abundance of each miRNA family. We alternatively compared the five most abundant miRNA families in apple organs and pear winter buds. In pear, miR166 and miR167 were the most abundant sRNAs, followed by miR168, miR319 and miR390. In apple organs, miR166 were relatively highly expressed in leaf and root and miR167 also showed relatively high expression in all tested organs except fruit. However, the miR319 family was not abundantly expressed in apple and miR390 showed similarly low expression, except in flower. Such results suggested that miR319 and miR390 play specific roles in the development of winter buds. miR319 targets class II TCP family transcription factors, which function in the coordination of cell proliferation, the differentiation of several morphological traits, the biosynthesis of phytohormons, as well as the regulation of circadian clock rhythms . However, the roles of TCP genes in winter buds have not been well described. Compared to other tissue/organs, the high abundance of miR319 suggested relatively low TCP expression, which is probably correlated with the active cell division observed in dormant buds, comparing to mature leaves in which cell division is limited to the area near the petiole. In Arabidopsis, miR160 and miR167 are involved in auxin signaling via regulation of ARF genes . In Japanese pear, we also identified seven and six ARF encoding genes regulated by miR160 and miR167, respectively. In addition, a class of ta-siRNAs, produced from the TAS3 gene and triggered by miR390, were also involved in ARF gene regulation via mRNA cleavage. In Arabidopsis and wheat, ARF maintains seed dormancy by stimulating the ABA signal . Although the functions of auxin and ARF on endodormancy have not yet been described in detail, the relatively high expression of ARF-target miRNAs, such as miR167 and miR390, suggests their possible roles in the winter bud dormancy.
Many of the previous reports on sRNA uses a single biological sample for differential expression analysis, potentially increasing the risk of false positive results. In our work, we analyzed the sRNAome of samples from two successive seasons as biological replicates, allowing us greater confidence in the results of our differential expression analysis. Differential expression analyses usually employ one of three strategies: i) individual sequences of interest, such as miRNAs; ii) genomic elements, such as genes or transposons; or iii) discrete sRNA-generated genomic loci (bin or cluster) . In this work, we employed two separate strategies: strategy i was applied to miRNA analysis and strategy ii was applied to analyses of other sRNA loci.
During the preparation of our manuscript, Niu et al.  reported the analysis of the miRNAs of Chinese white pear during the endodormancy release. They found several miRNAs that were differentially expressed during endodormancy release. Specifically, they identified a pear-specific miRNA miR6390 that directly cleaved the transcript of PpDAM1 gene, suggesting the post-transcriptional regulation of PpDAM1. However, neither the conserved miRNAs nor the pear-specific miRNAs were differentially expressed using samples of two successive seasons in our study. In addition, we did not found the miR6390 in all our four datasets even we sequenced the sRNAome in a relatively high depth. The reason for such differences is probably due to the different cultivars used in these two studies, suggesting that the functions of miRNAs during endodormancy release may vary among pear cultivars.
In the 1540 differentially expressed loci, over 80 % were upregulated after endodormancy release. Considering that a similar phenomenon was observed for the RNA-seq analysis using the same samples, these results might suggest universal transcriptional upregulation following endodormancy release. In total, 20 loci, all of which were upregulated in ecodormancy, were annotated as exons of 19 genes, suggesting their potential regulation of those genes. However, only two genes were identified as differentially expressed in endodormancy vs. ecodormancy. One of them encoded a bZIP protein, while the other was not able to be annotated. The bZIP protein is similar to Arabidopsis AtbZIP9, belonging to group C, which has an extended leucine zipper with up to nine heptad repeats. The function of AtbZIP9 has not yet been well characterized, but other members of group C were involved in regulation of seed storage protein production and responses to environmental or pathogen challenges. In the ‘Kosui’ RNA-seq libraries, the bZIP9 gene was upregulated at ecodormancy, while in the Chinese pear datasets high expression levels were observed in December and January. These results suggest a potential function of bZIP9 in the regulation of endodormancy release.
There are several cases for which multiple miRNAs accumulate from the same precursor, including cases where miRNA and miRNA* species accumulate to approximately equal levels, cases in which overlapping but distinct miRNA species are produced from the same arm of a stem-loop, and cases where multiple miRNA/miRNA* duplexes are sequentially excised. In this work, we identified a novel miR168 miRNA sequence which overlapped with the well-described miR168. In accordance with the nomenclature proposed in Meyers et al. , we designated the original miR168 and novel miR168 in Japanese pear as PpmiR168.1 and PpmiR168.2, respectively. A similar case was observed for Arabidopsis miR161, which targets the PPR gene family; the miR161 locus encodes two overlapping miRNAs, miR161.1 and miR161.2, from a contiguous 29 nt sequence. Considering the evolution of this type of MIR locus, Allen et al.  proposed that the MIR161 gene evolved relatively recently via an inverted duplication event associated with the active expansion of the target gene. The similar structure of pear miR168 suggests the same mechanisms of accumulation and evolution as those of Arabidopsis miR161. In pear, miR168.1 was the dominant mature miRNA, accounting for over 70 % of sRNA, while miR168.2 accounted for only ~5 %. However, PARE-seq showed that most cleavage events in the target gene AGO1 were mediated by miR168.2 but not by miR168.1, possibly due to differences in minimum free energy required for forming a duplex between the miRNA and the target mRNA.
As mentioned above, miR390 triggered the production of ta-siRNAs, which post-transcriptionally regulate ARF genes. Previous reports show reveal at least five TAS3 loci in the apple genome . Despite pear belonging to the same genus as apple, our phased loci analysis found only one TAS3 locus sharing high homology with apple TAS3-1 genes. Further Blasting with core D7(+) and D8(+) sequences to pear genome sequences also matched only a single locus (data not shown). We do not consider only one TAS3 locus to be present in pear genome and attribute our result to the incomplete pear genome used in this study. In addition, the lack of pear EST information also limited us to identified TAS genes and other non-coding RNAs. Compared to apple, for which 339,058 EST fragments exist in the Genbank EST database, only 4413 EST fragments exist for pear (as of December, 2014), none of which correspond to the conserved TAS locus. In contrast, four of five apple TAS loci were identified from the EST database .
Much work has reported identification of miRNAs in new plant species. In this study, we used three pipelines (miRProf and miRCAT, which are parts of UEA sRNA workbench, and ShortStack) to annotate miRNAs. miRProf uses similarity to search for matches while ShortStack and miRCAT identify miRNA sequences de novo. Utilizing annotations of known miRNAs, miRProf identified 137 miRNAs belonging to 32 known miRNA families, while ShortStack identified only 68 known miRNAs belonging to 22 families. Although BLAST-based strategies are more sensitive for identification of known miRNAs, they are unable to obtain information on miRNA location and copy number in the genome and their sensitivity depends largely on the BLAST parameters chosen (with or without mismatches), especially for species with no information in miRBase. In contrast, the de novo strategies use a series of criteria to determine the probability of each sRNA and their accuracy depends on the stringency of the criteria used. In recent years, the criteria for miRNA annotation have largely been changed. For example, most miRNA precursors produce more than a single product (as in the example of miR168), which are not able to be properly annotated using all three pipelines. Therefore, it is necessary to develop a more robust algorithm for plant miRNA annotation.
We identified 137 conserved or less conserved miRNAs and 50 pear-specific miRNAs. However, none of the conserved microRNAs or pear-specific miRNAs was differentially expressed between endodormancy and ecodormancy stages at least in this study. On the contrast, 1540 of 218,050 loci that produced sRNAs were differentially expressed between endodormancy and ecodormancy. We also characterized a multifunctional miRNA precursor MIR168 and we showed that siRNAs are able to trigger phased siRNAs in pear like reported in other plant species. Our work showed that dormancy release is a highly coordinated physiological process involving the regulation of sRNAs.
All plant materials used in this study were described in Bai et al. . The total RNA used for transcriptome analysis in Bai et al.  was also used for sRNA-seq in this work. For PARE-seq, buds sampled on Dec. 6 (under endodormancy) and on Dec. 31 (after endodormancy release) in the 2009/2010 seasons and on Dec. 2 (under endodormancy) and on Dec. 31 (after endodormancy release) in the 2010/2011 season were mixed in equal quantities and used for RNA preparation.
RNA extraction and deep sequencing
Bioinformatics analysis of small RNAs
All the raw reads were first processed to removing the 3′ adaptors with the free script cutadapt (https://cutadapt.readthedocs.org/en/stable/). Any sequences without the adaptor matched were excluded from further analysis. The reads were then filtered to remove the reads <18 or >25 nt, with low complexity or matching to tRNAs and rRNAs by the filter function integrated in the UEA sRNA workbench . The resulting clean reads were subjected to the analysis using two well-established pipelines, UEA sRNA workbench and ShortStack. For the UEA sRNA workbench, conserved miRNA analysis was performed with miRProf with no mismatch allowed using miRbase v21 . The conserved miRNAs were then removed from the libraries and the remaining sRNAs were subjected to the pear-specific miRNAs identification using miRCat using the default plant parameters. The pear genome sequences were retrieved from the Pear Genome Project . The total number of the reads in a given library was used for the normalization of read abundance, denoted as reads per million reads (RPM). In addition, the clean reads were also mapped to pear genome sequences with bowtie (v1.0.0, ) and subjected to the pipeline ShortStack (v1.2.4, ) with the default parameters to de novo identify the miRNAs loci, HP sRNA loci and other sRNA loci. The miRNA loci were further annotated with miRBase to identify the conserved miRNAs and pear-specific miRNAs. To identify the phased siRNA loci, the p-value (Phase pval) calculated by ShortStack were adjusted to false discovery rate (FDR) using Benjamini-Hochberg method and the sRNA loci with the FDR < 0.05 were determined as phased loci. The raw counts for each sRNA loci calculated by ShortStack were then used for differential expression analysis.
Differential expression analysis of miRNAs and sRNA loci
Differential expression of conserved, less conserved and pear-specific miRNAs were carried out using edgeR . The raw counts of all individual sRNA tags, including all types of sRNAs, were calculated and used as the input of edgeR. The edgeR output was then manually adjusted to summarize the counts of miRNAs within the same family. Differential expression of the sRNA loci were calculated by edgeR using the output of ShortStack as the input.
Bioinformatic analysis of PARE data
Mixed RNA with equal amounts of the four samples used for sRNA-seq was used for PARE sequencing. After adapter-trimming and genomic mapping as done for the sRNA data. The pipeline PAREsnip  within the UEA sRNA workbench was used for the PARE analysis. The threshold for the alignment score was set to 4.5 for all miRNAs. Only category 0–3 were analysed to minimize the false positive results. The pear gene set was retrieved from Pear Genome Project and the annotation of the genes was carried out by BLASTx alignment to Arabidopsis genes (TAIR10). GO analysis was carried out with BiNGO , a plugin of Cytoscape.
qRT-PCR of miRNAs
Total RNAs were extracted using the method previously described . cDNAs synthesis and qRT-PCR were performed with Mir-X miRNA qRT-PCR SYBR kit (Takara, Tokyo, Japan) according to the manufacturer’s instruction. Relative expression was determined with the 2-ΔΔT algorithm by normalizing to the plant U6 non-coding RNAs. The miRNA-specific primers used for real-time RT-PCR are the sequences of each mature miRNAs.
Availability of supporting data
All the raw sequencing data used in this work have been submitted to the NIH Short Read Archive (SRA) under accession number PRJNA310384.
Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116(2):281–97.
Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136(2):215–33.
Axtell M, Merchant S. Classification and comparison of small RNAs from plants. Annu Rev Plant Biol. 2013;64:137–59.
Fei Q, Xia R, Meyers B. Phased, secondary, small interfering RNAs in posttranscriptional regulatory networks. Plant Cell. 2013;25(7):2400–15.
Voinnet O. Origin, biogenesis, and activity of plant microRNAs. Cell. 2009;136(4):669–87.
Matzke M, Kanno T, Claxinger L, Huettel B, Matzke A. RNA-mediated chromatin-based silencing in plants. Curr Opin Cell Biol. 2009;21(3):367–76.
Law J, Jacobsen S. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11(3):204–20.
Lee T, Gurazada S, Zhai J, Li S, Simon S, Matzke M, Chen X, Meyers B. RNA polymerase V-dependent small RNAs in Arabidopsis originate from small, intergenic loci including most SINE repeats. Epigenetics. 2012;7(7):781–95.
Vazquez F, Vaucheret H, Rajagopalan R, Lepers C, Gasciolli V, Mallory A, Hilbert J, Bartel D, Crete P. Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs. Mol Cell. 2004;16(1):69–79.
Wu L, Mao L, Qi Y. Roles of DICER-LIKE and ARGONAUTE proteins in TAS-derived small interfering RNA-triggered DNA methylation. Plant Physiol. 2012;160(2):990–9.
Wu L, Zhou H, Zhang Q, Zhang J, Ni F, Liu C, Qi Y. DNA methylation mediated by a microRNA pathway. Mol Cell. 2010;38(3):465–75.
Wu G, Poethig R. Temporal regulation of shoot development in Arabidopsis thaliana by miR156 and its target SPL3. Development. 2006;133(18):3539–47.
Aukerman M, Sakai H. Regulation of flowering time and floral organ identity by a microRNA and its APETALA2-like target genes. Plant Cell. 2003;15(11):2730–41.
Wu G, Park M, Conway S, Wang J, Weigel D, Poethig R. The sequential action of miR156 and miR172 regulates developmental timing in Arabidopsis. Cell. 2009;138(4):750–9.
Mallory A, Bartel D, Bartel B. MicroRNA-directed regulation of Arabidopsis AUXIN RESPONSE FACTOR17 is essential for proper development and modulates expression of early auxin response genes. Plant Cell. 2005;17(5):1360–75.
Wu M, Tian Q, Reed J. Arabidopsis microRNA167 controls patterns of ARF6 and ARF8 expression, and regulates both female and male reproduction. Development. 2006;133(21):4211–8.
Kutter C, Schob H, Stadler M, Meins F, Si-Ammour A. MicroRNA-mediated regulation of stomatal development in Arabidopsis. Plant Cell. 2007;19(8):2417–29.
Wang Y, Itaya A, Zhong X, Wu Y, Zhang J, van der Knaap E, Olmstead R, Qi Y, Ding B. Function and evolution of a microRNA that regulates a Ca2 + −ATPase and triggers the formation of phased small interfering RNAs in tomato reproductive growth. Plant Cell. 2011;23(9):3185–203.
Wu L, Liu D, Wu J, Zhang R, Qin Z, Liu D, Li A, Fu D, Zhai W, Mao L. Regulation of FLOWERING LOCUS T by a microRNA in Brachypodium distachyon. Plant Cell. 2013;25(11):4363–77.
Hu J, Zhou Y, He F, Dong X, Liu L, Coupland G, Turck F, de Meaux J. miR824-regulated AGAMOUS-LIKE16 contributes to flowering time repression in Arabidopsis. Plant Cell. 2014;26(5):2024–37.
Qin Z, Li C, Mao L, Wu L. Novel insights from non-conserved microRNAs in plants. Front Plant Sci. 2014;5:586.
Xia R, Zhu H, An Y, Beers E, Liu Z. Apple miRNAs and tasiRNAs with novel regulatory networks. Genome Biol. 2012;13(6):R47.
Visser M, van der Walt A, Maree H, Rees D, Burger J. Extending the sRNAome of apple by next-generation sequencing. PLoS One. 2014;9(4):e95782.
Ma C, Lu Y, Bai S, Zhang W, Duan X, Meng D, Wang Z, Wang A, Zhou Z, Li T. Cloning and characterization of miRNAs and their targets, including a novel miRNA-targeted NBS-LRR protein class gene in apple (Golden Delicious). Mol Plant. 2014;7(1):218–30.
Niu Q, Qian M, Liu G, Yang F, Teng Y. A genome-wide identification and characterization of mircoRNAs and their targets in ‘Suli’ pear (Pyrus pyrifolia white pear group). Planta. 2013;238(6):1095–112.
Tamura F, Tanabe K, Itai A. Effect of interruption of chilling on bud break in Japanese pear. Acta Horticult. 1995;395:135–40.
Bai S, Saito T, Sakamoto D, Ito A, Fujii H, Moriguchi T. Transcriptome analysis of Japanese pear (Pyrus pyrifolia Nakai) flower buds trransitioning through endodormancy. Plant Cell Physiol. 2013;54(7):1132–51.
Sugiura T, Honjo H. A dynamic model for predicting the flowering date developed using an endodormancy break model and a flower bud development model in Japanese pear. J Agric Meteorol. 1997;54(5):897–900.
Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 2005;33:D121–4.
Wu J, Wang Z, Shi Z, Zhang S, Ming R, Zhu S, Khan M, Tao S, Korban S, Wang H, et al. The genome of the pear (Pyrus bretschneideri Rehd.). Genome Res. 2013;23(2):396–408.
Langmead B, Trapnell C, Pop M, Salzberg S. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25.
Axtell M. ShortStack: Comprehensive annotation and quantification of small RNA genes. RNA. 2013;19(6):740–51.
Folkes L, Moxon S, Woolfenden H, Stocks M, Szittya G, Dalmay T, Moulton V. PAREsnip: a tool for rapid genome-wide discovery of small RNA/target interactions evidenced through degradome sequencing. Nucleic Acids Res. 2012;40(13):e103.
Stocks M, Moxon S, Mapleson D, Woolfenden H, Mohorianu I, Folkes L, Schwach F, Dalmay T, Moulton V. The UEA sRNA workbench: a suite of tools for analysing and visualizing next generation sequencing microRNA and small RNA datasets. Bioinformatics. 2012;28(15):2059–61.
Kozomara A, Griffiths-Jones S. miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42(D1):D68–73.
Robinson M, McCarthy D, Smyth G. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Vaucheret H, Vazquez F, Crete P, Bartel DP. The action of ARGONAUTE1 in the miRNA pathway and its regulation by the miRNA pathway are crucial for plant development. Genes Dev. 2004;18(10):1187–97.
Dunoyer P, Brosnan C, Schott G, Wang Y, Jay F, Alioua A, Himber C, Voinnet O. An endogenous, systemic RNAi pathway in plants. EMBO J. 2010;29(10):1699–712.
Slotkin R, Freeling M, Lisch D. Mu killer causes the heritable inactivation of the Mutator family of transposable elements in Zea mays. Genetics. 2003;165(2):781–97.
Arikit S, Xia R, Kakrana A, Huang K, Zhai J, Yan Z, Valdés-López O, Prince S, Musket TA, Nguyen HT, et al. An atlas of soybean small RNAs identifies phased siRNAs from hundreds of coding genes. Plant Cell. 2014;26(12):4584–601.
Lopez JA, Sun Y, Blair PB, Mukhtar MS. TCP three-way handshake: linking developmental processes with plant immunity. Trends Plant Sci. 2015;20(4):238–45.
Jones-Rhoades M, Bartel D, Bartel B. MicroRNAs and their regulatory roles in plants. Annu Rev Plant Biol. 2006;57:19–53.
Liu G, Li W, Zheng P, Xu T, Chen L, Liu D, Hussain S, Teng Y. Transcriptomic analysis of ‘Suli’ pear (Pyrus pyrifolia white pear group) buds during the dormancy by RNA-Seq. BMC Genomics. 2013;13(1):700.
McCormick KP, Willmann MR, Meyers BC. Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments. Silence. 2011;2(1):2.
Niu Q, Li J, Cai D, Qian M, Jia H, Bai S, Hussain S, Liu G, Teng Y, Zheng X. Dormancy-associated MADS-box genes and microRNAs jointly control dormancy transition in pear (Pyrus pyrifolia white pear group) flower bud. J Exp Bot. 2015;67(1):239–57.
Meyers B, Axtell M, Bartel B, Bartel D, Baulcombe D, Bowman J, Cao X, Carrington J, Chen X, Green P, et al. Criteria for annotation of plant microRNAs. Plant Cell. 2008;20(12):3186–90.
Allen E, Xie Z, Gustafson A, Sung G, Spatafora J, Carrington J. Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana. Nat Genet. 2004;36(12):1282–90.
German M, Luo S, Schroth G, Meyers B, Green P. Construction of Parallel Analysis of RNA Ends (PARE) libraries for the study of cleaved miRNA targets and the RNA degradome. Nat Protoc. 2009;4(3):356–62.
Maere S, Heymans K, Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks. Bioinformatics. 2005;21(16):3448–9.
We are thankful to Daisuke Sakamoto for providing us with flower bud materials.
This work was partly supported by the Japan Society for the Promotion of Science (JSPS) (grant no. 25292027 to T.M.).
The authors declare that they have no competing interests.
SB and TM conceived and planned the study. SB collected and extracted total RNAs for next-generation sequencing. SB, TS, AI and YT analyzed the sequencing data, SB, PAT and YX performed the qRT-PCR. SB and TM wrote the manuscript. TM supervised the research. All authors read and approved the final manuscript.
Information of sRNA clusers annotated by ShortStack. (XLSX 28366 kb)
PAREsnip identified the target of small RNAs. (XLSX 672 kb)
Target of known miRNAs in pear identified by PARE-seq. (XLSX 38 kb)
Step-loop structures of pear-specific miRNAs predicted by RNAfold. (PDF 4170 kb)
Abundance of novel miRNAs in each libary. (XLSX 57 kb)