A comparison of performance of plant miRNA target prediction tools and the characterization of features for genome-wide target prediction
© Srivastava et al.; licensee BioMed Central Ltd. 2014
Received: 29 January 2014
Accepted: 1 May 2014
Published: 8 May 2014
Deep-sequencing has enabled the identification of large numbers of miRNAs and siRNAs, making the high-throughput target identification a main limiting factor in defining their function. In plants, several tools have been developed to predict targets, majority of them being trained on Arabidopsis datasets. An extensive and systematic evaluation has not been made for their suitability for predicting targets in species other than Arabidopsis. Nor, these have not been evaluated for their suitability for high-throughput target prediction at genome level.
We evaluated the performance of 11 computational tools in identifying genome-wide targets in Arabidopsis and other plants with procedures that optimized score-cutoffs for estimating targets. Targetfinder was most efficient [89% ‘precision’ (accuracy of prediction), 97% ‘recall’ (sensitivity)] in predicting ‘true-positive’ targets in Arabidopsis miRNA-mRNA interactions. In contrast, only 46% of true positive interactions from non-Arabidopsis species were detected, indicating low ‘recall’ values. Score optimizations increased the ‘recall’ to only 70% (corresponding ‘precision’: 65%) for datasets of true miRNA-mRNA interactions in species other than Arabidopsis. Combining the results of Targetfinder and psRNATarget delivers high true positive coverage, whereas the intersection of psRNATarget and Tapirhybrid outputs deliver highly ‘precise’ predictions. The large number of ‘false negative’ predictions delivered from non-Arabidopsis datasets by all the available tools indicate the diversity in miRNAs-mRNA interaction features between Arabidopsis and other species. A subset of miRNA-mRNA interactions differed significantly for features in seed regions as well as the total number of matches/mismatches.
Although, many plant miRNA target prediction tools may be optimized to predict targets with high specificity in Arabidopsis, such optimized thresholds may not be suitable for many targets in non-Arabidopsis species. More importantly, non-conventional features of miRNA-mRNA interaction may exist in plants indicating alternate mode of miRNA target recognition. Incorporation of these divergent features would enable next-generation of algorithms to better identify target interactions.
KeywordsmiRNA Target prediction Plants Deep-sequencing Non-model plants
Regulatory non-coding small RNAs (smRNAs, 18–30 nucleotides (nt)) play important roles in the regulation of cellular, physiological and ecological processes in plants [1–5]. smRNAs recognize target mRNA molecules by directing effector Argonaute (AGO) protein complexes via base-pairing interactions with nucleic acid molecules, which usually leads to the inhibition of gene expression. In plants, thousands of smRNAs are expressed at any given condition [6, 7]. Elucidation of the function of these smRNAs would largely depend on recognition of their target molecules. Rapid advances in ‘deep-sequencing’ (or ‘next-generation sequencing’; NGS) technology have enabled genome-wide identification of large numbers of smRNAs (including the rare molecules) with greater efficiency [8, 9]. Thus, the current bottleneck in understanding RNA mediated interaction is the correct identification of genes that may be targeted by the numerous smRNAs in the cell.
Plant smRNAs are broadly classified into microRNAs (miRNAs) and small-interfering RNAs (siRNAs). miRNAs are endogenous and originate from specific locations in genomes. The primary transcripts of miRNAs are transcribed in the nucleus in an RNA polymerase II-dependent manner, and transported to the cytoplasm, where these stem-loop structures generate mature miRNAs . siRNAs may have exogenous as well as endogenous origins from viruses, inverted repeats, transposons, transgenes, convergent mRNAs, natural sense-antisense pairs, hairpin RNAs as well as phased siRNAs. Independent of their origin, both the miRNAs and the siRNAs mechanistically depend on the same two families of proteins, the Dicer-like (DCLs) and the AGOs. smRNAs regulate gene expression by binding to the target mRNAs through complementary base-pairing .
Three modes of repression of targets have been proposed in plants . First, a large number of plant miRNA targets undergo cleavage . The PIWI domain of the AGO proteins have endonuclease activity that cleave target mRNAs that are complementary to the guide smRNA strand. Plant miRNAs display complementarity to their targets throughout their length and thus help AGOs ‘slice’ targets. This feature of complementarity of miRNA:mRNAs has been (nearly universally) used by the tools that computationally predict miRNA-target interactions in plants . Secondly, translational inhibition of targets, in which the regulation of protein levels occurs without changes in the target’s mRNA levels, has also been suggested in plants [14–17]. Translational repression has generally been associated with the limited complementarity between the miRNAs and the targets in animals. However, the degree of complementarity between the miRNA and mRNA necessary to support translational repression in plants remains unknown . The third mode of action is the ‘destabilization of targets’, in which a minority of plant targets in the degradome do not accumulate slicer cleavage products . Such targets may involve mRNA destabilization instead of slicing. Moreover, the smRNA-target interactions are complex, as one smRNA may regulate the expression of more than one target and one mRNA can be regulated by many smRNAs [14, 19–21]. The role of smRNAs (including miRNAs) in inducible adaptive responses of plants to quick changes in its environment is rapidly being recognized [2, 3, 5, 22, 23]. It is conceivable that such rapid smRNA-mediated adaptive responses may involve mRNA destabilization and the reversible repression of targets in plants.
Although, initial studies in Arabidopsis proposed near-perfect complementarity between the smRNAs and their targets as a general rule, deviations from this rule were soon evident, indicating that pairing at some sites may be less perfect than others [1, 24, 25]. For instance, position 19 of miR319 and position 16 in the target region in the mRNAs were shown to be critical for pairing in Arabidopsis . Similarly, a mismatch at the 10th and 11th positions could lead to inhibition of translation instead of cleavage of target . Furthermore, complementarities at the 3’ end of the miRNA and 5’ end of target has been shown to be more crucial for tasiRNA formation than are the complementarities in the 5’ end of miRNA in Arabidopsis . Such studies indicate that the criteria of perfect/near perfect complementarities between miRNA/mRNAs need to be relaxed and additional features should be included for accurate target prediction. Therefore, features such as the conservation of targets in related species, the location of target sites [in the coding sequence or in the untranslated region (UTR)], cleavage or repression of targets, presence of multiple target sites, target site accessibility, and the integration of expression profiles of both, miRNA and targets have been utilized for predicting targets (as reviewed by [29, 30] and references therein).
Patscan  was one of the first tools for predicting targets in Arabidopsis and rice  and several new tools have been developed for miRNA target predictions in Arabidopsis. miRU , the first tool for the plant-specific miRNA target prediction, which was later upgraded to psRNATarget , uses a dynamic programming approach, aligning sequences using a modified Smith-Waterman algorithm and applying the ‘RNAup’ algorithm  for target site accessibility. Targetfinder  implements a ‘FASTA’ program along with a penalty scoring scheme for mismatches, bulges, or gaps for aligning the sequences. In 2010, two web-servers, TAPIR  and Target-align , were introduced. TAPIR is imbedded with two search options, the ‘FASTA’ search engine (for ‘fast’ searches), and the ‘RNA hybrid’ search engine (for ‘precise’ results). Target-align also employ the Smith-Waterman based scoring method for predicting the complementarities between miRNAs and mRNAs. Target-align is implemented both as a web server and as a standalone tool, but its utility for genome-wide target predictions for smRNAs has not been tested. Target_Prediction  is based on ‘scanning’ targets for miRNA-patterns followed by the calculation of the minimum free energy (with the help of ‘RNAhybrid’) for predicting miRNA-mRNA duplexes. miRTour , a web server based program, implements a variety of resources such as BLASTX, RNAfold and ClustalW for the prediction of targets (and thus also involves energy minimizations). imiRTP  is an integrated miRNA target interaction prediction tool kit only for Arabidopsis thaliana miRNAs. Further, machine learning has been implemented for predicting the plant miRNA targets, for instance, p-TAREF  implements support vector regression (SVR) and uses a feature of information of ‘dinucleotide density variation’ around the target site from datasets of A. thaliana, Oryza sativa, Medicago truncatula and Solanum lycopersicum. psRobot  is a server hosting a toolbox for analyzing plant smRNAs: it has two modules of stem-loop prediction and smRNA target prediction. psRobot uses a modified Smith-Waterman algorithm and target site conservation to predict targets in A. thaliana, Brachypodium distachyon, Carica papaya, O. sativa, Populus trichocarpa, Sorghum bicolor, Vitis vinifera and Zea mays. psRobot have implemented parallel programming to reduce the run-time during analysis of large datasets such as transcriptomes and genomes.
Although large numbers of tools are available for identifying smRNA-targets in-silico, a comprehensive evaluation of these tools for large-scale, genome-wide target identification has been lacking. In the post-genomics era where microarrays and deep-sequencing technologies have enabled unparalleled data production, genome-wide target prediction with high accuracy is becoming critical for the elucidation of functions of smRNAs. Here we have examined the performance of 18 publically available target prediction tools for plants, including the three miRNA-target prediction tools that are extensively used but not explicitly developed for plants (Targetscan [44, 45], miRanda , RNAhybrid [47, 48]). We have chosen an experimentally validated dataset from Arabidopsis and other plant species comprising of 330 and 115 unique experimentally validated miRNA:mRNA interactions, respectively, to evaluate the tools.
While testing the performance of 18 published smRNA target prediction tools (Additional file 1), we found that some of these tools were either discontinued, or their source codes or web servers were not available (for instance miRNAassist ). Web miRNA designer, WMD3,  is used to custom design artificial miRNAs to silence expression of specific targets, this tool was also not considered fit for genome-wide target prediction, Similarly, the slice detector module of SoMART  was not considered because it uses the degradome data for mapping targets of miRNAs. Thus, a total of eleven tools (8 tools specifically published for plant targets and 3 others - Targetscan, miRanda, RNAhybrid) were selected for further evaluation in this study. All the selected tools were implemented either as stand-alone tools, web server or both (Additional file 1). These tools predicted targets for plant miRNAs (from miRBase version 18; [52, 53]), against the A thaliana’s trancriptome (Phytozome V8.0; ) at their default settings. Initial evaluation of the selected tools was based on a) execution time and b), the average number of targets predicted per miRNA.
Performance of the algorithm on experimentally validated plant miRNA targets
A. thaliana is one of the best studied plant species including for its smRNAs. For the purpose of this evaluation we broadly classified experimentally validated datasets into two categories, one that originated from Arabidopsis (‘Arabidopsis dataset’) and the other that were obtained from species other than Arabidopsis (‘non-Arabidopsis dataset’).
Evaluation of the tools on Arabidopsis dataset
p-TAREF, an algorithm that implements ‘machine learning’ , gave a genome-wide prediction of 15,082 miRNA target interactions with very low ‘precision’ of only 2% (even though it had an increase in sensitivity from 12.7 to 96.3%). Due to extremely low ‘precision’ percentage, this tool was not considered further. In addition, RNAhybrid, miRanda, Target-align and Targetscan were also not further considered as they returned low ‘precision’ and ‘recall’ rates throughout the analysis (Figure 4). Targetfinder performed the best among the selected tools with a ‘recall’ rate of 88% and a ‘precision’ rate of 97% (Figure 4).
Evaluation of the tools on datasets from non-Arabidopsis species
In addition to the precision and recall analysis we have also performed the ‘receiver operating characteristic (ROC) analysis (Additional file 5) for evaluation of the tools for both, Arabidopsis and non-Arabidopsis datasets. Results of ROC analysis were consistent with the precision and recall analysis. A clear difference in the performance of tools was observed between Arabidopsis and non-Arabidopsis species (Additional file 5). Targetfinder was confirmed as the best performer in both Arabidopsis (area under curve (AUC) = 0.88) as well in non-Arabidopsis species (AUC = 0.78).
Factors affecting prediction efficiencies
Effect of free energy
Features of the false negative predictions
The differences between the TP and FN at the alignment level are summarized as a heatmap (Additional file 7). The heatmap clustering is based on the alignment of first 20 nt of miRNA-mRNA complementarity of the respective interactions that were plotted with respect to miRNA 5’-3’ direction (Additional file 7). miRNA-mRNA seed interaction positions (2–12 position) can be represented either as (i) less than two G:U wobbles, (ii) two mismatches and one G:U wobble, (iii) only three mismatches and no G:U wobble, with or without mismatch/gap/G:U wobble at the 3’ end of the miRNA target interaction. This indicates that continuous pairing of miRNAs and mRNAs in TP dataset is different from FN datasets in non-Arabidopsis species. This observation is further confirmed by evaluating the entropy values for each miRNA-mRNA positions in TP and FN datasets (Figure 8D). Since, the numbers of FNs in Arabidopsis dataset were too few to deduce any meaningful statistics, we did not use Arabidopsis FNs dataset for further comparisons. Wilcoxon rank-test between TP datasets (Arabidopsis and non-Arabidopsis) were not observed to be significant while Wilcoxon rank test between non-Arabidopsis FN and TP datasets were observed to be highly significant (Wilcoxon rank test P = 4.8e-07 for Arabidopsis and for non-Arabidopsis datasets). This suggests that the entropy values in FN datasets are significantly higher when compared to the TP datasets (Arabidopsis and non-Arabidopsis).
The integration of smRNAs into the existing functional genomics datasets is essential for a better understanding of the cellular, physiological and ecological processes [1, 4, 5, 7]. In the post-genomics era where information on identity, sequence and expression of smRNA is readily attainable, the high-throughput characterization of their targets is proving to be the limiting factor in understanding smRNA-functions. Although several models and computational tools have been proposed for in-silico identification of smRNA targets, predicting targets with significant statistical confidence in high-throughput experiments still remains a challenging task. Successful prediction of targets depend on the fine-tuning of several factors effecting the miRNA-mRNA complex formation, such as complementarity of smRNA and target sequence, continuous stretches of matches and mismatches, local structural properties, target site accessibility, free energy of interaction, GC content, G:U pairs, etc. Previously proposed models/tools have been theoretically shown to have optimal performance but a critical comparative evaluation of these algorithms based on experimentally validated dataset has not been performed. In this study we attempted to compare all the available tools for high-throughput smRNA target prediction in Arabidopsis as well as non-Arabidopsis species at optimized scores. In addition, we have tried to understand the possible features that significantly reduce the ‘precision’ and ‘recall’ of tools when they are used for non-Arabidopsis species.
The initial evaluation of the selected eleven plant and animal specific tools were based on the true positive datasets obtained for the A. thaliana species (Figure 1). Based on the execution time and the number of FP target predictions, p-TAREF, RNAhybrid, miRanda, Target-align and Targetscan were not considered suitable for genome-wide prediction in plants and therefore these tools were not evaluated further (Figures 2 and 3).
In comparison to their default settings, the optimization of cut-off scores led to an increase in the ‘precision’ of Target_Prediction by 2% (Additional file 4, Figure 4), and to an increase in ‘recall’ by 3% for psRobot. After score optimization, Targetfinder was found to be the most useful tool for predicting targets for Arabidopsis datasets (Figure 5).
We carried out the same set of evaluations for interactions known in non-Arabidopsis species. A maximum of 43% ‘recall’ rate could be obtained when the tools used Arabidopsis ‘optimal score’. By recalculating the cut-off scores for non-Arabidopsis datasets independently, the ‘recall’ rates were increased to the range from 56-69% (Figure 6 and Additional file 4). For the non-Arabidopsis species, we observed that psRNATarget and Tapirhybrid displayed the best trade-off between ‘precision’ and ‘recall’ rates. Depending upon the context of usage (i.e. rapid sensitive scans or highly specific predictions) a combination of union or intersection for these tools could be recommended for miRNA target prediction in non-Arabidopsis species (see Results for details). This observation of different ‘optimal score’ and low ‘recall’ rate might suggest an alternate mechanism of target identification for the smRNAs in non-Arabidopsis species.
It has been shown earlier that in the absence of ‘real’ TN dataset precision and recall might prove to be a better measure for the tool evaluation [58–61]. Indeed, a ‘real’ TN dataset for miRNA-mRNA interaction has largely been missing. Still, we have performed ROC analysis with the negative dataset for miRNAs downloaded from a previously published study . ROC analysis clearly suggested that there was a stark difference in the performance of tools between Arabidopsis and non-Arabidopsis species, which is consistent with the precision and recall analysis (Additional file 5, Figures 4 and 6).
We investigated the possible reason for differences in the TP rates observed for the Arabidopsis and non-Arabidopsis species by considering (a) commonality in the prediction programs, and (b) characterizing features of interactions for the miRNAs that were falsely predicted to be negative. Several prediction programs use the hypothesis that low free energy is required for the formation of stable RNA-RNA duplex [33–35, 37, 40, 43, 47]. Due to the limited numbers of solved secondary structures of RNA duplexes, calculations of free energy mainly rely on modeling efforts . We observed that there was a negative association of free-energy with transcript length (Figure 7). This association could introduce bias in the analysis and indicates that universal cut-off scores might not work for all the transcripts of different lengths.
To characterize the attributes for the targets that were falsely predicted to be negative, four features for miRNA-mRNA targets in TPs and FN for Arabidopsis and non-Arabidopsis species were determined (Additional file 6, Figure 8A-C). With an exception of miRNA GC content, we observed that all the other features were significantly different between TP and FNs of the non-Arabidopsis species while no significant differences were observed between TP of Arabidopsis and non-Arabidopsis species. These observations, in addition to the relatively high values for the entropy (Figure 8D) in the FN datasets further suggest that for a sub-set of the miRNAs belonging to the non-Arabidopsis species, additional components of mechanism of the target recognition are likely to exist.
In this study we have evaluated several miRNA target prediction tools. We observed that the majority of the plant specific tools may be made to predict targets with a high specificity in the model organism, A. thaliana if the parameters of predictions are optimized. We further conclude that such optimized ‘scores’ of Arabidopsis may not be used as a threshold while analyzing non-model organism (i.e. non-Arabidopsis datasets); in addition, we have optimized the scores for non-Arabidopsis species. Based on our results of the evaluation of known interactions, Targetfinder alone or in combination with psRNATarget or Tapirhybrid for the miRNA target predictions provided the most satisfactory results. While analyzing the FN datasets, we noted that additional features of target recognition likely exists, which indicates towards possible novel modes of miRNA-mRNA target recognition in non-Arabidopsis plants.
SmRNA target prediction tools
Comparison of parameters used in the different miRNA target prediction algorithms
Target site accessibility
Only source code
Web server and source code
Web server and source code
Scan for matches and RNA hybrid
Only source code
Only web server
Support Vector Regression (SVR)
Web server and source code
Web server and source code
Web server and source code
Web server and source code
Targetscan 6.2 (Non- conserved)
Only source code
SmRNA-target datasets and genome-wide target prediction
A. thaliana is a widely used model flowering plant, for which, the majority of the tools have been developed. As expected, most miRNA-mRNA interactions have also been experimentally validated in A. thaliana. These experimentally validated interactions of miRNAs and mRNAs in A. thaliana were obtained from the Arabidopsis small RNA project (ASRP) database [63, 64], and by performing a literature search [3, 14, 36, 65]. A majority of these miRNAs were conserved in other species as reported in miRBase V18 [52, 53]. miRNA targets for non-conserved miRNAs (those not reported in A. thaliana) included interactions from O. sativa, Glycine max and V. vinifera; these were curated from the literature [66–71]. Non-redundant miRNA-mRNA interactions add up to 330 for Arabidopsis and 134 for the other plant species (Additional file 8). We have removed miRNAs miR414 and miR413 since they are marked as ambiguous in miRBase; the total number of interactions for other plant species is now 115 (Additional file 8). Thus, our study provides a useful resource by curating a list of experimentally validated miRNA-mRNA interactions in Arabidopsis as well as those only in species other than Arabidopsis. Mature miRNA sequences and target sequences for the Arabidopsis dataset were extracted from miRBase V18 [52, 53] and Phytozome V8.0 . We then evaluated how various computational tools performed in predicting miRNA targets when subjected to large-scale datasets (at genome scales) from Phytozome V8.0 . These datasets were used as inputs to various tools (at their default parameters) for in-silico miRNA target identification. The following metrics were evaluated: 1) execution time, and 2) the average of number of targets predicted per miRNA. Ubuntu version 12.04 (64 bit) was used as an operating system for the evaluation of selected tools on Intel® Core™ i7-2600 CPU with a clock rate of 3.40GHz and 16 GB RAM.
Performance of the tools was evaluated by estimating the parameters of ‘precision’ (TP/(TP + FP)) and ‘recall’ (TP/(TP + FN)) [72, 73]. True positives (TP) were defined as experimentally validated in-silico predictions; false negatives (FN) are the experimentally validated interactions but not predicted in-silico, and the false positives (FP) are defined as in-silico predictions that were not experimentally validated. Maximum TPR values were obtained at the following threshold: psRNATarget: 5 (Maximum expectation), Tapirfasta: 10 (Score), Tapirhybrid: 10 (Score), psRobot: 5 (Penalty score threshold), Target_Prediction: 4 (Penalty score), Targetfinder: 6.5 (Prediction Score), Target-align: 110 (Maximum Score), p-TAREF: 4 (SVR score), RNAhybrid: 1 (p-value). ‘Precision’ and ‘recall’ are inversely related to each other: choosing a score cut-off based on a high ‘precision’ value would provide accurate results with low sensitivity and vice-versa. Therefore, an optimal balance between these two parameters is required. In this study, we have defined an optimal score where the ‘precision’ and ‘recall’ values intersect.
ROC analysis was performed by estimating the parameters of ‘specificity’ (TN/(TN + FP)) and ‘sensitivity’ (TP/(TP + FN)) [72, 73]. True positives (TP) were defined as experimentally validated in-silico predictions; false negatives (FN) are the experimentally validated interactions but not predicted in-silico, and the false positives (FP) are defined as in-silico predictions that were not experimentally validated; true negative (TN) dataset (119 sequences) was obtained from a previous publication .
Combinations of prediction algorithms
To further improve the ‘precision’ and ‘recall’ rates, predictions obtained from the selected tools at their optimal scores were combined. Combinations of the tools were assessed by either taking a union or an intersection of the predictions.
Complementarity and free energy
Evaluation of the ‘FN’ dataset
Four features were evaluated to characterize FN predictions. (a) The GC content of miRNAs (percentage of GC in the miRNA sequence) was calculated for the TP and FN datasets in both Arabidopsis and non-Arabidopsis datasets, (b) the length of the miRNA seed region, in which, the miRNA seed region was defined as the longest stretch of complementary bases between miRNA and mRNA, (c) the first stretch of the stem region, where this feature was defined as length of the continuous matches from first matched base until a mismatched base was observed, and (d) the ratio of the total number matches and the total number of mismatches. Differences in alignments are summarized as a heatmap. The heatmaps are based on the alignment between TP miRNA-mRNA pairs using ‘fasta’ sequence alignment tool version 35 . It is possible that different miRNAs may have different lengths. To cope with the variability in the alignment lengths, we have considered the first 20 nt for clustering. Match, mismatch/gap and G:U wobble were given a score +1, −1, and 0.5 respectively. Clustering is based on Euclidean distance and dendrogram was computed using the complete clustering method. All analyses were performed using R 2.15.2 .
Funding from the Max Planck Society, Germany and the Department of Science and Technology, India in form of the MPG-India Partner Group Program, the CRP Wheat Program of CIMMYT and CGIAR and the startup grant of IISER-Kolkata supported the work in SPP lab. We also thank the Indo-German Science and Technology Center for help in execution of the MPG-India Partner Program. TRM is supported by a fellowship of Integrated MS-PhD program of IISER-Kolkata. We thank Avinash Sethi, Ravi Singh and other SPP lab members for technical help to TRM with the analysis, and Santosh Atanur and Claire Morgan of MRC clinical sciences centre, London for their critical comments on the manuscript. We are thankful to the anonymous reviewers for their comments and suggestions; this has considerably improved the manuscript.
- Jones-Rhoades MW, Bartel DP, Bartel B: MicroRNAS and their regulatory roles in plants. Annu Rev Plant Biol. 2006, 57: 19-53. 10.1146/annurev.arplant.57.032905.105218.PubMedView ArticleGoogle Scholar
- Pandey SP, Baldwin IT: RNA-directed RNA polymerase 1 (RdR1) mediates the resistance of Nicotiana attenuata to herbivore attack in nature. Plant J. 2007, 50 (1): 40-53. 10.1111/j.1365-313X.2007.03030.x.PubMedView ArticleGoogle Scholar
- Pandey SP, Gaquerel E, Gase K, Baldwin IT: RNA-directed RNA polymerase3 from Nicotiana attenuata is required for competitive growth in natural environments. Plant Physiol. 2008, 147 (3): 1212-1224. 10.1104/pp.108.121319.PubMed CentralPubMedView ArticleGoogle Scholar
- Pandey SP, Somssich IE: The role of WRKY transcription factors in plant immunity. Plant Physiol. 2009, 150 (4): 1648-1655. 10.1104/pp.109.138990.PubMed CentralPubMedView ArticleGoogle Scholar
- Pandey SP, Moturu TR, Pandey P: Roles of Small RNAs in Regulation of Signaling and Adaptive Responses in Plants. Recent Trends in Gene Expression. Edited by: Mandal SS. 2013, Hauppauge, NY: Nova Publishers, 107-132.Google Scholar
- Lu C, Tej SS, Luo S, Haudenschild CD, Meyers BC, Green PJ: Elucidation of the small RNA component of the transcriptome. Science. 2005, 309 (5740): 1567-1569. 10.1126/science.1114112.PubMedView ArticleGoogle Scholar
- Pandey SP, Shahi P, Gase K, Baldwin IT: Herbivory-induced changes in the small-RNA transcriptome and phytohormone signaling in Nicotiana attenuata. Proc Natl Acad Sci U S A. 2008, 105 (12): 4559-4564. 10.1073/pnas.0711363105.PubMed CentralPubMedView ArticleGoogle Scholar
- Eldem V, Celikkol Akcay U, Ozhuner E, Bakir Y, Uranbey S, Unver T: Genome-wide identification of miRNAs responsive to drought in peach (Prunus persica) by high-throughput deep sequencing. PLoS One. 2012, 7 (12): e50298-10.1371/journal.pone.0050298.PubMed CentralPubMedView ArticleGoogle Scholar
- Sun LM, Ai XY, Li WY, Guo WW, Deng XX, Hu CG, Zhang JZ: Identification and comparative profiling of miRNAs in an early flowering mutant of trifoliate orange and its wild type by genome-wide deep sequencing. PLoS One. 2012, 7 (8): e43760-10.1371/journal.pone.0043760.PubMed CentralPubMedView ArticleGoogle Scholar
- He L, Hannon GJ: MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet. 2004, 5 (7): 522-531. 10.1038/nrg1379.PubMedView ArticleGoogle Scholar
- Carthew RW, Sontheimer EJ: Origins and Mechanisms of miRNAs and siRNAs. Cell. 2009, 136 (4): 642-655. 10.1016/j.cell.2009.01.035.PubMed CentralPubMedView ArticleGoogle Scholar
- Rogers K, Chen X: Biogenesis, turnover, and mode of action of plant microRNAs. Plant Cell. 2013, 25 (7): 2383-2399. 10.1105/tpc.113.113159.PubMed CentralPubMedView ArticleGoogle Scholar
- Ameres SL, Zamore PD: Diversifying microRNA sequence and function. Nat Rev Mol Cell Biol. 2013, 14 (8): 475-488. 10.1038/nrm3611.PubMedView ArticleGoogle Scholar
- Beauclair L, Yu A, Bouche N: microRNA-directed cleavage and translational repression of the copper chaperone for superoxide dismutase mRNA in Arabidopsis. Plant J. 2010, 62 (3): 454-462. 10.1111/j.1365-313X.2010.04162.x.PubMedView ArticleGoogle Scholar
- Dugas DV, Bartel B: Sucrose induction of Arabidopsis miR398 represses two Cu/Zn superoxide dismutases. Plant Mol Biol. 2008, 67 (4): 403-417. 10.1007/s11103-008-9329-1.PubMedView ArticleGoogle Scholar
- Gandikota M, Birkenbihl RP, Hohmann S, Cardon GH, Saedler H, Huijser P: The miRNA156/157 recognition element in the 3’ UTR of the Arabidopsis SBP box gene SPL3 prevents early flowering by translational inhibition in seedlings. Plant J. 2007, 49 (4): 683-693. 10.1111/j.1365-313X.2006.02983.x.PubMedView ArticleGoogle Scholar
- Li S, Liu L, Zhuang X, Yu Y, Liu X, Cui X, Ji L, Pan Z, Cao X, Mo B, Zhang F, Raikhel N, Jiang L, Chen X: MicroRNAs inhibit the translation of target mRNAs on the endoplasmic reticulum in Arabidopsis. Cell. 2013, 153 (3): 562-574. 10.1016/j.cell.2013.04.005.PubMed CentralPubMedView ArticleGoogle Scholar
- German MA, Pillay M, Jeong DH, Hetawal A, Luo S, Janardhanan P, Kannan V, Rymarquis LA, Nobuta K, German R, De Paoli E, Lu C, Schroth G, Meyers BC, Green PJ: Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends. Nat Biotechnol. 2008, 26 (8): 941-946. 10.1038/nbt1417.PubMedView ArticleGoogle Scholar
- Allen E, Xie Z, Gustafson AM, Carrington JC: microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell. 2005, 121 (2): 207-221. 10.1016/j.cell.2005.04.004.PubMedView ArticleGoogle Scholar
- Allen RS, Millar AA: MicroRNAs in Plant Development and Stress Responses. Edited by: Sunkar R. 2012, Berlin, Heidelberg: Springer Berlin Heidelberg, 123-148. Genetic and Molecular Approaches to Assess MicroRNA Function, vol. 15View ArticleGoogle Scholar
- Llave C, Xie Z, Kasschau KD, Carrington JC: Cleavage of Scarecrow-like mRNA targets directed by a class of Arabidopsis miRNA. Science. 2002, 297 (5589): 2053-2056. 10.1126/science.1076311.PubMedView ArticleGoogle Scholar
- Reynoso MA, Blanco FA, Zanetti ME: Insights into post-transcriptional regulation during legume-rhizobia symbiosis. Plant Signal Behav. 2012, 8 (2): e23102-PubMed CentralPubMedView ArticleGoogle Scholar
- Voinnet O: Origin, biogenesis, and activity of plant microRNAs. Cell. 2009, 136 (4): 669-687. 10.1016/j.cell.2009.01.046.PubMedView ArticleGoogle Scholar
- Jones-Rhoades MW, Bartel DP: Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. Mol Cell. 2004, 14 (6): 787-799. 10.1016/j.molcel.2004.05.027.PubMedView ArticleGoogle Scholar
- Wang XJ, Reyes JL, Chua NH, Gaasterland T: Prediction and identification of Arabidopsis thaliana microRNAs and their mRNA targets. Genome Biol. 2004, 5 (9): R65-10.1186/gb-2004-5-9-r65.PubMed CentralPubMedView ArticleGoogle Scholar
- Palatnik JF, Wollmann H, Schommer C, Schwab R, Boisbouvier J, Rodriguez R, Warthmann N, Allen E, Dezulian T, Huson D, Carrington JC, Weigel D: Sequence and expression differences underlie functional specialization of Arabidopsis microRNAs miR159 and miR319. Dev Cell. 2007, 13 (1): 115-125. 10.1016/j.devcel.2007.04.012.PubMedView ArticleGoogle Scholar
- Brodersen P, Sakvarelidze-Achard L, Bruun-Rasmussen M, Dunoyer P, Yamamoto YY, Sieburth L, Voinnet O: Widespread translational inhibition by plant miRNAs and siRNAs. Science. 2008, 320 (5880): 1185-1190. 10.1126/science.1159151.PubMedView ArticleGoogle Scholar
- Zhang C, Ng DW, Lu J, Chen ZJ: Roles of target site location and sequence complementarity in trans-acting siRNA formation in Arabidopsis. Plant J. 2012, 69 (2): 217-226. 10.1111/j.1365-313X.2011.04783.x.PubMedView ArticleGoogle Scholar
- Dai X, Zhuang Z, Zhao PX: Computational analysis of miRNA targets in plants: current status and challenges. Brief Bioinform. 2011, 12 (2): 115-121. 10.1093/bib/bbq065.PubMedView ArticleGoogle Scholar
- Ding J, Zhou S, Guan J: Finding microRNA targets in plants: current status and perspectives. Genomics Proteomics Bioinform. 2012, 10 (5): 264-275. 10.1016/j.gpb.2012.09.003.View ArticleGoogle Scholar
- Dsouza M, Larsen N, Overbeek R: Searching for patterns in genomic data. Trends Genet. 1997, 13 (12): 497-498.PubMedView ArticleGoogle Scholar
- Bonnet E, Wuyts J, Rouze P, Van de Peer Y: Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes. Proc Natl Acad Sci U S A. 2004, 101 (31): 11511-11516. 10.1073/pnas.0404025101.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhang Y: miRU: an automated plant miRNA target prediction server. Nucleic Acids Res. 2005, 33 (Web Server issue): W701-W704.PubMed CentralPubMedView ArticleGoogle Scholar
- Dai X, Zhao PX: psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res. 2011, 39 (Web Server issue): W155-W159.PubMed CentralPubMedView ArticleGoogle Scholar
- Lorenz R, Bernhart SH, Honer Zu Siederdissen C, Tafer H, Flamm C, Stadler PF, Hofacker IL: ViennaRNA Package 2.0. Algorithms Mol Biol. 2011, 6 (1): 26-10.1186/1748-7188-6-26.PubMed CentralPubMedView ArticleGoogle Scholar
- Fahlgren N, Howell MD, Kasschau KD, Chapman EJ, Sullivan CM, Cumbie JS, Givan SA, Law TF, Grant SR, Dangl JL, Carrington JC: High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One. 2007, 2 (2): e219-10.1371/journal.pone.0000219.PubMed CentralPubMedView ArticleGoogle Scholar
- Bonnet E, He Y, Billiau K, Van de Peer Y: TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics. 2010, 26 (12): 1566-1568. 10.1093/bioinformatics/btq233.PubMedView ArticleGoogle Scholar
- Xie F, Zhang B: Target-align: a tool for plant microRNA target identification. Bioinformatics. 2010, 26 (23): 3002-3003. 10.1093/bioinformatics/btq568.PubMedView ArticleGoogle Scholar
- Sun Y-H, Lu S, Shi R, Chiang V: Computational Prediction of Plant miRNA Targets. RNAi and Plant Gene Function Analysis. Edited by: Kodama H, Komamine A. 2011, Heidelberg: Springer Protocols, 744: 175-186.View ArticleGoogle Scholar
- Milev I, Yahubyan G, Minkov I, Baev V: miRTour: plant miRNA and target prediction tool. Bioinformation. 2011, 6 (6): 248-249. 10.6026/97320630006248.PubMed CentralPubMedView ArticleGoogle Scholar
- Ding J, Yu S, Ohler U, Guan J, Zhou S: imiRTP: An Integrated Method to Identifying miRNA-target Interactions in Arabidopsis thaliana. IEEE International Conference on Bioinformatics and Biomedicine: 2011. 2011, 100-104.View ArticleGoogle Scholar
- Jha A, Shankar R: Employing machine learning for reliable miRNA target identification in plants. BMC Genomics. 2011, 12 (1): 636-10.1186/1471-2164-12-636.PubMed CentralPubMedView ArticleGoogle Scholar
- Wu HJ, Ma YK, Chen T, Wang M, Wang XJ: PsRobot: a web-based plant small RNA meta-analysis toolbox. Nucleic Acids Res. 2012, 40 (Web Server issue): W22-W28.PubMed CentralPubMedView ArticleGoogle Scholar
- Garcia DM, Baek D, Shin C, Bell GW, Grimson A, Bartel DP: Weak seed-pairing stability and high target-site abundance decrease the proficiency of lsy-6 and other microRNAs. Nat Struct Mol Biol. 2011, 18 (10): 1139-1146. 10.1038/nsmb.2115.PubMed CentralPubMedView ArticleGoogle Scholar
- Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP: MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell. 2007, 27 (1): 91-105. 10.1016/j.molcel.2007.06.017.PubMed CentralPubMedView ArticleGoogle Scholar
- Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS: MicroRNA targets in Drosophila. Genome Biol. 2003, 5 (1): R1-10.1186/gb-2003-5-1-r1.PubMed CentralPubMedView ArticleGoogle Scholar
- Kruger J, Rehmsmeier M: RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34 (Web Server issue): W451-W454.PubMed CentralPubMedView ArticleGoogle Scholar
- Rehmsmeier M, Steffen P, Hochsmann M, Giegerich R: Fast and effective prediction of microRNA/target duplexes. RNA. 2004, 10 (10): 1507-1517. 10.1261/rna.5248604.PubMed CentralPubMedView ArticleGoogle Scholar
- Xie FL, Huang SQ, Guo K, Xiang AL, Zhu YY, Nie L, Yang ZM: Computational identification of novel microRNAs and targets in Brassica napus. FEBS Lett. 2007, 581 (7): 1464-1474. 10.1016/j.febslet.2007.02.074.PubMedView ArticleGoogle Scholar
- Ossowski S, Schwab R, Weigel D: Gene silencing in plants using artificial microRNAs and other small RNAs. Plant J. 2008, 53 (4): 674-690. 10.1111/j.1365-313X.2007.03328.x.PubMedView ArticleGoogle Scholar
- Li F, Orban R, Baker B: SoMART: a web server for plant miRNA, tasiRNA and target gene analysis. Plant J. 2012, 70 (5): 891-901. 10.1111/j.1365-313X.2012.04922.x.PubMedView ArticleGoogle Scholar
- Griffiths-Jones S, Saini HK, Van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36 (Database issue): D154-D158.PubMed CentralPubMedGoogle Scholar
- Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39 (Database issue): D152-D157.PubMed CentralPubMedView ArticleGoogle Scholar
- Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, Rokhsar DS: Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40 (Database issue): D1178-D1186.PubMed CentralPubMedView ArticleGoogle Scholar
- Rajewsky N: microRNA target predictions in animals. Nat Genet. 2006, 38 (Suppl): S8-S13.PubMedView ArticleGoogle Scholar
- Adai A, Johnson C, Mlotshwa S, Archer-Evans S, Manocha V, Vance V, Sundaresan V: Computational prediction of miRNAs in Arabidopsis thaliana. Genome Res. 2005, 15 (1): 78-91. 10.1101/gr.2908205.PubMed CentralPubMedView ArticleGoogle Scholar
- Bartel DP: MicroRNAs: target recognition and regulatory functions. Cell. 2009, 136 (2): 215-233. 10.1016/j.cell.2009.01.002.PubMed CentralPubMedView ArticleGoogle Scholar
- Sohn S, Comeau DC, Kim W, Wilbur WJ: Abbreviation definition identification based on automatic precision estimates. BMC Bioinforma. 2008, 9: 402-10.1186/1471-2105-9-402.View ArticleGoogle Scholar
- Monastyrskyy B, D’Andrea D, Fidelis K, Tramontano A, Kryshtafovych A: Evaluation of residue-residue contact prediction in CASP10. Proteins. 2014, 82 (Suppl 2): 138-153.PubMed CentralPubMedView ArticleGoogle Scholar
- Krzyzanowski PM, Andrade-Navarro MA: Identification of novel stem cell markers using gap analysis of gene expression data. Genome Biol. 2007, 8 (9): R193-10.1186/gb-2007-8-9-r193.PubMed CentralPubMedView ArticleGoogle Scholar
- Huang YJ, Powers R, Montelione GT: Protein NMR recall, precision, and F-measure scores (RPF scores): structure quality assessment measures based on information retrieval statistics. J Am Chem Soc. 2005, 127 (6): 1665-1674. 10.1021/ja047109h.PubMedView ArticleGoogle Scholar
- Eddy SR: How do RNA folding algorithms work?. Nat Biotechnol. 2004, 22 (11): 1457-1458. 10.1038/nbt1104-1457.PubMedView ArticleGoogle Scholar
- Backman TW, Sullivan CM, Cumbie JS, Miller ZA, Chapman EJ, Fahlgren N, Givan SA, Carrington JC, Kasschau KD: Update of ASRP: the Arabidopsis Small RNA Project database. Nucleic Acids Res. 2008, 36 (Database issue): D982-D985.PubMed CentralPubMedGoogle Scholar
- Gustafson AM, Allen E, Givan S, Smith D, Carrington JC, Kasschau KD: ASRP: the Arabidopsis Small RNA Project Database. Nucleic Acids Res. 2005, 33 (Database issue): D637-D640.PubMed CentralPubMedView ArticleGoogle Scholar
- Hewezi T, Maier TR, Nettleton D, Baum TJ: The Arabidopsis microRNA396-GRF1/GRF3 regulatory module acts as a developmental regulator in the reprogramming of root cells during cyst nematode infection. Plant Physiol. 2012, 159 (1): 321-335. 10.1104/pp.112.193649.PubMed CentralPubMedView ArticleGoogle Scholar
- Li YF, Zheng Y, Addo-Quaye C, Zhang L, Saini A, Jagadeeswaran G, Axtell MJ, Zhang W, Sunkar R: Transcriptome-wide identification of microRNA targets in rice. Plant J. 2010, 62 (5): 742-759. 10.1111/j.1365-313X.2010.04187.x.PubMedView ArticleGoogle Scholar
- Macovei A, Tuteja N: microRNAs targeting DEAD-box helicases are involved in salinity stress response in rice (Oryza sativa L.). BMC Plant Biol. 2012, 12: 183-10.1186/1471-2229-12-183.PubMed CentralPubMedView ArticleGoogle Scholar
- Pantaleo V, Szittya G, Moxon S, Miozzi L, Moulton V, Dalmay T, Burgyan J: Identification of grapevine microRNAs and their targets using high-throughput sequencing and degradome analysis. Plant J. 2010, 62 (6): 960-976.PubMedGoogle Scholar
- Shamimuzzaman M, Vodkin L: Identification of soybean seed developmental stage-specific and tissue-specific miRNA targets by degradome sequencing. BMC Genomics. 2012, 13 (1): 310-10.1186/1471-2164-13-310.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhang B, Pan X, Stellwag EJ: Identification of soybean microRNAs and their targets. Planta. 2008, 229 (1): 161-182. 10.1007/s00425-008-0818-x.PubMedView ArticleGoogle Scholar
- Zhou M, Gu L, Li P, Song X, Wei L, Chen Z, Cao X: Degradome sequencing reveals endogenous small RNA targets in rice (Oryza sativa L. ssp. indica). Front Biol. 2010, 5 (1): 67-90. 10.1007/s11515-010-0007-8.View ArticleGoogle Scholar
- Hu LL, Huang Y, Wang QC, Zou Q, Jiang Y: Benchmark comparison of ab initio microRNA identification methods and software. Genet Mol Res. 2012, 11 (4): 4525-4538. 10.4238/2012.October.17.4.PubMedView ArticleGoogle Scholar
- Radivojac P, Clark WT, Oron TR, Schnoes AM, Wittkop T, Sokolov A, Graim K, Funk C, Verspoor K, Ben-Hur A, Pandey G, Yunes JM, Talwalkar AS, Repo S, Souza ML, Piovesan D, Casadio R, Wang Z, Cheng J, Fang H, Gough J, Koskinen P, Törönen P, Nokso-Koivisto J, Holm L, Cozzetto D, Buchan DW, Bryson K, Jones DT, Limaye B, et al: A large-scale evaluation of computational protein function prediction. Nat Methods. 2013, 10 (3): 221-227. 10.1038/nmeth.2340.PubMed CentralPubMedView ArticleGoogle Scholar
- Lipman DJ, Pearson WR: Rapid and sensitive protein similarity searches. Science. 1985, 227 (4693): 1435-1441. 10.1126/science.2983426.PubMedView ArticleGoogle Scholar
- Team RC: R: A Language and Environment for Statistical Computing. 2013, Vienna, Austria: R Foundation for Statistical ComputingGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.