Insight into microRNA regulation by analyzing the characteristics of their targets in humans
© Hu; licensee BioMed Central Ltd. 2009
Received: 14 September 2009
Accepted: 10 December 2009
Published: 10 December 2009
microRNAs (miRNAs) are believed to regulate their targets through posttranscriptional gene regulation and have the potential to silence gene expression via multiple mechanisms. Despite previous advances on miRNA regulation of gene expression, little has been investigated from a genome scale.
To gain new insight into miRNA regulation in humans, we used large scale data and carried out a series of studies to compare various features of miRNA target genes to that of non-miRNA target genes. We observed significant differences between miRNA and non-miRNA target genes for a number of characteristics, including higher and broader mRNA expression, faster mRNA decay rate, longer protein half-life, and longer gene structures. Based on these features and by analyzing their relationships we found that miRNA target genes, other than having miRNA repression, were most likely under more complex regulation than non-miRNA target genes, which was evidenced by their higher and broader gene expression but longer gene structures. Our results of higher and broader gene expression but fast mRNA decay rates also provide evidence that miRNA dampening of the output of preexisting transcripts facilitates a more rapid and robust transition to new expression programs. This could be achieved by enhancing mRNA degradation through an additive effect from multiple miRNA targeting.
Genome-scale analysis on the nature of miRNA target genes has revealed a general mechanism for miRNA regulation of human gene expression. The results of this study also indicate that miRNA target genes, other than having miRNA repression, are under more complex gene regulation than non-miRNA target genes. These findings provide novel insight into miRNA regulation of human gene expression.
miRNAs, which were first discovered in Caenorhabditis elegans as post-transcriptional regulators of genes involved in developmental timing [1, 2], are small non-coding RNAs of ~23 nucleotides. They are now recognized as one of the major regulatory gene families, playing important roles in almost every cellular process in animals, plants and viruses [3–5]. In animals, this includes regulation of developmental timing and signaling pathways, apoptosis, metabolism, myogenesis and cardiogenesis, brain development , and human pathologies [6–8].
Although the mechanism by which miRNAs regulate gene expression remains under debate, miRNAs mainly mediate gene regulation post-transcriptionally via translational repression and reduction of mRNA stability by forming miRNA-mRNA pairs to their target genes. In vertebrates, most miRNAs pair imperfectly with the 3' untranslated regions (3'UTRs) of their targets, with a contiguous and perfect base pairing of the miRNA nucleotides 2-7 at the 'seed' region, providing pairing specificity [9–11]. Whereas miRNAs repress translation of target mRNAs by inhibiting translation initiation [12–14], blocking translation elongation [15, 16], or promoting premature dissociation of ribosomes , they induce significant degradation of mRNA targets by mRNA deadenylation, decapping, and 5' → 3' exonucleolytic degradation. The latter have been widely demonstrated from animal studies [17, 18], cultured cells [19–21], and microarray analysis [19, 20, 22, 23]. Contradicting these mechanisms, it has been recently discovered that miRNA have the potential to activate translation under certain conditions [24–26] and the ability to switch from translational repression to translational activation in cell-cycle-arrested cells [27–29].
It has been found that individual miRNA can mildly down-regulate hundreds of targets by direct or indirect effects, providing a mechanism of fine-tuning for gene expression. The first demonstration of this came from a microarray study in which introduction of an exogenous miRNA into human HeLa cells downregulated a large number of target mRNAs . Consistently, depletion of the miRNA machinery proteins destabilized around 20% of transcripts expressed in Drosophila. In Zebrafish, miR-430 was found to facilitate the deadenylation and clearance of several hundred target mRNAs, most of which were maternally expressed and accumulated in the absence of miR-430 . In another study that employed a proteomic approach to measure changes in the synthesis of proteins in response to miRNA transfection or endogenous miRNA knockdown, a single miRNA could repress the production of hundreds of proteins, but the repression was relatively mild . It has also been demonstrated that some miRNAs dampen the output of preexisting but unwanted transcripts to facilitate a more rapid and robust transition to new expression programs, which help maintain and define cell types [32–34]
Recent studies indicate that there exists more than 800 known mammalian miRNA genes [35–37] that are conserved throughout evolution with constitutive or spatially and temporally regulated expression. Computational analyses suggest that a single transcript may be regulated by multiple miRNAs  and that each miRNA can target tens to hundreds of transcripts [22, 31], leading to the conclusion that miRNAs as a whole regulate the expression of at least 30% of human gene transcripts . These discoveries show that miRNAs and their targets are part of complex regulatory networks, for which a number of studies have been performed to reveal the effects of specific miRNAs on temporal and spacial expression of their target genes, and hence the functions of miRNAs. The strategy employed in these studies is to identify the relationship between the expression of a miRNA and its targets by either overexpressing or silencing the miRNA [19, 40, 41].
Despite all these studies, little is known about miRNA regulation from a genome scale. Accordingly, to gain new insight into miRNA mediated gene regulation in humans, we carried out a series of studies to compare various features of miRNA target genes to that of non-miRNA target genes. These included the difference of gene expression from 79 human tissues , mRNA  and protein  stability, the influence of miRNA binding sites on mRNA degradation, and gene structures. Based on these characteristics of miRNA target genes and by analyzing the relationships between these features, principles of miRNA mediated gene regulation were investigated. To the best of our knowledge, this is the first use of large scale data from different levels to study the mechanism of miRNA regulation in humans, and the findings will therefore help provide new insight into miRNA regulation of human gene expression.
We have found significant differences between miRNA and non-miRNA target genes for a number of characteristics, including higher and broader mRNA expression, faster mRNA decay rate, longer protein half-life, and longer gene structures. We also found that miRNA target genes, other than having miRNA repression, were most likely under more complex regulation than non-miRNA target genes. The results also suggest that higher gene expression and longer length of miRNA target genes may be the consequence of genomic design for regulatory complexity, but it is not the result of lacking "selection for economy", for which highly and broadly expressed genes are compact [45, 46]. The higher and broader gene expression but fast mRNA decay rates also suggests that miRNA dampening of the output of preexisting but unwanted transcripts to facilitate transition to new expression programs [32–34] is a general mechanism for miRNA regulation of human gene expression. This can be achieved by enhancing mRNA degradation through an additive effect from multiple miRNA targeting.
miRNA target genes have higher and broader expression in human tissues
As a first step to investigate miRNA regulation, we used published gene expression data from the GNF Atlas2 gene expression database from 79 human tissues (gnfAtlas2)  to explore both the absolute expression and breadth of expression (See Materials and Methods) differences between non-miRNA and miRNA target genes predicted from the most popular algorithms of TargetScanS , PicTar , and RNA22 . While both TargetScanS and PicTar predict mammalian miRNA targets based on sequence complementarity, evolutionary conservation, and binding energy, they were reported to have high fidelity for target prediction from biological and informatic validation [11, 47]. On the other hand, RNA22, rather than using cross-species conservation as the major component for prediction, employs a pattern-based approach for the identification of miRNA binding sites in the sequence of interest. Therefore, the use of miRNA target genes predicted from RNA22 allows one to avoid potential source of bias due to genes with preferential 3'UTR cross-species conservation.
For gene expression breadth, we compared the fraction of miRNA target genes restrictively expressed in certain number of tissues to that of non-miRNA target genes. In this analysis, a gene was considered as expressed if its signal exceeded a threshold of 200 arbitrary units according to Su et al . The results indicate that miRNA target genes have wider gene expression breadth compared to that of non-miRNA target genes in 8 ten-tissue bins as shown in Figure 1b, where the fraction of miRNA target genes are, in general, less than the fraction of non-miRNA target genes for genes restrictively expressed in the first 5 tissue bins. However, this is reversed in the last 2 tissue bins where the fraction of miRNA target genes is greater than the fraction of non-miRNA target genes. Comparing the median fraction distribution in each tissue bin, we observed statistically significant differences for the first 3 (one-side Wilcoxon signed rank test p < 0.03) and the last 2 (one-side Wilcoxon signed rank test p < 0.005) bins, further confirming the characteristics of wide expression breadth for miRNA target genes. Similar results were also obtained from two other miRNA target gene sets (Additional file 2).
We next asked whether any other factors might contribute to these observed gene expression differences between miRNA and non-miRNA target genes. Accordingly, since transcription factors and promoter structures are involved in transcriptional regulation of gene expression, we compared promoters of these two gene groups for GC contents and transcription factor binding sites (TFBSs) (See Materials and Methods). We found that the promoter sequences of miRNA target genes were more GC- and TFBS-enriched when compared to the promoter sequences of non-miRNA target genes. The median GC percentage for promoter sequences of miRNA target genes is ~56%, which is significantly larger (one-side Wilcoxon rank sum test p < 10-200) than the median percentage of ~52% from non-miRNA target genes. Furthermore, the median TFBS number per promoter is 43 for miRNA target genes, which is also significantly larger (one-side Wilcoxon rank sum test p < 10-200) than the median TFBS number of 41 from non-miRNA target genes. These results indicate that the gene expression differences between miRNA and non-miRNA target genes may be partially due to transcriptional regulation, a topic that is worthy of further investigation.
miRNA target genes are less compact
The above findings raise an interesting question why miRNA target genes are highly and widely expressed, as miRNA target genes are down-regulated by miRNAs. Previous reports have shown that highly and broadly expressed genes are shorter in both their intronic and coding sequences than genes expressed at low level or in a few tissues as a results of selection for economy [46, 50–52], since transcription and translation are costly. We therefore performed analysis to see if the structures of miRNA target genes were more compact than non-miRNA target genes.
miRNA target genes are less compact
miRNA target genes
Non-miRNA target genes
n = 6444
n = 12428
Coding sequences length
2009 ± 26
1575 ± 14
1730 ± 19
1251 ± 11
274 ± 3
197 ± 2
3989 ± 33
2705 ± 19
Total intron length
83699 ± 1890
42275 ± 859
Number of introns
11.6 ± 0.14
8.5 ± 0.01
n = 6545
n = 12327
Coding sequences length
1771 ± 22
1264 ± 13
1688 ± 18
965 ± 12
271 ± 3
198 ± 2
3871 ± 32
2757 ± 20
Total intron length
80741 ± 1792
43507 ± 910
Number of introns
11.1 ± 0.14
8.5 ± 0.09
n = 9055
n = 9817
Coding sequences length
1833 ± 20
1622 ± 16
1415 ± 14
1032 ± 14
231 ± 2
216 ± 3
3455 ± 26
2856 ± 23
Total intron length
61231 ± 1201
51983 ± 1250
Number of introns
11.6 ± 0.1
8.8 ± 0.1
The result that miRNA target genes have longer 3'UTR sequences is not surprising, as miRNAs perform their regulation roles mainly by pairing with 3'UTR of target genes. However, the finding that other gene structures of miRNA target genes are also longer is contradicted to the "selection for economy", for which highly and broadly expressed genes are compact [45, 46]. Previous studies indicated that longer gene structures were most likely linked to gene regulation, such as splicing regulation and chromatin-mediated gene suppression from introns [46, 54]. The result therefore indicates that miRNA target genes, other than having miRNA repression, are subject to more complex regulation [32–34, 52, 54].
miRNA target transcripts are less stable but not their protein products
It is widely accepted that miRNAs mediate gene regulation by reducing the stability of their target transcripts. Previous studies from manipulating individual miRNAs revealed that introducing an alien miRNA into human HeLa cells down-regulated a large number of mRNAs  and that inhibiting miR-122 function in mouse liver led to the increase of stability for hundreds of mRNAs . To investigate if mRNA stability of miRNA target genes differs from that of non-miRNA target genes on the genome scale, we compared the mRNA decay rates between these two gene groups using data from Yang et al. , which contains mRNA decay rates for ~5550 human transcripts, representing ~4250 unique genes. Out of the 4250 genes, 2148, 2020, and 2248 were mapped to miRNA target genes predicted from TargetScanS, PicTar, and RNA22, respectively.
As an extension of selection for economy for genes, one would expect that genes with a lower mRNA decay rate should be more stable for their protein products. Therefore, we asked if a similar difference existed for their protein products, although protein stability is not likely under miRNA regulation. To address this question, we compared protein stability of miRNA target genes to that of non-miRNA target genes using data from Yen et al. , which contains half-life protein stability measures represented by protein stability indices for ~6500 unique genes. Out of the 6500 genes, we were able to map 2417, 2502, and 3366 to miRNA target genes predicted from TargetScanS, PicTar, and RNA22, respectively. Contrary to the selection for economy for genes, the results indicate that proteins of miRNA target genes are more stable than those of non-miRNA target genes as shown in Figure 2b, where the median protein stability indices from miRNA target genes are larger than those from non-miRNA target genes. Further statistical analyses using one-side Wilcoxon rank sum tests revealed significant differences between miRNA and non-miRNA target genes, with p-values of 6.4 × 10-10, 6.3 × 10-6, and 9 × 10-4 for the comparisons using miRNA target genes predicted from TargetScanS, and PicTar, and RNA22, respectively.
miRNA target gene expression is correlated with mRNA and protein stability
The above discovered characteristics of higher and broader mRNA expression, faster mRNA decay rate, and longer protein half life for miRNA target genes raise an important question as to whether or not they are related to each other. If these characteristics associate with miRNA target genes and are dependent on each other, then they are expected to correlate from the same set of miRNA target genes. This should be especially true for mRNA expression and decay rate, as both are related to miRNA targets. To address this question, we first mapped miRNA target genes from mRNA expression, mRNA decay rate, and protein stability index datasets to each other. We then computed the correlation using the paired values from the overlapping genes.
Although we observed negative correlations between mRNA decay rate and protein stability (R: between -0.027 and -0.09), these correlations are not statistically significant across all three miRNA target gene sets (Spearman's rank correlation p: between 0.4 and 7 × 10-4), suggesting that mRNA decay rate and protein stability are most likely independent. Similar correlation results between mRNA decay rate, mRNA expression, and protein stability were obtained from TargetScanS and RNA22 predicted miRNA target genes (Additional files 3, 4, 5, and 6).
Taken together, the significant and inverse correlation between mRNA expression and mRNA stability but positive correlation to protein stability and the higher and broader gene expression but fast mRNA decay rates for miRNA target genes suggest that miRNA dampening the output of preexisting but unwanted transcripts to facilitate transition to new expression programs [32–34] is a general mechanism for miRNA regulation of gene expression.
miRNA binding sites have an additive effect on mRNA stability
As a first step to understand how miRNAs facilitate transition of miRNA target genes to new expression programs, we sought to determine if the number of miRNA binding sites had any effect on the level of mRNA degradation. Accordingly, we performed analyses to investigate the influence of the number of miRNA binding sites on mRNA stability by examining the relationship between the number of miRNA binding sites and the level of mRNA decay rate in miRNA target genes.
We performed the following 4 analyses to verify that the observed additive effect of miRNA binding sites on mRNA decay rates was indeed a property for miRNA regulation. In the first analysis, we assessed the robustness of the changing trends for the additive effect of miRNA binding sites on mRNA decay rate by permutation tests (See Methods). No changing trend was obtained from any of 10,000 random datasets as shown in Figure 5b and Figure 5d, where the mRNA decay rate or miRNA binding sites from the 10,000 random datasets were displayed along with the corresponding miRNA binding site or mRNA decay rate groups.
We next performed analysis to reveal whether the additive effect was contributed from biologically relevant miRNA binding sites, or rather if it came from other factors. Accordingly, we first created 3 random datasets by shuffling the 3'UTR sequences of the 2248 RNA22 predicted miRNA target genes which had mRNA decay rate data. We then used the RNA22 algorithm to predict miRNA binding sites for individual genes in these random datasets (See Materials and Methods). We subsequently employed the rank test to assess the relationship between the number of miRNA binding sites from the 3'UTR sequences of these random datasets and the level of mRNA decay rate in corresponding genes. Again, no positive correlations were observed between miRNA decay rate and the number of miRNA binding sites from any of the three random datasets as also shown in Figure 5b and Figure 5d, where the results from the complete nucleotide mixing dataset are depicted.
miRNAs regulate gene expression post-transcriptionally by base-pairing to target mRNAs mainly in 3'UTRs. It has been demonstrated that genes preferentially coexpressed with miRNAs have evolved to avoid targeting by that miRNA, a phenomenon referred to as antitargets. Genes of this kind are able to escape from miRNA targeting by site depletion , for which genes have either short UTRs or low target site densities. One of the examples to avoid miRNA regulation is the housekeeping genes which have not only shorter 3'UTRs than other transcripts but also lower target site density. Further evidence for antitargets came from correlation study between "seed" sequences and the expression of several highly tissue-specific human miRNAs, which specifically down-regulate a large number of mRNA targets. The down-regulation was correlated to the presence of miRNA seed sequence in 3'UTRs of target mRNAs .
Since miRNAs can down-regulate target mRNA, one would expect that miRNA target genes should be expressed at lower levels than non-miRNA target genes. We however found significantly higher mRNA expression for miRNA target genes from different human tissues, when compared to non-miRNA target genes. The high mRNA expression of miRNA target genes might be partly due to their promoter sequences, which generally have more TFBSs and are enriched in GC nucleotides, a topic that is worthy of further investigating. On the other hand, we observed that miRNA target genes had high mRNA decay rates but were more stable for their protein products. Further analyses indicated that mRNA expression of miRNA target genes was significantly and inversely correlated with mRNA stability but positively correlated with protein stability. The highly expressed mRNA from miRNA target genes is subjected to more rapid degradation but the protein products are more stable. These observations raise an interesting question as to why mRNA of miRNA target genes are highly transcripted and then degraded before reaching the final protein products, as the transcriptional process is costly.
One explanation for the above phenomenon is that mRNA of miRNA target genes are generally subjected to tuning, which is one way that miRNAs regulate their targets based on a micromanager model . In this model, the miRNA targets were classified into a few categories, with one class comprising of switch targets with multiple target sites presenting in the 3'UTR. With the onset of new developmental stage or other cellular changes these miRNA targets take advantage of the miRNAs to dampen gene products to inconsequential levels so that new expression programs can be achieved. This post-transcriptional control could be more responsive than transcriptional control in terms of both speed and reversibility. One of the examples came from the study for zebrafish, in which miR-430 directly regulated several hundred targets by promoting deadenylation and clearance of mRNAs. Most of these mRNAs were maternally expressed and accumulated in the absence of miR-430 but were degraded during early embryogenesis . In agreement with others' discoveries from individual genes, our findings from genome scale analyses on the characteristics of miRNA target genes also suggest that in human miRNA dampening of the preexisting transcript outputs could be a general mechanism for miRNA regulation. Furthermore, miRNA target genes are most likely under more complex regulation than non-miRNA target genes.
Supporting the complex regulation notion, miRNA target genes in humans are significantly longer than non-miRNA target genes in various parts of gene structures. The longer gene structures but higher gene expression for miRNA target genes seems wasteful based on "selection for economy" for genes [45, 46, 51], for which the highly and broadly expressed genes are shorter both in their intronic and coding sequences than genes expressed in a narrow and low fashion, as transcription and translation are energetically costly. Considering the enhanced mRNA decay rate for miRNA target genes, the paradox against the selection for economy is even more apparent. The results from this study therefore suggest that evolution may not necessarily optimize efficiency but impose specific roles for miRNA target genes such as more subtle and complex gene regulation based on a "genome design" hypothesis . This hypothesis suggests that the length of gene structures is determined by their function and that longer gene structures are involved in more complex regulation such as chromatin-mediated suppression of gene expression.
In this study, we observed significant correlations between mRNA decay rate and the number of miRNA binding sites for miRNA target genes. We found that miRNA binding sites had an additive effect on mRNA decay rate, for which genes with more miRNA binding sites displayed higher mRNA decay rate. These additive effects come mainly from different miRNAs, since only a small fraction of all predicted targets contain more than one miRNA binding site for any single miRNA . While the number of miRNA binding sites for miRNA target genes used in this study ranges from 1 to 173, with average number > 8, more than 82% of genes have 2 or more miRNA binding sites, indicating that majority of miRNA target genes are under multiple miRNA regulation. It is suggested that the main reason for having multiple miRNA targeting on individual genes is that these genes can be regulated in a variety of conditions such as developmental timing and within various tissues. Results from this study revealed that multiple targeting on a gene could also act together to provide enhanced mRNA decay rate, and hence stronger gene repression, which could contribute to facilitate the transition of miRNA target genes to new expression programs.
To gain new insight into miRNA regulation in humans, we have used large scale data and carried out a series of studies to compare various features of miRNA target genes to that of non-miRNA target genes. We have demonstrated that miRNA and non-miRNA target genes are significantly different for a number of characteristics, including higher and broader mRNA expression, faster mRNA decay rate, longer protein half-life, and longer gene structures. We have found that miRNA target genes, other than having miRNA repression, are most likely under more complex regulation than non-miRNA target genes. We have also revealed that miRNA dampening the output of preexisting but unwanted transcripts to facilitate transition to new expression programs is a general mechanism for miRNA regulation of human gene expression. This could be achieved by enhancing mRNA degradation through an additive effect from multiple miRNA targeting.
Compilation of human and miRNA target genes
Human genes (hg18 March 2006 assembly) with Reference Sequences (RefSeqs) and their corresponding structure parameters, including the number of introns, the length of coding sequences (CDSs), introns, 3'UTRs, and 5'UTRs, were extracted from UCSC Genome browser http://genome.ucsc.edu/. To allow comparisons between different data sources, the RefSeqs IDs were mapped to the corresponding Gene IDs from NCBI http://www.ncbi.nlm.nih.gov/ using gene2refseq. This process resulted in ~27600 RefSeqs, which represent ~18870 unique genes, with structure parameters.
Three miRNA target gene datasets predicted from algorithms of PicTar , TargetScanS , and RNA22  were used. miRNA target genes from PicTar and TargetScanS were obtained from miRGen database , which contains 7034 and 6995 unique target genes for PicTar and TargetScanS, respectively. For PicTar, the result based on conservation in human, chimp, mouse, and rat was chosen. For TargetScanS, the predictions using positions on UTRs (without gaps) and 4-way UTR multiple sequence alignments were used. miRNA target genes predicted from RNA22 were obtained from RNA22 web server http://cbcsrv.watson.ibm.com/rna22.html, which contains ~21650 Ensemble transcripts of miRNA targets. To make them comparable to other data sources, these Ensemble transcripts, which represent ~9240 unique genes, were mapped to Gene IDs based on Biomart http://www.biomart.org/. The number of miRNA target sites from multiple miRNAs for individual genes was also computed for each dataset. For a given gene with multiple predicted transcripts from RNA22, the transcript with the most miRNA binding sites was chosen. We also employed miRNA target genes common between TargetScanS and PicTar for all analyses performed in this study. Results (data not shown) similar to the miRNA target genes predicted from the three individual algorithms were obtained.
For each miRNA target gene dataset, the ~18870 unique human genes were first separated into two groups of miRNA and non-miRNA target genes and their structure parameters were then compared. In the case of a gene with alternative-splicing variants the one having the largest gene structure parameter was used. Significant differences for the median values between miRNA and non-miRNA target genes were determined by Wilcoxon rank sum test.
Human gene expression data and expression comparison
The GNF Atlas2 gene expression database (gnfAtlas2)  was used for the gene expression comparisons between miRNA and non-miRNA target genes. This dataset, which contains ~16920 unique genes whose mRNA expression was determined in 79 human tissues, cell types, and cancer lines has been widely used for various gene expression studies. The expression signals for a given gene with different RefSeq IDs in the same tissue were first averaged. miRNA target genes derived from TargetScanS, PicTar, and RNA22 were then mapped to the ~16920 unique genes based on their gene IDs. Out of 5166 to 9235 miRNA target genes, 6245 from TargetScanS, 6300 from PicTar, and 8276 from RNA22 were in the expression data gene list. The median expression levels between miRNA and non-miRNA target genes were compared, and the statistical significances were evaluated using Wilcoxon rank sum test.
For the comparison of gene expression breadth, expressed miRNA and non-miRNA target genes were first extracted from the dataset. A gene is considered to be expressed if its expression intensity exceeds a threshold of 200 arbitrary units according to Su et al . The resulting miRNA and non-miRNA target genes were then separated into different bins based on their expression breadth across 79 tissues. A total of 8 bins were defined with increment of 10 tissues for the first 7 bins and increment of 9 tissues for the 8th bin. Therefore, each bin contains genes expressed in its specified tissue numbers. For example, the first bin contains genes expressed in 1 to 10 tissues and the 8th bin contains genes expressed in 71 to 79 tissues. Comparisons between miRNA and non-miRNA target genes were performed within each bin, in which each of the 10 (9) fractions of miRNA target genes out of all expressed miRNA target genes was compared to that of non-miRNA target genes. The statistical significance between miRNA and non-miRNA target genes in each bin was determined by Wilcoxon signed rank test, which evaluates the difference for the 10 paired fractions of miRNA and non-miRNA target genes.
GC content and TFBSs comparisons
Promoter sequences within 1-kb upstream of transcriptional starting sites for both miRNA and non-miRNA target genes were extracted from the UCSC Genome browser (hg18 March 2006 assembly). The percentage of GC nucleotides in each promoter sequence was computed using an in-house developed PERL script, and the differences of the median values between miRNA and non-miRNA target genes were evaluated by Wilcoxon rank sum test. For TFBS comparison, the Match® program, for which the profile parameter was set to "minimize the false positives", was employed to conduct searches for TFBSs using 214 non-redundant vertebrate position weight matrices from the professional TRANSFAC11.4 database . The number of TFBSs on each promoter sequence was computed using an in-house developed PERL script, and the differences of the median values between miRNA and non-miRNA target genes were evaluated by Wilcoxon rank sum test.
Compilation of mRNA decay rate and protein stability data
mRNA decay rate data were obtained from Yang et al. , which contains mRNA decay rates for ~5550 human transcripts. To allow comparisons between different data sources, the Accession IDs were first mapped to the corresponding Gene IDs from NCBI http://www.ncbi.nlm.nih.gov/ using gene2refseq. The mRNA decay rate for a given gene with different Accession IDs was then averaged, resulting in ~4250 unique genes with mRNA decay rate. Protein stability data were obtained from Yen et al. , which contains half-life protein stability measures represented as protein stability index for ~6500 unique genes. For genes with multiple tests, the values of protein stability indices were averaged.
Rank tests were employed for a few analyses, including miRNA binding sites vs. mRNA decay rates, miRNA binding sites vs. protein stability, miRNA density vs. mRNA decay rates, miRNA density vs. miRNA binding sites, mRNA decay rates vs. mRNA expression, and protein stability vs. mRNA expression. Taking the miRNA binding sites vs. mRNA decay rates as an example, genes were first ranked by either the number of miRNA binding sites or mRNA decay rates, resulting in 2 datasets with either ordered number of miRNA binding sites or ordered mRNA decay rates. Genes in each of the two resulting datasets were then separated into 5 equal number groups, which were represented as [1,20], (21,40], (41,60], (61,80], and (81,100] in standard interval notation, with increasing values. Comparison analyses for the paired values of miRNA binding sites and mRNA decay rates were performed within each of the 5 gene groups.
To assess the robustness of the trends for the additive effect of miRNA binding sites on mRNA decay rate, permutation test was first performed to create 10,000 datasets by randomizing samples of miRNA binding sites and mRNA decay rate from genes of the test datasets. Each of the random datasets was then subjected to rank test as described above. Wilcoxon rank sum tests were subsequently employed to determine the statistical difference between [1,20] and (41,60] groups as well as between (41,60] and (81,100] groups in each random dataset. The resulting 10,000 p-values were compared to that from test dataset to estimate the robustness of the trends for the observed additive effect of miRNA binding sites on mRNA decay rate.
We also performed analysis to test whether the additive effects of miRNA binding sites on mRNA decay rate were contributed from biologically relevant miRNA binding sites. For this analysis, we first created 3 random 3'UTR sequence datasets, each having the same number of genes and nucleotide contents as the test dataset. Specifically, we shuffled the 3'UTR sequences within each of the 2248 RNA22 predicted miRNA target genes which have mRNA decay rate data by either keeping dinucleotide together or mixing completely. The resulting datasets (2 from keeping dinucleotide together and 1 from complete mixing) were then subjected to the prediction of miRNA binding sites through the RNA22 web server http://cbcsrv.watson.ibm.com/rna22.html. To make the predictions comparable to those of the test dataset, we compiled and used the same list of miRNAs based on the test dataset and set up the parameters for RNA22 algorithm to be 0 for "maximum number of allowed UN-paired bases", 6 for "in seed/nucleus", 14 for "minimum number of paired-up bases in heteroduplex", and -25 for "maximum folding energy for heteroduplex".
The author thanks Michael Buck for helpful discussions, Thomas Furlani for critical reading of the manuscript, and Andrew Bruno for providing programming support to get predicted miRNA binding sites from RNA22 web server. This work was partially supported by NIH grant AI28304.
- Lee RC, Feinbaum RL, Ambros V: The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993, 75 (5): 843-854. 10.1016/0092-8674(93)90529-Y.View ArticlePubMedGoogle Scholar
- Wightman B, Ha I, Ruvkun G: Posttranscriptional regulation of the heterochronic gene lin-14 by lin-4 mediates temporal pattern formation in C. elegans. Cell. 1993, 75 (5): 855-862. 10.1016/0092-8674(93)90530-4.View ArticlePubMedGoogle Scholar
- Kloosterman WP, Plasterk RH: The diverse functions of microRNAs in animal development and disease. Developmental cell. 2006, 11 (4): 441-450. 10.1016/j.devcel.2006.09.009.View ArticlePubMedGoogle Scholar
- Voinnet O: Origin, biogenesis, and activity of plant microRNAs. Cell. 2009, 136 (4): 669-687. 10.1016/j.cell.2009.01.046.View ArticlePubMedGoogle Scholar
- Sarnow P, Jopling CL, Norman KL, Schutz S, Wehner KA: MicroRNAs: expression, avoidance and subversion by vertebrate viruses. Nat Rev Microbiol. 2006, 4 (9): 651-659. 10.1038/nrmicro1473.View ArticlePubMedGoogle Scholar
- Esquela-Kerscher A, Slack FJ: Oncomirs - microRNAs with a role in cancer. Nature reviews. 2006, 6 (4): 259-269. 10.1038/nrc1840.PubMedGoogle Scholar
- Chang TC, Mendell JT: microRNAs in vertebrate physiology and human disease. Annual review of genomics and human genetics. 2007, 8: 215-239. 10.1146/annurev.genom.8.080706.092351.View ArticlePubMedGoogle Scholar
- Krutzfeldt J, Stoffel M: MicroRNAs: a new class of regulatory genes affecting metabolism. Cell metabolism. 2006, 4 (1): 9-12. 10.1016/j.cmet.2006.05.009.View ArticlePubMedGoogle Scholar
- Doench JG, Sharp PA: Specificity of microRNA target selection in translational repression. Genes & development. 2004, 18 (5): 504-511. 10.1101/gad.1184404.View ArticleGoogle Scholar
- Brennecke J, Stark A, Russell RB, Cohen SM: Principles of microRNA-target recognition. PLoS biology. 2005, 3 (3): e85-10.1371/journal.pbio.0030085.PubMed CentralView ArticlePubMedGoogle Scholar
- Lewis BP, Burge CB, Bartel DP: Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005, 120 (1): 15-20. 10.1016/j.cell.2004.12.035.View ArticlePubMedGoogle Scholar
- Pillai RS, Bhattacharyya SN, Filipowicz W: Repression of protein synthesis by miRNAs: how many mechanisms?. Trends in cell biology. 2007, 17 (3): 118-126. 10.1016/j.tcb.2006.12.007.View ArticlePubMedGoogle Scholar
- Pillai RS, Bhattacharyya SN, Artus CG, Zoller T, Cougot N, Basyuk E, Bertrand E, Filipowicz W: Inhibition of translational initiation by Let-7 MicroRNA in human cells. Science (New York, NY). 2005, 309 (5740): 1573-1576.View ArticleGoogle Scholar
- Mathonnet G, Fabian MR, Svitkin YV, Parsyan A, Huck L, Murata T, Biffo S, Merrick WC, Darzynkiewicz E, Pillai RS, et al: MicroRNA inhibition of translation initiation in vitro by targeting the cap-binding complex eIF4F. Science (New York, NY). 2007, 317 (5845): 1764-1767.View ArticleGoogle Scholar
- Seggerson K, Tang L, Moss EG: Two genetic circuits repress the Caenorhabditis elegans heterochronic gene lin-28 after translation initiation. Developmental biology. 2002, 243 (2): 215-225. 10.1006/dbio.2001.0563.View ArticlePubMedGoogle Scholar
- Petersen CP, Bordeleau ME, Pelletier J, Sharp PA: Short RNAs repress translation after initiation in mammalian cells. Molecular cell. 2006, 21 (4): 533-542. 10.1016/j.molcel.2006.01.031.View ArticlePubMedGoogle Scholar
- Bagga S, Bracht J, Hunter S, Massirer K, Holtz J, Eachus R, Pasquinelli AE: Regulation by let-7 and lin-4 miRNAs results in target mRNA degradation. Cell. 2005, 122 (4): 553-563. 10.1016/j.cell.2005.07.031.View ArticlePubMedGoogle Scholar
- Giraldez AJ, Mishima Y, Rihel J, Grocock RJ, Van Dongen S, Inoue K, Enright AJ, Schier AF: Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science (New York, NY). 2006, 312 (5770): 75-79.View ArticleGoogle Scholar
- Lim LP, Lau NC, Garrett-Engele P, Grimson A, Schelter JM, Castle J, Bartel DP, Linsley PS, Johnson JM: Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs. Nature. 2005, 433 (7027): 769-773. 10.1038/nature03315.View ArticlePubMedGoogle Scholar
- Wu L, Fan J, Belasco JG: MicroRNAs direct rapid deadenylation of mRNA. Proceedings of the National Academy of Sciences of the United States of America. 2006, 103 (11): 4034-4039. 10.1073/pnas.0510928103.PubMed CentralView ArticlePubMedGoogle Scholar
- Eulalio A, Huntzinger E, Nishihara T, Rehwinkel J, Fauser M, Izaurralde E: Deadenylation is a widespread effect of miRNA regulation. RNA (New York, NY). 2009, 15 (1): 21-32.View ArticleGoogle Scholar
- Baek D, Villen J, Shin C, Camargo FD, Gygi SP, Bartel DP: The impact of microRNAs on protein output. Nature. 2008, 455 (7209): 64-71. 10.1038/nature07242.PubMed CentralView ArticlePubMedGoogle Scholar
- Huang JC, Babak T, Corson TW, Chua G, Khan S, Gallie BL, Hughes TR, Blencowe BJ, Frey BJ, Morris QD: Using expression profiling data to identify human microRNA targets. Nature methods. 2007, 4 (12): 1045-1049. 10.1038/nmeth1130.View ArticlePubMedGoogle Scholar
- Orom UA, Nielsen FC, Lund AH: MicroRNA-10a binds the 5'UTR of ribosomal protein mRNAs and enhances their translation. Molecular cell. 2008, 30 (4): 460-471. 10.1016/j.molcel.2008.05.001.View ArticlePubMedGoogle Scholar
- Henke JI, Goergen D, Zheng J, Song Y, Schuttler CG, Fehr C, Junemann C, Niepmann M: microRNA-122 stimulates translation of hepatitis C virus RNA. The EMBO journal. 2008, 27 (24): 3300-3310. 10.1038/emboj.2008.244.PubMed CentralView ArticlePubMedGoogle Scholar
- Jopling CL, Schutz S, Sarnow P: Position-dependent function for a tandem microRNA miR-122-binding site located in the hepatitis C virus RNA genome. Cell host & microbe. 2008, 4 (1): 77-85. 10.1016/j.chom.2008.05.013.View ArticleGoogle Scholar
- Vasudevan S, Tong Y, Steitz JA: Switching from repression to activation: microRNAs can up-regulate translation. Science (New York, NY). 2007, 318 (5858): 1931-1934.View ArticleGoogle Scholar
- Vasudevan S, Steitz JA: AU-rich-element-mediated upregulation of translation by FXR1 and Argonaute 2. Cell. 2007, 128 (6): 1105-1118. 10.1016/j.cell.2007.01.038.PubMed CentralView ArticlePubMedGoogle Scholar
- Vasudevan S, Tong Y, Steitz JA: Cell-cycle control of microRNA-mediated translation regulation. Cell cycle (Georgetown, Tex). 2008, 7 (11): 1545-1549.View ArticleGoogle Scholar
- Rehwinkel J, Natalin P, Stark A, Brennecke J, Cohen SM, Izaurralde E: Genome-wide analysis of mRNAs regulated by Drosha and Argonaute proteins in Drosophila melanogaster. Molecular and cellular biology. 2006, 26 (8): 2965-2975. 10.1128/MCB.26.8.2965-2975.2006.PubMed CentralView ArticlePubMedGoogle Scholar
- Selbach M, Schwanhausser B, Thierfelder N, Fang Z, Khanin R, Rajewsky N: Widespread changes in protein synthesis induced by microRNAs. Nature. 2008, 455 (7209): 58-63. 10.1038/nature07228.View ArticlePubMedGoogle Scholar
- Rhoades MW, Reinhart BJ, Lim LP, Burge CB, Bartel B, Bartel DP: Prediction of plant microRNA targets. Cell. 2002, 110 (4): 513-520. 10.1016/S0092-8674(02)00863-2.View ArticlePubMedGoogle Scholar
- Farh KK, Grimson A, Jan C, Lewis BP, Johnston WK, Lim LP, Burge CB, Bartel DP: The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science (New York, NY). 2005, 310 (5755): 1817-1821.View ArticleGoogle Scholar
- Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM: Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3'UTR evolution. Cell. 2005, 123 (6): 1133-1146. 10.1016/j.cell.2005.11.023.View ArticlePubMedGoogle Scholar
- Bentwich I, Avniel A, Karov Y, Aharonov R, Gilad S, Barad O, Barzilai A, Einat P, Einav U, Meiri E, et al: Identification of hundreds of conserved and nonconserved human microRNAs. Nature genetics. 2005, 37 (7): 766-770. 10.1038/ng1590.View ArticlePubMedGoogle Scholar
- Berezikov E, Guryev V, Belt van de J, Wienholds E, Plasterk RH, Cuppen E: Phylogenetic shadowing and computational identification of human microRNA genes. Cell. 2005, 120 (1): 21-24. 10.1016/j.cell.2004.12.031.View ArticlePubMedGoogle Scholar
- Xie X, Lu J, Kulbokas EJ, Golub TR, Mootha V, Lindblad-Toh K, Lander ES, Kellis M: Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals. Nature. 2005, 434 (7031): 338-345. 10.1038/nature03441.PubMed CentralView ArticlePubMedGoogle Scholar
- Lindow M, Gorodkin J: Principles and limitations of computational microRNA gene and target finding. DNA and cell biology. 2007, 26 (5): 339-351. 10.1089/dna.2006.0551.View ArticlePubMedGoogle Scholar
- Rajewsky N: microRNA target predictions in animals. Nature genetics. 2006, 38 (Suppl): S8-13. 10.1038/ng1798.View ArticlePubMedGoogle Scholar
- Poy MN, Eliasson L, Krutzfeldt J, Kuwajima S, Ma X, Macdonald PE, Pfeffer S, Tuschl T, Rajewsky N, Rorsman P, et al: A pancreatic islet-specific microRNA regulates insulin secretion. Nature. 2004, 432 (7014): 226-230. 10.1038/nature03076.View ArticlePubMedGoogle Scholar
- Krutzfeldt J, Rajewsky N, Braich R, Rajeev KG, Tuschl T, Manoharan M, Stoffel M: Silencing of microRNAs in vivo with 'antagomirs'. Nature. 2005, 438 (7068): 685-689. 10.1038/nature04303.View ArticlePubMedGoogle Scholar
- Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, et al: A gene atlas of the mouse and human protein-encoding transcriptomes. Proceedings of the National Academy of Sciences of the United States of America. 2004, 101 (16): 6062-6067. 10.1073/pnas.0400782101.PubMed CentralView ArticlePubMedGoogle Scholar
- Yang E, van Nimwegen E, Zavolan M, Rajewsky N, Schroeder M, Magnasco M, Darnell JE: Decay rates of human mRNAs: correlation with functional characteristics and sequence attributes. Genome research. 2003, 13 (8): 1863-1872.PubMed CentralPubMedGoogle Scholar
- Yen HC, Xu Q, Chou DM, Zhao Z, Elledge SJ: Global protein stability profiling in mammalian cells. Science (New York, NY). 2008, 322 (5903): 918-923.View ArticleGoogle Scholar
- Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA: Selection for short introns in highly expressed genes. Nature genetics. 2002, 31 (4): 415-418.PubMedGoogle Scholar
- Eisenberg E, Levanon EY: Human housekeeping genes are compact. Trends Genet. 2003, 19 (7): 362-365. 10.1016/S0168-9525(03)00140-9.View ArticlePubMedGoogle Scholar
- Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, et al: Combinatorial microRNA target predictions. Nature genetics. 2005, 37 (5): 495-500. 10.1038/ng1536.View ArticlePubMedGoogle Scholar
- Miranda KC, Huynh T, Tay Y, Ang YS, Tam WL, Thomson AM, Lim B, Rigoutsos I: A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell. 2006, 126 (6): 1203-1217. 10.1016/j.cell.2006.07.031.View ArticlePubMedGoogle Scholar
- Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T, Orth AP, Vega RG, Sapinoso LM, Moqrich A, et al: Large-scale analysis of the human and mouse transcriptomes. Proceedings of the National Academy of Sciences of the United States of America. 2002, 99 (7): 4465-4470. 10.1073/pnas.012025199.PubMed CentralView ArticlePubMedGoogle Scholar
- Hurst LD, McVean G, Moore T: Imprinted genes have few and small introns. Nature genetics. 1996, 12 (3): 234-237. 10.1038/ng0396-234.View ArticlePubMedGoogle Scholar
- Urrutia AO, Hurst LD: The signature of selection mediated by expression on human genes. Genome research. 2003, 13 (10): 2260-2264. 10.1101/gr.641103.PubMed CentralView ArticlePubMedGoogle Scholar
- Vinogradov AE: Compactness of human housekeeping genes: selection for economy or genomic design?. Trends Genet. 2004, 20 (5): 248-253. 10.1016/j.tig.2004.03.006.View ArticlePubMedGoogle Scholar
- Shabalina SA, Spiridonov NA: The mammalian transcriptome and the function of non-coding DNA sequences. Genome biology. 2004, 5 (4): 105-10.1186/gb-2004-5-4-105.PubMed CentralView ArticlePubMedGoogle Scholar
- Pozzoli U, Menozzi G, Comi GP, Cagliani R, Bresolin N, Sironi M: Intron size in mammals: complexity comes to terms with economy. Trends Genet. 2007, 23 (1): 20-24. 10.1016/j.tig.2006.10.003.View ArticlePubMedGoogle Scholar
- Sood P, Krek A, Zavolan M, Macino G, Rajewsky N: Cell-type-specific signatures of microRNAs on target mRNA expression. Proceedings of the National Academy of Sciences of the United States of America. 2006, 103 (8): 2746-2751. 10.1073/pnas.0511045103.PubMed CentralView ArticlePubMedGoogle Scholar
- Bartel DP, Chen CZ: Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs. Nat Rev Genet. 2004, 5 (5): 396-400. 10.1038/nrg1328.View ArticlePubMedGoogle Scholar
- Megraw M, Sethupathy P, Corda B, Hatzigeorgiou AG: miRGen: a database for the study of animal microRNA genomic organization and function. Nucleic acids research. 2007, D149-155. 10.1093/nar/gkl904. 35 DatabaseGoogle Scholar
- Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic acids research. 2006, D108-110. 10.1093/nar/gkj143. 34 DatabaseGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.