Analysis of the association between codon optimality and mRNA stability in Schizosaccharomyces pombe

Harigaya, Yuriko; Parker, Roy

doi:10.1186/s12864-016-3237-6

Research article
Open access
Published: 08 November 2016

Analysis of the association between codon optimality and mRNA stability in Schizosaccharomyces pombe

Yuriko Harigaya^1,2 &
Roy Parker^1,2

BMC Genomics volume 17, Article number: 895 (2016) Cite this article

3204 Accesses
45 Citations
3 Altmetric
Metrics details

Abstract

Background

Recent experiments have shown that codon optimality is a major determinant of mRNA stability in Saccharomyces cerevisiae and that this phenomenon may be conserved in Escherichia coli and some metazoans, although work in Neurospora crassa is not consistent with this model.

Results

We examined the association between codon optimality and mRNA stability in the fission yeast Schizosaccharomyces pombe. Our analysis revealed the following points. First, we observe a genome-wide association between codon optimality and mRNA stability also in S. pombe, suggesting evolutionary conservation of the phenomenon. Second, in both S. pombe and S. cerevisiae, mRNA synthesis rates are also correlated at the genome-wide analysis with codon optimality, suggesting that the long-appreciated association between codon optimality and mRNA abundance is due to regulation of both mRNA synthesis and degradation. However, when we examined correlation of codon optimality and either mRNA half-lives or synthesis rates controlling for mRNA abundance, codon optimality was still positively correlated with mRNA half-lives in S. cerevisiae, but the association was no longer significant for mRNA half-lives in S. pombe or for synthesis rates in either organism. This illustrates how only the pairwise analysis of multiple correlating variables may limit these types of analyses. Finally, in S. pombe, codon optimality is associated with known DNA/RNA sequence motifs that are associated with mRNA production/stability, suggesting these two features have been under similar selective pressures for optimal gene expression.

Conclusions

Consistent with the emerging body of studies, this study suggests that the association between codon optimality and mRNA stability may be a broadly conserved phenomenon. It also suggests that the association can be explained at least in part by independent adaptations of codon optimality and other transcript features for elevated expression during evolution.

Background

The regulation of mRNA degradation is a critical step in the gene expression process. Previous studies have identified multiple gene sequence features that contribute to the dramatic differences in stability of cellular mRNAs. For example, RNA sequence elements [1, 2], RNA secondary structures within transcripts [3, 4], and DNA elements in promoter regions [5, 6], which can be specifically recognized by regulatory factors, play key roles in the modulation of mRNA stability [7]. Lengths of the transcripts can also affect mRNA stability [8, 9]. Moreover, a recent study by Coller and colleagues has identified codon optimality as another critical determinant of mRNA stability in Saccharomyces cerevisiae [10].

The concept of codon optimality has been developed through the study of the codon usage bias, which refers to the unequal frequency of synonymous codons within a gene, a genome or between genomes [11, 12]. The codon usage bias within a genome has been thought to be at least in part a consequence of selective pressure based on the observation that in multiple organisms highly expressed genes tend to have strong codon usage bias [13, 14]. Since the codons that are preferably used in highly expressed genes correspond to relatively abundant tRNA species in at least several organisms [15–17], it has been postulated that the preferred codons promote translation elongation rates and thereby increase the protein output. However, this hypothesis has been a matter of debate. A body of studies in unicellular organisms suggest that under physiological conditions rates of translation initiation rather than those of translation elongation mainly determine the protein output [18–21], which is not consistent with the hypothesis. On the other hand, some studies suggest that translation elongation rates can affect protein output by modulating translation initiation rates, which supports the hypothesis [22, 23].

The assumption underlying the hypothesis is that tRNA concentrations affect translation elongation rates at cognate codons. Because tRNA selection is a major rate-limiting step in the translation elongation cycle [24, 25], in theory, cellular concentrations of aminoacyl tRNAs affect rates of translation elongation when the cognate codons are in the ribosomal A-site. That is, the optimal codons, which are decoded by high abundance tRNA species, are translated more efficiently whereas the non-optimal codons, which are decoded by low abundance tRNA species, are translated more slowly. Despite the fundamental importance of this notion, it was only recently that convincing evidence emerged to support the idea that codon identify affects translation elongation rates [26–30].

In the recent study by Coller and colleagues, codon optimality was shown to be significantly associated with mRNA stability on a genomic scale in S. cerevisiae [10]. Moreover, experiments using several model transcripts have established a causal relationship between the two variables [10]. These results, along with earlier studies [31–33], have led to a model where non-optimal codons slow ribosomal translation and thereby trigger mRNA decay [10], which can offer a powerful explanation for the codon usage bias within a genome. More recent studies in Escherichia coli, zebrafish, Drosophila, Xenopus, and mouse are consistent with the codon-mediated regulation of mRNA stability being a conserved phenomenon [34–36]. Furthermore, the codon-mediated decay appears to play roles in human disease. For example, in human breast cancer cells, modulation of tRNA concentrations preferentially alters stability of mRNAs with high content of codons that are decoded by the tRNAs [37].

The generality of non-optimal codons triggering mRNA decay is not yet entirely clear. For example, a study in Neurospora crassa found that local accumulations of ribosomes on mRNAs due to synonymous codon substitutions, which indicate reduced rates of translation elongation, were not associated with changes in mRNA abundance [26]. The observation indicates that slow ribosome movements caused by non-optimal codons and mRNA instability can be uncoupled at least in certain situations and that the prevalence of the association between the two features requires additional investigation.

To examine the evolutionary conservation of the association between codon optimality and mRNA stability further, we have analyzed previously published RNA kinetic data in the fission yeast Schizosaccharomyces pombe, a yeast species that is evolutionarily distant from S. cerevisiae [38]. Our analysis revealed a significant association between codon optimality and mRNA stability in S. pombe, supporting the generality of the phenomenon. Our results also suggest that independent adaptations of codon optimality and other transcript features for elevated expression can contribute to formation of the optimal gene expression landscape.

Results and discussion

Comparison of previously published RNA kinetic datasets

As a prerequisite for our analysis to examine associations between codon optimality and mRNA stability, we required robust sets of genome-wide decay rate measurements. Towards this goal, we compiled previously published genome-wide RNA kinetic data in S. cerevisiae and S. pombe. The datasets were generated in multiple laboratories using various experimental methods and are summarized in Tables 1 and 2.

Table 1 Genome-wide RNA kinetic data in S. cerevisiae

Full size table

Table 2 Genome-wide RNA kinetic data in S. pombe

Full size table

In S. cerevisiae, a number of studies have measured mRNA synthesis and decay rates on a genomic scale. Although the methods used in these studies differ, kinetic rates are directly measured. Under the assumption that mRNA metabolism is in a steady state, some studies measured mRNA synthesis rates as well as mRNA abundance and calculated mRNA half-lives, while others measured mRNA decay rates and abundance to calculate mRNA synthesis rates (Table 1).

mRNA synthesis rates can be measured by multiple methods including genomic run-on (GRO) [39]. Direct measurement of mRNA half-lives is performed either by transcription shutoff or metabolic labeling. Although transcription shutoff is usually achieved by an addition of transcription inhibitor, in S. cerevisiae, many studies have used the rpb1-1 temperature sensitive mutant of the RBP1 gene, encoding the largest subunit of RNA polymerase II [40]. An alternative to the rbp1-1 strain is the rbp1-frb strain, in which Rbp1 is depleted from the nucleus upon an addition of the drug rapamycin via the “anchor-away” technique [8, 41]. Metabolic labeling using 4-thiouridine (4sU) or 4-thiouracil (4tU) can measure mRNA half-lives with minimal perturbation [42–47]. In one scheme, time course experiments are performed after an addition of 4sU/4tU to cell cultures [44, 46–48], which we refer to as “4sU/4tU labeling.” In another scheme, samples are analyzed after an addition of Uracil to cultures that have been grown in the presence of 4sU/4tU [45], which we refer to as “4sU/4tU chase” (Table 1).

Genome-wide RNA kinetic measurement experiments also differ in methods of mRNA quantification, which can be performed by microarray, RNA-seq, and direct RNA sequencing (DRS). A critical factor in mRNA quantification is whether the procedure enriches polyadenylated species relative to deadenylated species for a given gene. The enrichment of polyadenylated species can be achieved by physical separation using an oligo(dT) column, which we refer to as “oligo(dT) selection.” It can also be achieved by selective cDNA synthesis using an oligo(dT) primer, which we refer to as “oligo(dT) priming” (Table 1). It was previously noted that oligo(dT) selection can fail to capture RNA species that retain substantial poly(A) tails and result in highly skewed kinetic values [10, 49].

We first compared mRNA half-lives and synthesis rates obtained from 14 previously published RNA kinetic data in S. cerevisiae (Methods and Table 1). The analysis led to the following observations. First, as previously noted by others [10, 44, 47], ranges of mRNA half-lives varied substantially depending on the dataset (Fig. 1a), and correlations across different datasets were generally poor (Fig. 1c and Additional file 1: Figure S1). Second, however, datasets that were generated via 4sU/4tU labeling protocols (“Cramer (1),” “Cramer (2),” and “Gresham”) [44, 46, 48] showed similar ranges of values (Fig. 1a) and fair levels of correlations (Fig. 1c and Additional file 1: Figure S1). Although datasets that were generated via the rbp1-1 or rbp1-frb system (“Brown (1),” “Brown (2),” “Peltz,” “Struhl,” “Young,” “Hughes,” “Coller (2),” “Pilpel,” and “Coller (1)”) [8, 10, 50–54] also showed fair levels of correlations (Fig. 1c), their absolute values varied substantially (Fig. 1a). We note that mRNA half-lives obtained by Weis and colleagues via 4tU chase and oligo(dT) selection (“Weis”) [45] substantially differed from the 4sU/4tU labeling data (“Cramer (1),” “Cramer (2),” and “Gresham”), which were generated via protocols without oligo(dT) selection (Table 1, Fig. 1c, and Additional file 1: Figure S1). Third, unlike mRNA half-lives, mRNA synthesis rates from all datasets were positively correlated with one another regardless of the methods used for measurements (Fig. 2c and Additional file 2: Figure S3). We note, however, that the absolute values differed substantially depending on the datasets (Fig. 2a).

Although it is not possible to rigorously determine which datasets are most biologically accurate, we suggest that the results obtained via 4sU/4tU labeling without oligo(dT) selection (“Cramer (1)”, “Cramer (2),” and “Gresham”) are more likely to reflect the physiological nature of gene regulation for the following reasons. First, methods that involve transcription shutoff are subject to artifacts due to stress responses caused by global transcription shutoff [47]. Second, in the GRO protocols, cells are treated with sarkosyl, which perturbs cell physiology. Third, at least in S. cerevisiae, it has been shown that an addition of 4sU/4tU at a relevant concentration does not impact gene expression profiles within a relevant time frame [44, 47] and that an addition of 4tU at a relevant concentration does not noticeably affect cell growth [46]. Fourth, as described above, oligo(dT) selection can lead to skewed values [10]. We, therefore, focused on the three datasets (“Cramer (1),” “Cramer (2),” and “Gresham”) in subsequent analyses in S. cerevisiae.

We also compared mRNA half-life values in seven datasets from four studies in S. pombe (“Mata (1),” “Mata (2),” “Mata (3),” “Mata (4),” “Cramer,” “Mata (5),” and “Gagneur”), which were all generated via 4sU/4tU labeling (Methods and Table 2). In addition, we also analyzed genome-wide chromatin immunoprecipitation (ChIP) data for RNA polymerase II (RNAPII) by Bahler and colleagues (“Bahler”) [55] because an association of RNAPII can serve as a proxy for mRNA synthesis. The analysis led to the following observations. First, mRNA half-lives in the different datasets showed moderate to strong correlations with one another (Fig. 1d and Additional file 3: Figure S2). Second, mRNA synthesis rates were also well correlated across the datasets (Fig. 2d, Additional file 4: Figure S4, and Additional file 5: Figure S5). Third, however, the absolute values of mRNA half-lives and synthesis rates differed substantially depending on the datasets (Figs. 1b and 2b). These observations represent the technical challenges in quantifying absolute kinetic values but suggest that relative values are reasonably reliable.

Although all these datasets were generated via 4sU/4tU labeling, the quality of data is likely to be heterogeneous (Table 2). For example, the two datasets obtained from an early study by Mata and colleagues (“Mata (1)” and “Mata (2)”) do not contain replicates [56]. Moreover, in the experiments, cells were labeled with 4sU for 15 and 30 min [56], which could potentially have affected cell physiology. The same caveats apply to the “Mata (3)” and “Mata (4)” datasets, which contain values recomputed by Cramer and colleagues from the “Mata (1)” and “Mata (2)” datasets, respectively, taking into account labeling bias and cell division [47]. The dataset by Cramer and colleagues (“Cramer”) were obtained from two replicate experiments in which cells were labeled with 4sU for 6 min [47]. The more recent data by Mata and colleagues (“Mata (5)”) contain arithmetic means of values obtained from seven replicate experiments in which cells were labeled with 4sU for either 7 or 10 min [57]. The most recent data by Gagneur, Cramer, and colleagues (“Gagneur”) are based on two replicate experiments in which samples were analyzed at five time points within 10 min after an addition of 4tU as well as at the steady state. Moreover, unlike in other studies, the kinetic rates were obtained by a mathematical model that takes into account pre-mRNA splicing and, therefore, are likely to be more accurate [58]. To control the quality, in subsequent analyses, we focused on the two datasets obtained from the most recent studies, which consist of larger numbers of replicates/data points (“Mata (5)” and “Gagneur”).

Correlations between mRNA half-lives, synthesis rates, and abundance

We next examined relationships between mRNA half-lives, stability, and abundance in the selected datasets, which we considered the most reliable. The analysis led to the following observations, which are consistent with previous studies [44, 55]. First, mRNA half-lives were positively correlated with mRNA abundance in both S. cerevisiae (Additional file 6: Figures S6A-C) and S. pombe (Additional file 7: Figures S7A and B). Second, mRNA synthesis rates were also positively correlated with mRNA levels in both organisms (Additional file 6: Figures S6D-F, Additional file 7: Figure S7C, and D). Third, mRNA half-lives and synthesis rates did not show a consistent relationship across datasets in S. cerevisiae (Additional file 6: Figure S6G-I) whereas, in S. pombe, very weak positive correlations were detected (Additional file 7: Figure S7G and H).

Overall, the results are consistent with a global picture of gene regulation in dividing cells of S. cerevisiae and S. pombe, where both mRNA synthesis rates and half-lives are positively correlated with mRNA abundance. Rather than being a fine tuner, mRNA stability control appears to function cooperatively with the transcriptional control to contribute to the diversification of mRNA abundance.

Association between codon optimality and mRNA stability

To examine whether codon optimality is associated with mRNA stability in S. pombe, we used a method previously developed by Coller and colleagues with a minor modification (Methods) [10]. As a metric of optimality of each codon, we used both the “relative adaptiveness value” for the tRNA adaptation index (tAI) (Methods) [59], also known as classical translation efficiency (cTE) [60], and normalized translation efficiency (nTE) [60]. tAI takes into account tRNA gene copy numbers and wobble base paring based on the assumption that tRNA gene copy numbers correspond to aminoacyl tRNA concentrations. nTE additionally takes into account mRNA abundance based on the idea that an effective tRNA concentration can also be affected by the levels of mRNAs that require the tRNA for translation.

We first calculated a metric called the codon occurrence to mRNA stability correlation coefficient (CSC), which was originally defined as a Pearson's correlation coefficient between the frequency of occurrence of each codon in mRNAs and mRNA half-lives [10]. In our analysis, we used a Spearman's correlation coefficient instead of a Pearson's correlation coefficient since some data contained outliers. Codons that are enriched in stable mRNAs have positive CSC values, whereas those that are enriched in unstable mRNA have negative CSC values. To determine the statistical significance of the association between codon optimality and the CSC, we split codons into two groups based on the sign of the CSC and performed chi-square tests as in the previous study [10].

Coller and colleagues have observed the association between codon optimality and mRNA half-lives in their own data generated via the rbp1-1 system (“Coller (1)”) as well as in the data by Cramer and colleagues, which was generated via 4sU labeling (“Cramer (1)”) [10, 44]. To reassess these results in S. cerevisiae, we computed the CSC values using the dataset by Coller and colleagues (“Coller (1)”) as well as those on which we chose to focus in this study (“Cramer (1),” “Cramer (2),” and “Gresham”). This analysis led to the following observations, which confirmed the previously discovered association in S. cerevisiae. First, CSC values obtained from the four datasets were highly correlated with one another (Additional file 8: Figure S8A-F). Second, the CSC values were significantly associated with the both metrics of codon optimality (cTE and nTE) (Fig. 3a, Additional file 9: Figure S9A, B, and Table 3) [10]. Third, consistent with this, mRNA half-lives were positively correlated with geometric mean values of tAI computed for individual genes (tAI_g) (Methods) [59] (Fig. 3c, Additional file 9: Figure S9D, and E). Moreover, we obtained essentially the same results when we used percent optimal codon content of individual genes instead of the tAI_g values (Additional file 10: Figure S10A-F). Fourth, the CSC values were positively correlated with the tAI values (Additional file 11: Figure S11A, D, and G). Fifth, similarly to the earlier study [10], the positive correlation between codon optimality and the CSC disappeared when +1 or +2 frameshifts were computationally introduced, suggesting that other transcript features, such as nucleotide composition and RNA structures, are unlikely to contribute to the observed correlation (Additional file 11: Figure S11B, C, E, F, H, and I).

Table 3 Association between codon optimality and mRNA half-lives

Full size table

We then performed similar analyses using the two RNA kinetic datasets selected in S. pombe (“Mata (5)” and “Gagneur”). This led to the following observations. First, the CSC values obtained from the two datasets were highly correlated with each other (Additional file 8: Figure S8G). Second, the CSC values were significantly associated with the both metrics of codon optimality (cTE and nTE) (Fig. 3b, Additional file 9: Figure S9C, and Table 3). Third, consistent with this, mRNA half-lives were positively correlated with the tAI_g values as well as with optimal codon content (Fig. 3d, Additional file 9: Figure S9F, and Additional file 10: Figure S10G-J). Fourth, the CSC values were positively correlated with the tAI values (Additional file 12: Figure S12A and D). Fifth, the positive correlation between codon optimality and the CSC disappeared when +1 or +2 frameshifts were computationally introduced (Additional file 12: Figure S12B, C, E, and F).

These results suggest that the association between codon optimality and mRNA stability is conserved when examined at a gnome-wide scale in S. pombe. Given the evolutionary distance between the two yeast species [38], this implies that the association is a broadly conserved phenomenon.

Although we are only presenting a correlation between codon optimality and decay rates, several observations suggest that codon optimality can impact the rate of mRNA decay through a mechanistic connection, at least in S. cerevisiae. First, in the replacement of abundant codons with minor synonymous codons in the S. cerevisiae PGK1 transcript reduced the mRNA level, consistent with reduced codon optimality increasing the decay rate [31]. Second, resynthesis of mRNAs with increased or decreased codon optimality but synonymous codons leads to a corresponding change in the mRNA decay rate [10]. Third, specific regions of minor synonymous codons can lead to increased mRNA decay rates for the S. cerevisiae MATalpha1 mRNA [32, 33]. An important issue for future work will be determining the mechanistic coupling between codon optimality and mRNA decay mechanisms. Notably, new studies have started elucidating the molecular mechanisms underlying the coupling [35, 36, 61].

It has long been appreciated that highly expressed genes are often enriched in optimal codons [11]. To investigate the association between codon optimality and mRNA half-lives further, we computed the Spearman's partial correlations between the two variables controlling for mRNA abundance. In S. cerevisiae, the positive correlation between codon optimality in all three metrics and mRNA half-lives remained significant after analyses were controlled for mRNA abundance (Additional file 13: Table S1). This is consistent with the intimate relationship between codon optimality and mRNA stability in this organism. In S. pombe, the positive correlation remained for both the datasets only when the tAI values were used as a metric of codon optimality (Additional file 13: Table S1). This may suggest that, in S. pombe, the relationship between codon optimality and mRNA stability is weaker than in S. cerevisiae.

Association between codon optimality and mRNA synthesis rates

As described above, highly expressed genes are often enriched in optimal codons [11]. Since mRNA abundance is determined by rates of mRNA production and degradation, we next examined the association between codon optimality and mRNA synthesis rates in S. cerevisiae and S. pombe. For this purpose, we calculated a Spearman's correlation coefficient between the frequency of occurrence of each codon in mRNAs and mRNA synthesis rates, which we termed the codon occurrence to mRNA production correlation coefficient (CPC). In S. pombe, we also calculated the CPC values based on the genome-wide ChIP data for RNAPII by Bahler and colleagues [55]. The analysis led to the following observations. First, CPC values obtained from all the analyzed datasets were highly correlated (Additional file 14: Figure S13). Second, the CPC values were significantly associated with tAI (cTE) as well as with nTE in both S. cerevisiae and S. pombe (Fig. 4a, b, Additional file 15: Figure S14A, B, Additional file 16: Figure S15A, B, and Table 4). Third, mRNA synthesis rates were positively correlated with the tAI_g values as well as optimal codon content (Additional file 15: Figures S14C, D, Additional file 16: Figure S15C, D, and Additional file 17: Figure S16). Fourth, the CPC values were positively correlated with the tAI values (Additional file 18: Figure S17A, D, G, Additional file 19: Figure S18A, D, and G). Fifth, the positive correlation between codon optimality and the CPC disappeared when +1 or +2 frameshifts were computationally introduced [10] (Additional file 18: Figure S17B, C, E, F, H, I, Additional file 19: Figure S18B, C, E, F, H, and I).

Table 4 Association between codon optimality and mRNA synthesis rates

Full size table

These results suggest that, in both S. cerevisiae and S. pombe, not only mRNA half-lives but also mRNA synthesis rates are correlated with codon optimality and that the long-appreciated association between codon optimality and mRNA abundance is due to regulation of both mRNA synthesis and degradation. The association between codon optimality and mRNA synthesis rates would be most readily explained by independent adaptations of codon usage and other gene features that modulate mRNA synthesis rates for elevated expression during evolution.

To investigate the association between codon optimality and mRNA synthesis rates in more detail, we computed the Spearman's partial correlations between the two variables controlling for mRNA abundance (Additional file 20: Table S2). With one exception in which we performed the analysis using optimal codon content in the nTE classification and the “Mata (5)” dataset, we found no significant positive partial correlations between codon optimality and mRNA synthesis rates once we controlled for mRNA abundance. This can be consistent with the relationship between codon optimality and mRNA synthesis rates being indirect in both organisms.

Association between codon optimality and DNA/RNA sequence motifs

The above analysis revealed a consistent correlation between aspects of mRNAs that promote gene expression. Cis-acting DNA/RNA sequence motifs are also critical determinants in the regulation of mRNA production and stability. It is possible that the association between codon optimality and mRNA production/stability is in part due to associations between codon optimality and such sequence elements. In S. cerevisiae, to our knowledge, identification of sequence elements that are associated with mRNA synthesis rates and/or half-lives based on 4sU/4tU labeling data has not been performed. Moreover, our attempt to identify such sequence motifs was not successful. Consistent with this, a recent study suggests that inherent transcript properties including codon content rather than specific sequence motifs account for a large fraction of variation in mRNA decay rates in this organism [62]. On the other hand, in S. pombe, Gagneur, Cramer, and colleagues have identified DNA/RNA sequence motifs that are associated with mRNA synthesis rates and/or half-lives based on their 4tU labeling data [58].

To address the question of whether the sequence elements and codon optimality have been under similar selective pressures for optimal gene expression, we compared the tAI_g values between genes containing the previously identified motifs [58] and those lacking them in S. pombe. Interestingly, we observed that the tAI_g values of genes with motifs that were located in promoter and/or 5' UTR regions and associated with long half-lives and/or fast synthesis rates are significantly greater than those of genes without them (Fig. 5a, c-g, k, l, and Table 5). On the other hand, there was no significant difference in the tAI_g values between genes containing motifs that are associated with short half-lives and those lacking them (Fig. 5b, h, i, and Table 5) with one exception of the TTAATGA motif located in 3' UTR (Fig. 5j and Table 5). Essentially the same results were obtained when optimal codon content was compared instead of the tAI_g values (Table 5).

Table 5 Association between codon optimality and DNA/RNA sequence motifs

Full size table

These results are consistent with the idea that codon usage and certain DNA/RNA sequence motifs have been under similar selective pressures. Interestingly, the 5' UTR elements are enriched near the 5' end of mRNA not near the start codon [58] and, therefore, are unlikely to define the start codon context, which is already known to be associated with codon optimality in certain organisms as well as with mRNA stability in S. pombe [21, 55, 63, 64]. Rather, the motifs may modulate 5 ' cap accessibility, which is associated with translation initiation efficiency [29, 65]. Because of the causal relationship between efficient translation initiation and mRNA stability [66, 67], these observations collectively suggest that the association between codon optimality and mRNA stability is at least in part attributable to similar selective pressures acting on codon optimality and 5' UTR sequence features that may modulate translation initiation efficiency.

Conclusion

Our study in S. cerevisiae and S. pombe suggests that the genome-wide association between codon optimality and mRNA stability is a conserved phenomenon. It also suggests that the association in S. pombe is at least in part attributable to independent adaptations of codon usage and other transcript features during evolution. Although this study did not find such transcript features in S. cerevisiae, it cannot be excluded that independent adaptations contribute to the association also in this organism. As has been performed in S. cerevisiae, it will be important to experimentally examine causality between the two variables in S. pombe. Determining relative contributions from direct mechanistic links and selective pressures will be a key step to a fuller understanding of mRNA stability control.

Methods

Data sources

Coding sequences and annotations of S. cerevisiae (version R64-1-1) and S. pombe (version asm294v2.26) were obtained from the Saccharomyces genome database [68], Pombase [69], and Ensembl Genomes [70]. Sources of RNA kinetic data are summarized in Tables 1 and 2. mRNA abundance data were taken from previous studies by Ito and colleagues [71], Cramer and colleagues [44, 48], Gresham and colleagues [46], and Bahler and colleagues [72]. The relative adaptiveness values for tRNA adaptation index (tAI) were taken from a previous study by Tuller and colleagues [73]. The gene-wise average tAI values (tAI_g) were computed using the codonR program developed by dos Reis and colleagues [59]. Classification of optimal and non-optimal codons was taken from a previous study by Frydman and colleagues [60]. Genome-wide ChIP data for RNAPII in S. pombe was taken from a previous study by Bahler and colleagues [55].

Data filtering and processing

Out of all 6717 annotated ORFs in S. cerevisiae, we included all 4789 nuclear-encoded ORFs that are annotated as “verified” (Additional file 21: Table S3) [68]. Out of all 5142 annotated ORFs in S. pombe, we included all nuclear-encoded ORFs except those designated as dubious and SPAC1556.06, which has multiple transcript isoforms [69]. This resulted in 5051 genes (Additional file 22: Table S4). From the RNA kinetic data by Gagneur, Cramer, and colleagues [58], we excluded multicistronic transcript units. We used “minute,” “molecule per minute per cell,” and “molecule per cell” as units of mRNA half-lives, synthesis rates, and abundance, respectively. When the values were expressed otherwise in the source data, we converted them accordingly. In our calculations, we used a total number of mRNAs per cell of 60,000 and that of 41,000 for S. cerevisiae and S. pombe, respectively [72, 74]. When mRNA synthesis rates were not available in the source data, based on the steady-state assumption [75], we computed them from mRNA half-lives and abundance using the equations λ = log2/hl and μ = m (α + λ), where λ is the mRNA decay rate [min⁻¹], hl is the mRNA half-life [min], μ is the mRNA synthesis rate [RNA min⁻¹ cell⁻¹], m is the mRNA abundance [cell⁻¹], and α is the cell growth rate [min⁻¹]. When mRNA abundance was not available in the source data, we used data obtained by Ito and colleagues [71] and those obtained by Bahler and colleagues [72] for S. cerevisiae and S. pombe, respectively. For the datasets in S. cerevisiae, we used mRNA abundance obtained in matched types of culture media. That is, we used mRNA abundance in the rich nutrient YPD medium for the “Brown (1),” “Brown (2),” “Hughes,” “Peltz,” “Pilpel,” “Struhl,” “Coller (1),” and “Coller (2)” datasets, whereas we used that in the minimal nutrient SC medium for the “Weis” dataset (Additional file 21: Table S3). For mRNA half-live data obtained via the rbp1-1 and rbp1-frb systems, where cell growth is halted, or those obtained without taking into account cell division, we set α to zero. For the “Weis” data, we computed α from a cell cycle length of 150 min using the equation α = log2/ccl, where ccl is the cell cycle length [min].

Statistical analyses and graphics

All statistical analyses were performed using R [76]. The chisq.test() function was used to perform chi-square tests. The cor.test() function was used to calculate Spearman's correlation coefficients. The wilcox.test() function was used to perform Wilcoxon rank sum tests. The pcor.test() function in the ppcor package was used to calculate Spearman's partial correlations. The heatscatter() function in the LSD package was used to draw scatterplots. The heatmap.2() function in the gplots package was modified to draw heatmaps.

Calculation of CSC and CPC

We calculated the CSC as described previously except that we used Spearman's correlation coefficients instead of Pearson's correlation coefficients [10]. We calculated the CPC by replacing mRNA half-lives with mRNA synthesis rates in the CSC calculation.

Motif analysis

The DNA/RNA motifs were taken from a previous study by Gagneur, Cramer, and colleagues [58]. The motifs TATTTAT, TATTTA and ATTTAT were combined into a group (denoted as TATTTAT). The motifs TTAATGA, TTAATG, and TAATGA were also combined (denoted as TTAATGA). Only significant motif instances were considered (P < 0.05). As a background gene set, we considered 3814 genes for which mRNA half-lives and synthesis rates were estimated. Information about the presence and absence of the motifs can be found in Additional file 23: Table S5.

Abbreviations

3' UTR:: Three prime untranslated region
4sU:: 4-thiouridine
4tU:: 4-thiouracil
5' UTR:: Five prime untranslated region
ChIP:: Chromatin immunoprecipitation
CPC:: Codon occurrence to mRNA production correlation coefficient
CSC:: Codon occurrence to mRNA stability correlation coefficient
cTE:: Classical translation efficiency
DRS:: Direct RNA sequencing
GRO:: Genomic run-on
nTE:: Normalized translation efficiency
RNAPII:: RNA polymerase II
tAI:: tRNA adaptation index

References

Hafner M, Landthaler M, Burger L, Khorshid M, Hausser J, Berninger P, Rothballer A, Ascano Jr M, Jungkamp AC, Munschauer M, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010;141(1):129–41.
Article CAS PubMed PubMed Central Google Scholar
Ray D, Kazan H, Cook KB, Weirauch MT, Najafabadi HS, Li X, Gueroussov S, Albu M, Zheng H, Yang A, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499(7457):172–7.
Article CAS PubMed PubMed Central Google Scholar
Goodarzi H, Najafabadi HS, Oikonomou P, Greco TM, Fish L, Salavati R, Cristea IM, Tavazoie S. Systematic discovery of structural elements governing stability of mammalian messenger RNAs. Nature. 2012;485(7397):264–8.
Article CAS PubMed PubMed Central Google Scholar
Rabani M, Kertesz M, Segal E. Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes. Proc Natl Acad Sci U S A. 2008;105(39):14885–90.
Article CAS PubMed PubMed Central Google Scholar
Trcek T, Larson DR, Moldon A, Query CC, Singer RH. Single-molecule mRNA decay measurements reveal promoter- regulated mRNA stability in yeast. Cell. 2011;147(7):1484–97.
Article CAS PubMed PubMed Central Google Scholar
Bregman A, Avraham-Kelbert M, Barkai O, Duek L, Guterman A, Choder M. Promoter elements regulate cytoplasmic mRNA decay. Cell. 2011;147(7):1473–83.
Article CAS PubMed Google Scholar
Mitchell SF, Parker R. Principles and properties of eukaryotic mRNPs. Mol Cell. 2014;54(4):547–58.
Article CAS PubMed Google Scholar
Geisberg JV, Moqtaderi Z, Fan X, Ozsolak F, Struhl K. Global analysis of mRNA isoform half-lives reveals stabilizing and destabilizing elements in yeast. Cell. 2014;156(4):812–24.
Article CAS PubMed PubMed Central Google Scholar
Feng L, Niu DK. Relationship between mRNA stability and length: an old question with a new twist. Biochem Genet. 2007;45(1–2):131–7.
Article CAS PubMed Google Scholar
Presnyak V, Alhusaini N, Chen YH, Martin S, Morris N, Kline N, Olson S, Weinberg D, Baker KE, Graveley BR, et al. Codon optimality is a major determinant of mRNA stability. Cell. 2015;160(6):1111–24.
Article CAS PubMed PubMed Central Google Scholar
Plotkin JB, Kudla G. Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet. 2011;12(1):32–42.
Article CAS PubMed Google Scholar
Hershberg R, Petrov DA. Selection on codon bias. Annu Rev Genet. 2008;42:287–99.
Article CAS PubMed Google Scholar
Powell JR, Moriyama EN. Evolution of codon usage bias in Drosophila. Proc Natl Acad Sci U S A. 1997;94(15):7784–90.
Article CAS PubMed PubMed Central Google Scholar
Ikemura T. Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. J Mol Biol. 1981;151(3):389–409.
Article CAS PubMed Google Scholar
Ikemura T. Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. Differences in synonymous codon choice patterns of yeast and Escherichia coli with reference to the abundance of isoaccepting transfer RNAs. J Mol Biol. 1982;158(4):573–97.
Article CAS PubMed Google Scholar
Moriyama EN, Powell JR. Codon usage bias and tRNA abundance in Drosophila. J Mol Evol. 1997;45(5):514–23.
Article CAS PubMed Google Scholar
Duret L. tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. Trends Genet. 2000;16(7):287–9.
Article CAS PubMed Google Scholar
Andersson SG, Kurland CG. Codon preferences in free-living microorganisms. Microbiol Rev. 1990;54(2):198–210.
CAS PubMed PubMed Central Google Scholar
Bulmer M. The selection-mutation-drift theory of synonymous codon usage. Genetics. 1991;129(3):897–907.
CAS PubMed PubMed Central Google Scholar
Kudla G, Murray AW, Tollervey D, Plotkin JB. Coding-sequence determinants of gene expression in Escherichia coli. Science. 2009;324(5924):255–8.
Article CAS PubMed PubMed Central Google Scholar
Pop C, Rouskin S, Ingolia NT, Han L, Phizicky EM, Weissman JS, Koller D. Causal signals between codon bias, mRNA structure, and the efficiency of translation and elongation. Mol Syst Biol. 2014;10:770.
Article CAS PubMed PubMed Central Google Scholar
Ciandrini L, Stansfield I, Romano MC. Ribosome traffic on mRNAs maps to gene ontology: genome-wide quantification of translation initiation rates and polysome size regulation. PLoS Comput Biol. 2013;9(1), e1002866.
Article CAS PubMed PubMed Central Google Scholar
Chu D, Kazana E, Bellanger N, Singh T, Tuite MF, von der Haar T. Translation elongation can control translation initiation on eukaryotic mRNAs. EMBO J. 2014;33(1):21–34.
Article CAS PubMed Google Scholar
Gromadski KB, Rodnina MV. Kinetic determinants of high-fidelity tRNA discrimination on the ribosome. Mol Cell. 2004;13(2):191–200.
Article CAS PubMed Google Scholar
Lee TH, Blanchard SC, Kim HD, Puglisi JD, Chu S. The role of fluctuations in tRNA selection by the ribosome. Proc Natl Acad Sci U S A. 2007;104(34):13661–5.
Article CAS PubMed PubMed Central Google Scholar
Yu CH, Dang Y, Zhou Z, Wu C, Zhao F, Sachs MS, Liu Y. Codon usage influences the local rate of translation elongation to regulate co-translational protein folding. Mol Cell. 2015;59(5):744–54.
Article CAS PubMed PubMed Central Google Scholar
Gardin J, Yeasmin R, Yurovsky A, Cai Y, Skiena S, Futcher B. Measurement of average decoding rates of the 61 sense codons in vivo. Elife. 2014;3.
Koutmou KS, Radhakrishnan A, Green R. Synthesis at the speed of codons. Trends Biochem Sci. 2015;40(12):717–8.
Article CAS PubMed Google Scholar
Weinberg DE, Shah P, Eichhorn SW, Hussmann JA, Plotkin JB, Bartel DP. Improved ribosome-footprint and mRNA measurements provide insights into dynamics and regulation of yeast translation. Cell Rep. 2016;14(7):1787–99.
Article CAS PubMed Google Scholar
Hussmann JA, Patchett S, Johnson A, Sawyer S, Press WH. Understanding biases in ribosome profiling experiments reveals signatures of translation dynamics in yeast. PLoS Genet. 2015;11(12), e1005732.
Article CAS PubMed PubMed Central Google Scholar
Hoekema A, Kastelein RA, Vasser M, de Boer HA. Codon replacement in the PGK1 gene of Saccharomyces cerevisiae: experimental approach to study the role of biased codon usage in gene expression. Mol Cell Biol. 1987;7(8):2914–24.
Article CAS PubMed PubMed Central Google Scholar
Caponigro G, Muhlrad D, Parker R. A small segment of the MAT alpha 1 transcript promotes mRNA decay in Saccharomyces cerevisiae: a stimulatory role for rare codons. Mol Cell Biol. 1993;13(9):5141–8.
Article CAS PubMed PubMed Central Google Scholar
Hennigan AN, Jacobson A. Functional mapping of the translation-dependent instability element of yeast MATalpha1 mRNA. Mol Cell Biol. 1996;16(7):3833–43.
Article CAS PubMed PubMed Central Google Scholar
Boel G, Letso R, Neely H, Price WN, Wong KH, Su M, Luff JD, Valecha M, Everett JK, Acton TB, et al. Codon influence on protein expression in E. coli correlates with mRNA levels. Nature. 2016;529(7586):358–63.
Article CAS PubMed PubMed Central Google Scholar
Mishima Y, Tomari Y. Codon usage and 3' UTR length determine maternal mRNA stability in zebrafish. Mol Cell. 2016;61(6):874–85.
Article CAS PubMed Google Scholar
Bazzini AA, Del Viso F, Moreno-Mateos MA, Johnstone TG, Vejnar CE, Qin Y, Yao J, Khokha MK, Giraldez AJ. Codon identity regulates mRNA stability and translation efficiency during the maternal-to-zygotic transition.EMBO J. 2016;35(19):2087-103.
Goodarzi H, Nguyen HC, Zhang S, Dill BD, Molina H, Tavazoie SF. Modulated expression of specific tRNAs drives gene expression and cancer progression. Cell. 2016;165(6):1416–27.
Article CAS PubMed Google Scholar
Sipiczki M. Where does fission yeast sit on the tree of life? Genome Biol. 2000;1(2):REVIEWS1011.
Article CAS PubMed PubMed Central Google Scholar
Perez-Ortin JE, de Miguel-Jimenez L, Chavez S. Genome-wide studies of mRNA synthesis and degradation in eukaryotes. Biochim Biophys Acta. 2012;1819(6):604–15.
Article CAS PubMed Google Scholar
Nonet M, Scafe C, Sexton J, Young R. Eucaryotic RNA polymerase conditional mutant that rapidly ceases mRNA synthesis. Mol Cell Biol. 1987;7(5):1602–11.
Article CAS PubMed PubMed Central Google Scholar
Haruki H, Nishikawa J, Laemmli UK. The anchor-away technique: rapid, conditional establishment of yeast mutant phenotypes. Mol Cell. 2008;31(6):925–32.
Article CAS PubMed Google Scholar
Cleary MD, Meiering CD, Jan E, Guymon R, Boothroyd JC. Biosynthetic labeling of RNA with uracil phosphoribosyltransferase allows cell-specific microarray analysis of mRNA synthesis and decay. Nat Biotechnol. 2005;23(2):232–7.
Article CAS PubMed Google Scholar
Dolken L, Ruzsics Z, Radle B, Friedel CC, Zimmer R, Mages J, Hoffmann R, Dickinson P, Forster T, Ghazal P, et al. High-resolution gene expression profiling for simultaneous kinetic parameter analysis of RNA synthesis and decay. RNA. 2008;14(9):1959–72.
Article CAS PubMed PubMed Central Google Scholar
Miller C, Schwalb B, Maier K, Schulz D, Dumcke S, Zacher B, Mayer A, Sydow J, Marcinowski L, Dolken L, et al. Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Mol Syst Biol. 2011;7:458.
Article CAS PubMed PubMed Central Google Scholar
Munchel SE, Shultzaberger RK, Takizawa N, Weis K. Dynamic profiling of mRNA turnover reveals gene-specific and system-wide regulation of mRNA decay. Mol Biol Cell. 2011;22(15):2787–95.
Article CAS PubMed PubMed Central Google Scholar
Neymotin B, Athanasiadou R, Gresham D. Determination of in vivo RNA kinetics using RATE-seq. RNA. 2014;20(10):1645–52.
Article CAS PubMed PubMed Central Google Scholar
Sun M, Schwalb B, Schulz D, Pirkl N, Etzold S, Lariviere L, Maier KC, Seizl M, Tresch A, Cramer P. Comparative dynamic transcriptome analysis (cDTA) reveals mutual feedback between mRNA synthesis and degradation. Genome Res. 2012;22(7):1350–9.
Article CAS PubMed PubMed Central Google Scholar
Sun M, Schwalb B, Pirkl N, Maier KC, Schenk A, Failmezger H, Tresch A, Cramer P. Global analysis of eukaryotic mRNA degradation reveals Xrn1-dependent buffering of transcript levels. Mol Cell. 2013;52(1):52–62.
Article CAS PubMed Google Scholar
Herrick D, Parker R, Jacobson A. Identification and comparison of stable and unstable mRNAs in Saccharomyces cerevisiae. Mol Cell Biol. 1990;10(5):2269–84.
Article CAS PubMed PubMed Central Google Scholar
Wang Y, Liu CL, Storey JD, Tibshirani RJ, Herschlag D, Brown PO. Precision and functional specificity in mRNA decay. Proc Natl Acad Sci U S A. 2002;99(9):5860–5.
Article CAS PubMed PubMed Central Google Scholar
Duttagupta R, Tian B, Wilusz CJ, Khounh DT, Soteropoulos P, Ouyang M, Dougherty JP, Peltz SW. Global analysis of Pub1p targets reveals a coordinate control of gene expression through modulation of binding and stability. Mol Cell Biol. 2005;25(13):5499–513.
Article CAS PubMed PubMed Central Google Scholar
Holstege FC, Jennings EG, Wyrick JJ, Lee TI, Hengartner CJ, Green MR, Golub TR, Lander ES, Young RA. Dissecting the regulatory circuitry of a eukaryotic genome. Cell. 1998;95(5):717–28.
Article CAS PubMed Google Scholar
Grigull J, Mnaimneh S, Pootoolal J, Robinson MD, Hughes TR. Genome-wide analysis of mRNA stability using transcription inhibitors and microarrays reveals posttranscriptional control of ribosome biogenesis factors. Mol Cell Biol. 2004;24(12):5534–47.
Article CAS PubMed PubMed Central Google Scholar
Shalem O, Dahan O, Levo M, Martinez MR, Furman I, Segal E, Pilpel Y. Transient transcriptional responses to stress are generated by opposing effects of mRNA production and degradation. Mol Syst Biol. 2008;4:223.
Article CAS PubMed PubMed Central Google Scholar
Lackner DH, Beilharz TH, Marguerat S, Mata J, Watt S, Schubert F, Preiss T, Bahler J. A network of multiple regulatory layers shapes gene expression in fission yeast. Mol Cell. 2007;26(1):145–55.
Article CAS PubMed PubMed Central Google Scholar
Amorim MJ, Cotobal C, Duncan C, Mata J. Global coordination of transcriptional control and mRNA decay during cellular differentiation. Mol Syst Biol. 2010;6:380.
Article CAS PubMed PubMed Central Google Scholar
Hasan A, Cotobal C, Duncan CD, Mata J. Systematic analysis of the role of RNA-binding proteins in the regulation of RNA stability. PLoS Genet. 2014;10(11), e1004684.
Article CAS PubMed PubMed Central Google Scholar
Eser P, Wachutka L, Maier KC, Demel C, Boroni M, Iyer S, Cramer P, Gagneur J. Determinants of RNA metabolism in the Schizosaccharomyces pombe genome. Mol Syst Biol. 2016;12(2):857.
Article CAS PubMed PubMed Central Google Scholar
dos Reis M, Savva R, Wernisch L. Solving the riddle of codon usage preferences: a test for translational selection. Nucleic Acids Res. 2004;32(17):5036–44.
Article CAS PubMed PubMed Central Google Scholar
Pechmann S, Frydman J. Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding. Nat Struct Mol Biol. 2013;20(2):237–43.
Article CAS PubMed Google Scholar
Radhakrishnan A, Chen YH, Martin S, Alhusaini N, Green R, Coller J. The DEAD-box protein Dhh1p couples mRNA decay and translation by monitoring codon optimality. Cell. 2016;167(1):122-32.
Neymotin B, Ettore V, Gresham D. Multiple transcript properties related to translation affect mRNA degradation rates in Saccharomyces cerevisiae. G3 (Bethesda). 2016. doi:10.1534/g3.116.032276. [Epub ahead of print] PMID: 27633789.
Miyasaka H. The positive relationship between codon usage bias and translation initiation AUG context in Saccharomyces cerevisiae. Yeast. 1999;15(8):633–7.
Article CAS PubMed Google Scholar
Miyasaka H. Translation initiation AUG context varies with codon usage bias and gene length in Drosophila melanogaster. J Mol Evol. 2002;55(1):52–64.
Article CAS PubMed Google Scholar
Godefroy-Colburn T, Ravelonandro M, Pinck L. Cap accessibility correlates with the initiation efficiency of alfalfa mosaic virus RNAs. Eur J Biochem. 1985;147(3):549–52.
Article CAS PubMed Google Scholar
Schwartz DC, Parker R. Mutations in translation initiation factors lead to increased rates of deadenylation and decapping of mRNAs in Saccharomyces cerevisiae. Mol Cell Biol. 1999;19(8):5247–56.
Article CAS PubMed PubMed Central Google Scholar
LaGrandeur T, Parker R. The cis acting sequences responsible for the differential decay of the unstable MFA2 and stable PGK1 transcripts in yeast include the context of the translational start codon. RNA. 1999;5(3):420–33.
Article CAS PubMed PubMed Central Google Scholar
Saccharomyces Genome Database. http://downloads.yeastgenome.org/. Accessed 9 May 2016.
Pombase. http://www.pombase.org/. Accessed 9 May 2016.
Ensembl Genomes. ftp://ftp.ensemblgenomes.org/. Accessed 9 May 2016.
Miura F, Kawaguchi N, Yoshida M, Uematsu C, Kito K, Sakaki Y, Ito T. Absolute quantification of the budding yeast transcriptome by means of competitive PCR between genomic and complementary DNAs. BMC Genomics. 2008;9:574.
Article CAS PubMed PubMed Central Google Scholar
Marguerat S, Schmidt A, Codlin S, Chen W, Aebersold R, Bahler J. Quantitative analysis of fission yeast transcriptomes and proteomes in proliferating and quiescent cells. Cell. 2012;151(3):671–83.
Article CAS PubMed PubMed Central Google Scholar
Tuller T, Waldman YY, Kupiec M, Ruppin E. Translation efficiency is determined by both codon bias and folding energy. Proc Natl Acad Sci U S A. 2010;107(8):3645–50.
Article CAS PubMed PubMed Central Google Scholar
Zenklusen D, Larson DR, Singer RH. Single-RNA counting reveals alternative modes of gene expression in yeast. Nat Struct Mol Biol. 2008;15(12):1263–71.
Article CAS PubMed PubMed Central Google Scholar
Perez-Ortin JE, Alepuz PM, Moreno J. Genomics and gene transcription kinetics in yeast. Trends Genet. 2007;23(5):250–7.
Article CAS PubMed Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2015. http://www.R-project.org/.
Google Scholar
Pelechano V, Perez-Ortin JE. There is a steady-state transcriptome in exponentially growing yeast cells. Yeast. 2010;27(7):413–22.
Article CAS PubMed Google Scholar

Download references

Acknowledgement

We thank Patrick Cramer and Bjoern Schwalb (Max Planck Institute for Biophysical Chemistry) for processed RNA kinetic data. This work was financially supported by the Howard Hughes Medial Institute (to RP).

Funding

This work was supported by the Howard Hughes Medial Institute (to RP).

Availability of data materials

The datasets supporting the conclusions of this article are included within the article and its additional files.

Authors’ contributions

YH and RP conceived and designed the study. YH performed the analysis. YH and RP wrote the manuscript. Both authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Authors and Affiliations

Department of Chemistry and Biochemistry, University of Colorado Boulder, Boulder, CO, 80303, USA
Yuriko Harigaya & Roy Parker
Howard Hughes Medical Institute, University of Colorado Boulder, Boulder, CO, 80303, USA
Yuriko Harigaya & Roy Parker

Authors

Yuriko Harigaya
View author publications
You can also search for this author in PubMed Google Scholar
Roy Parker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuriko Harigaya.

Additional files

Additional file 1: Figure S1.

Comparison of mRNA half-life measurements in S. cerevisiae. The pairwise scatterplots compare mRNA half-lives in 14 datasets. The datasets are ordered as in Fig. 1c. The upper triangle panels show Spearman correlation coefficients (top) and P values (bottom). The axis range is from 0 to 60 min. (PDF 1243 kb)

Additional file 2: Figure S3.

Comparison of mRNA synthesis rates in S. cerevisiae. The pairwise scatterplots compare mRNA synthesis rates in 14 datasets. The datasets are ordered as in Fig. 2c. The upper triangle panels show Spearman correlation coefficients (top) and P values (bottom). The axis range is from 0 to 1 RNA per minute. (PDF 1203 kb)

Additional file 3: Figure S2.

Comparison of mRNA half-life measurements in S. pombe. The pairwise scatterplots compare mRNA half-lives in seven datasets. The datasets are ordered as in Fig. 1d. The upper triangle panels show Spearman correlation coefficients (top) and P values (bottom). The axis range is from 0 to 120 min. (PDF 456 kb)

Additional file 4: Figure S4.

Comparison of mRNA synthesis rates in S. pombe. The pairwise scatterplots compare mRNA half-lives in seven datasets. The datasets are ordered as in Fig. 2d. The upper triangle panels show Spearman correlation coefficients (top) and P values (bottom). The axis range is from 0 to 1 RNA per minute. (PDF 434 kb)

Additional file 5: Figure S5.

Comparison of mRNA synthesis rates with ChIP intensity signals for RNAPII in S. pombe. (A-G) Scatterplot comparing mRNA synthesis rates [RNA/min] in seven datasets and RNAPII ChIP signals in an arbitrary unit (a. u.) obtained by Bahler and colleagues [55]. Spearman's ρ and P value are shown. (PDF 984 kb)

Additional file 6: Figure S6.

Correlations between mRNA half-lives, synthesis rates, and abundance in S. cerevisiae. (A) Scatterplot comparing mRNA half-lives and abundance in the “Cramer (1)” data. Spearman's ρ and P value are shown. (B) Same as (A) but for the “Cramer (2)” data. (C) Same as (A) but for the “Gresham” data. (D) Scatterplot comparing mRNA synthesis rates and abundance in the “Cramer (1)” data. (E) Same as (D) but for the “Cramer (2)” data. (F) Same as (D) but for the “Gresham” data. (G) Scatterplot comparing mRNA half-lives and synthesis rates in the “Cramer (1)” data. (H) Same as (G) but for the “Cramer (2)” data. (I) Same as (G) but for the “Gresham” data. (PDF 1187 kb)

Additional file 7: Figure S7.

Correlations between mRNA half-lives, synthesis rates, and abundance in S. pombe. (A) Scatterplot comparing mRNA half-lives in the “Mata (5)” data and mRNA abundance [RNA/cell] obtained by Bahler and colleagues [72]. Spearman's ρ and P value are shown. (B) Same as (A) but for mRNA half-lives in the “Gagneur” data. (C) Scatterplot comparing mRNA synthesis rates computed from the “Mata (5)” data and mRNA abundance as shown in (A). (D) Same as (C) but for mRNA synthesis rates in the “Gagneur” data. (E) Scatterplot comparing mRNA half-lives and synthesis rates in the “Mata (5)” data. (F) Same as (E) but for the “Gagneur” data. (PDF 818 kb)

Additional file 8: Figure S8.

Correlations between the CSC values obtained from different RNA kinetic datasets in S. cerevisiae (“Cramer (1),” “Cramer (2),” “Gresham,” and “Coller (1)”) (A-F) and S. pombe (“Mata (5)” and “Gagneur”) (G). Spearman's ρ and P value are shown. The circles and cross signs represent optimal and non-optimal codons, respectively. (PDF 38 kb)

Additional file 9: Figure S9.

Codon optimality and mRNA half-lives are significantly associated in S. cerevisiae and S. pombe. (A) The CSC plotted for each codon based on S. cerevisiae mRNA half-lives in the “Cramer (1)” dataset. The white and gray bars represent optimal and non-optimal codons, respectively. The classification of codon optimality is based on the S. cerevisiae cTE. (B) Same as (A) but for the “Cramer (2)” dataset. (C) The CSC plotted for each codon based on S. pombe mRNA half-lives in the “Mata (5)” dataset. The classification of codon optimality is based on the S. pombe cTE. (D) Scatterplot comparing tAI_g and mRNA half-lives in the “Cramer (1)” dataset in S. cerevisiae. Spearman's ρ and P value are shown. (E) Same as (D) but for the “Cramer (2)” data. (F) Same as (D) but for the “Mata (5)” dataset in S. pombe. (PDF 427 kb)

Additional file 10: Figure S10.

Correlations between optimal codon content and mRNA half-lives in S. cerevisiae and S. pombe. (A) Scatterplot comparing optimal codon content based on the cTE classification and mRNA half-lives in the “Cramer (1)” data in S. cerevisiae. Spearman's ρ and P value are shown. (B) Same as (A) but for the “Cramer (2)” data. (C) Same as (A) but for the “Gresham” data. (D) Same as (A) but based on the nTE classification. (E) Same as (D) but for the “Cramer (2)” data. (F) Same as (D) but for the “Gresham” data. (G) Same as (A) but for the “Mata (5)” data in S. pombe. (H) Same as (G) but for the “Gagneur” data. (I) Same as (G) but based on the nTE classification. (J) Same as (I) but for the “Gagneur” data. (PDF 1341 kb)

Additional file 11: Figure S11.

Introduction of +1 and +2 frameshifts eliminates the positive correlation between codon optimality and the CSC in S. cerevisiae. (A) Scatterplot comparing the tAI values and the CSC based on the “Cramer (1)” data (ρ = 0.76, P = 1.7 × 10⁻¹²). The circles and cross signs represent optimal and non-optimal codons, respectively. (B) Same as (A) but upon introduction of +1 frameshifts (ρ = −0.12, P = 0.37). (C) Same as (A) but upon introduction of +2 frameshifts (ρ = −0.34, P = 7.7 × 10⁻³). (D) Same as (A) but based on the “Cramer (2)” data (ρ = 0.62, P = 9.3 × 10⁻⁸). (E) Same as (B) but based on the “Cramer (2)” data (ρ = −0.16, P = 0.21). (F) Same as (C) but based on the “Cramer (2)” data (ρ = −0.29, P = 0.02). (G) Same as (A) but based on the “Gresham” data (ρ = 0.58, P = 1.0 × 10⁻⁶). (H) Same as (B) but based on the “Gresham” data (ρ = 0.03, P = 0.76). (I) Same as (C) but based on the “Gresham” data (ρ = −0.23, P = 0.07). (PDF 29 kb)

Additional file 12: Figure S12.

Introduction of +1 and +2 frameshifts eliminates the positive correlation between codon optimality and the CSC in S. pombe. (A) Scatterplot comparing the tAI values and the CSC based on the “Mata (5)” data (ρ = 0.85, P = 7.6 × 10⁻¹⁸). The circles and cross signs represent optimal and non-optimal codons, respectively. (B) Same as (A) but upon introduction of +1 frameshifts (ρ = −0.16, P = 0.22). (C) Same as (A) but upon introduction of +2 frameshifts (ρ = −0.30, P = 0.02). (D) Same as (A) but based on the “Gagneur” data (ρ = 0.81, P = 4.6 × 10⁻¹⁵). (E) Same as (B) but based on the “Gagneur” data (ρ = −0.16, P = 0.02). (F) Same as (C) but based on the “Gagneur” data (ρ = −0.37, P = 3.8 × 10⁻³). (PDF 24 kb)

Additional file 13: Table S1.

Spearman's correlation coefficients and Spearman's partial correlations controlling for mRNA abundance to assess an association between codon optimality and mRNA half-lives. (XLS 31 kb)

Additional file 14: Figure S13.

Correlations between the CPC values obtained from different RNA kinetic datasets in S. cerevisiae (“Cramer (1),” “Cramer (2),” and “Gresham”) (A-C) and S. pombe (“Mata (5)” and “Gagneur”) as well as those obtained from the RNAPII ChIP data in S. pombe (“Bahler”) (D-F) [55]. Spearman's ρ and P value are shown. The circles and cross signs represent optimal and non-optimal codons, respectively. (PDF 36 kb)

Additional file 15: Figure S14.

Codon optimality and mRNA synthesis rates are significantly associated in S. cerevisiae. (A) The CPC plotted for each codon based on mRNA synthesis rates in the “Cramer (1)” dataset. The white and gray bars represent optimal and non-optimal codons, respectively. The classification of codon optimality is based on the S. cerevisiae cTE. (B) Same as (A) but based on the “Cramer (2)” dataset. (C) Scatterplot comparing tAI_g and mRNA synthesis rates in the “Cramer (1)” dataset. Spearman's ρ and P value are shown. (D) Same as (C) but for the “Cramer (2)” dataset. (PDF 275 kb)

Additional file 16: Figure S15.

Codon optimality and mRNA synthesis rates are significantly associated in S. pombe. (A) The CPC plotted for each codon based on the RNAPII ChIP signals by Bahler and colleagues [55]. The white and gray bars represent optimal and non-optimal codons, respectively. The classification of codon optimality is based on the S. pombe cTE. (B) Same as (A) but for the CPC based on mRNA synthesis rates in the “Mata (5)” dataset. (C) Scatterplot comparing tAI_g and RNAPII ChIP signals by Bahler and colleagues. Spearman's ρ and P value are shown. (D) Same as (C) but for mRNA synthesis rates in the “Mata (5)” dataset. (PDF 323 kb)

Additional file 17: Figure S16.

Correlations between optimal codon content and mRNA synthesis rates in S. cerevisiae and S. pombe. (A) Scatterplot comparing optimal codon content based on the cTE classification and mRNA synthesis rates in the “Cramer (1)” data in S. cerevisiae. Spearman's ρ and P value are shown. (B) Same as (A) but for the “Cramer (2)” data. (C) Same as (A) but for the “Gresham” data. (D) Same as (A) but based on the nTE classification. (E) Same as (D) but for the “Cramer (2)” data. (F) Same as (D) but for the “Gresham” data. (G) Same as (A) but for the S. pombe RNAPII ChIP data by Bahler and colleagues. (H) Same as (A) but for the “Mata (5)” data in S. pombe. (I) Same as (G) but for the “Gagneur” data. (J) Same as (G) but based on the nTE classification. (K) Same as (J) but for the “Mata (5)” data. (L) Same as (J) but for the “Gagneur” data. (PDF 1653 kb)

Additional file 18: Figure S17.

Introduction of +1 and +2 frameshifts eliminates the positive correlation between codon optimality and the CPC in S. cerevisiae. (A) Scatterplot comparing the tAI values and the CPC based on the “Cramer (1)” data (ρ = 0.79, P = 6.6 × 10⁻¹⁴). The circles and cross signs represent optimal and non-optimal codons, respectively. (B) Same as (A) but upon introduction of +1 frameshifts (ρ = −0.10, P = 0.44). (C) Same as (A) but upon introduction of +2 frameshifts (ρ = −0.28, P = 0.03). (D) Same as (A) but based on the “Cramer (2)” data (ρ = 0.74, P = 1.4 × 10⁻¹¹). (E) Same as (B) but based on the “Cramer (2)” data (ρ = −0.16, P = 0.22). (F) Same as C but based on the “Cramer (2)” data (ρ = −0.05, P = 0.72). (G) Same as (A) but based on the “Gresham” data (ρ = 0.75, P = 3.4 × 10⁻¹²). (H) Same as (B) but based on the “Gresham” data (ρ = −0.17, P = 0.18). (I) Same as (C) but based on the “Gresham” data (ρ = −0.15, P = 0.26). (PDF 29 kb)

Additional file 19: Figure S18.

Introduction of +1 and +2 frameshifts eliminates the positive correlation between codon optimality and the CPC in S. pombe. (A) Scatterplot comparing the tAI values and the CPC based on the RNAPII ChIP signals by Bahler and colleagues (ρ = 0.80, P = 5.5 × 10⁻¹⁵). The circles and cross signs represent optimal and non-optimal codons, respectively. (B) Same as (A) but upon introduction of +1 frameshifts (ρ = −0.14, P = 0.29). (C) Same as (A) but upon introduction of +2 frameshifts (ρ = −0.22, P = 0.09). (D) Same as (A) but for the CPC based on mRNA synthesis rates in the “Mata (5)” data (ρ = 0.83, P = 1.4 × 10⁻¹⁶). (E) Same as (B) but for the CPC based on mRNA synthesis rates in the “Mata (5)” data (ρ = −0.15, P = 0.25). (F) Same as (C) but for the CPC based on mRNA synthesis rates in the “Mata (5)” data (ρ = −0.19, P = 0.13). (G) Same as (D) but based on the “Gagneur” data (ρ = 0.82, P = 4.2 × 10⁻¹⁶). (H) Same as (E) but based on the “Gagneur” data (ρ = −0.20, P = 0.11). (I) Same as (F) but based on the “Gagneur” data (ρ = −0.28, P = 0.03). (PDF 29 kb)

Additional file 20: Table S2.

Spearman's correlation coefficients and Spearman's partial correlations controlling for mRNA abundance to assess an association between codon optimality and mRNA synthesis rates. (XLS 31 kb)

Additional file 21: Table S3.

tAI_g values, percent optimal codons, mRNA half-lives synthesis rates, and abundance in S. cerevisiae. (XLS 3062 kb)

Additional file 22: Table S4.

tAI_g values, percent optimal codons, mRNA half-lives, synthesis rates, and abundance in S. pombe. (XLS 2000 kb)

Additional file 23: Table S5.

Presence (1) and absence (0) of DNA/RNA sequence motifs in S. pombe. (XLS 538 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Harigaya, Y., Parker, R. Analysis of the association between codon optimality and mRNA stability in Schizosaccharomyces pombe . BMC Genomics 17, 895 (2016). https://doi.org/10.1186/s12864-016-3237-6

Download citation

Received: 14 May 2016
Accepted: 01 November 2016
Published: 08 November 2016
DOI: https://doi.org/10.1186/s12864-016-3237-6

Analysis of the association between codon optimality and mRNA stability in Schizosaccharomyces pombe

Abstract

Background

Results

Conclusions

Background

Results and discussion

Comparison of previously published RNA kinetic datasets

Correlations between mRNA half-lives, synthesis rates, and abundance

Association between codon optimality and mRNA stability

Association between codon optimality and mRNA synthesis rates

Association between codon optimality and DNA/RNA sequence motifs

Conclusion

Methods

Data sources

Data filtering and processing

Statistical analyses and graphics

Calculation of CSC and CPC

Motif analysis

Abbreviations

References

Acknowledgement

Funding

Availability of data materials

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us