DNA methylation and gene expression in Mimulus guttatus
© Colicchio et al. 2015
Received: 29 January 2015
Accepted: 29 May 2015
Published: 7 July 2015
The presence of methyl groups on cytosine nucleotides across an organism’s genome (methylation) is a major regulator of genome stability, crossing over, and gene regulation. The capacity for DNA methylation to be altered by environmental conditions, and potentially passed between generations, makes it a prime candidate for transgenerational epigenetic inheritance. Here we conduct the first analysis of the Mimulus guttatus methylome, with a focus on the relationship between DNA methylation and gene expression.
We present a whole genome methylome for the inbred line Iron Mountain 62 (IM62). DNA methylation varies across chromosomes, genomic regions, and genes. We develop a model that predicts gene expression based on DNA methylation (R2 = 0.2). Post hoc analysis of this model confirms prior relationships, and identifies novel relationships between methylation and gene expression. Additionally, we find that DNA methylation is significantly depleted near gene transcriptional start sites, which may explain the recently discovered elevated rate of recombination in these same regions.
The establishment here of a reference methylome will be a useful resource for the continued advancement of M. guttatus as a model system. Using a model-based approach, we demonstrate that methylation patterns are an important predictor of variation in gene expression. This model provides a novel approach for differential methylation analysis that generates distinct and testable hypotheses regarding gene expression.
DNA cytosine methylation is an epigenetic modification that acts in conjunction with histone modification and small RNAs to regulate gene expression [1–3] and control transposable elements [4, 5]. In addition, DNA methylation appears to alter mutation rates  and to decrease rates of recombination . It is found in organisms spanning the eukaryotic phylogeny [8, 9], and can occur in many sequence contexts. In plants, cytosine methylation can be found in CG, CHG, or CHH contexts, where H is any nucleotide besides G . It appears that much of the methylome is stable within an individual; however, the methylome does exhibit predictable plastic responses to developmental and environmental cues [11, 12].
Recent work has greatly expanded our knowledge of the mechanisms involved in maintaining and modifying DNA methylation in plants [13–18], yet we still do not fully understand how specific patterns of DNA methylation in and near coding sequences control gene expression. In Arabidopsis thaliana, CG DNA methylation in regulatory sequences is negatively correlated with gene expression [3, 19], possibly through limiting promoter accessibility. Contrastingly, gene body CG methylation is elevated in moderate to highly expressed genes [3, 10, 20], potentially though the removal of histone variant H2A.Z . Similar patterns of association between the distribution of plant CG methylation and gene expression have been found in the wild rice , tomato , and maize . Additionally, Arabidopsis genes within differentially methylated regions tended to be more highly expressed in individuals with increased CG methylation, but lower in individuals with increased non-CG (CHG and CHH) methylation . However, the interaction between gene expression and different forms of DNA methylation in and around genes has not been fully explored. For example, the impact of non-CG methylation on gene expression is especially understudied, despite its established role in regulating transposable elements through pre- and post-transcriptional silencing .
The standard method for characterizing genomic patterns of DNA methylation is to classify genes into methylation quantiles and then compare gene expression across these groups [3, 20, 22, 26–29]. Here, we adopt an explicit model-based approach, predicting gene expression from gene methylation and other basic gene-specific features (exon length, intron length, and exon number). We compare the methylome of an inbred line, to gene expression from a distinct recombinant inbred line, and test how well DNA methylation, in combination with other stable genetic factors, predict gene expression across lines and tissue types. The explanatory power of stable epigenetic variation on gene expression is relatively unknown (although see  for model-based approaches to predicting gene expression via promoter motifs in Saccharomyces cerevisiae, and  for a Sanger sequencing approach to gene expression modeling based on histone and DNA methylation in rice). With the model-based approach presented here, we are able to assess the scale to which constitutive epigenetic variation effects global gene expression, and the patterns of DNA methylation through which this regulation is manifest.
Previous studies of Mimulus guttatus have demonstrated transgenerational epigenetic inheritance [32–35]. Herbivore induced defensive traits can be transmitted between generations, and the observed transcriptional basis of this response , has made it a promising model system in the burgeoning field of ecological epigenetics [36–39]. However, along with identifying transmissible epigenetic marks, it is vital to understand the role that stable epigenetic regulation has on gene expression. Here we present the first M. guttatus methylome. We utilize a novel modeling approach to untangle the complex interactions between methylation and gene expression. We show that non-CG gene body methylation may have a significant effect on gene expression despite occurring at relatively low levels. Utilizing a GO term enrichment approach, we demonstrate that certain functional categories are over-represented in genes with high gene body CG methylation. We provide evidence that there are differences in methylation and gene expression between chromosomes, such that mean gene expression is significantly lower across some chromosomes than others. Finally, we look at transcriptional start sites across the genome, where recent evidence suggests increased recombination in M. guttatus , and find a corresponding decrease in DNA methylation.
DNA extraction and bisulfite sequencing
We germinated seeds from the M. guttatus Iron Mountain inbred line, IM62, the line that was sequenced to establish the M. guttatus reference genome  ( http://phytozome.jgi.doe.gov). When the second leaf pair of seedlings was just visible we collected leaf tissue from multiple seedlings, flash froze it in liquid nitrogen, and stored it at −80 °C. We preformed DNA extractions using a CTAB protocol . We pooled DNA from multiple seedlings before library construction in order to limit the effects of aberrant intra-individual variation . From this pooled sample we generated sequencing template for whole genome bisulfite sequencing (WGBS) following the PBAT (Post-Bisulfite Adaptor Tagging) protocol . With 1 ng of unmethylated lambda DNA obtained from Promega used as a spike-in control for conversion efficiency, 100 ng of genomic DNA from M. guttatus was treated with bisulfite using EZ DNA Methylation kit from Zymo Research. Two rounds of random primer extension for tagging bisulfite treated DNA with adaptors were performed using primers for single-end library construction as described in . The concentration of templates was determined by qPCR with Library Quantification Kits from KAPA biosystems. A single lane of 100 cycle reactions on HiSeq 2500 was assigned for the library sequencing.
We used the software BMap  (http://itolab.med.kyushu-u.ac.jp/BMap/index.html) to map bisulifte treated reads to the M. guttatus v2.0 reference genome (http://phytozome.jgi.doe.gov). In short, BMap first searches candidate genomic loci for each read in two duplicated genome sequences, one with every C in the genome converted to a T (C2T), and one with G to A (G2A), using an approach called adaptive seed . Next BMap creates pairwise alignments between the read and original DNA sequence of every candidate loci, and calculates scores for each alignment allowing mismatches between T in the reads with C in the reference. Finally an alignment with the highest score is reported for each read. We used default parameters for mapping with BMap. Using alignments exported by BMap, methylation status for every cytosine in every read was called, and counts both supporting the methylated and unmethylated state are assigned for every cytosine residue of the reference genome. Methylation levels for CG, CHG and CHH contexts are exported to different files and analyzed independently.
Global methylome analysis
We estimated the number of total and methylated cytosines mapped across the genome on a per-nucleotide basis for the M. guttatus IM62 seedling methylome. Percent methylation was calculated for each 1 kb window across the genome for total methylation, as well as methylation in each of the three sequence contexts. Centromere positions were estimated from characteristic repeat sequences .
Gene methylation analysis
Using the M. guttatus v2.0 annotations , we calculated the percent methylation in each sequence context for each of the 24,130 annotated genes. Only the 17,043 for which we had gene expression data  were used for down-stream analysis. For each annotated gene we defined three regions: up-stream as the 1kb up-stream of the transcriptional start site, gene body as the transcribed portion of the gene, and down-stream as the 1kb downstream of the 3′ UTR. Gene expression values were generated previously by RNAseq from seedling tissue of genetically distinct M. guttatus – a recombinant inbred line derived from cross between divergent populations .
In order to determine if gene methylation and expression varied across chromosomes we preformed four ANOVAs with chromosome as an explanatory variable and CG, CHG, CHH, and log-transformed gene expression as response variables.
Gene ontology terms were already assigned to genes , and were utilized both to calculate the total number of GO terms per gene, as well as to perform a Fisher’s Exact test to determine what, if any, types of genes were enriched or depleted in our set of highly CG methylated genes, and our set of chromosomes exhibiting significantly reduced gene expression levels.
In order to choose a predictive gene expression model, we included methylation in each of three contexts, percent methylation in gene bodies, up-stream and down-stream regions, intron length (sum of all introns for a gene), exon length (sum of all exons for a gene), number of exons, and interaction terms up to the third degree. Gene length, intron size, and intron number are all known to be positively correlated with gene expression in plants , opposite the trend observed in animals . We used a Bayesian information criterion (BIC)  to inform our restricted maximum likelihood (REML) model selection (done in order to limit the number of parameters included in our model, and in turn reduce over fitting). Additionally, genes were parsed randomly into thirds, and parameters were tunes for each of these three groups independently. These models were then used to predict gene expression in the remaining to gene groups to provide 3-fold cross-validation . We Z-transformed values to make parameter estimates comparable, making a value of 0 represent the mean value for a variable, with positive or negative deviations reflecting the number of standard deviations a value is from the mean.
We identified transposable elements across the M. guttatus genome from the repeat-masked genome assembly . Genomic repeats larger than 100 base pairs were selected and percent methylation in all three sequence contexts was identified for these repeats.
Results and discussion
Mimulus guttatus methylation across sequence contexts and genomic regions
Proportion of cytosines methylated
1st 500bp of Gene Body
Methylation effect on gene expression
Summary of REML genetic architecture and methylation fit on log transformed gene expression
Root Mean Square Error
Mean of Response
Analysis of variance in gene expression predictive model
Sum of Squares
Prob > F
p < 0.0001
Sorted estimate of parameter effects on log transformed gene expression
Prob > |t|
Gene Body CHG2
Number of Exons*Intron Length
Exon Length* Intron Length
Gene Body CG * Exon Length
Gene Body CG2 * Exon Length
Gene Body CG*Gene Body CHH
Up-stream CHH* Percent CG
Exon Length2 * Number of Exons
Gene Body CHG
Gene Body CG2
Gene Body CG
Exon Length * Number of Exons
Gene Body CHH
Number of Exons
Gene Body CHG * Exon Length
Up-stream CG* Up-stream CHH
Up-stream CG* Gene Body CG
Exon Length * Intron Length * Number of Exons
Gene Body CHG3
Gene body CG methylation
Quadratic Effects: f 1 − 0.1m cg 2 = f 2. The squared gene body CG methylation term has the second largest effect size of any methylation term (after gene body CHG methylation) on gene expression, and leads to a predicted local m cg maximum for gene expression (due to it being a negative parabola, Fig. 6a, green line). This maximum is found at m cg = − 0.35 (47 %). As gene methylation increases or decreases relative to a moderate 45 % methylation, gene expression is expected to decrease (Fig. 6a; green line).
Cubic Effects: f 2 − 0.03m cg 3 = f 3. The cubed gene body CG methylation term is also negative; compared to our quadratic model, this leads to higher predicted gene expression for genes with lower than average methylation, and lower for genes with higher than average methylation. This slightly lowers the predicted local maximum of gene expression to m cg = − 0.43 (45 %) (Fig. 6a, blue line). These data agree with previous findings that there is a non-linear relationship between gene body CG methylation and gene expression with an intermediate optimum .
At u cg = 1 (82 %) we find that the local maximum for gene expression is at m cg = − 0.62 (40.2 %), while at u cg = − 1 (24.1 %) the local maximum for gene expression is at m cg = − 0.29 (48.8 %). As up-stream CG methylation decreases (Fig. 6b, from purple to yellow lines), gene body CG methylation is expected to have a more positive effect on gene expression.
While it has long been noted that regulatory region methylation is linked with reduced gene expression, here we find evidence that the difference in methylation between these regions also appears to correlate with gene expression. The negative interaction term between up-stream and gene body CG methylation predicts that distinctly different levels of methylation up-stream and within genes tends to correspond with higher levels of gene expression. When gene body CG methylation and regulatory methylation are both high, gene expression tends to be low (Fig. 6b, purple lines at high gene body CG values). However, as either decreases (Fig. 6b, purple lines at low gene body CG values, or yellow lines at high CG methylation values), gene expression is expected to increase.
Three positive interaction terms: f 4 + m cg (.02u chh +.02m chh +.04l exon ) = f 5: While only up-stream CG methylation showed a negative interaction with gene body CG methylation, three terms have positive linear interactions: Up-stream CHH methylation, gene body CHH methylation, and exon length. These can be treated in much the same way as our negative interaction term. Depending on the values of these terms, they can offset each other and lead to the removal of any interaction effect. For example if exon length (l exon ) = − 1 and up-stream (u chh ) and gene body (m chh ) CHH methylation = 1 these positive interaction terms cancel out (−.4 + .2 + .2 = 0). However, if we consider them varying in the same direction, they can have a striking effect on the relationship between gene body CG methylation and gene expression. At u chh = m chh = l exon = 1 (and the negative interaction term u cg = 0), we see the local maximum is at a methylation level of m cg = − 0.05 (55.1 %). If our negative interaction term u cg = − 1 , this increases to m cg = + 0.05 (57.3 %) (Fig. 6b, varying the values of our interaction terms, u chh = m chh = l exon = − u cg from −1.6 to 1.6, as the summed interaction term increases (lines become yellow) the local maxima for gene expression does so as well). When u chh = m chh = l exon = − u cg < 0, gene body CG methylation is almost purely repressive. At a summed interaction value less than −0.7 there is no longer a local maximum, and CG methylation has a purely negative effect on gene expression.
Quadratic Interaction Terms: log(GE) = f 5+.03m cg 2 l exon = f 6. Finally the interaction between the quadratic gene body CG methylation term and exon length is included in this model. As our quadratic term increases, not only does the position of the local gene expression maximum increase, so to does the inflection point (the point at which the function changes from concave to convex). Now, at the same linear interaction values tested above (u chh = m chh = l exon = − u cg = 1), our local maximum occurs at m cg = 0.07 (57.8 %) (Fig. 6c). As exon sizes increase, the effect of gene body CG methylation is expected to rapidly become more positive, and peak gene expression is predicted to occur at higher m cg levels. At l exon = 3 (3.5 kb), we find that the local maximum for gene expression occurs at m cg = 0.90 (78.1 %) and at l exon = 4 (6kb) there is no longer a local maximum for m cg , and the highest expected gene expression occurs at m cg approaching 100 % (largest gene size in Fig. 6c). It appears that for genes with smaller exons, moderately methylated genes are most highly expressed, but as genes become larger so to does the level of gene methylation that is associated with more highly expressed genes. Our gene size by gene body CG methylation results confirm a pattern observed by Zilberman et al.  in the first genome-wide methylome analysis in Arabidopsis in which found only a marginal relationship between gene size and gene expression, except for the genes in which gene bodies were methylated and then they found a positive relationship between gene size and gene expression.
Individual effects of interaction terms: log(GE) = f 5 + .08l exon − .02l exon 2 = f 6: Finally, we consider the effect of multiple terms simultaneously. Up until this point we have only included gene body CG methylation effects, and its interaction terms, while not including the independent effects of the term with which it interacts. Independent of gene body CG methylation, we find that gene expression tends to increase as the standardized exon length increases from −1 (500bp) to 2 (2kb), and beyond this point we expect a decline. In the absence of interaction terms, only considering independent effects of gene body CG methylation and exon length, we would estimate that peak gene expression occurs at an exon length of 2kb, and methylation of 45 %. Here we show that the effect of gene body CG methylation on expression is extremely size dependent, and that gene expression is expected to be highest for large highly gene body CG methylated genes, but lowest for small highly gene body CG methylated genes (Fig. 6d). It may be that as exon length increases, gene methylation is necessary to stabilize transcription, while for smaller genes it is not necessary for this purpose, and rather plays a repressive effect due to condensing chromatin near the transcription start site.
In this same way all other independent and interaction terms could be added to this model, parameters considered, and hypotheses tested. As nine distinct parameters are included (with 27 total terms) in this model the results quickly become difficult to conceptualize or visualize, yet through full-model construction, followed by simplification methods as presented above it is possible to decipher complex higher order regulatory interactions. We briefly discuss the effects of the other significant gene size and methylation terms in this model.
Intron length shows significant first, second, and third order effects with a gene expression peak at an intron size of approximately 1700 base pairs. Additionally, a positive interaction term with both exon length and number of introns suggests that generally, longer genes with more introns tend to be more highly expressed. Although relatively large genes do tend to be most highly expressed, there are negative quadratic terms for both exon and intron length that suggest after a certain point, increasing exon and intron length should be associated with decreased gene expression.
Non-CG gene body methylation
Gene body CHG methylation had significant linear, quadratic, and cubic independent terms, and an exon length interaction term. Gene body CHG methylation has a negative effect on gene expression across nearly its full range of possible values (Fig. 5), and it appears that it is the increase from no CHG methylation to slight CHG methylation that reduces gene expression. After this point the effect of CHG methylation appears to be minimal. The negative exon length interaction term suggests that long genes with CHG methylation tend to be more significantly repressed than smaller genes.
Gene body CHH methylation was found to have a negative effect on gene expression (Fig. 5), but a positive interaction with gene body CG methylation. Thus, as gene body CHH methylation increases, gene body CG methylation is expected to have a more positive effect on gene expression, but mean gene expression, independent of gene body CG methylation, is expected to decrease. Like CHG methylation, a manual inspection reveals that the jump from no CHH methylation to low levels of CHH methylation leads to a decrease in gene expression, but after this, the effects of increased methylation are minimal.
While it has been suggested that non-CG gene body methylation may be misattributed to genomic regions that are actually pseudogenes or paralogs [52, 56], here we find evidence that in at least some cases these genes are still expressed, albeit at lower levels than non-methylated genes. One possible explanation is that non-CG methylation of genes may be a first step on the path toward pseudogenization , whereby genes become targeted by non-CG methylation, gene expression is reduced, mutational constraints become lightened, and eventually the gene becomes entirely non-functional. Additionally, it may be that tightly developmentally controlled small RNAs are responsible for the majority of this methylation, and the use of identical tissue for methylation and gene expression analysis would identify a stronger role of gene body non-CG methylation on gene expression. Finally, even trace amounts of non-CG gene body methylation may be indicative of the presence of small RNAs, and RNA-directed DNA methylation (RdDM) . It could be that the methylation of just a few nucleotides by a single 24nt siRNA is enough to reduce gene expression, without significantly altering the methylation state of the whole gene.
Regulatory region methylation
Along with a negative interaction with gene body CG methylation, up-stream CG methylation also has a direct negative effect on gene expression (Fig. 5) and a negative interaction with up-stream CHH methylation. Not only does up-stream CG methylation limit the positive effect of gene body CG methylation on predicted gene expression, it also directly reduces predicted expression. Up-stream CHH methylation has both a significant positive linear effect on gene expression (Fig. 5), and a positive interaction with gene body CG methylation. The negative interaction term with up-stream CG methylation suggests that while up-stream CHH methylation generally has a positive effect on gene expression, when it is found alongside CG methylation, this effect is negated. While down-stream CHH methylation did not interact with gene body CG methylation, it was also found to have a positive effect on gene expression (Fig. 5).
A previous study in Arabidopsis similarly found that there was a positive correlation between gene expression and regulatory CHH methylation (albeit not in a regression framework) . They posit that as gene expression increases, unstable transcripts are produced as by-products at both the 5′ and 3′ ends of genes. In turn, this lead to the production of small RNAs that can target and cause CHH methylation bracketing highly expressed genes through RNA directed DNA methylation (RdDM). The possibility that increased gene expression causes increased regulatory CHH methylation, and not vice-versa does not introduce bias in this framework, but rather reinforces that our interpretations do not imply causality.
Gene expression modeling overview
While the traditional method of looking for simple associations between methylation state and gene expression has provided some insight into epigenetic regulation, here we demonstrate that modeling approaches can provide additional insight into these systems. We explain a surprisingly high (20.1 %) amount of the variation in log(gene expression) simply through methylation and gene architecture variation. We considered a potential 454 parameters in our model before settling on 29, but it is important to note that many other factors such as presence of enhancers within the gene body and distance to transposable elements, likely also modify the role of methylation on expression. By considering exon and intron length within this model we take the first steps to account for these potential confounding factors of methylation on expression. It is worth stressing that the gene expression and methylome data were not only collected from different individuals, but also different genetic lines, using different vegetative tissue types, and grown under slightly different greenhouse conditions. It is certainly possible that a similar model, tuned across multiple paired methylome and gene expression samples, could predict gene expression with greater precision. This portion of gene expression variation explained represents that which is at least relatively stable across individual genotypes, tissue, and conditions. While here we apply this model to gene expression on a gene-by-gene basis, through altering the response variable to another parameter of a gene, such as it’s mutation rate, gene expression variance, or the tissues in which it is expressed, this model could be extended to look for other roles of DNA methylation on gene function and evolution.
Results from this and other [3, 20, 29, 59] studies suggest that gene body CG methylation needs to be considered to have a quadratic effect on gene expression, and that this effect is highly dependent on exon size. Thus, genes can either be parsed according to exon length prior to estimating the role of gene body CG methylation on expression, or the interaction between exon length and methylation should be considered in the model. Other forms of methylation appear to have a more straightforward role in regulating gene expression, and in some cases it may suffice to predict that, for example, as up-stream CG methylation increases at a gene, its expression will likely decrease.
Gene ontology analysis of genes with high CG gene body methylation
Decreased methylation near transcription start sites
Decreased methylation near transcription start sites (TSS) was one of the earliest discovered phenomena of gene methylation . However, new evidence in M. guttatus  provides us with a novel framework in which to view this pattern. Hellsten et al.  identified an approximately two-fold increase in recombination near gene start sites (the beginning of exon 1 being most enriched), and postulated that this may be related to nucleosome depleted open chromatin at these regions as is the case in Arabidopsis  and rice . At the time of their publication however, there was no evidence for a similar trend in Mimulus. Here, evidence of depleted methylation near TSS (Table 1; Fig. 8) provides support to the theory that open chromatin (unmethylated) near TSS may increase local recombination rates. It appears that at least in yeast double stranded breaks occur most frequently in open chromatin regions , which may explain the observed increase in recombination near transcription start sites. It is likely that the increased recombination near TSS is simply a by-product of the dual forces exerted by DNA methylation, one involved in gene regulation, and another limiting double stranded breaks. The ability for DNA methylation to alter both of these processes provides an interesting link between gene regulation and DNA recombination that may or may not prove to be of evolutionary significance. Further studies linking methylation and recombination at a nucleotide level should further clarify this trend.
Transposable element methylation
Transposable element frequencies, classes, and methylation
We find that DNA methylation in all contexts is enriched in transposable elements relative to genes, however this is most significant for non-CG methylation (Table 1). This suggests that both RNA dependent DNA methylation (RdDM) is targeting and silencing transposable elements in M. guttatus as is this case in other angiosperms. Found at 2,380 copies, the helB8c family of helitron elements is far and away the most common transposon in the Mimulus genome (more abundant than the next seven TE families combined; Table 5). Helitrons are a relatively newly discovered class of type 2 transposable elements that propagate through a rolling circle mechanism that is still somewhat mysterious . One thing that is clear, is that these elements have been highly successful in propagating across flowering plants, making up 2 % of the Arabidopsis genome ; a single family of helitrons makes up 6 % of the maize genome , making it the most abundant DNA transposon identified. Here, we provide evidence for the success of these elements across the diversity of flowering plants.
Much remains unknown about the gene regulatory information contained in an organism’s methylome, but here we provide further evidence of complex interactions between gene methylation and expression. DNA methylation may actively alter gene expression, itself be altered by gene expression, or both methylation and expression may be jointly determined by a distinct genetic feature. Still the ability to explain over a fifth of the variation in log transformed gene expression by local DNA methylation, and basic genetic architecture (exon length, intron length, exon number), is promising and has numerous potential applications. Recent efforts have shown that the plant methylome is relatively stable throughout development , unlike gene expression. In this way methylation at a gene likely reflects moderately stable epigenetic control of gene expression, while developmentally activated transcription factors and small RNAs may provide highly plastic gene expression control throughout development. Through combining differential methylation analyses across tissue types, environmental treatments, or genetic lines with a modeling approach as described here; our understanding of the role of epigenetic variation in gene regulation can be greatly increased.
Nicholas McCool for plant care and DNA extractions. Masahiko Shimizu for laboratory aid, travel assistance, and general support. The University of Kansas Genome Sequencing Facility for preforming sequencing operations. This work was supported by NSF IOS-0951254 to JKK, LCH and AG Scoville. JMC’s travel to the University of Tokyo was supported by an NSF RCN microMorph grant, and a University of Kansas Botany Endowment grant to JMC. NIH grant R01 GM073990 (PI JKK).
- Flavell R. Inactivation of gene expression in plants as a consequence of specific sequence duplication. Proc Natl Acad Sci. 1994;91(9):3490–6.PubMed CentralPubMedView ArticleGoogle Scholar
- Tate PH, Bird AP. Effects of DNA methylation on DNA-binding proteins and gene expression. Curr Opin Genet Dev. 1993;3(2):226–31.PubMedView ArticleGoogle Scholar
- Zilberman D, Gehring M, Tran RK, Ballinger T, Henikoff S. Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2006;39(1):61–9.PubMedView ArticleGoogle Scholar
- Lippman Z, Gendrel A-V, Black M, Vaughn MW, Dedhia N, McCombie WR, et al. Role of transposable elements in heterochromatin and epigenetic control. Nature. 2004;430(6998):471–6.PubMedView ArticleGoogle Scholar
- Miura A, Yonebayashi S, Watanabe K, Toyama T, Shimada H, Kakutani T. Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis. Nature. 2001;411(6834):212–4.PubMedView ArticleGoogle Scholar
- Xia J, Han L, Zhao Z. Investigating the relationship of DNA methylation with mutation rate and allele frequency in the human genome. BMC Genomics. 2012;13(8):1–9.Google Scholar
- Mirouze M, Lieberman-Lazarovich M, Aversano R, Bucher E, Nicolet J, Reinders J, et al. Loss of DNA methylation affects the recombination landscape in Arabidopsis. Proc Natl Acad Sci. 2012;109(15):5880–5.PubMed CentralPubMedView ArticleGoogle Scholar
- Feng S, Cokus SJ, Zhang X, Chen P-Y, Bostick M, Goll MG, et al. Conservation and divergence of methylation patterning in plants and animals. Proc Natl Acad Sci. 2010;107(19):8689–94.PubMed CentralPubMedView ArticleGoogle Scholar
- Huff JT, Zilberman D. Dnmt1-independent CG methylation contributes to nucleosome positioning in diverse eukaryotes. Cell. 2014;156(6):1286–97.PubMed CentralPubMedView ArticleGoogle Scholar
- Gruenbaum Y, Naveh-Many T, Cedar H, Razin A. Sequence specificity of methylation in higher plant DNA. 1981.Google Scholar
- Bond DM, Baulcombe DC. Small RNAs and heritable epigenetic variation in plants. Trends Cell Biol. 2014;24(2):100–7.PubMedView ArticleGoogle Scholar
- Kinoshita T, Jacobsen SE. Opening the door to epigenetics in PCP. Plant Cell Physiol. 2012;53(5):763–5.PubMed CentralPubMedView ArticleGoogle Scholar
- Eichten SR, Schmitz RJ, Springer NM. Epigenetics: beyond chromatin modifications and complex genetic regulation. Plant Physiol. 2014;165(3):933–47.PubMed CentralPubMedView ArticleGoogle Scholar
- Cao X, Jacobsen SE. Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc Natl Acad Sci. 2002;99 suppl 4:16491–8.PubMed CentralPubMedView ArticleGoogle Scholar
- Law JA, Du J, Hale CJ, Feng S, Krajewski K, Palanca AMS, et al. Polymerase IV occupancy at RNA-directed DNA methylation sites requires SHH1. Nature. 2013;498(7454):385–9.PubMed CentralPubMedView ArticleGoogle Scholar
- Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11(3):204–20.PubMed CentralPubMedView ArticleGoogle Scholar
- Leonhardt H, Page AW, Weier H-U, Bestor TH. A targeting sequence directs DNA methyltransferase to sites of DNA replication in mammalian nuclei. Cell. 1992;71(5):865–73.PubMedView ArticleGoogle Scholar
- Lindroth AM, Cao X, Jackson JP, Zilberman D, McCallum CM, Henikoff S, et al. Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation. Science. 2001;292(5524):2077–80.PubMedView ArticleGoogle Scholar
- Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, Chen H, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006;126(6):1189–201.PubMedView ArticleGoogle Scholar
- Li X, Zhu J, Hu F, Ge S, Ye M, Xiang H, et al. Single-base resolution maps of cultivated and wild rice methylomes and regulatory roles of DNA methylation in plant gene expression. BMC Genomics. 2012;13(1):300.PubMed CentralPubMedView ArticleGoogle Scholar
- Coleman-Derr D, Zilberman D. Deposition of histone variant H2A. Z within gene bodies regulates responsive genes. PLoS Genet. 2012;8(10):e1002988.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhong S, Fei Z, Chen YR, Zheng Y, Huang M, Vrebalov J, et al. Single-base resolution methylomes of tomato fruit development reveal epigenome modifications associated with ripening. Nat Biotechnol. 2013;31(2):154–9.PubMedView ArticleGoogle Scholar
- Eichten SR, Briskine R, Song J, Li Q, Swanson-Wagner R, Hermanson PJ, et al. Epigenetic and genetic influences on DNA methylation variation in maize populations. Plant Cell. 2013;25(8):2783–97.PubMed CentralPubMedView ArticleGoogle Scholar
- Schmitz RJ, Schultz MD, Urich MA, Nery JR, Pelizzola M, Libiger O, et al. Patterns of population epigenomic diversity. Nature. 2013;495(7440):193–8.PubMed CentralPubMedView ArticleGoogle Scholar
- Saze H, Tsugane K, Kanno T, Nishimura T. DNA methylation in plants: relationship to small RNAs and histone modifications, and functions in transposon inactivation. Plant Cell Physiol. 2012;53(5):766–84.PubMedView ArticleGoogle Scholar
- Gent JI, Ellis NA, Guo L, Harkess AE, Yao Y, Zhang X, et al. CHH islands: de novo DNA methylation in near-gene chromatin regulation in maize. Genome Res. 2013;23(4):628–37.PubMed CentralPubMedView ArticleGoogle Scholar
- Takuno S, Gaut BS. Gene body methylation is conserved between plant orthologs and is of evolutionary consequence. Proc Natl Acad Sci. 2013;110(5):1797–802.PubMed CentralPubMedView ArticleGoogle Scholar
- Li Q, Eichten SR, Hermanson PJ, Springer NM. Inheritance patterns and stability of DNA methylation variation in maize near-isogenic lines. Genetics. 2014;196(3):667–76.PubMed CentralPubMedView ArticleGoogle Scholar
- Wang J, Marowsky NC, Fan C. Divergence of gene body DNA methylation and evolution of plant duplicate genes. PLoS One. 2014;9(10):e110357.PubMed CentralPubMedView ArticleGoogle Scholar
- Yuan Y, Guo L, Shen L, Liu JS. Predicting gene expression from sequence: a reexamination. PLoS Comput Biol. 2007;3(11):e243.PubMed CentralPubMedView ArticleGoogle Scholar
- Li X, Wang X, He K, Ma Y, Su N, He H, et al. High-resolution mapping of epigenetic modifications of the rice genome uncovers interplay between DNA methylation, histone methylation, and gene expression. Plant Cell Online. 2008;20(2):259–76.View ArticleGoogle Scholar
- Colicchio JM, Monnahan PJ, Kelly JK, Hileman LC. Gene expression plasticity resulting from parental leaf damage in Mimulus guttatus. New Phytol. 2015;205(2):894–906.Google Scholar
- Holeski L. Within and between generation phenotypic plasticity in trichome density of Mimulus guttatus. J Evol Biol. 2007;20(6):2092–100.PubMedView ArticleGoogle Scholar
- Holeski LM, Chase‐Alone R, Kelly JK. The genetics of phenotypic plasticity in plant defense: trichome production in Mimulus guttatus. Am Nat. 2010;175(4):391–400.PubMedView ArticleGoogle Scholar
- Scoville AG, Barnett LL, Bodbyl‐Roels S, Kelly JK, Hileman LC. Differential regulation of a MYB transcription factor is correlated with transgenerational epigenetic inheritance of trichome density in Mimulus guttatus. New Phytolo. 2011;191(1):251–63.View ArticleGoogle Scholar
- Holeski LM, Jander G, Agrawal AA. Transgenerational defense induction and epigenetic inheritance in plants. Trends Ecol Evol. 2012;27(11):618–26.PubMedView ArticleGoogle Scholar
- Holeski LM, Zinkgraf MS, Couture JJ, Whitham TG, Lindroth RL. Transgenerational effects of herbivory in a group of long-lived tree species: maternal damage reduces offspring allocation to resistance traits, but not growth. J Ecol. 2013;101(4):1062–73.View ArticleGoogle Scholar
- Latzel V, Allan E, Bortolini Silveira A, Colot V, Fischer M, Bossdorf O. Epigenetic diversity increases the productivity and stability of plant populations. Nat Commun. 2013;4:2875.PubMedView ArticleGoogle Scholar
- Kilvitis H, Alvarez M, Foust C, Schrey A, Robertson M, Richards C. Ecological Epigenetics. In: Landry CR, Aubin-Horth N, editors. Ecological Genomics, vol. 781. Netherlands: Springer; 2014. p. 191–210.View ArticleGoogle Scholar
- Hellsten U, Wright KM, Jenkins J, Shu S, Yuan Y, Wessler SR, et al. Fine-scale variation in meiotic recombination in Mimulus inferred from population shotgun sequencing. Proc Natl Acad Sci. 2013;110(48):19478–82.PubMed CentralPubMedView ArticleGoogle Scholar
- Holeski L, Keefover-Ring K, Bowers MD, Harnenz Z, Lindroth R. Patterns of Phytochemical Variation in Mimulus guttatus (Yellow Monkeyflower). J Chem Ecol. 2013;39(4):525–36.PubMedView ArticleGoogle Scholar
- Hardcastle T. High-throughput sequencing of cytosine methylation in plant DNA. Plant Methods. 2013;9(1):16.PubMed CentralPubMedView ArticleGoogle Scholar
- Miura F, Enomoto Y, Dairiki R, Ito T. Amplification-free whole-genome bisulfite sequencing by post-bisulfite adaptor tagging. Nucleic Acids Res. 2012;40(17):e136.PubMed CentralPubMedView ArticleGoogle Scholar
- Kiełbasa SM, Wan R, Sato K, Horton P, Frith MC. Adaptive seeds tame genomic sequence comparison. Genome Res. 2011;21(3):487–93.PubMed CentralPubMedView ArticleGoogle Scholar
- Flagel LE, Willis JH, Vision TJ. The standing pool of genomic structural variation in a natural population of Mimulus guttatus. Genome Biol Evol. 2014;6(1):53–64.PubMed CentralPubMedView ArticleGoogle Scholar
- Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(D1):D1178–86.PubMed CentralPubMedView ArticleGoogle Scholar
- Ren X-Y, Vorst O, Fiers MW, Stiekema WJ, Nap J-P. In plants, highly expressed genes are the least compact. Trends Genet. 2006;22(10):528–32.PubMedView ArticleGoogle Scholar
- Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA. Selection for short introns in highly expressed genes. Nat Genet. 2002;31(4):415–8.PubMedGoogle Scholar
- Posada D, Buckley TR. Model selection and model averaging in phylogenetics: advantages of Akaike information criterion and Bayesian approaches over likelihood ratio tests. Syst Biol. 2004;53(5):793–808.PubMedView ArticleGoogle Scholar
- Kohavi R. A study of cross-validation and bootstrap for accuracy estimation and model selection. IJCAI. 1995;1995:1137–45.Google Scholar
- Kelly JK, Koseva B, Mojica JP. The genomic signal of partial sweeps in Mimulus guttatus. Genome Biol Evol. 2013;5(8):1457–69.PubMed CentralPubMedView ArticleGoogle Scholar
- Schmitz RJ, He Y, Valdes-Lopez O, Khan SM, Joshi T, Urich MA, et al. Epigenome-wide inheritance of cytosine methylation variants in a recombinant inbred population. Genome Res. 2013;23(10):1663–74.PubMed CentralPubMedView ArticleGoogle Scholar
- Shen H, He H, Li J, Chen W, Wang X, Guo L, et al. Genome-wide analysis of DNA methylation and gene expression changes in two Arabidopsis ecotypes and their reciprocal hybrids. Plant Cell. 2012;24(3):875–92.PubMed CentralPubMedView ArticleGoogle Scholar
- Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13(7):484–92.PubMedView ArticleGoogle Scholar
- Bartee L, Malagnac F, Bender J. Arabidopsis cmt3 chromomethylase mutations block non-CG methylation and silencing of an endogenous gene. Genes Dev. 2001;15(14):1753–8.PubMed CentralPubMedView ArticleGoogle Scholar
- Seymour DK, Koenig D, Hagmann J, Becker C, Weigel D. Evolution of DNA methylation patterns in the brassicaceae is driven by differences in genome organization. PLoS Genet. 2014;10(11):e1004785.PubMed CentralPubMedView ArticleGoogle Scholar
- Li X, Li W, Wang H, Cao J, Maehashi K, Huang L, et al. Pseudogenization of a sweet-receptor gene accounts for cats’ indifference toward sugar. PLoS Genet. 2005;1(1):e3.PubMed CentralView ArticleGoogle Scholar
- Wassenegger M. RNA-directed DNA methylation. In: Plant Gene Silencing. Springer; 2000: 83–100.Google Scholar
- Yang H, Chang F, You C, Cui J, Zhu G, Wang L, et al. Whole‐genome DNA methylation patterns and complex associations with gene structure and expression during flower development in Arabidopsis. Plant J. 2015;81(2):268–81.PubMedView ArticleGoogle Scholar
- Zhang W, Zhang T, Wu Y, Jiang J. Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis. Plant Cell Online. 2012;24(7):2719–31.View ArticleGoogle Scholar
- Yu J, Hu S, Wang J, Wong GK-S, Li S, Liu B, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002;296(5565):79–92.PubMedView ArticleGoogle Scholar
- Pan J, Sasaki M, Kniewel R, Murakami H, Blitzblau Hannah G, Tischfield Sam E, Zhu X, et al. A Hierarchical Combination of Factors Shapes the Genome-wide Topography of Yeast Meiotic Recombination Initiation. Cell, 2011;144(5):719–731.Google Scholar
- Xiong W, He L, Lai J, Dooner HK, Du C. HelitronScanner uncovers a large overlooked cache of Helitron transposons in many plant genomes. Proc Natl Acad Sci. 2014;111(28):10263–8.PubMed CentralPubMedView ArticleGoogle Scholar
- Hollister JD, Gaut BS. Population and evolutionary dynamics of helitron transposable elements in Arabidopsis thaliana. Mol Biol Evol. 2007;24(11):2515–24.PubMedView ArticleGoogle Scholar
- Eichten SR, Vaughn MW, Hermanson PJ, Springer NM. Variation in DNA methylation patterns is more common among maize inbreds than among tissues. Plant Genome 2013, 6(2).Google Scholar
- Cokus SJ, Feng S, Zhang X, Chen Z, Merriman B, Haudenschild CD, et al. Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature. 2008;452(7184):215–9.PubMed CentralPubMedView ArticleGoogle Scholar
- Wickham H. ggplot2: elegant graphics for data analysis: Springer; 2009.Google Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.