Dynamic DNA cytosine methylation in the Populus trichocarpa genome: tissue-level variation and relationship to gene expression
© Vining et al; licensee BioMed Central Ltd. 2012
Received: 5 September 2011
Accepted: 17 January 2012
Published: 17 January 2012
DNA cytosine methylation is an epigenetic modification that has been implicated in many biological processes. However, large-scale epigenomic studies have been applied to very few plant species, and variability in methylation among specialized tissues and its relationship to gene expression is poorly understood.
We surveyed DNA methylation from seven distinct tissue types (vegetative bud, male inflorescence [catkin], female catkin, leaf, root, xylem, phloem) in the reference tree species black cottonwood (Populus trichocarpa). Using 5-methyl-cytosine DNA immunoprecipitation followed by Illumina sequencing (MeDIP-seq), we mapped a total of 129,360,151 36- or 32-mer reads to the P. trichocarpa reference genome. We validated MeDIP-seq results by bisulfite sequencing, and compared methylation and gene expression using published microarray data. Qualitative DNA methylation differences among tissues were obvious on a chromosome scale. Methylated genes had lower expression than unmethylated genes, but genes with methylation in transcribed regions ("gene body methylation") had even lower expression than genes with promoter methylation. Promoter methylation was more frequent than gene body methylation in all tissues except male catkins. Male catkins differed in demethylation of particular transposable element categories, in level of gene body methylation, and in expression range of genes with methylated transcribed regions. Tissue-specific gene expression patterns were correlated with both gene body and promoter methylation.
We found striking differences among tissues in methylation, which were apparent at the chromosomal scale and when genes and transposable elements were examined. In contrast to other studies in plants, gene body methylation had a more repressive effect on transcription than promoter methylation.
KeywordsEpigenetics epigenomics DNA methylation 5-methylcytosine Populus
"Epigenetic" implies changes in regulatory states of genes or genomic DNA without changes in DNA sequence. The archetypical epigenetic modification in eukaryotic genomes is the addition of a methyl group to the fifth carbon of cytosine to produce 5-methylcytosine (5meC) [1, 2], reviewed in . Cytosine DNA methylation is an epigenetic modification that is shared by many eukaryotic organisms. Along with various other epigenetic modifications such as methylation, phosphorylation and acetylation of histone amino acids, cytosine methylation is an important regulator of biological processes including transposon silencing, heterochromatin organization, genomic imprinting, and gene expression.
The distribution of cytosine methylation is highly variable within plant genomes . This overall methylation pattern, which is conserved among diverse plant taxa, is often described as "mosaic," as it consists of interspersed methylated and unmethylated regions [5, 6]. The patterns of 5meC, mechanisms for de novo and maintenance methylation and the requirement for specific proteins for cytosine methylation have been best studied in Arabidopsis thaliana, where roughly 20% of the genome is methylated in whole seedlings [7, 8]. Cytosine methylation is strongly enriched in heterochromatin at pericentromeric and subtelomeric repeats, and at rDNA clusters [7, 9]. Repetitive sequences, which consist largely of transposons, retrotransposons, and tandem or inverted repeats, are highly methylated [5, 10, 11]. A novel and unexpected finding from genome-wide surveys was that a third of A. thaliana genes are methylated within their transcribed regions ("gene body methylation") [7, 8], while perhaps 16% of rice (Oryza sativa) genes are enriched for 5meC . The relationship between gene body methylation and transcription is currently not well understood. While promoter methylation is generally associated with lower transcription in A. thaliana, the relationship of gene body methylation to expression is complex, with methylation tending to occur most often in genes transcribed at moderate to high, but not very high, levels [4, 7, 8].
In plants, 5meC can occur in all sequence contexts (CG, CHG and CHH, where H refers to A, C or T) [13, 14]. The mechanisms responsible for establishment and maintenance of 5meC are best studied in A. thaliana where the maintenance methyltransferase MET1 targets hemimethylated CG sites, and the de novo methyltransferases DRM2 and CMT3 target CHG and CHH sites. Disruption of maintenance methylation results in abnormal developmental phenotypes including stunting, malformed leaves, decreased apical dominance, lower fertility, disrupted heterochrony, delayed flowering time and abnormal flower morphology [15, 16], while DRM2 and CMT3 mutants display defects in RNA mediated silencing , as well as dwarfing and abnormal leaf phenotypes . The activity of methyltransferases appears synergistic, at least in some cases, so that deletion of DRM1/2/CMT3) affects CG methylation maintenance by MET1 . Together, these results suggest that 5meC in all contexts can affect several aspects of chromatin regulation, with consequences for plant development and differentiation.
Tissue-level variation in methylation has been noted in several plant species. For example, in Arabidopsis, about six percent of cytosines were found to be methylated in immature floral , while 24 percent of CG, six point seven percent of CHG, and one point seven percent of CHH were methylated in young plants . Few studies have compared high-resolution methylation profiles among tissues within a plant species. In rice, whole genome methylation patterns were found to be similar among mature leaves, embryos, seedling shoots and roots, but hypomethylation was correlated with preferential expression in endosperm . Patterns of 5meC in LTR transposable elements differed between rice leaves and roots and affected transcription of neighboring genes  a phenomenon common to the SINE containing FWA promoter of A. thaliana[23, 24].
In addition to well-established roles in transposable element silencing and genomic imprinting, DNA methylation may be involved in plant adaptation to stress [25, 26]. In A. thaliana, genome-wide methylation increased in the progeny of plants exposed to temperature extremes, ultraviolet light , flood, and salt but decreased in progeny of drought-stressed plants [28, 29]. In hybrid poplars (P. deltoides × P. nigra), shoot apices from drought-stressed juvenile trees exhibited genotype-dependent 5meC variation . Differential DNA methylation patterns in poplar clones that have acquired differential transcriptome responses to drought stress have been observed .
While much has been learned from work on annual plants, in-depth investigations of cytosine methylation patterns in long-lived plants have been sparse. Because of their long term tissue differentiation and perennial exposure to environmental stresses, DNA methylation may play a greater role in both tree development and homeostasis. Studies of gross cellular DNA methylation indicate that it may vary substantially during tree development, whether assessed in vivo or in vitro. In apical buds of chestnut trees, Castanea sativa, 5meC increased during bud set and decreased during bud burst . In Monterey pine, Pinus radiata, 5meC levels in needles of reproductively mature trees were double that of juvenile needles . In shoots of chestnut and Monterey pine, a gradual increase in DNA methylation accompanied aging over 5-8 years [34, 35]. Increased methylation in mature vs. juvenile leaves was associated with loss of capacity for in vitro organogenesis in P. radiata. In micropropagated Acacia, shoots with juvenile leaves exhibited higher DNA methylation levels than shoots with mature leaves . Transient DNA methylation of ovules accompanied embryogenesis in chestnut . As noted above, in poplar drought stress induced changes in total cellular DNA methylation  and was associated with transcriptome changes within separately propagated clones .
A variety of experimental techniques can be applied to study genome-wide DNA methylation (reviewed in ). On a gross scale, the proportion of 5meC can be estimated by HPLC or HPCE, as has been done to show differences in 5meC among tissue types or treatments [33, 35]. The drawback of these methods is the lack of sequence specific information. Immunoprecipitation with an antibody raised against 5-methylcytidine (MeDIP), followed by genome tiling array hybridization or high-throughput sequencing of the precipitated DNA (MeDIP-seq), has been used to enumerate and compare methylated regions in Homo sapiens[40, 41], Mus musculus, Neurospora and A. thaliana[7, 8]. The most detailed, single-base resolution maps are generated by sequencing of genomic DNA treated with sodium bisulfite, which converts unmethylated cytosines to uracils but leaves 5meC unconverted . However, this technique requires very high sequencing depth and is not suitable for mapping to repetitive genomic regions where uniqueness can be confounded by the presence of C to T SNPs. Genome-wide bisulfite sequencing was first used in Arabidopsis , but has now also been used to assess genome methylation in Oryza sativa and P. trichocarpa[6, 21], as well as mammals including. H. sapiens and M. musculus. For the present work, we chose MeDIP-seq of many different tissue types because it provides comprehensive methylome coverage at a lower cost than genome-wide bisulfite sequencing.
The black cottonwood, Populus trichocarpa, is widely recognized as a reference species for tree biology. It has been studied in great detail over the past 30 years, and many resources are readily available, including a draft genome sequence http://www.phytozome.net/poplar, custom microarrays, and extensive transcriptome data . For our studies we used genome assembly version 2.2 in combination with published expression microarray data from multiple tissue types [48, 49]. While mature leaves from P. trichocarpa have recently been subjected to genome-wide bisulfite sequencing , high-resolution epigenomic methods have not yet been applied to discern tissue-level variation. We investigated variation in genome-level cytosine methylation among all of the major types of differentiated poplar tissues. To this end, we sequenced methylated DNA obtained by MeDIP from seven P. trichocarpa tissues on an Illumina GAIIx. We found overall patterns of cytosine methylation that are consistent with those seen in Arabidopsis, but observed differences in methylation patterns among tissue types not previously studied. We also found a different pattern of association of gene body methylation to gene expression.
Collection of MeDIP-seq data
Summary of MeDIP-seq experimental results.
Total Reads (Illumina Yield)
No. Mapped Reads
Percent Mapped Reads
Validation of MeDIP-seq results by bisulfite sequencing
Bisulfite sequencing of eight selected targets was used to confirm quality of the MeDIP-seq data. Regions were selected to represent a range of RPKM values and maximum per-nucleotide coverage values (Additional files 2, 3), and were mainly at 5' ends of genes in promoters and coding regions. There was a strong correlation with both RPKM (R2 = 0.92) and maximum per-nucleotide coverage (R2 = 0.93) (Additional file 4). Cytosines in all three sequence contexts (CG, CHG, CHH) were methylated in the target regions, but in targets with an overall low cytosine percentage, the CHH context was more frequently methylated than CG or CHG (Additional file 5), and was more variable than the other contexts among the three examined bud stages (Additional file 6).
Mapping of MeDIP-seq reads to the genome
Coverage of the total genome was calculated for each tissue type separately for uniquely mapping reads and for distributed repeats. Uniquely mapped non-clonal reads represented 9 to 59% of the total number of reads (Table 1). For non-immunoprecipitated control samples, uniquely mapping reads covered ~80% of the genome and k-mer redistributed repeats covered 23% of the genome; these percentages included overlaps at the ends of reads between the two types. In contrast, reads from MeDIP samples that were aligned to unique positions in the reference genome covered 26%-56% of the genome, while an additional 14%-19% of the genome was covered by distributed k-mer repeats (Additional file 7A). Within the covered portion of the genome, average coverage was deeper for distributed k-mer repeats (ranging from 4.9 reads/bp in xylem to 14.6 reads/bp in bud) than for uniquely mapping reads (ranging from 0.8 reads/bp in root to 2.5 reads/bp in bud) (Additional data file 7B).
Number of methylated 1 kb windows called by three methods.
No. methylated 1 kb windows
No. methylated 1 kb windows
Mapping of MeDIP-seq reads to genes
Variable methylation of transposable element classes
Differentiation in methylation among tissues
On a chromosome scale, overall MeDIP-seq read coverage was similar across tissue types, but there were visually striking regions of large-scale heterogeneity among tissues that ranged from approximately 100 kb to 2 Mb in length (Figures 1 and 2, Additional file 9; several examples are indicated with asterisks in Figure 1). Many of these areas of methylation heterogeneity had low gene density and contained clusters of transposable elements, but one region we examined more closely was a cluster of leucine-rich repeat (LRR) genes. The regions with the highest methylation tended to show the highest tissue-associated variation in methylation (Additional file 11). However, there were also large chromosomal sections where methylation signals were relatively low for all tissues, but a particular tissue was consistently highest (e.g., the right half of chromosome 3 and the left half of chromosome 8 in Figure 1).
Statistical comparision of called methylated 1 kb windows between tissues.
Statistical comparison of called methylated genic features between tissues.
Gene ontology of methylated genes in male catkins
To determine the functional classification of body-methylated genes specific to male catkins, we tested for enrichment of gene ontology (GO) categories. This analysis revealed significant enrichment (p < 0.05) in 168 specific gene ontology categories, including those related to translation/protein metabolism (264 genes), nucleic acid binding (322 genes) and RNA metabolism (135 genes). Some of the enriched GO categories observed are illustrated in Additional file 12.
Association of methylation and gene expression
We compared the categorized gene feature methylation to tissue-specific gene expression data from previous expression microarray studies . We did this on both a global scale, looking at methylation and expression data pooled across all tissue types, and on a per-tissue basis. We also analyzed the association of gene expression to methylation at particular genic features. Clustering of tissues by only their overall gene expression patterns suggested that floral tissues, the bud samples, and root and xylem had the most similar gene expression profiles (Additional file 13). However, when tissues were clustered based on only RPKM data, the patterns were highly dissimilar (Additional file 14). The biological replications clustered for both the male and female inflorescence tissues, as well as for the buds and input samples. However, the biological replications for the root, leaf and phloem tissues did not cluster, and the positions of all tissues bore little similarity to what was observed based on gene expression data. The lack of concordance was also observed when biological replications were pooled and methylation of gene bodies or promoters clustered (data not shown). Thus, at the gross genome level, tissue specific gene expression and methylation had no obvious association.
We have taken advantage of two of poplar's characteristics as a model tree species--high quality genomic resources and extensive, highly elaborated tissue types--to interrogate epigenomic variation at genome scale. Previous studies have either compared total DNA methylation among different tissue types, or have looked at one or very few tissues at genomic scale. Most high-resolution studies of methylation in Arabidopsis and rice have used seedlings or young plants, which are complex mixtures of different tissues rather than discrete tissue types, each composed of their own complex cell types. Only a handful of previous studies have examined genome-wide, high-resolution methylation differences among coherent tissue types , and these have mainly focused on comparisons of cytosine methylation in endosperm and embryo during seed development .
The MeDIP-seq method does not provide single-base resolution as does genome scale bisulfite sequencing, and therefore does not allow detailed analysis of cytosine context in methylated regions. However, region-specific bisulfite sequencing validated our MeDIP-seq results, showing that both calculated RPKM and maximum coverage per nucleotide reflect the underlying percentage of methylated cytosines in regions with varying cytosine content and position relative to genes. Within bisulfite-sequencing target regions, cytosines in all three (CG, CHG, CHH) sequence contexts were identified, with CG and CHG methylation being more consistent within tissues. However, in the two targets with cytosine content < 10%, cytosines in CHH context were methylated more frequently than those in the other two contexts. One of these targets was 5' of a gene model, and the other spanned the 5' end of a gene model coding region. Previous studies have reported that CGs are more frequently methylated than CHGs or CHHs, especially in coding regions, and 5meCHH, while less frequent in general, is more common in repeat regions and short transposable elements [6, 11, 20].
Differential tissue methylation was extensive at genic and non-genic regions
MeDIP-seq read calculations and RPKM and CPPD statistical analysis based on differences among 1-kb genome windows showed that only ~2% of the P. trichocarpa genome was methylated in all seven assayed tissues. In contrast, 64% of the genome was ubiquitously unmethylated. The difference implies that one-third of the genome was differentially methylated among the tissues studied. Comparisons of promoter- and gene body methylated genes likewise showed extensive tissue differential methylation; 11 to 16% or less were methylated in common among tissues. We know of no comparable estimates of tissue-level variation in other plants; the few studies that have compared gene-level variation in different tissue types have used small numbers of tissues and reported low 5meC variation among tissues, most of which were accounted for by variation in transposable element methylation . At least some of the chromosome blocks we observed that had highly tissue-differentiated methylation were also rich in transposable elements (Additional file 9).
Chromosome methylation supports the locations of putative centromeres
We separately mapped unique MeDIP-seq reads and k-mer repeat reads, distributing k-mer repeats over all their genome occurrences. The P. trichocarpa genome is highly duplicated, with ~41% of the assembled genome considered repetitive (based on 16-mer counts > 34; ). Due to the difficulty of assembling repetitive genome regions, k-mer repeats were masked from the original genome assembly . This repeat exclusion may be the reason that unique reads covered a much larger proportion of genome space than k-mer repeat reads. On a chromosomal scale, repeat regions were also correlated with genome gaps; the v2.2 assembly includes a large number (2,499) of scaffolds that are not yet assigned to specific chromosomes.
Our chromosome methylation maps showed concentrations of MeDIP-seq reads, in particular k-mer repeats, on more than half of P. trichocarpa chromosomes. These regions correspond with areas of low gene density, which are expected for centromere/pericentromere locations. A similar chromosome methylation profile, with high methylation in centromeric and pericentromeric regions, has been observed in Arabidopsis[7, 51]. Centromeric satellite repeats are generally methylated and silenced , although repeats associated with centromere-specific histone CENH3 are hypomethylated compared to their counterparts in pericentromeric heterochromatin [54, 55]. Genes near centromeres are also likely to be methylated . Chromosomes lacking a single methylation peak had either more than one distinct methylation peak (e.g. LG V, LG XII), or more broad, indistinct methylated regions (e.g. LG XIII, LG XVII). These regions likely reflect the large chromosomal rearrangements and segmental duplications that mark the evolutionary history of Populus.
Retroelements showed extensive and differential tissue methylation
Our data showed that protein-coding genes were underrepresented in the methylated fraction of the genome, while transposable elements and other simple repeats were generally methylated. LTR-gypsy retroelements are abundant in heterochromatic centromeric and pericentromeric regions in plants, and are the most plentiful repetitive element in the P. trichocarpa genome . We found that this retroelement class was also enriched in the methylated fraction of the genome in all tissue types. Four other retroelement categories (DNA cacta, LINE, LTR copia, retroelement) were also overrepresented in the methylated genome fraction, which is not surprising given the extensive evidence of methylation-mediated transposable element silencing in eukaryotic genomes [56, 57]. Two classes of LINE elements (LINE CR1, LINE L2) were underrepresented among the methylated genome fraction, and one class of LINE elements (LINE LTE) was overrepresented in xylem, phloem and male catkins, but underrepresented in buds, female catkins, leaves and roots. Thus, these elements showed considerable differential methylation by tissue. LINE elements are more abundant in Populus compared to other plant genomes, and there appears to have been a recent expansion of this element class in the genome .
Genes were extensively methylated
Four to 12% of annotated protein-coding genes were methylated, with the level varying widely among tissues as discussed above. This is lower than the estimated 30% of methylated transcribed regions in Arabidopsis, but closer to the 16% predicted for rice . The pattern of methylation within and around protein-coding genes was consistent with that seen in previous studies [6, 14, 20, 50], with methylation high 5'and 3' of the transcribed portion of genes. Within the transcribed region, methylation was lowest near the transcription start and stop sites and increased away from there within the gene body. Interestingly, we observed a prominent methylation peak ~200 bp 5' of the transcription start site. A similar peak was seen in methylation profiles of A. thaliana embryos and endosperm in one study , but not in a second study . In Oryza spp., a small 5' peak was seen for methylation in CHH context, but not CHG or CG context , while no spike in methylation in any sequence context in this region was identified elsewhere . The cause for both the apparent peak and the incongruity of results remains unclear.
Promoter and gene body methylation is negatively correlated with transcription
Our data showed that promoter-methylated genes had a wider expression range and higher median expression than body-methylated genes in most tissues. Methylation upstream or downstream of genes is generally understood to repress transcription [7, 8, 59]. Our results support this notion, as promoter-methylated genes had lower expression than genes that were not called methylated at any feature. Surprisingly, our results also indicated that gene body methylation was more repressive of transcription than promoter methylation. This contradicts what has been reported for Arabidopsis, where body-methylated genes are often highly transcriptionally active [7, 59]. However, the relationship between gene body methylation and gene expression in plants appears to be confounded by additional factors such as gene length , and additional local epigenetic modifications. DNA methylation in gene bodies may not cause either absence or presence of transcription at all but rather mark splice junctions and thus be correlated to gene expression .
Several studies have examined the transcriptional effects of combinations of 5meC and histone modifications: In Arabidopsis seedlings, histone 3 lysine 4 monomethylation (H3K4meI) was highly correlated with CG-context methylation in transcribed regions of transcriptionally-active genes , while H3K27me3 was anticorrelated . In Zea mays roots and shoots, genes with low levels of transcription had either 5meC or H3K27me3, also in an apparent mutually exclusive pattern . In rice shoots, a complex pattern was observed, with hypermethylated genes tending to have fewer histone modifications and lower transcription, while hypomethylated genes exhibited a range of expression, with concurrent H3K4me3 associated with higher transcription levels, and concurrent H3K27me3 associated with lower transcription levels . The emerging picture is of a complex hierarchy of combinations of 5meC with other epigenetic modifications, in addition to overall sequence context and chromatin context, that ultimately regulate transcription.
We examined the correlation between methylation and tissue predominance of gene expression in two ways: by comparing hierarchical clustering patterns of gene methylation and expression, and by querying methylation status of sets of genes deemed to be expressed in a tissue-preferential manner. Hierarchical clustering patterns revealed no large scale, consistent tissue-level patterns between methylation and expression. In Zea mays, DNA methylation in shoots and roots was also uncorrelated with differential gene expression on a genome scale . However, when methylation profiles of sets of genes with tissue-biased expression were examined, they did show differences in promoter and gene body methylation. Though small on average, the differences were highly statistically significant and consistent between promoters and gene bodies. This analysis suggests that DNA methylation may indeed play a role in directing or maintaining tissue differential gene expression, though its extent appears modest. To our knowledge, this is the first observation of genome scale tissue differentiation of gene expression with DNA methylation in plants.
Male catkins showed a unique pattern of methylation and associated gene expression
Surprisingly, male catkins had a far greater number of genes with body methylation than other sampled tissues, and the level of methylation of these genes was lower than that observed in other tissues. Expression of gene body-methylated genes was also higher than in other tissues except for female catkins. Three retroelement categories (DNA hAT, LINE1, LTR unknown) were underrepresented in the methylated fraction in male catkins, but overrepresented in all other tissue types. These unusual patterns seen may reflect the demethylation and reactivation of several types of transposable elements in pollen vegetative nuclei, with the associated siRNA cascade silencing transposable elements in sperm nuclei. Our male catkins were collected at anthesis (pollen release) and the majority of their biomass appeared to be made up of dehiscing (drying and opening) anthers; pollen DNA can therefore be expected to be highly represented in our male catkin data. Perhaps hypermethylation of surrounding transposable elements could also result in some associated low-level methylation of protein-coding genes, resulting in the unusual pattern of genic methylation seen. Genes that were body-methylated only in male catkins and not in the other tissue types had lower expression in male catkins than in all other tissues types except leaves, and gene ontology categorization of these genes showed enrichment of categories related to protein metabolism, cellular signaling, and DNA/RNA binding. At least some of these genes may play a role in pollen-associated changes in small RNA metabolism and associated DNA methylation. In contrast, female catkins did not show a distinctive pattern of DNA methylation or associated gene expression, even though genome-wide demethylation has been observed in endosperm relative to embryo tissue . Active demethylation is brought about by DEMETER, which is expressed specifically in the central cell of the female gametophye and removes methylated cytosines via a mechanism involving single-strand break repair [63, 64]. We believe the difference between male and female catkins was mainly because we collected female catkins during early pollen release, well before endosperm and embryo development was likely to have begun on a large scale. In addition, examination of a subset of our collected female inflorescences did not show any signs of seed development when a subsample of ovules was dissected (data not shown).
Epigenomic studies have been applied to very few plant species to date. Our study is the first description of epigenomic differentiation among tissues in in any tree or perennial plant species at genome scale resolution. We sequenced methylated DNA from seven distinct tissues representing a wide range of developmental variation. Although the general pattern of chromosome and genic methylation agree with those of Arabidopsis and rice, there were a number of important differences or elaborations that may relate to its distinctive biology and evolution, and warrant further analysis. These include the degree of tissue-specific methylation throughout the genome and its association with genes; the negative association of gene body methylation with gene expression; the modest but consistent association of tissue-differential gene expression with promoter and gene body methylation; the peak in methylation 5' to genes; and the distinctive pattern of male catkin transposon and gene body methylation. The genomic catalog provided will also provide a foundation to inform a variety of other investigations, including those related to natural variation in rate of recombination throughout the genome, position effects observed during genetic engineering, and the interspecific heterosis and gender differentiation (dioecy) that are observed in poplar and many other plant species.
Genomic DNA for most tissues was obtained from P. trichocarpa clone Nisqually-1, the genotype that was used for the published genome sequence . Mature leaves were collected in September 2008, and buds were collected in August-September 2008, December 2008 and March 2009 from two-year-old trees at a field site in Corvallis, Oregon, USA. Fine roots and xylem and phloem ~15 cm below the apical bud were collected in August 2009 from two-year-old Nisqually trees maintained in a lath house at Oregon State University, Corvallis, Oregon. Male and female catkins were collected at anthesis in March, 2009 from mature wild P. trichocarpa in Corvallis, Oregon. Male catkins were collected at the start of pollen shed, and female stigmas had adhering pollen, but dissection of a small sample (~20) ovules from several different inflorescences showed no signs of seed development.
Methylated DNA immunoprecipitation
The DNA extraction method was based on a previously published method . Approximately 250 mg of tissue was ground to a fine powder in liquid nitrogen, then homogenized in extraction buffer (1 ml; 50 mM Tris [pH 8], 5 mM EDTA, 0.35 M sorbitol, 10% [w/v] polyethylene glycol [MW 8000], 1% [w/v] N-laurylsarcosine, 0.1% [w/v] bovine serum albumin (BSA), 0.1% [v/v] β-mercaptoethanol, 1% hexadecyltrimethylammonium bromide, 2 M NaCl). The homogenate was incubated at 60-65°C for 60 min in sterile 1.5 ml microcentrifuge tubes, followed by extraction with ~500 μl of 24:1 phenol:chloroform. Following centrifugation at 13,000 × g for 10 min, the aqueous layer (200-300 μl) was moved to a new, sterile 1.5 ml microfuge tube. DNA was precipitated with two volumes of ice-cold 95% ethanol at 4°C for 2-24 hours and subsequently pelleted by centrifugation at 13,000 × g for 5 min. The pellet was rinsed with 500 μl of 70% ethanol, then dried in a speed-vac for 5 min and finally resuspended in 50 μl TE buffer. Fifty microliters of 10 μg/ml RNase enzyme (Qiagen, Valencia, CA) were added and the mixture incubated at 37°C for 60 min to digest RNA. DNA concentration was determined using an ND-1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA).
Prior to immunoprecipitation, genomic DNA was sheared to 200-1000 bp fragments and ligated to Illumina sequencing adaptors as described previously . Ten to twenty micrograms of genomic DNA were diluted to 300 μl in TE buffer. The DNA was sheared for 18 min with 30 sec on/off cycling at 4°C in a Diagenode Bioruptor (Sparta, NJ). The sheared fragments were recovered by using a Qiaquick PCR purification kit (Qiagen) according to manufacturer's instructions (final elution volume 52 μl). The fragments were end-repaired by mixing 50 μl of the DNA sample, 25 μl sterile distilled H2O, 10 μl T4 DNA ligase buffer (Invitrogen, Carlsbad, CA), 4 μl 20 mM dNTP mix, 5 μl T4 DNA polymerase (Invitrogen or New England Biolabs, Ipswich, MA), 1 μl Klenow DNA polymerase (Invitrogen or New England Biolabs) and 5 μl T4 polynucleotide kinase (New England Biolabs) incubated for 30 min at room temperature (San Diego, CA) were ligated to the DNA after end repair. Prior to MeDIP, the DNA was denatured in a 100°C heat block for 10 min and snap-cooled on ice for 5 min. The cooled single-stranded DNA was immunoprecipitated overnight on a rotator at 4°C with 1 μl of anti-5me-cytidine antibody (Diagenode, #MAb-5MECYT-100) in immunoprecipitation buffer (100 mM Na-Phosphate, pH 7.0; 1.4 M NaCl; 0.5% Triton X-100). Bound DNA was precipitated with sheep anti-mouse IgG Dynabeads (M-280, Invitrogen). The bound DNA was washed thrice with immunoprecipitation buffer for 10 min at room temperature with shaking, resuspended in 250 μl proteinase K digestion buffer (5 mM Tris, pH 8.0; 1 mM EDTA, pH 8.0; 0.05% SDS) with 7 μl of 10 mg/ml proteinase K and incubated for 3 hrs on an end-over-end rotator at 50°C to digest the antibodies and release the 5meC-containing DNA. The DNA was extracted once with 250 μl phenol, once with 250 μl chloroform and precipitated by adding 500 μl ethanol with 400 mM NaCl. To improve recovery, 1 μl glycogen (20 mg/ml) was added. DNA pellets were washed with 70% ethanol, resuspended in 50 μl TE buffer and stored at -20°C.
Immunoprecipitated DNA was tested for enrichment of methylated regions by duplex PCR targeting genomic regions expected to be differentially methylated. The expected methylated target was a putative retroelement (Poptr1_1/LG_XV:6357939-6358210, Additional file 2). The expected unmethylated target was a histone H2B gene (Poptr1_1/LG_II:21650848-21651585, Additional file 3). Relative enrichment was assessed qualitatively by brightness of bands on an electrophoretic gel.
Illumina sequencing library preparation
The immunoprecipitated DNA was amplified by PCR with primers PE_PCR1.0 PE_PCR2.0 (Additional data file 3) to produce sequencing libraries. The number of PCR cycles required to produce a library for Illumina sequencing of recovered DNA was determined by testing a range of cycle numbers (15, 18, 21 cycles). For each library, three separate 20 μl PCRs with the appropriate number of cycles were combined. DNA was purified on a Qiagen PCR purification column (final elution volume of 52 μl TE buffer). DNA samples were quantified using a nanodrop ND-1000 spectrophotometer, then diluted to 10 nM for sequencing on an Illumina 1G or GAIIx Genome Analyzer.
One microgram of genomic DNA from three biological replicates of each of three tissue types (autumn buds, winter buds, spring buds) was bisulfite-treated following the instructions included with the EpiTect Bisulfite kit (Qiagen). Prior to bisulfite treatment, aliquots from genomic DNA samples representing three biological replicates of each bud stage were pooled in equimolar amounts to serve as an untreated control. Targets for bisulfite sequencing were chosen to represent a variety of MeDIP-seq coverage levels (Additional file 2). The bisulfite-sequencing targets chosen for this study had a cytosine content ranging from 7.1%-24.0%. PCR primers were designed with Primer3 software, manually selecting regions with few cytosine bases in order to minimize primer degeneracy. PCRs were performed with Platinum Taq DNA polymerase (Invitrogen) in 25 μl reaction volumes containing 100 ng template DNA and 10 ng of each primer. PCR products were cloned following instructions included in TOPO TA cloning kits (Invitrogen). Ten clones amplified and isolated for each target region were sequenced from each of the three tissue types, and four clones were sequenced from the untreated pool. Sequences were aligned using ClustalW. Cytosine context was tallied and averaged for each set of clones.
Bioinformatic Processing and Statistical Analyses
Illumina 40- or 36-nt sequencing reads were trimmed to a length 32 nt. Where reads were identical ("clonal reads"), all but one was removed (Additional files 15, 16). Reads were then aligned to the P. trichocarpa V2.2 reference genome and the P.trichocarpa chloroplast genome http://genome.ornl.gov/poplar_chloroplast/ with Eland http://www.illumina.com/Documents/products/datasheets/datasheet_genomic_sequence.pdf and HashMatch . Eland alignments were performed using default parameters, which allow two mismatches per 32mer read. HashMatch alignments require perfect matches. Reads that aligned to the chloroplast or mitochondrial genomes (allowing up to two mismatches) were removed unless they were also perfect matches to the nuclear genome. Eland alignments were used to calculate the overall coverage per nucleotide as a measure for depth of sequencing for reads that align at unique positions, again allowing up to two mismatches. In a separate but parallel process, HashMatch was used to identify reads that align to multiple locations. These k-mer repeat reads were randomly and equally divided among all locations to which they aligned (allowing decimals) and coverage per nucleotide was calculated. Uniquely aligning reads were excluded from this branch of the pipeline which we refer to as "distributed" and/or "k-mer" repeats (Additional file 15). Sequencing depth or "coverage" was quantified by calculating Reads Per Kilobase of target sequence per Million reads mapped (RPKM; ). The RPKM measure was applied to one kilobase (kb) windows tiled across the entire genome, which was comprised of 378,538 windows. To check for potential bias toward cytosine-rich regions, numbers of cytosines per window were tallied, and these were compared to RPKM calculations (Additional data file 17); however, no relationship of RPKM to cytosine density was observed.
To study methylation-gene expression associations, RPKM was determined for specific features associated with annotated gene models (promoter, 5'UTR, gene body, exon, introns, 3'UTR, intergenic regions). To determine windows with RPKM that was statistically above that of the non-immunoprecipitated control (input), a false discovery rate (FDR) was calculated from 1-kb tiled windows for four lanes of input and the values for all MeDIP tissues pooled (sample). The arithmetic difference between input lanes was calculated and the distribution of these differences was determined for all possible permutations of input-input differences, and the mean of these distributions calculated. This process was repeated for the differences obtained from subtracting the average of all four input lanes from the sample. These distributions of differences were used to determine an RPKM cutoff that resulted in 100 significant windows in the sample-input comparison for every one significant window in the input-input comparison, thus a 1% FDR. By this procedure we arrived at an RPKM cutoff of 4.83. Genome feature context (promoter, intron, intergenic, etc.) was assigned to the collection of methylated windows. The results of this analysis were used for all tests of the relationship of methylation to expression, and descriptions of tissue-specific methylation patterns.
As alternative methods to quantify enrichment, we calculated the number of reads aligning in 1-kb windows with significant enrichment in MeDIP counts compared to input based on Poisson and negative binomial distributions. First, we normalized input counts to those of each MeDIP sample. For each window, if the input counts fell below the average input count for all windows, the counts were reset to their average value. We next used the cumulative Poisson probability distribution (CPPD) to estimate the probability of observing equal or greater read counts in the MeDIP sample than in the same window in the normalized input sample. Windows with probability of counts less than 0.0001 were considered significant (comparisons of input samples showed that this method yielded approximately a 1% FDR).
Finally, we used a negative binomial distribution to estimate the statistical significance of the peaks. The motivation for the use of the negative binomial is that it is a two parameter distribution that, unlike the Poisson where the mean and variance are equal, allows us to fit the observed variance of the data. We had observed that the variance across biological replicates was often larger than the mean, and therefore was not consistently fit by a Poisson distribution. This same observation has been made for RNA-seq data, where the use of negative binomials to estimate the significance of differential counts between a gene in different samples has become standard . The implementation of this approach was in all other ways identical to the Poisson method described above, except that the probability of observing the MeDIP counts in a window, compared to those of the input samples, was estimated using the negative binomial distribution. The parameters of the negative binomial distribution for each tissue were fit by measuring the variance in our data across biological replicates using the Matlab function nbinfit. The p-value for each window was then estimated using the Matlab function nbincdf which computes the distribution of the cumulative negative binomial distribution. The first pass of this analysis used a p-value cutoff of 10-4, which corresponded to an estimated false discovery rate of 5% based on variation among biological replicates of the bud tissue samples. These parameters called only 653 windows methylated in all tissues, 326,478 windows non-methylated in all tissues, and 51,405 windows differentially methylated among tissues. In a second pass, a peak was called in each window that had a p-value of 10-3, which corresponded to an estimated false discovery rate of 20%, also based on biological replicates of the bud tissue samples. Using the results of this analysis, the agreement among the three methods was calculated by dividing the number of methylated windows called by all of the methods by the total number of windows called by any of the methods. Genes with methylation at promoters, and/or within annotated transcribed regions (gene bodies) were compared to archival expression microarray data to determine correlation between methylation and expression. Mann-Whitney tests were used to compare RPKM or gene expression of groups of genes among tissues assuming independence of genes within biological replicates, and Sign tests  were used to evaluate the statistical significance of consistency among tissues in genic methylation and expression patterns.
Enrichment of gene ontology (GO) categories within sets of methylated genes was tested using the AgriGO singular enrichment analysis tool applied to the Poplar v2.2 genome reference gene ontology set, using default parameters except for the selection of the Bonferroni multiple-test correction method: http://bioinfo.cau.edu.cn/agriGO/analysis.php. GO enrichments were visualized using Cytoscape v.1.4. http://www.cytoscape.org. The RPKM data are available for browsing and downloading using Gbrowse version 2.13 at http://http:poplar.cgrb.oregonstate.edu. http://gmod.org. All MeDIP-seq data were submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) database (accession #SRA039208.1).
We are grateful to Chris Sullivan, Mark Dasenko, Scott Givan, Steve Drake and Matthew Peterson at the OSU Center for Genome Research and Biocomputing (CGRB) for their help in sequencing and data processing; Elizabeth Etherington, Cathleen Ma and Ruoqing Zhu of the Department of Forest Ecosystems and Society for their help in project management and material collections; and Palitha Dharmawardana of the Department of Botany and Plant Pathology for his help in accessing the microarray data. We thank the U.S. Department of Energy Plant Feedstock Genomics program for funding (DE-FG02-08ER64665).
- Wyatt G: Recognition and estimation of 5-methylcytosine in nucleic acids. Biochemical Journal. 1951, 48: 581-PubMed CentralView ArticlePubMed
- Doskocil J, Sorm F: Distribution of 5-methylcytosine in pyrimidine sequences of deoxyribonucleic acids. Biochim Biophys Acta. 1962, 55: 953-959. 10.1016/0006-3002(62)90909-5.View ArticlePubMed
- Goll MG, Bestor TH: Eukaryotic cytosine methyltransferases. Annu Rev Biochem. 2005, 74: 481-514. 10.1146/annurev.biochem.74.010904.153721.View ArticlePubMed
- Vaughn MW, Tanurdzić M, Lippman Z, Jiang H, Carrasquillo R, Rabinowicz PD, Dedhia N, McCombie WR, Agier N, Bulski A, Colot V, Doerge RW, Martienssen RA: Epigenetic natural variation in Arabidopsis thaliana. PLoS Biol. 2007, 5: e174-10.1371/journal.pbio.0050174.PubMed CentralView ArticlePubMed
- Suzuki MM, Bird A: DNA methylation landscapes: provocative insights from epigenomics. Nat Rev Genet. 2008, 9: 465-476.View ArticlePubMed
- Zemach A, McDaniel IE, Silva P, Zilberman D: Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010, 328: 916-919. 10.1126/science.1186366.View ArticlePubMed
- Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW-L, Chen H, Henderson IR, Shinn P, Pellegrini M, Jacobsen SE, Ecker JR: Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006, 126: 1189-1201. 10.1016/j.cell.2006.08.003.View ArticlePubMed
- Zilberman D, Gehring M, Tran RK, Ballinger T, Henikoff S: Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2007, 39: 61-69. 10.1038/ng1929.View ArticlePubMed
- Bender J: DNA methylation and epigenetics. Annu Rev Plant Biol. 2004, 55: 41-68. 10.1146/annurev.arplant.55.031903.141641.View ArticlePubMed
- Fukuda T, Sakai M, Takano H, Ono K, Takio S: Hypermethylation of retrotransposons in the liverwort Marchantia paleacea var. diptera. Plant Cell Rep. 2004, 22: 594-598. 10.1007/s00299-003-0739-x.View ArticlePubMed
- Widman N, Jacobsen SE, Pellegrini M: Determining the conservation of DNA methylation in Arabidopsis. Epigenetics. 2009, 4: 119-124. 10.4161/epi.4.2.8214.View ArticlePubMed
- He G, Zhu X, Elling AA, Chen L, Wang X, Guo L, Liang M, He H, Zhang H, Chen F, Qi Y, Chen R, Deng X-W: Global epigenetic and transcriptional trends among two rice subspecies and their reciprocal hybrids. Plant Cell. 2010, 22: 17-33. 10.1105/tpc.109.072041.PubMed CentralView ArticlePubMed
- Chan SW-L, Henderson IR, Jacobsen SE: Gardening the genome: DNA methylation in Arabidopsis thaliana. Nat Rev Genet. 2005, 6: 351-360.View ArticlePubMed
- Lister R, O'Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Millar AH, Ecker JR: Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008, 133: 523-536. 10.1016/j.cell.2008.03.029.PubMed CentralView ArticlePubMed
- Finnegan EJ, Peacock WJ, Dennis ES: Reduced DNA methylation in Arabidopsis thaliana results in abnormal plant development. Proc Natl Acad Sci USA. 1996, 93: 8449-8454. 10.1073/pnas.93.16.8449.PubMed CentralView ArticlePubMed
- Kankel MW, Ramsey DE, Stokes TL, Flowers SK, Haag JR, Jeddeloh JA, Riddle NC, Verbsky ML, Richards EJ: Arabidopsis MET1 cytosine methyltransferase mutants. Genetics. 2003, 163: 1109-1122.PubMed CentralPubMed
- Cao X, Aufsatz W, Zilberman D, Mette MF, Huang MS, Matzke M, Jacobsen SE: Role of the DRM and CMT3 methyltransferases in RNA-directed DNA methylation. Curr Biol. 2003, 13: 2212-2217. 10.1016/j.cub.2003.11.052.View ArticlePubMed
- Chan SW-L, Henderson IR, Zhang X, Shah G, Chien JS-C, Jacobsen SE: RNAi, DRD1, and histone methylation actively target developmentally important non-CG DNA methylation in arabidopsis. PLoS Genet. 2006, 2: e83-10.1371/journal.pgen.0020083.PubMed CentralView ArticlePubMed
- Singh A, Zubko E, Meyer P: Cooperative activity of DNA methyltransferases for maintenance of symmetrical and non-symmetrical cytosine methylation in Arabidopsis thaliana. Plant J. 2008, 56: 814-823. 10.1111/j.1365-313X.2008.03640.x.PubMed CentralView ArticlePubMed
- Cokus SJ, Feng S, Zhang X, Chen Z, Merriman B, Haudenschild CD, Pradhan S, Nelson SF, Pellegrini M, Jacobsen SE: Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature. 2008, 452: 215-219. 10.1038/nature06745.PubMed CentralView ArticlePubMed
- Zemach A, Kim MY, Silva P, Rodrigues JA, Dotson B, Brooks MD, Zilberman D: Local DNA hypomethylation activates genes in rice endosperm. Proc Natl Acad Sci USA. 2010, 107: 18729-18734. 10.1073/pnas.1009695107.PubMed CentralView ArticlePubMed
- Kashkush K, Khasdan V: Large-scale survey of cytosine methylation of retrotransposons and the impact of readout transcription from long terminal repeats on expression of adjacent rice genes. Genetics. 2007, 177: 1975-1985. 10.1534/genetics.107.080234.PubMed CentralView ArticlePubMed
- Soppe WJ, Jacobsen SE, Alonso-Blanco C, Jackson JP, Kakutani T, Koornneef M, Peeters AJ: The late flowering phenotype of fwa mutants is caused by gain-of-function epigenetic alleles of a homeodomain gene. Mol Cell. 2000, 6: 791-802. 10.1016/S1097-2765(05)00090-0.View ArticlePubMed
- Lippman Z, Gendrel A-V, Black M, Vaughn MW, Dedhia N, McCombie WR, Lavine K, Mittal V, May B, Kasschau KD, Carrington JC, Doerge RW, Colot V, Martienssen R: Role of transposable elements in heterochromatin and epigenetic control. Nature. 2004, 430: 471-476. 10.1038/nature02651.View ArticlePubMed
- Chinnusamy V, Zhu J-K: Epigenetic regulation of stress responses in plants. Curr Opin Plant Biol. 2009, 12: 133-139. 10.1016/j.pbi.2008.12.006.PubMed CentralView ArticlePubMed
- Mirouze M, Reinders J, Bucher E, Nishimura T, Schneeberger K, Ossowski S, Cao J, Weigel D, Paszkowski J, Mathieu O: Selective epigenetic control of retrotransposition in Arabidopsis. Nature. 2009, 461: 427-430. 10.1038/nature08328.View ArticlePubMed
- Lang-Mladek C, Popova O, Kiok K, Berlinger M, Rakic B, Aufsatz W, Jonak C, Hauser M-T, Luschnig C: Transgenerational inheritance and resetting of stress-induced loss of epigenetic gene silencing in Arabidopsis. Mol Plant. 2010, 3: 594-602. 10.1093/mp/ssq014.PubMed CentralView ArticlePubMed
- Boyko A, Blevins T, Yao Y, Golubov A, Bilichak A, Ilnytskyy Y, Hollunder J, Hollander J, Meins F, Kovalchuk I: Transgenerational adaptation of Arabidopsis to stress requires DNA methylation and the function of Dicer-like proteins. PLoS ONE. 2010, 5: e9514-10.1371/journal.pone.0009514.PubMed CentralView ArticlePubMed
- Boyko A, Kovalchuk I: Transgenerational response to stress in Arabidopsis thaliana. Plant Signal Behav. 2010, 5: 995-998. 10.4161/psb.5.8.12227.PubMed CentralView ArticlePubMed
- Gourcilleau D, Bogeat-Triboulot M-B, Le Thiec D, Lafon-Placette C, Delaunay A, El Soud W, Brignolas F, Maury S: DNA methylation and histone acetylation: genotypic variations in hybrid poplars, impact of water deficit and relationships with productivity. Ann For Sci. 2010, 67: 208-218. 10.1051/forest/2009101.View Article
- Raj S, Bräutigam K, Hamanishi ET, Wilkins O, Thomas BR, Schroeder W, Mansfield SD, Plant AL, Campbell MM: Clone history shapes Populus drought responses. Proc Natl Acad Sci USA. 2011, 108: 12521-12526. 10.1073/pnas.1103341108.PubMed CentralView ArticlePubMed
- Santamaría ME, Hasbún R, Valera MJ, Meijón M, Valledor L, Rodríguez JL, Toorop PE, Cañal MJ, Rodríguez R: Acetylated H4 histone and genomic DNA methylation patterns during bud set and bud burst in Castanea sativa. J Plant Physiol. 2009, 166: 1360-1369. 10.1016/j.jplph.2009.02.014.View ArticlePubMed
- Fraga MF, Cañal MJ, Rodríguez R: Phase-change related epigenetic and physiological changes in Pinus radiata D. Don. Planta. 2002, 215: 672-678. 10.1007/s00425-002-0795-4.View ArticlePubMed
- Valledor L, Hasbún R, Meijón M, Rodríguez JL, Santamaría E, Viejo M, Berdasco M, Feito I, Fraga MF, Cañal MJ, Rodríguez R: Involvement of DNA methylation in tree development and micropropagation. Plant Cell, Tissue and Organ Culture. 2007, 91: 75-86. 10.1007/s11240-007-9262-z.View Article
- Fraga MF, Rodríguez R, Cañal MJ: Genomic DNA methylation-demethylation during aging and reinvigoration of Pinus radiata. Tree Physiol. 2002, 22: 813-816. 10.1093/treephys/22.11.813.View ArticlePubMed
- Valledor L, Meijón M, Hasbún R, Jesús Cañal M, Rodríguez R: Variations in DNA methylation, acetylated histone H4, and methylated histone H3 during Pinus radiata needle maturation in relation to the loss of in vitro organogenic capability. J Plant Physiol. 2010, 167: 351-357. 10.1016/j.jplph.2009.09.018.View ArticlePubMed
- Baurens F-C, Nicolleau J, Legavre T, Verdeil J-L, Monteuuis O: Genomic DNA methylation of juvenile and mature Acacia mangium micropropagated in vitro with reference to leaf morphology as a phase change marker. Tree Physiol. 2004, 24: 401-407. 10.1093/treephys/24.4.401.View ArticlePubMed
- Viejo M, Rodríguez R, Valledor L, Pérez M, Cañal MJ, Hasbún R: DNA methylation during sexual embryogenesis and implications on the induction of somatic embryogenesis in Castanea sativa Miller. Sex Plant Reprod. 2010, 23: 315-323. 10.1007/s00497-010-0145-9.View ArticlePubMed
- Laird PW: Principles and challenges of genomewide DNA methylation analysis. Nat Rev Genet. 2010, 11: 191-203.View ArticlePubMed
- Weber M, Davies JJ, Wittig D, Oakeley EJ, Haase M, Lam WL, Schübeler D: Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells. Nat Genet. 2005, 37: 853-862. 10.1038/ng1598.View ArticlePubMed
- Weber M, Hellmann I, Stadler MB, Ramos L, Pääbo S, Rebhan M, Schübeler D: Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat Genet. 2007, 39: 457-466. 10.1038/ng1990.View ArticlePubMed
- Mukhopadhyay R, Yu W, Whitehead J, Xu J, Lezcano M, Pack S, Kanduri C, Kanduri M, Ginjala V, Vostrov A, Quitschke W, Chernukhin I, Klenova E, Lobanenkov V, Ohlsson R: The binding sites for the chromatin insulator protein CTCF map to DNA methylation-free domains genome-wide. Genome Res. 2004, 14: 1594-1602. 10.1101/gr.2408304.PubMed CentralView ArticlePubMed
- Lewis ZA, Honda S, Khlafallah TK, Jeffress JK, Freitag M, Mohn F, Schübeler D, Selker EU: Relics of repeat-induced point mutation direct heterochromatin formation in Neurospora crassa. Genome Res. 2009, 19: 427-437.PubMed CentralView ArticlePubMed
- Frommer M, McDonald LE, Millar DS, Collis CM, Watt F, Grigg GW, Molloy PL, Paul CL: A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands. Proc Natl Acad Sci USA. 1992, 89: 1827-1831. 10.1073/pnas.89.5.1827.PubMed CentralView ArticlePubMed
- Eckhardt F, Lewin J, Cortese R, Rakyan VK, Attwood J, Burger M, Burton J, Cox TV, Davies R, Down TA, Haefliger C, Horton R, Howe K, Jackson DK, Kunde J, Koenig C, Liddle J, Niblett D, Otto T, Pettett R, Seemann S, Thompson C, West T, Rogers J, Olek A, Berlin K, Beck S: DNA methylation profiling of human chromosomes 6, 20 and 22. Nat Genet. 2006, 38: 1378-1385. 10.1038/ng1909.PubMed CentralView ArticlePubMed
- Meissner A, Mikkelsen TS, Gu H, Wernig M, Hanna J, Sivachenko A, Zhang X, Bernstein BE, Nusbaum C, Jaffe DB, Gnirke A, Jaenisch R, Lander ES: Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature. 2008, 454: 766-770.PubMed CentralPubMed
- Tuskan GA, DiFazio S, Jansson S: The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313: 1596-1604. 10.1126/science.1128691.View ArticlePubMed
- Dharmawardhana P, Brunner AM, Strauss SH: Genome-wide transcriptome analysis of the transition from primary to secondary stem development in Populus trichocarpa. BMC Genomics. 2010, 11: 150-10.1186/1471-2164-11-150.PubMed CentralView ArticlePubMed
- Rodgers-Melnick E, Mane SP, Dharmawardhana P, Slavov GT, Crasta OR, Strauss SH, Brunner AM, Difazio SP: Contrasting patterns of evolution following whole genome versus tandem duplication events in Populus. Genome Res. 2011
- Feng S, Cokus SJ, Zhang X, Chen P-Y, Bostick M, Goll MG, Hetzel J, Jain J, Strauss SH, Halpern ME, Ukomadu C, Sadler KC, Pradhan S, Pellegrini M, Jacobsen SE: Conservation and divergence of methylation patterning in plants and animals. Proc Natl Acad Sci USA. 2010, 107: 8689-8694. 10.1073/pnas.1002720107.PubMed CentralView ArticlePubMed
- Gehring M, Bubb KL, Henikoff S: Extensive demethylation of repetitive elements during seed development underlies gene imprinting. Science. 2009, 324: 1447-1451. 10.1126/science.1171609.PubMed CentralView ArticlePubMed
- Douglas C, DiFazio S: Genetics and Genomics of Populus. The Populus genome and comparative genomics. Edited by: Jansson S, Bhalerao R, Groover A. 2010, Berlin: Springer-Verlag, 67-90.View Article
- May BP, Lippman ZB, Fang Y, Spector DL, Martienssen RA: Differential regulation of strand-specific transcripts from Arabidopsis centromeric satellite repeats. PLoS Genet. 2005, 1: e79-10.1371/journal.pgen.0010079.PubMed CentralView ArticlePubMed
- Zhang W, Lee H-R, Koo D-H, Jiang J: Epigenetic modification of centromeric chromatin: hypomethylation of DNA sequences in the CENH3-associated chromatin in Arabidopsis thaliana and maize. Plant Cell. 2008, 20: 25-34. 10.1105/tpc.107.057083.PubMed CentralView ArticlePubMed
- Zhang X: The epigenetic landscape of plants. Science. 2008, 320: 489-492. 10.1126/science.1153996.View ArticlePubMed
- Lisch D: Epigenetic regulation of transposable elements in plants. Annu Rev Plant Biol. 2009, 60: 43-66. 10.1146/annurev.arplant.59.032607.092744.View ArticlePubMed
- Selker EU, Tountas NA, Cross SH, Margolin BS, Murphy JG, Bird AP, Freitag M: The methylated component of the Neurospora crassa genome. Nature. 2003, 422: 893-897. 10.1038/nature01564.View ArticlePubMed
- Hsieh T-F, Ibarra CA, Silva P, Zemach A, Eshed-Williams L, Fischer RL, Zilberman D: Genome-wide demethylation of Arabidopsis endosperm. Science. 2009, 324: 1451-1454. 10.1126/science.1172417.PubMed CentralView ArticlePubMed
- Zhang X, Shiu S-H, Shiu S, Cal A, Borevitz JO: Global analysis of genetic, epigenetic and transcriptional polymorphisms in Arabidopsis thaliana using whole genome tiling arrays. PLoS Genet. 2008, 4: e1000032-10.1371/journal.pgen.1000032.PubMed CentralView ArticlePubMed
- Zhang X, Bernatavichute YV, Cokus S, Pellegrini M, Jacobsen SE: Genome-wide analysis of mono-, di- and trimethylation of histone H3 lysine 4 in Arabidopsis thaliana. Genome Biol. 2009, 10: R62-10.1186/gb-2009-10-6-r62.PubMed CentralView ArticlePubMed
- Roudier F, Teixeira FK, Colot V: Chromatin indexing in Arabidopsis: an epigenomic tale of tails and more. Trends Genet. 2009, 25: 511-517. 10.1016/j.tig.2009.09.013.View ArticlePubMed
- Wang X, Elling AA, Li X, Li N, Peng Z, He G, Sun H, Qi Y, Liu XS, Deng XW: Genome-wide and organ-specific landscapes of epigenetic modifications and their relationships to mRNA and small RNA transcriptomes in maize. Plant Cell. 2009, 21: 1053-1069. 10.1105/tpc.109.065714.PubMed CentralView ArticlePubMed
- Choi Y, Gehring M, Johnson L, Hannon M, Harada JJ, Goldberg RB, Jacobsen SE, Fischer RL: DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in arabidopsis. Cell. 2002, 110: 33-42. 10.1016/S0092-8674(02)00807-3.View ArticlePubMed
- Johnson MA, Bender J: Reprogramming the epigenome during germline and seed development. Genome Biol. 2009, 10: 232-10.1186/gb-2009-10-8-232.PubMed CentralView ArticlePubMed
- Crowley T, Muralitharan M, Stevenson T: Isolation of Conifer DNA: A superior method for the elimination of polysaccharides. Plant Molecular Biology Reporter. 21: 97a-97d.
- Pomraning KR, Smith KM, Freitag M: Genome-wide high throughput analysis of DNA methylation in eukaryotes. Methods. 2009, 47: 142-150. 10.1016/j.ymeth.2008.09.022.View ArticlePubMed
- Filichkin SA, Priest HD, Givan SA, Shen R, Bryant DW, Fox SE, Wong W-K, Mockler TC: Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res. 2010, 20: 45-58. 10.1101/gr.093302.109.PubMed CentralView ArticlePubMed
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.View ArticlePubMed
- Anders S, Huber W: Differential expression analysis for sequence count data. Genome biology. 2010, 11: R106-10.1186/gb-2010-11-10-r106.PubMed CentralView ArticlePubMed
- Snedecor G, Cochran W: Statistical Methods. 1967, Iowa State University Press, 6
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.