Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated
© Choy et al; licensee BioMed Central Ltd. 2010
Received: 12 May 2010
Accepted: 27 September 2010
Published: 27 September 2010
DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS") but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq) not to be biological transcription factor binding sites ("empirical TFBS"). We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding.
Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation.
Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.
DNA methylation is a well-studied component of epigenetics that, in the mammalian system, involves the 5' covalent modification of cytosine nucleotides by a methyl group. In humans, cytosine methylation almost always occurs in the context of a CG di-nucleotide, except in undifferentiated cells where methylation was recently identified in cytosines that do not precede guanines (non-CG methylation) [1, 2]. Regions of high CG density, termed "CpG islands" are usually un-methylated and found mainly in the 5' promoter ends of genes. However high resolution maps of genome-wide methylation now show that cytosine methylation occurs throughout the genome, particularly in bodies of highly expressed genes , and up to 4.25% of cytosines in the human genome are methylated . Although the functional difference between CG and non-CG methylation requires further investigation, it is clear that DNA methylation itself significantly regulates gene expression and affects cellular processes in disease and development . For example, genome-wide methylation is altered during aging [5–7] and malignant transformation , and recent evidence supports the notion that methylation can be modulated by diet and environment [9–11]. Moreover evidence of rapid and dynamic DNA methylation/de-methylation in vivo[12, 13] challenges the conventional view that DNA methylation is a stable or permanent epigenetic mark.
Mechanisms to explain aberrant de novo methylation in these contexts include (a) targeted recruitment of DNA methyl-transferases by cis-acting factors such as G9a or EZH2 , or (b) loss of boundaries or "protective" transcription factors leading to the spread of DNA methylation into affected regions in the genome [15, 16]. Indeed several non-redundant sequences matching the consensus motifs for transcription factors such as SP1 have been identified at sites that are resistant to de novo methylation in cancer . De novo methylated CpG islands in cancer however were characterized by the lack of sequence motif combinations and the absence of activating TF binding .
Conversely, the classical mechanism by which DNA methylation regulates transcription is through altered accessibility of transcription factor complexes to their cognate DNA binding sites [4, 18]. This mechanism is supported by many locus-specific examples [19, 20] but one that links the mechanism to environmental influences is the rodent model of maternal grooming . "Highly groomed" neonates developed hypo-methylation in the first exon of the glucocorticoid receptor gene which in turn permits binding of the transcription factor NGFI-A to this DNA regulatory region and up-regulates glucocorticoid receptor expression . In contrast, "lesser groomed" neonates developed methylation in the same DNA regulatory sequence with corresponding inhibition of NGFI-A binding and down-regulation of glucocorticoid receptor expression.
Conserved consensus motifs have been predicted for transcription factor binding across the human genome, and empirical transcription factor binding sites (TFBS) have been determined biologically using the genome-wide technique which couples chromatin immunoprecipitation and high throughput sequencing (ChIP-seq). We have previously examined the genome-wide methylome of human hearts  and sperm . We therefore set out to analyze the methylation state of TFBS in these methylation maps.
Conserved transcription factor consensus motifs (predicted TFBS) are hyper-methylated
We analyzed genome-wide DNA methylation profile in 4 normal adult human hearts (a post-mitotic organ) and human sperm (germ cell) by employing the technique of MeDIP-seq . Analysis of our MeDIP-seq datasets was performed using the Bayesian deconvolution algorithm called BATMAN . Using BATMAN we assigned methylation scores across the genome for hearts and sperm. Because we hypothesized that the interaction between transcription factor complexes and their cognate DNA binding sites is modulated and influenced by methylation of cytosines within the DNA sequence [4, 18], we examined methylation profiles at genome-wide sites of transcription factor binding.
List of 106 transcription factor families (from UCSC genome web browser, Conserved TFBS track) and their detailed methylation profiles in hearts and sperm.
AHR-ARNT, AML1, AP, AREB6, ARP1, ATF, BACH, BRACH, CDP, CEBP, CHOP, COMP1, COUP, CP2, CREB, EN1, ER, FAC1, GATA, GCNF, GFI1, GR, HEN1, HMX1, HOX, HSF, HTF, IK, ISRE, LMO2COM, LUN1, LYF1, MEIS1, MIF1, MRF2, MSX1, MYB, MYCMAX, MYOD, MYOGNF1, MZF1, NCX, NF1, NFE2, NFKB, NRSF, OLF1, EP300, P53, PAX, PBX1, PPAR, RFX1, ROAZ, RORA, RP58, RREB1, SEF1, SPZ1, SREBP, SRF, STAT, TAL1-E47, TCF, TGIF, USF, XBP1, YY1, ZIC, ZID
CETS1P54, E2F, E4BP4, EGR, ELK1, MAZR, NFY, NRF1, SRY
CHX10, FOX, FREAC, LHX3, MEF2, POU, RSRFC4, S8, HFH, SOX, SP1, TATA, TBP
BRN2, CREL, HLF, IRF, NFAT, NGFIC, NMYC, TST1 CDC5, OCT
CART1, NKX, EVI1, HNF
Empirical TFBS with conserved consensus motifs are hypo-methylated
List of 17 transcription factor families from ENCODE (UCSC genome web browser) and other published sources, and their detailed methylation profiles in hearts and sperm.
Next we compared predicted TFBS (HMR Conserved TFBS) to empirical TFBS (ENCODE ChIP-seq) for the same 17 TF. This revealed that only 40,876 locations were both predicted TFBS and empirical TFBS (empirical TFBS containing the expected conserved consensus motif, we called "set 4", Figure 1); i.e. 3.4% (40,876 out of 1,187,431) of the empirical TFBS were predicted by motif and conservation, and 5.3% (40,876 out of 771,221) of the predicted TFBS were biologically proven TBFS as determined by ChIP-seq. In contrast to the aggregate hyper-methylation profile at all predicted TFBS ("set 1", Figures 2A and 2B) and at the subset of 17 TFBS ("set 3", Figures 3A and 3B), predicted TFBS that were biologically proven TFBS ("set 4") showed an aggregate profile of hypo-methylation in both hearts and sperm (Figures 3C and 3D). Table 2 shows the detailed methylation profile for each TF in "set 4". All TF were associated with either a hypo-methylation or neutral profile in hearts whereas 3 out of 17 TF showed a hyper-methylation profile in sperm. The latter detailed analysis may reflect specific differences in empirical TF binding between a post-mitotic organ and germ cell.
Empirical TFBS with conserved consensus motif are more likely to reside in CpG islands than predicted TFBS
CpG islands (CGI) are CG-rich genomic regions often located at the 5' promoter region of genes. Since CpG islands are largely hypo-methylated [1, 2] and the interaction between transcription factor complexes and DNA may be regulated by CGI/promoter methylation, we asked what proportion of our sets of genomic locations corresponded to CGI. Only 7% of the subset of 17 predicted TFBS (i.e. 7% of "set 3") resided in CGI, whereas 40% of locations of empirical TFBS containing the expected conserved consensus motif (i.e. 40% of "set 4") were in CGI.
Interaction between DNA and proteins or protein complexes can be modulated by DNA methylation. Indeed there are examples of DNA methylation-dependent binding for transcription factors such as CTCF  and NGFI-A [10, 24]. Promiscuous or disorderly transcription factor binding may therefore be controlled by DNA methylation at potential binding sites throughout the genome where there are conserved consensus motifs. Here our global analysis largely supports this hypothesis for TF in general. Although the possibility remains that we are only sampling methylation profiles at these sites of conserved consensus motifs at a single time point, and previous or subsequent TF binding may occur as a result of dynamic changes in DNA methylation, this is the first genome-wide study to associate conserved consensus motifs (predicted TFBS) with DNA hyper-methylation. We found similar aggregate methylation profiles for the various sets of TFBS in parallel analyses using methylation maps from both hearts and sperm.
Analysis of all conserved consensus motifs throughout the genome and the subset of those that reside outside CGI showed an aggregate profile of hyper-methylation, but detailed analysis of individual TF suggests that there may be subsets of TF that behave differently. These may indeed represent specific TF whose combinatorial function is to establish or maintain the un-methylated DNA state [15, 16]. Indeed our finding of SP1 and NRF1 in this latter group corresponds to previous reports [16, 17, 25, 26] proposing this function for these two TF.
We have further found that only a very small number of predicted TFBS containing conserved consensus motif are biologically proven TBFS (empirical TFBS). Conversely, only a very small subset of empirical TFBS has the expected conserved consensus motif. Most importantly, we found that while conserved consensus motifs without biologically proven TF binding have a hyper-methylated profile, sites of biologically proven TFBS have the opposite hypo-methylation profile. Although the scales of methylation scores (% BATMAN) in our analysis are generally narrow (e.g. Figure 2, from trough to peak: 53.2 - 53.6%), these scores represent composite/aggregate scores at over 3 M locations in the genome and confidence intervals as indicated on the graphs do not show overlap from peaks to troughs, reflecting the significance of altered methylation patterns in these regions. Moreover these methylation scores are not representative of "whole-genome" methylation but only of the local regions that are being analyzed in each graph (e.g. 3, 749,417 regions in Figure 2 but a different set of 40,876 regions in Figure 3C), peak-to-trough scores therefore differ between analyses.
Interestingly, we also found that only a very small proportion of the sites of conserved consensus motif without biologically proven TF binding were within CGI s(7%); whereas a larger proportion of sites of biologically proven TF binding were within CGIs (40%). The lack of methylation modulation in sites of biologically proven TF binding outside of CGIs (Figure 5C) serves as a negative control for the other profiles of methylation differences that we have detected, but may also indicate that at these sites of empirical TF binding, a neutral methylation profile allows potential TF binding. Alternatively, potential TF binding at these sites may not be regulated by DNA methylation. Most importantly and in contrast to that, predicted consensus TFBS that are non-CGIs maintain a significant hyper-methylation pattern (Figures 4C and 4D).
Our data provides genome-wide evidence that the majority of conserved consensus motifs in the human genome are hyper-methylated, whereas biologically proven TFBS with conserved consensus motifs are hypo-methylated. This implicates a role for DNA methylation in preventing promiscuous or disorderly TF binding, at least for the majority of TF.
Human myocardium was collected by a protocol approved by Cambridgeshire Research Ethics Committee (UK) (REC reference: 06/Q0104/64).
Human left ventricular myocardium
Left ventricular (LV) tissue was obtained from non-donor suitable healthy male individuals involved in road traffic accidents. At the time of donor harvest, whole hearts were removed and transported in cold cardioplegic solution (cardioplegia formula and Hartmann's solution) similar to the procedure described before at Imperial College, London . Following analysis by a cardiovascular pathologist, left ventricular segments were cut and immediately snap frozen.
Genomic DNA isolation
Genomic DNA was isolated from LV samples using the Genomic DNA Buffer Set and Anionic columns (Qiagen, Crawley, UK). Samples (200 mg) were homogenized in G2 Lysis Buffer containing 80 μg/ml RNaseA, using a hand-held homogenizer (Polytron, Switzerland), and thereafter digested with 1 mg/ml Proteinase K (Roche Diagnostics, Burgess Hill, UK) overnight. Fully digested samples were centrifuged at 5000 μg for 10 min and gDNA was isolated from the supernatant using Genomic tip-500/G anionic columns (Qiagen) according to manufacturer's instructions. Integrity and purity of genomic DNA (gDNA) from each tissue was verified by Nanodrop (Thermo Scientific, Wilmington, DE) and the QIAxcel system (Qiagen).
Methylated-cytosine DNA Immunoprecipitation - high throughput sequencing (MeDIP-seq)
Genomic DNA was sheared × 3 for 10 min each time using a Bioruptor probe (Diagenode, Belgium) on ice at High setting (30 sec On, 30 sec Off), and passed through Qiagen QIAprep Spin columns. The extent of shearing was confirmed by running 300 ng of each sample on 1.5% agarose gel. All samples were sheared to the same extent, ranging from 100 - 500 bp with the majority of fragments at 200 bp.
Using the Illumina DNA Sample Prep Kit (FC-102-1001-1, Essex, UK), 5 μg of each sheared gDNA sample was end-repaired, adenosine-bases were added to blunt ends and respective adaptors were ligated to DNA fragments, according to manufacturer's instructions. After each step, samples were cleaned using QIAquick Spin columns (Qiagen). Subsequently, samples were heated at 95°C for 10 min and immediately cooled on ice for 10 min. 2.2 μg of single-stranded gDNA was used for MeDIP and the rest stored at -20°C as the input.
MeDIP was performed as previously described . Briefly, this was done using 7.5 μg of 5'methyl-cytosine antibody (MAb-5MECYT-500, Diagenode) in 500 μl IP buffer (10 mM sodium phosphate, pH 7.0, 140 mM NaCl, 0.05% Triton X-100) and incubated for 2.5 h at 4°C whilst rotating. 40 μl of 50% Protein-A agarose slurry (sc-2001, Santa Cruz, Germany) in IP buffer was added and incubated for further 2.5 h, whilst rotating at 4°C. Protein-A agarose beads were subsequently spun down and washed × 3, 10 min each time with IP buffer before eluting with 250 μl of digestion buffer, rotating at 55°C for another 2.5 h. Enriched methylated gDNA was purified using × 2 phenol:chloroform isolation, chloroform wash and precipitation using NaCl. Following washes with 70% ethanol, samples were quantified and a non-saturating amplification was performed using Illumina Primers 1.1 and 1.2 and 14-cycle PCR as recommended by Illumina. Next, samples were cleaned using QIAquick Spin columns and quantified on Bioanalyzer. 20 ng of each sample was used to confirm enrichment of methylated locus (OXT) and a concomitant depletion of un-methylated locus (UBE2B) versus the input by qPCR, as previously described . MeDIP samples were loaded onto a 2% agarose gel and the 150 - 250 bp bands were cut, and DNA eluted using Qiagen Gel extraction kit and further quantified using Bioanalyzer. Since we used "Illumina Library Single end Primers 1" (92 bp long), we expected our "short libraries" to contain insert sizes to range between 50 - 150 bp long. High throughput sequencing was performed (GeneService, Cambridge, UK) for each of the libraries on 2 channels of the Illumina GAII machine to a sequencing depth of at least 14 mil reads of 35 bp length for each library.
Data sets, genomic features and data analysis
MeDIP-seq data of human hearts were analyzed using a Bayesian deconvolution strategy, BATMAN (22). BATMAN scores from four normal human hearts were averaged using a Perl script (written by MKC and HGG). MeDIP-seq data of human sperm cells analyzed using the same algorithm came from a published resource . MeDIP-seq data for normal human hearts will be deposited in GEO (Accession number). Average plots of methylation densities were calculated using an algorithm previously described . Transcription factor binding motifs conserved in human/mouse/rat and not containing repetitive elements were from UCSC Genome Browser (http://genome.ucsc.edu/; TFBS Conserved track). ChIP-seq co-ordinates for 17 transcription factors were obtained from ENCODE projects deposited in UCSC Genome Browser and other published work (see references). Intersections between datasets were computed using the Table Browser in UCSC Genome Browser or BEDTools http://sourceforge.net/projects/bedtools/.
CpG island annotation
This was obtained from the UCSC Genome Browser (annotated according to ). CpG islands were predicted by searching the sequence one base at a time, scoring each dinucleotide (+17 for CG and -1 for others) and identifying maximally scoring segments. Each segment was then evaluated for the following criteria: GC content of 50% or greater, length greater than 200 bp, ratio greater than 0.6 of observed number of CG dinucleotides to the expected number on the basis of the number of Gs and Cs in the segment.
- Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti-Filippini J, Nery JR, Lee L, Ye Z, Ngo QM, Edsall L, Antosiewicz-Bourget J, Stewart R, Ruotti V, Millar AH, Thomson JA, Ren B, Ecker JR: Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009, 462: 315-322. 10.1038/nature08514.PubMed CentralPubMedView Article
- Laurent L, Wong E, Li G, Huynh T, Tsirigos A, Ong CT, Low HM, Kin Sung KW, Rigoutsos I, Loring J, Wei CL: Dynamic changes in the human methylome during differentiation. Genome Res. 20: 320-331. 10.1101/gr.101907.109.
- Ball MP, Li JB, Gao Y, Lee JH, LeProust EM, Park IH, Xie B, Daley GQ, Church GM: Targeted and genome-scale strategies reveal gene-body methylation signatures in human cells. Nat Biotechnol. 2009, 27: 361-368. 10.1038/nbt.1533.PubMed CentralPubMedView Article
- Jaenisch R, Bird A: Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 2003, 245-254. 10.1038/ng1089. 33 Suppl
- Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, Heine-Suner D, Cigudosa JC, Urioste M, Benitez J, Boix-Chornet M, Sanchez-Aguilera A, Ling C, Carlsson E, Poulsen P, Vaag A, Stephan Z, Spector TD, Wu YZ, Plass C, Esteller M: Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA. 2005, 102: 10604-10609. 10.1073/pnas.0500398102.PubMed CentralPubMedView Article
- Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, Wiemels JL, Nelson HH, Karagas MR, Padbury JF, Bueno R, Sugarbaker DJ, Yeh RF, Wiencke JK, Kelsey KT: Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 2009, 5: e1000602-10.1371/journal.pgen.1000602.PubMed CentralPubMedView Article
- Rakyan VK, Down TA, Maslau S, Andrew T, Yang TP, Beyan H, Whittaker P, McCann OT, Finer S, Valdes AM, Leslie RD, Deloukas P, Spector TD: Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res. 20: 434-9. 10.1101/gr.103101.109.
- Esteller M: Epigenetics in cancer. N Engl J Med. 2008, 358: 1148-1159. 10.1056/NEJMra072067.PubMedView Article
- Jirtle RL, Skinner MK: Environmental epigenomics and disease susceptibility. Nat Rev Genet. 2007, 8: 253-262. 10.1038/nrg2045.PubMedView Article
- Weaver IC, Cervoni N, Champagne FA, D'Alessio AC, Sharma S, Seckl JR, Dymov S, Szyf M, Meaney MJ: Epigenetic programming by maternal behavior. Nat Neurosci. 2004, 7: 847-854. 10.1038/nn1276.PubMedView Article
- Kucharski R, Maleszka J, Foret S, Maleszka R: Nutritional control of reproductive status in honeybees via DNA methylation. Science. 2008, 319: 1827-1830. 10.1126/science.1153069.PubMedView Article
- Miller CA, Sweatt JD: Covalent modification of DNA regulates memory formation. Neuron. 2007, 53: 857-869. 10.1016/j.neuron.2007.02.022.PubMedView Article
- Metivier R, Gallais R, Tiffoche C, Le Peron C, Jurkowska RZ, Carmouche RP, Ibberson D, Barath P, Demay F, Reid G, Benes V, Jeltsch A, Gannon F, Salbert G: Cyclical DNA methylation of a transcriptionally active promoter. Nature. 452: 45-50. 10.1038/nature06544.
- Tachibana M, Matsumura Y, Fukuda M, Kimura H, Shinkai Y: G9a/GLP complexes independently mediate H3K9 and DNA methylation to silence transcription. Embo J. 2008, 27: 2681-2690. 10.1038/emboj.2008.192.PubMed CentralPubMedView Article
- Turker MS: Gene silencing in mammalian cells and the spread of DNA methylation. Oncogene. 2002, 21: 5388-5393. 10.1038/sj.onc.1205599.PubMedView Article
- Han L, Lin IG, Hsieh CL: Protein binding protects sites on stable episomes and in the chromosome from de novo methylation. Mol Cell Biol. 2001, 21: 3416-3424. 10.1128/MCB.21.10.3416-3424.2001.PubMed CentralPubMedView Article
- Gebhard C, Benner C, Ehrich M, Schwarzfischer L, Schilling E, Klug M, Dietmaier W, Thiede C, Holler E, Andreesen R, Rehli M: General transcription factor binding at CpG islands in normal cells correlates with resistance to de novo DNA methylation in cancer cells. Cancer Res. 70: 1398-1407. 10.1158/0008-5472.CAN-09-3406.
- Jones PA, Takai D: The role of DNA methylation in mammalian epigenetics. Science. 2001, 293: 1068-1070. 10.1126/science.1063852.PubMedView Article
- Adams RL: DNA methylation. The effect of minor bases on DNA-protein interactions. Biochem J. 1990, 265: 309-320.PubMed CentralPubMedView Article
- Bell AC, Felsenfeld G: Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature. 2000, 405: 482-485. 10.1038/35013100.PubMedView Article
- Movassagh M, Choy MK, Goddard M, Bennett MR, Down TA, Foo RS: Differential DNA methylation correlates with differential expression of angiogenic factors in human heart failure. PLoS One. 5: e8564-10.1371/journal.pone.0008564.
- Down TA, Rakyan VK, Turner DJ, Flicek P, Li H, Kulesha E, Graf S, Johnson N, Herrero J, Tomazou EM, Thorne NP, Backdahl L, Herberth M, Howe KL, Jackson DK, Miretti MM, Marioni JC, Birney E, Hubbard TJ, Durbin R, Tavare S, Beck S: A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis. Nat Biotechnol. 2008, 26: 779-785. 10.1038/nbt1414.PubMed CentralPubMedView Article
- Farnham PJ: Insights from genomic profiling of transcription factors. Nat Rev Genet. 2009, 10: 605-616. 10.1038/nrg2636.PubMed CentralPubMedView Article
- McGowan PO, Sasaki A, D'Alessio AC, Dymov S, Labonté B, Szyf M, Turecki G, Meaney MJ: Epigenetic regulation of the glucocorticoid receptor in human brain associates with childhood abuse. Nat Neurosci. 2009, 12: 342-8. 10.1038/nn.2270.PubMed CentralPubMedView Article
- Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, Simon I, Yakhini Z, Cedar H: Developmental programming of CpG island methylation profiles in the human genome. Nat Struct Mol Biol. 2009, 16: 564-571. 10.1038/nsmb.1594.PubMedView Article
- Macleod D, Charlton J, Mullins J, Bird AP: Sp1 sites in the mouse aprt gene promoter are required to prevent methylation of the CpG island. Genes Dev. 1994, 8: 2282-2292. 10.1101/gad.8.19.2282.PubMedView Article
- Adamson DL, Money-Kyrle AR, Harding SE: Functional evidence for a cyclic-AMP related mechanism of action of the beta(2)-adrenoceptor in human ventricular myocytes. J Mol Cell Cardiol. 2000, 32: 1353-1360. 10.1006/jmcc.2000.1171.PubMedView Article
- Kolasinska-Zwierz P, Down T, Latorre I, Liu T, Liu XS, Ahringer J: Differential chromatin marking of introns and expressed exons by H3K36me3. Nat Genet. 2009, 41: 376-381. 10.1038/ng.322.PubMed CentralPubMedView Article
- Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010, 26: 841-842. 10.1093/bioinformatics/btq033.PubMed CentralPubMedView Article
- Gardiner-Garden M, Frommer M: CpG islands in vertebrate genomes. J Mol Biol. 1987, 196: 261-82. 10.1016/0022-2836(87)90689-9.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.