Skip to main content

Genome-wide analysis of DNA methylation patterns in horse



DNA methylation is an epigenetic regulatory mechanism that plays an essential role in mediating biological processes and determining phenotypic plasticity in organisms. Although the horse reference genome and whole transcriptome data are publically available the global DNA methylation data are yet to be known.


We report the first genome-wide DNA methylation characteristics data from skeletal muscle, heart, lung, and cerebrum tissues of thoroughbred (TH) and Jeju (JH) horses, an indigenous Korea breed, respectively by methyl-DNA immunoprecipitation sequencing. The analysis of the DNA methylation patterns indicated that the average methylation density was the lowest in the promoter region, while the density in the coding DNA sequence region was the highest. Among repeat elements, a relatively high density of methylation was observed in long interspersed nuclear elements compared to short interspersed nuclear elements or long terminal repeat elements. We also successfully identified differential methylated regions through a comparative analysis of corresponding tissues from TH and JH, indicating that the gene body regions showed a high methylation density.


We provide report the first DNA methylation landscape and differentially methylated genomic regions (DMRs) of thoroughbred and Jeju horses, providing comprehensive DMRs maps of the DNA methylome. These data are invaluable resource to better understanding of epigenetics in the horse providing information for the further biological function analyses.


DNA methylation is a stably inherited epigenetic modification in eukaryotes. The regulation and characteristics of the DNA methylation still remain enigmatic, although the importance of it has been demonstrated in many biological processes such as gene expression regulation, genomic imprinting, X chromosome inactivation, maintenance of genomic stability by transposon silencing. It has also been implicated in the development of diseases such as cancer [17]. DNA methylation is also essential for the proper differentiation and development of mammalian tissues [8, 9]. For instance, the knockout of genes encoding the DNA-methyltransferase (DNMT) enzymes, which are responsible for de novo methylation of DNA, results in embryonic lethality in mice [10, 11]. In mammals, methycytosine is observed mostly at CpG dinucleotides, except for the CpGs in CpG islands [12]. DNA methylation is unevenly distributed in genomes: the intergenic regions, and repetitive elements are usually hypermethylated, while the 5′ and 3′ flanking regions of genes are relatively hypomethylated compared with the intragenic regions [1315]. Recently, whole genome methylation has been extensively examined in mammalian species [16, 17] due to advanced sequencing technologies.

Previous studies have revealed the patterns of global DNA methylation in a single or few tissues across species [1823], or in multiple tissues or developmental stages in a single organism [8, 18, 2428]. The DNA methylation pattern is generally conserved, and through comparative analyses of DNA methylation across mammalian species, it has been suggested to play a role in tissue-specific gene regulation [20]. When tissue-specific differentially methylated regions (T-DMRs) in human and mouse tissues including heart, colon, kidney, testis, spleen, and muscle were compared, they could be distinguished clearly according to the corresponding tissues based on their methylation status [27]. It is probable that there are a large number of potentially important functional differences in methylation levels across species. In primates, relative tissue methylation levels generally differ among species [20]. However, there is insufficient evidence indicating that methylation differences exist at subspecies or breeds level.

Thoroughbred horse (TH) is a horse breed that has been manipulated by humans for improved speed, agility, and endurance in England. THs have been selected for racing ability. Thus the genetic traits related to athletic performance against TH have been extensively studied, including genotyping and transcriptome analysis [2935]. Jeju horse (JH; a natural monument No. 347) is an indigenous Korean horse, is physically a small and rugged pony [36]. They have been raised for meet source, farm labor, riding, and racing in Jeju Island, South Korea. Detailed genetic characterization of JH is thought to be crucial for the conservation of and for effective breeding strategies of this indigenous animal. Thus, many studies have been performed to analyses phylogenetic relationships, and discovering genetic marker [3739]. However, until now, there have been no studies associate the traits of Jeju horse with epigenetic patterns. With the advent of next-generation sequencing (NGS) and genome-wide association studies, some studies were performed using NGS and microarray technology in thoroughbred horses [35, 40, 41]. These studies concentrated only on gene expression and genetic markers of athletic ability during and after exercise. Methylation analyses in animals exhibiting racing traits have not yet been reported. Many previous studies suggested that exercise induces methylation changes [42, 43], and athletic ability is closely associated with methylation [44, 45]. The regulation of methylation profiling related to exercise genes is important for exercising horses. Therefore, identifying methylation profiles related to exercise ability will be invaluable in studying athletic traits in racing horses. Nonetheless, there are no studies about the influence of methylation on the racing ability of TH let alone JH while the traits governing the economics of horse racing, such as the racing ability, speed, disease resistance, and recovery ability, are of important resource in the horse industry.

Here, we report the data and analyses of genome-wide DNA methylation patterns in the skeletal muscle, heart, lung, and cerebrum of TH and JH, and tissue-specific DNA methylation differences between the two horse breeds produced by methyl-DNA immunoprecipitation sequencing (MeDIP-seq).


Global methylation analysis of thoroughbred and Jeju horses

We profiled the global DNA methylation status of physicality-associated organs (skeletal muscle, heart, lung, and cerebrum) of TH and JH using MeDIP sequencing. About 21 - 24 million raw reads from each samples were sequenced resulting in on average 820 K/mm2 of cluster density, producing about 1.05 - 1.2 Gbp. After low-quality data filtration, about 81.8% - 87.5% reads, assessed as clean data, were analyzed and mapped (Additional file 1: Table S1). On average, 17.5 and 16.0 million unique mapped reads were obtained from the four tissues of TH and JH, respectively, with a high-quality read lignment against the horse reference genome (Additional file 1: Table S1).

In the identification the global DNA methylation pattern, the number of methylated peaks in MeDIP-seq is important [46]. We obtained 61,000–112,000 methylated peaks in the TH and JH tissues (skeletal muscle, heart, lung and cerebrum), using the peak detection methodology which covers approximately 2.51-4.35% of the horse genome (2.7 Gbp) (Additional file 1: Table S1 and Table 1). These methylation peaks were observed with a moderate correlation of chromosomal length and gene number between methylation regions (Additional file 1: Figure S1). The degree of methylation was high in the intergenic regions containing repeats, followed by the intron and coding sequence (CDS) regions in both TH and JH (Table 1). However, the methylation density in the CDS region was higher than that in the intergenic region, whereas the methylation density in the other intragenic region such as 3’-UTR, intron, upstream 2 kb at transcription start site (TSS), and 5’-UTR was lower than that of the intergenic region (Figure 1A and 1B). Repeat elements showed a relatively high methylation density. In comparison with most of the repetitive elements, long interactive nuclear elements (LINE), short interactive nuclear elements (SINE), and long terminal repeat (LTR) elements exhibited a high level of methylation density in both TH and JH (Figure 1C, D). In this study, we demonstrated that the depletion or decrease of methylation density was found around TSS as well as promoter regions in both TH and JH, whereas the gradual increase of that was found in gene body (Figure 1E).

Table 1 Peak distribution in different components of the thoroughbred horse and the Jeju horse
Figure 1

The average methylation density in different genomic regions. Methylation density within the gene regions, intergenic regions, and repeats were calculated by dividing the peak length in that region by the area of that region for thoroughbred (A) and Jeju (B) horse-derived DNA. Further repeats were classified in different classes and the average methylation level of each class was calculated in thoroughbred (C) and Jeju (D) horses. (E) Distribution of methylation density around gene body, including upstream to downstream 2 kb, was calculated for all RefSeq genes.

The methylation of CpG islands in the promoter regions is known to regulate gene expression and it was reported to be hypomethylated in the vertebrate genome [47]. The horse genome contained a total of 109,505 CpG islands. Of these CpG islands, about 12.3% (n = 13,467) were methylated in the skeletal muscle of TH, 7.65% (n = 8,377) in the heart of TH, 12.84% (n = 14,056) in the lung of TH, and 10.12% (n = 11,082) in the cerebrum of TH (Table 2). In addition, about 11.27% (n = 12,345) were methylated in the skeletal muscle of JH, 10.26% (n = 11,232) in the heart of JH, 12.73% (n = 13,939) in the lung of JH, and 7.82% (n = 8,560) in the cerebrum of JH. Therefore, we observed the most abundant CpG island methylation in the lung tissue in both TH and JH. Most of the methylated CpG islands were located in the intergenic regions in both the TH and JH. In the case of the gene body region, methylated CpG islands were present largely in the intron regions, followed by the CDS regions.

Table 2 Summary of methylated CGIs in the different tissues of the thoroughbred and Jeju horses

Differential DNA methylation in thoroughbred and Jeju horses

We observed a total of 35,467 differentially methylated regions (DMRs) in the four different TH and JH tissues, indicating differences in their methylation profiles (Additional file 1: Table S2). The TH’s skeletal muscle was hypermethylated compared to that of JH, whereas the heart, lung, and cerebrum of TH showed a hypomethylated pattern compared to those of JH (Figure 2A). We also analyzed methylation events in the intergenic, gene body, and promoter regions in the four tissues of TH and JH. As shown in Figure 2B, the gene body region in the skeletal muscle of TH showed a relatively high level of methylation, whereas the gene body in the heart of TH showed a high hypomethylation pattern, compared to other tissues. We also examined DMRs within the repeat region, and found that SINE and LINE elements showed a high level of methylation in skeletal muscle compared to that of JH. The satellite regions indicated a high hypermethylation density in lung tissue compared to that of JH (Figure 2C). Here, based on our DMR data, we provide the DMRs associated with comprehensive maps of the DNA methylome of TH and JH (Figure 3).

Figure 2

Genomic distribution of differentially methylated regions (DMRs) in the thoroughbred horse compared to the Jeju horse. (A) The number of hyper- and hypomethylated DMRs in 4 different tissues of thoroughbred horses. (B) Distribution of hyper- and hypomethylation density in different genomic regions such as intergenic, gene body, and promoter regions. (C) Hyper- and hypomethylation density in repeat regions, classified according to the family.

Figure 3

Comprehensive maps of the entire DNA methylome of thoroughbred and Jeju horses. Circular representation of the hyper- and hypomethylation levels for four different tissues of thoroughbred horse.

MeDIP-seq data validation

To validate the results obtained with MeDIP-seq data, three regions were selected in the horse genome for analysis by bisulfite sequencing. We randomly chose one region with a relatively high level of methylation, one region with a moderate level of methylation and one region of differential methylation region between TH and JH. The bisulfite sequencing results showed a high degree of consistency with the MeDIP-seq data (Figure 4, Additional file 1: Figure S2, and Additional file 1: Figure S3). These results indicated that our genome-wide methylation results obtained by MeDIP-seq are reliable.

Figure 4

The validation of MeDIP-seq data by bisulfite sequencing (BSP). A high methylated region obtained from MeDIP-seq data was selected randomly and its methylation pattern was profiled by BSP. The box indicated amplification regions. CpG dinucleotides are represented by circles on vertical bars. Each line represents an independent clone, and methylated CpGs are marked by filled circles, unmethylated CpGs by open circles.

Analysis of functional categories of DMR-containing genes

To explore the biological functions associated with DMR-containing genes in the thoroughbred horse, we analyzed the gene ontology (GO) categories of these genes using DAVID ( [48]. All genes analyzed with GO annotations were used as the reference list. We selected some categories associated with exercise ability in the horse [49]. Several categories were related to exercise ability; however, we chose the category sets associated with overexpression and tissue capacity functions (Figure 5A). Comparison of gene methylation showed that there were 12,128 DMRs among TH and JH. DMRs and genes that are unique or shared among the four tissue types examined are shown in Figure 5B. Genes having high numbers of DMRs are dominant in the muscle (4327) and heart (4062). These two tissues have more DMR-containing genes than the cerebrum and lung; in particular, TH’s muscle tissue has the highest number of hypermethylated DMR-containing genes among the four tissues analyzed. The frequency of hypomethylation in the cerebrum, lung, and heart tissues was higher in TH than the JH.

Figure 5

Functional classification and comparison of differentially methylated regions (DMRs). (A) GO analysis of biological function. (B) The Venn diagram for comparison of DMRs that are unique or shared in four tissues derived from thoroughbred and Jeju horses. (C) k-mean clustering (k = 5) analysis of differential methylated genes.

Tissue-specific DMRs were identified by k-mean clustering in the methylation regions in the four tissues. Several genes containing DMRs were clustered, and were divided into 11 clusters (Figure 5C). The k-mean clustering of 1188 genes revealed differential methylation in each tissue (P = 0.0005 ~ 0.00051 for skeletal muscle-related clusters (MC1-MC3), P = 0.0005 for heart-related clusters (HC1-HC3), P = 0.00128 for lung-related clusters (LC1 and LC2), and P = 0.017 ~ 0.015 for cerebrum-related clusters (CC1-CC3)). Clusters of tissue-specific DMRs were located upstream of the TSS, which is 5 kb upstream of genes in the skeletal muscle and cerebrum. However, in heart and lung tissues, each cluster of DMRs was evenly spread over the region upstream and downstream of the TSS site. In heart tissue, tissue-specific DMR clusters were detected in several genes, while in the lung tissue, tissue-specific DMR clusters were detected in 85 genes.


We report the analyses and data generated by methyl-DNA immunoprecipitation sequencing to provide the genome-wide DNA methylation patterns in skeletal muscle, heart, lung, and cerebrum tissues of TH and JH. In the horse genome, gene body regions showed a higher methylation density than the intergenic regions. Also the repetitive elements had a high methylation density while CpG islands showed a low methylation density. These patterns revealed in this study were similar to those previously reported in other species, from plants to mammalians [13, 17, 50].

The promoter and 5'-UTR regions play an important role in the regulation of gene expression and they have been reported to be hypomethylated [51]. In the case of the gene body region, except for the 5'-UTR, DNA methylation contributed to chromatin structure alteration and regulation of the transcription elongation efficiency [52]. We report an increased level of methylation in the CDS, intron, and 3'-UTR regions in TH and JH, these results are similar to those from previously reported animal studies [22, 28]. Repeat elements occupied about 30–50% of the mammalian genome; among these, LINE elements were predominantly interspersed. In the horse genome, LINE elements were also the most predominantly interspersed repeat elements [53]. Repeat elements are usually associated with genomic instability through structural changes such as transposition, translocation, and recombination [54, 55]. To maintain genomic stability, DNA methylation functions as a silencing mechanism for repeat elements [56]. Thus, a major proportion of genomic methylation has been reported to occur in repeat elements, which is supported by our data. We found that DNA methylation was predominantly seen in LINE elements, consistent with findings from previous animal studies [47]. Additionally, SINE and LTR elements were hypermethylated in the horse genome, similar to the results in other animal studies [47, 57]. Methylation of these elements is known to be a crucial factor in the maintenance of genomic stability through the suppression of transcription, transposition, and recombination [17]. Thus, hypermethylation of repeat elements in the horse genome might play an essential role, as a defense mechanism to maintain genomic stability in the presence of active repeat elements. CpG islands have been universally reported to be regions of gene regulation via methylcytosine, possibly through the mechanism of transcriptional repression. These regions in the mammalian genome are known to be generally demethylated, in spite of having a high GC content [4]. Intragenic and intergenic methylated CpG islands affect functional gene expression through the regulation of promoter activity, and intergenic methylated CpG islands play a crucial role in the regulation of alternative promoters and splicing [48]. In this study, we found that about 10.73% and 10.52% of the CpG islands were methylated in TH and JH genomes, respectively, which is similar to that observed in the human genome (about 6–8%) [8]. Further analysis of the density of methylated CpG islands in intragenic regions showed a higher methylation level in exons (11.06 ± 1.78) than in introns (1.28 ± 0.28) in the horse genome. These results were consistent with the findings in humans and rats [8, 17]. Taken together, we provide a comprehensive data and information of the whole methylome in horse, They can enable researchers to perform in depth analyses of the roles played by DNA methylation in horses and probably in other mammals.

DNA methylation is one of the main epigenetic modification mechanisms; thus, the study of DMRs within tissues or individual organisms is important. In several studies, various levels of DNA methylation could regulate tissue-specific transcription and may be important during development and differentiation [58]. Thus, the analysis of DMRs among tissues is essential in understanding tissue specific gene expression. In particular, methylation analysis between breeds in a well-known subspecies can provides invaluable information on the evolutionary divergence and evidence for useful traits. We successfully identified differentially methylated regions within four tissues in two horse breeds. Similar results have been reported in pig tissues from various breeds [59, 60] that can be compared. Differential methylation patterns were observed in seven tissues (muscle, heart, liver, spleen, lung, kidney, and stomach) from Laiwu, a specific pig breed [60]. In addition, the level of methylation in the liver tissue genome of other breeds of pigs (such as Berkshire, Duroc, and Landrace) also differed [59]. Distribution patterns of DNA methylation are generally conserved among these three pig breeds, but some DMRs were detected in the coding genes and repetitive element regions in liver tissue. In this study, we also observed that distribution of DNA methylation in the two breeds showed generally conserved pattern but, some DMRs were detected a high density in the gene body, including the coding regions and introns. Gene body methylation, especially intronic DNA methylation, may be associated with alternative splicing [61]. Thus, these results suggest that methylation has important effects on gene transcription in individual breeds. Furthermore, in the repeat region, the density of DMRs was dominant. Thus, the high density of DMRs in repeat regions could also induce differences in transcript variation and expression. In summary, differences in DNA methylation patterns and the density of DMRs in the four tissues of individual breeds may play a crucial role in the process of development and the corresponding gene expression.

Gene containing DMRs in the tissues of TH showed high representation in the categories of ATP binding and cytoskeletal protein binding. ATP binding functions play a role during exercise, as they affect ATPase activity. ATPase activity-induced ATP lysis subsequently caused intermediate molecular interactions using the energy of ATP lysis [60]. In TH, these functions may play important roles and the dominant expression of these gene categories is required. In particular, during exercise, ATP binding could induce muscle contraction [62]. After ATP binds the myosin head, muscle contractions are initiated due to the detachment of myosin from actin filaments [63]. DMRs in the tissues of TH are also overrepresented in cytoskeletal protein binding. Generally, the cytoskeleton plays important roles in both intracellular transport and cellular division [63]. In eukaryotic cells, the cytoskeleton can be classified into three types: microfilaments, intermediate filaments, and microtubules [64]. Muscle activity-related units such as actin, keratin, and tubulin are included in the cytoskeleton. Thus, genes having DMRs could influence their binding and activities, thus differentially affecting the exercise ability in TH. These functional categories of genes with DMRs suggest that DNA methylation has an important effect on the regulation of genes categorized as being involved in ATP and cytoskeletal binding. Thus, these differences in methylation status in the tissues of TH and JH may indicate differences in their gene expression levels and exercise ability characteristics.


We have generated, for the first time, DNA methylomes for TH and JH. We provide the DNA methylation landscape and differentially methylated genomic regions in these horse species, indicating that DMRs represent comprehensive maps of the DNA methylome in TH and JH. These DNA methylome maps could be useful for further studies of epigenetic gene regulation in various horse breeds. The epigenetic system existing in the horse genome lays the foundation for studying the involvement of epigenetic modifications in horse domestication and improvement and provides a more systemic analysis of DNA methylation.


Ethics statement

The animal protocol used in this study has been reviewed by the Pusan National University-Institutional Animal Care and Use Committee (PNU-IACUC) on their ethical procedures and scientific care, and it has been approved (Approval Number PNU-2013-0411).

Genomic DNA extraction

The healthy thoroughbred (retired racing horse, Korea Racing Authority registered number: 016222; 5 years old; a castrated horse) and Jeju horses (tested Jeju native horse breed registered number: P06071M1; 6 years old; male) were sacrificed in compliance with the international guidelines for experimental animals, and the tissues were separated and stored at -80°C. The use of these samples was approved by the National Institute of Subtropical Agriculture in Jeju Island, South Korea. Genomic DNA was isolated from 4 tissue samples from each healthy horse (skeletal muscle, heart, lung, and cerebrum) using the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer’s manual for MeDIP-Seq and bisulfite-treatment experiments. DNA concentration and quality were estimated by UV spectrophotometry on a NanoDrop ND-1000 (NanoDrop, Wilmington, DE, USA). For quality control, we selected only those DNA samples in which the A260/A280 ratio range was 1.6 to 2.2, the A260/A230 ratio was >1.6, and the main band was identified by agarose gel electrophoresis.

Methyl-DNA immunoprecipitation sequencing

Eight genomic DNA samples with 1 μgof genomic DNA starting material (DNA concentration of 0.1 μg/μl) were sonicated to produce DNA fragments ranging from 100 to 500 bp. After DNA end-repair and the generation of 3'-dA overhangs using the Paired-End DNA Sample Prep Kit (Illumina, San Diego, CA, USA), the DNA samples were ligated to Illumina sequencing adaptors. The fragments were denatured and then immunoprecipitated using magnetic methylated DNA immunoprecipitation kit including a 1:10 diluted antibody mix (0.3 ul antibody, 0.6 ul buffer A, and 2.10 ul distilled water) following the manufacturer’s recommendation (Diagenode, Delville, NJ, USA). The immunoprecipitated DNA was quantified by quantitative real-time PCR (qPCR). DNA fragments between 200 and 300 bp were excised from the gel and purified using a gel extraction kit (Qiagen). The products were quantified with a Quant-iTTM dsDNA High Sensitivity Assay Kit (Invitrogen, Carlsbad, CA, USA) on an Agilent 2100 Analyzer (Agilent Technologies, Santa Clara, CA, USA). After qPCR analysis, DNA libraries were subjected to paired-end sequencing with a 50-bp read length using the Illumina HiSeq 2000 platform (Illumina). After the completion of a sequencing run, raw image files were processed by Illumina Real-Time Analysis (RTA) for image analysis and base calling. Sequencing reads have been submitted to the NCBI Short Read Archive (SRA) under an SRA accession no.SRP041333.

Bioinformatics analysis

Raw sequence data were first processed to filter out adapters and low-quality reads with the follow criteria; (1) N’s per read ≥ 10%, (2) average of quality score (QS) per read < 20, (3) number of nucleotides with < QS 20 per read ≥ 5%, and (4) having called the same bases in paired-end reads. The filtered data were then aligned to the horse reference genome (EquCab2) using the SOAPaligner (version 2.21) with mismatches of no more than 2 bp [65]. Uniquely mapped reads were retained for further analyses. To identify genomic regions that are enriched in a pool of specifically immunoprecipitated DNA fragments, genome-wide peak scanning was carried out by MACS (version 1.4.2) with a cutoff of P-value of 1 × 10-4 to exclude false positive peaks or noises [66]. In addition, an option of ‘--mfold’ to select the regions with MFOLD range of high-confidence enrichment ration against background to build model was used with lower limit 10 and upper limit 30. The distribution of peaks in different regions of the horse genome in each sample, including the promoter, 5'-untranslated region (UTR), 3'-UTR, exons, introns, intergenic regions, CpG islands (CGIs), and repeats, was analyzed. Methylated peaks corresponding to different genomic regions were selected by mapping at least 50% of the peak on a particular genomic region. In particular, CGI can be defined by 3 criteria: length greater than 200 bp, ≥50% GC content, ≥0.6 of CpG observed/expected ≥0.6. The methylation densities in the different regions of the genome were also compared.

To identify candidate differentially methylated regions (DMRs) in any 2 samples, their peaks were merged, and the number of reads within those peaks were assessed with chi-square and FDR statistics (P < 0.05). DMRs with a greater than 2-fold difference in read numbers were finally selected and classified as hyper- or hypo-methylated regions. All DMR-containing genes were used for subsequent gene ontology (GO) enrichment analyses using the DAVID Functional Annotation Tool with P < 0.05 [49]. Moreover, co-existing DMRs within genes among different tissues were plotted and centered at a transcription start site (TSS) using seqMINER with the k-mean clustering method [67].

Bisulfite sequencing (BSP)

Three pairs of primers (Additional file 1: Table S3) were designed with MethPrimer tool (, including one pair for the validation of relatively high methylated region, one pair for relatively moderate methylated region, and one pair for differentially methylated regions between TH and JH. Bisulfite modification of 1 μg of genomic DNA was performed using the Imprint® DNA Modification kit by standard methods (SIGMA). The bisulfite-treated DNA was amplified by PCR with BSP specific primer pair. After a hot start, PCRs were carried out for 40 cycles of 94°C for 40 sec, 50-55°C for 40 sec, and 72°C for 40 sec. PCR products were separated on a 1.5% agarose gel, purified with the LaboPass gel extraction kit (COSMO GENETCH) and cloned into the pGEM-T-easy vector (Promega). The cloned DNA was isolated using the Plasmid DNA mini-prep kit (GeneAll). Positive clones were randomly collected for sequencing at COSMO GENETCH company (Seoul, Korea).

Availability of supporting data

All sequencing reads from this study have been submitted to the NCBI Sequence Read Archive; SRA ( under accession no. SRP041333.


  1. 1.

    Sasaki H, Allen ND, Surani MA: DNA methylation and genomic imprinting in mammals. EXS. 1993, 64: 469-486.

    CAS  PubMed  Google Scholar 

  2. 2.

    Courtier B, Heard E, Avner P: Xce haplotypes show modified methylation in a region of the active X chromosome lying 3' to Xist. Proc Natl Acad Sci U S A. 1995, 92 (8): 3531-3535.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  3. 3.

    Siegfried Z, Eden S, Mendelsohn M, Feng X, Tsuberi BZ, Cedar H: DNA methylation represses transcription in vivo. Nat Genet. 1999, 22 (2): 203-206.

    CAS  PubMed  Article  Google Scholar 

  4. 4.

    Bird A: DNA methylation patterns and epigenetic memory. Genes Dev. 2002, 16 (1): 6-21.

    CAS  PubMed  Article  Google Scholar 

  5. 5.

    Robertson KD: DNA methylation and human disease. Nat Rev Genet. 2005, 6 (8): 597-610.

    CAS  PubMed  Article  Google Scholar 

  6. 6.

    Conerly M, Grady WM: Insights into the role of DNA methylation in disease through the use of mouse models. Dis Model Mech. 2010, 3 (5–6): 290-297.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  7. 7.

    Kulis M, Esteller M: DNA methylation and cancer. Adv Genet. 2010, 70: 27-56.

    PubMed  Article  Google Scholar 

  8. 8.

    Illingworth R, Kerr A, Desousa D, Jorgensen H, Ellis P, Stalker J, Jackson D, Clee C, Plumb R, Rogers J, Humphray S, Cox T, Langford C, Bird A: A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS Biol. 2008, 6 (1): e22-

    PubMed Central  PubMed  Article  Google Scholar 

  9. 9.

    Jaenisch R, Bird A: Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 2003, 33 (Suppl): 245-254.

    CAS  PubMed  Article  Google Scholar 

  10. 10.

    Li E, Bestor TH, Jaenisch R: Targeted mutation of the DNA methyltransferase gene results in embryonic lethality. Cell. 1992, 69 (6): 915-926.

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Okano M, Bell DW, Haber DA, Li E: DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell. 1999, 99 (3): 247-257.

    CAS  PubMed  Article  Google Scholar 

  12. 12.

    Weber M, Davies JJ, Wittig D, Oakeley EJ, Haase M, Lam WL, Schubeler D: Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells. Nat Genet. 2005, 37 (8): 853-862.

    CAS  PubMed  Article  Google Scholar 

  13. 13.

    Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, Chen H, Henderson IR, Shinn P, Pellegrini M, Jacobsen SE, Ecker JR: Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006, 126 (6): 1189-1201.

    CAS  PubMed  Article  Google Scholar 

  14. 14.

    Zilberman D, Gehring M, Tran RK, Ballinger T, Henikoff S: Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2007, 39 (1): 61-69.

    CAS  PubMed  Article  Google Scholar 

  15. 15.

    Gehring M, Bubb KL, Henikoff S: Extensive demethylation of repetitive elements during seed development underlies gene imprinting. Science. 2009, 324 (5933): 1447-1451.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  16. 16.

    Ruike Y, Imanaka Y, Sato F, Shimizu K, Tsujimoto G: Genome-wide analysis of aberrant methylation in human breast cancer cells using methyl-DNA immunoprecipitation combined with high-throughput sequencing. BMC Genomics. 2010, 11: 137-

    PubMed Central  PubMed  Article  Google Scholar 

  17. 17.

    Sati S, Tanwar VS, Kumar KA, Patowary A, Jain V, Ghosh S, Ahmad S, Singh M, Reddy SU, Chandak GR, Raghunath M, Sivasubbu S, Chakraborty K, Scaria V, Sengupta S: High resolution methylome map of rat indicates role of intragenic DNA methylation in identification of coding region. PLoS One. 2012, 7 (2): e31621-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  18. 18.

    Irizarry RA, Ladd-Acosta C, Wen B, Wu Z, Montano C, Onyango P, Cui H, Gabo K, Rongione M, Webster M, Ji H, Potash JB, Sabunciyan S, Feinberg AP: The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat Genet. 2009, 41 (2): 178-186.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  19. 19.

    Feng S, Cokus SJ, Zhang X, Chen PY, Bostick M, Goll MG, Hetzel J, Jain J, Strauss SH, Halpern ME, Ukomadu C, Sadler KC, Pradhan S, Pellegrini M, Jacobsen SE: Conservation and divergence of methylation patterning in plants and animals. Proc Natl Acad Sci U S A. 2010, 107 (19): 8689-8694.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  20. 20.

    Enard W, Fassbender A, Model F, Adorjan P, Paabo S, Olek A: Differences in DNA methylation patterns between humans and chimpanzees. Curr Biol. 2004, 14 (4): R148-R149.

    CAS  PubMed  Article  Google Scholar 

  21. 21.

    Gama-Sosa MA, Midgett RM, Slagel VA, Githens S, Kuo KC, Gehrke CW, Ehrlich M: Tissue-specific differences in DNA methylation in various mammals. Biochim Biophys Acta. 1983, 740 (2): 212-219.

    CAS  PubMed  Article  Google Scholar 

  22. 22.

    Zemach A, McDaniel IE, Silva P, Zilberman D: Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010, 328 (5980): 916-919.

    CAS  PubMed  Article  Google Scholar 

  23. 23.

    Igarashi J, Muroi S, Kawashima H, Wang X, Shinojima Y, Kitamura E, Oinuma T, Nemoto N, Song F, Ghosh S, Held WA, Nagase H: Quantitative analysis of human tissue-specific differences in methylation. Biochem Biophys Res Commun. 2008, 376 (4): 658-664.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  24. 24.

    Weber M, Hellmann I, Stadler MB, Ramos L, Paabo S, Rebhan M, Schubeler D: Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat Genet. 2007, 39 (4): 457-466.

    CAS  PubMed  Article  Google Scholar 

  25. 25.

    Rakyan VK, Down TA, Thorne NP, Flicek P, Kulesha E, Gräf S, Tomazou EM, Backdahl L, Johnson N, Herberth M, Howe KL, Jackson DK, Miretti MM, Fiegler H, Marioni JC, Birney E, Hubbard TJ, Carter NP, Tavaré S, Beck S: An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs). Genome Res. 2008, 18 (9): 1518-1529.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  26. 26.

    Eckhardt F, Lewin J, Cortese R, Rakyan VK, Attwood J, Burger M, Burton J, Cox TV, Davies R, Down TA, Haefliger C, Horton R, Howe K, Jackson DK, Kunde J, Koenig C, Liddle J, Niblett D, Otto T, Pettett R, Seemann S, Thompson C, West T, Rogers J, Olek A, Berlin K, Beck S: DNA methylation profiling of human chromosomes 6, 20 and 22. Nat Genet. 2006, 38 (12): 1378-1385.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  27. 27.

    Kitamura E, Igarashi J, Morohashi A, Hida N, Oinuma T, Nemoto N, Song F, Ghosh S, Held WA, Yoshida-Noro C, Nagase H: Analysis of tissue-specific differentially methylated regions (TDMs) in humans. Genomics. 2007, 89 (3): 326-337.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  28. 28.

    Gibbs JR, van der Brug MP, Hernandez DG, Traynor BJ, Nalls MA, Lai SL, Arepalli S, Dillman A, Rafferty IP, Troncoso J, Johnson R, Zielke HR, Ferrucci L, Longo DL, Cookson MR, Singleton AB: Abundant quantitative trait loci exist for DNA methylation and gene expression in human brain. PLoS Genet. 2010, 6 (5): e1000952-

    PubMed Central  PubMed  Article  Google Scholar 

  29. 29.

    Hill EW, Gu J, Eivers SS, Fonseca RG, McGivney BA, Govindarajan P, Orr N, Katz LM, MacHugh DE: A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses. PLoS One. 2010, 5 (1): e8645-

    PubMed Central  PubMed  Article  Google Scholar 

  30. 30.

    McGivney BA, Browne JA, Fonseca RG, Katz LM, Machugh DE, Whiston R, Hill EW: MSTN genotypes in Thoroughbred horses influence skeletal muscle gene expression and racetrack performance. Anim Genet. 2012, 43 (6): 810-812.

    CAS  PubMed  Article  Google Scholar 

  31. 31.

    Bower MA, McGivney BA, Campana MG, Gu J, Andersson LS, Barrett E, Davis CR, Mikko S, Stock F, Voronkova V, Bradley DG, Fahey AG, Lindgren G, MacHugh DE, Sulimova G, Hill EW: The genetic origin and history of speed in the Thoroughbred racehorse. Nat Commun. 2012, 3: 643-

    PubMed  Article  Google Scholar 

  32. 32.

    Webbon P: Harnessing the genetic toolbox for the benefit of the racing Thoroughbred. Equine Vet J. 2012, 44 (1): 8-12.

    CAS  PubMed  Article  Google Scholar 

  33. 33.

    Hill EW, Gu J, McGivney BA, MacHugh DE: Targets of selection in the Thoroughbred genome contain exercise-relevant gene SNPs associated with elite racecourse performance. Anim Genet. 2010, 41 (Suppl 2): 56-63.

    PubMed  Article  Google Scholar 

  34. 34.

    Hill EW, McGivney BA, Gu J, Whiston R, Machugh DE: A genome-wide SNP-association study confirms a sequence variant (g.66493737C > T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses. BMC Genomics. 2010, 11: 552-

    PubMed Central  PubMed  Article  Google Scholar 

  35. 35.

    Park KD, Park J, Ko J, Kim BC, Kim HS, Ahn K, Do KT, Choi H, Kim HM, Song S, Lee S, Jho S, Kong HS, Yang YM, Jhun BH, Kim C, Kim TH, Hwang S, Bhak J, Lee HK, Cho BW: Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq. BMC Genomics. 2012, 13: 473-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  36. 36.

    Cho BW, Lee KW, Kang HS, Kim SK, Shin TS, Kim YG: Application of polymerase chain reaction with short oligonucletide primers of arbitrary sequence for the genetic analysis of Cheju native horse. J Agr Tech Dev Inst. 2001, 5: 109-114.

    Google Scholar 

  37. 37.

    Cho GJ: Genetic Relationship and Characteristics Using microsatellite. J Life Sci. 2007, 17 (5): 699-705.

    Article  Google Scholar 

  38. 38.

    Kim KI, Yang YH, Lee SS, Park C, Ma R, Bouzat JL, Lewin HA: Phylogenetic relationships of Cheju horses to other horse breeds as determined by mtDNA D-loop sequence polymorphism. Anim Genet. 1999, 30 (2): 102-108.

    CAS  PubMed  Article  Google Scholar 

  39. 39.

    Shin JA, Yang YH, Kim HS, Yun YM, Lee KK: Genetic polymorphism of the serum proteins of horses in Jeju. J Vet Sci. 2002, 3 (4): 255-263.

    PubMed  Google Scholar 

  40. 40.

    Schroder W, Klostermann A, Stock KF, Distl O: A genome-wide association study for quantitative trait loci of show-jumping in Hanoverian warmblood horses. Anim Genet. 2012, 43 (4): 392-400.

    CAS  PubMed  Article  Google Scholar 

  41. 41.

    Corbin LJ, Blott SC, Swinburne JE, Sibbons C, Fox-Clipsham LY, Helwegen M, Parkin TD, Newton JR, Bramlage LR, McIlwraith CW, Bishop SC, Woolliams JA, Vaudin M: A genome-wide association study of osteochondritis dissecans in the Thoroughbred. Mamm Genome. 2012, 23 (3–4): 294-303.

    PubMed  Article  Google Scholar 

  42. 42.

    Barres R, Yan J, Egan B, Treebak JT, Rasmussen M, Fritz T, Caidahl K, Krook A, O'Gorman DJ, Zierath JR: Acute exercise remodels promoter methylation in human skeletal muscle. Cell Metab. 2012, 15 (3): 405-411.

    CAS  PubMed  Article  Google Scholar 

  43. 43.

    Gomez-Pinilla F, Zhuang Y, Feng J, Ying Z, Fan G: Exercise impacts brain-derived neurotrophic factor plasticity by engaging mechanisms of epigenetic regulation. Eur J Neurosci. 2011, 33 (3): 383-390.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  44. 44.

    Brutsaert TD, Parra EJ: What makes a champion? Explaining variation in human athletic performance. Respir Physiol Neurobiol. 2006, 151 (2–3): 109-123.

    PubMed  Article  Google Scholar 

  45. 45.

    Terruzzi I, Senesi P, Montesano A, La Torre A, Alberti G, Benedini S, Caumo A, Fermo I, Luzi L: Genetic polymorphisms of the enzymes involved in DNA methylation and synthesis in elite athletes. Physiol Genomics. 2011, 43 (16): 965-973.

    CAS  PubMed  Article  Google Scholar 

  46. 46.

    Hu Y, Xu H, Li Z, Zheng X, Jia X, Nie Q, Zhang X: Comparison of the genome-wide DNA methylation profiles between fast-growing and slow-growing broilers. PLoS One. 2013, 8 (2): e56411-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  47. 47.

    Jones PA: Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012, 13 (7): 484-492.

    CAS  PubMed  Article  Google Scholar 

  48. 48.

    da Huang W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009, 4 (1): 44-57.

    PubMed  Article  Google Scholar 

  49. 49.

    Booth FW, Chakravarthy MV, Spangenburg EE: Exercise and gene expression: physiological regulation of the human genome through physical activity. J Physiol. 2002, 543 (Pt 2): 399-411.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  50. 50.

    Li Q, Li N, Hu X, Li J, Du Z, Chen L, Yin G, Duan J, Zhang H, Zhao Y, Wang J, Li N: Genome-wide mapping of DNA methylation in chicken. PLoS One. 2011, 6 (5): e19428-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  51. 51.

    Klose RJ, Bird AP: Genomic DNA methylation: the mark and its mediators. Trends Biochem Sci. 2006, 31 (2): 89-97.

    CAS  PubMed  Article  Google Scholar 

  52. 52.

    Lorincz MC, Dickerson DR, Schmitt M, Groudine M: Intragenic DNA methylation alters chromatin structure and elongation efficiency in mammalian cells. Nat Struct Mol Biol. 2004, 11 (11): 1068-1075.

    CAS  PubMed  Article  Google Scholar 

  53. 53.

    Adelson DL, Raison JM, Garber M, Edgar RC: Interspersed repeats in the horse (Equus caballus); spatial correlations highlight conserved chromosomal domains. Anim Genet. 2010, 41 (Suppl 2): 91-99.

    PubMed  Article  Google Scholar 

  54. 54.

    Portela A, Esteller M: Epigenetic modifications and human disease. Nat Biotechnol. 2010, 28 (10): 1057-1068.

    CAS  PubMed  Article  Google Scholar 

  55. 55.

    Gordenin DA, Lobachev KS, Degtyareva NP, Malkova AL, Perkins E, Resnick MA: Inverted DNA repeats: a source of eukaryotic genomic instability. Mol Cell Biol. 1993, 13 (9): 5315-5322.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  56. 56.

    Walsh CP, Chaillet JR, Bestor TH: Transcription of IAP endogenous retroviruses is constrained by cytosine methylation. Nat Genet. 1998, 20 (2): 116-117.

    CAS  PubMed  Article  Google Scholar 

  57. 57.

    Gao F, Luo Y, Li S, Li J, Lin L, Nielsen AL, Sørensen CB, Vajta G, Wang J, Zhang X, Du Y, Yang H, Bolund L: Comparison of gene expression and genome-wide DNA methylation profiling between phenotypically normal cloned pigs and conventionally bred controls. PLoS One. 2011, 6 (10): e25901-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  58. 58.

    Grunau C, Hindermann W, Rosenthal A: Large-scale methylation analysis of human genomic DNA reveals tissue-specific differences between the methylation profiles of genes and pseudogenes. Hum Mol Genet. 2000, 9 (18): 2651-2663.

    CAS  PubMed  Article  Google Scholar 

  59. 59.

    Bang WY, Kim SW, Kwon SG, Hwang JH, Kim TW, Ko MS, Cho IC, Joo YK, Cho KK, Jeong JY, Kim CW: Swine liver methylomes of Berkshire, Duroc and Landrace breeds by MeDIPS. Anim Genet. 2013, 44 (4): 463-466.

    CAS  PubMed  Article  Google Scholar 

  60. 60.

    Yang C, Zhang M, Niu W, Yang R, Zhang Y, Qiu Z, Sun B, Zhao Z: Analysis of DNA methylation in various swine tissues. PLoS One. 2011, 6 (1): e16229-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  61. 61.

    Shukla S, Kavak E, Gregory M, Imashimizu M, Shutinoski B, Kashlev M, Oberdoerffer P, Sandberg R, Oberdoerffer S: CTCF-promoted RNA polymerase II pausing links DNA methylation to splicing. Nature. 2011, 479 (7371): 74-79.

    CAS  PubMed  Article  Google Scholar 

  62. 62.

    Hirano M, Anderson DE, Erickson HP, Hirano T: Bimodal activation of SMC ATPase by intra- and inter-molecular interactions. EMBO J. 2001, 20 (12): 3238-3250.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  63. 63.

    Karki S, Holzbaur EL: Cytoplasmic dynein and dynactin in cell division and intracellular transport. Curr Opin Cell Biol. 1999, 11 (1): 45-53.

    CAS  PubMed  Article  Google Scholar 

  64. 64.

    Fletcher DA, Mullins RD: Cell mechanics and the cytoskeleton. Nature. 2010, 463 (7280): 485-492.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  65. 65.

    Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J: SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009, 25 (15): 1966-1967.

    CAS  PubMed  Article  Google Scholar 

  66. 66.

    Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS: Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008, 9 (9): R137-

    PubMed Central  PubMed  Article  Google Scholar 

  67. 67.

    Ye T, Krebs AR, Choukrallah MA, Keime C, Plewniak F, Davidson I, Tora L: seqMINER: an integrated ChIP-seq data interpretation platform. Nucleic Acids Res. 2011, 39 (6): e35-

    CAS  PubMed Central  PubMed  Article  Google Scholar 

Download references


This work was supported by a grant from the Next generation BioGreen 21 Program (No. PJ0081062011), Rural Development Administration, Republic of Korea.

Author information



Corresponding authors

Correspondence to Jong Bhak or Heui-Soo Kim.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

HSK, and JB designed and supervised the experiments and analyses. HSK, YMY, HKL, KDP, BWC, and JB supervised the progress of the project. HSK, YMY, KDP, KTD, and BWC prepared skeletal muscle, heart, lung, and cerebrum tissues of thoroughbred and Jeju horses. KA, and THK generated sequences from the samples. CPH, JWM, SS, DSK, and SK conducted the bioinformatics analyses. HSK, JRL, JWM, and CPH designed the validation experiments, and JRL, YDJ, JHB, YC, JE, JAG and YJK conducted the experiments. JRL, YDJ, CPH, JAG, HSK, and JB wrote the manuscript and HSH, KH, JMY, and HJC participated in improving the manuscript. All authors read and approved the final manuscript.

Ja-Rang Lee, Chang Pyo Hong, Jae-Woo Moon contributed equally to this work.

Electronic supplementary material


Additional file 1: Figure S1: Pearson’s correlation between methylated peaks, chromosome length, and gene number. The peaks were plotted against chromosome length and gene number (B). Figure S2. Validation of MeDIP-seq data by bisulfite sequencing with relatively moderate methylated region. Box indicated amplification regions. CpG dinucleotides are represented by circles on vertical bars. Each line represented an independent clone, and methylated CpGs are marked by filled circles, unmethylated CpGs by open circles. Figure S3. Validation of MeDIP-seq data by bisulfite sequencing with differentially methylated regions in skeletal muscle. Box indicated amplification regions. CpG dinucleotides are represented by circles on vertical bars. Each line represented an independent clone, and methylated CpGs are marked by filled circles, unmethylated CpGs by open circles. Table S1. The general information of MeDIP-seq data in different each tissues from the Thoroughbred and Jeju horse. Table S2. Comparison of the number of differentially methylated region in throughbred and Jeju horse. Table S3. The information of primers for BSP. (DOCX 590 KB)

Authors’ original submitted files for images

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Lee, JR., Hong, C.P., Moon, JW. et al. Genome-wide analysis of DNA methylation patterns in horse. BMC Genomics 15, 598 (2014).

Download citation


  • Thoroughbred horse
  • Jeju horse
  • Genome-wide DNA methylation
  • Differential methylated region (DMR)
  • MeDIP-seq