Cross species comparison of C/EBPα and PPARγ profiles in mouse and human adipocytes reveals interdependent retention of binding sites
- Søren F Schmidt†1,
- Mette Jørgensen†2,
- Yun Chen2,
- Ronni Nielsen1,
- Albin Sandelin2Email author and
- Susanne Mandrup1Email author
© Schmidt et al; licensee BioMed Central Ltd. 2011
Received: 23 December 2010
Accepted: 16 March 2011
Published: 16 March 2011
The transcription factors peroxisome proliferator activated receptor γ (PPARγ) and CCAAT/enhancer binding protein α (C/EBPα) are key transcriptional regulators of adipocyte differentiation and function. We and others have previously shown that binding sites of these two transcription factors show a high degree of overlap and are associated with the majority of genes upregulated during differentiation of murine 3T3-L1 adipocytes.
Here we have mapped all binding sites of C/EBPα and PPARγ in human SGBS adipocytes and compared these with the genome-wide profiles from mouse adipocytes to systematically investigate what biological features correlate with retention of sites in orthologous regions between mouse and human. Despite a limited interspecies retention of binding sites, several biological features make sites more likely to be retained. First, co-binding of PPARγ and C/EBPα in mouse is the most powerful predictor of retention of the corresponding binding sites in human. Second, vicinity to genes highly upregulated during adipogenesis significantly increases retention. Third, the presence of C/EBPα consensus sites correlate with retention of both factors, indicating that C/EBPα facilitates recruitment of PPARγ. Fourth, retention correlates with overall sequence conservation within the binding regions independent of C/EBPα and PPARγ sequence patterns, indicating that other transcription factors work cooperatively with these two key transcription factors.
This study provides a comprehensive and systematic analysis of what biological features impact on retention of binding sites between human and mouse. Specifically, we show that the binding of C/EBPα and PPARγ in adipocytes have evolved in a highly interdependent manner, indicating a significant cooperativity between these two transcription factors.
The adipose tissue plays a central role in maintaining whole body lipid and glucose homeostasis as well as insulin sensitivity [1, 2]. Adipocytes are derived from fibroblastic precursors in the adipose tissue through a tightly regulated differentiation process. The molecular basis for the regulation of adipogenesis has been studied extensively in vitro using a variety of preadipocyte cell culture models. In particular, studies of the 3T3-L1 cell line derived from mouse embryo fibroblasts  have been valuable for gaining insight into the ordered cascade of molecular events required for adipogenesis [4–6]. More recently, human cell culture models have become available including the SGBS cell line derived from preadipocytes from a patient with Simpson-Golabi-Behmel syndrome .
The transcription factor peroxisome proliferator activated receptor (PPARγ) is a key regulator of adipogenesis, required for in vitro as well as in vivo differentiation of adipocytes [8, 9]. In addition to PPARγ, a number of other important transcriptional regulators of adipocyte differentiation have been identified , including members of the CCAAT/enhancer binding protein (C/EBP) family [10–12]. C/EBPα is induced late in adipocyte differentiation and is known to cooperate with PPARγ in induction of at least a subset of adipocyte-specific genes. In addition, these two factors induce the expression of each other [13, 14]. Two other members of the C/EBP family, C/EBPβ and -δ, are directly involved in the transcriptional induction of PPARγ and C/EBPα .
We recently used chromatin immunoprecipitation (ChIP) combined with deep sequencing (ChIP-seq) to generate genome-wide maps of the binding sites of PPARγ and its heterodimerization partner retinoid × receptor (RXR) during differentiation of 3T3-L1 adipocytes . In addition, we profiled RNA polymerase II (RNAPII) occupancy to measure active transcription at different time points during differentiation. This study revealed that PPARγ:RXR binding was highly enriched in the vicinity of genes upregulated during adipogenesis. In fact, the majority (75%) of all highly up-regulated genes have PPARγ:RXR binding in the immediate vicinity . Similarly, Lazar and colleagues  and others [17, 18] used ChIP in combination with hybridization to genomic tiling microarrays (ChIP-chip) or cloning followed by sequencing (ChIP-PET) to map PPARγ binding sites in 3T3-L1 adipocytes (reviewed in ). Intriguingly, these studies have revealed that the C/EBPα consensus site is highly over-represented under the binding regions of PPARγ. Lazar and colleagues profiled C/EBPα binding sites in mature 3T3-L1 adipocytes and found a remarkable overlap between C/EBPα and PPARγ binding (> 60% of all PPARγ sites) on a genome-wide scale . Importantly, 60% of the genes induced during adipogenesis have both C/EBPα and PPARγ binding sites within 50 kb of a transcription start site (TSS), and knockdown studies indicated that both C/EBPα and PPARγ are required for robust gene expression of a few selected adipocyte specific target genes. Cumulatively, these results indicate that both PPARγ and C/EBPα are directly involved in the activation of the majority of adipocyte-specific genes and that they cooperate through binding to adjacent sites on DNA.
Genome-wide profiling has also made it possible to study the evolution of gene regulation by mapping the sites for the same transcription factors in different species. This is typically done by aligning genomes of the two species and tabulating the number of detected sites in one species that are in the corresponding region in the other species. This means that a genomic region might be highly conserved in terms of nucleotides but may or may not bind the transcription factor in question in both species. To distinguish between genome sequence conservation and transcription factor binding site conservation, we will use the word "retention" to describe binding sites that are present in both species in the corresponding genomic region.
Notably, whereas the functional gene targets of a particular transcription factor are generally well conserved between species, it has been shown that the majority of binding sites for all transcription factors investigated to date are species-specific [20–25], reviewed in . This is surprising given that sequence conservation has been successfully used to enhance regulatory site prediction in proximal promoters (phylogenetic foot printing) [27, 28]; however, this might reflect that these older studies were focused on limited sets of sites often located in tissue-specific promoters, while genome-wide methods, such as ChIP-seq, are independent of previous annotations. Consistent with the species specificity of transcription factor binding sites, Rosen and colleagues recently compared PPARγ binding in 3T3-L1 adipocytes and in vitro differentiated primary human adipose stromal cells (hASC), and found that only 21.3% of the murine binding sites were retained in human adipocytes. By contrast, the association of PPARγ with adipocyte gene regulation appeared to be better retained than the specific binding sites, and the overall gene expression profiles were well conserved . Rosen and colleagues also showed that genes associated with a conserved PPARγ binding site are more likely to be upregulated during adipogenesis than genes associated with a species specific site , indicating that retention of PPARγ binding is increased near upregulated genes.
While previous reports agree that retention of transcription factor binding sites is limited, systematic analysis of the biological features determining whether a site is retained or not has not been performed. In particular, the interdependence between retention of the transcription factor binding sites of two transcription factors has not been investigated. Here, we used ChIP-seq to generate genome-wide binding profiles of C/EBPα and PPARγ in human SGBS adipocytes, compared these to previously published profiles in mouse 3T3-L1 adipocytes [15, 16], and systematically studied what features affect whether a binding site is retained between species or not.
We find that PPARγ binding sites have higher retention near genes upregulated during adipogenesis, and that regions bound by both factors are even more likely to be retained. Interestingly, PPARγ binding site retention in these co-bound regions is increased by the presence of a C/EBPα consensus site, suggesting that C/EBPα may facilitate PPARγ binding to DNA. At the same time, and independent of C/EBPα and PPARγ sequence patterns, sequence conservation in the larger region surrounding the actual binding sites has a positive impact on retention of both C/EBPα and PPARγ binding sites, indicating that other DNA sequence patterns also affect binding of these two factors to DNA.
Genome-wide mapping of C/EBPα and PPARγ binding in human SGBS adipocytes
To compare C/EBPα and PPARγ binding in mouse and human adipocytes, we used ChIP-seq to generate genome-wide profiles of C/EBPα and PPARγ binding in human SGBS adipocytes. Mapped ChIP tags were analyzed using peak finder methods, which identify regions enriched for ChIP tags. For simplicity, we will refer to such regions as "sites", but it is important to notice that these regions are typically 100-400 nt wide (up to 1000 nt wide in ChIP-chip studies) and thus several magnitudes larger than the actual binding site (typically 5-20 nt). We used the MACS peak finder  to identify regions consistently bound by C/EBPα and PPARγ in two biologically independent experiments. We detected 52,733 C/EBPα and 23,328 PPARγ binding sites in human SGBS adipocytes, and in accordance with previous findings in mouse , we found that 49.5% of PPARγ binding sites are also bound by C/EBPα. The number of PPARγ sites is comparable to the 39,968 sites recently reported by Rosen and colleagues for in vitro differentiated hASCs . However, the numbers are notably higher than the corresponding numbers in 3T3-L1 cells, where 16,760  and 15,461 (this study) C/EBPα sites and 5299  and 6952  PPARγ sites were reported. It remains to be investigated whether this represents a true species-specific difference, or whether this is more related to technical issues such as antibody specificity or cell culture conditions.
Correlating binding sites with gene expression patterns during adipogenesis
We previously identified 1650 genes which were differentially expressed during 3T3-L1 adipogenesis as assessed by RNAPII association to the respective genes . In that study, these genes were clustered in 5 different expression clusters depending on their relative expression profile during differentiation. For the purpose of this study we combined the two clusters containing genes downregulated during 3T3-L1 adipogenesis. This resulted in four new clusters: downregulated (1), transiently upregulated (2), moderately upregulated (3), and highly upregulated (4) (Figure 1B). For each gene belonging to a particular cluster, we divided the genomic region at +/- 100 kb around TSS into 10 kb intervals ("bins") and counted the number of mouse PPARγ binding sites within each such intervals. The PPARγ binding sites are enriched in the vicinity of all regulated genes compared to randomly selected regions (Figure 1C). Moreover and consistent with previous findings [15, 16], PPARγ binding sites in mouse adipocytes are highly enriched in the vicinity of genes moderately (3) and highly (4) upregulated during adipogenesis (Figure 1C). Similarly, PPARγ binding sites are also enriched in the vicinity of the human gene orthologs of the genes in these clusters (Figure 1C), consistent with a role for PPARγ in the regulation of these genes in SGBS adipocytes.
Measuring retention of mouse binding sites in human
To study the retention of C/EBPα and PPARγ binding sites between mouse and human adipocytes, we used human-mouse genome alignments  to identify human genome regions orthologous to C/EBPα and PPARγ sites in mouse (see Methods). To make sure that the sites were orthologous at genome sequence level, a mouse site had to map to a single location in the human genome, which in turn had to map back to the original site in mouse. We did not address site loss or gain due to large deletions or insertions in one species. This resulted in 2176 PPARγ and 4899 C/EBPα mouse sites that could be mapped to the human genome (62.5% and 56.3% of all detected mouse PPARγ and C/EBPα binding sites, respectively).
Conversely, there are sites that we with high confidence can label as retained between species. We noticed that there are larger genomic regions in which the majority of C/EBPα and PPARγ binding are retained as exemplified by the PPARG2 locus (Figure 3C). Given the crucial role of PPARγ in adipocyte differentiation, it is tempting to suggest that the reason that these particular sites are retained is their importance for the overall regulatory regime, since they likely regulate a key transcription factor. Prompted by this observation, we proceeded to systematically identify biological features that are positively correlated with increased retention of PPARγ and C/EBPα sites, with the hope of increasing our understanding of adipocyte gene regulation.
Retention of PPARγ binding sites is increased in the vicinity of genes up-regulated during 3T3-L1 adipogenesis
Several studies have reported that retention of binding sites is generally higher in the vicinity of putative key target genes [20–25]. However, species-specific binding sites are also enriched in these regions, and no studies have addressed whether the relative retention of binding sites (i.e the fraction of the sites in a mouse region that is retained vs. lost in human) is increased in the vicinity of putative target genes of the transcription factor. The assignment of target genes for binding sites is not trivial. Most studies agree that sites proximal to a gene are more likely to regulate the gene in question but this is not an absolute truth. In light of this, we used a combination of distance and gene expression to associate genes and binding sites. Thus, to systematically investigate how binding site retention is affected by the vicinity to putative target genes, we labeled sites in the following way. First, for each binding site, we located the closest gene TSS. If this TSS was closer than 100 kb, and the gene was within one of the pre-defined expression clusters shown in Figure 1 (a differentially expressed gene), the site was labeled with the corresponding cluster number (cluster 1 to 4). Sites where the closest TSS was closer than 100 kb but the gene was not part of any expression cluster (a gene with no measurable difference in expression during adipogenesis) were defined to belong to a new cluster, cluster 5 ("constitutive genes"). Finally, all sites with no gene TSS within 100 kb were defined to belong to a new cluster: cluster 6 ("distal sites"). There are obviously many ways to assign binding sites to putative target genes. However, the overall results described below are not specific to the cluster assignment method used. For instance, assigning sites to expression clusters only based on the identity of the closest gene gave very similar results (data not shown).
Turnover of PPARγ binding is increased in the vicinity of genes upregulated during 3T3-L1 adipogenesis
Retention of C/EBPα and PPARγ binding is increased at regions bound by both C/EBPα and PPARγ
Since we now established that both the expression pattern of nearby genes and co-binding of PPARγ and C/EBPα have an impact on retention of sites, we proceeded to see how these two associations are depending on one another. About 20% of C/EBPα-only sites are retained in human, regardless of expression pattern of nearby genes (Figure 6D), but C/EBPα binding sites overlapping PPARγ sites have a significantly higher retention if they are in the vicinity of moderately or highly upregulated genes (cluster 3 and 4) (Fisher Exact Test, P = 8.6*10^-15). Thus, the increased retention of C/EBPα binding in the vicinity of highly upregulated genes (Figure 4A), originates from increased retention of the binding sites that overlap with PPARγ binding.
For PPARγ-only sites there are large differences in the amount of retention depending on which gene expression cluster the sites are assigned to, with sites close to highly upregulated genes most likely to be retained (Fisher Exact Test, P = 0.01). By contrast, retention of PPARγ sites overlapping with C/EBPα is less affected by proximity to highly upregulated genes (Fisher Exact Test, P = 0.2, PPARγ overlap cluster 4 vs. all other cluster) (Figure 6C). Interestingly, retention of PPARγ sites near highly upregulated genes is not affected by overlap with C/EBPα (Fisher Exact Test, P = 0.8179, PPARγ overlap cluster 4 vs. PPARγ only cluster 4). Overall these findings are supported by the enrichment analysis (Figure 6E), although in this case, the enrichment of C/EBPα sites that overlap with PPARγ sites near moderately and highly upregulated genes is only close to significant (Wilcoxon test, P = 0.05707, clusters 3+4 vs. all other clusters), and the enrichment of PPARγ sites that overlap with C/EBPα sites near highly upregulated genes is now significantly higher than for other PPARγ sites overlapping C/EBPα (Wilcoxon test, P = 0.001).
C/EBPα sequence patterns are indicative of PPARγ retention in human but not vice versa
Interestingly, we found that PPARγ binding sites overlapping C/EBPα sites in mouse are more likely to be retained as PPARγ binding sites in human if they have a C/EBP consensus site (Fisher Exact Test, P = 0.009)(Figure 7B). We noticed the same tendency for PPARγ binding sites not overlapping C/EBPα sites in mouse, although this trend is not strong enough to be statistically significant (Fisher Exact Test, P = 0.16). However, the opposite is not true, i.e. PPARγ consensus sites have no impact on the retention of mouse C/EBPα binding sites in human (Fisher Exact, P = 0.88) (Figure 6B).
One way to interpret this is that for regions bound by both factors, the C/EBP consensus site is being selected for over evolution rather than the PPARγ consensus site, which would indicate that PPARγ binding is dependent on sequence-specific C/EBP binding. There are several mechanisms that could explain this, including PPARγ binding to suboptimal sites with the help of C/EBPα, indirect binding by tethering to C/EBPα, and indirect binding via long range chromosomal loops [34, 35].
It is likely that in addition to C/EBPα, several other transcription factors cooperate with PPARγ in the transactivation of nearby genes, and some of these factors may also cooperate with PPARγ in DNA binding. Such cooperativity would be expected to contribute to the retention of PPARγ sites. We investigated how the evolutionary constraint on the overall DNA sequence within the binding region of each mouse PPARγ and C/EBPα binding site affects retention of these binding sites in human. We found that regions with high mouse-human DNA sequence identity have higher retention of both C/EBPα and PPARγ binding (Figure 7C). Interestingly, the "high identity quartile" is not enriched for regions containing C/EBPα and PPARγ consensus sites (Figure 7D and 7E), indicating that other sequence patterns/transcription factors are also important for retention of C/EBPα and PPARγ binding.
In this study, we used ChIP-seq to generate genome-wide profiles of the binding sites of the major adipogenic transcription factors, C/EBPα and PPARγ, in mature human SGBS adipocytes. We identified 52,733 C/EBPα and 23,328 PPARγ binding sites, and consistent with previous studies in mouse adipocytes [15, 16] we found that PPARγ binding sites are highly enriched in the vicinity of genes upregulated during adipogenesis. Furthermore, and also consistent with what has been reported for mouse adipocytes , a high percentage (49.5%) of the PPARγ binding sites overlap with C/EBPα binding sites. Thus, these data indicate conservation of the overall regulatory regime of C/EBPα and PPARγ between mouse and human adipocytes, including their potential direct cooperativity through binding to adjacent sites. However, despite the conservation of the overall regulatory regime and putative target genes between mouse and human adipocytes the retention of mouse binding sites in human is limited, i.e. most sites are species-specific. Similar results demonstrating limited retention of binding sites despite extensive conservation of association with putative functional targets have previously been reported for other transcription factors [20, 22–25], and was recently also reported by Rosen and colleagues when comparing PPARγ binding sites in primary human in vitro differentiated adipocytes with the binding sites identified in murine 3T3-L1 cells .
Despite the fact that most binding sites of PPARγ and C/EBPα are species-specific, there are some sites that we can confidently classify as retained between human and mouse. There are also larger genomic regions, like the PPARG2 locus, where most sites are retained (Figure 3C). This observation prompted us to systematically investigate what biological features determine whether a site is retained or not. Thus, we have systematically investigated several biological features that could affect the retention of mouse C/EBPα and PPARγ binding sites, using two complementary methods with different advantages, i.e. a binary approach where overlaps of sites are counted, and an enrichment approach where the signal strength of human ChIP reads is assessed. The binary approach has the advantage of being conservative (i.e. only strong signals are being labeled as bound sites), and it is easier to interpret the outcomes as sites can be either lost or retained. Conversely, the enrichment method can take weaker sites into account and is not dependent on specific thresholds in peak finding; however, this method will not clearly define what sites are retained and lost, as it imposes no thresholds on the data.
Regardless of method, we find that there is a significantly increased retention as well as turn-over of PPARγ binding sites in the vicinity of genes upregulated during adipogenesis compared with binding sites more distant from such genes. By contrast, retention of C/EBPα binding is only slightly increased in the vicinity of genes that are highly upregulated. The difference indicates that there is a higher evolutionary pressure to maintain specific PPARγ binding sites compared with specific C/EBPα binding sites for the regulation of adipocyte differentiation and function. This may reflect the higher importance of PPARγ over C/EBPα in adipocyte differentiation, as most clearly demonstrated by Spiegelmann and colleagues, who showed that that ectopic expression of PPARγ in C/EBPα -/- mouse embryo fibroblasts (MEF) , but not C/EBPα in PPARγ -/- MEFs , can induce adipogenesis. In addition, since there are more C/EBPα binding sites than PPARγ binding sites there may be less evolutionary constraint on the individual C/EBPα sites. This could reflect the lower binding specificity of C/EBPα, since it is easier to create a new site if the factor can bind to many different sequences.
We also observed that mouse sites that are strongly bound by either factor are more likely to be retained, but the causality is unclear. Stronger binding sites could be under higher selective pressure, since they are more likely to have a vital function. In addition, stronger binding sites could be harder to erode over time, as these are likely to represent sites where multiple factors such as co-binding transcription factors and chromatin accessibility contribute to a high occupancy of the site. Thus, strongly bound sites in a common ancestor between two species might require more evolutionary changes to be reduced to a fully inactive site in one species.
The remarkable overlap between PPARγ and C/EBPα binding reported previously [15, 16] indicates cooperativity between the two transcription factors. In support of this, we find that retention of both C/EBPα and PPARγ binding is increased, when the binding sites overlap with a binding site of the other factor, suggesting that a potential synergy between these two factors is of such significance that selective pressure on these overlapping sites is increased. Interestingly, while retention of C/EBPα-only sites is not affected by vicinity to regulated genes, the C/EBPα sites that overlap with PPARγ sites display increased retention in the vicinity of upregulated genes, further indicating an importance of C/EBPα co-binding with PPARγ in adipocyte gene regulation.
We also analyzed the sequence of human genomic regions corresponding to overlapping PPARγ and C/EBPα binding sites in mouse adipocytes. These analyses revealed that presence of the C/EBP consensus sequence in the human sequence not only predicts the retention of C/EBPα binding but, surprisingly, also retention of PPARγ binding at sites that overlap with C/EBPα binding sites in mouse. By contrast, the PPAR consensus sequence in the human sequence only predicts retention of PPARγ binding, not retention of C/EBPα binding at sites overlapping with PPARγ binding sites. This indicates that at these overlapping sites, the C/EBP consensus sequence in the binding region is being selected for over evolution to a higher degree than the PPAR consensus site, suggesting that PPARγ binding is directed by C/EBPα binding. By analogy, differential binding of nuclear factor κB (NFκB) in 10 human individuals correlates with changes in a consensus sites of a reported cooperating transcription factor, signal transducer and activator of transcription 1 (Stat 1) , and similar results have been found in studies in yeast  and fungi .
Possible mechanisms explaining the interdependence between observed C/EBP consensus site conservation and PPARγ retention include C/EBP-assisted binding of PPARγ to an adjacent consensus site as well as indirect binding of PPARγ to DNA by tethering to C/EBPα or by long range intrachromosomal loops [34, 35]. Lazar and colleagues have previously investigated the interdependence of C/EBPα and PPARγ binding in mature 3T3-L1 adipocytes and failed to see an effect of C/EBPα knockdown on PPARγ binding to selected target sites in mature 3T3-L1 adipocytes . However, since only a few sites were investigated it is difficult to conclude on the general importance. Furthermore, it is possible that other members of the C/EBP family, which all share the same consensus sequence, play a role in the establishment of the PPARγ transcriptional complex during adipogenesis and a more limited role once the complex is established. Intriguingly, recent data from our laboratory indicate that C/EBPβ, which is expressed early in differentiation, may play a role in early chromatin remodeling of PPARγ binding sites (Siersbæk, Nielsen, John, Sung, Baek, Loft, Hager, and Mandrup, EMBO Journal, in press)).
Odom and colleagues have previously shown that the transcription factor binding regions that were retained between multiple species had higher sequence constraint than species specific binding regions , however the interdependence between conservation of the region around the bound site and the retention of the consensus site was not addressed. Interestingly, we show here that the high identity quartile of binding sites (i.e. the binding regions with the highest sequence conservation) displayed increased retention of C/EBPα and PPARγ binding independent of conservation of their respective consensus sites. This strongly indicates that additional sequence patterns are important for C/EBPα and PPARγ binding. Such sequence patterns could among others be directing binding of cooperating transcription factors.
Here we have performed a comprehensive and systematic analysis to investigate what parameters affect the retention of transcription factors binding between the mouse and human genome. We show that retention of PPARγ and C/EBPα binding between mouse and human adipocytes is interdependent, which strongly indicates that these transcription factors cooperate at the level of gaining access to their target sites in the genome. In addition, our data show that overall sequence conservation of the binding region contributes to retention, suggesting that other sequence patterns contribute to retention. Future experimental analyses will be required to dissect the interdependence of PPARγ and C/EBPα and possibly other transcription factors at a molecular level.
The 3T3-L1 fibroblasts were differentiated to adipocytes by stimulation with dexamethasone (DEX), 1-methyl-3-isobutylxanthine (MIX) and insulin as described previously . The cells were harvested for ChIP experiments at day 6 of differentiation. SGBS cells were generously provided by Dr. Martin Wabitsch at the University of Ulm, Germany and differentiated to adipocytes using a procedure modified from previous publications [7, 41]. Briefly, SGBS cells were grown to confluence in Dulbecco's Modified Eagle's Medium/Nutrient Mixture F-12 Ham's supplemented with 10% fetal bovine serum, 33 μM biotin, 17 μM pantothenate, 100 μg/ml streptomycin, 62.5 μg/ml penicillin, 1 ng/μl fibroblast growth factors (FGF) 1, and 90 μg/μl heparin. At two days postconfluency, SGBS cells were stimulated to differentiate with serum-free growth medium supplemented with 10 nM insulin, 200 pM triiodothyronine, 1 μM cortisol, 2 μM BRL 49653, 0.115 mg/ml MIX, 0.25 mmol/L DEX, and 0.01 mg/ml human transferrin. After 3 days, the medium was replaced with the differentiation medium without FGF1 and heparin, and after 6 days Rosiglitazone/BRL49653, MIX, and DEX was removed from the medium. The cells were harvested for ChIP experiments at day 10 of differentiation.
ChIP experiments were performed according to standard protocol as described in . Antibodies used were: PPARγ (H-100, sc7196, Santa Cruz), C/EBPα (14AA, sc61). ChIP-seq sample preparation for sequencing was performed according to the manufacturer's instructions (Illumina). Sequence reads from each ChIP-seq library were mapped to the reference genomes mm9 for mouse and hg18 for human using the the BWA alignment software allowing for 2 mismatches . PPARγ and C/EBPα binding sites were identified using MACS  with a p-value cutoff of e-10, and using sonicated input as a control. C/EBPα and PPARγ ChIP-seqs in 3T3-L1 were performed in monoplicates, and to identify high confidence binding sites we compared our data with previously published C/EBPα and PPARγ ChIP-chip data . ChIP-seq binding regions overlapped by ChIP-chip binding regions were used for downstream analysis. C/EBPα and PPARγ ChIP-seqs in SGBS cells were performed in duplicates from two independent experiments, and binding sites identified in both experiments were used for downstream analysis. Some of the detected sites both in SGBS and 3T3-L1 cells were much wider than expected, possibly due to multiple nearby binding events or noise. To avoid these and focus on reliable sites, we filtered away the 5% widest binding sites.
Finding regions bound by both PPARγ and C/EBPα
Initial analysis of the overlap between C/EBPα and PPARγ binding sites in mouse showed that in the large majority of cases either most of the site of one factor is overlapped by the site of the other factor or it is not overlapped at all (Additional file 1 Fig. S2). Thus, altering the threshold for how large the overlap should be for sites to be considered overlapping, does not have substantial impact of the downstream analysis. Based on the percentage overlap shown in Additional file 1 Fig. S2 and the length distributions for the binding sites in mouse, a C/EBPα site overlapping a PPARγ site with at least 1% of the total width of the C/EBPα site was defined as "overlapping". The PPARγ binding sites are in general more narrow than C/EBPα, so we used a higher threshold of 10% overlap for PPARγ. In human regions corresponding to mouse sites we relaxed this criterion to 1 nt overlap, since there is an implicit criterion for both sites to be in the same region in the orthology analysis described below. Changing the criteria lead to no substantial difference in terms of overall results (data not shown).
Finding human genomic regions orthologous to the mouse binding sites
We used the UCSC lift over tool [44, 45] together with chained BlastZ alignments  to lift over the binding sites from mm9 to hg18 genome assemblies. We used minMatch = 0.1, which is the recommended setting for inter-species comparisons. We then lifted the binding sites back again and tested if they overlapped a minimum of 90% of the original region. Only binding sites that could be uniquely mapped both ways were considered for the rest of the analyses.
Defining retained, species-specific binding sites and turnover events
For each lifted mouse binding site we checked if >10% of the site was overlapped by a human site, or vice versa. If any of these criteria were true, the mouse site was considered retained. Mouse binding sites that were not retained was considered specific to mouse and lost in human. In the same way, human binding sites that are not overlapped 10% by a lifted mouse binding site or vice versa were considered specific to human.
A lost binding site may have been replaced with a new, species-specific binding site - i.e. binding site turnover. For the binding sites lost in human, we checked if a human-specific binding site is located +/-10 kb (max 10 kb between the midpoints of the two binding sites) from the original lifted binding site. If this was the case, the site was considered subject to a turnover event.
Calculating ChIP-seq enrichment in human
As a complementary approach to the above, we considered ChIP-seq counts in human regions orthologous to mouse binding sites as a continuous variable rather than counting the overlap of human ChIP-seq peaks defined by the peak finder. This makes sense, as the number of ChIP-seq tags should correspond to the binding strength of a respective factor.
To make this comparable across regions, for a human region defined as orthologous to a mouse binding site, we counted the number of human ChIP tags in a window centered on the region with a window width corresponding to the 75th percentile of all human binding sites, for the respective factor: 520 nt for C/EBPα and 605 nt for PPARγ. We then estimated the number of ChIP-seq tags expected by random by calculating the mean ChIP-seq counts of 2000 randomly sampled genomic regions with the same widths as above. A log2 fold change value was calculated by dividing the observed number of ChIP-seq tags in the orthologous region with the random expectation. To avoid small number effects, we added a pseudo-count of 1 to both random regions and observed binding site counts.
Assigning genes and expression clusters to mouse binding sites
For each binding site we identified all genes (knownGene table from UCSC genome browser database ), where the TSS is located +/-100 kb from the center of the binding site. The binding site was labeled with the label of the closest gene that was part of an expression cluster. If none of the nearby genes were in a cluster, the binding site was placed in cluster 5 ("constitutive genes"); whereas the site was considered to belong to cluster 6 ("distal sites") if there were no TSS within 100 kb of the binding site.
Finding human gene orthologs to the differentially expressed mouse genes
For each differentially expressed mouse gene we used the UCSC lift over tool together with chained BlastZ alignments to lift over the TSS +/-50 bp from mm9 to hg18. We then found the closest gene (knownGene table from UCSC) within 5 kb of the lifted TSS. Using this method we found orthologous genes for 1482 of the 1626 mouse genes. We discarded from further analyses the 113 TSS regions that could not be lifted, and the 49 of the lifted TSSs that were not located in proximity (+/-5 kb) of any UCSC human gene.
Sequence content analysis
We scanned both strands of each human region defined above with position weight matrices (PWMs) (reviewed in ) from the JASPAR database (models MA0065.2 and MA0019.1, respectively, for PPARγ and C/EBPα) using the ASAP tool  based on a background model following a uniform distribution. To decide a relevant threshold for accepting a predicted binding site, we repeated the analysis in randomly selected regions with the respective PWM. A threshold of 80% of the scoring range of each model could separate the ChIP'ed regions and random regions best, so we used this for further analysis.
Calculating human-mouse identity
According to the binding sites in mouse described above, we extracted the alignments between mouse and human by mafFrag from the Kent source utilities http://genomewiki.ucsc.edu. We then applied Alistat, a utility included in the HMMER package, http://hmmer.janelia.org/ to calculate the identity score defined as (#Identity/minimum (length1, length2)), where #identity is the number of exact identities in an alignment and length1, length2 are the unaligned lengths of the two sequences.
Raw sequencing reads, aligned reads, wiggle files, and MACS regions for all ChIP-seq samples described here have been deposited to the NCBI GEO database http://www.ncbi.nlm.nih.gov/geo/ under accession number GSE27450.
CAAT/enhancer binding protein
human adipose stromal cells
mouse embryo fibroblast
peroxisome proliferator activated receptor
position weight matrices
RNA polymerase II
retinoid × receptor
Simpson Golabi Sehmel syndrom
Transcription start site
The authors thank members of the Mandrup and Sandelin laboratories for valuable discussions and input to the manuscript. We are grateful to Dr. Martin Wabitsch, University of Ulm for the human SGBS cells. This work was supported by grants to SM from the Novo Nordisk Foundation and the NIH NIDDK/ODS (5R01-DK063070) and by grants from European Research Council (FP7/2007-2013/ERC grant agreement 204135) and the Novo Nordisk Foundation to AS.
- Guilherme A, Virbasius JV, Puri V, Czech MP: Adipocyte dysfunctions linking obesity to insulin resistance and type 2 diabetes. Nat Rev Mol Cell Biol. 2008, 9: 367-377. 10.1038/nrm2391.PubMed CentralPubMedView ArticleGoogle Scholar
- Rosen ED, Spiegelman BM: Adipocytes as regulators of energy balance and glucose homeostasis. Nature. 2006, 444: 847-853. 10.1038/nature05483.PubMed CentralPubMedView ArticleGoogle Scholar
- Green H, Kehinde O: Sublines of Mouse 3T3 Cells That Accumulate Lipid. Cell. 1974, 1: 113-116. 10.1016/0092-8674(74)90126-3.View ArticleGoogle Scholar
- MacDougald OA, Mandrup S: Adipogenesis: forces that tip the scales. Trends Endocrinol Metab. 2002, 13: 5-11. 10.1016/S1043-2760(01)00517-3.PubMedView ArticleGoogle Scholar
- Rosen ED, MacDougald OA: Adipocyte differentiation from the inside out. Nat Rev Mol Cell Biol. 2006, 7: 885-896. 10.1038/nrm2066.PubMedView ArticleGoogle Scholar
- Farmer SR: Transcriptional control of adipocyte formation. Cell Metab. 2006, 4: 263-273. 10.1016/j.cmet.2006.07.001.PubMed CentralPubMedView ArticleGoogle Scholar
- Wabitsch M, Brenner RE, Melzner I, Braun M, Moller P, Heinze E, et al: Characterization of a human preadipocyte cell strain with high capacity for adipose differentiation. Int J Obes Relat Metab Disord. 2001, 25: 8-15. 10.1038/sj.ijo.0801520.PubMedView ArticleGoogle Scholar
- Rosen ED, Sarraf P, Troy AE, Bradwin G, Moore K, Milstone DS, et al: PPAR gamma is required for the differentiation of adipose tissue in vivo and in vitro. Molecular Cell. 1999, 4: 611-617. 10.1016/S1097-2765(00)80211-7.PubMedView ArticleGoogle Scholar
- Barak Y, Nelson MC, Ong ES, Jones YZ, Ruiz-Lozano P, Chien KR, et al: PPAR gamma is required for placental, cardiac, and adipose tissue development. Molecular Cell. 1999, 4: 585-595. 10.1016/S1097-2765(00)80209-9.PubMedView ArticleGoogle Scholar
- Wang Nd, Milton JF, Bradley A, Ou CN, Abdelsayed SV, Wilde MD, et al: Impaired Energy Homeostasis in C/EBPalpha Knockout Mice. Science. 1995, 269: 1108-1112. 10.1126/science.7652557.PubMedView ArticleGoogle Scholar
- Linhart HG, Ishimura-Oka K, DeMayo F, Kibe T, Repka D, Poindexter B, et al: C/EBPalpha is required for differentiation of white, but not brown, adipose tissue. Proceedings of the National Academy of Sciences of the United States of America. 2001, 98: 12532-12537. 10.1073/pnas.211416898.PubMed CentralPubMedView ArticleGoogle Scholar
- Tanaka T, Yoshida N, Kishimoto T, Akira S: Defective adipocyte differentiation in mice lacking the C/EBPbeta and/or C/EBPdelta gene. EMBO J. 1997, 16: 7432-7443. 10.1093/emboj/16.24.7432.PubMed CentralPubMedView ArticleGoogle Scholar
- Wu Z, Rosen ED, Brun R, Hauser S, Adelmant G, Troy AE, et al: Cross-regulation of C/EBP alpha and PPAR gamma controls the transcriptional pathway of adipogenesis and insulin sensitivity. Mol Cell. 1999, 3: 151-158. 10.1016/S1097-2765(00)80306-8.PubMedView ArticleGoogle Scholar
- Rosen ED, Hsu CH, Wang X, Sakai S, Freeman MW, Gonzalez FJ, et al: C/EBPalpha induces adipogenesis through PPARgamma: a unified pathway. Genes Dev. 2002, 16: 22-26. 10.1101/gad.948702.PubMed CentralPubMedView ArticleGoogle Scholar
- Nielsen R, Pedersen TA, Hagenbeek D, Moulos P, Siersbaek R, Megens E, et al: Genome-wide profiling of PPARgamma:RXR and RNA polymerase II occupancy reveals temporal activation of distinct metabolic pathways and changes in RXR dimer composition during adipogenesis. Genes Dev. 2008, 22: 2953-2967. 10.1101/gad.501108.PubMed CentralPubMedView ArticleGoogle Scholar
- Lefterova MI, Zhang Y, Steger DJ, Schupp M, Schug J, Cristancho A, et al: PPARgamma and C/EBP factors orchestrate adipocyte biology via adjacent binding on a genome-wide scale. Genes Dev. 2008, 22: 2941-2952. 10.1101/gad.1709008.PubMed CentralPubMedView ArticleGoogle Scholar
- Hamza MS, Pott S, Vega VB, Thomsen JS, Kandhadayar GS, Ng PW, et al: De-novo identification of PPARgamma/RXR binding sites and direct targets during adipogenesis. PLoS One. 2009, 4: e4907-10.1371/journal.pone.0004907.PubMed CentralPubMedView ArticleGoogle Scholar
- Wakabayashi K, Okamura M, Tsutsumi S, Nishikawa NS, Tanaka T, Sakakibara I, et al: The peroxisome proliferator-activated receptor gamma/retinoid × receptor alpha heterodimer targets the histone modification enzyme PR-Set7/Setd8 gene and regulates adipogenesis through a positive feedback loop. Mol Cell Biol. 2009, 29: 3544-3555. 10.1128/MCB.01856-08.PubMed CentralPubMedView ArticleGoogle Scholar
- Siersbaek R, Nielsen R, Mandrup S: PPARgamma in adipocyte differentiation and metabolism--novel insights from genome-wide studies. FEBS Lett. 2010, 584: 3242-3249. 10.1016/j.febslet.2010.06.010.PubMedView ArticleGoogle Scholar
- Boj SF, Servitja JM, Martin D, Rios M, Talianidis I, Guigo R, et al: Functional targets of the monogenic diabetes transcription factors HNF-1alpha and HNF-4alpha are highly conserved between mice and humans. Diabetes. 2009, 58: 1245-1253. 10.2337/db08-0812.PubMed CentralPubMedView ArticleGoogle Scholar
- Mikkelsen TS, Xu Z, Zhang X, Wang L, Gimble JM, Lander ES, et al: Comparative epigenomic analysis of murine and human adipogenesis. Cell. 2010, 143: 156-169. 10.1016/j.cell.2010.09.006.PubMed CentralPubMedView ArticleGoogle Scholar
- Kunarso G, Chia NY, Jeyakani J, Hwang C, Lu X, Chan YS, et al: Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat Genet. 2010, 42: 631-634. 10.1038/ng.600.PubMedView ArticleGoogle Scholar
- Odom DT, Dowell RD, Jacobsen ES, Gordon W, Danford TW, MacIsaac KD, et al: Tissue-specific transcriptional regulation has diverged significantly between human and mouse. Nat Genet. 2007, 39: 730-732. 10.1038/ng2047.PubMed CentralPubMedView ArticleGoogle Scholar
- Schmidt D, Wilson MD, Ballester B, Schwalie PC, Brown GD, Marshall A, et al: Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding. Science. 2010, 328: 1036-1040. 10.1126/science.1186176.PubMed CentralPubMedView ArticleGoogle Scholar
- Wilson MD, Barbosa-Morais NL, Schmidt D, Conboy CM, Vanes L, Tybulewicz VL, et al: Species-specific transcription in mice carrying human chromosome 21. Science. 2008, 322: 434-438. 10.1126/science.1160930.PubMed CentralPubMedView ArticleGoogle Scholar
- Wilson MD, Odom DT: Evolution of transcriptional control in mammals. Curr Opin Genet Dev. 2009, 19: 579-585. 10.1016/j.gde.2009.10.003.PubMedView ArticleGoogle Scholar
- Wasserman WW, Palumbo M, Thompson W, Fickett JW, Lawrence CE: Human-mouse genome comparisons to locate regulatory sites. Nat Genet. 2000, 26: 225-228. 10.1038/79965.PubMedView ArticleGoogle Scholar
- Lenhard B, Sandelin A, Mendoza L, Engstrom P, Jareborg N, Wasserman WW: Identification of conserved regulatory elements by comparative genome analysis. J Biol. 2003, 2: 13-10.1186/1475-4924-2-13.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al: Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008, 9: R137-10.1186/gb-2008-9-9-r137.PubMed CentralPubMedView ArticleGoogle Scholar
- Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, et al: Human-mouse alignments with BLASTZ. Genome Res. 2003, 13: 103-107. 10.1101/gr.809403.PubMed CentralPubMedView ArticleGoogle Scholar
- Dermitzakis ET, Clark AG: Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. Mol Biol Evol. 2002, 19: 1114-1121.PubMedView ArticleGoogle Scholar
- Wasserman WW, Sandelin A: Applied bioinformatics for the identification of regulatory elements. Nat Rev Genet. 2004, 5: 276-287. 10.1038/nrg1315.PubMedView ArticleGoogle Scholar
- Stormo GD: DNA binding sites: representation and discovery. Bioinformatics. 2000, 16: 16-23. 10.1093/bioinformatics/16.1.16.PubMedView ArticleGoogle Scholar
- Fullwood MJ, Liu MH, Pan YF, Liu J, Xu H, Mohamed YB, et al: An oestrogen-receptor-alpha-bound human chromatin interactome. Nature. 2009, 462: 58-64. 10.1038/nature08497.PubMed CentralPubMedView ArticleGoogle Scholar
- Carroll JS, Liu XS, Brodsky AS, Li W, Meyer CA, Szary AJ, et al: Chromosome-wide mapping of estrogen receptor binding reveals long-range regulation requiring the forkhead protein FoxA1. Cell. 2005, 122: 33-43. 10.1016/j.cell.2005.05.008.PubMedView ArticleGoogle Scholar
- Wu Z, Rosen ED, Brun R, Hauser S, Adelmant G, Troy AE, et al: Cross-regulation of C/EBP alpha and PPAR gamma controls the transcriptional pathway of adipogenesis and insulin sensitivity. Mol Cell. 1999, 3: 151-158. 10.1016/S1097-2765(00)80306-8.PubMedView ArticleGoogle Scholar
- Kasowski M, Grubert F, Heffelfinger C, Hariharan M, Asabere A, Waszak SM, et al: Variation in transcription factor binding among humans. Science. 2010, 328: 232-235. 10.1126/science.1183621.PubMed CentralPubMedView ArticleGoogle Scholar
- Zheng W, Zhao H, Mancera E, Steinmetz LM, Snyder M: Genetic analysis of variation in transcription factor binding in yeast. Nature. 2010, 464: 1187-1191. 10.1038/nature08934.PubMed CentralPubMedView ArticleGoogle Scholar
- Tuch BB, Galgoczy DJ, Hernday AD, Li H, Johnson AD: The evolution of combinatorial gene regulation in fungi. PLoS Biol. 2008, 6: e38-10.1371/journal.pbio.0060038.PubMed CentralPubMedView ArticleGoogle Scholar
- Helledie T, Grontved L, Jensen SS, Kiilerich P, Rietveld L, Albrektsen T, et al: The gene encoding the Acyl-CoA-binding protein is activated by peroxisome proliferator-activated receptor gamma through an intronic response element functionally conserved between humans and rodents. J Biol Chem. 2002, 277: 26821-26830. 10.1074/jbc.M111295200.PubMedView ArticleGoogle Scholar
- Newell FS, Su H, Tornqvist H, Whitehead JP, Prins JB, Hutley LJ: Characterization of the transcriptional and functional effects of fibroblast growth factor-1 on human preadipocyte differentiation. FASEB J. 2006, 20: 2615-2617. 10.1096/fj.05-5710fje.PubMedView ArticleGoogle Scholar
- Nielsen R, Grontved L, Stunnenberg HG, Mandrup S: Peroxisome proliferator-activated receptor subtype- and cell-type-specific activation of genomic target genes upon adenoviral transgene delivery. Mol Cell Biol. 2006, 26: 5698-5714. 10.1128/MCB.02266-05.PubMed CentralPubMedView ArticleGoogle Scholar
- Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.PubMed CentralPubMedView ArticleGoogle Scholar
- Fujita PA, Rhead B, Zweig AS, Hinrichs AS, Karolchik D, Cline MS, et al: The UCSC Genome Browser database: update 2011. Nucleic Acids Res. 2010Google Scholar
- Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al: The human genome browser at UCSC. Genome Res. 2002, 12: 996-1006.PubMed CentralPubMedView ArticleGoogle Scholar
- Hsu F, Kent WJ, Clawson H, Kuhn RM, Diekhans M, Haussler D: The UCSC Known Genes. Bioinformatics. 2006, 22: 1036-1046. 10.1093/bioinformatics/btl048.PubMedView ArticleGoogle Scholar
- Marstrand TT, Frellsen J, Moltke I, Thiim M, Valen E, Retelska D, et al: Asap: a framework for over-representation statistics for transcription factor binding sites. PLoS One. 2008, 3: e1623-10.1371/journal.pone.0001623.PubMed CentralPubMedView ArticleGoogle Scholar
- Hintze JL, Nelson RD: Violin plots: A box plot-density trace synergism. American Statistician. 1998, 52: 181-184. 10.2307/2685478.Google Scholar