Transcriptional co-expression and co-regulation of genes coding for components of the oxidative phosphorylation system
© van Waveren and Moraes; licensee BioMed Central Ltd. 2008
Received: 13 June 2007
Accepted: 14 January 2008
Published: 14 January 2008
The mitochondrial oxidative phosphorylation (OXPHOS) is critical for energy (ATP) production in eukaryotic cells. It was previously shown that genes coding for mitochondrial proteins involved in energy production co-express at the RNA level. Because the OXPHOS enzymes are multimeric complexes, we tested the hypothesis that genes coding for components of specific complexes are also co-regulated at the transcriptional level, and share common regulatory elements in their promoters.
We observed for the first time that, not only OXPHOS genes as a group co-express, but there is a co-expression of genes within each of the five OXPHOS enzyme complexes, showing a higher degree of complexity in gene co-regulation. In silico analysis of homologous promoter sequences in mammals identified the likely core promoter elements for most genes encoding OXPHOS subunits/assembly factors. The results included a significant abundance of previously identified sites (e.g. NRF1, NRF2, ERRA and YY1), as well as several sites that had not been previously detected. Although we identified patterns that correlated to OXPHOS gene expression, we did not detect an OXPHOS complex-specific arrangement of transcription factor binding sites within the core promoter that could explain the tight co-expression of these functionally related genes.
This study mapped the core promoters of most OXPHOS related genes and provided an example of gene expression regulation based on the final protein arrangement within a linear metabolic pathway.
Microarray data have been widely used to study gene co-expression by studying genes with similar expression patterns across a set of samples. Over the past few years, several lines of evidence suggest that co-expressed genes (which display high correlation values (CV) amongst different expression experiments) are likely to encode proteins that participate in the same metabolic pathway, form a common structural complex, or might be regulated by the same mechanism [5, 6].
In eukaryotes, the regulatory mechanisms underlying co-regulation of multiple genes is extremely complex . At the mRNA level, co-expression of OXPHOS genes has become evident from recent studies of wide transcriptome analysis across and within different species. In humans, large-scale analysis across different tissues (Shyamsundar et al. 2005) and across multiple datasets and conditions  have revealed a co-expression cluster significantly enriched in OXPHOS genes. In mouse, co-expression of OXPHOS genes across different tissues has been also described . Lastly, two macroevolutionary studies have also observed that several biological functional groups were repeatedly identified as co-expressed over large evolutionary distances and a wide variety of conditions. One of these clusters was significantly enriched in OXPHOS genes [10, 11]. Hence, it became apparent that a significant number of genes involved in the energy generation pathway, and in particular OXPHOS, not only share the same metabolic pathway (ATP synthesis) and interact at a protein-level, but also share tight co-expression at the mRNA level within and across different conditions and organisms. This suggests that these genes might have a common regulatory mechanism that accounts for this striking pattern of co-expression.
Energetic demands vary substantially between different cells and tissues of an organism. For example, in mammals, the mitochondrial content of cardiac myocytes or brown adipose cells is very high compared to skeletal muscle fibers type IIb. In general, the energy demand of a specific tissue correlates with the level of expression of genes encoding components of the OXPHOS system . The mechanisms controlling this nucleo-mitochondrial communication are just starting to emerge.
There are diverse regulatory mechanisms that may underlie co-expression. However, most studies have been devoted to the identification of proteins that regulate the transcription of nuclear-encoded mitochondrial genes as well as factors regulating mitochondrial transcription. Recent evidence points to both transcription factors (TFs) and transcriptional coactivators as important players in the regulation of mitochondrial biogenesis. DNA-binding TFs including nuclear-respiratory factor 1 and 2 (NRF1, NRF2) estrogen related receptor alpha (ERRA), Sp1, ying yang 1 (YY1), CREB and E-box factors have all been implicated in the expression of some nuclear-coded respiratory chain subunits, among other genes [13–15]. Transcriptional coactivator peroxisome proliferator-activated receptor gamma coactivator-1 (PGC-1α) and related family members (PGC-1β and PRC) are coactivators known to interact with NRF1, NRF2 and ERRA stimulating the expression of several OXPHOS genes [16–20]. PGC-1α was identified as a coactivator involved in mitochondrial gene regulation during adaptive thermogenesis in brown adipose tissue . PGC-1α is mainly expressed in tissues with high energy demand and high mitochondrial content, such as heart, brain, kidney and brown fat, and is induced by fasting and exposure to cold [21–25]. Overexpression of PGC-1α induces the expression of a vast amount of genes participating in mitochondrial metabolism including the TFs ERRA and NRF2 [14, 26]. On the other hand, PGC-1β was shown to regulate the expression of various OXPHOS genes as well as NRF1 and ERRA . Therefore, PGC-1α and PGC-1β could play a role as master regulators of the respiratory capacity of a cell either by physically interacting with ERRA, NRF1 and NRF2 at the promoters of OXPHOS-related genes or by increasing the expression of these transcription factors. However, while these TFs and coactivators directly regulate OXPHOS genes, it is unclear how they are integrated in response to environmental cues.
The main conclusion we can draw from the above is that the expression of OXPHOS genes can be modulated at the transcriptional level. There is mounting evidence to show that TFs and coactivators, particularly PGC-1α, are able to cooperatively alter the expression of OXPHOS genes. Although OXPHOS gene expression can be regulated at different levels, gene expression is likely to be a major determinant for coordinated expression. Therefore, understanding the complete set of TFs and coactivators that regulate OXPHOS gene expression should yield insights into the control of OXPHOS gene expression regulation. Although significant progress has been made in this respect, a complete list of factors regulating the expression of all OXPHOS genes remains incomplete, and how these regulators are integrated under physiological conditions is poorly understood.
In this study, OXPHOS gene co-expression across different tissues in humans and mice was confirmed in an independent large microarray dataset. Detailed analyses showed that subunits of individual complexes co-express preferentially among each other than with subunits of other complexes. In addition, phylogenetic footprinting across human, mouse and rat, helped to define each OXPHOS gene core promoter, and resulted in a comprehensive list of factors that could participate in their transcriptional control.
Most OXPHOS genes are co-expressed across humans and mouse normal tissues
From various genome wide expression studies, a significant group of OXPHOS genes appears to be co-expressed under various physiological conditions in several species. To identify which particular OXPHOS genes are co-expressed in different conditions/species, we obtained the complete microarray dataset of human and mouse from the Genomics Institute of the Novartis Research Foundation (GNF) tissue compendium. Using custom-designed whole-genome gene expression arrays from panels of mRNAs derived from 79 human and 61 mouse tissues and cell types performed in duplicates, this compendium evaluates the relative expression of 44,775 human and 36,182 mouse transcripts . Analyzing all transcripts would incorporate significant noise to the analysis, thus data was subjected to global normalization and filtered for a compilation of mitochondria-related genes that contains all mitochondrial transcripts coding for mitochondrial proteins, as well as transcripts coding for OXPHOS related factors such as those involved in oxidative stress response, fatty acid biosynthesis and glycolysis (see methods).
In order to study co-expression of OXPHOS genes among all the human and mouse "mitochondrial genelists", an unsupervised hierarchical cluster analysis was performed. This approach was used to visualize relationships among the expression patterns of OXPHOS genes and other groups of genes. Figure 1b shows the result of the hierarchical clustering analysis using Pearson correlation similarity measure of the filtered genes. Genes are displayed on the horizontal axis and tissues on the vertical axis. The nodes that have genes that are enriched in a particular biological pathway are represented on the right side of each tree. The ordered list of genes based on Figure 1b is described on Additional file 1.
In both mice and humans, the hierarchical tree cluster analysis resulted in a cluster of genes enriched in OXPHOS genes. These results confirm previous observations that a significant number of OXPHOS genes share a common pattern of gene expression across different tissues and this co-expression is conserved across different species. Other gene groups that were significantly co-expressed among different tissues correspond to genes coding for fatty acid biosynthesis pathway, oxidative stress response, mitochondrial translation machinery, mitochondrial chaperone activity, glycolysis and calcium binding. Tissue clustering showed that tissues with high or low expression of OXPHOS genes clustered together. Tissues with known high energy consumption such as heart, skeletal muscle, kidney and liver displayed increased expression of OXPHOS genes compared to tissues with lower energy demand such as skin and lung. This pattern of expression was observed for other mitochondrial genes as well, which suggests that tissues with high energy demand have not only an increase in the expression of energy generating genes but also of other mitochondrial components. These proteins may be necessary to structurally and metabolically sustain increased energy production.
Subunits of each complex tend to have tighter co-expression with subunits of the same complex than with subunits of other complexes
Co-expression of genes belonging to the same OXPHOS complex.
30% of genes
Total number of genes
Mouse GNF 
All the rest
Number of genes
Mouse U74A GNF 
All the rest
Number of genes
Human GNF 
All the rest
Number of genes
Human Stanford 
All the rest
Number of genes
All the rest
Number of genes
This analysis showed that although OXPHOS subunits co-express among themselves within and across species, gene members coding for each multimeric complex also co-express preferentially with members of the same complex. This novel observation suggests that there is a higher degree of complexity in OXPHOS gene regulation than had been so far detected.
Promoter Analysis of OXPHOS Genes
From the above study it is apparent that OXPHOS genes co-express at a tissue level. This raises the hypothesis that these genes have a common co-regulatory mechanism that allows co-expression among different tissues. Previous studies suggest that transcriptional initiation is the most common form of gene regulation of OXPHOS genes . However, a comprehensive study of transcriptional regulation of OXPHOS genes has not been performed until now. Most studies concentrate on the characterization of one or a few number of OXPHOS gene promoters and are limited to only one species. Therefore, in this study, an exhaustive analysis of all OXPHOS gene promoters in human, mouse and rat was performed. Promoter analysis was performed for all known nuclear coded OXPHOS structural subunits and assembly factors, as well as 130 randomly selected mitochondrial genes. All available promoter sequences from human, mouse and rat were extracted for all genes analyzed. These species were chosen because their genomes are divergent enough for phylogenetic footprinting analysis to be performed, as they still display high sequence conservation and are the best annotated. An example of phylogenetic footprinting is described on the Additional file 3. For this study, core promoter regions were defined as DNA sequences 500 bp upstream and downstream of the transcription start site (TSS). Analyzing sequences that were larger than 500 bp around the TSS resulted in a significant amount of noise related to promoters of neighboring genes. In addition, previous studies suggest that TFBS which are close to the TSS (~300 bp), are more likely to participate in transcription initiation [31, 32]. The complete phylogenetic footprinting alignments of the different OXPHOS polypeptides, highlighting the conserved regions, can be found in additional files (Core Promoter Additional files 4, 5, 6, 7, 8).
Respiration is a vital process in essentially every eukaryotic cell, thus OXPHOS genes are considered housekeeping genes and are expressed in all cell types. In vertebrates, CpG islands are usually associated with several housekeeping gene promoters. We tested for the presence of CpG islands in all human OXPHOS subunit promoters. Overall, CpG islands are present in the promoter regions of 83 out of the 101 human OXPHOS genes analyzed. Since during the course of evolution GC dinucleotides are usually lost, this analysis suggests that there is a high evolutionary pressure for sequence conservation of OXPHOS promoter sequences, and these conserved stretches of DNA probably reflect important transcriptional regulatory elements, and possible nucleosome positioning that would contribute to transcriptional regulation .
A hallmark of several OXPHOS promoters studied until now is the absence of a typical TATA box adjacent to the TSS. These promoters are called TATA-less . The TATA box is recognized by the constitutive transcription factor TBP (TATA box binding protein). Therefore, in order to analyze all OXPHOS promoters for the presence of a TATA box, all homologous promoters were aligned using DiAlignTF (Genomatix) and searched for conserved TBP binding sites within 100 bp of the TSS. Only 23% of OXPHOS promoters have conserved TBP binding sites in this region. These results confirm and extend previous observations that nuclear coded OXPHOS gene promoters are mostly TATA-less.
OXPHOS genes have common TF binding sites in their promoters
From our studies as well as others previously published, the fact that OXPHOS genes are coregulated is clear. Therefore, with the assumption that functionally important TFBS are conserved across evolution, searching for conserved TFBS in OXPHOS promoters would probably result in important information on the molecular basis of OXPHOS transcriptional regulation.
To search for conserved TFBS, phylogenetic footprinting was performed on human, mouse and rat orthologous promoters for all OXPHOS genes. Using DialignTF (Genomatix, DE), orthologue promoter sequences were aligned and strings of short-bp similarity (typically found in TFBS) were searched against a database of known TFBS. The most abundant TFBS families identified belong to TFs previously linked to transcription of some OXPHOS genes: ETSF (Human and murine ETS1 factors) which includes NRF2, SP1F (GC-Box factors SP1/GC), NRF1, EREF (Estrogen response elements) which includes ERRA TFBS, CREB (cAMP-responsive element binding proteins) and YY1F (Activator/repressor binding to transcription initiation site). We also identified TFs known to regulate housekeeping genes, such as EGRF (EGR/nerve growth factor induced protein C & related factors), ZBPF (Zinc binding protein factors) and EBOX (E-box binding factors) among others. Please refer to supplementary material (Additional file 9) for a complete list of results. ETSF belongs to the ETS family of transcription factors and participates in the transcriptional regulation of a myriad of genes . NRF2 is a member of the ETS family, however to differentiate true NRF2 binding sites in the promoters of OXPHOS genes, NRF2 was analyzed also as an independent TFBS. NRF1 and ERRA, as previously described, have been implicated in the regulation of transcription of various mitochondrial genes among others. SP1 is a ubiquitous transcription factor expressed in almost every cell. It has been shown to be necessary for the transcription of some TATA-less promoters and is an important component of the eukaryotic cellular transcriptional machinery [35, 36]. In human promoters, SP1 and EBOX transcription factor binding sites are recognized by the transcription factors NF-Y, SP1, and USF which are known to be constitutive TFs participating in basal gene transcription . Our promoter analysis identified a high percentage of promoters with these TFBS conserved in OXPHOS genes. Since OXPHOS genes are housekeeping genes, these TFBS might have a role in basal transcription initiation. EGRF belongs to the family of TFs involved in the transcription of immediate-early gene products expressed in response to diverse stimuli . YY1 is also a ubiquitous TF that has been implicated in the transcription of TATA-less promoters and regulates the transcriptional initiation of several mitochondrial genes . The transcriptional repressor, ZBPF, binds elements found predominantly in genes that participate in lipid metabolism as well as several genes involved in processes related to energy metabolism and vascular disease . CREB family of TFBS belongs to the cAMP response element (CRE) (recognized by CREB, ATF1, FOS/JUN and ATF2/JUN heterodimers), which are widely distributed TF families and known to regulate the expression of various unrelated genes. However, OXPHOS genes have been shown to be regulated by CREB under certain physiological conditions [30, 39]. Therefore, an enrichment of this family of TFBS among OXPHOS subunits is not surprising.
NRF1, NRF2, YY1 and ERRA correspond to TFs that have been previously linked to the expression of some OXPHOS genes. To compare the data obtained by the in silico approach used in this study with experimentally verified TFBS, we searched for studies in which functional analysis of these TFBS were performed in OXPHOS promoters. In many cases, less stringent DialignTF parameters identified sites that showed a functional effect on transcription in the literature. For this reason, and only for this comparison, all promoters were analyzed using less stringent parameters ('SEL' mode in Additional file 9). 23 out of 27 experimentally verified sites found in the literature could be predicted by our in silico analysis showing a high degree of concordance between both methods (see Additional file 10). This analysis showed that the in silico approach used in this study is able to identify functional TFBS studied in vivo.
Overall, these results show that phylogenetic footprinting resulted in the detection of several TFBS that are enriched in OXPHOS gene promoters. Some TFBS that displayed a high abundance in OXPHOS gene promoters, like SP1F, did not show an association with OXPHOS gene expression, which suggests that they might participate in non-specialized basal transcriptional initiation of these genes. The vast majority of the OXPHOS promoters analyzed have not been characterized previously. Therefore, these results confirm that the unbiased computational approach chosen does lead to biologically relevant information.
Conserved TFBS Show Positional Bias around the TSS
A fundamental question in a bioinformatics approach to promoter analysis is to determine which of the predicted TFBSs are biologically relevant. In the above study, the presence of a TFBS conserved across human, mouse and rat was taken as a guide to answer this question. In addition, the relative position of each conserved TFBS to the corresponding TSS can also be analyzed. Previous studies have relied on this approach, and found that TFBS that cluster relative to the TSS (i.e. with a positional bias) have a high likelihood of being biologically significant .
Taking a closer look around the TSS it became apparent that YY1F TFBS are preferentially conserved at the TSS or around 10 bp of it, suggesting a strong role in transcription initiation of OXPHOS genes (Figure 4b). On the other hand, NRF1, NRF2 and ETSF and SP1F TFBSs cluster slightly upstream of the TSS (~50–70 bp upstream of the TSS). Each TFBS clusters in slightly different positions within 150 bp upstream of the TSS. This raises the possibility that these factors function together in transcriptional regulation.
In summary, there seems to be a clear positional bias around the TSS for the most abundant TFBS identified by phylogenetic footprinting. In particular, known OXPHOS related TFs, NRF1, NRF2 and YY1F (except ERRA) all cluster within ~150 bp of the TSS. This suggests that most conserved TFBS identified for OXPHOS promoters are probably transcriptionally functional in humans, mouse and rat, and raises the possibility that TFBS identified outside of this region might not be functional for the promoters of these ubiquitously expressed genes studied. These results further support the biological relevance of our in silico approach.
TFBS Orientation on OXPHOS promoters
The coordinate regulation of the expression of gene groups can occur at different levels, such as transcription  or translation [41, 42]. In this study, we focused on the coordinated regulation of mRNA levels of genes related to OXPHOS function. Our results clearly show the power of in silico approach to define promoter regions and to study gene co-expression and co-regulation at a transcriptional level. Combining high-throughput gene expression data with promoter analysis has enabled the detection of sites that are recognized by unique or families of TFs which might play an important role in the co-expression of OXPHOS genes. Although previous studies have shown the co-expression and defined TFBS for genes coding for mitochondrial proteins [14, 26], in this study we focused on OXPHOS-related genes.
From our gene expression studies, we were able to confirm co-expression of OXPHOS genes at the mRNA level. OXPHOS genes are known as housekeeping genes, and it would be expected that they are transcribed at similar levels in all cell types. However, there is a clear difference in relative mRNA abundance of a significant number of these genes, mostly related to the energy requirements of each tissue. Although differences in the number, size and protein content of mitochondria from different tissues is a recognized aspect in the field, differences in mRNA levels is a relatively recent discovery. These findings suggest that there must be an intricate mechanism of co-regulation of these genes at the mRNA level.
An unexpected finding was the significant tendency for genes within the OXPHOS complexes to co-express among each other than with genes of other complexes. This suggests that there is a coordinated control in the levels of mRNAs of genes coding for each complex. It is well established that there is a tendency for genes participating in the same metabolic process, or interacting physically, to co-express . Therefore, these results suggest that, besides a master regulator of OXPHOS gene expression driving their co-expression at a tissue level, there is an underlying fine tuning mechanism regulating the expression of OXPHOS genes within each complex. In our studies, we did not detect a complex-specific pattern of TFBSs. Future studies will need to be designed to identify the mechanisms that are responsible for co-expression of genes within OXPHOS complexes. Although this may be due to the low power of statistical analyses (few conserved TFBS per gene promoter), it is possible that this "fine tuning" is orchestrated by cis elements relatively distant from the promoters or by mRNA degradation.
Previous studies have identified TFBS for NRF1, NRF2, ERRA, and YY1 in the promoters of some OXPHOS subunits [14, 30]. Although only a few of OXPHOS genes have previously been characterized for the functional role of these TFBS (Additional file 10), these studies correlate extremely well with the predicted TFBS identified by our in silico approach. In this study, human, mouse and rat promoters of all known OXPHOS and accessory subunits were analyzed, whilst usually published data concentrates in the characterization of one promoter in one species. Therefore, this study constitutes the first comprehensive analysis of all OXPHOS gene promoters, making a good case for unbiased computational approach resulting in biologically relevant data.
Interestingly, statistical analysis of the most abundant TFBS identified showed that NRF1, EREF, YY1F and CREB correlate significantly with OXPHOS gene expression pattern. These factors may act independently or synergistically in the transcriptional regulation of OXPHOS genes. The presence of multiple factors may serve to integrate diverse signals into mitochondrial biogenesis. In the mitochondrial field, these TFs were assumed to be master regulators of mitochondrial gene transcription . The results obtained in this study extend this concept to a broader level. Many of these TFs are not specific for OXPHOS-related genes (e.g. YY1F, CREB) and are also common in the promoters associated with other metabolic networks, but other are more commonly found in OXPHOS-related genes (e.g. NRF1, NRF2). Therefore, although the co-expression of genes coding for mitochondrial proteins can be explained by the presence of TFBS for TFs such as NRF1, NRF2 and ERRA, both the specific combination of TFs as well as other events controlling mRNA levels (e.g. coactivators, RNA degradation and long range cis effects) may ultimately explain the co-regulation.
From analyzing the positional bias around the TSS for the factors identified, it became apparent that most of the conserved TFBS identified cluster in specific positions close to the TSS. This suggests that the TFBS identified by phylogenetic footprinting are likely to be biologically functional if present within these clusters.
We showed that the coordinated expression of known OXPHOS genes goes beyond a mitochondrial or even OXPHOS pattern, to the level of individual complexes. This finding implies that either common promoter elements or a feedback mechanism from the assembled complexes influences the levels of mRNAs Although the identification of the core promoters and their conserved TFBS of 98 OXPHOS genes provided the initial clues to understand this process, further work will be required to fully define the precise mechanisms responsible for this co-expression.
Microarray Data Handling
The human and mouse expression atlas was obtained from GNF [44, 28]. This atlas contains custom-designed whole-genome gene expression arrays from mRNAs derived from 79 human and 61 mouse tissues and cell types performed in duplicates and evaluates the expression of 44,775 human and 36,182 mouse transcripts. Data was analyzed using Genespring microarray analysis software package (Agilent Technologies, Palo Alto, CA). To control for chip-wide variations in intensity, each array was normalized to the 50th percentile of genes. In addition, to control for the differences in detection efficiency between spots, each probeset (the collection of probes designed to interrogate a given gene sequence) was normalized to its median across all chips or arrays. These normalization steps enabled comparisons of relative change in gene expression levels between experiments. Duplicate arrays were then averaged and considered as one.
Data was filtered for mitochondrial genes. A complete list of human mitochondrial genes was obtained from two online compilations, which contain data obtained by several proteomic approaches in an effort to obtain the complete list of mitochondrial genes. MitoRES  provided 808 genes and MitoProteome  which is a mitochondrial protein database generated from experimental evidence such as mass spectrometry and other public databases provided 869 genes . Data was grouped excluding repetitions. For mouse, a complete list of mitochondrial genes was obtained from mitoRES (732 genes). The mitochondrial genes were compiled with genes belonging to "mitochondrial related pathways" which were not present in the lists such as glycolysis, fatty acid biosynthesis and oxidative stress genes. The resulting compilation contains 1290 probesets in the human and 1029 probesets in the mouse array datasets. Of these compilations, 147 represent human and 111 represent mouse OXPHOS genes, which code for both structural as well as accessory subunits of the respiratoy chain. Both human and mouse lists of mitochondrial probesets were used in expression profile analysis and were called "mitochondrial compilation".
Hierarchical clustering analysis was performed on the "mitochondrial compilation" genelists using Pearson correlation similarity. Genespring software package was used to create a graphical view of the data.
For the OXPHOS correlation test (explained bellow), two more datasets were used. The U74A mouse tissue compendium from GNF  and the human compilation of normal tissues available at the Stanford MicroArray Database [49, 50].
OXPHOS correlation test
To investigate if OXPHOS subunits co-express preferentially with subunits of the same complex than with subunits of other complexes, a "correlation similarity" test was designed. For each microarray dataset, a similarity matrix was made. Pearson's correlation was used to measure the pair-wise similarity between the expression profiles among all OXPHOS genes in the matrix (Figure 2). If one gene was represented by more than one probeset, the probeset with the highest average correlation values (CV) was included in the analysis and the rest was discarded. Next, for each OXPHOS gene (in a column), all other genes were rank ordered on the basis of their Pearson Correlation Value (CV). A cutoff of 30% of the highest ranking CV was set (Figure 2ii). The percentage of complex X subunits within the cutoff was calculated for each gene. Then, the null hypothesis that 'any gene has the same percentage of the different complexes subunits within the cutoff' was tested using the Student's t-Test. This analysis was performed for the two mouse and two human tissue atlases mentioned above.
Because some complexes have very few nuclear coded genes, the above analysis did not pass our statistical test for complexes with few subunits (complex II, complex III, and complex IV). To bypass this restriction, we grouped the results of the OXPHOS correlation test for all probesets from the four datasets before applying the statistical test.
Promoter Sequence Extraction and Analysis
Genomatix  and the UCSC Genome Browser  human genome May 2004 build were used to extract 1000 bp DNA sequence around the transcriptional start site (TSS as annotated by RefSeq) for human, mouse and rat genomes (500 bp upstream and 500 bp downstream from the TSS). These 1000 bp were considered to contain the core promoters. In cases where more than one gene promoter was present for a given genome, the promoters conserved across the three species by ElDorado (Genomatix) were chosen for further analysis. For the sequences of the promoters analyzed in this study, see supplementary data (Additional files 4, 5, 6, 7, 8). In genomes where there was no annotation for a specific gene, BLAT search tool was used to locate its correspondent EST using an orthologous mRNA sequence for that gene. In most cases, this approach could identify the gene in the queried genome and enabled the promoter extraction. All OXPHOS genes and other arbitrarily selected mitochondrial genes (n = 235) were analyzed by phylogenetic footprinting.
The presence of CpG islands was assessed by the UCSC Genome Browser built-in application . The algorithm considers segments of 200 bp of DNA and evaluates them for a 50% or greater content of GC. All human promoter sequences were visually inspected in the browser for the presence of CpG islands.
Alignment of Orthologous Promoters
Orthologous promoter sequences from human, mouse and rat were aligned using DiAlignTF (Genomatix, Germany). This application is able to align and identify conserved TFBSs in orthologous promoters. DiAlignTF is a DNA alignment software that constructs alignments from gap-free pairs of similar segments of sequences. Therefore, the program is especially suited to detect short local similarities, characteristic of short TFBS, in otherwise completely unrelated sequences. DialignTF was used with its default parameters using the Genomatix Matrix Family Library Version 5.0 (January 2005) and the orthologous promoters for each gene. This alignment is often called phylogenetic footprinting. For this analysis, 98 OXPHOS orthologous promoters (human, mouse and rat) as well as 134 other gene promoters participating in several mitochondrial pathways were examined.
Because several transcription factors bind similar TFBSs, Genomatix groups these into TFBS families. For example, NRF2 binding site is similar to other ETS family members, and therefore, ETSF (ETS family) contains NRF2 binding site. Since NRF2 has been implicated in the expression of some OXPHOS subunits, for this analysis, promoters were analyzed for the presence of both ETSF (family) and NRF2 (single factor) binding sites. On the other hand, NRF1 has its unique TFBS and therefore is not part of any family of TFBS.
Conserved families of TFBSs in all orthologue promoters were retrieved. The presence of at least one copy of a conserved TFBS family in the aligned promoter sequences was examined. The presence of more than one copy of the same TFBS family was not recorded. Binding sites for the transcription factor families V$SF1F, V$RORA and V$ERER (Genomatix nomenclature) are very similar and therefore often overlap . V$ERER family of transcription factors contains the ERRA binding site, thus only this family will be considered for this study since ERRA has previously been associated with OXPHOS gene transcription. All alignment results obtained are available in the supplementary material (Additional files 4, 5, 6, 7, 8).
Human, mouse and rat promoters were used to search all OXPHOS promoters for the presence of a TATA box. If a TATA box protein binding site (V$TBPF) was found to be conserved within 100 bp of the TSS of the aligned promoters, it was considered as a likely functional TATA box. If no V$TBPF binding site was present the genes were considered as being TATA-less .
Statistical Analysis of transcription factor binding sites (TFBS)
To test if TFBSs identified by phylogenetic footprinting are enriched in OXPHOS gene promoters over random mitochondrial gene promoters, a statistical test was designed. This test was performed with all promoters analyzed by phylogenetic footprinting using DialignTF default parameters (n = 235). An ordered list of all genes was generated using the Pearson correlation to the expression pattern of the average expression pattern of OXPHOS genes (the median of the expression patterns of OXPHOS genes that clustered in the hierarchical gene tree (Figure 2)). This resulted in a similarity ordered list of genes where the most similar values correspond to genes whose expression pattern was most similar to OXPHOS subunits. Each gene was then annotated for the presence of each TFBS in its promoter (Additional file 11). The data was then subjected to a nonparametric Wilcoxon Rank-Sum test for two independent samples to assess, for each TFBS, whether genes that contain the TFBS tend to rank high on the list (i.e. has an expression pattern similar to OXPHOS genes). A p-value less than 0.05 was considered as a significant association. This analysis was performed for the human and the mouse GNF tissue atlas 
Positional and Binding bias of each TFBS
To examine if conserved TFBSs cluster at a specific position from the TSS, the relative position to the TSS of each conserved TFBS identified by phylogenetic footprinting was documented for human, mouse and rat. All OXPHOS promoters and mitochondrial related promoters analyzed by phylogenetic footprinting were included in this analysis.
To study if there was a bias towards the TFBS orientation, the relative orientation (with respect to the direction of transcription) of each conserved TFBS analyzed above was determined. A nonparametric Sign test was used to determine if there was a preference of TF binding orientation. TFBS orientation depends on the Genomatix library of position weight matrices (PWM).
Cytochrome c Oxidase (a.k.a. Complex IV)
cAMP response element binding protein
Pearson's correlation value
- cyt c:
Estrogen related receptor alpha
electron transport chain
Genomics Institute of the Novartis Research Foundation
hypoxia inducible factor
Mitochondrial myopathy Encephalopathy Lactic acidosis and Stroke-like episodes
Nuclear respiratory factor 1
nuclear respiratory factor 2
: Oxidative Phosphorylation System
Peroxisome-proliferator activated gamma coactivator-1
reactive oxygen species
reverse transcriptase polymerase chain reaction
- TCA cycle:
tricarboxylic acid cycle
mitochondrial transcription factor A
Transcription factor binding site
transcription start site
ying yang 1
Oxidative Phosphorylation System
We are grateful to Dr. Sawsan Khuri for insightful discussions and critical comments on the manuscript. This work was supported by Public Health Service grants NS041777, CA085700 and EY10804.
- Saraste M: Oxidative phosphorylation at the fin de siecle. Science. 1999, 283: 1488-1493. 10.1126/science.283.5407.1488.PubMedView ArticleGoogle Scholar
- Attardi G, Schatz G: Biogenesis of mitochondria. Annu Rev Cell Biol. 1988, 4: 289-333. 10.1146/annurev.cb.04.110188.001445.PubMedView ArticleGoogle Scholar
- Grivell LA: Nucleo-mitochondrial interactions in mitochondrial gene expression. Crit Rev Biochem Mol Biol. 1995, 30: 121-164. 10.3109/10409239509085141.PubMedView ArticleGoogle Scholar
- Poyton RO, McEwen JE: Crosstalk between nuclear and mitochondrial genomes. Annu Rev Biochem. 1996, 65: 563-607. 10.1146/annurev.bi.65.070196.003023.PubMedView ArticleGoogle Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.PubMedPubMed CentralView ArticleGoogle Scholar
- Marcotte EM, Pellegrini M, Thompson MJ, Yeates TO, Eisenberg D: A combined algorithm for genome-wide prediction of protein function. Nature. 1999, 402: 83-86. 10.1038/47048.PubMedView ArticleGoogle Scholar
- Brazhnik P, de la Fuente A, Mendes P: Gene networks: how to put the function in genomics. Trends Biotechnol. 2002, 20: 467-472. 10.1016/S0167-7799(02)02053-X.PubMedView ArticleGoogle Scholar
- Lee HK, Hsu AK, Sajdak J, Qin J, Pavlidis P: Coexpression analysis of human genes across many microarray data sets. Genome Res. 2004, 14: 1085-1094. 10.1101/gr.1910904.PubMedPubMed CentralView ArticleGoogle Scholar
- Mootha VK, Bunkenborg J, Olsen JV, Hjerrild M, Wisniewski JR, Stahl E, Bolouri MS, Ray HN, Sihag S, Kamal M, Patterson N, Lander ES, Mann M: Integrated analysis of protein composition, tissue diversity, and gene regulation in mouse mitochondria. Cell. 2003, 115: 629-640. 10.1016/S0092-8674(03)00926-7.PubMedView ArticleGoogle Scholar
- Stuart JM, Segal E, Koller D, Kim SK: A gene-coexpression network for global discovery of conserved genetic modules. Science. 2003, 302: 249-255. 10.1126/science.1087447.PubMedView ArticleGoogle Scholar
- Bergmann S, Ihmels J, Barkai N: Similarities and differences in genome-wide expression data of six organisms. PLoS Biol. 2004, 2: E9-10.1371/journal.pbio.0020009.PubMedPubMed CentralView ArticleGoogle Scholar
- Nogueira V, Rigoulet M, Piquet MA, Devin A, Fontaine E, Leverve XM: Mitochondrial respiratory chain adjustment to cellular energy demand. J Biol Chem. 2001, 276: 46104-46110. 10.1074/jbc.M107425200.PubMedView ArticleGoogle Scholar
- Scarpulla RC: Nuclear activators and coactivators in mammalian mitochondrial biogenesis. Biochim Biophys Acta. 2002, 1576: 1-14.PubMedView ArticleGoogle Scholar
- Mootha VK, Handschin C, Arlow D, Xie X, St Pierre J, Sihag S, Yang W, Altshuler D, Puigserver P, Patterson N, Willy PJ, Schulman IG, Heyman RA, Lander ES, Spiegelman BM: Erralpha and Gabpa/b specify PGC-1alpha-dependent oxidative phosphorylation gene expression that is altered in diabetic muscle. Proc Natl Acad Sci U S A. 2004, 101: 6570-6575. 10.1073/pnas.0401401101.PubMedPubMed CentralView ArticleGoogle Scholar
- Laganiere J, Tremblay GB, Dufour CR, Giroux S, Rousseau F, Giguere V: A polymorphic autoregulatory hormone response element in the human estrogen-related receptor alpha (ERRalpha) promoter dictates peroxisome proliferator-activated receptor gamma coactivator-1alpha control of ERRalpha expression. J Biol Chem. 2004, 279: 18504-18510. 10.1074/jbc.M313543200.PubMedView ArticleGoogle Scholar
- Vega RB, Huss JM, Kelly DP: The coactivator PGC-1 cooperates with peroxisome proliferator-activated receptor alpha in transcriptional control of nuclear genes encoding mitochondrial fatty acid oxidation enzymes. Mol Cell Biol. 2000, 20: 1868-1876. 10.1128/MCB.20.5.1868-1876.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Wu Z, Puigserver P, Andersson U, Zhang C, Adelmant G, Mootha V, Troy A, Cinti S, Lowell B, Scarpulla RC, Spiegelman BM: Mechanisms controlling mitochondrial biogenesis and respiration through the thermogenic coactivator PGC-1. Cell. 1999, 98: 115-124. 10.1016/S0092-8674(00)80611-X.PubMedView ArticleGoogle Scholar
- Baar K, Wende AR, Jones TE, Marison M, Nolte LA, Chen M, Kelly DP, Holloszy JO: Adaptations of skeletal muscle to exercise: rapid increase in the transcriptional coactivator PGC-1. Faseb J. 2002, 16: 1879-1886. 10.1096/fj.02-0367com.PubMedView ArticleGoogle Scholar
- Czubryt MP, McAnally J, Fishman GI, Olson EN: Regulation of peroxisome proliferator-activated receptor gamma coactivator 1 alpha (PGC-1 alpha ) and mitochondrial function by MEF2 and HDAC5. Proc Natl Acad Sci U S A. 2003, 100: 1711-1716. 10.1073/pnas.0337639100.PubMedPubMed CentralView ArticleGoogle Scholar
- Louet JF, Hayhurst G, Gonzalez FJ, Girard J, Decaux JF: The coactivator PGC-1 is involved in the regulation of the liver carnitine palmitoyltransferase I gene expression by cAMP in combination with HNF4 alpha and cAMP-response element-binding protein (CREB). J Biol Chem. 2002, 277: 37991-38000. 10.1074/jbc.M205087200.PubMedView ArticleGoogle Scholar
- Puigserver P, Wu Z, Park CW, Graves R, Wright M, Spiegelman BM: A cold-inducible coactivator of nuclear receptors linked to adaptive thermogenesis. Cell. 1998, 92: 829-839. 10.1016/S0092-8674(00)81410-5.PubMedView ArticleGoogle Scholar
- Yoon JC, Puigserver P, Chen G, Donovan J, Wu Z, Rhee J, Adelmant G, Stafford J, Kahn CR, Granner DK, Newgard CB, Spiegelman BM: Control of hepatic gluconeogenesis through the transcriptional coactivator PGC-1. Nature. 2001, 413: 131-138. 10.1038/35093050.PubMedView ArticleGoogle Scholar
- Herzig S, Long F, Jhala US, Hedrick S, Quinn R, Bauer A, Rudolph D, Schutz G, Yoon C, Puigserver P, Spiegelman B, Montminy M: CREB regulates hepatic gluconeogenesis through the coactivator PGC-1. Nature. 2001, 413: 179-183. 10.1038/35093131.PubMedView ArticleGoogle Scholar
- Ichida M, Nemoto S, Finkel T: Identification of a specific molecular repressor of the peroxisome proliferator-activated receptor gamma Coactivator-1 alpha (PGC-1alpha). J Biol Chem. 2002, 277: 50991-50995. 10.1074/jbc.M210262200.PubMedView ArticleGoogle Scholar
- Schreiber SN, Knutti D, Brogli K, Uhlmann T, Kralli A: The transcriptional coactivator PGC-1 regulates the expression and activity of the orphan nuclear receptor estrogen-related receptor alpha (ERRalpha). J Biol Chem. 2003, 278: 9013-9018. 10.1074/jbc.M212923200.PubMedView ArticleGoogle Scholar
- Schreiber SN, Emter R, Hock MB, Knutti D, Cardenas J, Podvinec M, Oakeley EJ, Kralli A: The estrogen-related receptor alpha (ERRalpha) functions in PPARgamma coactivator 1alpha (PGC-1alpha)-induced mitochondrial biogenesis. Proc Natl Acad Sci U S A. 2004, 101: 6472-6477. 10.1073/pnas.0308686101.PubMedPubMed CentralView ArticleGoogle Scholar
- Vianna CR, Huntgeburth M, Coppari R, Choi CS, Lin J, Krauss S, Barbatelli G, Tzameli I, Kim YB, Cinti S, Shulman GI, Spiegelman BM, Lowell BB: Hypomorphic mutation of PGC-1beta causes mitochondrial dysfunction and liver insulin resistance. Cell Metab. 2006, 4: 453-464. 10.1016/j.cmet.2006.11.003.PubMedPubMed CentralView ArticleGoogle Scholar
- Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U S A. 2004, 101: 6062-6067. 10.1073/pnas.0400782101.PubMedPubMed CentralView ArticleGoogle Scholar
- Mootha VK, Lepage P, Miller K, Bunkenborg J, Reich M, Hjerrild M, Delmonte T, Villeneuve A, Sladek R, Xu F, Mitchell GA, Morin C, Mann M, Hudson TJ, Robinson B, Rioux JD, Lander ES: Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. Proc Natl Acad Sci U S A. 2003, 100: 605-610. 10.1073/pnas.242716699.PubMedPubMed CentralView ArticleGoogle Scholar
- Scarpulla RC: Transcriptional activators and coactivators in the nuclear control of mitochondrial function in mammalian cells. Gene. 2002, 286: 81-89. 10.1016/S0378-1119(01)00809-5.PubMedView ArticleGoogle Scholar
- FitzGerald PC, Shlyakhtenko A, Mir AA, Vinson C: Clustering of DNA sequences in human promoters. Genome Res. 2004, 14: 1562-1574. 10.1101/gr.1953904.PubMedPubMed CentralView ArticleGoogle Scholar
- Cooper SJ, Trinklein ND, Anton ED, Nguyen L, Myers RM: Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome. Genome Res. 2006, 16: 1-10. 10.1101/gr.4222606.PubMedPubMed CentralView ArticleGoogle Scholar
- Luykx P, Bajic IV, Khuri S: NXSensor web tool for evaluating DNA for nucleosome exclusion sequences and accessibility to binding factors. Nucleic Acids Res. 2006, 34: W560-5. 10.1093/nar/gkl158.PubMedPubMed CentralView ArticleGoogle Scholar
- Hsu T, Trojanowska M, Watson DK: Ets proteins in biological control and cancer. J Cell Biochem. 2004, 91: 896-903. 10.1002/jcb.20012.PubMedPubMed CentralView ArticleGoogle Scholar
- Block KL, Shou Y, Thorton M, Poncz M: The regulated expression of a TATA-less, platelet-specific gene, alphaIIb. Stem Cells. 1996, 14 Suppl 1: 38-47.PubMedView ArticleGoogle Scholar
- Weis L, Reinberg D: Accurate positioning of RNA polymerase II on a natural TATA-less promoter is independent of TATA-binding-protein-associated factors and initiator-binding proteins. Mol Cell Biol. 1997, 17: 2973-2984.PubMedPubMed CentralView ArticleGoogle Scholar
- Khachigian LM, Collins T: Early growth response factor 1: a pleiotropic mediator of inducible gene expression. J Mol Med. 1998, 76: 613-616. 10.1007/s001090050258.PubMedView ArticleGoogle Scholar
- Wagner S, Hess MA, Ormonde-Hanson P, Malandro J, Hu H, Chen M, Kehrer R, Frodsham M, Schumacher C, Beluch M, Honer C, Skolnick M, Ballinger D, Bowen BR: A broad role for the zinc finger protein ZNF202 in human lipid metabolism. J Biol Chem. 2000, 275: 15685-15690. 10.1074/jbc.M910152199.PubMedView ArticleGoogle Scholar
- Arnould T, Vankoningsloo S, Renard P, Houbion A, Ninane N, Demazy C, Remacle J, Raes M: CREB activation induced by mitochondrial dysfunction is a new signaling pathway that impairs cell proliferation. Embo J. 2002, 21: 53-63. 10.1093/emboj/21.1.53.PubMedPubMed CentralView ArticleGoogle Scholar
- Xie X, Lu J, Kulbokas EJ, Golub TR, Mootha V, Lindblad-Toh K, Lander ES, Kellis M: Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals. Nature. 2005, 434: 338-345. 10.1038/nature03441.PubMedPubMed CentralView ArticleGoogle Scholar
- Enriquez JA, Fernandez-Silva P, Montoya J: Autonomous regulation in mammalian mitochondrial DNA transcription. Biol Chem. 1999, 380: 737-747. 10.1515/BC.1999.094.PubMedView ArticleGoogle Scholar
- Di Liegro CM, Bellafiore M, Izquierdo JM, Rantanen A, Cuezva JM: 3'-untranslated regions of oxidative phosphorylation mRNAs function in vivo as enhancers of translation. Biochem J. 2000, 352 Pt 1: 109-115. 10.1042/0264-6021:3520109.PubMedView ArticleGoogle Scholar
- DeRisi JL, Iyer VR, Brown PO: Exploring the metabolic and genetic control of gene expression on a genomic scale. Science. 1997, 278: 680-686. 10.1126/science.278.5338.680.PubMedView ArticleGoogle Scholar
- GNF: GNF. [http://wombat.gnf.org/index.html]
- MitoRes: MitoRES. [http://www2.ba.itb.cnr.it/MitoNuc/]
- MitoProteome: MitoProteome. [http://www.mitoproteome.org/]
- Taylor SW, Fahy E, Zhang B, Glenn GM, Warnock DE, Wiley S, Murphy AN, Gaucher SP, Capaldi RA, Gibson BW, Ghosh SS: Characterization of the human heart mitochondrial proteome. Nat Biotechnol. 2003, 21: 281-286. 10.1038/nbt793.PubMedView ArticleGoogle Scholar
- Su AI, Cooke MP, Ching KA, Hakak Y, Walker JR, Wiltshire T, Orth AP, Vega RG, Sapinoso LM, Moqrich A, Patapoutian A, Hampton GM, Schultz PG, Hogenesch JB: Large-scale analysis of the human and mouse transcriptomes. Proc Natl Acad Sci U S A. 2002, 99: 4465-4470. 10.1073/pnas.012025199.PubMedPubMed CentralView ArticleGoogle Scholar
- Shyamsundar R, Kim YH, Higgins JP, Montgomery K, Jorden M, Sethuraman A, van de Rijn M, Botstein D, Brown PO, Pollack JR: A DNA microarray survey of gene expression in normal human tissues. Genome Biol. 2005, 6: R22-10.1186/gb-2005-6-3-r22.PubMedPubMed CentralView ArticleGoogle Scholar
- Database SMA: Stanford MicroArray Database. [http://genome-www5.stanford.edu/]
- Genomatix: Genomatix. [http://www.genomatix.de]
- browser UCSC: UCSC browser. [http://genome.ucsc.edu/]
- Gardiner-Garden M, Frommer M: CpG islands in vertebrate genomes. J Mol Biol. 1987, 196: 261-282. 10.1016/0022-2836(87)90689-9.PubMedView ArticleGoogle Scholar
- Johnston SD, Liu X, Zuo F, Eisenbraun TL, Wiley SR, Kraus RJ, Mertz JE: Estrogen-related receptor alpha 1 functionally binds as a monomer to extended half-site sequences including ones contained within estrogen-response elements. Mol Endocrinol. 1997, 11: 342-352. 10.1210/me.11.3.342.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.