Computational study of associations between histone modification and protein-DNA binding in yeast genome by integrating diverse information
© Wang; licensee BioMed Central Ltd. 2011
Received: 8 October 2010
Accepted: 1 April 2011
Published: 1 April 2011
In parallel with the quick development of high-throughput technologies, in vivo (vitro) experiments for genome-wide identification of protein-DNA interactions have been developed. Nevertheless, a few questions remain in the field, such as how to distinguish true protein-DNA binding (functional binding) from non-specific protein-DNA binding (non-functional binding). Previous researches tackled the problem by integrated analysis of multiple available sources. However, few systematic studies have been carried out to examine the possible relationships between histone modification and protein-DNA binding. Here this issue was investigated by using publicly available histone modification data in yeast.
Two separate histone modification datasets were studied, at both the open reading frame (ORF) and the promoter region of binding targets for 37 yeast transcription factors. Both results revealed a distinct histone modification pattern between the functional protein-DNA binding sites and non-functional ones for almost half of all TFs tested. Such difference is much stronger at the ORF than at the promoter region. In addition, a protein-histone modification interaction pathway can only be inferred from the functional protein binding targets.
Overall, the results suggest that histone modification information can be used to distinguish the functional protein-DNA binding from the non-functional, and that the regulation of various proteins is controlled by the modification of different histone lysines such as the protein-specific histone modification levels.
The binding of transcription factors (TF) to DNA sequences is an essential step in genome regulation. In parallel with the quick development of high-throughput methods for measuring genome-wide protein-DNA interaction (e.g., ChIP-chip , ChIP-Seq , DamID , and protein binding microarray ). Many state-of-art computer programs (e.g., MEME , MatrixReduce , and MDScan ) have been developed to identify TF binding motifs. Nevertheless, several questions remain in the field, such as how to distinguish true TF-DNA binding (functional TF binding sites) from non-specific TF-DNA binding (non-functional ones). Here the functional TF binding site is defined as the promoter region of a gene that, bound by a TF, is a true regulatory target (e.g., a strong correlation between the inferred TF activity and mRNA expression of a gene that is bound by the TF [8, 9]); the non-functional TF binding site refers to a non-specific TF-DNA binding such as a TF that is bound to the promoter region of a gene but does not regulate the gene expression. Finding the true regulatory targets of a TF based on the present technology is a challenge , which has inspired many researchers over the past several years to seek help from computational solutions such as integrative modeling of mRNA expression data and ChIP-chip data , biophysical modeling of orthologous promoter sequences , predicting of functionality of protein-DNA interactions , and distinguishing direct versus indirect TF-DNA interactions  by integrating diverse information.
Although some of the previous studies considered the effect of nucleosomes on TF-DNA interactions (e.g., nucleosome occupancy affects transcription by decreasing the accessibility of DNA to protein binding ), most of them ignored an important aspect that is also closely associated with functional TF binding, that is, changes in chromatin structure are affected by histone modifications such as methylation and acetylation [14, 15]. In a few recent papers [9, 16], the effect of histone modifications on protein-DNA interactions was emphasized. Especially, several excellent bioinformatics studies revealed importance of considering histone modification information, in computational algorithms, for identifying new regulatory elements  and predicting promoters and enhancers in the human and mouse genomes [18, 19]. However, no conclusive remarks were made to address the associations between histone modification and functional TF binding. This may be due to the ongoing debate on models of the functions of histone modification . Currently, three major models have been proposed to explain the role of histone modification in genome regulation: 1) charge neutralization , by which histone modification can relax chromatin structure because of neutralizing positive charges on DNA; 2) histone code , by which combinatory histone modifications can regulate downstream gene functions; and 3) signaling pathway [22, 23], by which multiple histone modifications can provide bi-stability and robustness through feedback loops. Motivated by this unsolved question, a systematic study of associations between TF-DNA binding and histone modification in yeast was carried out by integrative analysis of diverse datasets [8, 9, 24–27].
Pre-processing of datasets
ChIP-chip experimental data in rich medium conditions of 203 yeast TFs was obtained from the work of Harbison et al. . Yeast nucleosome occupancy in normal condition was taken from Lee et al. . The histone acetylation dataset was from Kurdistani et al. ; the dataset contained acetylation levels on 11 histone lysines in both yeast promoter and the open reading frame (ORF) (H2aK7, H2bK11, and 16; H3K9, 14, 18, 23, and 27; H4K8, 12, and 16). Because the measured histone modifications in any given promoter are affected by the rate of that region being occupied by nucleosome , the 11 acetylation levels were normalized by the nucleosome occupancy (H3 and H4) measured by Lee et al. . More specifically, the average of H3 and H4 histone levels was computed within each probe then the histone acetylation level of that probe was divided by the corresponding mean nucleosome occupancy. Additionally, histone modification data from Pokholok et al.  was used, which included acetylation levels on three histone lysines (H4; H3K9 and 14), methylation levels on five histone lysines (H3K4me1, 4me2, 4me3; H3K36me3; H3K79me3), nucleosome occupancy (H3 and H4), and histone acetyltransferase (ESA1 and GCN5) occupancy data under normal condition. Here the histone modification signals were also normalized by the local nucleosome occupancy as described in the previous dataset. Since array difference in genome-wide coverage (e.g. data from Kurdistani et al. contains only ~1580 promoters and ~2384 ORFs; but a high-resolution microarray data from Pokholok et al. includes ~5522 ORFs and ~5504 promoters), the above-mentioned two histone modifications datasets were separately analyzed. All datasets were transformed to Z-scores before further data analysis was performed.
Gene assignment, putative functional binding target and data analysis
Based on the original gene annotation tables from [25–27], an in-house Perl script file was used to map nucleosome occupancy and histone modification levels to gene and the corresponding promoter region, in which if multiple probes are assigned to the same gene or promoter region then we use their mean value. Information on computationally inferred functional TF binding sites and non-functional ones for 37 yeast TFs at normal condition was taken from publication by Gao et al. . Here TFs with less than five probes overlapped between the binding data and the histone modification data were excluded. To examine possible correlations between histone modifications and transcription factor binding, a two-tailed t-test was used to quantify the difference in mean between the TF binds and the histone modification  for both the functional binding probes (bind and couple) and non-functional ones (bind but not couple), respectively. In general, the t-test was used to score the difference between average TF binding affinity (histone modification level) of predefined groups of probes (e.g. functional binding probes), and that of all other probes on the array. Subsequently, the t-values were clustered  and visualized  in a color-coded heat map to uncover TF binds (histone modification) enriched in the probed regions forming a given group. The same procedure was successfully applied in a number of earlier studies [28, 31]. Finally, to evaluate the robustness of the t-test, the rank-sum test was applied on the same datasets, and then the log10 transformed p-values were displayed in the heat map.
Protein-histone modification interaction networks
In order to investigate possible correlation between the histone modification at ORF and the TF binding to the corresponding promoter, a computational strategy was used to build a protein-histone modification interaction network: 1) for the binding targets of each of 32 TFs from , enrichment of proteins (total 203 yeast TFs ) binding to the promoter was tested, such as by performing a two-tailed t-test for selected functional (or non-functional) binding sites versus the rest of the binding sites in the yeast genome ; 2) then, for the binding targets of the above-mentioned 32 TFs, the same t-tests were used to evaluate the histone modification changes (total 8 histone modifications ) at the corresponding ORF; 3) subsequently, the t-values from the previous tests were combined together, more specifically, the histone modifications at the ORF of functional (or non-functional) binding sites were combined with the enrichment of TF binding at the corresponding promoter; 4) in each of above two newly complied datasets, one for functional binding sites and the other for non-functional ones, proteins (203 TFs and 8 histone modifications) were grouped into 18 clusters by using a published computational approach  that combines the stress function, neuron gas algorithm and K-nearest neighbour method, where the number of protein clusters was automatically estimated by the stress function; 5) Finally, Gaussian Graphical Models [33, 34] were applied on the centers of 18 clusters for inferring the protein-histone modification interaction network. In predicted network, the nodes represent 18 protein clusters and the edges indicate associations between a pair of nodes, where the strength of interactions is stated by the partial correlation coefficient. For every node that is connected to the network, its representative proteins are labeled.
Bayesian Neural Networks
where H represents the model hypothesis space such as network structure and regularization, M is a probability framework of the objective function described in equation (1), and Z M is a normalization factor. By using a Gaussian approximation to the posterior probability, we minimize objective function (1) and determine the re-estimation formulas for hyperparameter α according to the weight assumptions E w . A detailed description of computational implementation of Bayesian Neural Networks by using the Gaussian approximation for the posterior distribution is available in previous publications [35–37].
Using Bayesian Neural Networks to find functional protein-DNA binding
As already discussed in the previous section, Bayesian Neural Networks is a supervised non-linear model, which has several advantages  when applied to classification tasks: 1) the computational algorithm is robust , 2) it can learn from the data without any pre-assumption, 3) its non-linear feature can be applied to model any real-world complex relationships. Thus, Bayesian Neural Networks was used to classify functional and non-functional binding sites  based on histone modification levels at the ORFs. First, we trained a classifier on the training data for each TF via Bayesian Neural Networks  (one hidden layer with two hidden neurons), then the trained classifier was applied on independent test data for recording the percentage of correct classifications and total number of correct classifications. To avoid the bias that may be introduced by the selection of training and test data, we randomly divided the half-available binding sites into the training and the test set, respectively. The random splitting was repeated 10 times for each TF, and the reported classification accuracy is the mean of percentage of correct classifications (MPCC) of 10 randomly selected test datasets. A corresponding 10-fold cross validation was also performed.
Histone acetylation (Kurdistani et al.)
Among the non-functional binding sites in Figure 1, there is an almost constant acetylation level across all histone lysines, which may be either high or low; for the functional binding sites, however, the equilibrium of the histone acetylation levels on the different lysines is broken, which results in a TF-specific perturbation of acetylation levels. For example, the functional binding sites of FHL1 (a transcriptional activator) show very high acetylation levels on H3K9, 14, 18, 23 and 27, but relatively no acetylation changes on H2K11 and 16; on the other hand, the functional binding sites of NRG1 (a transcriptional repressor) display high acetylation levels on H4K8 and 12 but low acetylation levels on H3K9, 14, 23, and 18, and H2K11 and 16, respectively. From the literature , we know that the effect of histone acetylation is dependent on the specific histone lysines that may initiate different downstream functions, such as the binding of additional histone acetyltransferases (HATs), modification of the chromatin structure, and recruitment of a particular transcription factor or nucleosome remodeling complex.
Nevertheless, Figure 1 also shows that the discriminative power of histone modifications is much less clear for other TFs (e.g. NDD1, MCM1, FKH2, ACE2, YAP1, SUM1 etc.). Of the above-mentioned six TFs, the first four are related to the yeast cell cycle and the other two are usually not functional under growth conditions . It suggests that for certain TFs, we need to consider more diverse histone modification information, such as histone decetylation and methylation levels, in order to distinguish the functional binding sites from the non-functional ones. Taken together, the results indicate the TF-specific histone acetylation at yeast ORF might be used as a biomarker of functional protein binding.
Histone modification (Pokholok et al.)
In Figure 2, acetylation levels on histone lysines of both FHL1 and NRG1 bear a similar trend as those observed in Figure 1. For the above-mentioned two TFs, a similar variation at three methylation levels on the histone lysines (H3K4me3, 36me3, and 79me3) was also observed, and the three methylation sites on the histones are involved in the activation of transcription  as usually the acetylation on the histone lysines is. Interestingly, for both cell cycle related TFs (e.g. MCM1, FKH2, NDD1 and ACE2) and TFs that are not functional under growth conditions (e.g. YAP1 and SUM1), the discriminative power of histone modifications in Figure 2 is much stronger than that in Figure 1. This may be caused by histone acetyltransferase and methylation levels (e.g. H3K79me3, H3K36me3, H3K4me3 and H3K4me2) in Figure 2. Thus, the new results support our previous hypothesis from Figure 1: the functional regulation of different TFs is controlled by the histone modifications on different lysines; the difference in histone modifications is much stronger at the ORF than at the promoter; in particular, the more diverse histone modification information, the stronger discriminative power it has.
A protein-histone modification interaction network (Pokholok et al.)
In Figure 3, the functional TF binding, several interesting correlations were found between protein binding to the promoter and histone modification at the corresponding ORF: for example, 1) at the center of the network, two clusters (clusters 13 and 15 with one protein in each, FHL1 and RAP1, respectively) are strongly connected to each other (partial correlation coefficient equals 0.91), while being also associated with three other clusters (clusters 1, 3, and 7) that contain histone H3 and histone acetyltransferase; 2) cluster 15 (RAP1) is negatively correlated (partial correlation coefficient equals -0.41) with cluster 3 (histone H3), but cluster 13 (FHL1) is positively associated with both cluster 3 and cluster 14 (histone H4 and H3K4me1). Additionally, much histone crosstalk was observed: for instance, 1) cluster 14 (H4 and H3K4me1) is negatively correlated with cluster 7 (histone acetyltransferase - ESA1 and GCN5; H3K4me3, H3K46me3; H3K9ac, and H3K14ac), but cluster 13 (FHL1) is positively associated with the same cluster; 2) cluster 7 is also correlated with cluster 6 where we found both histone methylation and histone acetylation (e.g., H3K4me2, H3K79me3, H4ac); 3) cluster 6 is connected with cluster 2 where three of five proteins are a chromatin remodeling complex (e.g., hir1, hir2, hir3) that contributes to both nucleosome formation and regulation of histone gene transcription. In summary, the inferred network reveals a number of interesting findings, such as evidence for histone crosstalk, data suggesting that different proteins are affected by different histone modifications, and data supporting that histone modifications are negatively correlated with nucleosome density, while being positively associated with both the chromatin remodeling complex and the binding of FHL1 and RAP1 to the promoter.
In Additional file 1 Figure S4, the non-functional TF binding, all histone modifications plus the nucleosome (H3 and H4) and HATs (ESA1 and GCN5) occupancies are grouped in the same cluster (cluster 7). In other words, there is no difference in histone modification changes across the 32 yeast TFs when TF binds to DNA but does not function. Particularly, many interesting protein-histone modification interactions in Figure 3 are not present at here: for example, cluster 7 neither directly interacts with the chromatin remodeling complex nor is it associated with the binding of RAP1 and FHL1 to the promoter, although the two proteins are still highly connected to each other (clusters 8 and 15). This suggests that the majority of protein-histone modification interactions will disappear if the protein binds to the promoter region of a gene but does not regulate the gene expression.
Classification of functional and non-functional binding sites by using measured histone modifications at the ORFs
Mean percentage of correct classifications and mean total number of correct classifications for 32 yeast TFs.
Mean percentage of correct classifications in 10 test datasets
Mean total number of correct classifications in 10 test datasets
For the first dataset, Additional file 1 Table S1a shows MPCC of 10 randomly selected test datasets, with 5 TFs (~24%) showing a good prediction rate on the test set (MPCC > = 70%) but with the other 13 TFs (~62%) classifying poorly (MPCC <60%). Among the poorly classified TFs, ~69% (9 TFs) are associated with yeast cell cycle. For the second dataset (Table 1), a clear improvement of the classification accuracy is observed: for example, 14 of the total 32 TFs (~44%) had MPCC > = 70% and the trained classifier only tested poorly on 4 TFs (~13%; MPCC <60%). Here ~67% of the TFs with lower classification accuracies (18 TFs with MPCC <70%) are TFs related to the yeast cell cycle. In brief, for the first training data that contains only histone acetylation information, good classification accuracy was achieved for around one third of all TFs tested (e.g. Table S1a); however, for the second dataset that includes both histone methylation and histone acetlylation features, almost half of all TFs tested were well classified by histone modifications (e.g. Table 1). Additional file 1 Figures S5a and S6a show the mean confusion matrix and the mean classification performance (prediction compared with true target), respectively, of 10 randomly selected test datasets [26, 27].
Discussion and Conclusions
Two histone modification datasets [26, 27] were investigated here. Both results confirm there is a distinct pattern of histone modifications between functional TF binding sites and non-functional ones for almost half of all TFs tested (Figures 1 and 2, respectively). For example, 1) for the functional TF binding sites, different TFs modify acetylation (methylation) levels on the different histone lysines; 2) for the non-functional TF binding sites, the acetylation (methylation) levels on different histone lysines are almost constant; 3) the difference in histone modifications between the functional TF binding sites and the non-functional ones is stronger at the ORF region than that at the promoters, which is also becoming clear when we directly compare the mean histone modification changes between the two groups (Additional file 1, Figure S7); and 4) a protein-histone modification interaction network can only be inferred from the functional protein binding targets. In summary, both the histone crosstalk and protein-histone modification interactions may play important roles in functional TF binding since many of them disappear under non-functional conditions.
In particular, the discriminative power of histone modifications is much greater with histone modifications at ORFs than at the promoter. The finding is backed by several lines of evidence in the literature. First, in yeast, the methylation levels on histone lysines are either positively or negatively correlated with transcription rates, and the main peaks of enrichment for methylation are often within the ORFs (e.g., H3K4me1, 4me2, 4me3; H3K36me3 and H3K79me3) [23, 41]. Second, although acetylation at many sites correlates with transcription rate, some of them (e.g., H4K16ac, H4K8ac, H2BK11ac and H2BK16ac) at yeast intergenic regions do not correlate well with transcription . Third, in different human cell types, histone modification levels and gene expression are very well correlated, and the main peaks of enrichment for those important modifications are within the ORFs (e.g., H3K4me3, H3K79me1, H4K20me1, and H3K27ac) . Finally, in both the yeast and fruit fly genomes, experimental observations have shown that the enrichment of H3K36me3 levels at the ORFs can be used to distinguish different chromatin types [13, 43]. Thus, the high levels of ORF enrichment for histone modification, especially the methylation levels, could potentially reflect the activity of protein-DNA binding in the promoter region . In general, all the above-mentioned molecular mechanisms support the hypothesis that ORF histone modification data are better associated with TF binding at the promoter than the promoter histone modification data, further investigation is still needed to determine and verify the underlying mechanism.
In addition, Bayesian Neural Networks was used to train a classifier from the training histone data, and then the trained classifier was applied to an independent set of histone data in order to predict the functional TF binding sites. The results are encouraging (Table 1) because almost half of the tested TFs could reach a prediction accuracy of ~70%, although only eight histone modifications were considered in the training set. In Table 1, especially, among the top 5 ranked TFs, we observed TF-specific histone modification at the ORFs of functional binding sites (Figure 4), which suggests that the functional regulation of different TFs is controlled by the histone modifications on different lysines (Figure 2). However, most of the currently examined histone modifications are associated with transcriptional activation , and there is no information about the histone modifications of transcriptional silencing/repression (e.g., histone decetylation  and methylation of H3K27me and H4K20me [23, 46]) in the training data. Therefore, the lack of information on certain histone modifications may cause the poor prediction rate for some TFs. For instance, in Table 1, ~67% of TFs with MPCC <70% are TFs related to yeast cell cycle; the cell cycle TFs are often associated with Rpd3 target genes, and the Rpd3 protein belongs to yeast histone deacetylases (HDACs) that may play an important role in yeast cell cycle regulation ; after excluding the cell cycle TFs from Figure 1 and 2, clustering analysis was performed again but the clustering patterns were not dramatically changed, Additional file 1 Figures S8 and S9, respectively. Thus, results indicate that if the training data include more post-translational modifications of the histones (e.g., the above mentioned HDACS, H3K27me, and H4K20me, as well as phosphorylation and ubiquitylation ), then the trained classifier will achieve a better prediction accuracy in the test data.
Below is a brief description of the possible protein-histone modification interaction network from Figure 3. First, for correlations between protein binding to the promoter and histone modification at the corresponding ORF, cluster 15 (RAP1) has a strong negative association with cluster 3 (histone H3, etc.), but cluster 13 (FHL1) has a strong positive interaction with the same cluster, and cluster 14 (histone H4, H3Kme1, etc.) and cluster 3 are positively correlated to each other. The above mentioned interactions are consistent with the literature: for example, RAP1 (a general transcription factor) opens chromatin  to facilitate binding by other TFs such as GCN4, and then the bound TF recruits HATs (e.g. GCN5 and ESA1), resulting in histone acetylation ; FHL1 has been thought to interact with the histone acetylase ESA1 and to activate transcription of proteins ; by searching the BioGRID database , direct protein-protein interaction between RAP1 and FHL1 was found, as well as interactions between RAP1 and nine other proteins associated with HATs (e.g., SAS4, SAS5, RTT109; for detailed information, please refer to Additional file 6), although FHL1 only interacted with one HAT (EAF6). Additionally, both FHL1 and RAP1 are known to actively participate in modifying chromatin structure [48, 51] and regulating acetylation/methylation levels on histone lysines [27, 49, 52]. Thus, current literature supports the view that RAP1 and FHL1 complement each other to control the chromatin-open and HAT recruitment activities (a bi-stability of chromatin state).
Second, cluster 13 (FHL1) is positively correlated with cluster 7 but cluster 14 (histone occupancy) is negatively associated with the same cluster. In cluster 7, both HATs (GCN5 and ESA1) and histone modifications for active genes (e.g., H3K4me3, H3K36me3, H3K9ac, and H3K14ac ) are found. Therefore, the gene transcription rate is negatively correlated with the nucleosome density but positively associated with the binding of FHL1 (may recruit HATs) to the promoter region.
Third, for histone crosstalk, cluster 6 is only connected with cluster 7 and cluster 2. In cluster 6, there are three histone modifications (H4ac, H3K4me2, H3K79me3), which are less associated with the transcription rate than H3K9ac and H3K14ac . In cluster 2, three of five proteins are subunits of a HIR complex (e.g., HIR1, HIR2, HIR3), a nucleosome assembly complex that contributes to nucleosome formation. Based on the BioGRID database, HIR complexes directly interact with at least 25 proteins (e.g., IES3, ASF1, ARP8, SNF5, SWI3) that are involved in chromatin remodeling, and 9 another proteins that are involved in histone modification (e.g., LEO1, ESA1, SAS2; for detailed information please refer to Additional file 6). Thus, at the end of a protein-histone modification interaction network, protein involved in chromatin remodeling, such as ATP-dependent chromatin remodelers, may play a key role in generating new histone modifications, e.g., modification in the chromatin structure that may influence gene activity either positively or negatively .
I thank Prof. Harmen Bussemaker for introducing me to the project, Prof. Ben Davidson for critical reading of manuscript, and three referees for their constructive comments to help improve the paper. Publication cost of the paper is supported by Inger and John Fredriksen Foundation for Ovarian Cancer Research. Accessing high performance computing resources at the University of Oslo is supported by Norwegian Cancer Society (419666-107277-PR-2007-0065) and NOTUR project (nn4605k).
- Hanlon SE, Lieb JD: Progress and challenges in profiling the dynamics of chromatin and transcription factor binding with DNA microarrays. Current opinion in genetics & development. 2004, 14 (6): 697-705.View ArticleGoogle Scholar
- Schones DE, Zhao K: Genome-wide approaches to studying chromatin modifications. Nature reviews. 2008, 9 (3): 179-191. 10.1038/nrg2270.View ArticlePubMedGoogle Scholar
- van Steensel B, Henikoff S: Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nature biotechnology. 2000, 18 (4): 424-428. 10.1038/74487.View ArticlePubMedGoogle Scholar
- Bulyk ML, Huang X, Choo Y, Church GM: Exploring the DNA-binding specificities of zinc fingers with DNA microarrays. Proceedings of the National Academy of Sciences of the United States of America. 2001, 98 (13): 7158-7163. 10.1073/pnas.111163698.View ArticlePubMedPubMed CentralGoogle Scholar
- Bailey TL: Discovering novel sequence motifs with MEME. Current protocols in bioinformatics/editoral board, Andreas D Baxevanis [et al. 2002, Chapter 2: Unit 2 4-Google Scholar
- Foat BC, Morozov AV, Bussemaker HJ: Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE. Bioinformatics (Oxford, England). 2006, 22 (14): e141-149. 10.1093/bioinformatics/btl223.View ArticleGoogle Scholar
- Liu XS, Brutlag DL, Liu JS: An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments. Nature biotechnology. 2002, 20 (8): 835-839.View ArticlePubMedGoogle Scholar
- Gao F, Foat BC, Bussemaker HJ: Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data. BMC bioinformatics. 2004, 5: 31-10.1186/1471-2105-5-31.View ArticlePubMedPubMed CentralGoogle Scholar
- Ucar D, Beyer A, Parthasarathy S, Workman CT: Predicting functionality of protein-DNA interactions by integrating diverse evidence. Bioinformatics (Oxford, England). 2009, 25 (12): i137-144. 10.1093/bioinformatics/btp213.View ArticleGoogle Scholar
- Wang J: Computational biology of genome expression and regulation--a review of microarray bioinformatics. J Environ Pathol Toxicol Oncol. 2008, 27 (3): 157-179.View ArticlePubMedGoogle Scholar
- Ward LD, Bussemaker HJ: Predicting functional transcription factor binding through alignment-free and affinity-based analysis of orthologous promoter sequences. Bioinformatics (Oxford, England). 2008, 24 (13): i165-171. 10.1093/bioinformatics/btn154.View ArticleGoogle Scholar
- Gordan R, Hartemink AJ, Bulyk ML: Distinguishing direct versus indirect transcription factor-DNA interactions. Genome research. 2009, 19 (11): 2090-2100. 10.1101/gr.094144.109.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang J, Ward L, Bussemaker H: Classification of Saccharomyces cerevisiae promoter regions into distinct chromatin classes reveals the existence of nucleosome-depleted hotspots of transcription factor occupancy. arXiv:10100713v1. 2008Google Scholar
- Mellor J: The dynamics of chromatin remodeling at promoters. Molecular cell. 2005, 19 (2): 147-157. 10.1016/j.molcel.2005.06.023.View ArticlePubMedGoogle Scholar
- Jiang C, Pugh BF: Nucleosome positioning and gene regulation: advances through genomics. Nature reviews. 2009, 10 (3): 161-172.View ArticlePubMedPubMed CentralGoogle Scholar
- Ernst J, Plasterer HL, Simon I, Bar-Joseph Z: Integrating multiple evidence sources to predict transcription factor binding in the human genome. Genome research. 20 (4): 526-536. 10.1101/gr.096305.109.Google Scholar
- Won KJ, Chepelev I, Ren B, Wang W: Prediction of regulatory elements in mammalian genomes using chromatin signatures. BMC bioinformatics. 2008, 9: 547-10.1186/1471-2105-9-547.View ArticlePubMedPubMed CentralGoogle Scholar
- Heintzman ND, Stuart RK, Hon G, Fu Y, Ching CW, Hawkins RD, Barrera LO, Van Calcar S, Qu C, Ching KA, et al: Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nature genetics. 2007, 39 (3): 311-318. 10.1038/ng1966.View ArticlePubMedGoogle Scholar
- Won KJ, Ren B, Wang W: Genome-wide prediction of transcription factor binding sites using an integrated model. Genome biology. 11 (1): R7-10.1186/gb-2010-11-1-r7.Google Scholar
- Wolffe AP, Hayes JJ: Chromatin disruption and modification. Nucleic acids research. 1999, 27 (3): 711-720. 10.1093/nar/27.3.711.View ArticlePubMedPubMed CentralGoogle Scholar
- Kouzarides T: Chromatin modifications and their function. Cell. 2007, 128 (4): 693-705. 10.1016/j.cell.2007.02.005.View ArticlePubMedGoogle Scholar
- Schreiber SL, Bernstein BE: Signaling network model of chromatin. Cell. 2002, 111 (6): 771-778. 10.1016/S0092-8674(02)01196-0.View ArticlePubMedGoogle Scholar
- Li B, Carey M, Workman JL: The role of chromatin during transcription. Cell. 2007, 128 (4): 707-719. 10.1016/j.cell.2007.01.015.View ArticlePubMedGoogle Scholar
- Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, et al: Transcriptional regulatory code of a eukaryotic genome. Nature. 2004, 431 (7004): 99-104. 10.1038/nature02800.View ArticlePubMedPubMed CentralGoogle Scholar
- Lee W, Tillo D, Bray N, Morse RH, Davis RW, Hughes TR, Nislow C: A high-resolution atlas of nucleosome occupancy in yeast. Nature genetics. 2007, 39 (10): 1235-1244. 10.1038/ng2117.View ArticlePubMedGoogle Scholar
- Pokholok DK, Harbison CT, Levine S, Cole M, Hannett NM, Lee TI, Bell GW, Walker K, Rolfe PA, Herbolsheimer E, et al: Genome-wide map of nucleosome acetylation and methylation in yeast. Cell. 2005, 122 (4): 517-527. 10.1016/j.cell.2005.06.026.View ArticlePubMedGoogle Scholar
- Kurdistani SK, Tavazoie S, Grunstein M: Mapping global histone acetylation patterns to gene expression. Cell. 2004, 117 (6): 721-733. 10.1016/j.cell.2004.05.023.View ArticlePubMedGoogle Scholar
- Boorsma A, Foat BC, Vis D, Klis F, Bussemaker HJ: T-profiler: scoring the activity of predefined groups of genes using gene expression data. Nucleic acids research. 2005, 33 (Web Server issue): W592-595. 10.1093/nar/gki484.View ArticlePubMedPubMed CentralGoogle Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95 (25): 14863-14868. 10.1073/pnas.95.25.14863.View ArticlePubMedPubMed CentralGoogle Scholar
- Saldanha AJ: Java Treeview--extensible visualization of microarray data. Bioinformatics (Oxford, England). 2004, 20 (17): 3246-3248. 10.1093/bioinformatics/bth349.View ArticleGoogle Scholar
- Moorman C, Sun LV, Wang J, de Wit E, Talhout W, Ward LD, Greil F, Lu XJ, White KP, Bussemaker HJ, et al: Hotspots of transcription factor colocalization in the genome of Drosophila melanogaster. Proceedings of the National Academy of Sciences of the United States of America. 2006, 103 (32): 12027-12032. 10.1073/pnas.0605003103.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang J: A new framework for identifying combinatorial regulation of transcription factors: a case study of the yeast cell cycle. Journal of biomedical informatics. 2007, 40 (6): 707-725. 10.1016/j.jbi.2007.02.003.View ArticlePubMedGoogle Scholar
- Wang J, Cheung LW, Delabie J: New probabilistic graphical models for genetic regulatory networks studies. Journal of biomedical informatics. 2005, 38 (6): 443-455. 10.1016/j.jbi.2005.04.003.View ArticlePubMedGoogle Scholar
- Wang J, Myklebost O, Hovig E: MGraph: graphical models for microarray data analysis. Bioinformatics (Oxford, England). 2003, 19 (17): 2210-2211. 10.1093/bioinformatics/btg298.View ArticleGoogle Scholar
- Mackay D: Bayesian Methods for Adaptive Models. PhD thesis, California Institute of Technology. 1991Google Scholar
- Wang J: The effect of prior assumptions over the weights in BayesPI with application to study protein-DNA interactions from ChIP-based high-throughput data. BMC bioinformatics. 11: 412-10.1186/1471-2105-11-412.Google Scholar
- Wang J, Morigen : BayesPI - a new model to study protein-DNA interactions: a case study of condition-specific protein binding parameters for Yeast transcription factors. BMC bioinformatics. 2009, 10: 345-10.1186/1471-2105-10-345.View ArticlePubMedPubMed CentralGoogle Scholar
- Nabney I: NETLAB: Algorithms for Pattern Recognition. 2001, London: SpringerGoogle Scholar
- Shahbazian MD, Grunstein M: Functions of site-specific histone acetylation and deacetylation. Annual review of biochemistry. 2007, 76: 75-100. 10.1146/annurev.biochem.76.052705.162114.View ArticlePubMedGoogle Scholar
- Mahony S, Benos PV: STAMP: a web tool for exploring DNA-binding motif similarities. Nucleic acids research. 2007, 35 (Web Server issue): W253-258. 10.1093/nar/gkm272.View ArticlePubMedPubMed CentralGoogle Scholar
- Millar CB, Grunstein M: Genome-wide patterns of histone modifications in yeast. Nat Rev Mol Cell Biol. 2006, 7 (9): 657-666. 10.1038/nrm1986.View ArticlePubMedGoogle Scholar
- Karlic R, Chung HR, Lasserre J, Vlahovicek K, Vingron M: Histone modification levels are predictive for gene expression. Proceedings of the National Academy of Sciences of the United States of America. 107 (7): 2926-2931. 10.1073/pnas.0909344107.Google Scholar
- Filion GJ, van Bemmel JG, Braunschweig U, Talhout W, Kind J, Ward LD, Brugman W, de Castro IJ, Kerkhoven RM, Bussemaker HJ, et al: Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells. Cell. 2010, 143 (2): 212-224. 10.1016/j.cell.2010.09.009.View ArticlePubMedPubMed CentralGoogle Scholar
- Pekowska A, Benoukraf T, Ferrier P, Spicuglia S: A unique H3K4me2 profile marks tissue-specific gene regulation. Genome research. 2010, 20 (11): 1493-1502. 10.1101/gr.109389.110.View ArticlePubMedPubMed CentralGoogle Scholar
- Peterson CL, Laniel MA: Histones and histone modifications. Curr Biol. 2004, 14 (14): R546-551. 10.1016/j.cub.2004.07.007.View ArticlePubMedGoogle Scholar
- Henikoff S: Nucleosome destabilization in the epigenetic regulation of gene expression. Nature reviews. 2008, 9 (1): 15-26. 10.1038/nrg2206.View ArticlePubMedGoogle Scholar
- Robert F, Pokholok DK, Hannett NM, Rinaldi NJ, Chandy M, Rolfe A, Workman JL, Gifford DK, Young RA: Global position and recruitment of HATs and HDACs in the yeast genome. Molecular cell. 2004, 16 (2): 199-209. 10.1016/j.molcel.2004.09.021.View ArticlePubMedPubMed CentralGoogle Scholar
- Morse RH: RAP, RAP, open up! New wrinkles for RAP1 in yeast. Trends Genet. 2000, 16 (2): 51-53. 10.1016/S0168-9525(99)01936-8.View ArticlePubMedGoogle Scholar
- Guo X, Tatsuoka K, Liu R: Histone acetylation and transcriptional regulation in the genome of Saccharomyces cerevisiae. Bioinformatics (Oxford, England). 2006, 22 (4): 392-399. 10.1093/bioinformatics/bti823.View ArticleGoogle Scholar
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic acids research. 2006, 34 (Database issue): D535-539. 10.1093/nar/gkj109.View ArticlePubMedGoogle Scholar
- Morse RH: Getting into chromatin: how do transcription factors get past the histones?. Biochemistry and cell biology = Biochimie et biologie cellulaire. 2003, 81 (3): 101-112. 10.1139/o03-039.View ArticlePubMedGoogle Scholar
- Pham H, Ferrari R, Cokus SJ, Kurdistani SK, Pellegrini M: Modeling the regulatory network of histone acetylation in Saccharomyces cerevisiae. Molecular systems biology. 2007, 3: 153-10.1038/msb4100194.View ArticlePubMedPubMed CentralGoogle Scholar
- Lee JS, Smith E, Shilatifard A: The language of histone crosstalk. Cell. 142 (5): 682-685. 10.1016/j.cell.2010.08.011.Google Scholar
- Tsai HK, Lu HH, Li WH: Statistical methods for identifying yeast cell cycle transcription factors. Proceedings of the National Academy of Sciences of the United States of America. 2005, 102 (38): 13532-13537. 10.1073/pnas.0505874102.View ArticlePubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.