An integrated approach to characterize transcription factor and microRNA regulatory networks involved in Schwann cell response to peripheral nerve injury
- Li-Wei Chang†1,
- Andreu Viader†2,
- Nobish Varghese1,
- Jacqueline E Payton1,
- Jeffrey Milbrandt2 and
- Rakesh Nagarajan1Email author
© Chang et al.; licensee BioMed Central Ltd. 2013
Received: 28 January 2012
Accepted: 29 January 2013
Published: 6 February 2013
The regenerative response of Schwann cells after peripheral nerve injury is a critical process directly related to the pathophysiology of a number of neurodegenerative diseases. This SC injury response is dependent on an intricate gene regulatory program coordinated by a number of transcription factors and microRNAs, but the interactions among them remain largely unknown. Uncovering the transcriptional and post-transcriptional regulatory networks governing the Schwann cell injury response is a key step towards a better understanding of Schwann cell biology and may help develop novel therapies for related diseases. Performing such comprehensive network analysis requires systematic bioinformatics methods to integrate multiple genomic datasets.
In this study we present a computational pipeline to infer transcription factor and microRNA regulatory networks. Our approach combined mRNA and microRNA expression profiling data, ChIP-Seq data of transcription factors, and computational transcription factor and microRNA target prediction. Using mRNA and microRNA expression data collected in a Schwann cell injury model, we constructed a regulatory network and studied regulatory pathways involved in Schwann cell response to injury. Furthermore, we analyzed network motifs and obtained insights on cooperative regulation of transcription factors and microRNAs in Schwann cell injury recovery.
This work demonstrates a systematic method for gene regulatory network inference that may be used to gain new information on gene regulation by transcription factors and microRNAs.
KeywordsTranscriptional regulatory network MicroRNA regulatory network Myelination Schwann cells
Schwann cells (SCs), the main glia cells in the peripheral nervous system, are among a limited number of mammalian cells capable of dedifferentiation. The ability of SCs to dedifferentiate is critical to their role in supporting peripheral nerve regeneration. Following peripheral nerve injury, SCs mount a regenerative response involving coordinated dedifferentiation, proliferation and redifferentiation that supports axonal regrowth and helps restore peripheral nerve function . Like other cellular processes tightly coupled with cell fate determination and developmental timing control, this SC injury response requires precise spatiotemporal regulation of gene expression. This is achieved via an intricate transcriptional program that maintains the balance between positive and negative regulators of SC differentiation . In addition to transcriptional control, recent studies have shown that SC myelination [3–5] and response to injury  are also post-transcriptionally modulated by microRNAs (miRNAs).
Although the role of individual TFs that regulate SC myelination has been investigated, cooperation and interaction among different TFs involved in the response of SCs to nerve damage remain largely unknown. More importantly, how miRNAs integrate into the genetic program of TFs to modulate SC gene expression remains unclear. A comprehensive delineation of the TF and miRNA regulatory network underlying the SC injury response may shed light on fundamental aspects of SC biology. This information could also help fulfill the therapeutic potential of modulating the SC injury response in a number of neurodegenerative diseases characterized by peripheral axonopathy.
Systematically inferring TF and miRNA regulatory networks is difficult to achieve by experimental methods and has motivated development of computational approaches. Computational tools have been created to construct TF and miRNA regulatory networks using information such as gene expression profiling, miRNA expression profiling, and predicted TF and miRNA binding sites [7–11]. However, most of these tools utilized a subset of these data and few studies have combined all of these datasets to infer TFs and miRNA regulatory networks. For example, MIR@NT@N  uses TF and miRNA target prediction but does not use mRNA and miRNA expression data. Moreover, transcriptional regulation of miRNAs is often not included due to the challenge in reliable prediction of miRNA promoters [12, 13]. In addition, chromatin immunoprecipitation with sequencing (ChIP-Seq) data for TFs experimentally characterize TF regulatory targets and have been combined and co-analyzed with mRNA expression profiling data , but usually only a small number of TFs were included. Multiple ChIP-Seq datasets from independent experiments were seldom compiled and incorporated into computational network inference. These limitations highlight the need for additional tools to systematically integrate genomic profiling datasets to better understand gene regulatory networks that govern complex biological systems.
In this study we developed a computational pipeline, InteGRaNet, to infer the gene regulatory network involved in Schwann cell response to injury. This network includes TF-mRNA, TF-miRNA and miRNA-mRNA regulatory interactions. This pipeline utilizes previously developed and new computational tools to integrate mRNA and miRNA expression data, ChIP-Seq data, and in-silico TF and miRNA target predictions. Starting with a set of genes and/or miRNAs obtained from expression profiling analysis, our approach initially constructs a network by connecting TFs to genes or miRNAs using TF targets identified from ChIP-Seq experiments. This network is then expanded to include additional regulatory targets of TFs and miRNAs by using genome-wide target prediction. We apply our computational pipeline to infer the Schwann cell injury response network and study the regulatory interactions around Egr2, a known key regulator of myelination. Furthermore, we study cooperative TF/miRNA regulation involved in the Schwann cell injury response network. This work demonstrates a systematic approach to integrate multiple genomic datasets and to infer TF and miRNA regulatory networks, which may be used to better understand coordinated gene regulation by TFs and miRNAs in complex biological systems.
Overview of TF and miRNA regulatory network inference
Identification of an initial set of genes involved in the SC injury response network
To better understand the functions of these gene clusters, we performed Gene Ontology (GO) term enrichment analysis on the IRGCs. Consistent with initial dedifferentiation/proliferation and subsequent redifferentiation of SCs after nerve injury, we found that PGC genes (upregulated immediately after crush) were enriched for functional categories involved in cell proliferation, including cell cycle and chromatin assembly. MGC genes (donwregulated immediately after crush), on the other hand, were enriched for functional categories involved in SC differentiation, including lipid metabolic process and myelination (Figure 2B). The enriched categories in these clusters were therefore consistent with their expression pattern after nerve injury. Genes in the four IRGCs were used as the initial set of nodes for TF and miRNA network inference.
Identification of potential TF and miRNA regulators in the SC injury response network
Transcriptional activators or repressors are likely to be correlated or anti-correlated with the expression of their target genes. Thus, to identify potential regulators of genes in the IRGCs we identified TFs that were correlated or inversely correlated with the expression profile of each IRGC. As a result, 6, 2, 23, and 2 TFs were found to have correlated expression with the four IRGCs, respectively. Three TFs, Cbfb, Taf9, and Mef2a, were found to have inversely correlated expression with Cluster 2. As shown in a recent study, miRNA regulators may be anti-correlated or correlated with the expression of their targets, functioning as either a reinforcer or a fine-tuner . Thus, to identify miRNA regulators for genes in IRGCs, we analyzed the nerve miRNA expression measured before and after crush injury . Comparing miRNA expression profiles with mRNA expression profiles, we found that, of the 87 miRNAs expressed in peripheral nerve, 17, 26, 6, and 6 miRNAs were correlated with the expression profile of the four IRGCs, respectively (Additional file 3: Table S3). 5 and 8 miRNAs were anti-correlated with the expression profile of Cluster 3 and Cluster 4 (Additional file 4: Table S4). Overall, this analysis identified 30 miRNAs that were expressed similar to MGC genes and 6 miRNAs that were expressed similar to PGC genes (Figure 2C, Additional file 5: Table S5). To check if the dynamically regulated miRNAs regulate key gene functions involved in myelination, we examined whether genes in the GO categories enriched in dynamically regulated IRGC genes (Figure 2B) were potential targets of the dynamically regulated miRNAs. For 12 of the 16 GO categories enriched in IRGCs, more than 50% of the annotated genes have a seed sequence match to at least one of these correlated or anti-correlated miRNAs (Figure 2D). This result suggests that these correlated or anti-correlated miRNAs are likely to be regulators of dynamically regulated genes in IRGCs.
Identification of TF-mRNA and TF-miRNA interactions using ChIP-Seq data
ChIP-Seq datasets used to identify TF regulatory interactions in the SC injury response network
GEO accession number
Number of peaks
Number of targets
The overall performance of TSSvote is difficult to assess due to the limited number of experimentally validated miRNA transcription start sites. When the performance of TSSvote was tested by a compiled benchmark set of 21 experimentally determined human and mouse miRNA TSSs (Additional file 6: Table S6), TSSvote predicted 52% of these test TSSs within 500 bp and 81% of them within 2500 bp, outperforming all other currently available methods tested (as measured by the number of miRNA TSSs predicted within a given error range; Figure 3B). The predictions of TSSvote were further supported by the fact that a large proportion of miRNAs (63% of intergenic and 45% of intragenic miRNAs in mouse) were located within 10 kb from the pre-miRNA sequence (Figure 3C, Additional file 7: Table S7). Furthermore, the miRNA promoter sequences (as defined above) were more conserved than randomly selected intergenic sequences of the same length (Chi-square P-value=1.45E-51) (Figure 3D) and contained significantly more TF binding sites than random sequences (Chi-square P-value=1.28E-34) (Figure 3E).
Additional TF target prediction using genome-wide TFBS enrichment analysis
Although ChIP-Seq data of TFs allowed the extraction of experimentally characterized TF-mRNA and TF-miRNA interactions, this information was only available for a subset of TFs. Moreover, because ChIP-Seq experiments might have been performed under different conditions, some transcriptional regulatory interactions critical to the SC injury response may not be identified. To address this shortcoming, we included computationally predicted transcriptional regulatory interactions based on an improved version of a previously developed statistical model for genome-wide TF binding site enrichment  (see Methods). Briefly, this approach calculated a binding probability score for each TF-gene pair using all the evolutionarily conserved TF binding sites (TFBS) in proximal promoters and evaluated a P-value using TFBS permutation (see Methods, Additional file 8: Figure S1). In this study, the model was improved by using a phylogenetic tree-based scoring function to incorporate evolutionary conservation information from more species. Using this model, we predicted 108,204 mouse and 132,516 human TF-mRNA interactions. By applying this model to the miRNA promoters predicted by TSSvote (Additional file 8: Figure S1) we also predicted a total of 2,658 mouse and 5,395 human TF-miRNA interactions. Using these predictions, we expanded the regulatory network to include 79 TF-TF interactions and 70 TF-miRNA interactions, connecting 34 TFs and 22 miRNAs.
Identification of miRNA-mRNA interactions using computational prediction
The previous ChIP-Seq data analysis and genome-wide TF target prediction identified TF-mRNA and TF-miRNA interactions. To identify miRNA-mRNA interactions we performed computational miRNA target prediction. A previous study showed that better performance of miRNA target prediction may be achieved by combining multiple currently available algorithms in order to reach reasonable specificity while minimizing loss of sensitivity . Therefore we chose to combine TargetscanS  and pictar , which provides higher specificity, with miRanda , which provide higher sensitivity, to identify targets of miRNAs. Only targets of miRNAs predicted by at least two of these three methods were included in the network construction. Using this approach, we identified 57,980 mouse and 75,570 human miRNA-mRNA interactions. The average number of targets is 250 genes per miRNA, respectively, which is close to the speculated number of targets per miRNA . Using this result, 43 miRNA-mRNA interactions were added to the SC injury response network.
Expanding network to include master TF regulators of coexpressed mRNAs or miRNAs
Up to this step, the TFs included in the regulatory network were identified by their correlation or inverse correlation with the dynamically regulated IRGCs. However, master regulators of genes in the IRGCs may share a similar expression profile but with a lag time, or they may be constantly expressed throughout SC injury response while being modulated by mechanisms other than transcriptional control. These TFs will be missed by expression correlation-based discovery but could be identified as common regulators of genes in IRGCs based on enrichment of their TF binding sites. Therefore, we applied a previously developed tool, the Promoter Analysis Pipeline (PAP) , to identify curated TF binding sites that were enriched in the proximal promoter sequences of genes in each IRGC. As a result, we found several TFBS significantly enriched in genes in clusters 1, 2 and 4 based on a Bonferroni corrected P-value cutoff of 0.05 (Additional file 10: Table S9) (see Methods). These TFs included E2f1 and Nfyc that were correlated with the IRGCs and had been added to the network in Step 2. Remarkably, known functions of these TFs were consistent with the enriched GO terms for the corresponding gene clusters they regulate (e.g. Nfkb1 for inflammatory response, Egr2 for myelination, and E2f1 for cell cycle). Applying the same analysis to miRNAs correlated with the IRGCs, we found one TF, Spz1, whose binding sites were enriched in miRNAs correlated with cluster 2. These master TFs were added to the SC injury response network as additional nodes.
Expanding network to include master miRNA regulators of coexpressed genes
Similar to TFs, common miRNA regulators of genes in the IRGCs may not have expression profiles that are tightly correlated or anti-correlated with their target genes. These miRNAs may be identified by the enrichment of their predicted target genes in the IRGCs. Thus, for each miRNA we calculated the hypergeometric P-value for its target enrichment (see Methods). As a result, we found 2, 2, and 3 miRNAs with a significant enrichment P-value for clusters 1, 2, and 3 respectively (Additional file 11: Table S10). Like the search of master TF regulators using TFBS enrichment, this analysis identified three miRNAs (let-7a, let-7f and miR-145) that were correlated with the expression of the IRGCs. Interestingly, miR-140, whose predicted targets were enriched in cluster 3, was not identified using expression correlation with IRGCs due to its low expression on microarray. However, qPCR experiments showed that miR-140 was indeed expressed in nerve and had an expression profile correlated with MGCs . These results showed that analysis of miRNA target enrichment may identify miRNA regulators whose expression was not correlated or anti-correlated with its targets or miRNA regulators whose expression was not accurately measured on microarray.
Expanding network to include regulatory interactions for additional regulators
Availability of the InteGRaNet pipeline and datasets for network construction
The network construction pipeline we developed in this study including all the raw datasets is available to the public. These include 71,346 mouse and 64,367 human TF-mRNA interactions identified by the compendium of public ChIP-Seq data, high quality sets of 1,183 mouse and 1,511 human TF-miRNA interactions identified by miRNA TSS prediction and ChIP-Seq data, 108,204 mouse and 132,516 human computationally predicted TF-mRNA interactions, 2,658 mouse and 5,395 human computationally predicted TF-miRNA interactions, and 57,980 mouse and 75,570 human miRNA-mRNA interactions predicted by three algorithms. A Perl script can take a list of genes and miRNAs these data files as input and creates a network in a text format. These data files and the script are available upon request.
Comparison to current algorithms for TF and miRNA network construction
To test the performance of our approach, we compared the InteGRaNet pipeline to currently available methods for constructing TF and miRNA regulatory networks, including GenMiR++ , MIR@NT@N , mirConnX , MAGIA  and EdgeExpressDB . These methods use similar but different approaches and have different strengths and limitations. Of the six algorithms, MIR@NT@N, mirConnX, MAGIA and InteGRaNet predict all three types of interactions, i.e. TF-mRNA, TF-miRNA and miRNA-mRNA regulation. EdgeExpressDB only predicts TF-mRNA and miRNA-mRNA but not TF-miRNA interactions; GenMiR++ only infers miRNA-mRNA interactions using expression profiling data. Thus, while EdgeExpressDB and GenMiR++ can be used to predicted particular types of interactions, they are limited in comprehensive inference of comprehensive TF and miRNA networks. MIR@NT@N, mirConnX, MAGIA and InteGRaNet all use a pre-curated/pre-calculated set of TF and miRNA targets and combine this dataset with user inputted mRNA and miRNA expression data. Of these four methods, mirConnX allows users to change the weight of the predefined target dataset in network construction, whereas the other three do not provide this option. mirConnX and InteGRaNet use sophisticated statistical models to calculate TF and miRNA targets, whereas MIR@NT@N and MAGIA merely extract information from existing databases. Finally, EdgeExpressDB uses one human leukemia dataset to generate networks and does not allow users to use their own data to construct regulatory networks.
To benchmark these algorithms, we first compiled a set of known interactions using the GeneGO database (http://www.genego.com). GeneGO includes manually curated regulatory interactions from the literature. Using genes and miRNAs in our Schwann cell injury recovery network, the GeneGO database search returned a network that consisted of 871 connections, including 772 TF-mRNA, 30 TF-miRNA and 69 miRNA-mRNA interactions. Because these interactions were based on previous studies and were only a part of the complete SC injury network, interactions found by computational algorithms but not by GeneGO may not be false positives. Also, because GeneGO interactions were found in diverse biological systems and were not specific to Schwann cells, GeneGO interactions that were not found by computational algorithms might not be false negatives. For these reasons, it was difficult to evaluate the sensitivity and specificity of the algorithms.
The GeneGO network also allowed us to evaluate the effect of including ChIP-Seq data in InteGRaNet. While the performance of InteGRaNet without ChIP-Seq data was similar to InteGRaNet with ChIP-Seq data in predicting TF-mRNA interactions (Figure 6B), ChIP-Seq data significantly improved the performance in predicting TF-miRNA interactions (Figure 6C). Predictions of miRNA-mRNA interactions did not use ChIP-Seq data and thus were not affected.
Percentage of predicted interactions made by each algorithm that are also predicted by at least one other algorithm
Effect of model parameters on InteGRaNet performance
The Egr2 subnetwork revealed biological insights on regulation of myelination
Egr2 was in turn predicted to directly regulate the expression of the TF Hic1 and 3 miRNAs (let-7f, let-7a, and miR-22) (Figure 8A). These targets were particularly interesting as they may allow Egr2 to broadly regulate a number of genes and signaling cascades important for SC differentiation. For example, let-7a and let-7f, as well as other let-7 family members, are known to interact with a variety of targets to enhance cellular differentiation . Similarly, the tumor suppressor miR-22 has been shown to target the 3’UTR of Pten and modulate Akt signaling, which critically determines the extent of SC myelination [35, 36]. These results showed that our method to delineate the genetic networks driving the SC injury response elucidated Egr2 regulatory pathways that were consistent with current knowledge on the regulation of SC differentiation.
Examination of the Egr2 subnetwork also revealed that Egr2 participates in a number of potentially important regulatory network motifs [17, 27, 37]. In a network motif such as a coherent or incoherent feedforward loop, TFs and miRNAs cooperate to reinforce or modulate the transcriptional control of the common target gene. In the Egr2 subnetwork, Egr2 was found to participate in feedforward loops involving the tumor suppressor Hic1 and the miRNAs let-7a and let-7f (Figure 8B). The expression of Hic1 is not tightly correlated with Egr2, suggesting that the function of this feedforward loop is to maintain the expression level of Hic1 within a small range. In addition, we uncovered two feedback loops of Egr2 (Figure 8C). In the Egr2/Hic1/miR-124 negative feedback loop, Egr2 regulates Hic1, which induces miR-124 to inhibit the expression of Egr2. Using this feedback loop, Egr2 modulates its own expression with an oscillatory behavior. When Egr2 expression is too high or too low, it raises or lowers its own expression level through Hic1/miR-124. As a result, Egr2 expression is maintained within a range. Moreover, the expression of the mediator of this loop, Hic1, is also closely modulated by the Egr2/Let-7/Hic1 loop mentioned above, ensuring the robustness of this mechanism. Finally, Egr2 forms a positive feedback loop with let-7 and Gabpa. Egr2 activates let-7, which inhibits Gabpa, an inhibitor of Egr2. This loop allows Egr2 to assuage the inhibitory effect of Gabpa and increases its own expression. These feedforward and feedback loops cooperate to maintain the expression of Egr2 within a constant range. Together, these Egr2 network motifs suggest that the cooperation between miRNAs and TFs ensures rapid and robust transitions between the distinct differentiation states of SCs that are necessary to support nerve regeneration.
TF and miRNA feedforward loops in the SC injury response network
In this study, we developed a computational pipeline for TF and miRNA regulatory network inference that integrates expression profiling data of mRNAs and miRNAs, TF regulatory targets derived from ChIP-Seq data, and computational TF and miRNA target prediction. Our method takes a step-wise, bottom-up approach that starts with dynamically regulated co-expressed gene clusters as the basic network node set and sequentially adds TFs and miRNAs and their regulatory interactions to the network. By applying our approach to comprehensive delineation of the gene regulatory network underlying the SC response to nerve injury, we showed that this method allows inference of integrated gene regulation by TFs and miRNAs in complex biological settings. Our method was able to provide new insights into fundamental aspects of the SC regenerative response, indicating its potential to help elucidate the complexities of biological processes governed by intricate networks of TFs and miRNAs.
An important step in our approach was to use available Chip-Seq data to derive mRNA and miRNA targets of TFs. While a significant number of transcription factor ChIP-Seq data has been accumulated, only few studies have combined these datasets and used this resource to study transcriptional regulation of mRNAs , and no studies have co-analyzed these datasets to infer transcriptional regulation of miRNAs. This is partly due to the lack of reliable prediction of miRNA promoters. Our TSSvote algorithm incorporates several sequence features that imply transcription start sites and our method does not rely on experimental data that probe promoter usage, which may be dependent on the experimental conditions. Testing our algorithm using experimentally validated miRNA TSS sites showed that the accuracy of our prediction was within 2500 bp in 81% of the cases and the performance was better than current methods (Figure 3B). These predicted miRNA TSS allowed for the identification of ChIP-Seq peaks that were located within miRNA promoters. The identified miRNA promoters also allowed for the computational prediction of TFs that regulate miRNAs (Additional file 8: Figure S1). Our computational predictions are expected to be more accurate than previously reported methods [12, 13] due to the more accurate miRNA promoter annotation and a more robust TF binding site analysis model.
A notable strength of our methods is that it integrates multiple types of experimental and computational data via a modular approach. Thus, individual components of the network inference pipeline may be improved or replaced separately, and additional information about TF or miRNA regulation may be added to the prediction model. For example, the computational prediction of TF targets may be further improved by incorporating epigenetic information . Also, additional regulatory mechanisms, such as regulation by non-coding RNAs and by RNA binding proteins , may be added into the network once experimental data or computational prediction become available for these interactions.
The three major components in our pipeline include identification of TF targets using ChIP-Seq data, identification of TF targets using computational prediction and identification of miRNA target using computational prediction. All these components use a set of parameters and cutoffs to perform target identification or prediction, and their performance depends on the selection of cutoffs, with a lower cutoff generating more targets and a high cutoff generating fewer targets. While an arbitrary choice of cutoff is inevitable, we attempted to optimize our cutoff selection using independent datasets. For TF targets identified by ChIP-Seq data, the number of identified targets depended on the ChIP-Seq peak calling algorithm and its parameters. This performance of peak calling can be optimized but is beyond the scope of our manuscript. For computational prediction of TF targets, the P-value cutoff was selected and optimized in a previous publication by comparing to an independent study . For computational prediction of miRNA targets, the cutoff was selected based on a previous estimate of the number of targets per miRNA .
A key component of our method was the prediction of TF-target interactions by computational models of TFBS enrichment. Regulatory networks inferred by large-scale genome-wide prediction methods like ours are often difficult to validate thoroughly and experimentally. However, our prediction method was based on a published statistical model that was validated using multiple datasets, including compiled sets of co-regulated genes and multiple ChIP-chip datasets . Furthermore, the improved version of this model was compared to a large set of independent ChIP-Seq experiment data for 70 TFs and demonstrated good consistency for both TF-mRNA and TF-miRNA regulation. In addition, regulatory pathways identified in the subnetwork around Egr2, a well known transcription regulator of myelination, are consistent with current knowledge of regulation of SC myelination (Figure 8A). Remarkably, the post-transcriptional regulation of Egr2 by two miRNAs, miR-124 and miR-140, identified in our network were validated experimentally using luciferase assays (Additional file 12: Figure S2) . These results suggest that our method produced an informative and reliable regulatory network for SC injury response.
To demonstrate that our network construction method may be used to gain insight on gene regulation and regulatory pathways in complex biological systems, we applied our method to study the TF and miRNA regulatory networks governing the SC injury response. This response involves the cycling of SCs between distinct differentiation states that support nerve regeneration. Proper cycling is accomplished through the reciprocal regulation of genes driving SC dedifferentiation and myelination respectively, through transcriptional control by TFs as well as post-transcriptional modulation by miRNAs [3–6]. The importance of transcriptional and post-transcriptional control in the SC injury response as well as the reciprocal nature of the genetic programs driving this process make SC injury recovery an ideal system for studying the cooperation of TF and miRNA mediated gene regulation. When we examined the SC injury response subnetwork around Egr2, a known key regulator of myelination, we found other regulators previously associated with SC differentiation. Furthermore, we found that Egr2 interacts with miRNAs in feedforward and feedback loops, which may be important for modulating the expression of both Egr2 and its targets.
miRNAs and TFs tend to cooperate in coherent or incoherent feed-forward loops, in which miRNAs may function as either a reinforcer or a modulator, to control the expression of a target gene [17, 27, 37]. In our analysis of network motifs in the SC injury response network, we found that genes involved in proliferation tend to be regulated by coherent loops, where their repression during SC injury response is reinforced by miRNAs. Genes involved in myelination, on the other hand, tend to be regulated by incoherent loops, where their activation during SC injury response is “fine-tuned” by miRNAs (Figure 9). This suggests that fast and precise timing of the activation/inactivation of genes associated with the immature state of SCs is most critical for the dedifferentiation of these glia after nerve injury. In contrast, proper remyelination seems not to require as carefully controlled timing of gene expression, but instead depend mostly on achieving precise functional levels of myelin-related proteins. This is particularly interesting because myelin formation and maintenance is very sensitive to gene dosage effects. In fact, both abnormally low or high levels of specific myelin proteins can cause peripheral neuropathy in humans .
We present in this work a novel approach to TF and miRNA regulatory network inference. Our approach systematically integrates multiple types of experimental data and computational prediction on gene regulation and thus produces more reliable gene regulatory networks. Applying our approach to the SC injury response dataset demonstrates that our method may be used to gain new insight on gene regulation by TFs and miRNAs.
Identification of dynamically regulated SC injury response gene clusters (IRGCs)
SC mRNA expression profiling before and after crush and transection injury were performed using Affymetrix MU74Av2 chips in a previous study (Nagarajan et al., 2002). Gene expression levels were measured for uninjured nerves, on days 4, 7 and 10 after crush injury, and on days 1, 4, 7, and 10 after transection injury. Mouse gene expression profiling data during SC development were collected from an independent study . This dataset profiled mRNA expression on days 0, 2, 4, and 10 after birth. Expression data were processed and normalized using Affymetrix MAS5 algorithm. A nerve-expressed gene was defined as one that was called present in at least one data point during SC injury response or development. k-means clustering was used to cluster genes based on the combined expression profiles of injury response and development. Gene clusters that contained known myelin genes and that were differentially expressed before and after crush injury based on a t-test were identified. Clusters with similar expression profiles based on the Pearson correlation coefficient were identified and merged. The average expression profile, i.e. the centroid, was calculated for each cluster. Nerve-expressed genes that were correlated with the centroids based on a Pearson correlation coefficient cutoff of 0.8 were identified as the final coexpressed IRGCs.
Identification of miRNA regulators of SC injury response gene clusters
SC miRNA expression profiling before and after crush injury were performed using HTG Molecular qNPA miRNA microarrays in a previous study . Expression of 1046 miRNAs was profiled using this microarray platform on days 0, 4 and 14 after crush injury. miRNA expression data were filtered using the following criteria: miRNAs that had an expression level lower than the average expression of the control miRNA probesets were removed from further analysis. miRNAs for which the expression of one duplicate probeset at all time points were significantly higher than that of the other duplicate probeset based on a Mann–Whitney U test were also removed from further analysis. After this filtering procedure, the average expression of the two duplicate probesets at each time point was used as the expression at that time point. miRNAs that were correlated or anti-correlated with the expression of IRGCs were identified using miRNA expression data and crush injury mRNA expression data on days 0, 4, and 10.
Analysis of ChIP-Seq datasets
Publicly available ChIP-Seq datasets for human and mouse transcription factors were compiled and collected from literature search. Peak locations identified in the original studies were used if they are available. When peak locations were not available, Partek Genomic Suite with default parameters was used to identify peaks using raw alignment data. All peak locations were converted to genomic coordinates of human genome build hg18 or mouse genome build mm9. Peak locations of human datasets were then mapped to the mouse genome using UCSC’s liftover tool. Peaks that were located within the promoters of mRNAs or miRNAs were identified using NCBI’s gene annotation for mRNAs and computationally predicted miRNA TSS (see below). When peaks were mapped across species, only peaks that were located within proximal promoters and mapped to proximal promoters of orthologous genes (based on HomoloGene) were retained in the further analysis.
Computational prediction of miRNA transcription start sites
To predict miRNA TSS, all human and mouse miRNAs were categorized as intergenic or intragenic miRNAs. Intragenic miRNAs were defined as miRNAs located between the start and end of a protein coding gene that is on the same strand (termed the host gene). miRNAs that are not intragenic were defined as intergenic. For intergenic miRNAs, the TSS search range was defined as the genomic sequence between the end of the upstream gene and the start of the pre-miRNA. For intragenic miRNAs, the TSS search range was defined as the genomic sequence between the start of the host gene and the start of the pre-miRNA. To predict miRNA TSS, a new algorithm, TSSvote, was developed to score each 100 bp window within the TSS search range based on transcription related sequence features. Mapping locations of known transcripts or ESTs and CpG islands were downloaded from the UCSC genome browser. CAGE tags were downloaded from the FANTOM project . H3K4me3 histon modification marks were collected from a compiled set of H3K4me3 ChIP-Seq studies [43–51]. Conservation score was calculated as the number of aligned species in the 100 bp sequence window divided by the number of species in which the pre-miRNA was conserved. Using these sequence features, TSSvote calculated the score of each sequence window by score = 2δtranscript/EST + δ CpG + δ CAGE + δH 3K 4me 3 + conservation where δ feature equals one if a given feature is located within the sequence window. Otherwise, δ feature equals zero. For each miRNA, the sequence window within the TSS search range that had the highest score was predicted as the miRNA TSS. When multiple sequence windows had the same score, the sequence window closest to the miRNA was assigned as the predicted TSS.
Computational prediction of TF regulatory targets
To predict TF regulatory targets, we applied a previously developed computational model of transcription factor binding site (TFBS) enrichment  with several extended features, including more TF binding models and an improved phylogenetic model for TFBS conservation. Briefly, multiple sequence alignments of ten vertebrates, whose genomes were completely sequenced with a good coverage (>6x), were obtained from the UCSC genome browser download site. Using NCBI’s mouse genome annotation (build 37.1), for each mouse gene the multiple alignments of genomic sequence from -100 kb of the TSS to the end of the gene itself were extracted. Within this range, the sequence between -10 kb and +5 kb of the TSS and the sequence regions that have a regulatory potential (RP) score  larger than 0.1 were identified and collected as the TFBS search space. To search for TFBS, a total of 867 vertebrate position weight matrix models (PWMs) of TFs were compiled from the TRANSFAC , JASPAR , and UniProbe  databases. Using these PWMs, putative TFBS were identified in the TFBS search space using the program patser with the default score cutoff, and the evolutionary conservation of each site was determined using multiple sequence alignments.
where X is the collection of all sites, sx is the PWM score of binding site x and wx is the total phylogenetic tree branch length of all the species in which binding site x is conserved, based on a previously published tree . Note that in this scoring formula the common branch length shared by two close species was only counted once. In this model, a site that is conserved in a distantly related species will gain a higher weight than one conserved in a closely related species.
Using this scoring model, the probability score for binding and the P-value were calculated based on all the identified TFBSs. Because the consolidated database of TF binding weight matrix models may have multiple models for the same TF, the bias in P-value calculation implanted by multiple hypothesis testing was removed by performing a Bonferroni correction on the raw P-value for each individual weight matrix of the same TF. Regulatory targets of TFs were identified using an adjusted P-value cutoff of 0.005, which was determined by comparing the number of computationally predicted targets to the number of ChIP-Seq identified targets for available TFs. The same analysis was applied to identify TFs that regulate miRNAs using miRNA promoters, which were defined as the sequence between -5 kb and +1 kb of the miRNA TSS predicted by TSSvote. For more details on the computational model please refer to the original paper that described the model .
Computational prediction of miRNA regulatory targets
We combined miRNA target predicted by three algorithms, including TargetscanS , pictar , and miRanda . miRNA targets predicted by TargetscanS were downloaded from http://www.targetscan.org/ (Release 4.2). miRNA targets predicted by pictar were downloaded from http://pictar.mdc-berlin.de/ and targets predicted by miRanda were downloaded from http://www.microrna.org (September 2008 release).
Identification of master TF regulators of coexpressed mRNAs or miRNAs
Common TF regulators of coexpressed mRNAs were identified by the previously developed Promoter Analysis Pipeline (PAP) tool , which is available via a web interface or API at http://bioinformatics.wustl.edu/webTools/PromoterAnalysis.do. PAP searches for the enriched TF binding sites in the promoter sequences of coexpressed mRNAs or miRNAs. Briefly, an R-score was calculated for each gene in the mouse genome based on the ranking of the probability score for binding. Genes that are more likely to be regulated by a TF will have a higher R-score for that TF. For a set of coexpressed genes, the average of the R-scores of the member genes were calculated for each TF. The P-value for a given R-score was then calculated by using randomly selected gene clusters of the same size. A Bonferroni corrected P-value cutoff of 0.05 was used to identify TFs that had significantly higher average R-scores as common regulators. The same analysis was applied to identify common TF regulators of coexpressed miRNAs based on miRNA R-scores, which were calculated using the probability score for binding for miRNAs.
Identification of master miRNA regulators of coexpressed mRNA genes
Common miRNA regulators of coexpressed mRNA genes were identified by the enrichment of miRNA targets in the coexpresed genes. The hypergeometric P-value for enrichment was calculated for a miRNA using the total number of nerve expressed genes that were predicted as targets of any miRNA (population size), the number of coexpressed mRNA genes (sample size), the number of nerve expressed genes that were predicted as targets of the miRNA (number of successes in population), and the number of coexpressed genes that were predicted as targets of the miRNA (number of successes in sample). Common miRNA regulators were identified using a hypergeometric P-value cutoff of 0.05.
Plasmids: pre-mir-124 was obtained through PCR amplification from genomic DNA. The resulting fragment was cloned between the BamHI and Nhe I sites in the miRNASelect pEP-MIR Cloning and Expression Vector (Cell Biolabs) using the InFusion HD cloning system (Clonetech) according to the manufacurer’s recommendations. Pre-mir-124 included the miRNA stem loop and ~100 nt of flanking sequence on either side. For luciferase assays, the 3’UTR region of Egr2 was PCR amplified from genomic DNA using the following primers: Egr2 3’UTR: F, AAAGCT GCGCACTAGTGATGAAGCTCTGGCTGACACACCA; R, ATCCTTTATTAAGCTTACCA TAGTCAATAAGCCATCCAT. DNA fragments were cloned downstream of the luciferase gene between the HindIII and SpeI sites in the pMIR-REPORT miRNA Expression Reporter Vector (Ambion). The 3’UTR of Egr2 lacking the miR-124 pad was cloned in an analogous manner. pRL-CMV Renilla Luciferase Reporter vector (promega) was used as a transfection control.
Luciferase assays: HEK293T cells were seeded at a density of 50,000 cells/well in 24 well plates in DMEM media (Invitrogen) supplemented with 10% fetal bovine serum (FBS), 2 mM L-glutamine. Cell were transfected 24 h later, with either a pEP-MIR vector expressing a pre-miRNA or with the pEP-mir Null control and with the pMIR-REPORT luciferase reporter vector containing the appropriate 3’UTR linked to luciferase. pRL-CMV Renilla Luciferase Reporter vector (Promega) was used as a transfection control. A total of 200 ng of plasmid DNA/well were transfected at a ratio of 50:1:0.5 (miRNA : luciferase reporter : transfection Ctrl). Cells were harvested 48 h post-transfection and assayed using a Dual-Luciferase Reporter Assay System (Promega) according to the manufacturer’s protocol.
Injury response gene cluster
Myelination gene cluster
Proliferation gene cluster
Transcription start site
Transcription factor binding site
Promoter Analysis Pipeline
Position weight matrix
We thank the Center for Biomedical Informatics (CBMI), which provided the in silico analysis service. The CBMI is partially supported by NCI Cancer Center Support Grant P30 CA91842 to the Alvin J. Siteman Cancer Center and by ICTS/CTSA Grant UL1RR024992 from the National Center for Research Resources (NCRR), a component of the National Institutes of Health (NIH), and NIH Roadmap for Medical Research. This work is also supported by NIH Neuroscience Blueprint Center Core Grant P30NS057105 to Washington University, the HOPE Center for Neurological Disorders, National Institutes of Health Grants NS040745 (JM), AG13730 (JM). LC is supported by an NIH Pathway to Independence Award K99LM010824.
- Geuna S, Raimondo S, Ronchi G, Di Scipio F, Tos P, Czaja K, Fornaro M: Chapter 3: Histology of the peripheral nerve and changes occurring during nerve regeneration. Int Rev Neurobiol. 2009, 87: 27-46.View ArticlePubMed
- Jessen KR, Mirsky R: Negative regulation of myelination: relevance for development, injury, and demyelinating disease. Glia. 2008, 56: 1552-1565.View ArticlePubMed
- Bremer J, O’Connor T, Tiberi C, Rehrauer H, Weis J, Aguzzi A: Ablation of Dicer from murine Schwann cells increases their proliferation while blocking myelination. PLoS One. 2010, 5: e12450-PubMed CentralView ArticlePubMed
- Pereira JA, Baumann R, Norrmen C, Somandin C, Miehe M, Jacob C, Luhmann T, Hall-Bozic H, Mantei N, Meijer D, Suter U: Dicer in Schwann cells is required for myelination and axonal integrity. J Neurosci. 2010, 30: 6763-6775.View ArticlePubMed
- Yun B, Anderegg A, Menichella D, Wrabetz L, Feltri ML, Awatramani R: MicroRNA-deficient Schwann cells display congenital hypomyelination. J Neurosci. 2010, 30: 7722-7728.PubMed CentralView ArticlePubMed
- Viader A, Chang LW, Fahrner T, Nagarajan R, Milbrandt J: MicroRNAs modulate Schwann cell response to nerve injury by reinforcing transcriptional silencing of dedifferentiation-related genes. J Neurosci. 2011, 31: 17358-17369.PubMed CentralView ArticlePubMed
- Lionetti M, Biasiolo M, Agnelli L, Todoerti K, Mosca L, Fabris S, Sales G, Deliliers GL, Bicciato S, Lombardi L, Bortoluzzi S, Neri A: Identification of microRNA expression patterns and definition of a microRNA/mRNA regulatory network in distinct molecular groups of multiple myeloma. Blood. 2009, 114: e20-e26.View ArticlePubMed
- Huang JC, Babak T, Corson TW, Chua G, Khan S, Gallie BL, Hughes TR, Blencowe BJ, Frey BJ, Morris QD: Using expression profiling data to identify human microRNA targets. Nat Methods. 2007, 4: 1054-1059.View Article
- Le Bechec A, Portales-Casamar E, Vetter G, Moes M, Zindy PJ, Saumet A, Arenillas D, Theillet C, Wasserman WW, Lecellier CH, Feirderich E: MIR@NT@N: a framework integrating transcription factors, microRNAs and their targets to identify sub-network motifs in a meta-regulation network model. BMC Bioinforma. 2011, 12: 67-View Article
- Huang GT, Athanassiou C, Benos PV: mirConnX: condition-specific mRNA-microRNA netwoek integrator. Nucleic Acids Res. 2011, 39: W416-W423.PubMed CentralView ArticlePubMed
- Bisognin A, Sales G, Coppe A, Bortoluzzi S, Romualdi C: MAGIA2: from miRNA and genes expression data integrative analysis to microRNA-transcription factor mixed regulatory circuits (2012 update). Nucleic Acids Res. 2012, 40: W13-W21.PubMed CentralView ArticlePubMed
- Steverin J, Waterhouse AM, Kawaji H, Lassmann T, van Nimwegen E, Balwierz PJ, de Hoon MJ, Hume DA, Carninci P, Hayashizake Y, Suzuki H, Daub CO: FANTOM4 EdgeExpressDB: an integrated database of promoters, genes, microRNAs, expression dynamics and regulatory interactions. Genome Biol. 2009, 10: R39-View Article
- Shalgi R, Lieber D, Oren M, Pilpel Y: Global and local architecture of the mammalian microRNA-transcription factor regulatory network. PLoS Comput Biol. 2007, 3: e131-PubMed CentralView ArticlePubMed
- Novershtern N, Subramanian A, Lawton LN, Mak RH, Haining WN, McConkey ME, Habib N, Yosef N, Chang CY, Shay T, Frampton GM, Drake AC, Leskov I, Nilsson B, Preffer F, Dombkowski D, Evans JW, Liefeld T, Smutko JS, Chen J, Friedman N, Young RA, Golub TR, Regev A, Ebert BL: Densely interconnected transcriptional circuits control cell states in human hematopoiesis. Cell. 2011, 144: 296-309.PubMed CentralView ArticlePubMed
- Nagarajan R, Le N, Mahoney H, Araki T, Milbrandt J: Deciphering peripheral nerve myelination by using Schwann cell expression profiling. Proc Natl Acad Sci USA. 2002, 99: 8998-9003.PubMed CentralView ArticlePubMed
- Verheijen MH, Chrast R, Burrola P, Lemke G: Local regulation of fat metabolism in peripheral nerves. Genes Dev. 2003, 17: 2450-2464.PubMed CentralView ArticlePubMed
- Tsang J, Zhu J, van Oudenaarden A: MicroRNA-mediated feedback and feedforward loops are recurrent network motifs in mammals. Mol Cell. 2007, 26: 753-767.PubMed CentralView ArticlePubMed
- Saini HK, Enright AJ, Griffiths-Jones S: Annotation of mammalian primary microRNAs. BMC Genomics. 2008, 9: 564-PubMed CentralView ArticlePubMed
- Corcoran DL, Pandit KV, Gordon B, Bhattacharjee A, Kaminski N, Benos PV: Features of mammalian microRNA promoters emerge from polymerase II chromatin immunoprecipitation data. PLoS One. 2009, 4: e5279-PubMed CentralView ArticlePubMed
- Marson A, Levine SS, Cole MF, Frampton GM, Brambrink T, Johnstone S, Guenther MG, Johnston WK, Wernig M, Newman J, Calabrese JM, Dennis LM, Volkert TL, Gupta S, Love J, Hannett N, Sharp PA, Bartel DP, Jaenisch R, Young RA: Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell. 2008, 134: 521-533.PubMed CentralView ArticlePubMed
- Ozsolak F, Poling LL, Wang Z, Liu H, Liu XS, Roeder RG, Zhang X, Song JS, Fisher DE: Chromatin structure analyses identify miRNA promoters. Genes Dev. 2008, 22: 3172-3183.PubMed CentralView ArticlePubMed
- Chang LW, Payton JE, Yuan W, Ley TJ, Nagarajan R, Stormo GD: Computational identification of the normal and perturbed genetic networks involved in myeloid differentiation and acute promyelocytic leukemia. Genome Biol. 2008, 9: R38-PubMed CentralView ArticlePubMed
- Sethupathy P, Megraw M, Hatzigeorgiou AG: A guide through present computational approaches for the identification of mammalian microRNA targets. Nat Methods. 2006, 3: 881-886.View ArticlePubMed
- Lewis BP, Burge CB, Bartel DP: Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005, 120: 15-20.View ArticlePubMed
- Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, Rajewsky N: Combinatorial microRNA target predictions. Nat Genet. 2005, 37: 495-500.View ArticlePubMed
- John B, Enright AJ, Aravin A, Tuschl T, Sander C, Marks DS: Human MicroRNA targets. PLoS Biol. 2004, 2: e363-PubMed CentralView ArticlePubMed
- Martinez NJ, Walhout AJ: The interplay between transcription factors and microRNAs in genome-scale regulatory networks. BioEssays. 2009, 31: 435-445.PubMed CentralView ArticlePubMed
- Chang LW, Nagarajan R, Magee JA, Milbrandt J, Stormo GD: A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles. Genome Res. 2006, 16: 405-413.PubMed CentralView ArticlePubMed
- Svaren J, Meijer D: The molecular machinery of myelin gene transcription in Schwann cells. Glia. 2008, 56: 1541-1551.PubMed CentralView ArticlePubMed
- Mechta-Grigoriou F, Gerald D, Yaniv M: The mammalian Jun proteins: redundancy and specificity. Oncogene. 2001, 20: 2378-2389.View ArticlePubMed
- Nickols JC, Valentine W, Kanwal S, Carter BD: Activation of the transcription factor NF-kappaB in Schwann cells is required for peripheral myelin formation. Nat Neurosci. 2003, 6: 161-167.View ArticlePubMed
- Parkinson DB, Bhaskaran A, Arthur-Farraj P, Noon LA, Woodhoo A, Lloyd AC, Feltri ML, Wrabetz L, Behrens A, Mirsky R, Jessen KR: c-Jun is a negative regulator of myelination. J Cell Biol. 2008, 181: 625-637.PubMed CentralView ArticlePubMed
- Stevens B, Fields RD: Response of Schwann cells to action potentials in development. Science. 2000, 287: 2267-2271.View ArticlePubMed
- Melton C, Judson RL, Blelloch R: Opposing microRNA families regulate self-renewal in mouse embryonic stem cells. Nature. 2010, 463: 621-626.PubMed CentralView ArticlePubMed
- Ogata T, Iijima S, Hoshikawa S, Miura T, Yamamoto S, Oda H, Nakamura K, Tanaka S: Opposing extracellular signal-regulated kinase and Akt pathways control Schwann cell myelination. J Neurosci. 2004, 24: 6724-6732.View ArticlePubMed
- Cotter L, Ozcelik M, Jacob C, Pereira JA, Locher V, Baumann R, Relvas JB, Suter U, Tricaud N: Dlg1-PTEN interaction regulates myelin thickness to prevent damaging peripheral nerve overmyelination. Science. 2010, 328: 1415-1418.View ArticlePubMed
- Shalgi R, Brosh R, Oren M, Pilpel Y, Rotter V: Coupling transcriptional and post-transcriptional miRNA regulation in the control of cell fate. Aging (Albany NY). 2009, 1: 762-770.
- Hannah R, Joshi A, Wilson NK, Kinston S, Gottgens B: A compendium of genome-wide hematopoietic transcription factor maps supports the identification of gene regulatory control mechanisms. Exp Hematol. 2011, 39: 531-541.View ArticlePubMed
- Ramsey SA, Knijnenburg TA, Kennedy KA, Zak DE, Gilchrist M, Gold ES, Johnson CD, Lampano AE, Litvak V, Navarro G, Stolyar T, Aderem A, Shmulevich I: Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites. Bioinformatics. 2010, 26: 2071-2075.PubMed CentralView ArticlePubMed
- Viswanathan SR, Daley GQ, Gregory RI: Selective blockade of microRNA processing by Lin28. Science. 2008, 320: 97-100.PubMed CentralView ArticlePubMed
- Prukop T, Nave KA, Sereda MW, Meyer zu Horste G: Myelin disorders: Causes and perspectives of Charcot-Marie-Tooth neuropathy. J Mol Neurosci. 2006, 28: 77-88.View ArticlePubMed
- Severin J, Waterhouse AM, Kawaji H, Lassmann T, van Nimwegen E, Balwierz PJ, de Hoon MJ, Hume DA, Carninci P, Hayashizaki Y, Suzuki H, Daub CO, Forrest AR: FANTOM4 EdgeExpressDB: an integrated database of promoters, genes, microRNAs, expression dynamics and regulatory interactions. Genome Biol. 2009, 10: R39-PubMed CentralView ArticlePubMed
- Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim TK, Koche RP, Lee W, Mendenhall E, O’Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE: Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007, 448: 553-560.PubMed CentralView ArticlePubMed
- Wei G, Wei L, Zhu J, Zang C, Hu-Li J, Yao Z, Cui K, Kanno Y, Roh TY, Watford WT, Schones DE, Peng W, Sun HW, Paul WE, O’Shea JJ, Zhao K: Global mapping of H3K4me3 and H3K27me3 reveals specificity and plasticity in lineage fate determination of differentiating CD4+ T cells. Immunity. 2009, 30: 155-167.PubMed CentralView ArticlePubMed
- Robertson AG, Bilenky M, Tam A, Zhao Y, Zeng T, Thiessen N, Cezard T, Fejes AP, Wederell ED, Cullum R, Euskirchen G, Krzywinski M, Birol I, Snyder M, Hoodless PA, Hirst M, Marra MA, Jones SJ: Genome-wide relationship between histone H3 lysine 4 mono- and tri-methylation and transcription factor binding. Genome Res. 2008, 18: 1906-1917.PubMed CentralView ArticlePubMed
- Zhao XD, Han X, Chew JL, Liu J, Chiu KP, Choo A, Orlov YL, Sung WK, Shahab A, Kuznetsov VA, Bourque G, Oh S, Ruan Y, Ng HH, Wei CL: Whole-genome mapping of histone H3 Lys4 and 27 trimethylations reveals distinct genomic compartments in human embryonic stem cells. Cell Stem Cell. 2007, 1: 286-298.View ArticlePubMed
- Pan G, Tian S, Nie J, Yang C, Ruotti V, Wei H, Jonsdottir GA, Stewart R, Thomson JA: Whole-genome analysis of histone H3 lysine 4 and lysine 27 methylation in human embryonic stem cells. Cell Stem Cell. 2007, 1: 299-312.View ArticlePubMed
- Araki Y, Wang Z, Zang C, Wood WH, Schones D, Cui K, Roh TY, Lhotsky B, Wersto RP, Peng W, Becker KG, Zhao K, Weng NP: Genome-wide analysis of histone methylation reveals chromatin state-based regulation of gene transcription and function of memory CD8+ T cells. Immunity. 2009, 30: 912-925.PubMed CentralView ArticlePubMed
- Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K: High-resolution profiling of histone methylations in the human genome. Cell. 2007, 129: 823-837.View ArticlePubMed
- Cheung I, Shulha HP, Jiang Y, Matevossian A, Wang J, Weng Z, Akbarian S: Developmental regulation and individual differences of neuronal H3K4me3 epigenomes in the prefrontal cortex. Proc Natl Acad Sci USA. 2010, 107: 8824-8829.PubMed CentralView ArticlePubMed
- Guenther MG, Frampton GM, Soldner F, Hockemeyer D, Mitalipova M, Jaenisch R, Young RA: Chromatin structure and gene expression programs of human embryonic and induced pluripotent stem cells. Cell Stem Cell. 2010, 7: 249-257.PubMed CentralView ArticlePubMed
- Kolbe D, Taylor J, Elnitski L, Eswara P, Li J, Miller W, Hardison R, Chiaromonte F: Regulatory potential scores from genome-wide three-way alignments of human, mouse, and rat. Genome Res. 2004, 14: 700-707.PubMed CentralView ArticlePubMed
- Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel AE, Wingender E: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 2006, 34: D108-D110.PubMed CentralView ArticlePubMed
- Portales-Casamar E, Thongjuea S, Kwon AT, Arenillas D, Zhao X, Valen E, Yusuf D, Lenhard B, Wasserman WW, Sandelin A: JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles. Nucleic Acids Res. 2010, 38: D105-D110.PubMed CentralView ArticlePubMed
- Newburger DE, Bulyk ML: UniPROBE: an online database of protein binding microarray data on protein-DNA interactions. Nucleic Acids Res. 2009, 37: D77-D82.PubMed CentralView ArticlePubMed
- Chun S, Fay JC: Identification of deleterious mutations within three human genomes. Genome Res. 2009, 19: 1553-1561.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.