- Research article
- Open Access
Reprogramming of lysosomal gene expression by interleukin-4 and Stat6
BMC Genomicsvolume 14, Article number: 853 (2013)
Lysosomes play important roles in multiple aspects of physiology, but the problem of how the transcription of lysosomal genes is coordinated remains incompletely understood. The goal of this study was to illuminate the physiological contexts in which lysosomal genes are coordinately regulated and to identify transcription factors involved in this control.
As transcription factors and their target genes are often co-regulated, we performed meta-analyses of array-based expression data to identify regulators whose mRNA profiles are highly correlated with those of a core set of lysosomal genes. Among the ~50 transcription factors that rank highest by this measure, 65% are involved in differentiation or development, and 22% have been implicated in interferon signaling. The most strongly correlated candidate was Stat6, a factor commonly activated by interleukin-4 (IL-4) or IL-13. Publicly available chromatin immunoprecipitation (ChIP) data from alternatively activated mouse macrophages show that lysosomal genes are overrepresented among Stat6-bound targets. Quantification of RNA from wild-type and Stat6-deficient cells indicates that Stat6 promotes the expression of over 100 lysosomal genes, including hydrolases, subunits of the vacuolar H+ ATPase and trafficking factors. While IL-4 inhibits and activates different sets of lysosomal genes, Stat6 mediates only the activating effects of IL-4, by promoting increased expression and by neutralizing undefined inhibitory signals induced by IL-4.
The current data establish Stat6 as a broadly acting regulator of lysosomal gene expression in mouse macrophages. Other regulators whose expression correlates with lysosomal genes suggest that lysosome function is frequently re-programmed during differentiation, development and interferon signaling.
Cells must be able to flexibly adjust the structural and functional capacity of their compartments in order to adapt to stress or changing nutrients, to assume specialized tissue functions and to maintain homeostasis.
The biogenesis of cellular organelles involves the assembly and targeting of numerous proteins and membrane lipids, and often these processes are orchestrated by transcription factors whose activities are adjusted in response to stress or developmental cues. While much is known regarding the regulation of lipids, mitochondria, peroxisomes and the ER [1–6], understanding the transcriptional regulation of lysosomal function remains less advanced.
Lysosomes are defined by acidic luminal pH, characteristic membrane proteins and lipids, and the presence of multiple acidic hydrolases that catalyze the degradation of material reaching the compartment through fluid-phase endocytosis, phagocytosis or autophagy [7–10]. Abnormalities of lysosomal function, content, number, morphology or gene expression are characteristic of multiple inherited lysosomal storage diseases, of cellular senescence, organismal ageing, atherosclerosis, Alzheimer’s and other neurodegenerative diseases [11–17]. Ectopic secretion of lysosomal proteases can lead to excessive extracellular matrix degradation, which in turn contributes to metastasis, emphysema, atherosclerosis, arthritis, osteoporosis and the formation of aneurysms [14, 18–20].
Large-scale gene expression correlation analyses have shown that a number of lysosomal genes form coordinated clusters, or synexpression groups, suggesting that expression of these targets is co-regulated under varying conditions [21–23]. Sardiello et al. performed a pattern search of lysosomal promoters, leading to the identification of a specific E-box, which was found to be recognized by a basic helix-loop-helix transcription factor called TFEB [21, 23]. Ectopically expressed TFEB causes an upregulation of multiple lysosomal genes, leading to increased numbers of lysosomes, enhanced degradation of endocytic substrates, and lysosomal exocytosis [21, 24].
Transcriptional regulation of lysosomal function has been studied mainly during autophagy, and in this context several transcription factors have been shown to play roles in lysosomal gene regulation, including GATA-1 , FoxO3  and TFEB [26–29].
Lysosomal substrates of extracellular origin impose a particular load on macrophages and other phagocytic myeloid cells that process microbes, senescent cells and effete tissue material [11, 30]. How the degradative capacity of lysosomes in such cells is regulated during stress and differentiation remains poorly understood.
Here, we used expression correlation analyses to search for novel regulators of lysosome-specific genes. We found that transcription factors whose expression correlates with lysosomal genes are often involved in differentiation, embryonic development and interferon signaling. The strongest candidate that emerged from our computations was Signal Transducer and Activator of Transcription-6 (Stat6), a transcription factor regulated by IL-4 and IL-13. The roles of IL-4 and Stat6 in modulating lysosomal gene expression were evaluated in a primary cell culture model of alternatively activated mouse macrophages using data based on gene expression profiling, quantitative PCR and chromatin immunoprecipitations. Results obtained with macrophages from wild-type and Stat6-deficient mice demonstrate that Stat6 positively regulates a large number of lysosomal genes in an IL-4-dependent manner.
Identification of transcriptional networks through correlation analysis
Previous studies have shown that the mRNA levels of transcriptional regulators are often predictive of the expression of their target genes [31–36]. Based on this premise, we asked whether mRNA correlation analyses across multiple datasets might reveal novel regulators of lysosomal gene expression.
Calculations were performed using expression profiles based on specific mouse and human Affymetrix microarray platforms for which large numbers of independent datasets are available at the NCBI GEO repository . We then processed these files to generate average expression values for named, full-length mRNAs. A list of known transcription factors was assembled from gene ontology (GO) annotations and the literature [38, 39].
To verify the usefulness of the processed expression data for extracting transcriptional regulators, we initially interrogated the datasets for two pathways whose regulation is already well understood. We began by calculating a matrix of Pearson correlations between 19 mouse genes in the cholesterol biosynthesis pathway and 1,683 known transcription factors. The results were aggregated according to transcription factor and ranked. The resulting list was led by Srebf2, which encodes the transcription factor SREBP-2, known in fact to be a pivotal regulator of cholesterol metabolism and of the genes in the reference set (Figure 1A; Additional file 1) [6, 40].
To further trial this strategy on a pathway for organelle biogenesis, we selected a group of 97 highly coordinated ER genes and found that this cluster correlates most strongly with Xbp1, an established master regulator of the ER stress response and of ER biogenesis (Figure 1B; Additional files 1, 2 and 3) .
In a complementary approach, Srebf2 and Xbp1 were each used as reference to calculate expression correlations with 16,771 mouse genes. Of the 10 genes that correlated most strongly with Srebf2, five are involved in cholesterol metabolism (p = 2.8E-09, hypergeometric test), and of the 10 genes most strongly correlated with Xbp1 nine encode proteins associated with the ER (p = 3.2E-10) (Additional file 4). These values are similar to results returned by the Netview tool, which returns lists of ‘nearest neighbors’ based on co-occurrence in expression quantile groups ; according to Netview, the sets of 10 nearest neighbors for Srebf2 and Xbp1 each include five genes associated with cholesterol metabolism or the ER, respectively (data not shown). Taken together, the above results confirm that correlation analyses of co-regulated gene groups across our processed datasets can identify transcription factors that coordinate their expression.
Correlation analysis of lysosomal gene expression
We next asked which transcription factors might correlate with the expression of lysosomal genes. Calculations were performed for 1066 mouse and 1412 human DNA-binding transcription factors for which high-quality data are available in the processed microarray datasets. In each case the 500 highest-ranking correlators were then analyzed for GO set enrichment using the Bioconductor GOstats package . The resulting tables were searched for the terms ‘lysosome’ or ‘vacuole’, returning 49 transcription factors that scored positive at a significance cutoff of p ≤ 0.001 and according to both mouse and human datasets (Additional file 5; a searchable database of the complete results is available at http://genecan.sgul.ac.uk). The 49 transcriptional regulators include two factors, MITF and TFEB, that have previously been shown to direct the expression of lysosomal genes during differentiation and autophagy, respectively [26–29, 42]. Further validating the data is the presence of 7 regulators that are known to physically associate with the endomembrane system (ATF6, HCLS1, LASS2, CREB3, NFE2L1, NFE2L2, and TSG101). A majority (65%), however, have been implicated during embryonic development or differentiation (CEBPA, CEBPB, CEBPD, DDIT3, EGR2, EPAS1, FOS, HCLS1, HDAC5, HHEX, IKZF1, IRF1, IRF8, LMO2, LYL1, MAF, MAFB, MAFG, MITF, MNT, NFE2L2, NR1H2, NR1H3, PPARG, RELA, SPIB, STAT1, STAT3, STAT5A, STAT6, TFEB, TSG101). Also prominent are transcription factors involved in interferon signaling (CIITA, IRF1, IRF2, IRF5, IRF8, IRF9, NR1H2, NR1H3, PPARG, STAT1, STAT2).
The above results suggest that lysosomal gene sets are re-programmed in the context of different transcriptional networks. We therefore sought to identify subsets of lysosomal genes that are likely to be coordinately regulated. Correlations were calculated among a list of 269 lysosomal genes that had been assembled from GO annotations and reviews of the lysosomal proteome [38, 43, 44]. Computations were performed across 1,444 mouse datasets, and the genes were clustered hierarchically according to average correlation coefficients. Inspection of the resulting dendrogram and heat map identify three principal clusters, suggesting that the genes in each of these groups are often controlled collectively (Figure 2; Additional files 6 and 7).
Cluster 1 consists of 38 genes encoding lysosomal proteins whose functional profiles are generally similar to that of the entire lysosomal gene set. However, an unexpectedly large number of Cluster 1 genes (58%) have also been found at the plasma membrane (GO:0044459) or extracellularly (GO:0005576); of lysosomal genes outside of cluster 1, only 33% fall into either of these categories.
Cluster 2 consists of 41 genes that cover a range of different lysosomal functions, but subunits of the vacuolar H+ ATPase (Atp6ap2, Atp6v1a, Atp6v1b2, Atp6v1c1, Atp6v1d, Atp6v1g1, Atp6v1h) and components of the endo/lysosomal trafficking machinery (Lamp2, M6pr, Rab14, Rab7, Rab9, Snap23, Vamp7, Vps4b) are particularly prominent. The expression of genes in Cluster 1 is correlated negatively with 78% of Cluster 2, suggesting that these gene sets are frequently regulated reciprocally.
Cluster 3, the largest coherent group, includes 128 lysosomal genes whose protein products are involved in all aspects of lysosome physiology, including hydrolysis, acidification, transport and antigen presentation.
Collectively, the preceding analyses support the conclusion that distinct subsets of lysosomal genes can be coordinately regulated, hinting at the existence of dedicated transcriptional networks that control the expression of these clusters. The relatively low average correlation values indicate that such networks would be active only in a subset of physiological contexts.
Transcription factors co-regulated with lysosomal syn-expression groups
Next, in order to explore the contexts and possible regulation of the three lysosomal gene clusters, each group was used as reference set for correlation analyses with known transcription factors across both mouse and human datasets. The complete results of these calculations are given in Additional file 7, and the ten highest-ranking transcription factors are listed in Figure 3.
A large proportion of the transcription factors that correlate with Cluster 1 are known to play roles in embryonic development and morphogenesis (CRX, GRHL2, HNF1A, HOXC6, MESP2, PAX8, SIM1, SMYD1, TBX19, TBX6; GO:0032502). Based on Pubmed literature searches, none have been associated with the regulation of lysosomal or vacuolar function. Two genes, Pax8 and Smyd1, are common to both the mouse and human top-10 lists. Pax8 is important for thyroid and kidney development [45, 46]. Twenty-nine percent of Cluster 1 genes (Aldob, Aqp2, Atp6v0a4, Atp6v1b1, Atp6v1c2, Cckar, Cubn, Dnase1, Kcne1, Slc30a2, Sult1c2) are highly expressed in mouse kidney (> 2 standard deviations above the mean) , which may help explain the moderate correlation of Pax8 with the Cluster 1 gene set. Smyd1 encodes a skeletal muscle and heart-specific histone methyltransferase, also known as Bop, that acts as a transcriptional repressor [48, 49]. Whether Smyd1 contributes to the repression of the lysosomal Cluster 2 genes is currently not known.
Five of the top-10 transcription factors that correlate with Cluster 2 (NFE2L2, HBP1, XBP1, HIF1A, CTNNB1) have been linked to both oxidative stress and autophagy. The intersection of the mouse and human regulators contains three genes (NFE2L2, HBP1 and RNF6), with NFE2L2 ranking highest. The NFE2L2 gene encodes the protein nuclear factor-erythroid 2-related factor 2 (Nrf2), a basic-leucine zipper transcription factor that activates genes with antioxidant response elements (AREs) under conditions of oxidative stress . Based on published chromatin immunoprecipitation-sequencing (ChIP-Seq) data from mouse embryonic fibroblasts (MEFs), Nrf2 binds to the promoters of five of the cluster 2 genes (CD164, CLN5, CTSO, SCPEP1, TMEM55A) (p = 0.014, hypergeometric test) and to eight additional lysosomal genes . In lymphoid cells Nrf2 binds to the promoter of one of the cluster 2 genes (IDS) plus nine other lysosomal genes , with FNBP1 and GABARAPL1 being recognized in both MEFs and lymphoid cells. According to published microarray studies comparing wild-type and Nrf2-deficient mouse tissues, the expression of only a small and varying number of Cluster 2 genes are affected by Nrf2 in liver, small intestine and prostate [53–56]. Consistent across independent studies is the Nrf2-dependent regulation of Ctsb (cathepsin B) in liver, small intestine and prostate [54, 56]; moreover, Nrf2 binds close to the Scpep1 (serine carboxypeptidase 1) gene in MEFs and regulates its expression in liver and small intestine [51, 54]. In summary, Nrf2 regulates a small number of lysosomal genes in a tissue-specific manner, but available evidence does not yet support the concept that Nrf2 accounts for the coordinate regulation of a majority Cluster 2 genes.
The transcription factor whose expression profiles correlated most strongly with those of the lysosomal Cluster 3 was mouse Stat6, and following an analogous analysis of human datasets, STAT6 ranked near the top as well (Figure 3). Also correlating strongly with Cluster 3 are other regulators of immune function (Irf3, Nr1h2, Rela, Stat2, Stat3) and transcription factors bound to the membrane of the ER (Atf6, Creb3, Creb3l2, Lass2, Nfe2l1).
Given its strong correlation with the largest lysosomal cluster, Stat6 was chosen for further in-depth analyses.
As a cutoff-free approach to identifying which GO categories best correlate with Stat6 we used the Gene Set Enrichment Analysis (GSEA) tool developed by Subramanian et al. . GSEA calculates enrichment scores that reflect to what degree members of a particular GO category are concentrated at the extreme of a ranked gene list. The method also defines a ‘leading-edge’ subset of genes that account for the core of the enrichment score. The greatest enrichment scores for genes that correlate with mouse Stat6 were seen for the GO terms ‘lysosomal membrane’ and ‘lysosome’ (nominal p = 0; Figure 4B and C; Additional file 8). In the human datasets the highest-scoring categories are related to type I interferon and Toll-like receptor signaling; however, the GO sets ‘lysosomal membrane’ and ‘lysosome’ also ranked high (nominal p = 0; Figure 4G and H; Additional file 8). The ‘leading-edge’ subsets for the GO term ‘lysosome’ contained 134 mouse and 126 human lysosomal genes, respectively (Figure 4C and H), of which 98 were common to both sets (73-78% overlap; Additional file 8). These results confirm that Stat6 mRNA levels are often coordinated with those of multiple lysosomal genes.
IL-4 and Stat6 regulate lysosomal gene expression in macrophages
Stat6 is a member of the ‘signal transducer and activator of transcription’ family and expressed in most tissues [47, 59]. The principal activators of Stat6 are IL-4 and IL-13, whose binding to cognate receptor complexes leads to recruitment of JAKs and JAK-mediated Stat6 phosphorylation, whereupon Stat6 dimerizes, moves to the nucleus and binds to specific promoter elements [60, 61]. In addition to being stimulated by IL-4 and IL-13, Stat6 can become active in response to other cytokines and certain pathogens [60, 62, 63]. To obtain a detailed view of lysosomal gene regulation by IL-4 and Stat6, we analyzed IL-4-induced changes of gene expression in macrophages from wild-type and Stat6-deficient mice . Detailed results are given in Additional file 9 and are graphically summarized in Figure 5. In the absence of IL-4 the Stat6 genotype had no effect on lysosomal gene expression, as would have been expected. However, in cells exposed to IL-4 an absence of Stat6 was associated with significant (p ≤ 0.05) changes in the expression of 149 (55%) known lysosomal genes. Comparing gene expression in Stat6-deficient versus wild-type cells, 46 lysosomal genes were increased and 103 lysosomal genes were decreased (Additional file 9).
The 103 lysosomal genes that are positively regulated by Stat6 encode proteins involved in all aspects of lysosomal function, including 39% of known lysosomal hydrolases, most subunits of the vacuolar H+ ATPase, components of the vesicular transport machinery and others (Figure 5A). Of these 103 genes, 53% are among the leading edge of mouse lysosomal genes whose expression correlates most significantly with Stat6 (Figure 4D; p = 0.031).
In wild-type macrophages exposed to IL-4, fifty-one lysosomal genes were induced and 60 lysosomal genes were suppressed (Figure 5B and C), reflecting a complex reconfiguration of gene expression in this cell type [58, 64]. The contribution of Stat6 to lysosomal gene expression, however, is generally positive: in Stat6-deficient cells the activating effects of IL-4 were almost completely abolished (Figure 5B), whereas the suppressing effects of IL-4 largely persisted (Figure 5C). Surprisingly, 82 lysosomal genes were suppressed by IL-4 in Stat6-deficient, but not in wild-type cells (Figure 5C), indicating that Stat6 usually antagonizes the inhibitory effect of IL-4 on the expression of these genes.
Next we sought to verify the microarray data through an alternative method. For this, we cultured bone marrow macrophages from wild-type and Stat6-deficient mice with IL-4 for 6 hours, 24 hours or 10 days, and RNA was extracted for reverse transcription and quantitative PCR. Most targets were chosen from among the lysosomal genes that, according to Figure 5D, were decreased at least two-fold in Stat6-/- versus Stat6+/+ macrophages grown in the presence of IL-4. Expression of the Ppia gene (encoding cyclophilin A) remained relatively unchanged under the four conditions and was used to normalize values for the remaining genes. The Arg1 gene (encoding arginase I), an established target of IL-4/Stat6 signaling [66, 67], was strongly induced by IL-4 in wild-type cells, but its expression remained low in Stat6-deficient cells (Figure 6). Similar results were obtained for most of the lysosomal genes that were analyzed, with the strongest regulation by IL-4 seen for Mmp13, Acp5, Ctsk, Atp6v0d2, Ifi30 and Ctsl. Only Myo7a and Hgsnat, which appeared moderately induced by IL-4 based on microarray analyses (Figure 5D), changed much less significantly according to the qPCR data (data not shown). Overall, however, the qPCR results are in good agreement with the microarray data and confirm that IL-4 controls the expression of multiple lysosomal genes in a Stat6-dependent manner.
Stat6 binds close to lysosomal gene loci
To explore whether Stat6 might bind to genomic loci encoding lysosomal genes, we interrogated publicly available ChIP-seq data from IL-4 treated macrophages . According to peak coordinates from macrophages grown with IL-4 for 4 hours, Stat6 binds in the vicinity of 4,520 named genes (± 5 kb of transcription start site), 153 of which have been annotated as encoding lysosomal proteins (p = 6E-62, hypergeometric test).
As an unbiased approach to determining whether genes associated with specific functions are statistically overrepresented among the 4,520 Stat6-bound targets, the list was subjected to gene set enrichment analysis using GOstats . Excluding categories with more than 1000 members, the three highest-ranking categories returned by this analysis were ‘ribosomal subunit’ (GO:0044391; p < 6E-11), ‘cytosolic part’ (GO:0044445; p < E-10) and ‘lysosome’ (GO:0005764; p < 2E-10) (Additional file 10). These data indicate that genes encoding lysosomal proteins make up a significant fraction of the genomic loci that Stat6 physically interacts with.
Of the 153 lysosomal genes whose loci are bound by Stat6 according to the current measure, 46% are among the genes whose mRNA levels were significantly reduced in Stat6-deficient macrophages grown with IL-4 (see Figure 5 and Additional file 9). However, this fraction increases to 72% if Stat6 peaks located anywhere inside a gene are also counted as targets.
In order to study the kinetics of Stat6 binding to lysosomal loci, Stat6 ChIP-seq peaks from different time points after IL-4 addition were aligned to genomic maps of the genes whose mRNA levels were analyzed in Figure 6. As expected, no IL-4 induced Stat6 binding could be seen around the Ppia locus, whereas several Stat6 peaks appeared at the Arg1 gene as early as 15 min after IL-4 addition (Figure 7). Similarly, IL-4 rapidly induced the binding of Stat6 to all of the 10 lysosomal genes that are shown, and binding patterns remained relatively stable for the subsequent 4 hours. At the Atp6v1h, Ctsl and Ifi30 loci additional Stat6 peaks appear at later time points, suggesting contributions of co-factors whose activities might increase with delayed kinetics. To independently verify the Stat6 ChIP-seq data, several peaks were selected for verification by ChIP-qPCR, and in each case the results were confirmatory (Additional file 11). Collectively, mRNA quantifications and ChIP-seq data show that Stat6-mediated activation of lysosomal gene expression coincides with the binding of Stat6 to the affected promoters, strongly suggesting that Stat6 plays a direct role in augmenting the transcription of these targets.
Stat6 sites at lysosomal genes are near active chromatin
Actively transcribed genes are often controlled through promoter and enhancer elements characterized by binding sites for multiple, tissue-specific transcription factors and by the presence of nucleosomes with activating epigenetic modifications such as monomethylated histone H3 lysine-4 (H3K4me1) and acetylated H3 lysine-27 (H3K27ac) [68, 69]. In macrophages, cell type-specific gene expression depends in part on Pu.1, an ETS-domain transcription factor required for the development of myeloid and B-lymphoid cell types [70–72]. Pu.1 has previously been shown to cooperate with Stat6 in controlling the expression of several genes . To characterize Stat6-bound genomic regions near lysosomal genes, we studied publicly available ChIP-seq data to explore the co-localization of Stat6 with H3K4me1, H3K27ac and Pu.1 in samples from wild-type and Stat6-deficient macrophages that had been grown ± IL-4 for 4 hours . As a point of reference we selected 196 Stat6 peaks that were observed consistently after 1, 2 and 4 hours of IL-4 exposure and within 5 kb of the transcription start sites of lysosomal genes (133 lysosomal genes; range, 1–4 peaks per gene; average, 1.5 peaks per gene) (Additional file 12). Of these 196 peaks, 75% were marked by the presence of H3K4me1, H3K27ac and Pu.1 in wild-type cells grown without IL-4, showing that Stat6 predominantly binds to regions characterized by activating epigenetic markers (Figure 8 and Additional file 12).
In macrophages, latent enhancers have been defined as sites devoid of H3K4me1, H3K27ac and Pu.1 that acquire H3K4me1 upon stimulation , and none of the Stat6 peaks near lysosomal genes fall into this category (Additional file 12).
Enhancer elements containing H3K4me1, but no H3K27ac, have been described as “poised” for activation [73, 74]. Among the regions bound by Stat6 near lysosomal genes, 86% (169/196) have pre-existing nucleosomes containing H3K4me1, but only 4% (8/196) contain H3K4me1 alone and acquire H3K27ac upon IL-4 exposure; at 5 of these sites (near Atp6v0d2, Htt, Ids, Plekhf1, Srgn) the IL-4 induced acetylation of H3K27 was dependent on Stat6 (Figure 8 and Additional file 12).
H3K27ac marks are usually concentrated near promoters and their presence is predictive of gene expression . Inspection of the data in Additional file 12 shows that 89% (174/196) of the lysosomal Stat6 peak regions already contain H3K27ac in untreated wild-type cells. We identified 9 lysosomal Stat6 peak regions at which H3K27ac was induced by IL-4, and this modification was Stat6-dependent near the same 5 genes at which IL-4/Stat6 promote monomethylation of H3K4 (Atp6v0d2, Htt, Ids, Plekhf1, Srgn), indicating that Stat6 coordinates activating chromatin modifications at these promoters. Two of the affected targets, Atp6v0d2 and Plekhf1, are among the lysosomal genes whose mRNA levels are most strongly regulated by IL-4 and Stat6 (Figures 5, 6, 7 and 8). At many of the lysosomal genes whose expression is controlled by Stat6, IL-4 exposure led to a pronounced expansion of pre-existing H3K27ac marks around the Stat6 peaks (e.g. Arg1, Plekhf1, Atp6v0a1, Atp6v1b2, Atp6v1h, Ctsl, Acp5), and at most of these sites the spreading of H3K27ac was dependent on Stat6 (Figures 7 and 8). In summary, Stat6 binds near lysosomal genes at sites marked by active chromatin configurations, and at several lysosomal genes Stat6 contributes to the establishment or expansion of these markers. These results further strengthen the concept that Stat6 plays pivotal roles in activating the expression of lysosomal genes in macrophages.
In the current study we used gene expression correlation analyses to search for DNA-binding transcription factors whose activities might relate to lysosomal function. The strongest candidate that emerged from our data was Stat6, a widely expressed transcription factor that is activated in response to specific cytokines and pathogens. In support of a role for Stat6 upstream of lysosomal gene expression we demonstrate that IL-4 induced Stat6 positively regulates a wide range of lysosomal genes in mouse macrophages.
Our in silico strategy was based on a large body of work showing that the expression of transcription factors and their target genes are often positively related [31–36]. If the expression of a group of lysosomal genes was transcriptionally coordinated through the action of a transcription factor, we reasoned, it might be possible to identify such a regulator through correlation analyses across a great number of microarray data. Association of transcriptional regulators with their target genes, based on expression data, has previously been demonstrated using a number of methods, including mutual information scoring [33, 76], probabilistic methods , differential equations , Gibbs sampling  and Spearman correlations . Here, we applied a simplified clustering approach by calculating Pearson correlations between lists of known transcription factors and potential target genes. Correlation values were averaged across hundreds of expression datasets, and genes were ranked accordingly. In one round of calculations genes most highly correlated with individual known transcription factors were screened for GO set enrichment which, at a cutoff of p ≤ 0.001, produced a list of 49 DNA-binding regulators whose expression is highly correlated with lysosomal genes. In a complementary approach transcription factors were sorted according to their correlation with groups of co-regulated lysosomal genes.
The list of 49 regulators includes MITF and TFEB, two proteins that have been previously identified as regulators of lysosomal gene expression in the context of autophagy, and differentiation [26–29, 42]. Further validating our approach was the presence of proteins that are known to be physically associated with compartments of the endomembrane system, including ATF6, HCLS1, LASS2, CREB3, NFE2L1, NFE2L2, and TSG101. Strikingly, 65% of the 49 regulators have been implicated during embryonic development or differentiation, and 22% are involved in interferon signaling, suggesting that these processes are frequently accompanied by reconfiguration of the lysosomal system. Augmented expression of lysosomal genes during development and differentiation might support the generation of tissue-specific, lysosome-related organelles . Moreover, endo/lysosomal factors are increasingly being implicated during cell migration and polarity, both important aspects of development, as well as of wound healing and cancer . Positive correlation between lysosomal and interferon signaling genes points to the front-line role of lysosomes in the defense against pathogens [81–83].
Cluster analysis of all known lysosomal genes led to the identification of several subgroups whose expression appears to be coordinated. Unexpectedly, one cluster was characterized by being correlated negatively with a large fraction of other lysosomal genes, indicating that the expression of these groups is often mutually exclusive. The mechanistic origin and physiological relevance of these results is not yet understood. The largest lysosomal cluster includes large fractions of known acidic hydrolases and vacuolar H+ ATPase subunits, supporting the impression from previous studies that core lysosomal functions are transcriptionally coordinated [21–23]. The transcription factor most strongly correlated with this cluster was Stat6. A member of the signal transducer and activator of transcription family , Stat6 is activated predominantly through JAK-mediated tyrosine phosphorylation in response to IL-4 or IL-13. Stat6-deficient mice are viable, but suffer defects in the differentiation of several immune and non-immune cell types, exhibit increased susceptibility to infection by certain viral, bacterial and helminthic pathogens and show attenuated allergic responses . Conversely, ectopically activated Stat6 is frequently found in tumor samples [85–87].
In monocytes, Stat6 signaling promotes the differentiation into a class of alternatively activated macrophages . Based on microarray data obtained from primary mouse macrophages cultured with IL-4, we found that the expression of 103 lysosomal genes was dependent on Stat6, reflecting 40% of the known lysosomal proteome and 54% of lysosomal genes expressed in this cell type; 45 of these genes have been associated with human diseases . Of particular interest is the role of Stat6 in controlling the vacuolar H+ ATPase that, by virtue of maintaining the acidic pH in the endo/lysosomal system, is pivotal to all aspects of lysosomal function . Of 15 different subunits and associated factors that make up the vacuolar H+ ATPase [90, 91], 14 were found to be controlled by Stat6, and three subunits (encoded by Atp6v0a1, Atp6v0d2 and Atp6v1b2) were among the lysosomal genes most strongly induced by IL-4.
Previous studies have shown that IL-4 increases the expression of the lysosomal proteases cathepsin L and S in mouse macrophages and of cathepsin S in human bronchial and conjunctival epithelial cells [92–94]; however, the transcription factor responsible for this regulation had not yet been identified. Through analysis of microarray data we found that in mouse macrophages exposure to IL-4 augments the expression of eight lysosomal protease genes, and in 7 of these cases (Ctsk, Ctsl, Ctso, Ctsz, Mmp13, Scpep1, Tpp1) the IL-4 effect was dependent on Stat6. In total, the IL-4/Stat6 system was found to control the expression of 39% of known lysosomal hydrolases, a group that in addition to proteases, include glycosidases, lipases and other degradative enzymes. Genes involved in vesicular targeting to lysosomes and in the movement of substances across the lysosomal membrane are also regulated by IL-4/Stat6. These effects are likely to contribute to the heightened influx of endocytic substrates and the increased capacity for lysosomal degradation that have previously been observed in IL-4 treated macrophages [94–96].
Alternatively activated macrophages have been implicated in tissue repair , and we speculate that IL-4/Stat6-mediated expression of lysosomal enzymes may facilitate the repair-associated turnover of extracellular matrix, for example through secretion of acidic hydrolases into the extracellular space [20, 98], or through intracellular digestion of phagocytosed collagen fibrils . In support of this model, the lysosomal genes that are most strongly affected by IL-4 and Stat6 in macrophages, encoding cathepsins L and K, tartrate-resistant acid phosphatase, collagenase-3 and vacuolar H+ ATPase, have all been shown to play important roles in extracellular matrix degradation [98, 100–102]. Furthermore, Stat6-controlled expression of several lysosomal and extracellular proteases has been implicated in tissue destruction during pulmonary emphysema and is thought to contribute to the invasiveness of glioma tumours [103–105].
In wild-type macrophages IL-4 effects a complex reprogramming of gene expression, with similar numbers of lysosomal genes being induced and suppressed. Stat6, however, appears to mediate only the activating effects of IL-4 on lysosomal mRNAs. In macrophages devoid of Stat6 the induction of lysosomal genes by IL-4 was blocked or severely reduced. On the other hand, in Stat6-/- cells the suppressive effects of IL-4 remained largely unchanged, pointing to an IL-4 induced signaling branch that operates independently of Stat6. Similar interdigitation of IL-4 and Stat6 signaling has been observed in the context of Th2 differentiation [106, 107].
Unexpectedly, the expression of 82 lysosomal genes was reduced by IL-4 in Stat6-deficient, but not in wild-type cells. Stat6 thus acts both to mediate increased expression as well as to counteract an unknown, inhibitory pathway that is triggered by IL-4. These results reveal that the role of Stat6 in this context is significantly broader than might have been expected from the number of IL-4 induced genes alone. The mechanism behind this effect is not yet known. Binding of Stat6 to affected promoters might for example block access or regulation by IL-4-induced inhibitory factors, compensate for the loss of other positive regulators, or counter the effects of repressive epigenetic alterations. Stat6 deficiency in mouse T cells has been shown to cause a marked increase of trimethylated lysine-27 on histone 3 (H3K27me3), indicating that Stat6 supports transcription in part by antagonizing inhibitory chromatin modifications . However, to what degree Stat6 exerts this effect in macrophages still has to be explored in detail.
ChIP data show that about 70% of the lysosomal loci that are regulated by Stat6 in macrophages have nearby Stat6 binding sites, indicating that Stat6 is likely exerting proximal effects in directing the expression of these targets. A similar number of Stat6 sites near lysosomal genes have been identified in a ChIP-seq analysis in mouse Th2 cells; however, very few of these genes were induced on the mRNA level . While phosphorylation on tyrosine 641 is sufficient to allow the binding of Stat6 dimers to cognate DNA elements, Stat6 is unable to activate transcription on its own [106–108], and it must cooperate with other proteins to promote gene expression . In this context, activating chromatin modifications are likely to be important, as most Stat6 peaks overlap with regions of histone H3 acetylation, and the most strongly regulated Stat6 targets experience extensive expansions of H3K27ac marks in an Il-4 and Stat6-dependent manner. Exactly what additional factors are involved in IL-4/Stat6-controlled lysosomal gene expression, and whether they act in concert with or downstream of Stat6 remains to be studied.
Understanding how the structure and function of organelles are molded during embryonic development and differentiation is a major goal of cell and developmental biology. The aim of this study was to identify transcriptional networks that are associated with the re-programming specifically of lysosome-related genes. Through large-scale analyses of published microarray data we identified more than 50 DNA-binding transcription factors whose expression correlates with significant numbers of lysosomal genes. Affiliations identified in this manner indicate that mRNAs for lysosomal genes are frequently modulated in concert with regulators that are active during of differentiation, development, interferon signaling and oxidative stress, suggesting broad re-programming of lysosomal genes in these contexts. Depending on network structure, expression of transcription factors can correlate with their downstream target genes, and for most of the regulators identified here such directing roles in lysosomal gene control remains to be explored. However, Stat6, the strongest candidate emerging from our correlation analysis, was clearly identified as an upstream regulator for a large number of lysosomal genes in IL-4 treated mouse macrophages. According to the effects of IL-4, lysosomal genes can be grouped into three principal categories (Figure 9); lysosomal genes in category I are induced by IL-4 through a Stat6-dependent mechanism; genes in category II tend to be suppressed by IL-4, but this effect is negated in the presence of Stat6; category III genes are suppressed by IL-4 through a pathway operating independently of Stat6. In summary, this work illuminates the principal contexts of lysosomal gene regulation, identifies a novel pathway of lysosomal gene control and advances understanding of the cell and molecular biology of alternative macrophage differentiation.
Cell culture media were from PAA Laboratories (Yeovil, UK) and fetal bovine serum (FBS) was from Sigma Aldrich (Gillingham, UK).
Expression correlation analyses
All data manipulations and calculations were performed with custom Unix/Linux or R version 2..0 scripts . 1,517 microarray data series based on Affymetrix mouse genome 430 2.0 arrays (Mouse_430_2; Geo platform accession number GPL1261) and 1,744 data series based on Affymetrix Human Genome U133 Plus 2.0 arrays (HG_U133_Plus2; GEO platform accession number GPL570), each containing at least 6 samples, were downloaded from the NCBI GEO ftp site . The data in each file were reduced to high-quality probe sets attributable to single, full-length mRNAs (labeled “grade A” in Affymetrix annotation files; Mouse430_2.na32.annot.csv and HG_U133_Plus_2.na32.annot.csv). As the number of probe sets on these platforms varies for individual targets, values for genes represented by more than one probe set were averaged, leaving data for 16,772 mouse and 17,237 human genetic endpoints. Each dataset was then saved as an R data frame in binary format for subsequent analyses. Gene expression correlation matrices were calculated using the R ‘cor’ function (Pearson method).
Data matrices were subjected to hierarchical clustering using the R ‘hclust’ function (complete method), and heat maps were generated with the ‘heatmap.2’ function from the gplots library . Dendrograms were split into clusters using the R ‘cutree’ function.
Microarray data analysis
For the analyses that resulted in Additional file 9 and Figure 5, raw data (cel files) were background-corrected and normalized using the Bioconductor GCRMA package . Only probe sets based on full-length cDNAs, assigned to a single gene and annotated as “grade A” according to Affymetrix annotation files were used for further analyses. Probe IDs were replaced with gene symbols, and data for genes represented by more than one probe set were averaged. Genes with average expression values of less than one standard deviation (SD) under all four conditions were considered not expressed and excluded. In the remaining matrix (10,232 genes) values below 1 SD were set to 1 SD. Variances between pairs of triplicate data were then compared with an F test (R ‘var.test’ function); if the resulting p value was greater than 0.05, an unpaired Student’s t test was performed, otherwise Welch’s t test was used (R ‘t.test’ function). The p values returned by the t tests were corrected for multiple hypothesis testing using the false discovery rate method (R ‘p.adjust’ function). Log (base 2) fold changes were set to zero when p values returned by p.adjust were greater than 0.05.
GO annotations are based on lists that we downloaded from the AmiGO gene ontology site . Lists of known transcription regulators were based on a compilation published by Ravasi et al.  that was extended based on recent GO annotations (GO:0000981, GO:0000982, GO:0000983, GO:0000988, GO:0000989, GO:0001010, GO:0001011, GO:0001071, GO:0001076, GO:0001087, GO:0001133, GO:0003700, GO:0003705, GO:0004879). Lists of lysosomal genes were based on combined mouse and human annotations for the GO terms ‘lysosome’ (GO:0005764) and ‘vacuole’ (GO:0005773), which were extended based on reviews of the lysosomal proteome [38, 43, 44]. In some cases gene lists were further modified by removing genes for which no high-quality data were available in the given data set.
Analysis of published ChIP-seq data
Files with peak coordinates in bed format were downloaded from the NCBI GEO depository [GEO:GSE38379]. The data were annotated using the Bioconductor ChIPpeakAnno package (version 2.6.1.) , and peaks were graphically aligned to genomic loci using R code based on the Gviz (version 1.2.1.), GenomicFeatures (version 1.10.2.) and Lattice packages [113–115].
Promoter sequence analysis
Promoter sequences ± 2 kb relative to transcription start sites were searched for perfect matches to the Stat6 consensus binding motif (TTCNNNNGAA) using the RSAT tool (DNA-pattern method, 0 substitutions) [116, 117].
Stat6-deficient mice (C.129S2-Stat6tm1Gru/J) were obtained from Jackson Laboratories . Animals were housed in Tecniplast blue line IVC cages at a negative pressure of 25pa with 75 air changes an hour and fed a standard CRM diet (SDS Diets Services, Essex, UK). Work involving mice was approved by the UK Home Office and the University of Debrecen Medical and Health Science Center Ethics Committee (license number 120/2009/DE MAB), respectively.
For studies performed in London, cells were cultured at 37°C in an atmosphere of 5% CO2. Medium A refers to RPMI supplemented with 10% FBS, 50 U/ml penicillin and 50 μg/ml streptomycin. To obtain bone marrow macrophages, the femurs of mice were submersed in PBS, crushed with hemostats, filtered through 70-μm cell strainers (BD Falcon), washed with PBS and plated in medium A on untreated 10-cm Petri dishes overnight. Unattached cells were then set up in 6-well plates at 5.5x106 cells per well in medium B plus 20 ng/ml murine recombinant M-CSF (Peprotech) plus or minus murine recombinant IL-4 (Peprotech).
For work performed in Debrecen, cells were isolated and differentiated as previously described . Bone-marrow was flushed from the femur of wild-type C57BI6/J male animals. Cells were purified through a Ficoll-Paque gradient (Amersham Biosciences, Arlington Heights, IL) and cultured in DMEM containing 20% endotoxin-reduced fetal bovine serum and 30% L929 conditioned medium for 5 days.
RNA analysis by qPCR
Total RNA was isolated using Trizol reagent (Invitrogen) and 1–2 μg used as template in 20-μl reverse transcription reactions using Tetro Reverse Transcriptase (Bioline) or a Superscript III CellDirect cDNA synthesis kit (Invitrogen). Quantitative PCR reactions were performed in a Bio-Rad CFX96 thermocycler and set up using a Platinum SYBR Green qPCR Supermix (Invitrogen) or SYBR green dye from Diagenode, each in a total volume of 10 μl containing 0.5 μl cDNA and 200 nM primers. Primer sequences are given in Additional file 13. Standard curves with serial template dilutions were included with each run.
Chromatin immunoprecipitation (ChIP)
ChIP was performed as previously described  with minor modifications. Briefly, cells were crosslinked with DSG (Sigma) for 30 minutes and then with formaldehyde (Sigma) for 10 minutes. After fixation chromatin was sonicated with a Diagenode Bioraptor to generate 200-1000 bp fragments. Chromatin was immunoprecipitated with pre-immune IgG (Millipore, 12–370), or with a polyclonal antibody against STAT6 (Santa Cruz, sc-981). Chromatin antibody complexes were precipitated with anti-IgA paramagnetic beads (Life Technologies). After 6 washing steps complexes were eluted and the crosslinks reversed. DNA fragments were column purified (Qiagen, MinElute). DNA was quantified with a Qubit fluorometer (Invitrogen). Immunoprecipitated DNA was quantified by qPCR and normalized to values obtained after amplification of unprecipitated (input) DNA.
Graphics were generated with custom R scripts, in some cases using extensions provided by the gplots, ggplot2 and other packages as indicated [110, 121]. R-generated graphic files in portable document format (PDF) were further edited in Adobe Illustrator.
Availability of supporting data
A searchable database with the results of transcription factor gene expression correlation analyses is available at http://genecan.sgul.ac.uk.
Laszlo Nagy and Axel Nohturfft share senior authorship.
Gene set enrichment analysis
Acetylated histone H3-lysine-27
monomethylated histone H3 lysine-4
Affymetrix Human Genome U133 Plus 2.0 array
Affymetrix mouse genome 430 2.0 array
quantitative polymerase chain reaction
Signal transducer and activator of transcription 6
Michalik L, Auwerx J, Berger JP, Chatterjee VK, Glass CK, Gonzalez FJ, Grimaldi PA, Kadowaki T, Lazar MA, O’Rahilly S, Palmer CN, Plutzky J, Reddy JK, Spiegelman BM, Staels B, Wahli W: International Union of Pharmacology. LXI. Peroxisome proliferator-activated receptors. Pharmacol Rev. 2006, 58 (4): 726-741. 10.1124/pr.58.4.5.
Jornayvaz FR, Shulman GI: Regulation of mitochondrial biogenesis. Essays Biochem. 2010, 47: 69-84. 10.1042/bse0470069.
Rebelo AP, Dillon LM, Moraes CT: Mitochondrial DNA transcription regulation and nucleoid organization. J Inherit Metab Dis. 2011, 34 (4): 941-951. 10.1007/s10545-011-9330-8.
Ron D, Walter P: Signal integration in the endoplasmic reticulum unfolded protein response. Nat Rev Mol Cell Biol. 2007, 8 (7): 519-529. 10.1038/nrm2199.
Ljubicic V, Joseph AM, Saleem A, Uguccioni G, Collu-Marchese M, Lai RY, Nguyen LM, Hood DA: Transcriptional and post-transcriptional regulation of mitochondrial biogenesis in skeletal muscle: effects of exercise and aging. Biochim Biophys Acta. 2010, 1800 (3): 223-234. 10.1016/j.bbagen.2009.07.031.
Nohturfft A, Zhang SC: Coordination of lipid metabolism in membrane biogenesis. Annu Rev Cell Dev Biol. 2009, 25: 539-566. 10.1146/annurev.cellbio.24.110707.175344.
de Duve C: The lysosome turns fifty. Nat Cell Biol. 2005, 7 (9): 847-849. 10.1038/ncb0905-847.
Pryor PR, Luzio JP: Delivery of endocytosed membrane proteins to the lysosome. Biochim Biophys Acta. 2009, 1793 (4): 615-624. 10.1016/j.bbamcr.2008.12.022.
Yang Z, Klionsky DJ: An overview of the molecular mechanism of autophagy. Curr Top Microbiol Immunol. 2009, 335: 1-32.
Saftig P, Klumperman J: Lysosome biogenesis and lysosomal membrane proteins: trafficking meets function. Nat Rev Mol Cell Biol. 2009, 10 (9): 623-635. 10.1038/nrm2745.
Part 16: Lysosomal disorders. Online Metabolic and Molecular Bases of Inherited Disease. Edited by: Valle D, Beaudet AL, Vogelstein B, Kinzler KW, Antonarakis SE, Ballabio A. Updated in 2013. [http://www.ommbid.com],
Hwang ES, Yoon G, Kang HT: A comparative analysis of the cell biology of senescence and aging. Cell Mol Life Sci. 2009, 66 (15): 2503-2524. 10.1007/s00018-009-0034-2.
de Magalhaes JP, Curado J, Church GM: Meta-analysis of age-related gene expression profiles identifies common signatures of aging. Bioinformatics. 2009, 25 (7): 875-881. 10.1093/bioinformatics/btp073.
Jerome WG: Advanced atherosclerotic foam cell formation has features of an acquired lysosomal storage disorder. Rejuvenation Res. 2006, 9 (2): 245-255. 10.1089/rej.2006.9.245.
Lee JH, Yu WH, Kumar A, Lee S, Mohan PS, Peterhoff CM, Wolfe DM, Martinez-Vicente M, Massey AC, Sovak G, Uchiyama Y, Westaway D, Cuervo AM, Nixon RA: Lysosomal proteolysis and autophagy require presenilin 1 and are disrupted by Alzheimer-related PS1 mutations. Cell. 2010, 141 (7): 1146-1158. 10.1016/j.cell.2010.05.008.
Nixon RA, Yang DS, Lee JH: Neurodegenerative lysosomal disorders: a continuum from development to late age. Autophagy. 2008, 4 (5): 590-599.
Jegga AG, Schneider L, Ouyang X, Zhang J: Systems biology of the autophagy-lysosomal pathway. Autophagy. 2011, 7 (5): 477-489. 10.4161/auto.7.5.14811.
Elias JA, Kang MJ, Crothers K, Homer R, Lee CG: State of the art. Mechanistic heterogeneity in chronic obstructive pulmonary disease: insights from transgenic mice. Proc Am Thorac Soc. 2006, 3 (6): 494-498. 10.1513/pats.200603-068MS.
Lutgens SP, Cleutjens KB, Daemen MJ, Heeneman S: Cathepsin cysteine proteases in cardiovascular disease. FASEB J. 2007, 21 (12): 3029-3041. 10.1096/fj.06-7924com.
Gocheva V, Joyce JA: Cysteine cathepsins and the cutting edge of cancer invasion. Cell Cycle. 2007, 6 (1): 60-64. 10.4161/cc.6.1.3669.
Sardiello M, Palmieri M, di Ronza A, Medina DL, Valenza M, Gennarino VA, Di Malta C, Donaudy F, Embrione V, Polishchuk RS, Banfi S, Parenti G, Cattaneo E, Ballabio A: A gene network regulating lysosomal biogenesis and function. Science. 2009, 325 (5939): 473-477.
Belcastro V, Siciliano V, Gregoretti F, Mithbaokar P, Dharmalingam G, Berlingieri S, Iorio F, Oliva G, Polishchuck R, Brunetti-Pierri N, di Bernardo D: Transcriptional gene network inference from a massive dataset elucidates transcriptome organization and gene function. Nucleic Acids Res. 2011, 39 (20): 8677-8688. 10.1093/nar/gkr593.
Palmieri M, Impey S, Kang H, di Ronza A, Pelz C, Sardiello M, Ballabio A: Characterization of the CLEAR network reveals an integrated control of cellular clearance pathways. Hum Mol Genet. 2011, 20 (19): 3852-3866. 10.1093/hmg/ddr306.
Medina DL, Fraldi A, Bouche V, Annunziata F, Mansueto G, Spampanato C, Puri C, Pignata A, Martina JA, Sardiello M, Palmieri M, Polishchuk R, Puertollano R, Ballabio A: Transcriptional activation of lysosomal exocytosis promotes cellular clearance. Dev Cell. 2011, 21 (3): 421-430. 10.1016/j.devcel.2011.07.016.
Kang YA, Sanalkumar R, O’Geen H, Linnemann AK, Chang CJ, Bouhassira EE, Farnham PJ, Keles S, Bresnick EH: Autophagy driven by a master regulator of hematopoiesis. Mol Cell Biol. 2012, 32 (1): 226-239. 10.1128/MCB.06166-11.
Pena-Llopis S, Vega-Rubin-de-Celis S, Schwartz JC, Wolff NC, Tran TA, Zou L, Xie XJ, Corey DR, Brugarolas J: Regulation of TFEB and V-ATPases by mTORC1. EMBO J. 2011, 30 (16): 3242-3258. 10.1038/emboj.2011.257.
Settembre C, Zoncu R, Medina DL, Vetrini F, Erdin S, Erdin S, Huynh T, Ferron M, Karsenty G, Vellard MC, Facchinetti V, Sabatini DM, Ballabio A: A lysosome-to-nucleus signalling mechanism senses and regulates the lysosome via mTOR and TFEB. EMBO J. 2012, 31 (5): 1095-1108. 10.1038/emboj.2012.32.
Settembre C, Ballabio A: TFEB regulates autophagy: an integrated coordination of cellular degradation and recycling processes. Autophagy. 2011, 7 (11): 1379-1381. 10.4161/auto.7.11.17166.
Settembre C, Di Malta C, Polito VA, Garcia Arencibia M, Vetrini F, Erdin S, Erdin SU, Huynh T, Medina D, Colella P, Sardiello M, Rubinsztein DC, Ballabio A: TFEB links autophagy to lysosomal biogenesis. Science. 2011, 332 (6036): 1429-1433. 10.1126/science.1204592.
Phagocytosis: The host. Edited by: Gordon S. 1999, Amsterdam: Elsevier, [AM Tartakoff (Series Editor): Advances in Cellular and Molecular Biology of Membranes and Organelles, vol 5.]
Carro MS, Lim WK, Alvarez MJ, Bollo RJ, Zhao X, Snyder EY, Sulman EP, Anne SL, Doetsch F, Colman H, Lasorella A, Aldape K, Califano A, Iavarone A: The transcriptional network for mesenchymal transformation of brain tumours. Nature. 2010, 463 (7279): 318-325. 10.1038/nature08712.
Boulesteix AL, Strimmer K: Predicting transcription factor activities from combined analysis of microarray and ChIP data: a partial least squares approach. Theor Biol Med Model. 2005, 2: 23-10.1186/1742-4682-2-23.
Pe’er D, Regev A, Tanay A: Minreg: inferring an active regulator set. Bioinformatics. 2002, 18 (Suppl 1): S258-S267. 10.1093/bioinformatics/18.suppl_1.S258.
Segal E, Shapira M, Regev A, Pe’er D, Botstein D, Koller D, Friedman N: Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet. 2003, 34 (2): 166-176. 10.1038/ng1165.
Alon U: Network motifs: theory and experimental approaches. Nat Rev Genet. 2007, 8 (6): 450-461. 10.1038/nrg2102.
Serfling E: Autoregulation–a common property of eukaryotic transcription factors?. Trends Genet. 1989, 5 (5): 131-133.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Muertter RN, Edgar R: NCBI GEO: archive for high-throughput functional genomic data. Nucleic Acids Res. 2009, 37 (Database issue): D885-D890.
Carbon S, Ireland A, Mungall CJ, Shu S, Marshall B, Lewis S, AmiGO Hub, Web Presence Working Group: AmiGO: online access to ontology and annotation data. Bioinformatics. 2009, 25 (2): 288-289. 10.1093/bioinformatics/btn615.
Ravasi T, Suzuki H, Cannistraci CV, Katayama S, Bajic VB, Tan K, Akalin A, Schmeier S, Kanamori-Katayama M, Bertin N, Carninci P, Daub CO, Forrest AR, Gough J, Grimmond S, Han JH, Hashimoto T, Hide W, Hofmann O, Kamburov A, Kaur M, Kawaji H, Kubosaki A, Lassmann T, van Nimwegen E, MacPherson CR, Ogawa C, Radovanovic A, Schwartz A, Teasdale RD, Tegner J, Lenhard B, Teichmann SA, Arakawa T, Ninomiya N, Murakami K, Tagami M, Fukuda S, Imamura K, Kai C, Ishihara R, Kitazume Y, Kawai J, Hume DA, Ideker T, Hayashizaki Y: An atlas of combinatorial transcriptional regulation in mouse and man. Cell. 2010, 140 (5): 744-752. 10.1016/j.cell.2010.01.044.
Horton JD, Goldstein JL, Brown MS: SREBPs: activators of the complete program of cholesterol and fatty acid synthesis in the liver. J Clin Invest. 2002, 109 (9): 1125-1131.
Falcon S, Gentleman R: Using GOstats to test gene lists for GO term association. Bioinformatics. 2007, 23 (2): 257-258. 10.1093/bioinformatics/btl567.
Cheli Y, Ohanna M, Ballotti R, Bertolotto C: Fifteen-year quest for microphthalmia-associated transcription factor target genes. Pigment Cell Melanoma Res. 2010, 23 (1): 27-40. 10.1111/j.1755-148X.2009.00653.x.
Schroder BA, Wrocklage C, Hasilik A, Saftig P: The proteome of lysosomes. Proteomics. 2010, 10 (22): 4053-4076. 10.1002/pmic.201000196.
Lubke T, Lobel P, Sleat DE: Proteomics of the lysosome. Biochim Biophys Acta. 2009, 1793 (4): 625-635. 10.1016/j.bbamcr.2008.09.018.
Mansouri A, Chowdhury K, Gruss P: Follicular cells of the thyroid gland require Pax8 gene function. Nat Genet. 1998, 19 (1): 87-90. 10.1038/ng0598-87.
Bouchard M, Souabni A, Mandler M, Neubuser A, Busslinger M: Nephric lineage specification by Pax2 and Pax8. Genes Dev. 2002, 16 (22): 2958-2970. 10.1101/gad.240102.
Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, Cooke MP, Walker JR, Hogenesch JB: A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci USA. 2004, 101 (16): 6062-6067. 10.1073/pnas.0400782101.
Tan X, Rotllant J, Li H, De Deyne P, Du SJ: SmyD1, a histone methyltransferase, is required for myofibril organization and muscle contraction in zebrafish embryos. Proc Natl Acad Sci USA. 2006, 103 (8): 2713-2718. 10.1073/pnas.0509503103.
Gottlieb PD, Pierce SA, Sims RJ, Yamagishi H, Weihe EK, Harriss JV, Maika SD, Kuziel WA, King HL, Olson EN, Nakagawa O, Srivastava D: Bop encodes a muscle-restricted protein containing MYND and SET domains and is essential for cardiac differentiation and morphogenesis. Nat Genet. 2002, 31 (1): 25-32.
Ma Q, He X: Molecular basis of electrophilic and oxidative defense: promises and perils of Nrf2. Pharmacol Rev. 2012, 64 (4): 1055-1081. 10.1124/pr.110.004333.
Malhotra D, Portales-Casamar E, Singh A, Srivastava S, Arenillas D, Happel C, Shyr C, Wakabayashi N, Kensler TW, Wasserman WW, Biswal S: Global mapping of binding sites for Nrf2 identifies novel targets in cell survival response through ChIP-Seq profiling and network analysis. Nucleic Acids Res. 2010, 38 (17): 5718-5734. 10.1093/nar/gkq212.
Chorley BN, Campbell MR, Wang X, Karaca M, Sambandan D, Bangura F, Xue P, Pi J, Kleeberger SR, Bell DA: Identification of novel NRF2-regulated genes by ChIP-Seq: influence on retinoid X receptor alpha. Nucleic Acids Res. 2012, 40 (15): 7416-7429. 10.1093/nar/gks409.
Shen G, Xu C, Hu R, Jain MR, Nair S, Lin W, Yang CS, Chan JY, Kong AN: Comparison of (-)-epigallocatechin-3-gallate elicited liver and small intestine gene expression profiles between C57BL/6J mice and C57BL/6J/Nrf2 (-/-) mice. Pharm Res. 2005, 22 (11): 1805-1820. 10.1007/s11095-005-7546-8.
Nair S, Xu C, Shen G, Hebbar V, Gopalakrishnan A, Hu R, Jain MR, Lin W, Keum YS, Liew C, Chan JY, Kong AN: Pharmacogenomics of phenolic antioxidant butylated hydroxyanisole (BHA) in the small intestine and liver of Nrf2 knockout and C57BL/6J mice. Pharm Res. 2006, 23 (11): 2621-2637. 10.1007/s11095-006-9099-x.
Hu R, Xu C, Shen G, Jain MR, Khor TO, Gopalkrishnan A, Lin W, Reddy B, Chan JY, Kong AN: Gene expression profiles induced by cancer chemopreventive isothiocyanate sulforaphane in the liver of C57BL/6J mice and C57BL/6J/Nrf2 (-/-) mice. Cancer Lett. 2006, 243 (2): 170-192. 10.1016/j.canlet.2005.11.050.
Barve A, Khor TO, Nair S, Lin W, Yu S, Jain MR, Chan JY, Kong AN: Pharmacogenomic profile of soy isoflavone concentrate in the prostate of Nrf2 deficient and wild-type mice. J Pharm Sci. 2008, 97 (10): 4528-4545. 10.1002/jps.21311.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005, 102 (43): 15545-15550. 10.1073/pnas.0506580102.
Ostuni R, Piccolo V, Barozzi I, Polletti S, Termanini A, Bonifacio S, Curina A, Prosperini E, Ghisletti S, Natoli G: Latent enhancers activated by stimulation in differentiated cells. Cell. 2013, 152 (1–2): 157-171.
Uhlen M, Oksvold P, Fagerberg L, Lundberg E, Jonasson K, Forsberg M, Zwahlen M, Kampf C, Wester K, Hober S, Wernerus H, Bjorling L, Ponten F: Towards a knowledge-based Human Protein Atlas. Nat Biotechnol. 2010, 28 (12): 1248-1250. 10.1038/nbt1210-1248.
Wurster AL, Tanaka T, Grusby MJ: The biology of Stat4 and Stat6. Oncogene. 2000, 19 (21): 2577-2584. 10.1038/sj.onc.1203485.
Goenka S, Kaplan MH: Transcriptional regulation by STAT6. Immunol Res. 2011, 50 (1): 87-96. 10.1007/s12026-011-8205-2.
Saeij JP, Coller S, Boyle JP, Jerome ME, White MW, Boothroyd JC: Toxoplasma co-opts host gene expression by injection of a polymorphic kinase homologue. Nature. 2007, 445 (7125): 324-327. 10.1038/nature05395.
Chen H, Sun H, You F, Sun W, Zhou X, Chen L, Yang J, Wang Y, Tang H, Guan Y, Xia W, Gu J, Ishikawa H, Gutman D, Barber G, Qin Z, Jiang Z: Activation of STAT6 by STING is critical for antiviral innate immunity. Cell. 2011, 147 (2): 436-446. 10.1016/j.cell.2011.09.022.
Szanto A, Balint BL, Nagy ZS, Barta E, Dezso B, Pap A, Szeles L, Poliska S, Oros M, Evans RM, Barak Y, Schwabe J, Nagy L: STAT6 transcription factor is a facilitator of the nuclear receptor PPARgamma-regulated gene expression in macrophages and dendritic cells. Immunity. 2010, 33 (5): 699-712. 10.1016/j.immuni.2010.11.009.
Hulsen T, de Vlieg J, Alkema W: BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams. BMC Genomics. 2008, 9 (1): 488-10.1186/1471-2164-9-488.
Rutschman R, Lang R, Hesse M, Ihle JN, Wynn TA, Murray PJ: Cutting edge: Stat6-dependent substrate depletion regulates nitric oxide production. J Immunol. 2001, 166 (4): 2173-2177.
Fairweather D, Cihakova D: Alternatively activated macrophages in infection and autoimmunity. J Autoimmun. 2009, 33 (3–4): 222-230.
Heintzman ND, Stuart RK, Hon G, Fu Y, Ching CW, Hawkins RD, Barrera LO, Van Calcar S, Qu C, Ching KA, Wang W, Weng Z, Green RD, Crawford GE, Ren B: Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat Genet. 2007, 39 (3): 311-318. 10.1038/ng1966.
Yip KY, Cheng C, Bhardwaj N, Brown JB, Leng J, Kundaje A, Rozowsky J, Birney E, Bickel P, Snyder M, Gerstein M: Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors. Genome Biol. 2012, 13 (9): R48-10.1186/gb-2012-13-9-r48.
Scott EW, Simon MC, Anastasi J, Singh H: Requirement of transcription factor PU.1 in the development of multiple hematopoietic lineages. Science. 1994, 265 (5178): 1573-1577. 10.1126/science.8079170.
McKercher SR, Torbett BE, Anderson KL, Henkel GW, Vestal DJ, Baribault H, Klemsz M, Feeney AJ, Wu GE, Paige CJ, Maki RA: Targeted disruption of the PU.1 gene results in multiple hematopoietic abnormalities. EMBO J. 1996, 15 (20): 5647-5658.
Lawrence T, Natoli G: Transcriptional regulation of macrophage polarization: enabling diversity with identity. Nat Rev Immunol. 2011, 11 (11): 750-761. 10.1038/nri3088.
Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, Hanna J, Lodato MA, Frampton GM, Sharp PA, Boyer LA, Young RA, Jaenisch R: Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc Natl Acad Sci USA. 2010, 107 (50): 21931-21936. 10.1073/pnas.1016071107.
Rada-Iglesias A, Bajpai R, Swigut T, Brugmann SA, Flynn RA, Wysocka J: A unique chromatin signature uncovers early developmental enhancers in humans. Nature. 2011, 470 (7333): 279-283. 10.1038/nature09692.
Karlic R, Chung HR, Lasserre J, Vlahovicek K, Vingron M: Histone modification levels are predictive for gene expression. Proc Natl Acad Sci USA. 2010, 107 (7): 2926-2931. 10.1073/pnas.0909344107.
Basso K, Margolin AA, Stolovitzky G, Klein U, Dalla-Favera R, Califano A: Reverse engineering of regulatory networks in human B cells. Nat Genet. 2005, 37 (4): 382-390. 10.1038/ng1532.
Della Gatta G, Bansal M, Ambesi-Impiombato A, Antonini D, Missero C, di Bernardo D: Direct targets of the TRP63 transcription factor revealed by a combination of gene expression profiling and reverse engineering. Genome Res. 2008, 18 (6): 939-948. 10.1101/gr.073601.107.
Joshi A, De Smet R, Marchal K, Van de Peer Y, Michoel T: Module networks revisited: computational assessment and prioritization of model predictions. Bioinformatics. 2009, 25 (4): 490-496. 10.1093/bioinformatics/btn658.
Raposo G, Fevrier B, Stoorvogel W, Marks MS: Lysosome-related organelles: a view from immunity and pigmentation. Cell Struct Funct. 2002, 27 (6): 443-456. 10.1247/csf.27.443.
Lobert VH, Stenmark H: Cell polarity and migration: emerging role for the endosomal sorting machinery. Physiology (Bethesda). 2011, 26 (3): 171-180. 10.1152/physiol.00054.2010.
Li P, Gregg JL, Wang N, Zhou D, O’Donnell P, Blum JS, Crotzer VL: Compartmentalization of class II antigen presentation: contribution of cytoplasmic and endosomal processing. Immunol Rev. 2005, 207: 206-217. 10.1111/j.0105-2896.2005.00297.x.
Bird PI, Trapani JA, Villadangos JA: Endolysosomal proteases and their inhibitors in immunity. Nat Rev Immunol. 2009, 9 (12): 871-882. 10.1038/nri2671.
Randow F, MacMicking JD, James LC: Cellular self-defense: how cell-autonomous immunity protects against pathogens. Science. 2013, 340 (6133): 701-706. 10.1126/science.1233028.
Stark GR, Darnell JE: The JAK-STAT pathway at twenty. Immunity. 2012, 36 (4): 503-514. 10.1016/j.immuni.2012.03.013.
Benekli M, Baer MR, Baumann H, Wetzler M: Signal transducer and activator of transcription proteins in leukemias. Blood. 2003, 101 (8): 2940-2954. 10.1182/blood-2002-04-1204.
Bruns HA, Kaplan MH: The role of constitutively active Stat6 in leukemia and lymphoma. Crit Rev Oncol Hematol. 2006, 57 (3): 245-253. 10.1016/j.critrevonc.2005.08.005.
Calo V, Migliavacca M, Bazan V, Macaluso M, Buscemi M, Gebbia N, Russo A: STAT proteins: from normal control of cellular events to tumorigenesis. J Cell Physiol. 2003, 197 (2): 157-168. 10.1002/jcp.10364.
Martinez FO, Helming L, Gordon S: Alternative activation of macrophages: an immunologic functional perspective. Annu Rev Immunol. 2009, 27: 451-483. 10.1146/annurev.immunol.021908.132532.
Online Mendelian Inheritance in Man, OMIM: McKusick-Nathans Institute of Genetic Medicine. 2013, Baltimore, MD: Johns Hopkins University, [http://omim.org/]
Forgac M: Vacuolar ATPases: rotary proton pumps in physiology and pathophysiology. Nat Rev Mol Cell Biol. 2007, 8 (11): 917-929. 10.1038/nrm2272.
Miranda KC, Karet FE, Brown D: An extended nomenclature for mammalian V-ATPase subunit genes and splice variants. PLoS One. 2010, 5 (3): e9531-10.1371/journal.pone.0009531.
Matsui K, Yuyama N, Akaiwa M, Yoshida NL, Maeda M, Sugita Y, Izuhara K: Identification of an alternative splicing variant of cathepsin C/dipeptidyl-peptidase I. Gene. 2002, 293 (1–2): 1-7.
Ueta M, Mizushima K, Yokoi N, Naito Y, Kinoshita S: Expression of the interleukin-4 receptor alpha in human conjunctival epithelial cells. Br J Ophthalmol. 2010, 94 (9): 1239-1243. 10.1136/bjo.2009.173419.
Balce DR, Li B, Allan ER, Rybicka JM, Krohn RM, Yates RM: Alternative activation of macrophages by IL-4 enhances the proteolytic capacity of their phagosomes through synergistic mechanisms. Blood. 2011, 118 (15): 4199-4208. 10.1182/blood-2011-01-328906.
Montaner LJ, da Silva RP, Sun J, Sutterwala S, Hollinshead M, Vaux D, Gordon S: Type 1 and type 2 cytokine regulation of macrophage endocytosis: differential activation by IL-4/IL-13 as opposed to IFN-gamma or IL-10. J Immunol. 1999, 162 (8): 4606-4613.
Wainszelbaum MJ, Proctor BM, Pontow SE, Stahl PD, Barbieri MA: IL4/PGE2 induction of an enlarged early endosomal compartment in mouse macrophages is Rab5-dependent. Exp Cell Res. 2006, 312 (12): 2238-2251. 10.1016/j.yexcr.2006.03.025.
Varin A, Gordon S: Alternative activation of macrophages: immune function and cellular biology. Immunobiology. 2009, 214 (7): 630-641. 10.1016/j.imbio.2008.11.009.
Reiser J, Adair B, Reinheckel T: Specialized roles for cysteine cathepsins in health and disease. J Clin Invest. 2010, 120 (10): 3421-3431. 10.1172/JCI42918.
Everts V, van der Zee E, Creemers L, Beertsen W: Phagocytosis and intracellular digestion of collagen, its role in turnover and remodelling. Histochem J. 1996, 28 (4): 229-245. 10.1007/BF02409011.
Lecaille F, Bromme D, Lalmanach G: Biochemical properties and regulation of cathepsin K activity. Biochimie. 2008, 90 (2): 208-226. 10.1016/j.biochi.2007.08.011.
Hayman AR, Jones SJ, Boyde A, Foster D, Colledge WH, Carlton MB, Evans MJ, Cox TM: Mice lacking tartrate-resistant acid phosphatase (Acp 5) have disrupted endochondral ossification and mild osteopetrosis. Development. 1996, 122 (10): 3151-3162.
Li JJ, Johnson AR: Selective MMP13 inhibitors. Med Res Rev. 2011, 31 (6): 863-894. 10.1002/med.20204.
Zheng T, Zhu Z, Wang Z, Homer RJ, Ma B, Riese RJ, Chapman HA, Shapiro SD, Elias JA: Inducible targeting of IL-13 to the adult lung causes matrix metalloproteinase- and cathepsin-dependent emphysema. J Clin Invest. 2000, 106 (9): 1081-1093. 10.1172/JCI10458.
Lee PJ, Zhang X, Shan P, Ma B, Lee CG, Homer RJ, Zhu Z, Rincon M, Mossman BT, Elias JA: ERK1/2 mitogen-activated protein kinase selectively mediates IL-13-induced lung inflammation and remodeling in vivo. J Clin Invest. 2006, 116 (1): 163-173.
Merk BC, Owens JL, Lopes MB, Silva CM, Hussaini IM: STAT6 expression in glioblastoma promotes invasive growth. BMC Cancer. 2011, 11: 184-2407. 10.1186/1471-2407-11-184.
Wei L, Vahedi G, Sun HW, Watford WT, Takatori H, Ramos HL, Takahashi H, Liang J, Gutierrez-Cruz G, Zang C, Peng W, O’Shea JJ, Kanno Y: Discrete roles of STAT4 and STAT6 transcription factors in tuning epigenetic modifications and transcription during T helper cell differentiation. Immunity. 2010, 32 (6): 840-851. 10.1016/j.immuni.2010.06.003.
Elo LL, Jarvenpaa H, Tuomela S, Raghav S, Ahlfors H, Laurila K, Gupta B, Lund RJ, Tahvanainen J, Hawkins RD, Oresic M, Lahdesmaki H, Rasool O, Rao KV, Aittokallio T, Lahesmaa R: Genome-wide profiling of interleukin-4 and STAT6 transcription factor regulation of human Th2 cell programming. Immunity. 2010, 32 (6): 852-862. 10.1016/j.immuni.2010.06.011.
Mikita T, Campbell D, Wu P, Williamson K, Schindler U: Requirements for interleukin-4-induced gene expression and functional characterization of Stat6. Mol Cell Biol. 1996, 16 (10): 5811-5820.
Team RDC: R: A language and environment for statistical computing: Vienna. 2010, Austria: R Foundation for Statistical Computing
Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, Lumley T, Maechler M, Magnusson A, Moeller S, Schwartz M, Venables B: Various R programming tools for plotting data. The Comprehensive R Archive Network. 2013, [http://cran.r-project.org]
Wu Z, Irizarry RA, Gentleman R, Martinez-Murillo F, Spencer F: A model-based background adjustment for oligonucleotide expression arrays. J Am Stat Assoc. 2004, 99 (468): 909-917. 10.1198/016214504000000683.
Zhu LJ, Gazin C, Lawson ND, Pagès H, Lin SM, Lapointe DS, Green MR: ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data. BMC Bioinformatics. 2010, 11: 237-10.1186/1471-2105-11-237. doi: 10.1186/1471-2105-11-237
Carlson M, Pages H, Aboyoun P, Falcon S, Morgan M, Sarkar D, Lawrence M: GenomicFeatures: Tools for making and manipulating transcript centric annotations. Bioconductor. 2013, [http://www.bioconductor.org/]
Hahne F, Durinck S, Ivanek R, Mueller A, Lianoglou S: Gviz: Plotting data and annotation information along genomic coordinates. Bioconductor. 2013, [http://www.bioconductor.org/]
Sarkar D: Lattice: Multivariate Data Visualization with R. 2008, New York: Springer
Schindler U, Wu P, Rothe M, Brasseur M, McKnight SL: Components of a Stat recognition code: evidence for two layers of molecular selectivity. Immunity. 1995, 2 (6): 689-697. 10.1016/1074-7613(95)90013-6.
van Helden J: Regulatory sequence analysis tools. Nucleic Acids Res. 2003, 31 (13): 3593-3596. 10.1093/nar/gkg567.
Kaplan MH, Schindler U, Smiley ST, Grusby MJ: Stat6 is required for mediating responses to IL-4 and for development of Th2 cells. Immunity. 1996, 4 (3): 313-319. 10.1016/S1074-7613(00)80439-2.
Barish GD, Downes M, Alaynick WA, Yu RT, Ocampo CB, Bookout AL, Mangelsdorf DJ, Evans RM: A Nuclear Receptor Atlas: macrophage activation. Mol Endocrinol. 2005, 19 (10): 2466-2477. 10.1210/me.2004-0529.
Barish GD, Yu RT, Karunasiri M, Ocampo CB, Dixon J, Benner C, Dent AL, Tangirala RK, Evans RM: Bcl-6 and NF-kappaB cistromes mediate opposing regulation of the innate immune response. Genes Dev. 2010, 24 (24): 2760-2765. 10.1101/gad.1998010.
Wickham H: ggplot2: elegant graphics for data analysis. 2009, New York: Springer
Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE, Mouse Genome Database Group: The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse. Nucleic Acids Res. 2012, 40 (Database issue): D881-D886.
Kasprzyk A: BioMart: driving a paradigm change in biological data management. 2011, Oxford: Database, bar049
Maglott D, Ostell J, Pruitt KD, Tatusova T: Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 2005, 33 (Database issue): D54-D58.
Rozen S, Skaletsky HJ: Primer3 on the WWW for general users and for biologist programmers. Bioinformatics Methods and Protocols: Methods in Molecular Biology. Edited by: Krawetz S, Misener S, Totowa NJ. 2000, Totowa, New Jersey: Humana Press, 365-386.
This paper is dedicated to the memory of Louise Margaret Brignull. Research in the Nohturfft laboratory was supported by a grant from St. George’s Charitable Foundation. ZC is supported by funding from the European Union, the European Social Fund and the State of Hungary (TÁMOP 4.2.4. A/2-11-1-2012-0001 'National Excellence Program' and TÁMOP-4.2.2/B-10/1-2010-0024). LN is supported by a grant from the Hungarian Scientific Research Fund (OTKA K100196), and TÁMOP422_2012_0023 VÉD-ELEM, implemented through the New Hungary Development Plan and co-financed by the European Social Fund and the European Regional Development Fund. NB was supported by a project grant from Asthma UK (06-050) and S.J. by a Chair from Asthma UK (CH1155). This work was supported in part by MRC Centre Grant G1000758 and ERC FP7 Advanced Grant 233015 (to SJ). We are grateful to our colleagues in the Biomics Centre and the Information Technology group at St. George’s University of London for provision of core instrumentation support and to Keith Carr and Giuseppe Sollazzo for invaluable help during website development.
The authors declare that they have no competing interests.
AN conceived of the study, wrote computer programs and wrote the first draft of the manuscript. AN, LM, BD, ZC and NB performed tissue harvest. AN, BD, HS, IV, LB, LM and ZC cultured cells. AN, BD, IV, LB, LM, and ZC extracted RNA and performed quantitative PCR. BD, LN, and ZC performed chromatin immunoprecipitation. NB and SJ designed knockout mouse studies and managed animal husbandry. AN, LB and LM performed data analysis. LB, LM, LN, NB, and SJ helped write the final version of the paper. All authors read and approved the final manuscript.
Louise M Brignull, Zsolt Czimmerer contributed equally to this work.
Electronic supplementary material
Additional file 1: Expression correlation between membrane biogenesis genes and known transcription factors. The data pertain to Figure 1 of the main article, and calculations were performed as detailed in the accompanying legend. (Worksheet 1, “Cholesterol biosynthesis”) Column A lists 19 genes encoding enzymes in the cholesterol biosynthesis pathway that were used as a reference set . Column C lists NCBI GEO accession numbers of 472 Mouse430_2-based gene expression data series that had been selected based on a minimum average coefficient of correlation among the 19 reference genes of 30%. The genes in Column E are based on a list of known transcription factors (see Methods). Values in Column G represent average Pearson correlation coefficients calculated between the corresponding transcription factor in Column E and the reference genes from Column A across the data series listed in Column C. Column F indicates whether the gene encodes a DNA-binding protein according to GO annotations (GO:0003677). Transcription factor genes are listed according to correlation values in descending order. (Worksheet 2, “ER”) Column A lists 95 highly correlated ER genes that have been selected as detailed in the legends to Figure 1b and Additional file 3, after removing known transcription factors. Column C lists NCBI GEO accession numbers of 247 Mouse430_2-based gene expression data series that had been selected based on a minimum average coefficient of correlation among the 95 reference genes of 30%. Columns E-G contain correlation data for transcription factor genes as detailed above. (XLS 283 KB)
Additional file 2: Matrix of expression correlations among ER genes in mouse. This supplement pertains to Figure 1b and Additional file 3. Data were generated as described in the accompanying legends. Tab-delimited. (TXT 11 MB)
Additional file 3: Gene expression correlation analysis identifies distinct subsets of ER genes. These data pertain to Figure 1b of the main article. Pearson correlation coefficients across 1,435 Mouse430_2-based microarray datasets were calculated among 778 ER genes. The gene list was based on a set downloaded from the AmiGO gene ontology database (GO:0005783) and modified by removing genes also associated with the Golgi (GO:0005794) or lysosomes (GO:0005764). The resulting data matrix (Additional file 2) was subjected to hierarchical clustering as described in Methods. The cluster with the largest average ( = 0.101), consisting of 97 ER genes (listed in Additional file 1), is highlighted in red. (PDF 438 KB)
Additional file 4: Genome-wide expression correlation analyses for mouse and human SREBF2 and XBP1. (Worksheet 1, “Mouse”) Pearson correlations were averaged across the indicated number (n) of Mouse430_2-based datasets between the transcription factors Srebf2 or Xbp1 and 16,771 other named genes for which high-quality data were available. (Worksheet 2, “Human”) Pearson correlations between human SREBF2 or XBP1 and 17,236 other named genes averaged across the indicated number (n) of HG_U133_Plus2-based datasets. (XLS 3 MB)
Additional file 6: Matrix of expression correlations among lysosomal genes in mouse. This supplement pertains to Figures 2 and 3. Data were generated as described in the accompanying legends. Tab-delimited. (TXT 1 MB)
Additional file 7: On each worksheet Column A lists lysosomal genes for the indicated cluster from Figure 2. Column C lists NCBI GEO accession numbers of Mouse430_2 or HG_U133_Plus2-based gene expression data series that had been selected based on a minimum average coefficient of correlation among the reference genes of 30%. Values in Column G represent average Pearson correlation coefficients calculated between the corresponding transcription factor in Column E and the reference genes from Column A across the data series listed in Column C. Column F indicates whether the gene encodes a DNA-binding protein according to GO annotations (GO:0003677). Transcription factor genes are listed according to correlation values in descending order. (XLS 844 KB)
Additional file 8: Correlation of Stat6 with lysosomal genes. (Worksheet 1, “Correl. Mouse”) Average expression correlations across 1,093 datasets were calculated between mouse Stat6 and genes represented on Mouse_430_2 arrays as described in the legend to Additional file 4. (Worksheet 2, “Correl. Human”) Average expression correlations across 1,122 datasets were calculated between human STAT6 and genes represented on HG_U133_Plus2 arrays as described in the legend to Additional file 4. (Worksheet 3, “GSEA Mouse”) The correlation-ranked gene list from Worksheet 1 was analyzed for GO term enrichment using the GSEA tool . Gene set reference files in gene matrix transposed (gmt) format were generated by reformatting lists, downloaded from the AmiGO website, containing all mouse and human GO associations (all gene product types and data sources) . Column J lists ‘leading-edge’ lysosomal genes as returned by GSEA; genes common to both the mouse and human leading edges are shown in red; leading-edge lysosomal genes bound by Stat6 according to reference  are boxed. (Worksheet 4, “GSEA Human”) The correlation-ranked gene list from Worksheet 2 was analyzed for GO term enrichment using the GSEA tool as in Worksheet 3. (Worksheet 5, “GOstats Mouse”) The 500 highest-ranking genes in Worksheet 1 were analyzed for GO term enrichment using a custom R script based on the Bioconductor GOstats package . (Worksheet 6, “GOstats Human”) The 500 highest-ranking genes in Worksheet 2 were analyzed for GO term enrichment as in Worksheet 5. (XLS 2 MB)
Additional file 9: Effects of Stat6 and IL-4 on lysosomal gene expression in mouse bone marrow macrophages. The data pertain to Figure 5 of the main article, and calculations were performed as detailed in the accompanying legend and in Methods. Values indicate log (base 2) fold changes of lysosomal genes as determined by gene expression profiling. Genes are color-coded according to GO annotations. Antigen processing and presentation, GO:0019882; glycosidases, GO:0016798; peptidases, GO:0008233; other hydrolases, GO:0016787; vesicular transport, GO:0016192; vacuolar H+ ATPase, GO:0016471. (XLS 60 KB)
Additional file 10: Stat6-bound loci are statistically enriched for genes encoding lysosomal proteins. This analysis is based on publicly available ChIP-seq data as described in the legend to Figure 7. A file with peak coordinates in bed format was downloaded from the NCBI GEO depository [GEO:GSM1022305], and peaks were annotated as described in Methods. We only considered peaks whose centers were located within 5kb of transcription start sites of genes for which both HGNS gene symbols and Entrez gene IDs were available. The resulting list of 4,520 gene symbols was analyzed for GO set enrichment using the Bioconductor GOstats package . The reference set consisted of 23,563 unique gene symbols based on a list of mouse GO annotations (version 1.79) from the Mouse Genome Information (MGI) site for which Entrez gene IDs were also available . (XLS 492 KB)
Additional file 11: Stat6 binding to lysosomal loci verified by ChIP-PCR. Bone marrow-derived macrophages from wild-type and Stat6-deficient mice were cultured in the presence of M-CSF for five days and then switched to media ± recombinant IL-4 for 30 minutes. Chromatin was crosslinked, fragmented and immunoprecipitated with control IgG (left panels) or anti-Stat6 (right panels) as described in Methods. The immunoprecipitated DNA was used as template for qPCR reactions to quantify selected Stat6 peak regions. Coordinates of PCR fragments are (mouse genome build mm9): Arg1, chr10:24650166-24650215; Plekhf1, chr7:39013036-39013202; Mmp13, chr9:7330552-7330598; Atp6v0d2, chr4:19821892-19821947; Atp6v0a1, chr11:100856194-100856250; Atp6v1b2, chr8:71615039-71615173; Ctsl, Chr13:64503469-64503600. (PDF 422 KB)
Additional file 12: Stat6 binds to active chromatin regions. Analyses are based on publicly available ChIP-seq data deposited by Ostuni et al. . Peak coordinates in bed format were downloaded from the NCBI GEO site [GEO:GSM1022256, GSM1022257, GSM1022258, GSM1022259, GSM1022260, GSM1022261, GSM1022262, GSM1022263, GSM1022264, GSM1022265, GSM1022266, GSM1022267, GSM1022297, GSM1022298, GSM1022299] and analyzed with the ChipPeakAnno package . Columns A-C list the merged coordinates of 196 Stat6 peaks selected according to three criteria: (i) Stat6 peaks overlapped in all of three independent data sets obtained after growth of mouse macrophages with IL-4 for 1, 2 and 4 hours [GEO:GSM1022297, GSM1022298, GSM1022299]; (ii) peaks were filtered to exclude those located farther than 5kb from the nearest transcription start site; (iii) we further selected peaks close to genes annotated as encoding lysosomal proteins. Peak names in column E correspond to those in GSM1022299. Peaks were annotated with gene identifiers using information obtained through the BioMart server . Columns H-S indicate whether the Stat6 peaks overlap with peaks obtained with antibodies against H3K26ac (columns H-K), H3K4me1 (columns L-O) or Pu.1 (columns P-S), using chromatin from wild-type or Stat6-deficient macrophages grown ± IL-4 for 4 hours as indicated in the column headers . (XLS 82 KB)
Additional file 13: Oligonucleotides used for qPCR. Primers were designed based on sequence information obtained through the NCBI Entrez Gene server  using the Primer-BLAST software tool . (XLS 45 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.