- Research article
- Open Access
The dynamics of early-state transcriptional changes and aggregate formation in a Huntington’s disease cell model
BMC Genomics volume 18, Article number: 373 (2017)
Huntington’s disease (HD) is a fatal neurodegenerative disorder caused by a CAG expansion in the Huntingtin (HTT) gene. Proteolytic cleavage of mutant huntingtin (Htt) protein with an expanded polyglutamine (polyQ) stretch results in production of Htt fragments that aggregate and induce impaired ubiquitin proteasome, mitochondrial functioning and transcriptional dysregulation. To understand the time-resolved relationship between aggregate formation and transcriptional changes at early disease stages, we performed temporal transcriptome profiling and quantification of aggregate formation in living cells in an inducible HD cell model.
Rat pheochromocytoma (PC12) cells containing a stably integrated, doxycycline-inducible, eGFP-tagged N-terminal human Htt fragment with an expanded polyQ domain were used to analyse gene expression changes at different stages of mutant Htt aggregation. At earliest time points after doxycycline induction no detectable aggregates and few changes in gene expression were observed. Aggregates started to appear at intermediate time points. Aggregate formation and subsequent enlargement of aggregates coincided with a rapid increase in the number of differentially expressed (DE) genes. The increase in number of large aggregates coincided with a decrease in the number of smaller aggregates whereas the transcription profile reverted towards the profile observed before mutant Htt induction. Cluster-based analysis of the 2,176 differentially expressed genes revealed fourteen distinct clusters responding differently over time. Functional enrichment analysis of the two major gene clusters revealed that genes in the up-regulated cluster were mainly involved in metabolic (antioxidant activity and cellular ketone metabolic processes) and genes in the down-regulated cluster in developmental processes, respectively. Promoter-based analysis of the identified gene clusters resulted in identification of a transcription factor network of which several previously have been linked to HD.
We demonstrate a time-resolved relationship between Htt aggregation and changes in the transcriptional profile. We identified two major gene clusters showing involvement of (i) mitochondrial dysfunction and (ii) developmental processes implying cellular homeostasis defects. We identified novel and known HD-linked transcription factors and show their interaction with known and predicted regulatory proteins. Our data provide a novel resource for hypothesis building on the role of transcriptional key regulators in early stages of HD and possibly other polyQ-dependent diseases.
Huntington’s disease (HD) is a neurodegenerative disorder that manifests in behavioural changes as well as a decline in cognitive abilities and motor functions. Disease onset typically starts around the fourth decade of life and after disease onset symptoms progressively increase . HD is caused by an expanded stretch of CAG repeats in the Huntingtin (HTT) gene which is translated into an abnormally large polyglutamine (polyQ) domain near the huntingtin (Htt) protein N-terminus. An increasing number of CAG repeats is related to progressively earlier ages of disease onset. The Htt protein is involved in a variety of cellular processes and interacts with several proteins including transcriptional regulators and nuclear receptors. Mutant Htt is known to exhibit a changed protein interaction profile and to disrupt physiological functioning of its binding partners [2,3,4,5,6], e.g., CREB Binding Protein (CBP) and Specificity Protein 1 (Sp1) [7,8,9]. Even though Htt is ubiquitously expressed in all cell types, medium-sized spiny neurons in the striatum are particularly affected very early in HD .
Transcriptional dysregulation is proposed to represent a key factor in HD development and major changes in gene expression are detected in HD patient post-mortem brain tissue and in disease model systems [11,12,13,14,15,16,17,18]. Protein aggregates in the brain are a hallmark of HD and transcriptional regulators are found to be associated with these aggregates [19,20,21,22,23]. In mouse models Htt aggregates are noted to appear before disease onset [24, 25]. Sequestration of proteins involved in transcriptional regulation into aggregates [20,21,22,23, 26] or ubiquitin proteasome dysfunction [27, 28] is suggested to represent a causative role in HD pathogenesis. However, sequestration of proteins into aggregates does not seem to affect protein concentrations to a biologically significant degree  and both transcriptional dysregulation and ubiquitin proteosome impairment are noted to already take place before nuclear aggregate formation. It is now thought that it is not mutant Htt aggregation but the soluble Htt protein fragments that play an important role in early cellular pathology [30,31,32]. Clearly, the causal relationship between protein aggregation, transcriptional dysregulation and neuronal cell dysfunction at early disease onset are not fully understood.
In the present study, we used an inducible HD cell system to determine the dynamics of early-state transcriptional changes and aggregate formation. We performed transcriptional profiling and quantified aggregate formation over time upon induction of mutant and control Htt protein expression. Our data show that a rapid accumulation of small mutant Htt aggregates coincides with a significant change in the transcriptional profile represented by one major cluster of up-regulated genes involved in metabolic processes and one major cluster of down-regulated genes involved in developmental processes. At later time points small aggregates seem to merge into larger ones and changes in gene expression diminish. We identified a number of transcription factors involved in transcriptional dysregulation and illustrate their interactions that may be part of a larger network commonly affected not only in HD, but possibly also in other polyQ-dependent disorders.
Rat PC12 cells containing exon 1 of the human HTT gene, with either 23 (control) or 74 (HD) CAG repeats, fused to Green Fluorescent Protein (GFP) under control of a doxycycline-inducible promoter  were cultured in six well plates containing high glucose Dulbecco’s modified Eagle’s medium containing pyruvate and L-glutamine (DMEM 41966, Invitrogen Life Technologies) supplemented with 100 U/ml penicillin and streptomycin (Invitrogen Life Technologies), 2 mM L-glutamine (Invitrogen Life Technologies), 10% heat-inactivated horse serum (Invitrogen Life Technologies), 5% TET-approved heat-inactivated fetal bovine serum (Clontech), 100 μg/ml G418 (Invitrogen Life Technologies) and 75 μg/ml hygromycin (Invitrogen Life Technologies) at 37 °C and 10% CO2. Induction of fusion protein was accomplished by incubating the cells with 1 μg/ml doxycycline (Clontech). Cells were harvested at 0 (pre-induction state), 4, 8, 24, 36, 48, 72, 96 and 120 h post-induction.
Induced cells were imaged using a Zeiss Axiovert 200 M wide-field fluorescence microscope equipped with a 100x Plan-Apochromat (1.4NA) oil immersion lens and a Cairn Xenon Arc lamp with monochromator and objective heater. Images were recorded with a cooled CCD camera and analysed using a modified version of the Argos system .
The outline of the cells and the aggregates present in the captured images were detected as follows. A Gaussian filter with σ = 0.2 μm in combination with a threshold intensity of 30 was used to enable detection of the cells. A Gaussian difference with σ = 0.1 and σ = 0.5 μm in combination with a threshold intensity of 80 was used for spot detection. The 20% quantile value of the data was taken as the background value and subtracted from the data. For follow-up analysis, aggregate spots smaller than 1×10−5 μm were considered noise and discarded. For each remaining aggregate, the logarithm of its volume (loge(volume)) was calculated, leading to a range of values from −5.54 to 4.09. Given this range, we binned the aggregates into eleven groups. For each time point after induction, at least four images were analysed and the number of aggregates belonging to each bin was determined. To correct for the varying number of cells present in each image, the number of aggregates found for each bin was divided by the total volume of the cells found in that image.
For each of the at least four images captured per time point, an average eGFP-Htt expression value was obtained calculating the total, background-corrected fluorescence values divided by the combined cell volume.
Total RNA was extracted from PC12 cells and purified from whole cell lysates derived from samples cultured at each experimental condition. RNA isolation was performed with Trizol (Life technologies) following manufacturer’s instruction. tRNAs were removed with RNeasy MinElute clean-up kit (Qiagen). The integrity of isolated RNA was measured using the Agilent 2100 BioAnalyzer (Agilent Technologies) according to the manufacturer’s instructions. Based on the results of the integrity measurements, a quality score for each RNA sample was calculated using the RNA Integrity Number algorithm . The samples used in our experiment were assigned scores ranging from 8.8 to 10, with an average score of 9.9, indicating high quality RNA samples.
A total of 108 samples were prepared for hybridisation: six for each of the nine time points for both control and HD cell lines. Each test sample was labelled with Cy3 and hybridised against a Cy5-labelled common reference sample, which consisted of a pool of equal amounts of RNA from all individual test samples. Labelling of the RNA samples was performed using Ambion aminoallyl amplification. Hybridisation of the samples was randomized across nine slides containing twelve arrays each. Four samples had to be discarded because of hybridization-related problems.
The microarrays used in this study were Nimblegen-Roche chips (12–146 k format) spotted with Nimblegen’s default oligo library (corresponding to 26,419 rat genes, each of which was represented by multiple distinct probes) as well as probes to be used for internal quality controls and array construction. The Nimblegen oligo library was supplemented with probes recognizing three additional rat genes (arginine-glutamic acid dipeptide repeats (Rere), HECT, UBA and WWE domain containing 1 (Huwe1) and Specificity protein 3 (Sp3)), probes to quantify the eGFP-Htt construct expression levels and to detect reverse complementary versions of the probes targeting eGFP-Htt as negative controls. Sequences for these additional probes are included in the supplementary material (Additional file 1).
Reannotation of microarray probes
The annotation of the probes corresponding to rat genes was updated using the Ensembl rat transcriptome (Rnor_5.0). Probe sequences were aligned to the rat transcriptome using blast with a threshold E-value of 1x10−10. Probes uniquely matching a transcript sequence with a bit score of at least 100 were assigned the corresponding Ensembl transcript and gene ID. Probes matching multiple transcript sequences belonging to the same gene were only assigned the Ensembl gene ID. For probes that could not be matched at all, we used Nimblegen’s original RefSeq ID annotation. Additional information such as Entrez gene IDs and gene symbols were retrieved from the ‘rnorvegicus_gene_ensembl’ dataset from the Ensembl database using the biomaRt R/Bioconductor package (version 2.18.0 [36, 37]). After reannotation, not all genes interrogated by the default oligo library were represented by four distinct probes as originally designed (Additional file 2).
Pre-processing and statistical analysis
Microarray expression values were analysed using R statistical software (version 3.1.0) and R/Bioconductor packages. Furthermore, all microarray probe sets related to Nimblegen internal controls and array construction were removed from the dataset to avoid introducing systematic effects during pre-processing of the data. The remaining probes all corresponded to rat genes or one of the additional custom probes. The R/Bioconductor package arrayQualityMetrics  was used to assess whether the microarray data were of good quality. No outlier arrays were observed.
Expression values were background corrected using the normal-exponential (normexp) method (limma package, version 3.20.1 ), with an offset value of 20 to shrink log-transformed red/green ratios towards zero at lower intensities. Loess normalisation (limma package) of the log-ratios was performed to mitigate dye biases within each array. The expression values of all probes corresponding to the same Ensembl gene ID were summarized to a single representative expression value through the median polish procedure (affy package, version 1.42.2 ). When lacking an updated Ensembl gene ID, the original RefSeq gene IDs were used to summarise the probes. Significance of differential expression was assessed using linear modelling and empirical Bayes moderated statistics (limma package). The gene-wise linear models employed included a factor for the combination of cell line (control/HD) and time point, and a factor to correct for potential slide effects present in the data. For both control and HD samples, contrasts were made between each individual time point t and the pre-induction state for that cell line, that is, Ct-C0 and HDt-HD0 respectively. We also investigated which genes responded differently over time in the HD cell line relative to the control cell line to obtain a HD-specific set of significant differentially expressed (DE) genes (HDt-HD0) - (Ct-C0). Correction for multiple testing was done using the Benjamini-Hochberg false discovery rate.
Cluster and enrichment analyses
Expression values were corrected for potential slide effects using ComBat (sva package, version 3.4.0 ) with ‘slide’ as a batch covariate and a model matrix containing factors for the cell lines and time points used. Only genes annotated with an updated Ensembl gene ID and a single Entrez gene ID were used. Expression values of the 2176 HD-specific genes were used for further analyses using Expander (version 6.0) . Expression values derived from arrays representing the same conditions were averaged to a single expression value. Next, genes were clustered by expression pattern using the CLICK algorithm implemented in Expander  with the ‘Default Homogeneity’ option and using Pearson correlation as a distance measure.
The Entrez IDs for the genes comprising the resulting clusters were then used for functional enrichment analysis using the TANGO algorithm with default settings. Reported p-values are the result of testing for the significance of overlap between gene groups and Gene Ontology (GO) terms using a hypergeometric test. P-values were corrected for multiple testing by sampling for each cluster 2,500 random gene sets of the same size as the original cluster and estimating the empirical p-value distribution for the evaluated GO terms. All rat genes represented on the array by probes for which we had updated annotation (Additional file 2) were used as background set. GO terms with a corrected p-value smaller than 0.05 were considered to be significant.
For each of the clusters created by CLICK, promoter analysis was carried out to identify transcription factors for which binding sites were overrepresented in the promoter regions of a set of genes. The PRIMA algorithm  was used to analyse a region from 1000 bases upstream to 200 bases downstream of the transcription start site of each gene. All genes represented on the array for which we had updated annotation were used as background set. Binding site motifs with a p-value < 0.0001 as determined using a generalized hypergeometric test were considered significant.
Protein interaction analysis
Interaction between proteins was studied using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and its database of known protein interactions (http://string-db.org/; version 10.0). Gene symbols were used as input while selecting Homo sapiens as species. The network was created based on high confidence interactions (cut-off score of 0.7) and visualised using the ‘Molecular Action’ view option to show the type of interaction between the nodes.
Onset and accumulation of mutant HD aggregates
We used fluorescence microscopy to study the temporal expression of eGFP-tagged N-terminal Htt fragments with an expanded polyQ domain in PC12 cells (Q23 as control and Q74 as HD cell line). We determined the localization and aggregation of eGFP-Htt-Q74 at 4 h - 120 h after doxycycline induction (Fig. 1a-p). Additional file 3 shows the eGFP-Htt-Q23 expressing cells at 4 h - 120 h after doxycycline induction. We did not detect aggregate formation in the Q23 control cell line at any of the post-induction time points. We used Argos software to calculate the volume of each individual aggregate and the combined volume of all cells per image (Fig. 1e-h, m-p).
Aggregates detected in cells were binned based on their loge(volume). The first eGFP-Htt-Q74 aggregates were observed 24 h post-induction and were relatively small, mainly occurring in the first 6 bins (Fig. 1q). Over time, the number of aggregates increased until it reached a maximum at 48–72 h post-induction. In this same time frame, the distribution of the aggregate sizes started shifting towards larger volumes, occurring in almost all bins (Fig. 1r). At 96 h post-induction, the total number of aggregates decreased notably. Surprisingly, at 120 h post-induction in a defined set of aggregate units (bins [0,1) and [1,2)) a large increase in the number of aggregates per cell volume was noted compared to the number of aggregates per cell volume observed at 48–72 h post-induction.
Doxycycline-induced expression of eGFP-Htt
The eGFP-Htt protein expression levels were estimated based on fluorescence measurements using the images described earlier (Fig. 2a, b). Upon doxycycline induction, we observed a swift eGFP-Htt-Q74 protein induction for up to 24 h at which point the protein reached a maximum expression level. The eGFP-Htt-Q74 expression level remained unchanged until 72 h and then subsequently decreased, reaching levels at 120 h post-induction similar to those observed after 4 h. eGFP-Htt-Q23 protein levels are increasing more gradually and reach a slightly higher plateau level compared with eGFP-Htt-Q74 levels.
To determine the changes in gene expression in the eGFP-Htt-Q23 control and eGFP-Htt-Q74 HD cell lines, mRNA was harvested in parallel to the microscopic aggregate formation analysis, including the pre-induction state. We used the Nimblegen-Roche 12-146 k microarray platform to analyse temporal gene expression at defined post-induction time points. Several probes were included to measure eGFP-Htt expression levels (Fig. 2c). Additional File 4 shows for each condition the mean log2-ratio with respect to the Cy5-labelled common reference sample of these probes as well as of the probe targeting endogenous rat Htt (ENSRNOG00000011073). Our data show that levels of the probe targeting endogenous Htt did not change upon doxycycline induction in the eGFP-Htt-Q23 control and eGFP-Htt-Q74 HD cell line.
The expression values of these probes were pre-processed along with all other probes on the microarrays. Based on these processed expression values the fold changes in eGFP-Htt expression with respect to the pre-induction state were calculated for each time point in both the control and the HD cells (Fig. 2d and e, respectively). Expression of eGFP-Htt mRNA increased rapidly after doxycycline induction. In the HD cells the doxycycline-induced expression increased faster than in the control cells. Similar to the observed eGFP-Htt-Q74 protein levels, mRNA expression in HD cells showed a peak at the early post-induction time points (4–8 h), followed by a fast decrease (36 h) and a subsequent drop to end-point levels (>48 h). In control cells, we observed a gradual increase, with eGFP-Htt mRNA expression levels reaching a peak at 48 h post-induction, followed by a decrease in expression. Probes with a reverse complementary (rc) sequence to the eGFP-Htt probes were used as negative controls. None of the rc probes exhibited a change in expression level as compared to the pre-induction state during the course of the experiment.
In both control and HD cells, a similar trend was observed for the eGFP-82, eGFP-162, linker and Htt probes (Fig. 2d, e). The only clear difference in comparing control and HD cells is noted in the expression of the probe recognizing the Htt repeat region of the construct. Here, we did not observe an increase in expression in control cells, whereas in the HD cells the expression pattern was similar to that observed for the linker probe. The increase in CAG repeat sequences in the eGFP-Htt-Q74 compared to the -Q23 control may explain improved hybridisation of this probe to the Htt-Q74 site.
Doxycycline induced changes in gene expression profiles
Using a Nimblegen-Roche 12-146 k microarray platform, we assayed approximately 26 k rat genes for transcriptional changes at regular intervals over a period of 120 h after induction of eGFP-Htt-Q23 (control) or eGFP-Htt-Q74 (HD). Principal Component Analysis (PCA) of the processed microarray data revealed a clear separation between the majority of control and HD samples based on the first principal component (Fig. 3a). Note that the HD samples taken 24 h post-induction did not group well with the other HD samples. Furthermore, a combination of the first and second principal component separated the HD samples into an ‘early’ (0-8 h, red oval) and ‘late’ (36-120 h, blue oval) cluster. A similar time-based clustering of the control samples was not as evident, indicating that the changes between control samples over time were smaller than the changes between HD samples.
We compared the expression values of genes in the control and HD cells before doxycycline induction to assess the baseline differences between the two cell lines (Additional file 5). The control and HD cells express the eGFP tagged HTT gene under control of a doxycycline promoter in a distinct genomic site. A set of 2059 genes were significantly differentially expressed (DE) (adjusted P-value < 0.05) between control and HD cells. Functional enrichment analysis on this set of genes revealed an overrepresentation of Gene Ontology (GO) terms (representing biological processes or molecular functions) predominantly related to cell division, apoptosis and stress responses (data not shown). Closer examination revealed that all five eGFP-Htt probes are included in this set of baseline DE genes and exhibited increased expression levels in the HD cells compared to the control cells. These findings indicate a clear and substantial difference between the two cell lines even before induction of the eGFP-Htt constructs. The baseline expression levels in the eGFP tagged polyQ HTT gene in the control and HD cells depend on the genomic integration site of the gene since doxycycline-induced tet-on gene expression systems exhibit relatively strong promoter induction. To prevent genes from being reported as DE due to transcriptional differences present in control versus HD samples before doxycycline induction, we compared measurements between each time point and the pre-induction state for both cell lines separately. We also determined which genes responded differently over time in the HD cell line compared to the control cell line, referred to as HD-specific genes. The latter analysis led to a set of 2,176 DE genes that were found to be DE (moderated F-statistics, adjusted P-value < 0.1) in at least one time point compared to the pre-induction state (Additional file 6).
Overall very few genes are significantly DE in the control cells prior to 120 h post-induction (Fig. 3b). In the HD cells we detected only few changes in gene expression before 36 h post-induction, especially after correcting for effects in the control cell line (Fig. 3b, orange lines). Roughly half the genes that were DE in the HD samples before 36 h post-induction dropped below the significance threshold after correcting for changes observed in the control samples. This suggests that in this time frame these genes exhibit a similar but non-statistically significant up- or down-regulation in the control cell line. These genes could exhibit an altered expression either due to doxycycline-induced overexpression of the HTT gene or due to a mutant Htt protein-induced gain-of-toxic function. At 36 h post-induction, the number of DE genes in HD samples greatly increased, both before and after correcting for changes observed in the control samples. Upon 72 h post-induction, the number of DE genes in the control samples started to increase, slowly at first but much more rapidly after 96 h. We found roughly equal amounts of up- and down-regulated genes in the HD and control samples. Both control and HD cells exhibited around 800 DE genes at 120 h post-induction and 141 of these genes were found to be DE in both control and HD cell lines at this time point. Out of this set, 55 genes exhibited increased expression in control cells while showing decreased expression in the HD cells. The remaining genes were all up-regulated in both cell types. For each time point, roughly half of the DE genes were also DE in the same direction for adjacent time points, indicating a more persistent change in expression.
We performed a cluster-based analysis to study the temporal behaviour of the HD-specific set of genes (Additional file 6). Fourteen distinct clusters were found that represented all of the 2176 selected genes (Fig. 4a and Additional file 7). The large majority of genes showed either one of two distinct patterns illustrating either an increased (Fig. 4a; cluster 1) or decreased expression (Fig. 4a; cluster 2) from around 24 h post-induction. Regardless of the direction (up or down) of the change in expression, a maximum effect was reached around 72 h. In general, few genes exhibited changes in expression before 24 h post-induction.
Functional enrichment analysis of gene clusters
We subjected the obtained gene clusters to functional enrichment analysis to identify overrepresented GO terms. Significantly overrepresented biological processes and molecular functions were only detected for gene clusters 1 and 2 (Fig. 4b). Cluster 1 was enriched for genes involved in metabolic processes, whereas the genes in cluster 2 were mostly involved in developmental processes. The two most significantly overrepresented GO terms in cluster 1 represent ‘antioxidant activity’ and ‘cellular ketone metabolic processes’.
To identify potentially pathogenic processes during early HD, we investigated the transcriptional changes at time points before 36 h post-induction in more detail. Given the modest changes in expression for these early time points, we selected HD-specific DE genes at the 4, 8 and 24 h time points using a less strict significance threshold (nominal P-value < 0.2). To select the more persistent changes in gene expression during this time frame, the resulting set was limited to those genes that were found DE at all three time points. A cluster analysis and subsequent functional enrichment analysis on this set of 404 ‘early response’ genes revealed only processes related to cell division and DNA replication to be overrepresented (Additional file 8). These processes were all detected in a single cluster comprised of genes being persistently down-regulated until the 72 h time point with exception of the 36 h time point (Additional file 8, panel A).
Promoter enrichment analysis of gene clusters
Promoter-based enrichment analysis of the observed gene clusters resulted in the identification of a number of transcription factors with binding sites overrepresented in the promoter regions of the genes included in each cluster (Fig. 4c). Among these transcription factors we observed several transcription factors that have previously been linked with HD such as Sp1, Nuclear transcription factor Y (NF-Y), E2F and Hypermethylated in cancer 1 (HIC1) [7, 8, 45,46,47].
We used the STRING database of known and predicted protein interactions to find potential relationships between the transcription factors identified in the promoter analysis of our detected DE genes. The transcription factors identified were used as input for STRING. The selected proteins were connected through known and predicted interactions and a small network could be established (Additional file 9).
HD is a monogenic neurodegenerative disease displaying widespread transcriptional deregulation as a key feature resulting from a CAG expansion in the HTT gene on chromosome 4p16.3 leading to the production of mutant Htt protein containing a prolonged polyQ tract that is prone to aggregation. The CAG repeat length is the major determinant of the age of onset although significant variability in the age of onset exists, underscoring disease complexity. There is an unmet need for disease-modifying therapies as currently available treatments can only ease disease symptoms. Knowledge of the cause(s) of early onset events provides important information to infer disease state-specific signatures to be used for diagnostic screening and to find targets for disease state-specific interference. Transcriptional deregulation and aggregate formation are key events of early disease onset but their temporal relationship during the earliest cellular changes is largely unresolved.
In the present study, we determined the time-resolved relationship between aggregate formation and changes at transcriptional level during early disease stages in an inducible HD cell model. Obviously, HD is a disease mostly known to occur in neuronal cells and in vitro cell line models have clear limitations to recapitulate HD in vivo pathology. However, considering the late onset of HD disease pathology and the large variability in disease pathology outcome, both in vitro and in vivo experimental models of HD have been invaluable to interpret disease pathogenesis, especially in the early phase of the disease. It is important to realize the challenges of using cell models, i.e., our analysis is based on control and HD cell lines that are intrinsically different, which clearly hampers analysis to interpret differentially expressed genes. It should however be noted that when analyzing patients intrinsic differences between patients are even worse and thereby largely complicating mechanistic insight in early onset phenomena. In this study, we used rat pheochromocytoma cells (PC12) containing a stably integrated doxycycline-inducible eGFP-tagged N-terminal Htt fragment with an expanded polyQ domain as a cell model to study early-state changes in Huntington’s disease. These cells have been well characterized and used by the field. The behavior of cells expressing the eGFP tagged Htt fragment has been extensively studied . Our data show that the kinetics of exon1 mRNA and protein levels in particular of differentially expressed genes in control and HD cell line are different. The eGFP-Htt-Q23 control and eGFP-Htt-Q74 HD cell lines have different baseline expression levels. Therefore, we did not compare individual time points between HD and control samples, but determined which genes responded differently over time between the two cell lines, thus taking differences at baseline into account thereby also correcting for the potential effect of adding doxycylin to the cells.
Our data indicate that the earliest cellular changes can be divided into three phases with distinct cellular phenotypes. The first alteration in transcriptional activity (before the 24 h post-induction time point) occurs in the absence of microscopically detectable aggregates. We observed that in this first phase deregulated genes are typically involved in cell division and DNA replication suggesting that reduced cellular replicative ability might represent a very early effect of Htt polyQ induction. It is intriguing that we detect deregulation of genes involved in cell division and DNA replication in our cell model. Possibly deregulation in these genes reflects a general hampered cell functioning causing overall cellular deregulation of main survival genes in an early phase. However, there is growing evidence that also glial cells are affected in HD and play a major role in the disease pathology [49, 50], making our findings on deregulated genes involved in cell division and replication interesting. Most likely genes affected prior to 24 h post-induction may be particularly sensitive to disruption of their normal functioning by the presence of polyQ-expanded Htt. The second phase (24–36 h post-induction) exhibits significant transcriptional deregulation coinciding with the appearance of the first small aggregates. At later time points when the number of small aggregates drops and large aggregates simultaneously increase (possibly illustrating small aggregate merging) a decrease in transcriptional deregulation is noted that might represent a protective effect due to the sequestration of polyQ-expanded Htt protein into these large aggregates.
Our aggregate volume measurements confirm previous observations in the same cell line , although contrary to Gong et al. (2008), we observe normal growth in the total cell volume (indicating healthy cellular viability) up to 72 h post-induction. Only images captured post-96 h show a substantial decrease in total cell volume (data not shown). Intermediate polyQ-expanded Htt accumulations have been shown to occur before the formation of full sized aggregates . Arrasate et al. (2004) illustrated that neuronal survival was negatively impacted by the concentration of diffuse polyQ-expanded Htt. The presence of aggregates was shown to reduce the concentration of diffuse polyQ-expanded Htt and an increased neuronal survival . It has been suggested that monomeric and oligomeric polyQ-expanded Htt protein exhibit cellular toxicity while formation of larger aggregates provides (at least temporary) protection from these toxic effects [52, 54]. These larger aggregates might eventually exhibit secondary pathogenic effects such as physically blocking vesicle or mitochondrial trafficking in the cell.
To understand which biological pathways are associated with the early-state temporal transcription changes, we used a clustering algorithm to group genes with similar expression patterns followed by a functional enrichment analysis. We identified a number of biological processes overrepresented in up- or down-regulated clustered genes that occur at the intermediate onset. The gene cluster representing upregulated genes shows overrepresentation of genes related to antioxidant activity and ketone metabolism whereas the down-regulated cluster of genes exhibits overrepresentation of several processes involved in development.
Our findings are in line with a study by van Roon-Mom et al. that analysed DE genes at day 1 and 5 after induction using the same cell line . Van Roon-Mom et al. identified mitochondrial (dys)function, oxidative stress responses and a set of genes involved in dopamine biosynthesis as early cellular onset phenomena. We compared our findings with this previous study. To this end, we investigated the transcriptional profiles of the gene set identified by van Roon-Mom et al. consisting of 21 Nrf2-regulated genes in our microarray data. We observed that eight of these genes were also found to be significantly DE in our HD-specific gene set and that these genes showed the same direction of change (up or down) as found by van Roon-Mom et al. (Additional file 6). Eleven of these genes were not significantly DE in our dataset. However, most of these genes did show a change in expression level in the same direction as was found by van Roon-Mom et al. The remaining two genes were not represented by probes on our array and could therefore not be examined. Furthermore, all five genes involved in dopamine biosynthesis identified by van Roon-Mom et al. were also found to be significantly down-regulated in our study (Additional file 6). In addition to the van Roon-Mom et al. study we show the trend of DE genes at a number of intermediate time points. Especially after 3 days we observe more extensive transcriptional deregulation. Taken together our study is a follow-up analysis of the previous transcription analysis study in the PC12 HD cell model.
Mitochondrial dysfunctionality and impaired energy (glucose) metabolism are well-known phenomena associated with Huntington’s disease, as well as other neurodegenerative disorders such as Parkinson’s disease or Amyotrophic Lateral Sclerosis (ALS) [56,57,58,59]. If neurons are unable to generate sufficient energy from glucose due to a limited glucose availability or ability to metabolise glucose, neurons may increase the production of enzymes capable of utilising an alternative energy source. At impaired glucose levels, ketone bodies are an alternative source of energy for neurons . Intriguingly, the overrepresentation of developmental processes suggests that HD belongs to the category of developmental disorders, which also includes schizophrenia, familial Alzheimer’s disease and Spinocerebellar ataxia 1 (SCA-1) . Studies of pre-symptomatic individuals carrying an expanded HTT gene have shown subtle developmental defects . Affected patients often experience weight loss besides severe motor abnormalities [63, 64]. In addition, pre-symptomatic children carrying the Htt polyQ expansion showed a lower body mass index . Moreover, brain scans reveal that HD patients have an enlarged cortex size and that pre-symptomatic individuals have a decreased intracranial volume . Energy regulation, cellular homeostasis and brain growth could typically represent early HD symptoms that arise before well-known HD symptoms and that predispose neurons to die later in life .
We identified a number of transcription factor binding sites that were overrepresented in promoter regions of the observed DE genes. Some of these transcription factors have previously been directly or indirectly linked to HD, i.e., Sp1, NF-Y, E2F, HIC1 and EGR-1 (Fig. 4c). It should be noted that in addition to promoter enrichment, indirect effects of regulatory or coding or non-coding regions could also be involved in causing an altered gene expression profile. Our data show that binding sites for EGR(−1) and Sp1 are overrepresented in both the up- and down-regulated genes in the early-onset gene clusters. EGR-1 and SP1 might exhibit a competitive interplay in genes that contain overlapping binding sites in their promoters for these transcription factors . Interaction between Sp1 and mutant Htt has been suggested to mediate neurotoxicity but on the other hand decreased Sp1 activity has been shown to be beneficial for HD mice . Deregulation of Sp1 activity through alteration of the Sp1 transcriptional complex may be responsible for most of the deleterious effects observed for Sp1, which may be ameliorated by an overall reduction of Sp1 activity. EGR-1 plays a role in the development of striatal medium spiny neurons , the cell type most affected by HD, in which its expression is shown to be regulated by the D1 and D2 dopamine receptors . Reduced expression of D1 and D2 dopamine receptors has been detected in post-mortem tissues in HD patients . We did not detect differential expression of dopamine receptors or EGR-1 (Additional file 6). The differences observed in EGR-1 activity are likely to result from more down-stream transcriptional deregulation.
We identified several particularly interesting transcription factors (Fig. 4c) that might provide a link between HD and other degenerative disorders caused by polyQ-expanded proteins, such as SCA17 and Spinobulbar Muscular Atrophy (SBMA). In SCA17, the activity of trimeric transcription factor NF-Y was shown to be reduced through sequestration of component A (NF-YA) by polyQ-expanded TATA-box binding protein (TBP) . In HD cell systems and the brains of HD mice, two of the NF-Y subunits were found sequestered in mutant Htt aggregates. Moreover, binding of NF-Y to the Heat Shock Protein 70 (Hsp70) promoter was noted to be decreased in HD brain resulting in a decreased heat-shock-induced chaperone response . We noted that NF-Y associates with down-regulated genes but we did not detect a significant change in the expression levels of heat-shock proteins. Possibly, down-regulation of these genes requires additional regulatory elements or signalling that occurs in the fully-differentiated neurons present in brain tissue or neighbouring cells of a different phenotype. The MYC-associated Zinc finger protein (MAZ) and related factor (MAZR) have both been implicated in regulating the expression of the androgen receptor (AR). Intriguingly, polyQ-expansion of AR leads to Spinal-Bulbar Muscular Atrophy [73, 74]. Two other transcription factors, CP2 and Zinc Finger protein 219 (ZNF219), have been implicated in the development of Alzheimer’s  and Parkinson’s disease , respectively. Alzheimer’s and Parkinson’s disease are not caused by repeat expansions in a single characteristic gene although these neurodegenerative disorders exhibit an advanced age disease onset similar as noted in HD. The transcription factors CP2 and ZNF219 might represent a change in transcriptional regulation that is generally noted in neurological diseases. A recent study in which Narayanan et al. analysed differential co-expression between gene pairs in expression data from patients suffering from Alzheimer’s or Huntington’s disease and non-diseased healthy controls showed a pattern of shared deregulation of gene expression between both sets of patients .
Our data support the current theory that small mutant Htt aggregates are more toxic to the cell than larger ones, at least regarding transcriptional dysregulation. The DE genes that we identified are enriched for antioxidant activity and energy metabolism, underscoring the importance of mitochondrial dysfunction and impaired developmental processes in early disease progression. We identified novel HD transcription factors and regulators using our time-resolved gene expression analysis. We also confirmed a number of transcription factors that previously have been linked to HD. We constructed a network of identified transcription factors that may play an important role in the early cellular phenotype and that is likely part of a larger network typically affected in neurodegenerative diseases [9, 21, 22, 78]. Identification of key regulators in our determined network may reveal avenues for active modulation of gene expression in Huntington’s disease and other polyQ-dependent disorders.
Amyotropic lateral sclerosis
CREB binding protein
Early growth response protein 1
Hypermethylated in cancer 1
Heat shock protein 70
HECT, UBA and WWE domain-containing protein 1
Myc-associated zinc finger protein
Nuclear factor Y
NF-E2 related factor 2
Principal component analysis
Arginine-glutamic acid dipeptide repeats
Spinobulbar muscular atrophy
Specificity protein 1
TATA-box binding protein
Transforming growth factor β
Zinc finger protein 219
Zoghbi HY, Orr HT. Glutamine repeats and neurodegeneration. Annu Rev Neurosci. 2000;23:217–47.
Benn CL, et al. Contribution of nuclear and extranuclear polyQ to neurological phenotypes in mouse models of Huntington's disease. Hum Mol Genet. 2005;14(20):3065–78.
Cornett J, et al. Polyglutamine expansion of huntingtin impairs its nuclear export. Nat Genet. 2005;37(2):198–204.
Wheeler VC, et al. Early phenotypes that presage late-onset neurodegenerative disease allow testing of modifiers in Hdh CAG knock-in mice. Hum Mol Genet. 2002;11(6):633–40.
Zuccato C, et al. Widespread disruption of repressor element-1 silencing transcription factor/neuron-restrictive silencer factor occupancy at its target genes in Huntington's disease. J Neurosci. 2007;27(26):6972–83.
Zuccato C, et al. Huntingtin interacts with REST/NRSF to modulate the transcription of NRSE-controlled neuronal genes. Nat Genet. 2003;35(1):76–83.
Dunah AW, et al. Sp1 and TAFII130 transcriptional activity disrupted in early Huntington's disease. Science. 2002;296(5576):2238–43.
Li SH, et al. Interaction of Huntington disease protein with transcriptional activator Sp1. Mol Cell Biol. 2002;22(5):1277–87.
Shimohata T, et al. Expanded polyglutamine stretches interact with TAFII130, interfering with CREB-dependent transcription. Nat Genet. 2000;26(1):29–36.
Reiner A, et al. Differential loss of striatal projection neurons in Huntington disease. Proc Natl Acad Sci U S A. 1988;85(15):5733–7.
Desplats PA, et al. Selective deficits in the expression of striatal-enriched mRNAs in Huntington's disease. J Neurochem. 2006;96(3):743–57.
Hodges A, et al. Regional and cellular gene expression changes in human Huntington's disease brain. Hum Mol Genet. 2006;15(6):965–77.
Lee ST, et al. Altered microRNA regulation in Huntington's disease models. Exp Neurol. 2011;227(1):172–9.
Luthi-Carter R, et al. Dysregulation of gene expression in the R6/2 model of polyglutamine disease: parallel changes in muscle and brain. Hum Mol Genet. 2002;11(17):1911–26.
Luthi-Carter R, et al. Decreased expression of striatal signaling genes in a mouse model of Huntington's disease. Hum Mol Genet. 2000;9(9):1259–71.
Marti E, et al. A myriad of miRNA variants in control and Huntington's disease brain regions detected by massively parallel sequencing. Nucleic Acids Res. 2010;38(20):7219–35.
Runne H, et al. Dysregulation of gene expression in primary neuron models of Huntington's disease shows that polyglutamine-related effects on the striatal transcriptome may not be dependent on brain circuitry. J Neurosci. 2008;28(39):9723–31.
Labadorf A, et al. RNA sequence analysis of human huntington disease brain reveals an extensive increase in inflammatory and developmental gene expression. PLoS One. 2015;10(12), e0143563.
Boutell JM, et al. Aberrant interactions of transcriptional repressor proteins with the Huntington's disease gene product, huntingtin. Hum Mol Genet. 1999;8(9):1647–55.
Kazantsev A, et al. Insoluble detergent-resistant aggregates form between pathological and nonpathological lengths of polyglutamine in mammalian cells. Proc Natl Acad Sci U S A. 1999;96(20):11404–9.
McCampbell A, et al. CREB-binding protein sequestration by expanded polyglutamine. Hum Mol Genet. 2000;9(14):2197–202.
Nucifora Jr FC, et al. Interference by huntingtin and atrophin-1 with cbp-mediated transcription leading to cellular toxicity. Science. 2001;291(5512):2423–8.
Steffan JS, et al. The Huntington's disease protein interacts with p53 and CREB-binding protein and represses transcription. Proc Natl Acad Sci U S A. 2000;97(12):6763–8.
Davies SW, et al. Formation of neuronal intranuclear inclusions underlies the neurological dysfunction in mice transgenic for the HD mutation. Cell. 1997;90(3):537–48.
DiFiglia M, et al. Aggregation of huntingtin in neuronal intranuclear inclusions and dystrophic neurites in brain. Science. 1997;277(5334):1990–3.
Kazantsev A, et al. A bivalent Huntingtin binding peptide suppresses polyglutamine aggregation and pathogenesis in Drosophila. Nat Genet. 2002;30(4):367–76.
Cummings CJ, Orr HT, Zoghbi HY. Progress in pathogenesis studies of spinocerebellar ataxia type 1. Philos Trans R Soc Lond B Biol Sci. 1999;354(1386):1079–81.
Donaldson KM, et al. Ubiquitin-mediated sequestration of normal cellular proteins into polyglutamine aggregates. Proc Natl Acad Sci U S A. 2003;100(15):8892–7.
Yu ZX, et al. Huntingtin inclusions do not deplete polyglutamine-containing transcription factors in HD mice. Hum Mol Genet. 2002;11(8):905–14.
Bennett EJ, et al. Global impairment of the ubiquitin-proteasome system by nuclear or cytoplasmic protein aggregates precedes inclusion body formation. Mol Cell. 2005;17(3):351–65.
Mitra S, Tsvetkov AS, Finkbeiner S. Single neuron ubiquitin-proteasome dynamics accompanying inclusion body formation in huntington disease. J Biol Chem. 2009;284(7):4398–403.
Arrasate M, Finkbeiner S. Protein aggregates in Huntington's disease. Exp Neurol. 2012;238(1):1–11.
Wyttenbach A, et al. Polyglutamine expansions cause decreased CRE-mediated transcription and early gene expression changes prior to cell death in an inducible cell model of Huntington's disease. Hum Mol Genet. 2001;10(17):1829–45.
de Leeuw W, Verschure PJ, van Liere R. Isualization and analysis of large data collections: a case study applied to confocal microscopy data. IEEE Trans Vis Comput Graph. 2006;12(5):1251–8.
Schroeder A, et al. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol Biol. 2006;7:3.
Durinck S, et al. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21(16):3439–40.
Durinck S, et al. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc. 2009;4(8):1184–91.
Kauffmann A, Gentleman R, Huber W. arrayQualityMetrics--a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25(3):415–6.
Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article3.
Gautier L, et al. affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004;20(3):307–15.
Leek JT, et al. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3.
Shamir R, et al. EXPANDER--an integrative program suite for microarray data analysis. BMC Bioinformatics. 2005;6:232.
Sharan R, Maron-Katz A, Shamir R. CLICK and EXPANDER: a system for clustering and visualizing gene expression data. Bioinformatics. 2003;19(14):1787–99.
Elkon R, et al. Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res. 2003;13(5):773–80.
Pelegri C, et al. Cell cycle activation in striatal neurons from Huntington's disease patients and rats treated with 3-nitropropionic acid. Int J Dev Neurosci. 2008;26(7):665–71.
Tang BL, Chua CE. SIRT1 and neuronal diseases. Mol Aspects Med. 2008;29(3):187–200.
Yamanaka T, et al. Mutant Huntingtin reduces HSP70 expression through the sequestration of NF-Y transcription factor. EMBO J. 2008;27(6):827–39.
Cisbani G, Cicchetti F. An in vitro perspective on the molecular mechanisms underlying mutant huntingtin protein toxicity. Cell Death Dis. 2012;3, e382.
Kwan W, et al. Mutant huntingtin impairs immune cell migration in Huntington disease. J Clin Invest. 2012;122(12):4737–47.
Sapp E, et al. Early and progressive accumulation of reactive microglia in the Huntington disease brain. J Neuropathol Exp Neurol. 2001;60(2):161–72.
Gong B, et al. Time-lapse analysis of aggregate formation in an inducible PC12 cell model of Huntington's disease reveals time-dependent aggregate formation that transiently delays cell death. Brain Res Bull. 2008;75(1):146–57.
Poirier MA, et al. Huntingtin spheroids and protofibrils as precursors in polyglutamine fibrilization. J Biol Chem. 2002;277(43):41032–7.
Arrasate M, et al. Inclusion body formation reduces levels of mutant huntingtin and the risk of neuronal death. Nature. 2004;431(7010):805–10.
Grote SK, La Spada AR. Insights into the molecular basis of polyglutamine neurodegeneration from studies of a spinocerebellar ataxia type 7 mouse model. Cytogenet Genome Res. 2003;100(1–4):164–74.
van Roon-Mom WM, et al. Mutant huntingtin activates Nrf2-responsive genes and impairs dopamine synthesis in a PC12 model of Huntington's disease. BMC Mol Biol. 2008;9:84.
Beal MF. Mitochondria take center stage in aging and neurodegeneration. Ann Neurol. 2005;58(4):495–505.
Browne SE, et al. Oxidative damage and metabolic dysfunction in Huntington's disease: selective vulnerability of the basal ganglia. Ann Neurol. 1997;41(5):646–53.
Gu M, et al. Mitochondrial defect in Huntington's disease caudate nucleus. Ann Neurol. 1996;39(3):385–9.
Shukla V, Mishra SK, Pant HC. Oxidative stress in neurodegeneration. Adv Pharmacol Sci. 2011;2011:572634.
Morris AA. Cerebral ketone body metabolism. J Inherit Metab Dis. 2005;28(2):109–21.
Mehler MF, Gokhan S. Mechanisms underlying neural cell death in neurodegenerative diseases: alterations of a developmentally-mediated cellular rheostat. Trends Neurosci. 2000;23(12):599–605.
Nopoulos PC, et al. Smaller intracranial volume in prodromal Huntington's disease: evidence for abnormal neurodevelopment. Brain. 2011;134(Pt 1):137–42.
Marder K, et al. Dietary intake in adults at risk for Huntington disease: analysis of PHAROS research participants. Neurology. 2009;73(5):385–92.
Carroll JB, et al. Treating the whole body in Huntington's disease. Lancet Neurol. 2015;14(11):1135–42.
Lee JK, et al. Measures of growth in children at risk for Huntington disease. Neurology. 2012;79(7):668–74.
Paulsen JS, et al. Brain structure in preclinical Huntington's disease. Biol Psychiatry. 2006;59(1):57–63.
Cui MZ, et al. Transcriptional regulation of the tissue factor gene in human epithelial cells is mediated by Sp1 and EGR-1. J Biol Chem. 1996;271(5):2731–9.
Qiu Z, et al. Sp1 is up-regulated in cellular and transgenic models of Huntington disease, and its reduction is neuroprotective. J Biol Chem. 2006;281(24):16672–80.
Keilani S, et al. Egr-1 induces DARPP-32 expression in striatal medium spiny neurons via a conserved intragenic element. J Neurosci. 2012;32(20):6808–18.
Spektor BS, et al. Differential D1 and D2 receptor-mediated effects on immediate early gene induction in a transgenic mouse model of Huntington's disease. Brain Res Mol Brain Res. 2002;102(1–2):118–28.
Augood SJ, Faull RL, Emson PC. Dopamine D1 and D2 receptor gene expression in the striatum in Huntington's disease. Ann Neurol. 1997;42(2):215–21.
Lee LC, et al. Role of the CCAAT-binding protein NFY in SCA17 pathogenesis. PLoS One. 2012;7(4), e35302.
Jiao L, et al. The prostate cancer-up-regulated Myc-associated zinc-finger protein (MAZ) modulates proliferation and metastasis through reciprocal regulation of androgen receptor. Med Oncol. 2013;30(2):570.
Pero R, et al. PATZ attenuates the RNF4-mediated enhancement of androgen receptor-dependent transcription. J Biol Chem. 2002;277(5):3280–5.
Bertram L, et al. Further evidence for LBP-1c/CP2/LSF association in Alzheimer's disease families. J Med Genet. 2005;42(11):857–62.
Clough RL, Dermentzaki G, Stefanis L. Functional dissection of the alpha-synuclein promoter: transcriptional regulation by ZSCAN21 and ZNF219. J Neurochem. 2009;110(5):1479–90.
Narayanan M, et al. Common dysregulation network in the human prefrontal cortex underlies two neurodegenerative diseases. Mol Syst Biol. 2014;10:743.
Luthi-Carter R, et al. Polyglutamine and transcription: gene expression changes shared by DRPLA and Huntington's disease mouse models reveal context-independent effects. Hum Mol Genet. 2002;11(17):1927–37.
We thank the MicroArray Department, Swammerdam Institute for Life Sciences, University of Amsterdam for the use of their microarray facility and expertise and D.C. Rubinsztein, Cambridge Institute for Medical Research for providing us with the inducible PC12 HD cell lines.
PJV obtained funding for this study from The Huntington patient association, The Netherlands and the Hertzberger fund, University of Amsterdam and from the Netherlands Dutch Research Organization (NWO-Meervoud 83607001 and NWO-VIDI 016041311 to PJV). The funding body did not play a role in the study design, collection, analysis and interpretation of the study.
Availability of data and materials
The microarray data supporting the conclusions of this article is available in the NCBI Gene Expression Omnibus in a MIAME compliant format. The other datasets supporting the conclusions of this article are included within the article (and its additional files).
PJV conceived and designed the experiments. PDM guided bioinformatic data analysis. DGEP performed the experiments. WCdL and IMV performed the microscopy analysis. MvH, PDM and PJV analyzed the data. MvH, PDM, PJV wrote the paper. WMCvR-M gave critical suggestions specially focused on the HD field. All authors read and approved the final manuscript.
The authors declare that they have no competing interest.
Consent for publication
Consent for publication is not applicable.
Ethics approval and consent to participate
Ethics approval and consent are not applicable in this study.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional probes used on the microarray. Probes added to the default Nimblegen library to interrogate additional rat genes (A) or as expression controls for eGFP-Htt levels (B). (PDF 34 kb)
Rat gene representation by microarray library probes. Distribution of the number of probes that represent the various rat genes interrogated by the Nimblegen default probe library after reannotation. The number of genes targeted by probes that could be associated with an Ensembl gene ID are indicated in the second column. The third column indicates the number of genes represented solely by probes that could not be linked to an Ensembl ID. In these cases the original RefSeq IDs were kept. (XLSX 10 kb)
Absence of eGFP-Htt-Q23 aggregate formation upon doxycycline induction. PC12 cells expressing eGFP-Htt-Q23 (control) upon doxycycline induction were imaged at various time points post-induction (4 – 120 h) using fluorescence living cell imaging. A-H show the Htt-Q23 eGFP expressing cells at the various post-induction time points (representative examples of at least four replicates per time point). (PPTX 647 kb)
The table shows the mean log2-ratio with respect to the Cy5-labelled common reference of all probes used to measure eGFP-Htt expression levels, i.e. eGFP-82, eGFP-162, Linker, HTT, HTT repeat, eGFP-82 rc, eGFP-162 rc, Linker rc, HTT rc, HTT repeat rc (Fig. 2c; rc: reverse complementary) and also the probe targeting endogenous Htt (ENSRNOG00000011073) upon doxycline induction in the Q23 control and Q74 Htt cell line. The expression values of these probes were pre-processed along with all other probes on the microarrays and the fold changes in eGFP-Htt expression with respect to the pre-induction state (0 h) in control and HD cells were calculated for each time point as shown in Fig. 2d, e. (XLSX 15 kb)
Expression values of genes (with Ensembl Gene ID) in the control and HD cells were compared before doxycycline induction to assess the baseline differences between the two cell lines. A set of 2,059 genes were significantly differentially expressed (adjusted P-value < 0.05) in control and HD cells. (XLSX 182 kb)
Huntington-specific differentially expressed genes. Genes (with Ensembl Gene ID) found to be DE in at least one time point (adjusted P-value < 0.1, moderated F-test) compared to the pre-induction state in the HD samples after correction for effects observed in the control samples. Genes are identified by Ensembl and Entrez Gene ID and gene symbol in the first three columns. The (adjusted) overall P-value gives the (adjusted) P-value for a gene to be significantly DE in at least one time point measured in the experiment. Each gene was placed in a cluster as determined by the CLICK algorithm. FC shows the fold change in expression compared to the pre-induction state with (adjusted) P-value listing the (adjusted) P-value resulting from significance testing of differential expression of the gene at that time point. Nrf2-regulated genes and genes involved in dopamine biosynthesis identified by van Roon-Mom et al.  are highlighted in yellow and blue, respectively. (XLSX 891 kb)
Expression profiles for all fourteen clusters created using the CLICK algorithm (Expander). Expression patterns for all clusters were standardized to mean zero and standard deviation one. Error bars correspond to one standard deviation. (PDF 360 kb)
Expression profile and functional enrichment analysis of early persistently DE genes. (A) Genes were selected from the HD-specific set of DE genes and further limited to those exhibiting a significant change in expression level during the full first 24 h post-induction using a relaxed significance threshold (P-value < 0.2). The resulting set of 404 genes was then subjected to cluster analysis followed by functional enrichment analysis. (B) Only one cluster was found to contain genes that overrepresented biological processes. Error bars correspond to one standard deviation. (PDF 224 kb)
Network of interactions between the transcription factors identified in the promoter analysis of DE genes in our experiment. Interactions between the transcription factors identified in the PRIMA analysis were extracted using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and its database of known protein interactions (http://string-db.org/; version 10.0). Gene symbols were used as input while selecting Homo sapiens as species. The network was created based on high confidence interactions (cut-off score of 0.7) and visualised using the ‘Molecular Action’ view option to show the type of interaction between the nodes. The table shows the name of the retrieved transcription factors, the alias used in STRING and a few comments. The network illustrates the detected connections. (PDF 521 kb)
About this article
Cite this article
van Hagen, M., Piebes, D.G.E., de Leeuw, W.C. et al. The dynamics of early-state transcriptional changes and aggregate formation in a Huntington’s disease cell model. BMC Genomics 18, 373 (2017). https://doi.org/10.1186/s12864-017-3745-z
- Huntington's disease
- Inducible HD cell line
- Early HD cellular phenotype
- Temporal analysis
- Gene clusters