- Research article
- Open Access
Genome-wide analysis of the Populus Hsp90 gene family reveals differential expression patterns, localization, and heat stress responses
BMC Genomics volume 14, Article number: 532 (2013)
Members of the heat shock protein 90 (Hsp90) class of proteins are evolutionarily conserved molecular chaperones. They are involved in protein folding, assembly, stabilization, activation, and degradation in many normal cellular processes and under stress conditions. Unlike many other well-characterized molecular chaperones, Hsp90s play key roles in signal transduction, cell-cycle control, genomic silencing, and protein trafficking. However, no systematic analysis of genome organization, gene structure, and expression compendium has been performed in the Populus model tree genus to date.
We performed a comprehensive analysis of the Populus Hsp90 gene family and identified 10 Populus Hsp90 genes, which were phylogenetically clustered into two major groups. Gene structure and motif composition are relatively conserved in each group. In Populus trichocarpa, we identified three paralogous pairs, among which the PtHsp90-5a/PtHsp90-5b paralogous pair might be created by duplication of a genome segment. Subcellular localization analysis shows that PtHsp90 members are localized in different subcellular compartments. PtHsp90-3 is localized both in the nucleus and in the cytoplasm, PtHsp90-5a and PtHsp90-5b are in chloroplasts, and PtHsp90-7 is in the endoplasmic reticulum (ER). Furthermore, microarray and semi-quantitative real-time RT-PCR analyses show that a number of Populus Hsp90 genes are differentially expressed upon exposure to various stresses.
The gene structure and motif composition of PtHsp90s are highly conserved among group members, suggesting that members of the same group may also have conserved functions. Microarray and RT-PCR analyses show that most PtHsp90s were induced by various stresses, including heat stress. Collectively, these observations lay the foundation for future efforts to unravel the biological roles of PtHsp90 genes.
Plants are exposed to various environmental stresses. Primary stresses such as high light intensity, heat shock, drought, chilling, salinity, and chemical pollutants act simultaneously on plants, causing cell injury and producing secondary stresses such as osmotic and oxidative stresses . Plants cannot avoid exposure to these factors, but adapted morphologically and physiologically by some mechanisms. Biosynthesis of many proteins called “stress proteins” is induced to protect cells from these harmful stimuli .
Heat shock proteins (Hsps) are responsible for protein folding, assembly, translocation, and degradation in many normal cellular processes. They stabilize proteins and membranes, and can assist in protein refolding under stress conditions. They also play a crucial role in protecting plants from stresses by reestablishing normal protein conformations and thus cellular homeostasis . Plant Hsps are classified into five families according to their molecular size: Hsp100, Hsp90, Hsp70, Hsp60, and small Hsps (sHsps). They have been well characterized in a few model plants such as the tomato, Arabidopsis, and rice [3, 4].
Hsp90s are a class of chaperone proteins that are highly conserved in prokaryotes and all eukaryotes. They are the major species of molecular chaperones and require ATP for their functions . Although Hsp90s are expressed in most organisms, their expression increases in response to stresses. Distinct from many other well-characterized molecular chaperones, Hsp90s display considerable specificity for their client proteins. Most of their known substrates are signal-transduction proteins such as steroid hormone receptors and signaling kinases . Although the major function of Hsp90s is to assist protein folding, they play key roles in signal transduction, cell-cycle control, protein degradation, genomic silencing, and protein trafficking [5, 6]. Expression of Hsp90 in Arabidopsis is developmentally regulated and is responsive to heat, cold, salinity, heavy metals, phytohormones, and light and dark transitions [4, 7]. Tobacco NbHsp90-1 and Arabidopsis AtHsp90-2 confer pathogen resistance by reacting to resistance proteins (R proteins), which are signal receptors from the pathogen [8, 9]. In addition, Hsp90s interact with the 26S proteasome and play a key role in its ATP-dependent assembly and maintenance in budding yeast . To fulfill their cellular roles, Hsp90s cooperate with other chaperones to form a multiprotein chaperone complex . Moreover, Hsp90s also act as buffers to phenotypic changes and are portrayed as “capacitors for evolution” .
Hsp90s are encoded by multiple genes. They consist of conserved N-terminal and C-terminal domains that are joined by a charged linker region that varies in length. Genes encoding cytosol-, ER-, and plastid-localized Hsp90 proteins have been characterized in several plant species . In the Arabidopsis genome, seven Hsp90 family members have been identified. Sequence analyses of Arabidopsis Hsp90 family genes have revealed two major subfamilies. AtHsp90-1–4 proteins containing the C-terminal pentapeptide MEEVD form the cytoplasmic subfamily; AtHsp90-5–7 form the other subfamily. AtHsp90-5 and AtHsp90-7 are localized in chloroplasts  and the endoplasmic reticulum (ER) , respectively. AtHsp90-6 is localized in mitochondria . Overexpression of cytosolic AtHsp90-2, chloroplast-localized AtHsp90-5, and ER-localized AtHsp90-7 reduces tolerance to salt and drought stresses, but improves tolerance to high concentrations of Ca2+. The induction of ABA-responsive genes is delayed by overexpression of cytosolic AtHsp90-2, but is hardly affected by overexpression of AtHsp90-5 and AtHsp90-7 under conditions of salt and drought stress, which implies that different cellular compartment-localized Hsp90s in Arabidopsis might contribute to responses to abiotic stresses by different functional mechanisms, probably through ABA- or Ca2+-dependent pathways .
The Populus genus comprises woody plants that are important to humans and animals. Completion of the P. trichocarpa genome sequence in 2006 rendered it a model species for research on trees , providing an opportunity to analyze and further understand Hsp90s. To determine the structure-function relationship of Hsp90s in the Populus genus, we performed detailed systematic analyses of genome organization, gene structure, and expression compendium. We report the comprehensive genomic identification and phylogenetic analysis of all 10 members of the Hsp90 gene family in the Populus genus, as well as their expression profiles in different tissues and their responses under heat stress. Our results provide a framework for further functional investigations of these genes.
Results and discussion
Identification of the Hsp90 gene family in P. trichocarpa and other plant species
To identify putative Populus Hsp90 genes, we first searched relevant databases using the corresponding Arabidopsis Hsp90 protein sequences as queries. Additional searches were performed based on keyword querying. After removing redundant sequences, we identified 10 candidate Hsp90 sequences in the genome of P. trichocarpa. All PtHsp90 candidates were analyzed using the Conserved Domain Database (CCD) [18, 19] and Pfam (http://pfam.sanger.ac.uk/). It was previously reported that there are seven Hsp90 genes presented in Arabidopsis. The number of Hsp90 genes in P. trichocarpa genome is in consistency with the ratio of 1.4-1.6 putative poplar homologs for each Arabidopsis gene according to comparative genomics studies . This indicates that the higher number of Hsp90 members in poplar is due to the expansion of gene families during the genome duplication and the genomic evolution followed. The Hsp90 genes identified in P. trichocarpa encode proteins ranging from 698 to 823 amino acids (aa) in length, with predicted isoelectric points (pIs) ranging from 4.85 to 5.53 (Table 1). The polypeptides are also predicted to contain a Histidine kinase-like ATPases (HATPase_c) family motif and a Hsp90 family motif (Additional file 1). HATPase_c domain belongs to the ATP binding superfamily including diverse protein families such as DNA topoisomerase II, molecular chaperones Hsp90, DNA-mismatch-repair enzymes, phytochrome-like ATPases and histidine kinases . Detailed information on the Hsp90 family genes in P. trichocarpa, Arabidopsis, and rice is given in Table 1 and Additional file 1.
To investigate the evolutionary relationships of Hsp90 proteins from different plants, we identified Hsp90 genes from seven other plant species, including the moss Physcomitrella patens, the monocotyledonous angiosperms Oryza sativa, Sorghum bicolor, and Brachypodium distachyon, the dicotyledonous angiosperms Arabidopsis thaliana, Vitis vinifera, and Medicago truncatula. All angiosperm genomes, as well as the moss genome, contain Hsp90 genes. The number of Hsp90s identified is seven in A. thaliana, ten in P. trichocarpa, five in V. vinifera, five in M. truncatula, eight in O. sativa, seven in S. bicolor, eight in B. distachyon, and ten in P. patens. Additional file 3 provides a complete list of all Hsp90 genes identified in the present study.
Phylogenetic analyses of the Hsp90 gene family
To examine the phylogenetic relationships among the Hsp90 genes in P. trichocarpa and other plant species, we first generated a maximum likelihood phylogenetic tree by aligning full-length Hsp90 protein sequences from eight different plant species using PhyML. All of the sequences are classified into two major groups (group I and II), each of which is further divided into two subgroups (subgroup Ia, Ib, IIa and IIb) (Figure 1). The distribution of Hsp90 members in different species varies, and subgroups Ib and IIa are the largest two subgroups. There are two subgroup Ia members in P. trichocarpa, but none in moss and only one in the other species examined (Table 2). There are also more group Ib Hsp90 members in moss than these in the other species analyzed.
Next, we constructed a phylogenetic tree of the Hsp90 protein sequences from Populus, Arabidopsis, and rice using the neighbor-joining (NJ) method (Figure 2A). The tree topologies produced by two algorithms are largely comparable, with only minor differences at interior branches (Figure 2A). Distance and percentage of identity among Populus, Arabidopsis, and rice Hsp90 proteins are given in Additional file 4. Phylogenetic analysis shows that there is high similarity among the cytosolic members and less similarity among the organelle-type members. In addition, both trees show that the most recent duplicated pairs (Hsp90-1a/Hsp90-1b, Hsp90-4a/Hsp90-4b and Hsp90-5a/Hsp90-5b) exhibit high similarity, which indicates that they evolved slowly in sequence and structure, and may still keep their function.
It is more accurate to reflect an evolutionary relationship by using conserved domain sequences . Therefore we also constructed the phylogenetic tree with the conserved Hsp90 motif sequences from Populus, Arabidopsis, and rice using the maximum likelihood method with 1000 bootstrap replicates (Additional file 6). The resulted phylogenetic tree is consistent with the one generated based on the full length protein sequences.
Gene structure and conserved motifs of Hsp90 genes in Populus, Arabidopsis, and rice
To further investigate the structural diversity of Hsp90 genes in Populus, Arabidopsis, and rice, we first constructed a separate phylogenetic tree using the full-length Hsp90 protein sequences from these three species. The Hsp90 proteins are classified into two groups as described above (Figure 2A). Then we analyzed the exon/intron organization in the coding sequence of each Hsp90 gene (Figure 2B). In general, the positions of some spliceosomal introns are conserved in orthologous genes. In many cases, conservation of exon/intron organization or gene structure in paralogous genes is high and sufficient to reveal the evolutionary relationship between introns . In the present study, Hsp90 gene family members within the same group shared similar gene structures in terms of intron number or exon length (Figure 2B). Hsp90 group I comprises the cytosolic Hsp90s whose members have two or three introns, while group II comprises organelle-type Hsp90s, which have 13–19 introns (Figure 2 and Table 1). The gene structure difference between group I and group II Hsp90 might associate with their functions in different biological processes in subcellular compartments. We also investigated intron phases with respect to codons. The intron phases are remarkably well conserved among group members, while the intron arrangements and intron phases are strikingly distinct between groups (Figure 2). This may lend support to the results of phylogenetic and genome duplication analyses. We further examined the exon/intron organization of paralogous pairs of Hsp90 genes to explore traceable intron gain or loss within these genes. Three paralogous pairs in Populus (PtHsp90-1a/1b, PtHsp90-4a/4b, and PtHsp90-5a/5b) show conserved exon/intron structures in terms of intron number or gene length, while OsHsp90-5a shows a single intron gain event during the structural evolution of the OsHsp90-5a/5b paralogous pair. Interestingly, OsHsp90-4 has an additional C-terminal exon compared to other members of the group I Hsp90.
Next, we predicted the major domains of these proteins in all three species using Pfam and CDD . All of the proteins contain a HATPase_c superfamily domain and a Hsp90 family domain (Additional file 1). Although the tools we used are suitable for defining the presence or absence of recognizable domains, they are unable to recognize smaller individual motifs and more divergent patterns. Thus, the program MEME was used to further study the diversification of these proteins . Twenty distinct motifs were identified (Figure 2C). Details of the 20 motifs are presented in Additional file 5. Most of the closely related members have common motif composition, suggesting possible functional similarity among these Hsp90 proteins (Figure 2C). Motif 2 and 6 (corresponding to the HATPase_c superfamily domain at the N-terminus) are found in all Hsp90 proteins from the species we examined. It has reported that both ATP binding and hydrolysis are required for Hsp90 function in vivo[23, 24]. Noticeably, motif 20 (representing the LEA_6 subdomain) is only found in OsHsp90-4. This additional LEA_6 subdomain might explain the specific ability of OsHsp90-4 to acclimatize to various stresses.
Chromosomal location and gene duplication of Hsp90 genes in Populus, Arabidopsis, and rice
Chromosomal mapping of the gene loci shows that the 10 PtHsp90 genes are distributed unevenly among nine chromosomes (Additional file 7). Two PtHsp90 genes are localized on chromosome I, and one is localized on each of chromosome IV, V, VI, VIII, X, XIV, XVI, and XVII. Gene duplication events are thought to occur frequently in organismal evolution [25, 26]. Previous studies report that the Populus genome has experienced at least two genome-wide duplication events (eurosid and salicoid), followed by a series of chromosomal reorganizations involving reciprocal tandem/terminal fusions and translocations . To investigate the possible relationship between Hsp90 genes and segmental chromosome duplication, we also compared the locations of Hsp90 genes in duplicated chromosomal blocks that were previously identified in Populus, Arabidopsis, and rice [17, 27, 28]. Their distributions are shown in Additional file 7 (Populus), Additional file 8 (Arabidopsis), and Additional file 9 (rice). The results suggest that segmental duplication and transposition events are not the major factors that led to the expansion of the Populus Hsp90 gene family. It may be that dynamic changes occurred following segmental duplication and led to the loss of many of the duplicated Hsp90 genes.
A search for duplicated genes using the Plant Genome Duplication Database (PGDD; http://chibba.agtec.uga.edu/duplication/) revealed the existence of three gene pairs (PtHsp90-1a/PtHsp90-1b, PtHsp90-3/PtHsp90-4b, and PtHsp90-5a/PtHsp90-5b) in P. trichocarpa (Additional file 10A) and two pairs (OsHsp90-2/OsHsp90-3 and OsHsp90-5a/OsHsp90-5b) in O. sativa (Additional file 10B). Interestingly, PtHsp90-4a was not assigned as a duplicated gene with PtHsp90-3 and PtHsp90-4b, indicating that PtHsp90-4a had experienced intensive recombination events after the recent duplication with PtHsp90-4b, which led to the great divergence in its adjacent regions. Of the three Hsp90 pairs in Populus that we examined, only one pair, PtHsp90-5a/PtHsp90-5b, remained in a conserved position in segmental duplicated blocks (Additional file 7), suggesting that only this paralogous pair survived during the evolutionary process after chromosome duplication event.
Subcellular localization of Populus Hsp90 proteins
In silico analyses using the protein subcellular localization prediction software WoLF PSORT (http://wolfpsort.org) enabled us to predict the likely protein localization of each of the different candidate Hsp90s in Populus. PtHsp90-3 is predicted to be localized in the nucleus or in the cytosol with high reliability, while PtHsp90-5a and PtHsp90-5b are predicted to be localized in chloroplasts, PtHsp90-6 is predicted to be localized in mitochondria, and PtHsp90-7 is predicted to be localized in the ER. For the other PtHsp90 proteins, the cytosol is predicted to be their most likely location (Table 1). To confirm their predicted localizations, some of these proteins were transiently expressed in tobacco leaf epidermal cells as fusions with the N-terminus of YFP. Four Hsp90 proteins were successfully expressed as fluorescent protein fusions (PtHsp90-3-YFP, PtHsp90-5a-YFP, PtHsp90-5b-YFP, and YFP-PtHsp90-7). Based on sequence analysis, PtHsp90-1a, PtHsp90-1b, PtHsp90-2, PtHsp90-3, PtHsp90-4a, and PtHsp90-4b contain the C-terminal pentapeptide MEEVD (Additional file 2), which is characteristic of cytoplasmic Hsp90 proteins both in plants and in animals. In Arabidopsis, it was confirmed that two cytoplasmic Hsp90s (AtHsp90-1 and AtHsp90-3) are localized both in the nucleus and in the cytoplasm . As shown in Figure 3A, the fluorescent signal of PtHsp90-3-YFP is also detected both in the nucleus and in the cytoplasm. This is consistent with the subcellular localizations of cytoplasmic Arabidopsis Hsp90 proteins [4, 15]. Using the autofluorescence of chlorophyll as a marker, we found that the fluorescent signals of both PtHsp90-5a-YFP and PtHsp90-5b-YFP are well co-localized with red chlorophyll autofluorescence (Figure 3B and 3C). A transit peptide for the import into mitochondria was identified in the N-terminal region of PtHsp90-6, but the intercellular localization of PtHsp90-6 remains to be confirmed experimentally. The PtHsp90-7 protein sequence contains a C-terminal KDEL ER-retention motif (Additional file 2). When YFP-PtHsp90-7 is co-expressed with the well-characterized luminal ER marker GFP-HDEL , it is co-localized with GFP-HDEL (Figure 3D), which confirms its ER localization. These results suggest that the localization of Hsp90s in the same subgroup is relatively conserved among different species. The conserved organelle localization of Hsp90 implies that they might play roles in organelle-specific development or stress response. It has been suggested that mutation of the chloroplast-localized AtHsp90-5 causes altered response to red light, chlorate resistance and constitutively delayed chloroplast development in the cr88 mutant [13, 31, 32]. In animals, a mitochondrial-localized Hsp90 appeared to have a critical role in cell cycle progression, cellular differentiation, and apoptosis [33, 34]. In tobacco, mitochondrial-localized Hsp90 was involved in the N gene-dependent cell death by affecting downstream MAPK cascade function . Mutation of the ER-localized AtHsp90-7 produced floral and shoot meristem phenotypes in the shepherd mutant that closely resemble that of the three clavata (clv) mutants in Arabidopsis[14, 36, 37]. The conserved subcellular localization of Hsp90s might provide clues for their specific cellular functions.
Differential expression patterns of Hsp90 genes in Populus
The expression patterns of genes can provide useful clues for the functions of these genes. To verify the expression profiles of Populus Hsp90 genes, the RNA-seq data of different Populus vegetative tissues (unpublished data) were used to analyze the expression of PtHsp90 genes. PtHsp90-5a and PtHsp90-5b are mainly expressed in the young leaves (YL) and mature leaves (ML) (Figure 4A), which is consistent with their localization in chloroplasts (Figures 3B and 4C). The expression of PtHsp90-5a in the young leaves is stronger than that in the mature leaves, suggesting PtHsp90-5a may play roles in young leaf development. The other PtHsp90 genes are mainly expressed in stems including primary stem (PS) or secondary stem (SS). PtHsp90-1b is highly expressed in secondary stem, while PtHsp90-1a, PtHsp90-6 and PtHsp90-7 are mainly expressed in primary stem. These results imply that these PtHsp90s might be involved in different stages of stem development. The transcription levels of PtHsp90-2, PtHsp90-3, PtHsp90-4a and PtHsp90-4b are higher than these of the other PtHsp90 members. PtHsp90-4a and PtHsp90-4b are ubiquitously highly expressed in almost all detected tissues (Additional file 11). In order to verify the expression profiles of PtHsp90 genes obtained by RNA-seq, qRT-PCR analysis of seven selected PtHsp90 genes was performed on three different tissues (Figure 4B). The average expression of each gene was calculated relatively to the value of the first replication of roots ± standard error (SE) (n≥3). The gene expression pattern detected by qRT-PCR is generally consistent with the RNA-seq results. The different expression patterns of PtHsp90s in different tissues imply that PtHsp90 members may be involved in different biological processes.
Differential stress responses of Hsp90 genes in Populus
In order to reveal the responses of Populus Hsp90 genes to abiotic stresses, we analyzed the expression profiles of PtHsp90s under abiotic stresses such as heat, low nitrogen levels, mechanical wounding, drought, and methyl jasmonate (MeJ) treatment. Affymetrix microarray data (series accession numbers GSE26199, GSE16786 and GSE17230 in the Gene Expression Omnibus [GEO])  were used to analyze the global expression profiles of Populus Hsp90 genes. Previous study divided the physiological condition into four states according to Populus photosynthetic activity from 22°C to 42°C: baseline (22°C, the growth temperature), optimum (31.75°C, temperature producing the maximum net CO2 assimilation rate), 20% inhibition of optimum (38.4°C) and 30% inhibition of optimum (40.5°C) . Most PtHsp90 genes are upregulated under heat stress. The expression of PtHsp90-1a and PtHsp90-1b is highly induced immediately when temperature increases to optimum. PtHsp90-5a and PtHsp90-6 are highly induced when the photosynthesis is inhibited by 30% under heat stress (Figure 5A). In PtHsp90 group I, PtHsp90-1a, PtHsp90-1b, and PtHsp90-3 in both the Soligo and Carpacio genotypes are upregulated under almost all drought stresses tested, including the early response (EAR) to drought at 36 h, and the long-term (10 days) responses to mild stress (LMI) and moderate stress (LMO) (Figure 5C). Nitrogen deficiency stress causes different responses among Hsp90 genes. For instance, PtHsp90-1a and PtHsp90-1b are upregulated in 4-week-old young leaves (YL) and 8-week-old expanded leaves (EL) of genotype 1979 and genotype 3200; PtHsp90-5a and PtHsp90-5b are upregulated in 8-week-old expanded leaves (EL) of the same two genotypes. However, PtHsp90-3, PtHsp90-4a, and PtHsp90-6 are downregulated in 8-week-old expanded leaves in genotype 1979 and/or genotype 3200 (Figure 5B). In response to mechanical wounding stress, six genes (PtHsp90-1a, PtHsp90-1b, PtHsp90-3, PtHsp90-5b, PtHsp90-6, and PtHsp90-7) are significantly downregulated in young leaves and/or expanding leaves 1 week after wounding. In response to MeJ feeding in cell culture, only PtHsp90-1a and PtHsp90-1b are slightly downregulated (Figure 5B).
The responses of PtHsp90 genes to heat stress were analyzed experimentally. Heat-stress treatment comprising pretreatment for 3 h at 37°C and subsequent treatment at 45°C for 3 h, with a 2-h recovery interval, was performed. Most genes are induced by heat stress (Figure 6, Additional file 12). We classified the PtHsp90s into four classes according to their expression profiles under heat stress. Class I genes are induced immediately by both 37°C pretreatment and 45°C treatment (PtHsp90s in this class are positively regulated under both 37°C pretreatment and subsequent treatment at 45°C) (Figure 6B). Notably, PtHsp90-1a is induced 30 min after 37°C pretreatment and significantly induced 3 h after 45°C treatment in leaves. Class II genes are induced by 37°C pretreatment but not affected by 45°C treatment. PtHsp90-7 belongs to this class and its expression is induced by 37°C pretreatment. However, the expression of PtHsp90-7 is not affected by 45°C treatment following 2 h of recovery from 37°C pretreatment (Figure 6C). Class III genes are not affected by 37°C pretreatment but are negatively regulated by 45°C treatment. PtHsp90-2 is not induced by 37°C pretreatment, and its mRNA abundance is reduced after recovery from 37°C pretreatment and subsequent 45°C treatment (Figure 6D). Class IV genes are not affected by either 37°C pretreatment or 45°C treatment significantly. The expression of PtHsp90-5b is still maintained in a low level in 37°C pretreatment and 45°C treatment (Figure 6E).
To verify the expression profiles of PtHsp90 genes in response to heat stress, qRT-PCR analysis was performed for four selected PtHsp90 genes under heat stress (Additional file 12C-F). Notably, PtHsp90-3, PtHsp90-4a and PtHsp90-5a are induced 3 h after 37°C pretreatment and significantly induced 3 h after 45°C treatment in leaves (Additional file 12C-E). The expression of PtHsp90-5b is also induced by heat stress, but the induction is not that dramatic compared with that of the other PtHsp90 genes in both 37°C pretreatment (2-fold) and 45°C treatment (3-fold) (Additional file 12F). In addition, we found that the paralogous pair PtHsp90-5a/PtHsp90-5b shared the same expression profile in different tissues but were different under wounding and heat stresses (Figures 5 and 6). We then analyzed the promotors (2000 bp upstream of the start codon) of PtHsp90-5a and PtHsp90-5b using PlantCARE . The sequence of the promotors share a low sequence identity (43.3%) and two heat shock elements (HSE) exist in the promotor of PtHsp90-5a while none in PtHsp-5b (data not shown), which may contribute to the different expression pattern of the two genes. These results suggest different response mechanisms of PtHsp90 members may exist under heat stress, and provide significant insights into their functions.
We performed a comprehensive analysis of the Populus Hsp90 gene family covering phylogeny, chromosomal location, gene structure, subcellular localization, expression profiling, and heat stress responses. A total of 10 full-length Hsp90 genes were identified in the Populus genome, all of which are clustered into two distinct groups. Exon/intron structure and motif compositions are found to be relatively conserved in each subgroup. The Populus genome contains three paralogous Hsp90 gene pairs, but only PtHsp90-5a/PtHsp90-5b is located in conserved positions in duplicated blocks, suggesting that it may be derived from a segmental duplication event during evolution. Furthermore, subcellular localization analysis revealed that PtHsp90 members are localized in different organelles. In addition, comparative expression profile analysis of Populus Hsp90s revealed that Hsp90s may play various conserved roles in different biological processes in plants. Although the functions of PtHsp90s remain largely unknown and many experiments are needed to determine their precise functions, our phylogenetic and expression analyses of the Populus Hsp90 gene family establishes a solid foundation for future comprehensive functional analyses of PtHsp90s.
Database searching and sequence retrieval
To identify potential members of the Populus Hsp90 gene family, we performed multiple database searches. Published Arabidopsis Hsp90 gene sequences  were retrieved and used as queries in BLAST searches against the Poplar Genome Database (http://www.phytozome.net/poplar.php, release 3.0). BLAST searches were also performed against the poplar genomes at the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov) and Phytozome (http://www.phytozome.net). Rice Hsp90 gene sequences were downloaded from the Rice Genome Annotation Project Database (http://rice.plantbiology.msu.edu/, release 7). Sequences of M. truncatula, S. bicolor, B. distachyon, V. vinifera, and P. patens were downloaded from Phytozome (http://www.phytozome.net). Local BLAST searches were performed using Arabidopsis Hsp90 protein sequences as queries to identify Hsp90 sequences in these plant species. All of the sequences were manually analyzed to confirm the presence of HATPase and Hsp90 domains using InterProScan (http://www.ebi.ac.uk/Tools/pfa/iprscan/).
WoLF PSORT (http://wolfpsort.org) was used to predict protein subcellular localization. The pI and molecular weight were estimated using the Compute pI/Mw tool from ExPASy (http://web.expasy.org/compute_pi).
Multiple sequence alignment of the full-length protein sequences was performed using ClustalX2 (version 2.1) . A maximum likelihood (ML) phylogenetic tree was constructed using PhyML (v3.0) with the JTT amino acid substitution model, 1000 bootstrap replicates, estimated proportions of invariable sites, four rate categories, estimated gamma distribution parameters, and an optimized starting BIONJ tree [42, 43].
Chromosomal location and gene structure of the Hsp90 genes
The chromosomal locations of the Hsp90 genes were determined using the Populus genome browser (http://www.phytozome.net/poplar). Information on intron/exon structure was collected from the genome annotations of P. trichocarpa from NCBI and Phytozome.
Gene structure analysis
The exon and intron structures of individual Hsp90 genes were illustrated using Gene Structure Display Server (GSDS, http://gsds.cbi.pku.edu.cn/)  by aligning the cDNA sequences with the corresponding genomic DNA sequences from Phytozome.
Conserved motif analysis
Functional motifs or domains of PtHsp90 protein sequences were analyzed using PROSITE and the Conserved Domain database. MEME (http://meme.sdsc.edu)  was used to identify motifs in candidate sequences. MEME was run locally with the following parameters: number of repetitions = any, maximum number of motifs = 20, and optimum motif width = 30 to 70 residues.
Transient expression and imaging
Transient expression in Nicotiana benthamiana lower leaf epidermal cells was performed as described by Zheng et al.  with slight modifications. Plants were cultivated under short-day conditions (8 h light/16 h dark). When the agrobacterium culture reached the stationary growth phase at 28°C with agitation, cells were pelleted and resuspended in infiltration buffer (100 μM acetosyringone in 10 mM MgCl2).
Publicly available microarray data analyses
For abiotic and hormonal treatments, Affymetrix microarray data available in the NCBI GEO database under the series accession numbers GSE26199 (heat stress), GSE17230 (drought stress) and GSE16786 were analyzed [39, 46, 47]. GSE16786 is composed of the following five subsets: GSE14893 (nitrogen limitation, genotype 1979), GSE14515 (nitrogen limitation, genotype 3200), GSE16783 (1 week after leaf wounding), GSE16785 (90 h after leaf wounding), and GSE16773 (methyl jasmonate-elicited suspension cell cultures). The Affymetrix CEL files representing different abiotic and hormonal treatments were downloaded from the GEO database and preprocessed using GeneSpring GX (V11.5) software (Agilent Technologies). The data were normalized using the GCRMA algorithm and then log transformed. The averages were calculated. After normalization and log transformation of data for all of the Populus genes presented on the chip, the log signal intensity values for Populus probe IDs corresponding to the Hsp90 gene models (v1.1) were extracted as a subset for further analyses. Expression was shown as fold change in experimental treatment samples relative to control samples. Tab-delimited files for the average log signal intensity values were imported into Genesis (v1.75) to generate the heat maps .
Probe sets corresponding to PtHsp90 genes were identified using the online Probe Match tool POParray (http://aspendb.uga.edu/poparray). For probe sets matching several Populus Hsp90 gene models, only these exhibiting consistently high hybridization signals across multiple samples were considered.
Plant material and growth conditions
Plant materials were collected from clonally propagated 1-year-old hybrid poplar (P. alba × P. glandulosa) clones (84K) grown in a growth chamber under long-day conditions (16 h light/8 h dark) at 23–25°C. Poplar saplings were subjected to heat treatment. Briefly, chamber was heated to 37°C for 3 h (pretreatment), returned to 23°C for 2 h, heated to 45°C for 3 h (treatment), and then allowed to recover for 2 h. Two biological replicates were performed. Leaves from three different plants were harvested at seven selected time points during heat stress treatment, frozen immediately in liquid nitrogen, and stored at −80°C for further analysis.
RNA isolation and semi-quantitative RT-PCR
Total RNA was extracted using the RNeasy Plant Mini Kit (Qiagen) with on-column treatment with RNase-free DNase I (Qiagen) to remove any contamination of genomic DNA according to the manufacturer’s instructions. RNA integrity was verified by 2% agar gel electrophoresis. First-strand cDNA synthesis was carried out with approximately 1 μg RNA using the SuperScript III reverse transcription kit (Invitrogen) and random primers according to the manufacturer’s procedure. Primers with melting temperatures of 58–60°C, lengths of 20–27 bp, and amplicon lengths of 160–260 bp were designed using Primer3 software (http://frodo.wi.mit.edu/primer3/input.htm). All primer sequences are listed in Additional file 13.
Real-time PCR was conducted on 7500 Real Time PCR System (Applied Biosystems, CA, USA) using SYBR Premix Ex Taq™ Kit (TaKaRa, Tokyo, Japan). Reactions were prepared in a total volume of 20 μl containing: 10 μl of 2×SYBR Premix, 2 μl of cDNA template, 0.4 μl of each specific primer to a final concentration of 200 nM. The reactions were performed as the following conditions: initial denaturation step of 95°C for 30 s followed by two-step thermal cycling profile of denaturation at 95°C for 10 s, and combined primer annealing/extension at 60°C for 34 s for 40 cycles. Negative PCR control without templates was performed for each primer pair. To verify the specificity of each primer pair, a melting curve analysis was performed ranging from 60°C to 95°C with temperature increasing steps of 0.06°C/s (5 acquisitions per °C) at the end of each run. The final threshold cycle (Ct) values were the mean of eight values including two biological replicates for each treatment and four technical replicates. The PtActin gene was used as an internal control.
Wang W, Vinocur B, Shoseyov O, Altman A: Role of plant heat-shock proteins and molecular chaperones in the abiotic stress response. Trends Plant Sci. 2004, 9 (5): 244-252. 10.1016/j.tplants.2004.03.006.
Gupta SC, Sharma A, Mishra M, Mishra RK, Chowdhuri DK: Heat shock proteins in toxicology: how close and how far?. Life Sci. 2010, 86 (11–12): 377-384.
Hu W, Hu G, Han B: Genome-wide survey and expression profiling of heat shock proteins and heat shock factors revealed overlapped and stress specific response under abiotic stresses in rice. Plant Sci. 2009, 176 (4): 583-590. 10.1016/j.plantsci.2009.01.016.
Krishna P, Gloor G: The Hsp90 family of proteins in Arabidopsis thaliana. Cell Stress Chaperone. 2001, 6 (3): 238-246. 10.1379/1466-1268(2001)006<0238:THFOPI>2.0.CO;2.
Young JC, Moarefi I, Hartl FU: Hsp90: a specialized but essential protein-folding tool. J Cell Biol. 2001, 154 (2): 267-273. 10.1083/jcb.200104079.
Richter K, Buchner J: Hsp90: chaperoning signal transduction. J Cell Physiol. 2001, 188 (3): 281-290. 10.1002/jcp.1131.
Milioni D, Hatzopoulos P: Genomic organization of hsp90 gene family in Arabidopsis. Plant Mol Biol. 1997, 35 (6): 955-961. 10.1023/A:1005874521528.
Liu Y, Burch-Smith T, Schiff M, Feng S, Dinesh-Kumar SP: Molecular chaperone Hsp90 associates with resistance protein N and its signaling proteins SGT1 and Rar1 to modulate an innate immune response in plants. J Biol Chem. 2004, 279 (3): 2101-2108. 10.1074/jbc.M310029200.
Hubert DA, Tornero P, Belkhadir Y, Krishna P, Takahashi A, Shirasu K, Dangl JL: Cytosolic HSP90 associates with and modulates the Arabidopsis RPM1 disease resistance protein. EMBO J. 2003, 22 (21): 5679-5689. 10.1093/emboj/cdg547.
Imai J, Maruya M, Yashiroda H, Yahara I, Tanaka K: The molecular chaperone Hsp90 plays a role in the assembly and maintenance of the 26S proteasome. EMBO J. 2003, 22 (14): 3557-3567. 10.1093/emboj/cdg349.
Zhang Z, Quick MK, Kanelakis KC, Gijzen M, Krishna P: Characterization of a plant homolog of hop, a cochaperone of hsp90. Plant Physiol. 2003, 131 (2): 525-535. 10.1104/pp.011940.
Rutherford SL, Lindquist S: Hsp90 as a capacitor for morphological evolution. Nature. 1998, 396 (6709): 336-342. 10.1038/24550.
Cao D, Froehlich JE, Zhang H, Cheng CL: The chlorate-resistant and photomorphogenesis-defective mutant cr88 encodes a chloroplast-targeted HSP90. Plant J. 2003, 33 (1): 107-118. 10.1046/j.1365-313X.2003.016011.x.
Ishiguro S, Watanabe Y, Ito N, Nonaka H, Takeda N, Sakai T, Kanaya H, Okada K: SHEPHERD is the Arabidopsis GRP94 responsible for the formation of functional CLAVATA proteins. EMBO J. 2002, 21 (5): 898-908. 10.1093/emboj/21.5.898.
Prassinos C, Haralampidis K, Milioni D, Samakovli D, Krambis K, Hatzopoulos P: Complexity of Hsp90 in organelle targeting. Plant Mol Biol. 2008, 67 (4): 323-334. 10.1007/s11103-008-9322-8.
Song H, Zhao R, Fan P, Wang X, Chen X, Li Y: Overexpression of AtHsp90.2, AtHsp90.5 and AtHsp90.7 in Arabidopsis thaliana enhances plant sensitivity to salt and drought stresses. Planta. 2009, 229 (4): 955-964. 10.1007/s00425-008-0886-y.
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A: The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313 (5793): 1596-1604. 10.1126/science.1128691.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR: CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011, 39: D225-229. 10.1093/nar/gkq1189.
Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwadz M: CDD: specific functional annotation with the conserved domain database. Nucleic Acids Res. 2009, 37: D205-210. 10.1093/nar/gkn845.
Dutta R, Inouye M: GHKL, An emergent ATPase/kinase superfamily. Trends Biochem Sci. 2000, 25 (1): 24-28. 10.1016/S0968-0004(99)01503-0.
Hardison RC: A brief history of hemoglobins: plant, animal, protist, and bacteria. Proc Natl Acad Sci U S A. 1996, 93 (12): 5675-5679. 10.1073/pnas.93.12.5675.
Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, 34: W369-373. 10.1093/nar/gkl198.
Obermann WM, Sondermann H, Russo AA, Pavletich NP, Hartl FU: In vivo function of Hsp90 is dependent on ATP binding and ATP hydrolysis. J Cell Biol. 1998, 143 (4): 901-910. 10.1083/jcb.143.4.901.
Panaretou B, Prodromou C, Roe SM, O'Brien R, Ladbury JE, Piper PW, Pearl LH: ATP binding and hydrolysis are essential to the function of the Hsp90 molecular chaperone in vivo. EMBO J. 1998, 17 (16): 4829-4836. 10.1093/emboj/17.16.4829.
Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D: Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc Natl Acad Sci U S A. 2003, 100 (20): 11484-11489. 10.1073/pnas.1932072100.
Mehan MR, Freimer NB, Ophoff RA: A genome-wide survey of segmental duplications that mediate common human genetic variation of chromosomal architecture. Hum Genomics. 2004, 1 (5): 335-344. 10.1186/1479-7364-1-5-335.
Blanc G, Hokamp K, Wolfe KH: A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 2003, 13 (2): 137-144. 10.1101/gr.751803.
Guyot R, Keller B: Ancestral genome duplication in rice. Genome. 2004, 47 (3): 610-614. 10.1139/g04-016.
Yoshida T, Ohama N, Nakajima J, Kidokoro S, Mizoi J, Nakashima K, Maruyama K, Kim J-M, Seki M, Todaka D: Arabidopsis HsfA1 transcription factors function as the main positive regulators in heat shock-responsive gene expression. Mol Genet Genomics. 2011, 286 (5–6): 321-332.
Haseloff J, Siemering KR, Prasher DC, Hodge S: Removal of a cryptic intron and subcellular localization of green fluorescent protein are required to mark transgenic Arabidopsis plants brightly. Proc Natl Acad Sci USA. 1997, 94 (6): 2122-2127. 10.1073/pnas.94.6.2122.
Lin Y, Cheng CL: A chlorate-resistant mutant defective in the regulation of nitrate reductase gene expression in Arabidopsis defines a new HY locus. Plant Cell. 1997, 9 (1): 21-35.
Cao D, Lin Y, Cheng CL: Genetic interactions between the chlorate-resistant mutant cr88 and the photomorphogenic mutants cop1 and hy5. Plant Cell. 2000, 12 (2): 199-210.
Felts SJ, Owen BA, Nguyen P, Trepel J, Donner DB, Toft DO: The hsp90-related protein TRAP1 is a mitochondrial protein with distinct functional properties. J Biol Chem. 2000, 275 (5): 3305-3312. 10.1074/jbc.275.5.3305.
Gesualdi NM, Chirico G, Pirozzi G, Costantino E, Landriscina M, Esposito F: Tumor necrosis factor-associated protein 1 (TRAP-1) protects cells from oxidative stress and apoptosis. Stress. 2007, 10 (4): 342-350. 10.1080/10253890701314863.
Takabatake R, Ando Y, Seo S, Katou S, Tsuda S, Ohashi Y, Mitsuhara I: MAP kinases function downstream of HSP90 and upstream of mitochondria in TMV resistance gene N-mediated hypersensitive cell death. Plant Cell Physiol. 2007, 48 (3): 498-510. 10.1093/pcp/pcm021.
Fletcher JC: Shoot and floral meristem maintenance in Arabidopsis. Annu Rev Plant Biol. 2002, 53 (1): 45-66. 10.1146/annurev.arplant.53.092701.143332.
Sangster TA, Queitsch C: The HSP90 chaperone complex, an emerging force in plant development and phenotypic plasticity. Curr Opin Plant Biol. 2005, 8 (1): 86-92. 10.1016/j.pbi.2004.11.012.
Barrett T, Edgar R: Gene expression omnibus: microarray data storage, submission, retrieval, and analysis. Methods Enzymol. 2006, 411: 352-369.
Weston DJ, Karve AA, Gunter LE, Jawdy SS, Yang X, Allen SM, Wullschleger SD: Comparative physiology and transcriptional networks underlying the heat shock response in Populus trichocarpa, Arabidopsis thaliana and Glycine max. Plant Cell Environ. 2011, 34 (9): 1488-1506. 10.1111/j.1365-3040.2011.02347.x.
Lescot M, Déhais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, Rouzé P, Rombauts S: PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002, 30 (1): 325-327. 10.1093/nar/30.1.325.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R: Clustal W and Clustal X version 2.0. Bioinformatics. 2007, 23 (21): 2947-2948. 10.1093/bioinformatics/btm404.
Guindon S, Gascuel O: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003, 52 (5): 696-704. 10.1080/10635150390235520.
Jones DT, Taylor WR, Thornton JM: The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992, 8 (3): 275-282.
Guo AY, Zhu QH, Chen X, Luo JC: GSDS: a gene structure display server. Yi chuan. 2007, 29 (8): 1023-1026. 10.1360/yc-007-1023.
Zheng H, Camacho L, Wee E, Batoko H, Legen J, Leaver CJ, Malho R, Hussey PJ, Moore I: A Rab-E GTPase mutant acts downstream of the Rab-D subclass in biosynthetic membrane traffic to the plasma membrane in tobacco leaf epidermis. Plant Cell. 2005, 17 (7): 2020-2036. 10.1105/tpc.105.031112.
Cohen D, Bogeat-Triboulot MB, Tisserant E, Balzergue S, Martin-Magniette ML, Lelandais G, Ningre N, Renou JP, Tamby JP, Le Thiec D: Comparative transcriptomics of drought responses in Populus: a meta-analysis of genome-wide expression profiling in mature leaves and root apices across two genotypes. BMC Genomics. 2010, 11 (1): 630-10.1186/1471-2164-11-630.
Yuan Y, Chung JD, Fu X, Johnson VE, Ranjan P, Booth SL, Harding SA, Tsai CJ: Alternative splicing and gene duplication differentially shaped the regulation of isochorismate synthase in Populus and Arabidopsis. Proc Natl Acad Sci U S A. 2009, 106 (51): 22020-22025. 10.1073/pnas.0906869106.
Sturn A, Quackenbush J, Trajanoski Z: Genesis: cluster analysis of microarray data. Bioinformatics. 2002, 18 (1): 207-208. 10.1093/bioinformatics/18.1.207.
This work was supported by the National Basic Research Program of China [2012CB114500] and the National Natural Science Foundation of China  to ML, and the National High Technology Research and Development Program of China [2013AA102702] to JC.
The authors declare that they have no competing interests.
JZ carried out all the analysis and interpreted the results. JL, LZ and BL helped in Populus materials collection and total RNA extraction. ML and JC conceived the project, supervised the analysis and critically revised the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1:Conserved domains of Hsp90 proteins in Arabidopsis, Populus , and rice. The major domains were identified using Pfam (http://pfam.sanger.ac.uk/). A multiple alignment of Hsp90 proteins from Arabidopsis (At), Populus (Pt), and rice (Os) was performed using Clustal X2.1, and a phylogenetic tree was constructed using MEGA 4.0 by the neighbor-joining (NJ) method with 1000 bootstrap replicates. (JPG 811 KB)
Additional file 2:List of all Hsp90 gene sequences identified in Populus and rice. The list comprises seven Arabidopsis Hsp90 sequences and Hsp90 sequences identified from Populus and rice in this study. Amino acid sequences were deduced from their corresponding coding sequences, and genomic DNA sequences were obtained from Phytozome (http://www.phytozome.net/poplar, release 2.1). (XLS 356 KB)
Additional file 3:List of Hsp90 protein sequences identified from eight plant species examined in this study.(TXT 46 KB)
Additional file 4:Distance and percentage of identity among Arabidopsis, Populus, and rice Hsp90 proteins. Amino acid identity among Populus, Arabidopsis, and rice Hsp90 proteins was analyzed in a pairwise fashion. (JPG 1 MB)
Additional file 5:Sequence logos for the conserved motifs of Hsp90 proteins in Arabidopsis, Populus , and rice. Conserved motifs and sequence logos were generated using the MEME search tool. Numbers on the horizontal axis represent sequence positions in the motifs and the vertical axis represents the information content in bits. (XLS 540 KB)
Additional file 6:Phylogenetic relationships of Hsp90 conserved motif sequences in Arabidopsis, Populus, and rice. A multiple alignment of Hsp90 proteins from A. thaliana (At), P. trichocarpa (Pt) and O. sativa (Os) was performed using Clustal X2.1, and a phylogenetic tree was constructed using conserved Hsp90 motif sequences by the maximum likelihood method with 1000 bootstrap replicates. (JPG 992 KB)
Additional file 7:Chromosomal locations of PtHsp90 genes. The schematic diagram shows the 10 Hsp90 genes mapped to nine chromosomes. Homologous blocks derived from segmental duplication are indicated using the same colors. The diagram of genome-wide chromosome organization resulting from genome duplication events in Populus is adapted from Tuskan et al. . (JPG 676 KB)
Additional file 9:Chromosomal locations of rice Hsp90 genes. The lines join the segmental duplicated homologous blocks that are indicated using the same colors. (JPG 739 KB)
Additional file 10:Gene duplication relationships in the Hsp90 gene family in Populus trichocarpa and Oryza sativa . Paralogous gene pairs generated by gene duplication within the Hsp90 family of P. trichocarpa (A) and O. sativa (B) were analyzed using the Plant Genome Duplication Database (http://chibba.agtec.uga.edu/duplication/). Each query gene displays only ±500 kb regions. Gene lines connect gene pairs. Blue lines represent the other anchor gene pairs in the region, and the red line represents the query locus. (JPG 303 KB)
Additional file 12:Expression analysis of selected PtHsp90 genes under heat stress. A. Conditions of heat stress. Seedlings were heated to 37°C for 3 h (pretreatment), returned to 23°C for 2 h, heated to 45°C for 3 h (treatment), and then allowed to recover for 2 h. 1, control; 2, 30 min after pretreatment at 37°C; 3, 2 h after pretreatment at 37°C; 4, 1 h after recovery at 23°C; 5, 30 min after treatment at 45°C; 6, 2 h after treatment at 45°C; 7, 2 h after recovery at 23°C. B. Analysis of expression profiles of PtHsp90s in response to heat stress in Populus leaves by semi-quantitative RT-PCR. The constitutively expressed PtActin was used as an internal control. Three independent experiments were performed under identical conditions. C-F. The relative mRNA abundance of four selected PtHsp90 genes was normalized with respect to reference gene PtActin under heat stress using qRT-PCR. Three biological replicates each with four technique replicates were performed and bars represent standard deviations (SD) of the replicates. (JPG 581 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Zhang, J., Li, J., Liu, B. et al. Genome-wide analysis of the Populus Hsp90 gene family reveals differential expression patterns, localization, and heat stress responses. BMC Genomics 14, 532 (2013). https://doi.org/10.1186/1471-2164-14-532
- Expression analysis
- Gene family
- Gene structure
- Phylogenetic analysis