Genome-wide identification of heat shock proteins (Hsps) and Hsp interactors in rice: Hsp70s as a case study
BMC Genomics volume 15, Article number: 344 (2014)
Heat shock proteins (Hsps) perform a fundamental role in protecting plants against abiotic stresses. Although researchers have made great efforts on the functional analysis of individual family members, Hsps have not been fully characterized in rice (Oryza sativa L.) and little is known about their interactors.
In this study, we combined orthology-based approach with expression association data to screen rice Hsps for the expression patterns of which strongly correlated with that of heat responsive probe-sets. Twenty-seven Hsp candidates were identified, including 12 small Hsps, six Hsp70s, three Hsp60s, three Hsp90s, and three clpB/Hsp100s. Then, using a combination of interolog and expression profile-based methods, we inferred 430 interactors of Hsp70s in rice, and validated the interactions by co-localization and function-based methods. Subsequent analysis showed 13 interacting domains and 28 target motifs were over-represented in Hsp70s interactors. Twenty-four GO terms of biological processes and five GO terms of molecular functions were enriched in the positive interactors, whose expression levels were positively associated with Hsp70s. Hsp70s interaction network implied that Hsp70s were involved in macromolecular translocation, carbohydrate metabolism, innate immunity, photosystem II repair and regulation of kinase activities.
Twenty-seven Hsps in rice were identified and 430 interactors of Hsp70s were inferred and validated, then the interacting network of Hsp70s was induced and the function of Hsp70s was analyzed. Furthermore, two databases named Rice Heat Shock Proteins (RiceHsps) and Rice Gene Expression Profile (RGEP), and one online tool named Protein-Protein Interaction Predictor (PPIP), were constructed and could be accessed at http://bioinformatics.fafu.edu.cn/.
Plants have evolved a spectrum of molecular programs to adapt to environmental stresses. To survive, plants undergo dramatic changes in physiological and molecular mechanisms . For instance, heat shock proteins (Hsps) are stimulated in response to a wide array of stress conditions and perform a fundamental role in protecting plants against abiotic stresses [1, 2].
Hsps can be classified into five major categories based on molecular mass: small heat shock protein (sHsp) family, chaperonin (Hsp60/GroEL) family, 70-kDa heat shock protein (Hsp70/DnaK) family, Hsp90 family and Hsp100/ClpB family . In Arabidopsis, at least 19 genes encoding sHsps, 16 chaperonins, 18 genes encoding Hsp70s, seven Hsp90s, and four Hsp100/ClpBs have been identified through genome-wide analysis [4–9]. Rice is the most important staple food crop in the world and the principal model for other monocotyledonous species . In recent years, researchers have made great efforts on the functional analysis of individual Hsp family members in rice [11–14], however Hsps still have not been fully characterized and little is known about their interactors .
Furthermore, detailed studies have established that the overexpression of Hsp70 genes enhanced the plant’s tolerance to environmental stresses [15–17]. Transgenic rice lines that overexpress sHsp17.7 exhibit increased drought tolerance during the seedling stage . However, the cellular mechanisms underlying Hsp function under abiotic stress are not fully understood . The completion of the Rice Genome Sequencing Project and high-throughput experimental methods have generated valuable data that can be used to identify proteins that interact with Hsps in rice, and consequently decipher the functions of Hsps.
Many computational approaches have been proposed to predict protein-protein interactions. In terms of test dataset types, these approaches can be grouped into three classes: sequence-oriented methods [19–22], gene expression profile-based methods  and structure-oriented methods . Interolog, a sequence-oriented method, has been widely used to construct protein-protein interactions (PPIs) in diverse organisms [10, 25–27]. This method is based on the principle that orthologous pairs can be detected by mapping those known interactions in the source organism onto the target organism . The gene expression profile-based methods identify genes that exhibit correlated changes in expression over conditions, since they tend to have similar functions or be involved in cellular processes [23, 28]. Each protein interaction mapping technique has different advantages and disadvantages , and the techniques are complementary to some extent. In this study, we integrated interolog- and gene expression profile-based methods to identify the interactors of Hsps in rice.
To carry out more reliable functional analysis, we first conducted a genome-wide screening for the true Hsps in rice using integration of orthology and expression association data. Then, we used interolog- and expression profile-based methods to identify Hsp70s interactors in rice response to abiotic stresses. Through mining the signal behind their interactors, we further investigated the pattern of binding sites and the interaction network of Hsp70s in response to abiotic stresses.
Gene expression in rice subjected to abiotic stresses
Four sets of gene expression data from rice seedlings exposed to drought, salt, cold and heat treatment were collected (Table 1) from the Gene Expression Omnibus (GEO) . The K-nearest neighbor (KNN) impute method was used to estimate the missing values in GeneChips . A total of 22,707 probe-sets with detectable expression values were selected from these GeneChips. Within-slide normalization (Figure 1) and multiple-slide normalization (Figure 2) were performed sequentially to minimize systematic variations.
Then, we identified heat-responsive (HR) probe-sets and estimated the global gene-gene pairwise relationships. In this study, we applied boxplots [32, 33] to identify HR probe-sets, which were defined as a group of probe-sets that were significantly up- or down-regulated by heat treatments. A total of 1,135 (5%) HR probe-sets that were expressed differentially under heat stress were detected (Figure 3). Among them, 651 probe-sets were up-regulated, while 484 probe-sets were down-regulated. Meanwhile, bootstrap analysis  was performed to estimate the absolute median value of Pearson Correlation Coefficients (PCC) between any pair of genes. The bootstrapped 95% confidence interval for the population ranged from 0.5648 to 0.5842 (Figure 4).
Genome-wide identification of Hsps in rice
Hsps screening in the rice proteome consisted of three steps. First, 41 candidate protein sequences, which were annotated as Hsps and contained the characteristic domains (Additional file 1: Table S1) of Hsps in Uniprot database , were downloaded. These sequences included 23 small Hsps (sHsps), eight Hsp70s, four Hsp60s, three Hsp90s and three Hsp100/ClpBs. Second, 10 of the 41 candidate proteins, whose expression value was absent in GSE6901 (GeneChips for drought, salt, and cold treatments) or GSE14275 (GeneChip for heat treatment), were filtered out. Third, since Hsps can stimulate a wide range of HR genes [3, 36], and those genes involved in similar functions or cellular processes are likely to have similar expression profiles over conditions . So we supposed the true Hsp genes should have a higher expression correlation with HR probe-sets compared with other genes. Therefore, 27 candidate genes, whose expression patterns were similar to that of the HR probe-sets (Table 2), were ultimately recognized as Hsps, including 12 sHsps, six Hsp70s, three Hsp60s, three Hsp90s and three Hsp100/ClpBs (Table 3). The average absolute value of the PCC between them and HR probe-sets reached 0.76, which was markedly greater than that of the global pairwise values (0.5648-0.5842) and the value of the Ubq5/control (0.5089).
Genome-wide identification of the interactors of Hsps in rice, with a focus on Hsp70s
Using the interolog method, 9,132 potential PPIs related to Hsps in rice (Additional file 1: Table S3) were mapped from the experimentally identified PPI in yeast . The predicted PPIs corresponding to 6 Hsp70s accounted for nearly 45% of the total interactions (4,091 out of 9,132). Therefore, in this paper, Hsp70s were selected as a case study.
Each of 6 Hsp70s sequences was used as a query to search its interactors in rice based on interlog method. After that, we applied an expression profile-based method to reduce the false-positive rate of Hsp70s PPIs predicted by interolog. The expression relationship between each interacting partner was further measured by Pearson Correlation Coefficients (PCCs). We found that the absolute PCC of 1,072 PPIs related to Hsp70s, including 430 interactors, were greater than 0.90 (Additional file 2: Supplemental Data 1A). Upon exposure to abiotic stresses, the expression of 166 interactors showed a positive relationship with that of Hsp70s, while the expression of 264 interactors was negatively correlated with that of Hsp70s (Table 4).
Assessment of the PPIs of Hsp70s in rice
Two computational methods were used to evaluate the overall quality of the above prediction. Randomized PPIs were generated and used as a control.
First, the co-localization method was applied to assess the Hsp70 PPIs. This method is based on the principle that interacting proteins are more likely to localize to the same cellular compartment than randomized pairs . The subcellular localization annotation of each protein in rice was obtained from WoLF PSORT , a stringent protein localization predictor based on experimental data. All of the predicted Hsp70s interactors contained subcellular localization annotations (Additional file 2: Supplemental Data 1B). We found that 582 PPIs (54% of 1,072 predicted PPIs) localized in common cellular compartments. In contrast, the maximum number of PPIs localized in the same subcellular compartment in 1,000 randomly repeated networks was 553 (51% of 1,072 randomized PPIs) (Figure 5), which was significantly lower than that of the predicted Hsp70 PPIs (empirical p-value < 0.001).
Second, we used the co-function method to test the overall quality of predicted Hsp70s PPIs. This method is based on the assumption that interacting partners tend to participate in the same cellular processes or share similar functions [22, 39]. The 6 Hsp70s contained four different GO terms (GO:0044260, GO:0005524, GO:0051082 and GO:0006457) in biological processes (BPs) or molecular functions (MFs). The result showed that 385 of 430 predicted Hsp70 interactors had GO annotations (Additional file 2: Supplemental Data 1B), and 300 of these interactors (78%) shared at least one common GO term with Hsp70s. The proportion of predicted interactors sharing the term GO:0044260, GO:0005524, GO:0051082 and GO:0006457 were 243 (63%), 267 (69%), 22 (6%) and 30 (8%), respectively, significantly higher than that of 1,000 repeats of randomized Hsp70 interactors (empirical p-value < 0.001) (Figure 6).
Identification of the binding sites of Hsp70s in rice
The above assessments provided strong support for the reliability of the Hsp70 interactors predicted in this paper. Therefore, we used these interactors as the positive dataset, and constructed a negative dataset composed of 10,158 proteins that were less likely to interact with Hsp70s. Since binding sites tend to occur more frequently in interacting proteins than in non-interacting proteins , we sought to detect over-represented domains or motifs by comparing their frequency of occurrence in the two different datasets.
The annotations of rice protein domains were obtained from Pfam . We identified 102 domains of 397 proteins in the positive dataset (Additional file 2: Supplemental Data 1B), and 2,628 domains of 7,746 proteins in the negative dataset. The number of negative samples was much greater than that of positive samples (20:1). To reduce this bias, we implemented one-tailed Fisher’s exact test  to detect the over-represented domains in the coordinated datasets (i.e., 397 positive samples versus 794 samples in the negative dataset; a ratio of 1:2), and used the Benjamini and Hochberg (BH) method  to control the false discovery rate (FDR). In addition, the above procedure was repeated 10 times by randomly changing the negative samples. Finally, 13 domains were detected with p-value lower than 0.05 in the 10 replicas (Additional file 3: Supplemental Data 2A). Similarly, we analyzed the binding motifs of Hsp70s in rice. The motif annotations were acquired from PROSITE [44, 45]. There were 113 motifs in 404 proteins among the positive samples (Additional file 2: Supplemental Data 1B), while there were 1,071 motifs in 10,081 proteins among the negative samples. Twenty-eight overrepresented motifs were ultimately investigated (Additional file 3: Supplemental Data 2B).
Functional analysis of Hsp70s in rice
It is expected that the functions of proteins can be deduced from their interactors. As mentioned above, among the 430 interactors of Hsp70s, 385 have BP or MF GO annotations (Additional file 2: Supplemental Data 1B). Furthermore, 147 interactors, whose expression levels positively correlated with that of Hsp70s, contained 109 GO annotations. In contrast, the 238 interactors, whose expression levels negatively correlated with Hsp70s, had 90 different GO annotations. The two distinct groups were defined as Positively Correlated Interactors (PCIs) and Negatively Correlated Interactors (NCIs). Using GO enrichment analysis, we found that 24 BP GO terms and five MF GO terms with p-values less than 0.05, were enriched in the PCIs compared with that in NCIs (Additional file 4: Supplemental Data 3A), suggesting that these biological processes or functions would be induced with the up-regulation of Hsp70s. Meanwhile, 23 BP GO terms and 16 MF GO terms with p-values less than 0.05 were over-represented in the NCIs compared with that in the PCIs (Additional file 4: Supplemental Data 3B), indicating that these biological processes or functions would be induced as Hsp70s down-regulation.
Construction of tools and riceHsp database
We constructed two databases, named Rice Heat Shock Proteins (RiceHsps) and Rice Gene Expression Profile (RGEP), and one online tool, named Protein-Protein Interaction Predictor (PPIP). The RiceHsps was built to store and show our predicted results in this paper. The RGEP was constructed to store the integrated gene expression data for rice subjected to abiotic stresses, including drought, salt, cold and high temperature. It also provided a function for identifier conversion among Michigan State University Osa1 Rice Locus (MSU ID), Rice Annotation Project Locus (RAP ID) and Affymetrix Rice Genome Probe-set (Affymetrix ID) (Figure 7). The tool PPIP was developed based on the interolog method. Once the user uploads at least two protein sequences in FASTA format into the text area, or a sequence file less than 2 Mb, the corresponding orthologous protein pairs, whose interaction has been verified by biochemical experiments in the selected model organism, will be retrieved (Figure 8). These online databases and tool can be accessible at http://bioinformatics.fafu.edu.cn.
Heat shock proteins (Hsps) in rice
Using a combination of orthology and expression association data, we identified 27 heat shock proteins, including 12 sHsps, 6 Hsp70s, 3 Hsp60s, 3 Hsp90s and 3 Hsp100/ClpBs. Using an orthology-based strategy, Sarkar et al. (2009) identified 23 sHsps in rice , 12 of which were confirmed in this paper and showed a strong relationship with HR probe-sets under abiotic stresses. According to orthology- and expression level-based data, Singh et al. (2010) discovered three Hsp100/ClpB proteins in rice , which were consistent with the result of this paper. We further noted that the expression pattern of the three Hsp100/ClpBs closely resembled that of HR probe-sets under abiotic stresses. Recently, Sarkar et al. (2013) identified 32 Hsp70 genes through sequence analysis and orthology-based method , including all the six Hsp70s in this paper. However, in this study, we not only adopted the sequence and orthology information, but also the gene expression association information to identify true Hsps in rice. Given that similar proteins in different species may have different functions, one has to take into account that an orthology-based strategy alone is not adequate to identify true Hsps in rice. Furthermore, it is not reliable to screen Hsps for evaluating the gene expression levels of candidates in rice in response to high-temperature stress, because some Hsps express constitutively . Therefore, we used a combination of orthology and expression association data to identify a highly reliable Hsps in rice.
Binding sites of Hsp70s in rice
Investigating the binding sites of Hsp70s will provide insight into the activity of those proteins and improve our ability to predict the potential risks of a particular mutation. In this study, we identified 13 domains and 28 motifs that occurred more frequently in the positive dataset than in the negative dataset, suggesting that these sequences are potential target sites for Hsp70s in rice. The results were partially supported by biochemical experiments conducted in previous studies. For instance, our results showed that the J-domain (PF00226, PS50076) of DnaJ/Hsp40 was the binding site for DnaK/Hsp70. By point mutation analysis, Wall et al. (1994) demonstrated that the J-domain interacted with DnaK and regulated DnaK activity . Suh et al. (1998) found that the ATPase domain of DnaK was a binding pocket for the J-domain . Horne et al. (2010) suggested that the fusion of the J-domain with p5 (Jdp5) could dramatically stimulate ATP hydrolysis by DnaK, and NMR studies on Jdp5 further indicated that the peptide tethered the J-domain to the ATPase domain of DnaK .
Therefore, the results of this study provided useful clues for experimental biologists in further analyzing the function of Hsp70s.
The Hsp70 interaction network in rice
The Hsp70s network was shown in Figure 9, and described in the following sections. We classified the interaction network into five sub-networks.
Sub-network A: Macromolecular translocation
Our results showed that the small GTPase Ran (LOC_Os01g42530), importin α (LOC_Os01g14950, LOC_Os05g06350) and importin β (LOC_Os05g28510) could bind to Hsp70s. Hsp70 and importin β were previously identified as Ran-interacting proteins (Rips) . The results of this study indicated that the Ras family domain (PF00071) and ATP/GTP-binding site motif A (P-loop) (PS00017) of the small GTPase Ran were potential interacting sites of Hsp70s. Furthermore, the expression of Ran and importin proteins was strongly correlated with that of Hsp70s (PCC > 0.90) under abiotic stresses (Additional file 5: Figure S1; Additional file 1: Table S5). We then constructed a protein-protein interaction network consisting of Hsp70s, GTPase Ran and importin proteins in rice (Figure 9A).
Importin α recognizes the nuclear localization signal (NLS) of nuclear proteins in the cytoplasm, forming a stable complex termed the nuclear pore-targeting complex (PTAC) [50, 51]. Importin β docks the PTAC to the cytoplasmic face of the nuclear pore complex (NPC) , a channel for macromolecules into the nucleus . In addition, the hydrolysis of GTP by the small GTPase Ran has been shown to be essential for the translocation of docked PATC into the nucleus . Therefore, the interaction network between Hsp70s, GTPase Ran and importin proteins in rice might be involved in translocation of macromolecules. Shulga et al. (1996) stated that Hsp70 could act as a molecular chaperone to promote the formation and stability of the nuclear localization signal-containing complex during both targeting and translocation phases of nuclear transport .
Sub-network B: Plant carbohydrate metabolism
The results of this study revealed that Hsp70s interacted with enolase (LOC_Os09g20820), fumaratehydratase (LOC_Os03g21950), malate dehydrogenase (LOC_Os07g43700, LOC_Os01g61380, LOC_Os05g49880) and citrate synthase (LOC_Os02g10070), which were constructed in sub-network B (Figure 9B). Most of these potential interactions have been partly validated by previous studies. In vitro studies indicated that Hsp70 might assist in transporting fumaratehydratase between the cytosol and mitochondria . Furthermore, it has been reported that the Hsp70 complex significantly increased the spontaneous rate of refolding of denatured mitochondrial malate dehydrogenase . Hsp70s have also been demonstrated to reduce the aggregation of citrate synthase under heat stress . Recently, through co-immunoprecipitation (CoIP) assays, Luo et al. (2011) further confirmed that Hsp70 could directly interact with α-enolase .
Our results indicated that the expression levels of Hsp70s were positively and strongly correlated with that of enolase, fumaratehydratase, malate dehydrogenase and citrate synthase in response to abiotic stresses (Additional file 5: Figure S2; Additional file 1: Table S6), implying that Hsp70s might have essential functions in stimulating carbohydrate metabolism by regulating the activity of those key enzymes. In a metabolomics study, Kaplan et al. (2004) also found that carbohydrate metabolism was affected by heat shock in Arabidopsis. The amount of pyruvate and oxaloacetate increased coordinately upon heat shock, while the fumarate and malate (oxaloacetate precursors) contents were similarly elevated, suggesting that the Embden-Meyerhof-Parnas (EMP) pathway and tricarboxylic acid cycle (TCA) cycle would be enhanced by abiotic stresses.
Sub-network C: plant innate immunity
In this study, we found that Hsp70s might cooperate with members of the small GTPaseRac family (LOC_Os01g12900, LOC_Os02g02840, LOC_Os02g20850), Hsp90 (LOC_Os06g50300, LOC_Os08g39140), SKP1 (LOC_Os09g36830) and MAPK6 (LOC_Os06g06090), as shown in Figure 9C. Hsp70, Hsp90 and RAR1 have been documented as the components of Rac1 complex in rice, based on CoIP experiments . Moreover, multiple lines of evidence have shown that Hsp70 was a negative regulator of ASK1/MAP3K, and overexpression of Hsp70 inhibited the MAPK signaling cascade, which was associated with apoptosis [62–64]. Consistent with previous studies, our results further illustrated that the expression level of Hsp70s was positively correlated with that of Rac, Hsp90 and SKP1, and negatively correlated with that of MAPK6 in response to abiotic stresses (Additional file 5: Figure S3; Additional file 1: Table S7). Furthermore, in addition to Rac (PF00071 and PS00017, PS51420), MAPK6 (PF00069 and PS50011, PS00108, PS00107, PS01351) also contained potential binding sites for Hsp70s.
Previous reports have shown that Hsp90 and two co-chaperone-like molecules, RAR1 and SGT1, performed a key role in effector-triggered immunity (ETI), the second line of the plant defense system [61, 65, 66]. Additionally, in vitro studies have indicated that SGT1 can interact with SKP1 and link it to the Hsp90 co-chaperone complexes . Further research found that the SKP1-CULLIN1-F-box (SCF) complex regulated the stability of resistance (R) proteins , suggesting that SKP1 might also be involved in the ETI response. In addition, the small GTPase Rac could function as a critical switch downstream of two types of innate immunity: PAMP-triggered immunity (PTI) and effector-triggered immunity (ETI) . This finding was recently supported by Jung et al. (2013). They found that the OsctHsp70-1 had a functional association with Ras/Raf-mediated MAPK kinase cascades .
Sub-network D: photosystem II repair
Sub-network D showed that Hsp70s might interact with FtsH families (LOC_Os06g51029, LOC_Os01g62500 and LOC_Os01g43150) (Figure 9D). Indeed, this interaction has been previously confirmed by Shen and colleagues . In this study, we found that there was a close positive correlation (PCC > 0.90) between the expression of Hsp70s and FtsH families in rice subjected to abiotic stresses (Additional file 5: Figure S4; Additional file 1: Table S8). The AAA-protein family signatures (PF00004, PS00674) of FtsH proteins were identified as potential target sites for Hsp70s. Previous showed that FtsH family members played an important role in the D1 repair cycle of PSII [70–72]. Using native gel electrophoresis, Yokthongwattana et al. (2001) revealed that Hsp70s could form a complex with intact D1 protein and also with D2 and CP47 , suggesting Hsp70s have a function in the photosystem II (PSII) repair cycle.
Sub-network E: protein kinase activities
In this study, we found that nearly 46% of the Hsp70 interactors (197 out of 430) contained protein kinase domains, including protein kinase C (PKC), protein kinase A (PKA), apoptosis signal-regulating kinase/mitogen-activated protein kinase kinasekinase (ASK/MAP3K), mitogen-activated protein kinase kinase (MAP2K), mitogen-activated protein kinase (MAPK), cyclin-dependent kinase (CDK), Ca2+-dependent protein kinase (CDPK), CBL-interacting protein kinase (CIPK), osmotic stress/abscisic acid-activated protein kinase (SAPK) and wall-associated kinase (WAK) family members. Furthermore, our results showed that the expression level of approximately 81% of those protein kinases (159 out of 197) had a strong negative correlation (PCC < −0.90) with that of Hsp70s. This was consistent with previous studies. Hsp70s were reported to directly interact with PKC, ASK/MAP3K and CDK [63, 74, 75], and inhibit the activities of jun amino-terminal kinase (JNK), ASK/MAP3K, MAPK and CDK [3, 63, 74–76]. Ding et al. (1998) have shown that overexpression of Hsp70 significantly suppressed the enzymatic activities of PKA and PKC . Therefore, it is likely that Hsp70s indiscriminately down-regulate the activity of various protein kinases.
By integrating orthology and functional association data, we identified 27 Hsps in rice, including 12 sHsps, 6 Hsp70s, 3 Hsp60s, 3 Hsp90s and 3 Hsp100/ClpBs. Then, using Hsp70s as a case study, we identified 430 interactors of Hsp70s in rice by combining interolog- and expression profile-based methods. According to the interactors of Hsp70s, we investigated the potential binding sites of Hsp70s, and analyzed the interacting network of Hsp70s in rice. Finally, we constructed two online databases and one tool, which could be accessed at http://bioinformatics.fafu.edu.cn/.
Rice sequence data
Yeast interaction data
Eight hundred and thirty-seven experimentally verified protein-protein interaction (PPI) pairs related to Hsps in yeast (Additional file 1: Table S2) were manually selected from the Database of Interaction Proteins (DIPs version 20101010; http://dip.doe-mbi.ucla.edu/dip/).
Gene expression data for rice subjected to drought, salt, cold or heat treatments were downloaded from GEO (accession number GSE6901 for drought, salt and cold treatments, and GSE14275 for heat treatment). All data were obtained using the same microarray platform (Affymetrix GeneChip Rice Genome Array; platform accession number GPL2025) and rice seedling samples (Table 1).
Preprocessing of microarray data
The impute package (version 1.22.0) [31, 79] in Bioconductor  was used to estimate missing expression data. In addition, probe-sets, whose expression value was absent in GSE6901 or GSE14275, were filtered out. Furthermore, a robust scatterplot smoother (LOWESS)  in R software (version 2.10.1)  was used to perform intensity-dependent within-slide normalization . The Limma package (version 3.2.0) was implemented to scale multiple-slide normalization .
Heat-responsive probe-sets detection
Boxplot [32, 33] in R was implemented to identify heat-responsive (HR) probe-sets. Probe-sets with M-values (log ratios) located beyond the upper or lower fence of the boxplot were considered as HR gene probe-sets.
Estimation of the global median absolute value of Pearson Correlation Coefficient (PCC)
The bootstrap method  was used to evaluate the median absolute value of PCCs between the expression levels of any two probe-sets among GeneChips. First, 10,000 non-redundant probe pairs were randomly selected, and the absolute PCC between each pair was computed. Based on these 10,000 PCC values, 100,000 bootstrap samples were built by sampling with replacement to measure the 95% confidence interval of the global median absolute value of PCC.
Identification of rice Hsps
Rice candidate Hsps were selected from the Uniprot database. These sequences satisfied the following criteria: (1) they possessed the conserved domains of Hsps (Additional file 1: Table S1); (2) they were functionally annotated as Hsps or involved in similar biological processes; (3) the sequence length was in agreement with the molecular mass of different Hsp family members; (4) Evidence at RNA or protein expression level; and (5) they were identified as Hsps in the MSU Rice Genome Annotation Project. After that, their corresponding Affymetrix IDs were retrieved from Ricechip.org (http://www.ricechip.org/). R software was used to calculate the PCC values between expression data of each candidate Hsp and HR gene probe-set.
Prediction of proteins interacting with Hsp70s in rice
For each experimentally verified PPI of Hsps, the pairwise amino acid sequence was locally run through BLASTP (version 2.2.23+)  against the entire rice proteome in an effort to identify orthologs in rice. The E-value cutoff, identity and alignment coverage were set at 10−10, 30% and 40%, respectively. Based on the core principle of interolog [19, 21], corresponding orthologous pairs in rice were predicted to interact with each other. Briefly, if two interacting proteins, A and B, in yeast had the corresponding orthologs A’ and B’ in rice, respectively, A’ might interact with B’.
Expression profile-based method
For each PPI predicted by interolog, we determined the absolute value of PCC between the corresponding gene expression data. R software was used to calculate the PCC values. Generally, the PCC values ranged from −1 to 1. A value of 1 indicated that the gene expression level of protein A would increase as that of protein B increased. In contrast, a value of −1 implied that the gene expression level of protein A would decrease as that of protein B increased. A value of 0 implied that there was no linear correlation between the expressions of these two genes. If the absolute value was less than 0.90, the PPI was filtered out.In addition, t-test was utilized to evaluate whether the paired PCC value was significantly greater or less than 0.
Assessment of PPIs of Hsp70s in rice
Protein localization method
Subcellular localization information of proteins in rice was obtained from WoLF PSORT . In addition, 1,000 randomized networks, in which the interacting partners of Hsp70s were randomly replaced by other proteins containing meaningful subcellular localization annotations in the rice proteome, were used as a control. The above process was repeated 1,000 times.
Function similarity method
The GO annotations of proteins in rice were downloaded from agriGO (http://bioinfo.cau.edu.cn/agriGO/download.php) . Furthermore, 1,000 randomized repeats of Hsp70 interactors were generated. The predicted interactors of Hsp70s were randomly replaced by other proteins possessing GO annotations in the rice proteome. The above procedure was repeated 1,000 times.
Identification of binding sites of Hsp70s in rice
Non-interactors of Hsp70s were used as negative controls. These proteins were collected from the rice proteome, and satisfied the following conditions: first, they could not interact with Hsp70s, based on the interolog prediction; and second, the absolute PCC value between the expression level of the non-interactor and that of any Hsp70 should be less than 0.40.
The domain information of rice proteins was obtained from Pfam (http://pfam.sanger.ac.uk/) . Because of the large number of sequences, we ran the PfamScan program (version 091007)  and HMMER package (version 3.0b3)  locally. Rice protein sequences were searched against Pfam-A domains in PfamScan databases (version 24.0) with an E-value cutoff of 0.0001.
The motif annotations of proteins in rice were acquired from PROSITE (http://prosite.expasy.org/) . The ScanProsite tool  was downloaded and applied locally to scan protein sequences against the PROSITE database (version 20.67).
Fisher’s exact test
A one-tailed Fisher’s exact test was used to detect the over-represented domains and motifs among the Hsp70s interactors in rice compared with the negative interactors. For each domain or motif annotation, a 2 × 2 contingency table was constructed, as shown in Additional file 1: Table S4. Then, R software was used to calculate the p-value to measure the significance level.
To limit the false-positive error rate associated with multiple statistical tests, R software was further used to alter each p-value into the corresponding adjusted p-value based on the BH method . Ultimately, the adjusted p-value was used to determine the potential binding sites. A cutoff value of 0.05 was used in this work.
Hsp70 network in rice
The GO information of the predicted Hsp70 interactors in rice was obtained from agriGO (http://bioinfo.cau.edu.cn/agriGO/). For each GO term, all parent nodes were retrieved according to the archive of the GO database, and the minimum distance from the root (depth) was determined. Only terms beyond the fourth depth were considered. After that, fisher’s exact test was conducted to reveal the over-represented GO terms in the opposite dataset, and the BH method was used to control the false discovery rate (FDR). The Hsp70 network was generated using Cytoscape (http://www.cytoscape.org/).
Construction of tools and the rice Hsps database
The web tools and rice Hsps database were constructed on a LAMP (Linux, Apache, MySQL and PHP) platform. RGEP visualization was developed using two types of open source software, Open Flash Chart (http://teethgrinder.co.uk/open-flash-chart/) and Google Chart Tools (https://developers.google.com/chart/).
Ahuja I, de Vos RC, Bones AM, Hall RD: Plant molecular stress responses face climate change. Trends Plant Sci. 2010, 15 (12): 664-674. 10.1016/j.tplants.2010.08.002.
Timperio AM, Egidi MG, Zolla L: Proteomics applied on plant abiotic stresses: role of heat shock proteins (HSP). J Proteomics. 2008, 71 (4): 391-411. 10.1016/j.jprot.2008.07.005.
Wang W, Vinocur B, Shoseyov O, Altman A: Role of plant heat-shock proteins and molecular chaperones in the abiotic stress response. Trends Plant Sci. 2004, 9 (5): 244-252. 10.1016/j.tplants.2004.03.006.
Lin BL, Wang JS, Liu HC, Chen RW, Meyer Y, Barakat A, Delseny M: Genomic analysis of the Hsp70 superfamily in Arabidopsis thaliana. Cell Stress Chaperones. 2001, 6 (3): 201-208. 10.1379/1466-1268(2001)006<0201:GAOTHS>2.0.CO;2.
Sung DY, Vierling E, Guy CL: Comprehensive expression profile analysis of the Arabidopsis Hsp70 gene family. Plant Physiol. 2001, 126 (2): 789-800. 10.1104/pp.126.2.789.
Krishna P, Gloor G: The Hsp90 family of proteins in Arabidopsis thaliana. Cell Stress Chaperones. 2001, 6 (3): 238-246. 10.1379/1466-1268(2001)006<0238:THFOPI>2.0.CO;2.
Hill JE, Hemmingsen SM: Arabidopsis thaliana type I and II chaperonins. Cell Stress Chaperones. 2001, 6 (3): 190-200. 10.1379/1466-1268(2001)006<0190:ATTIAI>2.0.CO;2.
Scharf KD, Siddique M, Vierling E: The expanding family of Arabidopsis thaliana small heat stress proteins and a new family of proteins containing alpha-crystallin domains (Acd proteins). Cell Stress Chaperones. 2001, 6 (3): 225-237. 10.1379/1466-1268(2001)006<0225:TEFOAT>2.0.CO;2.
Lee U, Rioflorido I, Hong SW, Larkindale J, Waters ER, Vierling E: The Arabidopsis ClpB/Hsp100 family of proteins: chaperones for stress and chloroplast development. Plant J. 2007, 49 (1): 115-127.
Lee I, Seo YS, Coltrane D, Hwang S, Oh T, Marcotte EM, Ronald PC: Genetic dissection of the biotic stress response using a genome-scale gene network for rice. Proc Natl Acad Sci U S A. 2011, 108 (45): 18548-18553. 10.1073/pnas.1110384108.
Sarkar NK, Kim YK, Grover A: Rice sHsp genes: genomic organization and expression profiling under stress and development. BMC Genomics. 2009, 10: 393-10.1186/1471-2164-10-393.
Singh A, Singh U, Mittal D, Grover A: Genome-wide analysis of rice ClpB/HSP100, ClpC and ClpD genes. BMC Genomics. 2010, 11: 95-10.1186/1471-2164-11-95.
Sarkar NK, Kundnani P, Grover A: Functional analysis of Hsp70 superfamily proteins of rice (Oryza sativa). Cell Stress Chaperones. 2013, 18 (4): 427-437. 10.1007/s12192-012-0395-6.
Jung KH, Gho HJ, Nguyen MX, Kim SR, An G: Genome-wide expression analysis of HSP70 family genes in rice and identification of a cytosolic HSP70 gene highly induced under heat stress. Funct Integr Genomics. 2013, 13 (3): 391-402. 10.1007/s10142-013-0331-6.
Sugino M, Hibino T, Tanaka Y, Nii N, Takabe T, Takabe T: Overexpression of DnaK from a halotolerant cyanobacterium Aphanothece halophytica acquires resistance to salt stress in transgenic tobacco plants. Plant Sci. 1999, 146 (2): 81-88. 10.1016/S0168-9452(99)00086-2.
Alvim FC, Carolino SM, Cascardo JC, Nunes CC, Martinez CA, Otoni WC, Fontes EP: Enhanced accumulation of BiP in transgenic plants confers tolerance to water stress. Plant Physiol. 2001, 126 (3): 1042-1054. 10.1104/pp.126.3.1042.
Ono K, Hibino T, Kohinata T, Suzuki S, Tanaka Y, Nakamura T, Takabe T, Takabe T: Overexpression of DnaK from a halotolerant cyanobacterium Aphanothece halophytica enhances the high-temperatue tolerance of tobacco during germination and early growth. Plant Sci. 2001, 160 (3): 455-461. 10.1016/S0168-9452(00)00412-X.
Sato Y, Yokoya S: Enhanced tolerance to drought stress in transgenic rice plants overexpressing a small heat-shock protein, sHSP17.7. Plant Cell Rep. 2008, 27 (2): 329-334. 10.1007/s00299-007-0470-0.
Matthews LR, Vaglio P, Reboul J, Ge H, Davis BP, Garrels J, Vincent S, Vidal M: Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or “interologs”. Genome Res. 2001, 11 (12): 2120-2126. 10.1101/gr.205301.
Ng SK, Zhang Z, Tan SH: Integrative approach for computationally inferring protein domain interactions. Bioinformatics. 2003, 19 (8): 923-929. 10.1093/bioinformatics/btg118.
Yu H, Luscombe NM, Lu HX, Zhu X, Xia Y, Han JD, Bertin N, Chung S, Vidal M, Gerstein M: Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. Genome Res. 2004, 14 (6): 1107-1118. 10.1101/gr.1774904.
Wu X, Zhu L, Guo J, Zhang DY, Lin K: Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations. Nucleic Acids Res. 2006, 34 (7): 2137-2150. 10.1093/nar/gkl219.
Ideker T, Ozier O, Schwikowski B, Siegel AF: Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics. 2002, 18 (Suppl 1): S233-S240. 10.1093/bioinformatics/18.suppl_1.S233.
Lyskov S, Gray JJ: The RosettaDock server for local protein-protein docking. Nucleic Acids Res. 2008, 36 (Web Server issue): W233-W238.
Lehner B, Fraser AG: A first-draft human protein-interaction map. Genome Biol. 2004, 5 (9): R63-10.1186/gb-2004-5-9-r63.
He F, Zhang Y, Chen H, Zhang Z, Peng YL: The prediction of protein-protein interaction networks in rice blast fungus. BMC Genomics. 2008, 9: 519-10.1186/1471-2164-9-519.
Wang TY, He F, Hu QW, Zhang Z: A predicted protein-protein interaction network of the filamentous fungus Neurospora crassa. Mol Biosyst. 2011, 7 (7): 2278-2285. 10.1039/c1mb05028a.
Deane CM, Salwinski L, Xenarios I, Eisenberg D: Protein interactions: two methods for assessment of the reliability of high throughput observations. Mol Cell Proteomics. 2002, 1 (5): 349-356. 10.1074/mcp.M100037-MCP200.
Han JD: Understanding biological functions through molecular networks. Cell Res. 2008, 18 (2): 224-237. 10.1038/cr.2008.16.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Muertter RN, Holko M, Ayanbule O, Yefanov A, Soboleva A: NCBI GEO: archive for functional genomics data sets–10 years on. Nucleic Acids Res. 2011, 39 (Database issue): D1005-D1010.
Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, Botstein D, Altman RB: Missing value estimation methods for DNA microarrays. Bioinformatics. 2001, 17 (6): 520-525. 10.1093/bioinformatics/17.6.520.
Benjamini Y: Opening the box of a boxplot. Am Stat. 1988, 42 (4): 257-262.
Frigge M, Hoaglin DC, Iglewicz B: Some implementations of the boxplot. Am Stat. 1989, 43 (1): 50-54.
Efron B: Bootstrap methods: another look at the jackknife. Ann Stat. 1979, 7 (1): 1-26. 10.1214/aos/1176344552.
Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Natale DA, O'Donovan C, Redaschi N, Yeh LS: The universal protein resource (UniProt). Nucleic Acids Res. 2005, 33 (Database issue): D154-D159.
Sung DY, Kaplan F, Lee KJ, Guy CL: Acquired tolerance to temperature extremes. Trends Plant Sci. 2003, 8 (4): 179-187. 10.1016/S1360-1385(03)00047-5.
Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, Eisenberg D: DIP, the database of interacting proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 2002, 30 (1): 303-305. 10.1093/nar/30.1.303.
Sprinzak E, Sattath S, Margalit H: How reliable are experimental protein-protein interaction data?. J Mol Biol. 2003, 327 (5): 919-923. 10.1016/S0022-2836(03)00239-0.
Horton P, Park KJ, Obayashi T, Fujita N, Harada H, Adams-Collier CJ, Nakai K: WoLF PSORT: protein localization predictor. Nucleic Acids Res. 2007, 35 (Web Server issue): W585-W587.
Pang E, Lin K: Yeast protein-protein interaction binding sites: prediction from the motif-motif, motif-domain and domain-domain levels. Mol Biosyst. 2010, 6 (11): 2164-2173. 10.1039/c0mb00038h.
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, Studholme DJ, Yeats C, Eddy SR: The Pfam protein families database. Nucleic Acids Res. 2004, 32 (Database issue): D138-D141.
Fisher RA: On the interpretation of χ2 from contingency tables, and the calculation of P. J R Stat Soc. 1922, 85 (1): 87-94. 10.2307/2340521.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995, 57 (1): 289-300.
Gattiker A, Gasteiger E, Bairoch A: ScanProsite: a reference implementation of a PROSITE scanning tool. Appl Bioinformatics. 2002, 1 (2): 107-108.
Sigrist CJ, Cerutti L, de Castro E, Langendijk-Genevaux PS, Bulliard V, Bairoch A, Hulo N: PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Res. 2010, 38 (Database issue): D161-D166.
Wall D, Zylicz M, Georgopoulos C: The NH2-terminal 108 amino acids of the Escherichia coli DnaJ protein stimulate the ATPase activity of DnaK and are sufficient for lambda replication. J Biol Chem. 1994, 269 (7): 5446-5451.
Suh WC, Burkholder WF, Lu CZ, Zhao X, Gottesman ME, Gross CA: Interaction of the Hsp70 molecular chaperone, DnaK, with its cochaperone DnaJ. Proc Natl Acad Sci U S A. 1998, 95 (26): 15223-15228. 10.1073/pnas.95.26.15223.
Horne BE, Li T, Genevaux P, Georgopoulos C, Landry SJ: The Hsp40 J-domain stimulates Hsp70 when tethered by the client to the ATPase domain. J Biol Chem. 2010, 285 (28): 21679-21688. 10.1074/jbc.M110.113118.
Saitoh H, Cooke CA, Burgess WH, Earnshaw WC, Dasso M: Direct and indirect association of the small GTPase ran with nuclear pore proteins and soluble transport factors: studies in Xenopus laevis egg extracts. Mol Biol Cell. 1996, 7 (9): 1319-1334. 10.1091/mbc.7.9.1319.
Imamoto N, Shimamoto T, Takao T, Tachibana T, Kose S, Matsubae M, Sekimoto T, Shimonishi Y, Yoneda Y: In vivo evidence for involvement of a 58 kDa component of nuclear pore-targeting complex in nuclear protein import. EMBO J. 1995, 14 (15): 3617-3626.
Imamoto N, Tachibana T, Matsubae M, Yoneda Y: A karyophilic protein forms a stable complex with cytoplasmic components prior to nuclear pore binding. J Biol Chem. 1995, 270 (15): 8559-8565. 10.1074/jbc.270.15.8559.
Radu A, Moore MS, Blobel G: The peptide repeat domain of nucleoporin Nup98 functions as a docking site in transport across the nuclear pore complex. Cell. 1995, 81 (2): 215-222. 10.1016/0092-8674(95)90331-3.
Mattaj IW, Englmeier L: Nucleocytoplasmic transport: the soluble phase. Annu Rev Biochem. 1998, 67: 265-306. 10.1146/annurev.biochem.67.1.265.
Moore MS, Blobel G: The GTP-binding protein Ran/TC4 is required for protein import into the nucleus. Nature. 1993, 365 (6447): 661-663. 10.1038/365661a0.
Shulga N, Roberts P, Gu Z, Spitz L, Tabb MM, Nomura M, Goldfarb DS: In vivo nuclear transport kinetics in Saccharomyces cerevisiae: a role for heat shock protein 70 during targeting and translocation. J Cell Biol. 1996, 135 (2): 329-339. 10.1083/jcb.135.2.329.
Soltys BJ, Gupta RS: Mitochondrial-matrix proteins at unexpected locations: are they exported?. Trends Biochem Sci. 1999, 24 (5): 174-177. 10.1016/S0968-0004(99)01390-0.
Pleckaityte M, Mistiniene E, Michailoviene V, Zvirblis G: Identification and characterization of a Hsp70 (DnaK) chaperone system from Meiothermus ruber. Mol Genet Genomics. 2003, 269 (1): 109-115.
Chakrabortee S, Tripathi R, Watson M, Schierle GS, Kurniawan DP, Kaminski CF, Wise MJ, Tunnacliffe A: Intrinsically disordered proteins as molecular shields. Mol Biosyst. 2012, 8 (1): 210-219. 10.1039/c1mb05263b.
Luo Q, Jiang L, Chen G, Feng Y, Lv Q, Zhang C, Qu S, Zhu H, Zhou B, Xiao X: Constitutive heat shock protein 70 interacts with alpha-enolase and protects cardiomyocytes against oxidative stress. Free Radic Res. 2011, 45 (11–12): 1355-1365.
Kaplan F, Kopka J, Haskell DW, Zhao W, Schiller KC, Gatzke N, Sung DY, Guy CL: Exploring the temperature-stress metabolome of Arabidopsis. Plant Physiol. 2004, 136 (4): 4159-4168. 10.1104/pp.104.052142.
Thao NP, Chen L, Nakashima A, Hara S, Umemura K, Takahashi A, Shirasu K, Kawasaki T, Shimamoto K: RAR1 and HSP90 form a complex with Rac/Rop GTPase and function in innate-immune responses in rice. Plant Cell. 2007, 19 (12): 4035-4045. 10.1105/tpc.107.055517.
Gabai VL, Meriin AB, Mosser DD, Caron AW, Rits S, Shifrin VI, Sherman MY: Hsp70 prevents activation of stress kinases. A novel pathway of cellular thermotolerance. J Biol Chem. 1997, 272 (29): 18033-18037. 10.1074/jbc.272.29.18033.
Park HS, Cho SG, Kim CK, Hwang HS, Noh KT, Kim MS, Huh SH, Kim MJ, Ryoo K, Kim EK, Kang WJ, Lee JS, Seo JS, Ko YG, Kim S, Choi EJ: Heat shock protein hsp72 is a negative regulator of apoptosis signal-regulating kinase 1. Mol Cell Biol. 2002, 22 (22): 7721-7730. 10.1128/MCB.22.22.7721-7730.2002.
Hwang JR, Zhang C, Patterson C: C-terminus of heat shock protein 70-interacting protein facilitates degradation of apoptosis signal-regulating kinase 1 and inhibits apoptosis signal-regulating kinase 1-dependent apoptosis. Cell Stress Chaperones. 2005, 10 (2): 147-156. 10.1379/CSC-90R.1.
Shirasu K: The HSP90-SGT1 chaperone complex for NLR immune sensors. Annu Rev Plant Biol. 2009, 60: 139-164. 10.1146/annurev.arplant.59.032607.092906.
Kawano Y, Chen L, Shimamoto K: The function of Rac small GTPase and associated proteins in rice innate immunity. Rice. 2010, 3 (2–3): 112-121.
Catlett MG, Kaplan KB: Sgt1p is a unique co-chaperone that acts as a client adaptor to link Hsp90 to Skp1p. J Biol Chem. 2006, 281 (44): 33739-33748. 10.1074/jbc.M603847200.
Cheng YT, Li Y, Huang S, Huang Y, Dong X, Zhang Y, Li X: Stability of plant immune-receptor resistance proteins is controlled by SKP1-CULLIN1-F-box (SCF)-mediated protein degradation. Proc Natl Acad Sci U S A. 2011, 108 (35): 14694-14699. 10.1073/pnas.1105685108.
Shen G, Adam Z, Zhang H: The E3 ligase AtCHIP ubiquitylates FtsH1, a component of the chloroplast FtsH protease, and affects protein degradation in chloroplasts. Plant J. 2007, 52 (2): 309-321. 10.1111/j.1365-313X.2007.03239.x.
Lindahl M, Spetea C, Hundal T, Oppenheim AB, Adam Z, Andersson B: The thylakoid FtsH protease plays a role in the light-induced turnover of the photosystem II D1 protein. Plant Cell. 2000, 12 (3): 419-431. 10.1105/tpc.12.3.419.
Bailey S, Thompson E, Nixon PJ, Horton P, Mullineaux CW, Robinson C, Mann NH: A critical role for the Var2 FtsH homologue of Arabidopsis thaliana in the photosystem II repair cycle in vivo. J Biol Chem. 2002, 277 (3): 2006-2011. 10.1074/jbc.M105878200.
Sakamoto W, Tamura T, Hanba-Tomita Y, Murata M, Sodmergen : The VAR1 locus of Arabidopsis encodes a chloroplastic FtsH and is responsible for leaf variegation in the mutant alleles. Genes Cells. 2002, 7 (8): 769-780. 10.1046/j.1365-2443.2002.00558.x.
Yokthongwattana K, Chrost B, Behrman S, Casper-Lindley C, Melis A: Photosystem II damage and repair cycle in the green alga Dunaliella salina: involvement of a chloroplast-localized HSP70. Plant Cell Physiol. 2001, 42 (12): 1389-1397. 10.1093/pcp/pce179.
Gao T, Newton AC: The turn motif is a phosphorylation switch that regulates the binding of Hsp70 to protein kinase C. J Biol Chem. 2002, 277 (35): 31585-31592. 10.1074/jbc.M204335200.
Marteil G, Gagne JP, Borsuk E, Richard-Parpaillon L, Poirier GG, Kubiak JZ: Proteomics reveals a switch in CDK1-associated proteins upon M-phase exit during the Xenopus laevis oocyte to embryo transition. Int J Biochem Cell Biol. 2012, 44 (1): 53-64. 10.1016/j.biocel.2011.09.003.
Kumar M, Rawat P, Khan SZ, Dhamija N, Chaudhary P, Ravi DS, Mitra D: Reciprocal regulation of human immunodeficiency virus-1 gene expression and replication by heat shock proteins 40 and 70. J Mol Biol. 2011, 410 (5): 944-958. 10.1016/j.jmb.2011.04.005.
Ding XZ, Tsokos GC, Kiang JG: Overexpression of HSP-70 inhibits the phosphorylation of HSF1 by activating protein phosphatase and inhibiting protein kinase C activity. FASEB J. 1998, 12 (6): 451-459.
Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, Orvis J, Haas B, Wortman J, Buell CR: The TIGR rice genome annotation resource: improvements and new features. Nucleic Acids Res. 2007, 35 (Database issue): D883-D887.
Hastie TTR, Narasimhan B, Chu G: Impute: imputation for microarray data. R Package Version 1220. 2009, http://CRAN.R-project.org/package=impute,
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004, 5 (10): R80-10.1186/gb-2004-5-10-r80.
Cleveland W: LOWESS: a program for smoothing scatterplots by robust locally weighted regression. Am Stat. 1981, 35: 1-
Ihaka R, Gentleman R: R: a language for data analysis and graphics. J Comput Graph Stat. 1996, 5 (3): 299-314.
Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP: Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002, 30 (4): e15-10.1093/nar/30.4.e15.
Smyth GK, Speed T: Normalization of cDNA microarray data. Methods. 2003, 31 (4): 265-273. 10.1016/S1046-2023(03)00155-5.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL: BLAST+: architecture and applications. BMC Bioinformatics. 2009, 10: 421-10.1186/1471-2105-10-421.
Du Z, Zhou X, Ling Y, Zhang Z, Su Z: agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010, 38 (Web Server issue): W64-W70.
Eddy SR: A new generation of homology search tools based on probabilistic inference. Genome Inform. 2009, 23 (1): 205-211.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13 (11): 2498-2504. 10.1101/gr.1239303.
This work was supported by the Natural Science Foundation of China and Fujian (grant nos. 31270454, 61163047 and 2013J01077), a grant from the Education Department of Fujian (grant no. JA12290) and the Key Subject of Ecology in Fujian (grant nos. 0608507 and 6112C0600).
The authors declare that they have no competing interests.
HQH conceived the study, analyzed the data and revised the manuscript. YFW conducted data analysis, analyzed Hsp70 network and drafted the manuscript. SKL analyzed the network of HSP70. KL developed part of the Perl scripts. QS constructed the dataset. HT and SFQ participated in the construction of online tools. XHC and JH developed and maintained the website. All authors read and approved the final manuscript.
Yongfei Wang, Shoukai Lin, Qi Song contributed equally to this work.
Electronic supplementary material
Additional file 1: Table S1: Domains for heat shock protein query in Uniprot database. Table S2. Number of PPIs related to Hsps in yeast collected from DIP. Table S3. Number ofpredicted protein-protein interaction related to rice Hspsby using interolog method. Table S4. 2 × 2 contingency table for Fisher’s exact test. Table S5. PCC between Hsp70s and Ran, importin proteins respectively. Table S6. PCC between Hsp70s and fumaratehydratase, malate dehydrogenase and citrate synthase respectively. Table S7. PCC between Hsp70s and Racs, Hsp90, SKP1 respectively. Table S8. PCC between Hsp70s and FtsH proteins. (DOC 78 KB)
Additional file 2: Supplemental Data 1A. Predicted PPIs ofHsp70s in rice based on the interolog and gene expression-based methods. Supplemental Data 1B. Predicted interactors of Hsp70s. (XLS 297 KB)
Additional file 3: Supplemental Data 2A. Domains overrepresented among interactors of Hsp70s. Supplemental Data 2B. Motifs overrepresented among interactors of Hsp70s. (XLS 28 KB)
Additional file 4: Supplemental Data 3A. Enriched GO terms among interactors with expression levels positively correlated with Hsp70s. Supplemental Data 3B. Enriched GO terms among interactorswith expression levels negatively correlated with Hsp70s. (XLS 32 KB)
Additional file 5: Figure S1: Gene expression profile of Hsp70s, Ran and importin proteins in response to abiotic stresses. Figure S2. Gene expression profile of Hsp70s, enolase, fumaratehydratase, malate dehydrogenase and citrate synthase in response to abiotic stresses. Figure S3. Gene expression profile of Hsp70s, Racs, Hsp90s, SKP1 in response to abiotic stresses. Figure S4. Gene expression profile of Hsp70s and FtsH proteins in response to abiotic stresses. (DOC 80 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Wang, Y., Lin, S., Song, Q. et al. Genome-wide identification of heat shock proteins (Hsps) and Hsp interactors in rice: Hsp70s as a case study. BMC Genomics 15, 344 (2014). https://doi.org/10.1186/1471-2164-15-344