Genomic survey, expression profile and co-expression network analysis of OsWD40 family in rice
© Ouyang et al; licensee BioMed Central Ltd. 2012
Received: 16 August 2011
Accepted: 20 March 2012
Published: 20 March 2012
WD40 proteins represent a large family in eukaryotes, which have been involved in a broad spectrum of crucial functions. Systematic characterization and co-expression analysis of OsWD40 genes enable us to understand the networks of the WD40 proteins and their biological processes and gene functions in rice.
In this study, we identify and analyze 200 potential OsWD40 genes in rice, describing their gene structures, genome localizations, and evolutionary relationship of each member. Expression profiles covering the whole life cycle in rice has revealed that transcripts of OsWD40 were accumulated differentially during vegetative and reproductive development and preferentially up or down-regulated in different tissues. Under phytohormone treatments, 25 OsWD40 genes were differentially expressed with treatments of one or more of the phytohormone NAA, KT, or GA3 in rice seedlings. We also used a combined analysis of expression correlation and Gene Ontology annotation to infer the biological role of the OsWD40 genes in rice. The results suggested that OsWD40 genes may perform their diverse functions by complex network, thus were predictive for understanding their biological pathways. The analysis also revealed that OsWD40 genes might interact with each other to take part in metabolic pathways, suggesting a more complex feedback network.
All of these analyses suggest that the functions of OsWD40 genes are diversified, which provide useful references for selecting candidate genes for further functional studies.
KeywordsOryza sativa Expression profiles Microarray WD40 gene Co-expression network
Proteins characterized by conserved motifs may belong to a gene family, which were represented by structural or functional similarity and evolutionary relationships. WD40 proteins are a group of proteins that are highly conserved in evolution and are extremely abundant across a wide range of eukaryotic organisms . Structurally, these proteins are characterized by the presence of approximately 40 amino acids core region, which contains a glycine-histidine (GH) dipeptide at the N terminus and a tryptophan-aspartate (WD) dipeptide at the C terminus separated by a region of variable lengths . Usually, the WD40 protein contains several tandemly repeated units of such motif, which are required to form the secondary structure . The structure of several WD40 proteins has been determined, suggesting that the WD40 domain folds into a secondary structural of beta propeller despite large levels of sequence diversity. For example, the mammalian Gβ subunit of heterotrimeric GTPases involved in signal transduction forms a beta propeller structure containing seven WD40 repeats .
WD40 gene family shows low level of sequence conservation with functional diversity in diverse pathways, and many WD40 proteins possess additional domains with other functional activities. Biochemical and structural studies have recognized WD40 proteins to be a broader spectrum of components in cytoplasm and nucleoplasm. They participate in important cellular pathways, including signal transduction, RNA processing, cytoskeleton dynamics, vesicular trafficking, nuclear export, regulation of cell division, and are especially prevalent in chromatin modification and transcriptional regulation [2, 4, 5]. In the model plant Arabidopsis thaliana, the WD 40 proteins have been identified and analyzed comprehensively . The results suggested that these proteins played key roles in plant-specific processes, with diversity in function conferred at least in part by divergence in upstream signaling pathways, downstream regulatory targets and/or structure outside of the WD 40 regions .
A common characteristic feature of the WD40 proteins is that, the WD40 domain mediates diverse protein-protein or protein-DNA interactions, thus interplays with multiple proteins to form dynamic complexes and functions as scaffolding protein [4, 6]. WD40 domain can mediate molecular recognitions with diverse partners through different sides of its surface. The same WD40 proteins can either recruit different substrates in a similar mode or in distinct ways . Considering the multiple interaction modes and the complex roles in cellular processes, it is difficult to identify the partners and pathways in relationship with WD40 proteins. The availability of high-throughput interactomes from different species enables us to understand the networks of the WD40 proteins more comprehensively [7–10]. For instance, WD40 domains were found to take part in more interaction pairs than any other domain in yeast, and being as the one of the most interacting domains in human interactome datasets [7–10]. Meanwhile, WD40 proteins can also act as a component of protein complexes involved in a variety of pathways [11–14], or offer binding sites for other proteins [15, 16], demonstrating that they can interact with the appropriate partner in different processes. The expression profiles in genome-wide scale provide essential data for building the co-expression network, thus allowed us to identify biological processes and gene functions . The comprehensive expression data in CREP database encompassing the entire life cycle of rice (Oryza sativa) provided rich information for associating the WD40 genes in different pathways by co-expression analysis . And deeper understanding of their structures, expression profiles, interactions and functional diversity will be essential for our study in the detailed cellular processes mediated by WD40 proteins.
Rice is one of the major staple foods for world population. It is also an ideal model species for functional genomic analysis and represents an evolutionary lineage within the monocotyledons. In this study, we discuss three important questions through the genome-wide scan and systematic characterization of OsWD40 gene family during the whole life cycle in rice. First, how many members belong to this family and what about their localizations, gene structures and other characteristics? Second, what are the expression patterns of the OsWD40 genes, and what is the connection between the expression levels and their gene functions? Third, how does this gene family evolve or what are the evolutionary relationships between these OsWD40 genes? Therefore, the answers would provide a solid base for future functional genomic studies of the OsWD40 genes in rice.
Collection and identification of the OsWD40 genes in rice
In order to identify the OsWD40 genes in the rice, the consensus protein sequence which is characteristic of WD40 genes in eukaryotes, GECKXVLXGHTSTVTCVAFSPDGPLLASGSRDGTIKIWD, was generated by hmmemit from HMM profile (PF00400). We carried out BLASTP analysis using this sequence as a query in MSU database http://rice.plantbiology.msu.edu/index.shtml, with a threshold E value of ≤ 10. A total of 342 sequences were identified as putative OsWD40 genes. By removal of different transcripts of the same gene, we identified 234 putative OsWD40 genes. These candidates were examined by SMART and Pfam searching for the presence of WD40 domain. Thus, 159 genes with the presence of WD40 domain were confirmed by SMART, and additional 41 genes were identified as containing such domain in Pfam. Therefore, there were a total of 200 OsWD40 genes in the rice genome. For convenience, the 200 OsWD40 genes were named from OsWD40-1 to OsWD40-200 according to their positions on pseudomolecules. As the table containing the accession numbers of each OsWD40 gene is too large within one printed page, these data were exhibited as the additional file (Additional file 1: Table S1). The detailed information of OsWD40 genes were also listed in Additional file 1: Table S1.
Except for the presence of a conserved WD40 domain, the OsWD40 genes vary substantially in the size and sequences of their encoded proteins, and their physicochemical properties (Additional file 1: Table S1). The position of the WD40 domain within the protein also varies. The length of OsWD40 proteins varied from 91 to 3787 amino acids. EXPASY analysis suggested that the OsWD40 protein sequences had large variations in isoelectric point (pI) values (ranging from 4.0839 to 10.3354) and molecular weight (ranging from 9.997 kDa to 420.65 kDa) (Additional file 1: Table S1). Only 61 of the 200 OsWD40 genes were predicted to be stable proteins, while the rest were unstable. Details on other parameters of protein sequences were shown in Additional file 1: Table S1.
Classification and phylogenetic analysis of OsWD40 proteins
To explore the evolutionary relationships of the WD40 genes in rice and Arabidopsis, an unrooted phylogenetic tree was generated from alignments of their full-length protein sequences. The phylogenetic analysis revealed that all WD40s were clustered into five distinct groups (Cluster I to Cluster V), comprising 151, 26, 66, 68, and 122 proteins, respectively (Additional file 2: Figure S2). WD40 proteins from rice and Arabidopsis are present in all groups. The OsWD40 members were more closely to those in the same clade in Arabidopsis than to other OsWD40 proteins in the same species (Additional file 3: Table S3), which indicated synteny and conservation between rice and Arabidopsis proteins. Most members in the same groups or subgroups shared one or more domains outside the WD40 domain, thus was consistent with the subfamily definition revealed above.
Expression profiling of OsWD40 genes during the whole life cycle of rice
To study the transcript accumulation of OsWD40 genes in the entire life cycle of rice, the expression profiling covering 24 developmental stages (Additional file 4: Table S4) in Minghui 63 were analyzed by Affymetrix rice microarray data in CREP database . Probes for 184 of the 200 OsWD40 genes could be identified in the Affymetrix microarray. Thirty-six genes had two probe sets and the higher signal value of the probe sets was used for analysis. Two pairs of genes, OsWD40-28 and OsWD40-42, as well as OsWD40-182 and OsWD40-193, shared the same probe sets, respectively. Only 182 genes showed a "present" detection call at p value of 0.05 in at least one of the investigated tissues, whereas two low expressing genes (OsWD40-151 and OsWD40-127) were either "absent" or "marginal" under these developmental stages. The transcripts of these two genes were at almost undetectable levels in all the stages analyzed, but it is possible that these genes might respond to specific stimuli or their expressions might be limited to specialized cell types that have not been analyzed in this investigation.
Responses of OsWD40 genes under NAA, KT, and GA3 treatments
Four OsWD40 genes showed differential expression under all three phytohormone treatments, among which three genes (OsWD40-25, 138, and 147) were up-regulated, whereas OsWD40-186 was down-regulated. The expression profile of the remaining genes in response to NAA, KT, and GA3 was different. For instance, three genes (OsWD40-33, 116 and 132) were up-regulated under NAA and GA3 treatments, two genes (OsWD40-7 and 77) were differentially expressed under NAA and KT treatments, and six genes (OsWD40-46, 76, 90, 98, 142 and 173) were up-regulated to KT and GA3 treatment, respectively. Meanwhile, ten OsWD40 genes showed differential expression specifically to one phytohormone treatment. Amongst the ten genes, OsWD40-15, 58, 69, 71 and 174 were up-regulated specifically to GA3 treatment. We also found that OsWD40-5, 22, 84 and 88 were up-regulated, whereas OsWD40-67 was down-regulated specifically to KT treatment.
The induction of OsWD40 genes by phytohormones prompted us to check their promoter sequence (2 kb upstream the transcript start site) by searching against the PLACE database http://www.dna.affrc.go.jp/PLACE/signalscan.html. The results suggested that all promoter regions of these 25 OsWD40 genes contained various elements of auxin, gibberellin, and cytokinin (Additional file 6: Table S6).
Chromosomal localization and gene duplication
Segmental duplication and tandem duplication play important roles in generating the members of a gene family during the evolution . Therefore, both segmental and tandem duplication events were investigated for elucidating the potential mechanism of evolution of OsWD40 gene family. Analysis of the MSU RGAP rice segmental duplication database revealed only 24 (12 pairs) OsWD40 genes could be assigned to MSU RGAP segmental duplication blocks at a maximal length distance permitted between collinear gene pairs of 500 kb. The overall similarity of the cDNA sequences of these genes ranged from 32.7% to 96.7% and all of them were found to have their counterparts on duplicated segments (Figure 5, Additional file 7: Table S7). Nineteen OsWD40 genes (nine groups) seemed to be produced from tandem duplications according to the criterion adopted in our analysis (Additional file 8: Table S8). They were separated by a maximum of five intervening genes. Three group of the gene pairs were placed juxtaposed with no intervening gene. The distance between these genes ranged from 3 kb to 35 kb (Additional file 8: Table S8). Interestingly, not all the tandemly duplicated genes in the same cluster had the same direction of transcription. This might suggest the complex behavior of tandem duplications in this family. All these results suggested that much of the diversity of the OsWD40 gene family in rice is due to both tandem duplication and segmental genome duplication events.
Two tandemly duplicated genes in Group 2 shared the same probe and OsWD40-187 in Group 9 did not have probe sets on Affymetrix microarray. Therefore, we analyzed the rest 16 tandemly duplicated genes in 9 groups. The expression pattern for tandem duplicated genes was more complicated (Figure 6B). The expression pattern was quite similar for four pairs of genes (Group 4, 5, 6, 8). Therefore, the gene copies might have maintained their functions during evolution, as evidenced by the similar expression pattern. Four pairs of genes (Group 1, 3, 7, 9) showed divergent expression profiles in most of the investigated tissues, as one of the genes was not expressed at significant levels in most of the tissues. This indicated that one of the members changed its function during the course of evolution. Group 7 containing OsWD40-153 and OsWD40-15 4 have already been elucidated. As reported by Luo et al. , OsFIE1 (OsWD40-154) and OsFIE2 (OsWD40-153) were likely to have duplicated in the ancestor of the grasses. OsFIE1 was expressed only in endosperm with imprinted effect , while OsFIE2 was not imprinted in endosperm and was expressed constitutively. OsWD40-18 in Group 1 was not expressed at significant levels in all tissues, which might be induced by pseudo-functionalization after duplication [20, 21].
Expression correlation and gene ontology (GO) analyses
The network in Figure 7A contained 33 rice genes (nodes) and 54 co-expression links, including 16 OsWD40 genes, 14 MYB related genes and three basic helix-loop-helix (bHLH) genes. Compared with the whole rice genome annotation, these OsWD40 genes seem to affect the regulation of multiple biological processes, such as cellular biosynthetic/metabolic process, transcription, nucleobase, nucleoside, nucleotide and nucleic acid metabolic process, gene expression, biosynthetic process, nitrogen compound metabolic process, macromolecule biosynthetic process and so on. Besides, the molecular functions related to DNA binding and nucleic acid binding were significantly enriched (Figure 8A). The network in Figure 7B contained only 9 rice genes (nodes) and 7 co-expression links. This network suggested the possibility that five OsWD40 genes might play roles in different molecular pathways with the participation of four MADS-box genes. Complex network has also been constructed in Figure 7C, containing 46 rice genes (nodes) and 69 co-expression links. The co-expression genes in this group were associated with histone-related proteins such as histone-lysine N-methyltransferase, histone deacetylase, single myb histone, jmjC domain containing proteins and so on. Analysis of their GO terms identified functional modules enriched for chromosome organization, chromatin organization, organelle organization, and chromatin modification. These genes also acted as important cellular components in nucleus and membrane-bounded organelle, and might take part in methyltransferase and transferase activities (Figure 8B). Our analysis also indicated that the expression of some OsWD40 genes was co-expressed with that of other OsWD40 genes. We found that 58 OsWD40 genes might therefore correlate with or interact with each other, thus forming a more complex feedback network (Figure 7D). However, no significant GO terms were identified.
OsWD40 evolution and classification
Gene duplications are one of the primary driving forces during the evolution, and the variations in family size and distribution of a gene family were related to either tandem or segmental duplications [19, 21]. As stated previously, OsWD40 family expanded from both tandem and segmental duplications, and the number of OsWD40 genes arranged in segmental duplications contributes to the 12% birth of new genes, while tandem duplication events contribute to the 9.5% birth of new genes.
It was interesting that different subfamilies of OsWD40 genes expanded in distinct manners: all segmental duplicated genes were favor for subfamily A, except OsWD40-49 that belonged to subfamily I. However, tandem duplications were belonged to various subfamilies, including subfamily A, B, D, G, and K.
We also noticed that the expression level of three tandemly duplicated genes, OsWD40-77, 84, and 186 were lower than that of their corresponding copies, suggesting they might lose their functions during the evolution. However, these three genes were differentially expressed under phytohormone treatments. Whereas another copy of these tandemly duplicated genes did not show differential expression under phytohormone treatments. Therefore, one would tentatively suppose that while OsWD40-77, 84, and 186 slowly lost their ancestral functions, they might evolve new functions in phytohormone pathways during the evolution.
OsWD40 genes may initiate their diverse functions by performing protein complex with MYB and bHLH transcription factors
The multiple metabolism pathways in plants must be regulated by coordinated expression of different genes. The co-expression networks reflect the correlation of the expression pattern of different genes, and are suggestive in tracing the genes in the same pathway. Here, our co-expression analysis has revealed that the function of OsWD40 proteins might require the participation of various members of MYB and bHLH transcription factors (Figure 7A).
OsWD40-13 was found to be co-expressed with five MYB factors, Os12g13570, Os06g19980, Os01g12860, Os02g34630 and Os06g40710, as well as two bHLH factors Os06g16400 and Os08g42470. OsWD40-13 was homologous to plant SMU genes, which appeared to be involved in splicing of specific pre-mRNAs that affected multiple aspects of development . Thus, whether transcription factors could serve as a link to mRNA process was a suggestive direction in further study. We also noticed that OsWD40-23 was found to be co-expressed with these transcription factors. Compared with the co-expression genes of OsWD40-13, four MYB factors Os02g42870, Os08g25820, Os08g25799, and Os08g41480 might be the potential interaction factors of OsWD40-23 in addition. OsWD40-71 might be a homolog of Arabidopsis LIS, which restricted gametic cell fate in female gametophyte [25, 26]. OsWD40-71 was found to be co-expressed with three MYB factors, Os02g34630, Os01g51154 and Os06g19980, as well as a bHLH factor Os06g16400. These three OsWD40 genes were co-expressed with the same group of MYB and bHLH transcription factors, suggesting that they may take part in correlated molecular pathways by interaction with these partners.
Previous genetic analyses found that ectopic expression of maize WD40 protein PAC1 in an Arabidopsis ttg1 mutant was able to complement the mutant phenotypes , suggesting that WD40 proteins can interact with similar partners. We might speculate that some OsWD40 genes might be in relation with each other by controlling the expression of these transcription factors. Another OsWD40 gene that attracted our attention was OsWD40-20, which was co-expressed with another group of MYB and bHLH transcription factors that were different from the co-expression genes mentioned above. We also found that OsWD40-20 might be related with other partners such as genes in SET family, HDAC family, and HAC (Figure 7C). This result suggested that OsWD40-20 might participate in the regulation of another pathway.
A general WD/Myb/bHLH complex for regulation of the anthocyanin biosynthetic pathway was also found in Antirrhinum majus, Petunia hybrida and Arabidopsis thaliana[28–34]. Meanwhile, a bHLH protein Lc in maize was found to interact with MYB transcription factors to activate anthocyanin expression . Another Lc-like bHLH protein was also found to require a MYB protein to perform its function [31, 34, 36, 37]. Results suggested that many MYBs interacted directly with Lc-like bHLH proteins and the WD40 repeat protein [34, 36]. Therefore, it seems that WD40 proteins allow protein-protein interactions between the bHLH and MYB proteins, and WD40 proteins in rice might also require MYBs and bHLHs to form a transcription complex to participate a range of pathways.
OsWD40 genes may be involved in histone-related functions with members in SET family
Histone expression and histone post-translational modifications play pivotal roles in chromatin remodeling and epigenetic regulation in plant development [38–40]. Our co-expression analysis has revealed that OsWD40 genes may function with histone-related proteins. Although the exact pathways mediate by these genes are still unclear, one might speculate that these OsWD40 genes play important roles in histone modification.
An important group of enzymes involved in histone modification is the histone-lysine N-methyltransferases. These proteins participate in the establishment and/or maintenance of euchromatic or heterochromatic states of active or transcriptionally repressed sequences . Here, a total of 10 histone-lysine N-methyltransferases were identified to be co-expressed with the OsWD40 proteins, both of which contain the SET domain that is responsible for the catalytic activity of the enzymes, suggesting possible interactions between the OsWD40 and SET genes in a family level. One might also tentatively speculate that the WD40 and SET domain must be the key functional structure for interaction by a conserved mechanism.
Functional studies of several Arabidopsis genes encoding WD40 proteins also suggest that they might be implicated in histone modification in different pathways. A WD40 domain cyclophilin, CYCLOPHILIN71 (CYP71), which functions in gene repression and organogenesis in Arabidopsis, serves as a highly conserved histone remodeling factor involved in chromatin-based gene silencing . Another WD40 protein MSI1 in Arabidopsis has also been proposed to exhibit pleiotropic phenotypes by epigenetic regulation [43–45]. Therefore, OsWD40 genes in rice might also involved in similar pathways by histone modulation. In a word, characterization of OsWD40 proteins function in histone modification could therefore open new perspectives for understanding the molecular mechanism of epigenetic regulation.
OsWD40 genes may take part in reproductive pathways with MADS-box transcription factors
WD40 genes identified in different plant species are involved in various developmental processes . In our study, the expression patterns of OsWD40 genes and the co-expression analysis provide useful information for establishing their putative functions. The available evidence suggests that OsWD40 genes may take part in reproductive pathways with MADS-box transcription factors in rice.
MADS-box transcription factors are essential for various aspects of pathways in flower development both in dicotyledon and monocotyledon . Our result suggested that OsWD40-23 was co-expressed with three MADS-box transcription factors, Os03g54170, Os06g36680, and Os01g52680 (Figure 7B). OsWD40-23 was homologous to Arabidopsis FVE/MSI4, a key regulator that interacted with CUL4-DDB1 and a PRC2-like complex to control epigenetic regulation of flowering time . It was also reported that Os03g54170 (OsMADS34) was required for rice inflorescence and spikelet development . Therefore, it would be interesting to investigate whether OsWD40-23 play roles in rice flower development with MADS-box transcription factors. We also found that three OsWD40 genes, OsWD40-31, 48, and 89, were co-expressed with a MADS gene Os04g38770. These four genes were expressed preferentially in stamen (Figure 2), suggesting that they may function in stamen development. All these studies support the results that MADS-box transcription factors are essential for flower developmental processes in relationship with OsWD40 genes.
In conclusion, using an in silico approach, a total of 200 OsWD40 genes were found to be present in rice genome. Genomic framework revealed the potential mechanisms responsible for the evolution of OsWD40 genes in rice. The expression profiling of OsWD40 gene family covering rice life cycle could provide deep insights into their potential functions during rice growth and development. Some genes appear to be differentially expressed in different tissues/organs, vegetative and reproductive development stages, and expression of some genes is influenced under phytohormones. These data will provide the basis for understanding the evolutionary history of OsWD40 members and their roles in rice growth and development. The findings in our work would be useful in selecting candidate genes for functional studies of OsWD40 members in rice. However, future research by adopting transformation strategies or insertion mutagenesis is required to elucidate the precise functions of these OsWD40 genes.
Collection and database search of OsWD40 members in rice
Hidden Markov Model (HMM) profile of WD40 domain (PF00400) downloaded from Pfam http://pfam.sanger.ac.uk/ was employed to identify the putative OsWD40 genes in rice. The BlastP search was carried out using the HMM profile on website of MSU RGAP http://rice.plantbiology.msu.edu/, followed by removal of redundant sequences from the database. The Pfam http://www.sanger.ac.uk/Software/Pfam/ and SMART database http://smart.embl-heidelberg.de/smart/batch.pl were finally used to confirm each predicted WD40 protein. Additional conserved motifs or domains besides WD40 were identified in Pfam database. Based on these domains, we classified the OsWD40 proteins into subfamilies and the sample protein structures of each subfamily were drawn manually.
Chromosomal localization and gene duplication
Each of the OsWD40 genes was mapped on rice chromosomes according to their positions available in MSU RGAP http://rice.plantbiology.msu.edu/. The distribution of OsWD40 genes was drawn by MapInspect http://www.plantbreeding.wur.nl/UK/software_mapinspect.html and modified manually with annotation.
The duplicated genes were elucidated from the segmental genome duplication of rice http://rice.plantbiology.msu.edu/segmental_dup/500kb/segdup_500kb.shtml, with the maximal length distance permitted between collinear gene pairs of 500 kb. Tandem duplicates were defined as genes separated by five or fewer genes. The distance between these genes on the chromosomes was calculated and the percentage of sequence similarity between the proteins encoded by these genes was determined by MegAlign software 4.0.
Structural analysis of the OsWD40 genes
Information about the gene structures, transcripts, full-length cDNA, BAC accessions for each gene and characteristics of corresponding proteins were procured from MSU RGAP and KOME http://cdna01.dna.affrc.go.jp/cDNA/.
Protein sequences of putative OsWD40 members collected from the MSU RGAP and KOME were analyzed by EXPASY PROTOPARAM tool http://www.expasy.org/tools/protparam.html. Information about the number of amino acids, molecular weight, theoretical isoelectric point (pI), amino acid composition, and instability index (instability index of > 40 was considered as unstable ) were obtained by this tool. The conserved domain of the OsWD40 protein in rice was determined by Pfam program.
Phylogenetic analysis of WD40 genes in rice and Arabidopsis
A total of 237 putative WD40 homologues in Arabidopsis were extracted in van Nocker and Ludwig (2003) . Among which four genes were not annotated in TAIR10, thus we use 233 AtWD40 proteins in further phylogenetic analysis.
Multiple sequence alignments were performed using Clustal X version1.83 based on the full sequence of WD40 proteins from rice and Arabidopsis with default parameters. An un-rooted neighbor-joining phylogenetic tree  was constructed by generating 1,000 random bootstrap replicates using MEGA 4.
Genome-wide expression analysis of OsWD40 family
Expression profile of OsWD40 gene family in 24 tissues for Minghui 63 was extracted from the Affymetrix rice microarray data from CREP database in our lab http://crep.ncpgr.cn. The microarray data have been submitted into the NCBI Gene Expression Omnibus (GEO) under the accession number of GSE19024 . The developmental stages and organs of the tissues were described in Additional file 4: Table S4. After normalization and variance stabilization, the average signal value of two biological replicates for each sample, except for samples 2, 3, 14, 15 and 16 (three biological replicates and two technical replicates) was used for analysis. Wherever more than one probe set was available for one gene, the higher signal value of the probe sets was used for analysis. For phytohormone treatments, seedlings at trefoil stage were treated with 0.1 mM NAA, GA3 and KT, respectively. Samples were harvested at the time points of 5, 15, 30 and 60 min after treatments. The samples under the same phytohormone treatment of different time points were mixed together.
Expression values of each gene were logarithmized and cluster analyses were performed using R with euclidean distances and hierarchical cluster method of "complete linkage". The expression patterns of OsWD40 genes were estimated and grouped according to the hierarchical cluster. For data analysis, expression level in each of the tissues was compared against the expression in seed using a student-t test. The genes that are up- or down-regulated by more than two-fold and with p values < 0.05 were considered to be differentially expressed. The average expression of more than two biological replicates for each sample was used for analysis.
Identification of correlated genes and network construction
The permutation test was done to determine the optimal threshold of the PCC [51, 52]. We computed the PCCs for all pairwise relationships between the 1000 randomly selected genes in two sets of transcriptomes (expression profiles for two varieties Minghui 63 and Zhenshan 97 in CREP database of our lab, http://crep.ncpgr.cn) comprising a total of 190 microarray experiments. We estimated a nullhypothesis pairwise correlation distribution by independently permuting the components of each gene expression value and recomputing all correlations. The distribution of the PCCs before and after independent random permutation was observed to choose the optimal thresholds.
OsWD40 genes with standard errors greater than 500 were used for further co-expression analysis, in order to exclude the situation that the correlation of expression level was due to the constitutive expression pattern. The correlated genes with PCCs higher than the optimal thresholds were extracted from the CREP database http://crep.ncpgr.cn and considered as the putative co-expression genes. The PCCs of these candidate genes were recalculated for confirmation, and the statistical significance was further determined using a student-t-test.
A visualization tool of Cytoscape was used to construct the co-expression network composed of the OsWD40 genes and their co-expressed genes. We mapped the correlated genes to the network and identified the function of OsWD40-correlated genes in network clusters. GO enrichment was performed by Singular Enrichment Analysis (SEA) tool in agriGO http://bioinfo.cau.edu.cn/agriGO/index.php with default parameters using the rice MSU6.1 genome annotation as the background. Statistical significance was determined using the Fisher's exact test and the Yekutieli multi-test adjustment .
We thank Dr. Weibo Xie for extracting the expression data from the CREP database. We also thank Prof. Chungen Hu for helpful discussion. This research was supported by grants from the National Natural Science Foundation of China (30971551), the State Key Basic Research and Development Plan of China (2007CB108700), the Natural Science Foundation of Hubei Province (Grant No. 2009CDB267), and the Specialized Research Fund for the Doctoral Program of Higher Education (Grant No. 20100146120034). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- Stirnimann CU, Petsalaki E, Russell RB, Muller CW: WD40 proteins propel cellular networks. Trends Biochem Sci. 2010, 35 (10): 565-574. 10.1016/j.tibs.2010.04.003.View ArticlePubMedGoogle Scholar
- Smith TF, Gaitatzes C, Saxena K, Neer EJ: The WD repeat: a common architecture for diverse functions. Trends Biochem Sci. 1999, 24 (5): 181-185. 10.1016/S0968-0004(99)01384-5.View ArticlePubMedGoogle Scholar
- Chothia C, Hubbard T, Brenner S, Barns H, Murzin A: Protein folds in the all-beta and all-alpha classes. Annu Rev Biophys Biomol Struct. 1997, 26: 597-627. 10.1146/annurev.biophys.26.1.597.View ArticlePubMedGoogle Scholar
- Smith TF: Diversity of WD-repeat proteins. Subcell Biochem. 2008, 48: 20-30. 10.1007/978-0-387-09595-0_3.View ArticlePubMedGoogle Scholar
- van Nocker S, Ludwig P: The WD-repeat protein superfamily in Arabidopsis: conservation and divergence in structure and function. BMC Genomics. 2003, 4 (1): 50-10.1186/1471-2164-4-50.PubMed CentralView ArticlePubMedGoogle Scholar
- Xu C, Min J: Structure and function of WD40 domain proteins. Protein Cell. 2011, 2 (3): 202-214. 10.1007/s13238-011-1018-1.View ArticlePubMedGoogle Scholar
- Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N: Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005, 437 (7062): 1173-1178. 10.1038/nature04209.View ArticlePubMedGoogle Scholar
- Yu H, Braun P, Yildirim MA, Lemmens I, Venkatesan K, Sahalie J, Hirozane-Kishikawa T, Gebreab F, Li N, Simonis N: High-quality binary protein interaction map of the yeast interactome network. Science. 2008, 322 (5898): 104-110. 10.1126/science.1158684.PubMed CentralView ArticlePubMedGoogle Scholar
- Collins SR, Kemmeren P, Zhao XC, Greenblatt JF, Spencer F, Holstege FC, Weissman JS, Krogan NJ: Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae. Mol Cell Proteomics. 2007, 6 (3): 439-450.View ArticlePubMedGoogle Scholar
- Stelzl U, Worm U, Lalowski M, Haenig C, Brembeck FH, Goehler H, Stroedicke M, Zenkner M, Schoenherr A, Koeppen S: A human protein-protein interaction network: a resource for annotating the proteome. Cell. 2005, 122 (6): 957-968. 10.1016/j.cell.2005.08.029.View ArticlePubMedGoogle Scholar
- Dragon F, Gallagher JE, Compagnone-Post PA, Mitchell BM, Porwancher KA, Wehner KA, Wormsley S, Settlage RE, Shabanowitz J, Osheim Y: A large nucleolar U3 ribonucleoprotein required for 18S ribosomal RNA biogenesis. Nature. 2002, 417 (6892): 967-970. 10.1038/nature00769.View ArticlePubMedGoogle Scholar
- Zhao J, Hyman L, Moore C: Formation of mRNA 3' ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol Mol Biol Rev. 1999, 63 (2): 405-445.PubMed CentralPubMedGoogle Scholar
- Ohnacker M, Barabino SM, Preker PJ, Keller W: The WD-repeat protein pfs2p bridges two essential factors within the yeast pre-mRNA 3'-end-processing complex. EMBO J. 2000, 19 (1): 37-47. 10.1093/emboj/19.1.37.PubMed CentralView ArticlePubMedGoogle Scholar
- Dubrovskaya V, Lavigne AC, Davidson I, Acker J, Staub A, Tora L: Distinct domains of hTAFII100 are required for functional interaction with transcription factor TFIIF beta (RAP30) and incorporation into the TFIID complex. EMBO J. 1996, 15 (14): 3702-3712.PubMed CentralPubMedGoogle Scholar
- van der Voorn L, Ploegh HL: The WD-40 repeat. FEBS Lett. 1992, 307 (2): 131-134. 10.1016/0014-5793(92)80751-2.View ArticlePubMedGoogle Scholar
- Ruiz-Garcia AB, Sendra R, Galiana M, Pamblanco M, Perez-Ortin JE, Tordera V: HAT1 and HAT2 proteins are components of a yeast nuclear histone acetyltransferase enzyme specific for free histone H4. J Biol Chem. 1998, 273 (20): 12599-12605. 10.1074/jbc.273.20.12599.View ArticlePubMedGoogle Scholar
- Nayak RR, Kearns M, Spielman RS, Cheung VG: Coexpression network based on natural variation in human gene expression reveals gene interactions and functions. Genome Res. 2009, 19 (11): 1953-1962. 10.1101/gr.097600.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang L, Xie W, Chen Y, Tang W, Yang J, Ye R, Liu L, Lin Y, Xu C, Xiao J: A dynamic gene expression atlas covering the entire life cycle of rice. Plant J. 2010, 61 (5): 752-766. 10.1111/j.1365-313X.2009.04100.x.View ArticlePubMedGoogle Scholar
- Cannon SB, Mitra A, Baumgarten A, Young ND, May G: The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004, 4: 10-10.1186/1471-2229-4-10.PubMed CentralView ArticlePubMedGoogle Scholar
- Gu Z, Steinmetz LM, Gu X, Scharfe C, Davis RW, Li WH: Role of duplicate genes in genetic robustness against null mutations. Nature. 2003, 421 (6918): 63-66. 10.1038/nature01198.View ArticlePubMedGoogle Scholar
- Moore RC, Purugganan MD: The early stages of duplicate gene evolution. Proc Natl Acad Sci USA. 2003, 100 (26): 15682-15687. 10.1073/pnas.2535513100.PubMed CentralView ArticlePubMedGoogle Scholar
- Luo M, Platten D, Chaudhury A, Peacock WJ, Dennis ES: Expression, imprinting, and evolution of rice homologs of the polycomb group genes. Mol Plant. 2009, 2 (4): 711-723. 10.1093/mp/ssp036.View ArticlePubMedGoogle Scholar
- Gutierrez-Marcos JF, Costa LM, Dal Pra M, Scholten S, Kranz E, Perez P, Dickinson HG: Epigenetic asymmetry of imprinted genes in plant gametes. Nat Genet. 2006, 38 (8): 876-878. 10.1038/ng1828.View ArticlePubMedGoogle Scholar
- Chung T, Wang D, Kim CS, Yadegari R, Larkins BA: Plant SMU-1 and SMU-2 homologues regulate pre-mRNA splicing and multiple aspects of development. Plant Physiol. 2009, 151 (3): 1498-1512. 10.1104/pp.109.141705.PubMed CentralView ArticlePubMedGoogle Scholar
- Gross-Hardt R, Kagi C, Baumann N, Moore JM, Baskar R, Gagliano WB, Jurgens G, Grossniklaus U: LACHESIS restricts gametic cell fate in the female gametophyte of Arabidopsis. PLoS Biol. 2007, 5 (3): e47-10.1371/journal.pbio.0050047.PubMed CentralView ArticlePubMedGoogle Scholar
- Moll C, von Lyncker L, Zimmermann S, Kagi C, Baumann N, Twell D, Grossniklaus U, Gross-Hardt R: CLO/GFA1 and ATO are novel regulators of gametic cell fate in plants. Plant J. 2008, 56 (6): 913-921. 10.1111/j.1365-313X.2008.03650.x.View ArticlePubMedGoogle Scholar
- Carey CC, Strahle JT, Selinger DA, Chandler VL: Mutations in the pale aleurone color1 regulatory gene of the Zea mays anthocyanin pathway have distinct phenotypes relative to the functionally similar TRANSPARENT TESTA GLABRA1 gene in Arabidopsis thaliana. Plant Cell. 2004, 16 (2): 450-464. 10.1105/tpc.018796.PubMed CentralView ArticlePubMedGoogle Scholar
- Morita Y, Saitoh M, Hoshino A, Nitasaka E, Iida S: Isolation of cDNAs for R2R3-MYB, bHLH and WDR transcriptional regulators and identification of c and ca mutations conferring white flowers in the Japanese morning glory. Plant Cell Physiol. 2006, 47 (4): 457-470. 10.1093/pcp/pcj012.View ArticlePubMedGoogle Scholar
- Quattrocchio F, Wing J, van der Woude K, Souer E, de Vetten N, Mol J, Koes R: Molecular analysis of the anthocyanin2 gene of petunia and its role in the evolution of flower color. Plant Cell. 1999, 11 (8): 1433-1444.PubMed CentralView ArticlePubMedGoogle Scholar
- Schwinn K, Venail J, Shang Y, Mackay S, Alm V, Butelli E, Oyama R, Bailey P, Davies K, Martin C: A small family of MYB-regulatory genes controls floral pigmentation intensity and patterning in the genus Antirrhinum. Plant Cell. 2006, 18 (4): 831-851. 10.1105/tpc.105.039255.PubMed CentralView ArticlePubMedGoogle Scholar
- Spelt C, Quattrocchio F, Mol JN, Koes R: Anthocyanin1 of petunia encodes a basic helix-loop-helix protein that directly activates transcription of structural anthocyanin genes. Plant Cell. 2000, 12 (9): 1619-1632.PubMed CentralView ArticlePubMedGoogle Scholar
- de Vetten N, Quattrocchio F, Mol J, Koes R: The an11 locus controlling flower pigmentation in petunia encodes a novel WD-repeat protein conserved in yeast, plants, and animals. Genes Dev. 1997, 11 (11): 1422-1434. 10.1101/gad.11.11.1422.View ArticlePubMedGoogle Scholar
- Walker AR, Davison PA, Bolognesi-Winfield AC, James CM, Srinivasan N, Blundell TL, Esch JJ, Marks MD, Gray JC: The TRANSPARENT TESTA GLABRA1 locus, which regulates trichome differentiation and anthocyanin biosynthesis in Arabidopsis, encodes a WD40 repeat protein. Plant Cell. 1999, 11 (7): 1337-1350.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang F, Gonzalez A, Zhao M, Payne CT, Lloyd A: A network of redundant bHLH proteins functions in all TTG1-dependent pathways of Arabidopsis. Development. 2003, 130 (20): 4859-4869. 10.1242/dev.00681.View ArticlePubMedGoogle Scholar
- Goff SA, Cone KC, Chandler VL: Functional analysis of the transcriptional activator encoded by the maize B gene: evidence for a direct functional interaction between two classes of regulatory proteins. Genes Dev. 1992, 6 (5): 864-875. 10.1101/gad.6.5.864.View ArticlePubMedGoogle Scholar
- Bernhardt C, Lee MM, Gonzalez A, Zhang F, Lloyd A, Schiefelbein J: The bHLH genes GLABRA3 (GL3) and ENHANCER OF GLABRA3 (EGL3) specify epidermal cell fate in the Arabidopsis root. Development. 2003, 130 (26): 6431-6439. 10.1242/dev.00880.View ArticlePubMedGoogle Scholar
- Quattrocchio F, Wing JF, van der Woude K, Mol JN, Koes R: Analysis of bHLH and MYB domain proteins: species-specific regulatory differences are caused by divergent evolution of target anthocyanin genes. Plant J. 1998, 13 (4): 475-488. 10.1046/j.1365-313X.1998.00046.x.View ArticlePubMedGoogle Scholar
- Liu C, Lu F, Cui X, Cao X: Histone methylation in higher plants. Annu Rev Plant Biol. 2010, 61: 395-420. 10.1146/annurev.arplant.043008.091939.View ArticlePubMedGoogle Scholar
- Li X, Wang X, He K, Ma Y, Su N, He H, Stolc V, Tongprasit W, Jin W, Jiang J: High-resolution mapping of epigenetic modifications of the rice genome uncovers interplay between DNA methylation, histone methylation, and gene expression. Plant Cell. 2008, 20 (2): 259-276. 10.1105/tpc.107.056879.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhou DX, Hu YF: Regulatory Function of Histone Modifications in Controlling Rice Gene Expression and Plant Growth. Rice. 2010, 3 (2-3): 103-111. 10.1007/s12284-010-9045-8.View ArticleGoogle Scholar
- Pontvianne F, Blevins T, Pikaard CS: Arabidopsis Histone Lysine Methyltransferases. Adv Bot Res. 2010, 53: 1-22.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhu J, Jeong JC, Zhu Y, Sokolchik I, Miyazaki S, Zhu JK, Hasegawa PM, Bohnert HJ, Shi H, Yun DJ: Involvement of Arabidopsis HOS15 in histone deacetylation and cold tolerance. Proc Natl Acad Sci USA. 2008, 105 (12): 4945-4950. 10.1073/pnas.0801029105.PubMed CentralView ArticlePubMedGoogle Scholar
- Bouveret R, Schonrock N, Gruissem W, Hennig L: Regulation of flowering time by Arabidopsis MSI1. Development. 2006, 133 (9): 1693-1702. 10.1242/dev.02340.View ArticlePubMedGoogle Scholar
- Hennig L, Taranto P, Walser M, Schonrock N, Gruissem W: Arabidopsis MSI1 is required for epigenetic maintenance of reproductive development. Development. 2003, 130 (12): 2555-2565. 10.1242/dev.00470.View ArticlePubMedGoogle Scholar
- Alexandre C, Moller-Steinbach Y, Schonrock N, Gruissem W, Hennig L: Arabidopsis MSI1 is required for negative regulation of the response to drought stress. Mol Plant. 2009, 2 (4): 675-687. 10.1093/mp/ssp012.View ArticlePubMedGoogle Scholar
- Kater MM, Dreni L, Colombo L: Functional conservation of MADS-box factors controlling floral organ identity in rice and Arabidopsis. J Exp Bot. 2006, 57 (13): 3433-3444. 10.1093/jxb/erl097.View ArticlePubMedGoogle Scholar
- Pazhouhandeh M, Molinier J, Berr A, Genschik P: MSI4/FVE interacts with CUL4-DDB1 and a PRC2-like complex to control epigenetic regulation of flowering time in Arabidopsis. Proc Natl Acad Sci USA. 2011, 108 (8): 3430-3435. 10.1073/pnas.1018242108.PubMed CentralView ArticlePubMedGoogle Scholar
- Gao XC, Liang WQ, Yin CS, Ji SM, Wang HM, Su XA, Guo CC, Kong HZ, Xue HW, Zhang DB: The SEPALLATA-Like Gene OsMADS34 Is Required for Rice Inflorescence and Spikelet Development. Plant Physiol. 2010, 153 (2): 728-740. 10.1104/pp.110.156711.PubMed CentralView ArticlePubMedGoogle Scholar
- Guruprasad K, Reddy BV, Pandit MW: Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. 1990, 4 (2): 155-161. 10.1093/protein/4.2.155.View ArticlePubMedGoogle Scholar
- Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.PubMedGoogle Scholar
- Butte AJ, Tamayo P, Slonim D, Golub TR, Kohane IS: Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proc Natl Acad Sci USA. 2000, 97 (22): 12182-12186. 10.1073/pnas.220392197.PubMed CentralView ArticlePubMedGoogle Scholar
- Carter SL, Brechbuhler CM, Griffin M, Bond AT: Gene co-expression network topology provides a framework for molecular characterization of cellular state. Bioinformatics. 2004, 20 (14): 2242-2250. 10.1093/bioinformatics/bth234.View ArticlePubMedGoogle Scholar
- Du Z, Zhou X, Ling Y, Zhang Z, Su Z: agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010, W64-70. 38 Web ServerGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.