- Research article
- Open Access
MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress
BMC Genomics volume 8, Article number: 242 (2007)
MADS-box transcription factors, besides being involved in floral organ specification, have also been implicated in several aspects of plant growth and development. In recent years, there have been reports on genomic localization, protein motif structure, phylogenetic relationships, gene structure and expression of the entire MADS-box family in the model plant system, Arabidopsis. Though there have been some studies in rice as well, an analysis of the complete MADS-box family along with a comprehensive expression profiling was still awaited after the completion of rice genome sequencing. Furthermore, owing to the role of MADS-box family in flower development, an analysis involving structure, expression and functional aspects of MADS-box genes in rice and Arabidopsis was required to understand the role of this gene family in reproductive development.
A genome-wide molecular characterization and microarray-based expression profiling of the genes encoding MADS-box transcription factor family in rice is presented. Using a thorough annotation exercise, 75 MADS-box genes have been identified in rice and categorized into MIKCc, MIKC*, Mα, Mβ and Mγ groups based on phylogeny. Chromosomal localization of these genes reveals that 16 MADS-box genes, mostly MIKCc-type, are located within the duplicated segments of the rice genome, whereas most of the M-type genes, 20 in all, seem to have resulted from tandem duplications. Nine members belonging to the Mβ group, which was considered absent in monocots, have also been identified. The expression profiles of all the MADS-box genes have been analyzed under 11 temporal stages of panicle and seed development, three abiotic stress conditions, along with three stages of vegetative development. Transcripts for 31 genes accumulate preferentially in the reproductive phase, of which, 12 genes are specifically expressed in seeds, and six genes show expression specific to panicle development. Differential expression of seven genes under stress conditions is also evident. An attempt has been made to gain insight into plausible functions of rice MADS-box genes by collating the expression data of functionally validated genes in rice and Arabidopsis.
Only a limited number of MADS genes have been functionally validated in rice. A comprehensive annotation and transcriptome profiling undertaken in this investigation adds to our understanding of the involvement of MADS-box family genes during reproductive development and stress in rice and also provides the basis for selection of candidate genes for functional validation studies.
The MADS-box family members, identified initially as floral homeotic genes, are one of the most extensively studied transcription factor genes in plants [1–8]. The word MADS finds its origin from the first letters of its founding members, Mini Chromosome Maintenance 1 (MCM1) of yeast (Saccharomyces cerevisiae) , Agamous (AG) of Arabidopsis (Arabidopsis thaliana) , Deficiens (DEF) of snapdragon (Antirrhinum majus)  and Serum Response Factor (SRF) of humans (Homo sapiens) . MADS-box transcription factors are characterized by the presence of an approximately 60 amino acids DNA binding domain, known as the MADS-box domain, located in the N-terminal region of the protein. The plant-specific MIKC-type MADS-box proteins include three additional domains followed by the M ADS domain, viz. a less-conserved I ntervening region of ~30 amino acids, a moderately conserved K eratin-like domain of ~70 amino acids mainly involved in heterodimerization, and a highly variable C-terminal region of variable length implicated in transcriptional activation and higher-order complex formation [13–15].
The MADS-box family has been divided into two main groups. The type I consists of ARG80/SRF-like genes of animals and fungi, also designated as M-type genes in plants, and type II contains MEF2-like genes of animals and yeast as well as MIKC-type genes of plants. It is proposed that an ancestral duplication before the divergence of plants and animals gave rise to these groups . The MIKC-type genes are also characterized by the presence of K domain that could have evolved after the divergence of these lineages. The type II genes have been categorized into MIKCc- and MIKC*-type based on structural features [17, 18]. The MIKCc genes have been further classified into 14 clades based on phylogeny [19, 20]. Type I genes have also been categorized into M- and N-type based on the protein motifs identified using the MEME search tool  and also as Mα, Mβ, Mγ and Mδ, based on the phylogenetic relationships between MADS-box regions . The Mδ group, however, corresponds to the MIKC* class described in this report and elsewhere .
The most striking feature of the MADS-box gene family is the diverse functions taken up by its members in different aspects of plant growth and development. These include flowering time control, meristem identity, floral organ identity, formation of dehiscence zone, fruit ripening, embryo development as well as development of vegetative organs such as root and leaf [23–27]. Genome-wide identification and phylogenetic analyses of MADS-box genes have revealed 107 and 71 (only 65 of these are listed in The Institute for Genomic Research (TIGR) Rice Pseudomolecule release 4 database) genes in Arabidopsis and rice, respectively [6, 28].
Though a large amount of expression data based on SAGE, microarrays and other high-throughput transcriptome analysis techniques is available in public databases, the studies involving expression of the entire MADS-box family have so far been restricted to northern blot analysis or reverse transcriptase PCR at limited stages of development . Recently, the comparison of expression profiles resulting from a 22 k rice cDNA microarray-based transcriptome analysis of early panicle development in rice was used to implicate three MADS-box genes, OsMADS1, 14 and 15, in panicle branching . The use of high-throughput genome-wide transcriptome analysis provides an insight into changes in the entire transcriptome across a variety of biological conditions. In combination with the whole genome sequence data and comparative expression analysis with genes of known functions, the transcriptomic data can become an initiation point for systematic investigations into structure-function relationships.
With an overall objective to understand regulation of reproductive organ development in indica rice, we have initiated a program on microarray-based expression profiling of transcription factors and signal transduction components. Here, we report a comprehensive account of identification and phylogenetic analysis of 75 members of MADS-box gene family in rice and their expression profiling during 11 stages of panicle and seed development along with three abiotic stress conditions and 3 stages of vegetative development. This analysis is based on TIGR Rice Pseudomolecule release 4 and KOME (Knowledge-based Oryza Molecular biological Encyclopedia) rice full-length cDNA database. We have identified 10 new members belonging to this gene family besides confirming 65 previously identified genes. Out of 71 previously identified genes by Nam and coworkers , six were not found in version 4 of TIGR. Our analysis also suggests the existence of Mβ-type genes in rice, which was earlier thought to be absent in monocots . The results of expression profiling have been discussed in light of phylogenetic relatedness of the genes and their known functions in rice as well as other systems.
Identification, organization and structure of MADS-box genes
HMM analysis and name search resulted in the identification of 75 MADS-box genes in rice genome. Since, gene names from OsMADS1 to OsMADS58 representing 34 genes already existed in the literature (though not in continuation), newly identified genes were named from OsMADS59 to OsMADS99 (see Table 1). The individual genes were localized on chromosomes based on the 5' and 3' coordinates for respective gene models in TIGR database (Figure 1). The maximum number of genes (16; 21%) were found to be localized on chromosome 1, whereas, chromosomes 10 and 11 had only one MADS-box gene each. Out of five types of MADS-box genes, the Mγ genes were confined to chromosome 1, 3 and 4, while Mβ genes were present only on chromosome 1. No chromosomal bias was observed in the distribution of MIKCc, MIKC* and Mα genes (Figure 1). Analysis of the TIGR rice segmental duplication database revealed 30 MADS-box genes within the duplicated segments of rice chromosomes. Only 16 genes, however, were found to have their counterparts on duplicated segments (Figure 1). Most of the duplicated genes belonged to the MIKCc group. Several MADS-box genes, especially M-type, were also found juxtaposed on chromosomes 1, 4, 5, 6, 8 and 12. Twenty such genes showed significant sequence identity.
Similar to that reported in Arabidopsis, distribution of introns in rice MADS-box family genes was also found bimodal with MIKCc and MIKC* genes containing multiple introns and the Mα, Mβ and Mγ genes usually having no or occasionally up to 4 introns (see Table 1; ). The length of MADS-box proteins varied from 150 to 300 amino acids, with few exceptionally longer or smaller proteins (Table 1). For details on other parameters of nucleic acid and protein sequences, refer to Table 1.
Evolutionary relationships between rice and Arabidopsis MADS-box family genes
To examine the evolutionary relationships of MADS-box genes in rice (including the 10 new genes identified in this study) and Arabidopsis, a tree was constructed using only the conserved MADS-box domain. Five groups, as described by Parenicova and coworkers  were identified containing representative genes of both rice and Arabidopsis. All the Arabidopsis proteins were found to lie in groups similar to those identified previously , except AGL47 and AGL82, which instead of forming a basal branch of the Mβ, grouped with Mγ proteins in our analysis as shown in supplementary figure S1 [see Additional file 1]. OsMADS64 grouped separately with AGL33 of Arabidopsis, which does not cluster with any of the MADS groups described above [see Additional file 1].
A separate phylogenetic tree was also generated from complete protein sequences of all the MADS-box genes in rice and Arabidopsis (Figure 2). Of the 75 rice MADS-box genes, 38 grouped with MIKCc, six with MIKC*, nine with Mβ, 13 with Mα and 10 grouped with Mγ-type Arabidopsis genes. In case of M-type genes, barring OsMADS90, 91 and 96, all other genes exhibited similar groupings as in the MADS-domain-specific phylogenetic analysis (Figure 2). MIKCc proteins were further divided into 14 clades. Representatives of both rice and Arabidopsis could be identified in all the clades except OsMADS32 and the FLC clade, which are exclusive to rice and Arabidopsis, respectively (Figure 2). Phylogenetic trees with bootstrap values, for complete protein sequences as well as coding sequences of rice and Arabidopsis MADS-box genes have been shown as supplementary figures S2 and S3, respectively [see Additional files 2 and 3].
Distribution of conserved motifs
The MEME motif search tool was employed to identify the conserved motifs present in MADS-box proteins (Figure 3). Motifs 1, 6 or 2 specifying the MADS domain were found in all the members of the MADS-box family. All proteins belonging to MIKCc and MIKC* groups had motif 1-type MADS domain. Most Mα proteins also had the motif 1-type MADS domain except OsMADS72 and 77 which contained motif 6. Motif 6 was found to be the most common type of MADS domain in Mβ-type proteins. Distinctively, in case of Mγ proteins a larger MADS domain of 83 amino acids was detected, followed by a coiled coil region and a region of unknown complexity as indicated by Simple Modular Architecture Research Tool (SMART) version 3.4. Together these components formed a larger conserved region of 200 amino acids denoted as motif 2. The conserved motifs 4, 3 and 7, specifying K domain, were found mainly in the MIKCc group members. Of the 38 MIKCc proteins, 31 had all three conserved regions, whereas, motif 7 was absent in case of OsMADS23 and 57; motif 4 in OsMADS22 and 55; and the complete K domain was missing in OsMADS67, 59 and 66. The partial K domain was also detected in some of the non-MIKCc proteins including OsMADS89 and 86 of Mγ, OsMADS68 and 62 of MIKC*, OsMADS63 and OsMADS69 of Mα group.
Certain unknown motifs, numbered 5, 11, 15, 16, 17 and 20, besides the MADS (motif 1) and the K (motifs 3, 4, and 7) domains in MIKCc proteins, were also revealed by MEME motif search. Motif 5 corresponded to the intervening region between the MADS and the K domains, whereas, the rest were found to be distributed in the C-terminal regions. The MADS domain in five M-type proteins, OsMADS88, 99, 96, 93 and 72, was found to be preceded by an N-terminal domain (motif 9). In Mβ proteins, the MADS domain was represented by motif 6 except in OsMADS98 and 92. In addition, other motifs, viz. 5, 8, 10, 12, 14, 16, 17, 19 and 20 were also detected, of which motifs 8, 10, 12, 14 and 19 were exclusive to the Mβ class of proteins. The sequences and lengths of all the motifs are given in supplementary table S1 [see Additional file 4].
Expression profiling of MADS-box genes during vegetative and reproductive development and stress
For expression analysis, the rice panicle and seed development stages were divided into 6 and 5 broad categories, respectively, based on landmark developmental events as described by Itoh and coworkers ; information available at oryzabase  and our preliminary histochemical analysis (data not shown; Table 2). Seedlings subjected to three stress conditions, viz. desiccation, cold and salt stress, were also included in this analysis. Transcriptome profiling of these stages along with mature leaf, root and 7-day-old seedlings was carried out by using GeneChip® Rice Genome Arrays. The raw data from 51 chips representing three biological replicates each from 17 samples was normalized by G ene C hip R obust M ultiarray A nalysis (GCRMA) algorithm . Since five tandemly duplicated Mγ genes OsMADS81, 82, 83, 85 and 99 showed very high sequence identity (94.6 to 98.6%), the unique probe sets for these genes were not available on the GeneChip®. Therefore, a cumulative expression profile for these genes is presented (Figure 4). Expression profiles for OsMADS78 and 79, which were also not represented on the chip, were studied using QPCR (quantitative real-time PCR, Figure 4). The primers used in this study are listed in supplementary table S2 [see Additional file 5]. The initial analysis revealed all but one (OsMADS80) gene to be expressing in at least one of the experimental stages analyzed (Figure 4).
Based on expression profiles during panicle, seed and vegetative development, MADS-box genes were classified into eight groups. Figure 5 shows mean expression profiles for each of the groups. Group I consisted of 12 genes (OsMADS2, 14, 15, 16, 18, 22, 30, 56, 60, 64, 65 and 77) most of which showed high transcript accumulation in all the stages analyzed. However, eight genes (OsMADS1, 4. 5, 6, 7, 8, 17 and 58) of group II showed high expression preferentially in panicle and seed with more than 100 to 500 folds expression in majority of the reproductive tissues in comparison to mature leaf. OsMADS7 and 8 were the most highly expressed genes in panicle and seed stages with more than 1000-fold transcript accumulation in S1 stage of seed development. Group III comprised of the genes OsMADS20, 32, 34 and 72, which showed high expression during early stages of panicle development. The expression declined gradually as the panicles matured. In contrast, the expression of group IV genes (OsMADS3 and 13) increased with the development of the panicles and continued to increase during stages of seed development; the highest expression being 10 and 100 folds for OsMADS3 and 13, respectively. Five genes, OsMADS21, 29, 71, 78 and 79 (group V), expressed preferentially during seed development. The expression of OsMADS21 and 29 was more than 100 folds in S1-S3 stages, whereas, the expression of OsMADS71 increased up to 15 folds in S3 to S5 stages. OsMADS78 and 79 transcripts showed S3-S4 stage-specific accumulation (Figure 4). Group VI comprised of five genes, OsMADS26, 27, 47, 55 and 57 that expressed predominantly in vegetative tissues. The maximum expression of OsMADS57 was observed in mature leaves, whereas, OsMADS27 showed higher expression in roots as well. Three genes (OsMADS31, 63 and 66) that constituted Group VII showed low levels of expression in most stages of panicle and seed development. The peak expression for these genes was observed either in P6 (OsMADS63), S1/S2 (OsMADS31) or S3 (OsMADS66) stages. The remaining genes (mainly M-type) showed very low levels of expression and were clustered together as group VIII (Figure 4). For normalized expression values and calculated fold change values with respect to mature leaf refer to supplementary tables S3 and S4 [see Additional file 6 and 7].
Eight genes showing discrete expression patterns were selected for validation by QPCR analysis. Figure 6 shows a comparison of the QPCR and microarray analysis. The expression patterns obtained for all eight genes using QPCR were similar to that derived from the microarrays. Moreover, most of the characterized genes showed similar expression patterns to that already described in the literature, which strengthens the reliability of the data obtained by using microarrays Expression levels of four MADS-box genes (OsMADS18, 22, 26 and 27) were up regulated by more than two folds in response to cold and dehydration stress treatments (Figure 7). Three genes (OsMADS2, 30 and 55) showed more than 2-fold down regulation in response to dehydration and salt stress. The fold change values with respect to seedlings are given in supplementary table S5 [see Additional file 8]
Expression profiles of putative orthologs of Arabidopsis MIKCc-type genes in rice
Comparative expression profiles of phylogenetically related MIKCc class genes in rice and Arabidopsis is shown in figure 8. These genes have been considered orthologs in Arabidopsis and rice [34–36]. In GLO-like clade, the expression of PI and OsMADS4 was found to be restricted to reproductive tissues, whereas, OsMADS2 showed significant expression in seedlings as well. The expression of another B-class gene, AP3, was also found to be specific to reproductive tissues in Arabidopsis; however its rice ortholog, OsMADS16, also expressed in vegetative tissues besides showing peak expression in reproductive tissues. The expression of AG was found to be restricted to the stages of floral development and initial stages of seed development in Arabidopsis. In rice, its putative orthologs OsMADS3 and OsMADS58 were found to have comparable expression profiles to that of AG, with OsMADS3 showing relatively low level transcript accumulation. In AG clade, the expression profiles of an Arabidopsis D-class gene, AGL11 and its rice counterpart OsMADS13 showing 53% identity at amino acid level were also comparable. SUPPRESSOR OF CONSTANS1 (SOC1) and its putative ortholog OsMADS50 in ricewere found to exhibit low level ubiquitous expression in the stages analyzed. The members of AGL2-like clade, OsMADS7/45 and OsMADS8/24 are duplicated genes with high level of sequence homology to AGL2/SEP2 and AGL14/SEP1, respectively. These genes, along with OsMADS1, 5 and 34 were found to exhibit similar expression patterns as those of SEP genes. In SQUA-like clade, AP1 gene of Arabidopsis and its putative rice orthologs OsMADS14 (RAP1B), 15(RAP1A) and OsMADS18 show high sequence similarity. In reproductive tissues, the expression profiles of OsMADS14, 15 and 18 were found to be very similar to that of AP1, but unlike AP1, the rice genes also expressed in mature leaves.
In AGL6-like clade, duplicated genes, OsMADS6 and OsMADS17, exhibit similar expression patterns as that of AGL6. Both the genes show more than 50% identity with AGL6 at amino acid level suggesting that these could be putative orthologs of AGL6 in rice.
Expression profiles of duplicated genes
Analysis of the TIGR rice segmental duplication database revealed 19 MADS-box genes that were localized on the duplicated segments of the rice genome. A comparison of expression profiles of the duplicated gene pairs, as obtained from microarray data, revealed that except for 3 gene pairs, viz. OsMADS7:8, OsMADS3:58 and OsMADS2:4; expression patterns of other duplicated genes had diverged significantly (Figure 9).
Chromosome 1 was found to have four tandemly duplicated genes within a 12 kb region. Two groups of tandemly duplicated genes with three genes each were localized on chromosome 4. Incidentally, this region overlapped with an intra-chromosomal duplicated segment suggesting that these six genes probably evolved from a single ancestral gene by a combination of segmental and tandem duplication events. All four tandemly duplicated MIKCc-type genes showed varied expression patterns. OsMADS13 and 33 as well as OsMADS14 and 34, although fulfilling our selection criteria, were not considered as being tandemly duplicated because of the high level of sequence divergence, which was evident from their placement in different clades of MIKCc-type genes.
Involvement of MADS-box genes in panicle and seed development in rice
For over a decade, investigations leading to the understanding of genetic and molecular basis of floral development in model eudicots, Arabidopsis and Antirrhinum, have revealed involvement of a number of MADS-box genes in specifying floral organ identity [37, 38]. Attempts have been made to predict the function of MADS-box genes in diverse species based on sequence similarities [24, 39]. However, identification of additional paralogs with very similar sequences and existence of duplicated genes with different expression patterns made it difficult to predict the function based on sequence data alone. Similarity in temporal and spatial expression patterns in combination with the sequence comparisons, however, was found to be a better criterion for establishing orthologous relationships. In this paper, we have presented a comprehensive expression profiling for all the MADS-box genes in rice along with an account of their phylogenetic relationships with the Arabidopsis genes.
Of the 75 genes analyzed in this study, more than 20 were found to exhibit either specific or preferential transcript accumulation during stages of panicle and seed development. Some of these genes have already been characterized as orthologs of Arabidopsis ABCDE class genes, viz.OsMADS14 and 15 are APETALA1 orthologs; OsMADS2 and 4 are PISTILLATA orthologs; OsMADS16 is an APETALA3 ortholog; OsMADS3 and 58 are AGAMOUS orthologs; D class gene, OsMADS13 is putative ortholog of AGL11; OsMADS7 and 8 are orthologous to SEPALLATA2 and 1, respectively [40–48]. Most of the putative orthologous genes in rice and Arabidopsis exhibit similar expression patterns (Figure 8). It was, however, observed that the rice counterparts had a general tendency to express in vegetative organs as well, whereas, the expression of Arabidopsis genes was restricted to reproductive tissues. From the expression data and per cent identity, it seems that duplicated genes, OsMADS6 and OsMADS17 (AGL6-like clade) are orthologous to AGL6. Further experimentation would be required to verify if these genes have similar functions as well.
In addition to some of the well characterized genes described above, there are several others, e.g. OsMADS34, 32, 20, 72, 63, 98, 89, 92 and 86, that show specific up-regulation in panicles but have not yet been functionally validated in rice. This list also includes MIKC*- and M-type genes along with MIKCc genes. Most of the functionally validated MADS-box family genes belong to the MIKCc class, while functions of most of the M-type genes are not yet known in any system. Therefore, this study provides a solid base to select genes for functional validation.
In 2003, Parenicova and coworkers showed that 64 of the 109 Arabidopsis MADS-box genes expressed in siliques . Later, by using high-density transcription factor filter arrays, almost all the MADS-box genes were found to express during silique development . These results suggested that besides being involved in the development of flowers, the MADS-box gene family could be involved in the process of seed development as well. In rice, we have also found (with the exception of OsMADS80)that transcripts for almost all the MADS-box genes are expressed in at least one of the seed development stages analyzed. Interestingly, the highest expression values for OsMADS13, 21, 23, 29, 71, 75, 78 and 79 were observed for seed stages, suggesting that these genes could be involved in development of seeds. Four of these genes belong to type II (MIKCc group), whereas, the remainder are type I (Mα-type). Since, two of the type I (Mγ) genes, AGL80 and PHERES, have previously been implicated in seed development, it might be interesting to investigate the role of other type-I genes showing up-regulation during development of seeds [50, 51].
Rice MIKCc genes
The MIKCc genes have been sub-grouped into 13 clades in Arabidopsis. Representatives of all but the FLC clade were also found in rice. Six genes belonging to the FLC clade have been implicated in control of flowering by vernalization and autonomous pathways in Arabidopsis. Since rice does not require vernalization for flowering, this clade has been suggested to be lost in rice . Recently, Zhao and coworkers (2006) have reported a new monocot-specific clade, OsMADS32-like clade, consisting of OsMADS32 of rice and TaAGL14 and 15 of wheat . The expression of TaAGL14 and 15 was detected in most vegetative stages along with inflorescence and seeds. In contrast, the OsMADS32 transcripts were found to be restricted to early stages of panicle and late seed development, suggesting that the OsMADS32-like clade might have evolved to cater for diverse monocot-specific functions. A comparison of phylogenetic relationships and expression profiles between rice and Arabidopsis MADS-box genes suggests that although most of the basic ABCDE functions have been retained in rice, acquisition of new functions and subfunctionalization of existing gene functions is also apparent.
Mβ-like genes are represented in rice
In earlier studies, no gene of rice could be assigned to the Mβ group of M-type MADS-box genes, hence it was suggested that probably Mβ genes have not been retained in the rice genome [6, 28]. In this study, we have identified nine new genes that grouped with Arabidopsis Mβ type genes. Although bootstrap values are low, separation of this clade from the rest of M-type genes of rice and the presence of conserved motifs in Arabidopsis Mβ protein and the newly identified group is suggestive of the existence of Mβ group in rice as well.
Duplication seems to have played major role in diversification of MADS-box family of genes
Arabidopsis has been reported to have 107 MADS-box genes . However, in rice that has a genome size almost three times as that of Arabidopsis [52, 53], the number of MADS-box genes was found to be only 75. The reason for this could be the variable status of whole genome duplications in Arabidopsis and rice [54, 55]. Surprisingly, however, the number of MIKCc-type genes in both Arabidopsis and rice was found to be similar at 39 and 38, respectively. Therefore, the difference in the total number is mainly due to the variation in the number of M-type genes, which are 37 in rice and 68 in Arabidopsis. It seems that duplication events have contributed significantly towards evolution of M-type genes. Our analysis revealed 16 M-type genes, which could have originated because of tandem duplications (Figure 1). Phylogenetic analysis suggests that rice and Arabidopsis Mγ genes probably had a common ancestor and the expansion occurred independently after divergence of monocots and dicots.
MADS-box genes seem to have evolved mainly through gene duplication events followed by neofunctionalization, subfunctionalization or in some cases pseudogenization of the duplicated gene . However, redundancy being one of the fates of duplication is also common in MADS-box family. We found 30 MADS-box genes lying on segmental duplicated regions of rice chromosomes while only 16 were found to have been retained, suggesting that considerable changes may have taken place following segmental duplication leading to loss of some of the genes. Except one, all paralogous gene pairs belong to MIKCc-type of MADS-box family. Expression data show that most of these duplicated genes have divergent expression patterns that may be because they have undergone neofunctionalization or subfunctionalization, though sufficient experimentation is required to prove this hypothesis. Interestingly, three genes, viz. OsMADS6, 17 and 56, lying on duplicated segments of chromosomes 2, 4 and 10, respectively, show collinearity in gene order. On the other hand, OsMADS50 lying on chromosome 3 shows synteny with only one of these genes, i.e. OsMADS17. They may all have resulted due to duplication of a segment on chromosome 4, but thereafter, evolution of these genes may have been quite independent resulting in loss of micro-collinearity between the duplicated regions.
Stress responsive MADS-box genes in rice
MADS-box genes have been shown to be affected by low temperature stress in tomato  and by application of hormones like cytokinins, gibberellins , ethylene  and auxins  in other plants. Seven MADS-box genes exhibited differential expression in response to cold, salt and/or desiccation stress in rice. So far, none of these genes has been implicated in stress response. Amongst stress-induced genes, OsMADS18 is a member of AP1/SQUA group that has been shown to express widely during development with its transcripts accumulating at high levels specifically in meristematic tissues . It has been shown to interact with OsMADS6, 8/24, 7/45, and 47 [45, 61] and in our analysis its expression pattern was found to overlap with those of OsMADS6, 8 and 7 in reproductive tissues and with OsMADS47 during vegetative development, suggesting that it might be interacting with different partners during reproductive development and stress. Recently, Tardif and coworkers showed that a large number of genes involved in flower development are associated with abiotic stress responses in wheat . Our preliminary analysis involving transcript profiling during reproductive development and abiotic stress conditions has also revealed approximately 400 genes that are up regulated during panicle/seed development and three stress conditions, viz. cold, salt, and dehydration (unpublished data). It would be, therefore, interesting to undertake specific investigations, which could establish the interactions of biochemical pathways that are activated during reproductive development and stress response.
Contribution of MADS-box gene family in flower organ specification is well documented in eudicots; however, functions of many gene members of this class have not been elucidated in rice. A comparison of phylogenetic relationships and expression profiles between rice and Arabidopsis MADS-box genes suggests that although most of the basic ABCDE functions have been retained in rice, acquisition of new functions and subfunctionalization of existing gene functions is also not uncommon. Furthermore, the role of MADS-box transcription factors in seed development and during stress response also needs to be explored. The new information generated is expected to help in selection of appropriate candidate genes for further functional characterization.
Identification of genes, nomenclature and mapping on chromosomes
Name Search and Hidden Markov Model (HMM) were employed to identify the MADS-box genes from rice genome. MADS-box sequences available for all land plants were downloaded from SWISSPROT and TrEMBL  and their HMM profile was generated using HMMER 2.1.1 software package [64, 65]. This profile was used to search the complete proteome of rice available in TIGR  and KOME [67, 68] databases using Basic Local Alignment Search Tool (BLAST; ) with filter off. Name search using MADS, SRF, AGAMOUS and AP1 as keywords in these databases helped in identification of more genes, which could not be identified using HMM profile due to the presence of incomplete MADS-box. Redundant sequences were removed by aligning the protein sequences using ClustalX 1.83  and checking their genomic locus in TIGR. Motif scan was performed using SMART [71, 72] or National Center for Biotechnology Information Conserved Domain Database (NCBI-CD; ) searches with filter off. According to already available nomenclature, 34 MADS-box genes have been named from OsMADS1 to 58. Thus, the newly identified genes were named from OsMADS59 to 99. MADS-box genes were mapped on chromosomes by identifying their chromosomal position given in the TIGR rice database. Information regarding ORF length, amino acids number, molecular weight and isoelectric point of protein was downloaded from TIGR release 4. For OsMADS3 and 20, Gene Runner program version 3.04 was employed to find molecular weight and PI of protein, as it was not available in TIGR.
To identify the number of groups formed by rice MADS-box genes in comparison to Arabidopsis, MADS-box domain comprising of 60 amino acids, identified by SMART from all the MADS-box sequences of Arabidopsis and rice were aligned using ClustalX (version 1.83) program. An un-rooted neighbor-joining (NJ; ) phylogenetic tree was constructed in ClustalX with default parameters. Separate phylogenetic trees were constructed using complete protein sequence and coding sequences of rice and Arabidopsis MADS-box genes. Bootstrap analysis was performed using 1000 replicates. The trees thus obtained were viewed using TREEVIEW software .
Sequence and duplication analysis
To identify the conserved motifs, MEME version 2.2  was employed using following parameters; number of repetitions – any, maximum number of motifs – 20, optimum motif width set to ≥ 6 and ≤ 200. The motifs obtained were annotated using SMART and NCBI-CD search program.
Further, MADS-box gene duplications were mapped on segmental duplications database of TIGR with 100 kb as well as 500 kb distance allowed between collinear genes . For finding tandemly duplicated candidates, genes with intergenic distance not more than 20 kb and having fair degree of overall homology between them were selected. Identity among duplicated genes was calculated using DNASTAR MegAlign 4.03 package.
Collection of plant material
Oryza sativa indica var. IR64 tillers spanning all stages of panicle and seed development were collected from field grown rice. Mature leaves were also harvested from same plants. For all the stages, three biological replicates were harvested from independent populations of plants. After harvesting, panicle and seed samples were frozen in liquid nitrogen and stored at -70°C. For stress treatment, rice seeds were surface-sterilized with 0.1% HgCl2 and soaked in RO (reverse osmosis) water overnight in dark. Next day, the seeds were spread on a meshed float and grown hydroponically at 28 ± 1°C in culture room conditions. After 7 days of growth, the seedlings were transferred to 100 ml beaker for treatment. For salt stress, sodium chloride was used at final concentration of 200 mM for 3 hours. For cold stress, seedlings were kept at 4°C for 3 hours. Desiccation stress was simulated by drying the plants on tissue paper and spreading them on Whatmann 3 mm sheet for 3 hours. The seedlings with their roots kept in water, for 3 hours duration, were used as control.
Affymetrix GeneChip® Rice Genome Arrays representing 49,824 transcripts (48,564 of japonica and 1,260 of indica) have been employed to study the transcriptome profiles of MADS-box genes during reproductive organ development and stress response in rice. Total RNA was isolated from all the tissues, except seeds, using TRIzol method (Invitrogen Inc., USA; ). Due to high carbohydrate content, RNA from seed samples was isolated using the method described earlier . After checking the quality on agarose formaldehyde or TAE gels, the RNA samples were quantified using nanodrop (ND-1000 Spectrophotometer). Five micrograms of RNA with 260:280 ratios of 1.9–2.0 and 260:230 ratios more than 2.0 was used for cDNA synthesis. Labeling and hybridizations were carried out according to Affymetrix manual for one-cycle target labeling (Affymetrix, Santa Clara, CA). Hybridization was performed in GeneChip® Hybridization Oven 640 for 16 hours at 45°C and 60 rpm. GeneChips were washed and stained with streptavidin-phycoerythrin using the fluidics protocol EukGE_WS2V5_450 in Affymetrix fluidic station model 450. Finally, chips were scanned using the GeneChip® Scanner 3000.
Digital expression analysis
The expression data for Arabidopsis, using Affymetrix GeneChip® ATH1 Genome Array, from stages comparable to those used for rice was obtained from Gene Expression Omnibus (GEO) database at the NCBI under the series accession numbers GSE5620, GSE5621, GSE5623, GSE5624, GSE5629, GSE5630, GSE5631, GSE5632 and GSE5634. Total of 55 CEL files representing 21 stages of development as well as stress treatments were downloaded from  and analyzed by using avadis™ microarray data analysis software version 4.2 . The data was normalized using GCRMA followed by log transformation and average calculation. Heat Map was generated for selected genes.
Microarray data analysis
CEL files generated in GeneChip Operating Software (GCOS) were further analyzed using avadis™. Data were normalized using GCRMA algorithm and log transformed. To get the expression values, averages of three biological replicates were used. The expression data for MADS-box genes was extracted by using name search and the gene IDs listed in table 1. Wherever more than one probe set was available for one gene, the probe set designed from 3' end was given preference. Cluster analysis on rows was performed using Euclidean distance metric, and Ward's Linkage rule of Hierarchical clustering. Differential expression analysis was performed taking mature leaf as reference to identify genes expressing more than two folds in panicle and seed, with p values <0.005. Similarly, for identifying stress-induced genes, differential expression analysis was performed with no correction applied and p values less than 0.05. Further K-means clustering was performed to identify the expression patterns shown by genes expressing in panicle and seed. Since 73 genes (69 probe sets) are represented on chip, expression profiles of OsMADS78 and 79 were studied using QPCR. Raw microarray data have been deposited in the Gene Expression Omnibus database at the National Center for Biotechnology Information under the series accession numbers GSE6893 and GSE6901.
Real time PCR reactions were carried out using the same RNA samples, which were used for microarrays as described earlier . In brief, primers were designed for all the genes preferentially from 3' end of the gene using PRIMER EXPRESS version 2.0 (PE Applied Biosystems, USA) with default parameters. Each primer was checked using BLAST tool of NCBI database with filter off for its specificity for respective gene, which was further confirmed by dissociation curve analysis obtained after the PCR reaction. First strand cDNA was synthesized by reverse transcription using 4 μg of total RNA in 100 μl of reaction volume using high-capacity cDNA Archive kit (Applied Biosystems, USA). Diluted cDNA samples were used for Real time PCR analysis with 200 nM of each primer mixed with SYBR Green PCR master as per manufacturer's instructions. The reaction was carried out in 96-well optical reaction plates (Applied Biosystems, USA), using ABI Prism 7000 Sequence Detection System and software (PE Applied Biosystems, USA). To normalize the variance among samples, Actin was used as endogenous control. Relative expression values were calculated after normalizing against the maximum expression value. These data were further normalized to ease the profile matching to that obtained from microarrays.
de Folter S, Angenent GC: trans meets cis in MADS science. Trends Plant Sci. 2006, 11 (5): 224-231. 10.1016/j.tplants.2006.03.008.
Jack T: Plant development going MADS. Plant Mol Biol. 2001, 46 (5): 515-520. 10.1023/A:1010689126632.
Kaufmann K, Melzer R, Theissen G: MIKC-type MADS-domain proteins: structural modularity, protein interactions and network evolution in land plants. Gene. 2005, 347 (2): 183-198. 10.1016/j.gene.2004.12.014.
Nam J, dePamphilis CW, Ma H, Nei M: Antiquity and evolution of the MADS-box gene family controlling flower development in plants. Mol Biol Evol. 2003, 20 (9): 1435-1447. 10.1093/molbev/msg152.
Ng M, Yanofsky MF: Function and evolution of the plant MADS-box gene family. Nat Rev Genet. 2001, 2 (3): 186-195. 10.1038/35056041.
Parenicova L, de Folter S, Kieffer M, Horner DS, Favalli C, Busscher J, Cook HE, Ingram RM, Kater MM, Davies B, Angenent GC, Colombo L: Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: new openings to the MADS world. Plant Cell. 2003, 15 (7): 1538-1551. 10.1105/tpc.011544.
Purugganan MD: The MADS-box floral homeotic gene lineages predate the origin of seed plants: phylogenetic and molecular clock estimates. J Mol Evol. 1997, 45 (4): 392-396. 10.1007/PL00006244.
Theissen G, Becker A, Di Rosa A, Kanno A, Kim JT, Munster T, Winter KU, Saedler H: A short history of MADS-box genes in plants. Plant Mol Biol. 2000, 42 (1): 115-149. 10.1023/A:1006332105728.
Passmore S, Maine GT, Elble R, Christ C, Tye BK: Saccharomyces cerevisiae protein involved in plasmid maintenance is necessary for mating of MAT alpha cells. J Mol Biol. 1988, 204 (3): 593-606. 10.1016/0022-2836(88)90358-0.
Yanofsky MF, Ma H, Bowman JL, Drews GN, Feldmann KA, Meyerowitz EM: The protein encoded by the Arabidopsis homeotic gene agamous resembles transcription factors. Nature. 1990, 346 (6279): 35-39. 10.1038/346035a0.
Sommer H, Beltran JP, Huijser P, Pape H, Lonnig WE, Saedler H, Schwarz-Sommer Z: Deficiens, a homeotic gene involved in the control of flower morphogenesis in Antirrhinum majus: the protein shows homology to transcription factors. EMBO J. 1990, 9 (3): 605-613.
Norman C, Runswick M, Pollock R, Treisman R: Isolation and properties of cDNA clones encoding SRF, a transcription factor that binds to the c-fos serum response element. Cell. 1988, 55 (6): 989-1003. 10.1016/0092-8674(88)90244-9.
Yang Y, Fanning L, Jack T: The K domain mediates heterodimerization of the Arabidopsis floral organ identity proteins, APETALA3 and PISTILLATA. Plant J. 2003, 33 (1): 47-59. 10.1046/j.0960-7412.2003.01473.x.
Cho S, Jang S, Chae S, Chung KM, Moon YH, An G, Jang SK: Analysis of the C-terminal region of Arabidopsis thaliana APETALA1 as a transcription activation domain. Plant Mol Biol. 1999, 40 (3): 419-429. 10.1023/A:1006273127067.
Egea-Cortines M, Saedler H, Sommer H: Ternary complex formation between the MADS-box proteins SQUAMOSA, DEFICIENS and GLOBOSA is involved in the control of floral architecture in Antirrhinum majus. EMBO J. 1999, 18 (19): 5370-5379. 10.1093/emboj/18.19.5370.
Alvarez-Buylla ER, Pelaz S, Liljegren SJ, Gold SE, Burgeff C, Ditta GS, Ribas de Pouplana L, Martinez-Castilla L, Yanofsky MF: An ancestral MADS-box gene duplication occurred before the divergence of plants and animals. Proc Natl Acad Sci U S A. 2000, 97 (10): 5328-5333. 10.1073/pnas.97.10.5328.
Henschel K, Kofuji R, Hasebe M, Saedler H, Munster T, Theissen G: Two ancient classes of MIKC-type MADS-box genes are present in the moss Physcomitrella patens. Mol Biol Evol. 2002, 19 (6): 801-814.
Kofuji R, Sumikawa N, Yamasaki M, Kondo K, Ueda K, Ito M, Hasebe M: Evolution and divergence of the MADS-box gene family based on genome-wide expression analyses. Mol Biol Evol. 2003, 20 (12): 1963-1977. 10.1093/molbev/msg216.
Becker A, Theissen G: The major clades of MADS-box genes and their role in the development and evolution of flowering plants. Mol Phylogenet Evol. 2003, 29 (3): 464-489. 10.1016/S1055-7903(03)00207-0.
Zhao T, Ni Z, Dai Y, Yao Y, Nie X, Sun Q: Characterization and expression of 42 MADS-box genes in wheat (Triticum aestivum L.). Mol Genet Genomics. 2006, 276 (4): 334-350. 10.1007/s00438-006-0147-3.
De Bodt S, Raes J, Florquin K, Rombauts S, Rouze P, Theissen G, Van de Peer Y: Genomewide structural annotation and evolutionary analysis of the type I MADS-box genes in plants. J Mol Evol. 2003, 56 (5): 573-586. 10.1007/s00239-002-2426-x.
De Bodt S, Raes J, Van de Peer Y, Theissen G: And then there were many: MADS goes genomic. Trends Plant Sci. 2003, 8 (10): 475-483. 10.1016/j.tplants.2003.09.006.
Alvarez-Buylla ER, Liljegren SJ, Pelaz S, Gold SE, Burgeff C, Ditta GS, Vergara-Silva F, Yanofsky MF: MADS-box gene evolution beyond flowers: expression in pollen, endosperm, guard cells, roots and trichomes. Plant J. 2000, 24 (4): 457-466. 10.1046/j.1365-313x.2000.00891.x.
Riechmann JL, Meyerowitz EM: MADS domain proteins in plant development. Biol Chem. 1997, 378 (10): 1079-1101.
Rounsley SD, Ditta GS, Yanofsky MF: Diverse roles for MADS box genes in Arabidopsis development. Plant Cell. 1995, 7 (8): 1259-1269. 10.1105/tpc.7.8.1259.
Saedler H, Becker A, Winter KU, Kirchner C, Theissen G: MADS-box genes are involved in floral development and evolution. Acta Biochim Pol. 2001, 48 (2): 351-358.
Moore S, Vrebalov J, Payton P, Giovannoni J: Use of genomics tools to isolate key ripening genes and analyse fruit maturation in tomato. J Exp Bot. 2002, 53 (377): 2023-2030. 10.1093/jxb/erf057.
Nam J, Kim J, Lee S, An G, Ma H, Nei M: Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms. Proc Natl Acad Sci U S A. 2004, 101 (7): 1910-1915. 10.1073/pnas.0308430100.
Furutani I, Sukegawa S, Kyozuka J: Genome-wide analysis of spatial and temporal gene expression in rice panicle development. Plant J. 2006, 46 (3): 503-511. 10.1111/j.1365-313X.2006.02703.x.
Itoh J, Nonomura K, Ikeda K, Yamaki S, Inukai Y, Yamagishi H, Kitano H, Nagato Y: Rice plant development: from zygote to spikelet. Plant Cell Physiol. 2005, 46 (1): 23-47. 10.1093/pcp/pci501.
Oryzabase . [http://www.shigen.nig.ac.jp/rice/oryzabase/top/top.jsp]
Wu Z, Irizarry RA, Gentleman R, Murillo FM, Spencer F: A Model Based Background Adjustment for Oligonucleotide Expression Arrays. Technical Report, Department of Biostatistics. Working Papers, Baltimore, MD. 2003
Pelucchi N, Fornara F, Favalli C, Masiero S, Lago C, Enrico Pe M, Colombo L, Kater MM: Comparative analysis of rice MADS-box genes expressed during flower development. Sex Plant Reprod. 2002, 15: 113-122. 10.1007/s00497-002-0151-7.
Kater MM, Dreni L, Colombo L: Functional conservation of MADS-box factors controlling floral organ identity in rice and Arabidopsis. J Exp Bot. 2006, 57 (13): 3433-3444. 10.1093/jxb/erl097.
Kyozuka J, Kobayashi T, Morita M, Shimamoto K: Spatially and temporally regulated expression of rice MADS box genes with similarity to Arabidopsis class A, B and C genes. Plant Cell Physiol. 2000, 41 (6): 710-718.
Lee S, Kim J, Han JJ, Han MJ, An G: Functional analyses of the flowering time gene OsMADS50, the putative SUPPRESSOR OF OVEREXPRESSION OF CO 1/AGAMOUS-LIKE 20 (SOC1/AGL20) ortholog in rice. Plant J. 2004, 38 (5): 754-764. 10.1111/j.1365-313X.2004.02082.x.
Coen ES, Meyerowitz EM: The war of the whorls: genetic interactions controlling flower development. Nature. 1991, 353 (6339): 31-37. 10.1038/353031a0.
Ma H: The unfolding drama of flower development: recent results from genetic and molecular analyses. Genes Dev. 1994, 8 (7): 745-756. 10.1101/gad.8.7.745.
Kramer EM, Dorit RL, Irish VF: Molecular evolution of genes controlling petal and stamen development: duplication and divergence within the APETALA3 and PISTILLATA MADS-box gene lineages. Genetics. 1998, 149 (2): 765-783.
Yamaguchi T, Lee DY, Miyao A, Hirochika H, An G, Hirano HY: Functional diversification of the two C-class MADS box genes OSMADS3 and OSMADS58 in Oryza sativa. Plant Cell. 2006, 18 (1): 15-28. 10.1105/tpc.105.037200.
Greco R, Stagi L, Colombo L, Angenent GC, Sari-Gorla M, Pe ME: MADS box genes expressed in developing inflorescences of rice and sorghum. Mol Gen Genet. 1997, 253 (5): 615-623. 10.1007/s004380050364.
Jia H, Chen R, Cong B, Cao K, Sun C, Luo D: Characterization and transcriptional profiles of two rice MADS-box genes. Plant Science. 2000, 155 (2): 115-122. 10.1016/S0168-9452(00)00191-6.
Kang HG, Jeon JS, Lee S, An G: Identification of class B and class C floral organ identity genes from rice plants. Plant Mol Biol. 1998, 38 (6): 1021-1029. 10.1023/A:1006051911291.
Kyozuka J, Shimamoto K: Ectopic expression of OsMADS3, a rice ortholog of AGAMOUS, caused a homeotic transformation of lodicules to stamens in transgenic rice plants. Plant Cell Physiol. 2002, 43 (1): 130-135. 10.1093/pcp/pcf010.
Moon YH, Jung JY, Kang HG, An G: Identification of a rice APETALA3 homologue by yeast two-hybrid screening. Plant Mol Biol. 1999, 40 (1): 167-177. 10.1023/A:1026429922616.
Prasad K, Vijayraghavan U: Double-stranded RNA interference of a rice PI/GLO paralog, OsMADS2, uncovers its second-whorl-specific function in floral organ patterning. Genetics. 2003, 165 (4): 2301-2305.
Lopez-Dee ZP, Wittich P, Enrico Pe M, Rigola D, Del Buono I, Gorla MS, Kater MM, Colombo L: OsMADS13, a novel rice MADS-box gene expressed during ovule development. Dev Genet. 1999, 25 (3): 237-244. 10.1002/(SICI)1520-6408(1999)25:3<237::AID-DVG6>3.0.CO;2-L.
Chung YY, Kim SR, Kang HG, Noh YS, Park MC, Finkel D, An G: Characterization of two MADS box genes homologous to GLOBOSA. Plant Science. 1995, 109: 45-56. 10.1016/0168-9452(95)04153-L.
de Folter S, Busscher J, Colombo L, Losa A, Angenent GC: Transcript profiling of transcription factor genes during silique development in Arabidopsis. Plant Mol Biol. 2004, 56 (3): 351-366. 10.1007/s11103-004-3473-z.
Kohler C, Hennig L, Spillane C, Pien S, Gruissem W, Grossniklaus U: The Polycomb-group protein MEDEA regulates seed development by controlling expression of the MADS-box gene PHERES1. Genes Dev. 2003, 17 (12): 1540-1553. 10.1101/gad.257403.
Portereiko MF, Lloyd A, Steffen JG, Punwani JA, Otsuga D, Drews GN: AGL80 is required for central cell and endosperm development in Arabidopsis. Plant Cell. 2006, 18: 1862-1872. 10.1105/tpc.106.040824.
International Rice Genome Sequencing Project: The map-based sequence of the rice genome. Nature. 2005, 436 (7052): 793-800. 10.1038/nature03895.
Vij S, Gupta V, Kumar D, Vydianathan R, Raghuvanshi S, Khurana P, Khurana JP, Tyagi AK: Decoding the rice genome. Bioessays. 2006, 28 (4): 421-432. 10.1002/bies.20399.
Paterson AH, Bowers JE, Chapman BA: Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci U S A. 2004, 101 (26): 9903-9908. 10.1073/pnas.0307901101.
Yu J, Wang J, Lin W, Li S, Li H, Zhou J, Ni P, Dong W, Hu S, Zeng C, Zhang J, Zhang Y, Li R, Xu Z, Li S, Li X, Zheng H, Cong L, Lin L, Yin J, Geng J, Li G, Shi J, Liu J, Lv H, Li J, Wang J, Deng Y, Ran L, Shi X, Wang X, Wu Q, Li C, Ren X, Wang J, Wang X, Li D, Liu D, Zhang X, Ji Z, Zhao W, Sun Y, Zhang Z, Bao J, Han Y, Dong L, Ji J, Chen P, Wu S, Liu J, Xiao Y, Bu D, Tan J, Yang L, Ye C, Zhang J, Xu J, Zhou Y, Yu Y, Zhang B, Zhuang S, Wei H, Liu B, Lei M, Yu H, Li Y, Xu H, Wei S, He X, Fang L, Zhang Z, Zhang Y, Huang X, Su Z, Tong W, Li J, Tong Z, Li S, Ye J, Wang L, Fang L, Lei T, Chen C, Chen H, Xu Z, Li H, Huang H, Zhang F, Xu H, Li N, Zhao C, Li S, Dong L, Huang Y, Li L, Xi Y, Qi Q, Li W, Zhang B, Hu W, Zhang Y, Tian X, Jiao Y, Liang X, Jin J, Gao L, Zheng W, Hao B, Liu S, Wang W, Yuan L, Cao M, McDermott J, Samudrala R, Wang J, Wong GK, Yang H: The Genomes of Oryza sativa: a history of duplications. PLoS Biol. 2005, 3 (2): e38-10.1371/journal.pbio.0030038.
Irish VF, Litt A: Flower development and evolution: gene duplication, diversification and redeployment. Curr Opin Genet Dev. 2005, 15 (4): 454-460. 10.1016/j.gde.2005.06.001.
Lozano R, Angosto T, Gomez P, Payan C, Capel J, Huijser P, Salinas J, Martinez-Zapater JM: Tomato flower abnormalities induced by low temperatures are associated with changes of expression of MADS-Box genes. Plant Physiol. 1998, 117 (1): 91-100. 10.1104/pp.117.1.91.
Bonhomme F, Kurz B, Melzer S, Bernier G, Jacqmard A: Cytokinin and gibberellin activate SaMADS A, a gene apparently involved in regulation of the floral transition in Sinapis alba. Plant J. 2000, 24 (1): 103-111. 10.1046/j.1365-313x.2000.00859.x.
Ando S, Sato Y, Kamachi S, Sakai S: Isolation of a MADS-box gene (ERAF17) and correlation of its expression with the induction of formation of female flowers by ethylene in cucumber plants (Cucumis sativus L.). Planta. 2001, 213 (6): 943-952.
Zhu C, Perry SE: Control of expression and autoregulation of AGL15, a member of the MADS-box family. Plant J. 2005, 41 (4): 583-594. 10.1111/j.1365-313X.2004.02320.x.
Fornara F, Parenicova L, Falasca G, Pelucchi N, Masiero S, Ciannamea S, Lopez-Dee Z, Altamura MM, Colombo L, Kater MM: Functional characterization of OsMADS18, a member of the AP1/SQUA subfamily of MADS box genes. Plant Physiol. 2004, 135 (4): 2207-2219. 10.1104/pp.104.045039.
Tardif G, Kane NA, Adam H, Labrie L, Major G, Gulick P, Sarhan F, Laliberte JF: Interaction network of proteins associated with abiotic stress response and development in wheat. Plant Mol Biol. 2007, 63 (5): 703-718. 10.1007/s11103-006-9119-6.
UniProt Knowledgebase. [http://www.expasy.org/sprot/]
Madera M, Gough J: A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res. 2002, 30 (19): 4321-4328. 10.1093/nar/gkf544.
TIGR Rice Genome Annotation. [http://www.tigr.org/tdb/e2k1/osa1/]
Knowledge-based Oryza Molecular biological Encyclopedia (KOME). [http://cdna01.dna.affrc.go.jp/cDNA/]
Kikuchi S, Satoh K, Nagata T, Kawagashira N, Doi K, Kishimoto N, Yazaki J, Ishikawa M, Yamada H, Ooka H, Hotta I, Kojima K, Namiki T, Ohneda E, Yahagi W, Suzuki K, Li CJ, Ohtsuki K, Shishiki T, Otomo Y, Murakami K, Iida Y, Sugano S, Fujimura T, Suzuki Y, Tsunoda Y, Kurosaki T, Kodama T, Masuda H, Kobayashi M, Xie Q, Lu M, Narikawa R, Sugiyama A, Mizuno K, Yokomizo S, Niikura J, Ikeda R, Ishibiki J, Kawamata M, Yoshimura A, Miura J, Kusumegi T, Oka M, Ryu R, Ueda M, Matsubara K, Kawai J, Carninci P, Adachi J, Aizawa K, Arakawa T, Fukuda S, Hara A, Hashizume W, Hayatsu N, Imotani K, Ishii Y, Itoh M, Kagawa I, Kondo S, Konno H, Miyazaki A, Osato N, Ota Y, Saito R, Sasaki D, Sato K, Shibata K, Shinagawa A, Shiraki T, Yoshino M, Hayashizaki Y, Yasunishi A: Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice. Science. 2003, 301 (5631): 376-379. 10.1126/science.1081288.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 25 (24): 4876-4882. 10.1093/nar/25.24.4876.
Letunic I, Goodstadt L, Dickens NJ, Doerks T, Schultz J, Mott R, Ciccarelli F, Copley RR, Ponting CP, Bork P: Recent improvements to the SMART domain-based sequence annotation resource. Nucleic Acids Res. 2002, 30 (1): 242-244. 10.1093/nar/30.1.242.
Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A. 1998, 95 (11): 5857-5864. 10.1073/pnas.95.11.5857.
NCBI Conserved Domains Database. [http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi]
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4 (4): 406-425.
Page RD: TreeView: an application to display phylogenetic trees on personal computers. Comput Appl Biosci. 1996, 12 (4): 357-358.
Bailey TL, Elkan C: The value of prior knowledge in discovering motifs with MEME. Proc Int Conf Intell Syst Mol Biol. 1995, 3: 21-29.
Chomczynski P, Sacchi N: Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal Biochem. 1987, 162 (1): 156-159. 10.1016/0003-2697(87)90021-2.
Singh G, Kumar S, Singh P: A quick method to isolate RNA from wheat and other carbohydrate-rich seeds. Plant Mol Biol Rep. 2003, 21: 93a-f.
Gene Expression Omnibus. [http://www.ncbi.nlm.nih.gov/geo/]
Jain M, Kaur N, Garg R, Thakur JK, Tyagi AK, Khurana JP: Structure and expression analysis of early auxin-responsive Aux/IAA gene family in rice (Oryza sativa). Funct Integr Genomics. 2006, 6 (1): 47-59. 10.1007/s10142-005-0005-0.
We acknowledge Dr. Ramesh Hariharan, Dr. Dibyendu Kumar and Mrs. Rashmi Jain for their help in microarray and other computational analyses. Thanks are also due to Professor J. P. Khurana, Drs. J. Able and Meenu Kapoor for critical reading of the manuscript. KOME and TIGR database resources are acknowledged for making available the detailed sequence information on rice. Expression data for Arabidopsis has been obtained from GEO database at NCBI. Senior Research fellowship by the Council for Scientific and Industrial Research (CSIR) to R.A. and S.R. and University Grants Commissions (UGC) fellowship to P.A. is also acknowledged. The project has been funded by the Department of Biotechnology, Government of India.
RA has done the computational analysis, microarray experiments for mature leaf and panicle stages, performed real-time PCR, analyzed the data and drafted the manuscript. PA has performed microarray for root and seed stages. SR has performed microarray experiments for seedling and stress stages. AKS and VPS had grown the rice plants, provided tissue material, and helped in identification of biological stages for microarray experiments. AKT was involved in planning of experiments, revised the final version of the manuscript and headed the project. SK designed and participated in all the experiments and revised the final text of the manuscript. All authors have read and approved the final manuscript.
Electronic supplementary material
Figure S1. Phylogenetic analysis of 173 MADS-box domain sequences from rice (75) and Arabidopsis (98)
Additional file 1: . ~60 residues of conserved MADS-box domain are used for constructing the phylogenetic tree. Arabidopsis genes are marked by A followed by number. Similarly rice genes are marked by O followed by number assigned to each gene. Due to shorter branch lengths of MIKCc-type genes, a magnified view is presented for this group to mark the genes. (PDF 360 KB)
Additional file 2: . An unrooted tree has been drawn using complete MADS-box proteins of rice and Arabidopsis. Bootstrap values from 1000 replicates have been indicated at nodes. Scale bar represents 0.1 amino acid substitution per site. (PDF 400 KB)
Additional file 3: . An unrooted NJ (neighbor-joining) tree based on all the nucleotide substitutions in coding exons of rice and Arabidopsis MADS-box genes. Bootstrap probabilities from 1000 replicates have been indicated at nodes. The genes showing different topology in DNA-based tree in comparison to protein-based tree have been marked by *. Scale bar represents 0.1 substitutions per nucleotide position. (PDF 407 KB)
Additional file 4: Table S1. Sequence and length of motifs identified from rice MADS-box proteins using MEME motif search tool (aa, amino acids). (XLS 21 KB)
Additional file 7: Table S4. Results of differential expression analysis taking mature leaf as reference. Fold changes have been taken with p value less than or equal to 0.005. (XLS 82 KB)
Additional file 8: Table S5. Results obtained after differential expression analysis to extract stress responsive genes using microarrays (seedling taken as reference). (XLS 36 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Arora, R., Agarwal, P., Ray, S. et al. MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress. BMC Genomics 8, 242 (2007). https://doi.org/10.1186/1471-2164-8-242
- Seed Development
- Duplicate Gene
- Serum Response Factor
- Duplicate Segment
- Simple Modular Architecture Research Tool