- Research article
Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns
BMC Genomicsvolume 10, Article number: 271 (2009)
Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs) for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals.
As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks) of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection.
We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the ongoing curation and annotation of the wheat genome. The data for each clone's expression in various tissues and stress treatments and its variability in wheat and rice as a result of their diversification are valuable tools for functional genomics in wheat and for comparative genomics in cereals.
Wheat is mainly cultivated in temperate zones and is one of the world's main staple foods. Wheat is polyploid and common wheat is an allohexaploid that has three homoeologous genomes. These genomes have been designated as A, B and D, with the coding regions of the homoeologous genes sharing more than 90% homology. Due to the huge genome size of the hexaploid (17 Gbp)  along with the high content of repeat sequences, it is very difficult to carry out complete sequencing of the entire genome or to perform forward genetics in polyploid wheat. Although common wheat genetic maps with molecular markers and cytological maps with deletion mutants of the chromosome segments have been constructed [2, 3], the number of mapped DNA markers for both map-based cloning and anchoring of the genome positions remains restricted. Comparison of wheat and rice genomic sequences to corresponding full-length cDNAs can provide information on exon-intron boundaries. With this information, it should be possible to set up primers for PCR-based markers such as the simple-sequence repeats . An international effort has been made to determine the DNA sequence of the entire chromosome of group 3 in bread wheat and in its ancestor, Aegilops tauschii (the D genome donor) . However, transformation of wheat is still difficult, and tagged lines with transposable elements or T-DNA are not yet available. While tools for forward genetics have proven insufficient for wheat, a recent report  suggested that RNA interference can suppress the action of three homoeologous genes, even in polyploid wheat. Therefore, reverse genetic approaches based on functional genomics might be quite useful. We thus have been collecting expressed sequence tags (ESTs) from single limited strains of common wheat [7, 8]. Simultaneous efforts from both our own and other laboratories have generated data on more than one million wheat ESTs http://www.ncbi.nlm.nih.gov./dbEST. With the recent innovations that have been developed for the systematic collection and in silico display of comprehensive ESTs from a number of tissues, including abiotic-stressed tissues , it has become possible to characterize the expression profiles of target genes in these particular tissues and treatments.
Sequence-verified full-length cDNA clones with high accuracy that harbor protein coding sequences are critical for advances in structural, functional and comparative genomic studies. Using full-length cDNA sequence data, the protein-coding regions in the genome can be precisely predicted. In the wheat genome, gene annotation using full-length cDNA sequence data is essential, because there are many repetitive sequences and retrotransposons that can cause confusion when trying to predict the gene regions in genomic sequences [see examples in reference ]. In addition, functional annotation should be more reliable for converted amino acid sequence (coding sequence or CDS) predicted from full-length cDNA sequence data. Employing CDS data should enable inference of functional roles based on gene ontology from model plants such as rice and Arabidopsis (InterPro: http://www.ebi.ac.uk/interpro/) .
CDS data can also be used to search for counterparts in related plant species. Wheat and rice both belong to the grass family, the Gramineae, and CDS data can reveal characteristic breeding behavior and the ability to adapt to environmental conditions that led to their diversification more than 50 million years ago . Wheat, which originated in temperate zones, grows on dry land, is a long-day plant, has a shortened rachis, accumulates gluten-rich flour, and its germination is promoted in response to cold temperatures . In contrast, rice, which is cultivated in tropical or subtropical areas, grows in water-rich conditions, is a short-day plant, has an elongated rachis, accumulates storage proteins via two types of protein bodies, and its germination is promoted in response to high temperatures . Although it has long been theorized that selection pressure controls the distinct growth habits of wheat and rice, there has been no systematic investigation on the evolutionary rates of change for a genome-wide set of growth habit-related genes for these plants. The main reason for this is that DNA sequence information for the wheat genome is still limited.
Here we report on the construction of a full-length cDNA library for Chinese Spring wheat and its resulting quality as a starting resource for the complete sequencing of 6,162 independent full-length cDNA clones. We describe the overall characteristics of these full-length cDNA clones and their annotation compared to other model plants. From these clones, we selected 3,487 genes for which expression patterns could be traced with EST data in 28 tissues that are either observed during the wheat life cycle or were also stress-treated [7, 8]. Based on hierarchical cluster classification of expression pattern, the substitution rates (Ka/Ks) of genes common to wheat and rice were calculated in order to assess the variability of the gene grouping in each cluster. Genes preferentially expressed in certain tissues or stress treatments showed higher Ka/Ks values, suggesting that molecular selection occurred during the diversification of wheat and rice.
Results and Discussion
Sequence determination and functional annotation of full-length wheat cDNAs
The CAP-trapper method [14, 15] was used to construct a full-length cDNA library from pooled RNAs derived from the common wheat (Triticum aestivum cv. Chinese Spring). The library included 17 tissues that are either formed during the wheat life cycle or were subjected to an abiotic stress (Table 1). A total of 19,968 randomly selected recombinant clones from the full-length cDNA library have been sequenced from both ends. By grouping these one-path sequences, we obtained 7,149 independent gene clusters, which made it possible to group the homoeogenes expressed from each of the three common wheat genomes (A, B and D genomes) . The largest clones were representative of each gene cluster and harbored the CDSs. After they were rearrayed, we were able to determine the full-length sequences of 6,162 clones. Figure 1 presents a detailed description of the sequence data for the full-length cDNA clones and the size distribution of the inserted DNAs. The full-length wheat cDNA data were used to carry out a blast homology search versus the public DNA database. About 10% of the clones were determined to be wheat-specific genes without any counterparts in the DNA database. Molecular functions of these sequenced cDNAs were predicted using the InterPro database . The distribution of the wheat genes that were classified into various categories of the InterPro database was similar to that for rice and Arabidopsis. These results indicate that the selection of full-length wheat cDNAs from the pooled RNAs reflects their expression within the tissues (Figure 2).
Comparison of gene nucleotide substitution rates in coding regions in wheat and rice
To compare the gene nucleotide substitution rates of wheat and rice, we selected 4,321 wheat genes that had deduced amino acid sequences with more than 80% overlapping homology as determined by the blastx method. In contrast, when the homology search was carried out against the wheat EST database using selected rice genes as the query, the corresponding wheat genes had the highest ranking. In order to examine the expression profiles in various tissues of wheat, including abiotic-stressed tissues, we used blastn (E < 1e-60) to search the wheat EST database (MUGEST: http://www.shigen.nig.ac.jp/wheat/komugi/top/top.jsp) for corresponding ESTs of the 4,321 wheat cDNAs described in references 7 and 8. We selected 3,487 wheat genes to compare their amino acid sequences in wheat and rice (Additional file 1). In addition, we were also able to trace their expression profiles in 28 normal or stress-treated tissues of common wheat (Table 2). Since the most common way to estimate selection constraints on protein evolution is using the ratio of the nonsynonymous (Ka) and synonymous (Ks) substitution rates , we calculated the Ka/Ks ratio for each gene to compare the values for the wheat and rice counterparts. The Ka/Ks value varied from 1.3 to 0, with the average being 0.2363 (Figure 3). The likely occurrence of adaptive evolution for a given gene is indicated by a Ka/Ks value greater than 1 . Out of the 3,487 wheat genes, 12 had Ka/Ks values greater than 1. Although the identity of most of these 12 genes is yet unknown, five have been annotated, including one gene as involved in lipid transfer, one with homology to heat shock protein HSP20, one to proteinase inhibitor, one to thionin and one to an RNA binding protein. Genes for lipid transfer proteins and HSP20 have been demonstrated to be involved in stress responses, such as plant defense  and heat shock . A proteinase inhibitor gene has been reported to regulate plant development  and drought tolerance . Thionin genes are specifically expressed in the seed and control plant defense against Fusarium head blight [22, 23]. RNA binding protein also has a role in both the development of flowers and seeds . Taken together, this evidence suggests that positive selection may have a role for beneficial functions of these genes. Therefore, to some extent these genes characterize the adaptation that must occur in order for growth to continue within the specific habitats for wheat and rice.
Comparative and functional genomic analysis of full-length wheat cDNAs
In order to examine nucleotide substitution rates of the genes in relation to their expression patterns for the various tissues and stress treatments, global expression profiles of the 3,487 full-length cDNA genes in the 28 tissues and treatments were estimated using the full-length cDNAs as scaffolds to cluster ESTs. Expression patterns of the clustered wheat ESTs (contigs) have been previously reported [7, 8]. The number of EST constituents assigned to the 28 cDNA libraries was scored for each contig, which produced a two-way expression profile, i.e., contig vs. library. As seen in Figure 3, hierarchical clustering was constructed based on the EST constituent matrix . To determine the counterparts of the 3,487 full-length cDNAs in the wheat ESTs, we performed a search with blastn (E < 1e-60) against the contigs [7, 8]. Based on the expression patterns of their counterpart contigs after hierarchical clustering, the 3,487 genes were classified into 30 major groups. In addition to the genes that were ubiquitously or nearly ubiquitously expressed in the 28 tissues and treatments (designated as ubiquitous or "ub"), we also classified the genes that were dominantly expressed in each of these 28 tissues and treatments (Figure 3 and Table 2). Out of the 28 cDNA libraries, 13 were assigned to a life cycle group, while 15 were assigned to the stressed-tissue group (Figure 3) . During the clustering, a group was found that showed dominant expression in all of the stress-treated tissues (Figure 3). These genes were designated stress constant or "strc". The number of genes classified into each category ranged from 44 (heat shock) to 182 (dried roots), with a mean of 109.9.
In order to assess the relationship between the variability of the genes and their expression profiles, the Ka/Ks value of each gene was plotted against its expression pattern (listed above the expression profile of each contig in Figure 3). The mean Ka/Ks values for the genes classified in each group are shown in Figure 4. In addition, the Ka/Ks values of the life cycle and stressed-tissue groups are respectively displayed in Figure 4A and 4B. Analysis of variance indicated a significant difference in the Ka/Ks value between the groups. Notably, the Ka/Ks values of the "ub" and "strc" genes were significantly lower than the average of the 3,487 genes, which suggests a selective constraint on these subsets of the genes . Genes preferentially expressed in the callus (Figure 4B) also had significantly lower Ka/Ks values than the average. On the other hand, the 35 genes expressed in six tissues and in the stressed tissues, namely, root (r), the spike at booting (o), the spike at flowering (f), the developing seed 30 days post-anthesis (DPA) (sl), seedlings subjected to a cold treatment (kv) and desiccated seedlings (vd), all had significantly higher mean Ka/Ks values compared to the overall average (Figure 4A, B). This shows that there was a loose selection constraint against these genes during the evolutionary divergence of wheat and rice. Table 3 lists the annotated genes that displayed higher Ka/Ks values (greater than 0.5) for the genes expressed in these six tissues and treatments (for further explanation, see Additional file 2). These six tissues and treatments characterize the growth habit or speciation requirements of wheat and rice. For example, rice roots are usually covered by water, which creates anaerobic conditions around the roots . In contrast, since wheat plants prefer dry land, anaerobic conditions do not occur around the wheat roots . Among the genes that display higher Ka/Ks values, genes encoding transcription factors [29, 30], beta-glucanase  and nodulin , which are regulated by abiotic as well as biotic stresses, were found. Additionally, radial oxygen loss is a critical marker for adaptation of roots to anaerobic conditions [33, 34]. Two genes related to radial oxygen loss had higher Ka/Ks values (Table 1 and Additional file 2). The evidence accumulating from these observations suggests that genes expressed in the roots that are related to signal perception or transduction, transcription regulation, or stress responses have all been exposed, to some extent, to the adaptive selection that occurred during the diversification of wheat and rice.
The spike at the booting stage of both wheat and rice, during which the critical process of meiosis occurs, generates the gametes [35, 36]. Among the four annotated genes in this tissue, two genes are related to lipid metabolism for cellular activity [37, 38] and two genes are associated with redox reactions in photosynthesis  and respiration . All of these had relatively higher Ka/Ks values (Table 3 and Additional file 2), suggesting that the spike at the booting stage is under highly stressed conditions for dry (wheat) and moist (rice) habitats.
The spike that occurs at flowering supplies the field for pollination. Thus, genes that control compatibility for fertilization need to work at this particular stage . Among the genes that were preferentially expressed at this stage, the RNA binding protein showed Ka/Ks values that were higher than 1, which suggests a key role in the pollination by the spikes during flowering . The DNA binding histone-like protein that is specifically expressed in this tissue has a higher Ka/Ks value. Additionally, a methyltransferase involved in epigenetic regulation has highly diverged. Certain stress-related proteins such as glutathione S-transferase , osmotin [43, 44], pectate lyase  and glucan endo-1,3-beta-D-glucosidase  were also characteristically expressed in this tissue. Likewise, a gene encoding a receptor-like kinase is also uniquely expressed in this tissue. In addition, genes that encode pollen allergen-related protein [47, 48] and proteinase inhibitor  were expressed in the spike at flowering. All of these data suggest that selection for recognition between male and female gametes, modulation of nucleic acids, stress responses and signal transduction in the field during pollination might operate on these genes.
In response to internal and external environments, developing seeds at 30 DPA enter into dormancy [50, 51]. Genes for thionin 1 and 2, polysaccharide-related proteins, lipid transfer protein and ankyrins are all involved in controlling plant defense at this stage (Additional file 2). Stage-specific proteins that play substantial roles in the seeds, such as late embryo abundant proteins I and II , stress-related protein  and α-amylase , all showed higher Ka/Ks values during this part of the life cycle. This suggests that genes for signal perception during seed dormancy, along with those for stage-specific and stress-related conditions, might characterize the growth habits of wheat and rice in situations where they are adapting to environmental conditions.
Genes expressed in cold-treated and desiccated seedlings showed higher Ka/Ks values than the others. Genes found to be involved after these treatments are normally associated with cell membranes, cell walls, stress and senescence.
It is assumed that adaptive selection works on a large number of genes, some of which show rapid evolution. Unfortunately, in genes that have rapidly evolved, there is great divergence in the plants that have resulted from genetic divergence, which makes it difficult to trace their orthology. In the current study, after deducing the complete sequence of full-length cDNA clones, we compared coding sequences of common wheat genes to their rice counterparts. The expression patterns in various tissues during the wheat life cycle as well as in stress-treated tissues can be systematically monitored . Thus, based on mean Ka/Ks values, it may be possible to infer the variability of genes in their sequences that will be expressed in their respective tissues or treatments to the plants. Genes ubiquitously expressed throughout the majority of the tissues or stress treatments exhibited lower Ka/Ks values, whereas significantly higher Ka/Ks values were noted both for the genes that were characteristically expressed in four tissues of the wheat life cycle (the root, the spike at the booting stage, the spike at flowering and the seed at 30 DPA), and for the genes expressed in response to the two stress treatments (seedlings undergoing cold treatment and desiccated seedlings). These findings of gene expression patterns in response to stresses are also reported for mammalian genes . Therefore, to some extent, positive selection might play a role in the gene expression that occurs in response to environmental changes.
We have developed a resource of a large number of sequenced full-length cDNAs of common wheat that covers the majority of the functional annotations deduced from gene ontology of rice and Arabidopsis. This full-length wheat cDNA resource is indispensable for gene annotations in the wheat genome for which sequencing is still ongoing, and for the functional analysis of these genes and their products. The information presented here on the full-length wheat cDNA sequences, their variability during the evolution of cereals and their expression profiles in various tissues during both the life cycle and in response to stress treatments should facilitate functional genomics and genome breeding of wheat and other cereals.
Construction of full-length wheat cDNA library, DNA sequencing, and selection of independent cDNA clones
The CAP-trapper method [14, 15] was used to construct a full-length cDNA library from pooled RNAs derived from 17 samples of common wheat (Triticum aestivum cv. Chinese Spring) tissues, a combination of those formed during the wheat life cycle and those subjected to abiotic stresses (Table 1). Subsequently, 39,936 cDNA clones were randomly selected from the library and sequenced by a one-path method from both ends of the inserts. These DNA sequences were assembled with the Phrap method (University of Washington Genome Center; http://www.phrap.org) using the program new_ace-penalty-5 – mismatch 50 – minscore 100. To construct the gene clusters (E < 1e-30), the resultant contigs were clustered using the blastn method . Representative clones from each gene cluster that was predicted to harbor CDSs were then rearrayed. Subsequently, 6,162 full-length cDNA clones were selected, and their inserts were completely sequenced using the primer walking method.
Data mining and characterization of full-length wheat cDNA clones
The wheat cDNAs were annotated using the blast method against the public database (E < 1e-5). Rice counterparts for each of the wheat cDNAs were selected from the public database when a blastx search determined that there was more than 80% overlapping homology in the deduced amino acids. The CDSs of the counterparts were automatically aligned. The alignments were needed to be corrected manually. Then, the Ka/Ks values  were calculated for the codons that corresponded between the wheat and rice homologues.
cDNA expression patterns were monitored using a tissue expression map of wheat, which included 28 cDNA libraries (Table 2). The blastn program (with E < 1e-60) was then used to search the full-length cDNAs for their correct counterparts in the wheat tissue expression map, with the full-length cDNAs used as scaffolds to cluster ESTs. Similarities between the resultant 3,487 cDNAs and the libraries were estimated using Pearson's correlation coefficient . Hierarchical clustering http://rana.lbl.gov/eisen/ was applied in order to compare these cDNAs expression profiles among the 28 wheat libraries. Expression profiles were displayed based on the number of constituents in the cDNAs along with their Ka/Ks value.
Full-length cDNA data
The full-length cDNA data have been deposited in the DDBJ under accession nos. AK330135 to AK336296, and are available at http://www.shigen.nig.ac.jp/wheat/komugi/top/top.jsp.
Bennett MD, Leitch IJ: Nuclear DNA amounts in angiosperms. Ann Bot. 1995, 76: 113-176. 10.1006/anbo.1995.1085.
Xue S, Zhang Z, Lin F, Kong Z, Cao Y, Li C, Yi H, Mei M, Zhu H, Wu J, Xu H, Zhao D, Tian D, Zhang C, Ma Z: A high-density intervarietal map of the wheat genome enriched with markers derived from expressed sequence tags. Theor Appl Genet. 2008, 117: 181-189. 10.1007/s00122-008-0764-9.
Qi LL, Echalier B, Chao S, Lazo GR, Butler GE, Anderson OD, Akhunov ED, Dvorák J, Linkiewicz AM, Ratnasiri A, Dubcovsky J, Bermudez-Kandianis CE, Greene RA, Kantety R, La Rota CM, Munkvold JD, Sorrells SF, Sorrells ME, Dilbirligi M, Sidhu D, Erayman M, Randhawa HS, Sandhu D, Bondareva SN, Gill KS, Mahmoud AA, Ma XF, Miftahudin , Gustafson JP, Conley EJ, Nduati V, Gonzalez-Hernandez JL, Anderson JA, Peng JH, Lapitan NL, Hossain KG, Kalavacharla V, Kianian SF, Pathan MS, Zhang DS, Nguyen HT, Choi DW, Fenton RD, Close TJ, McGuire PE, Qualset CO, Gill BS: A chromosome bin map of 16,000 expressed sequence tag loci and distribution of genes among the three genomes of polyploid wheat. Genetics. 2004, 168: 701-712. 10.1534/genetics.104.034868.
Ishikawa G, Yonemaru J, Saito M, Nakamura T: PCR-based landmark unique gene (PLUG) markers effectively assign homoeologous wheat genes to A, B and D genomes. BMC Genomics. 2007, 8: 135-10.1186/1471-2164-8-135.
Gill BS, Appels R, Botha-Oberholster A-M, Buell CR, Bennetzen JL, Chalhoub B, Chumley F, Dvorak J, Iwanaga M, Keller B, Li W, McCombie WR, Ogihara Y, Quetier F, Sasaki T: A workshop report on wheat genome sequencing: International genome research on wheat consortium. Genetics. 2004, 168: 1087-1096. 10.1534/genetics.104.034769.
Fu D, Uauy C, Blechl A, Dubcovsky J: RNA interference for wheat functional gene analysis. Transgenic Res. 2007, 16: 689-701. 10.1007/s11248-007-9150-7.
Ogihara Y, Mochida K, Nemoto Y, Murai K, Yamazaki Y, Shin-I T, Kohara Y: Correlated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tags. Plant J. 2003, 33: 1001-1011. 10.1046/j.1365-313X.2003.01687.x.
Mochida K, Kawaura K, Shimosaka E, Kawakami N, Shin-I T, Kohara Y, Yamazaki Y, Ogihara Y: Tissue expression map of comprehensive expressed sequence tags and its application to in silico screening of stress response genes in common wheat. Mol Genet Genomics. 2006, 276: 304-312. 10.1007/s00438-006-0120-1.
Salse J, Bolot S, Throude M, Jouffe V, Piegu B, Quraishi UM, Calcagno T, Cooke R, Delseny M, Feuillet C: Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution. Plant Cell. 2008, 20: 11-24. 10.1105/tpc.107.056309.
Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD, Durbin R, Falquet L, Fleischmann W, Gouzy J, Hermjakob H, Hulo N, Jonassen I, Kahn D, Kanapin A, Karavidopoulou Y, Lopez R, Marx B, Mulder NJ, Oinn TM, Pagni M, Servant F, Sigrist CJ, Zdobnov EM: The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 2001, 29: 37-40. 10.1093/nar/29.1.37.
Gaut BS: Evolutionary dynamics of grass genomes. New Phytol. 2002, 154: 15-28. 10.1046/j.1469-8137.2002.00352.x.
Zohary D, Hopf M: Domestication of Plants in the Old World. 2001, Oxford University Press, New York
Oka HI: Origin of Cultivated Rice. 1988, Elsevier Publications, Amsterdam
Seki M, Narusaka M, Kamiya A, Ishida J, Satou M, Sakurai T, Nakajima M, Enju A, Akiyama K, Oono Y, Muramatsu M, Hayashizaki Y, Kawai J, Carninci P, Itoh M, Ishii Y, Arakawa T, Shibata K, Shinagawa A, Shinozaki K: Functional annotation of a full-length Arabidopsis cDNA collection. Science. 2002, 296: 141-145. 10.1126/science.1071006.
Ogihara Y, Mochida K, Kawaura K, Murai K, Seki M, Kamiya A, Shinozaki K, Carninci P, Hayashizaki Y, Shin-I T, Kohara Y, Yamazaki Y: Construction of a full-length cDNA library from young spikelets of hexaploid wheat and its characterization by large-scale sequencing of expressed sequence tags. Genes Genet Syst. 2004, 79: 227-232. 10.1266/ggs.79.227.
Mochida K, Yamazaki Y, Ogihara Y: Discernment of homoeologous gene expression in hexaploid wheat by SNP analysis of contigs grouped from a large number of expressed sequence tags. Mol Genet Genomics. 2003, 270: 371-377. 10.1007/s00438-003-0939-7.
Li WH: Unbiased estimation of the rates of synonymous and nonsynonymous substitutions. J Mol Evol. 1993, 36: 96-99. 10.1007/BF02407308.
Cheng CS, Samuel D, Liu YJ, Shyu JC, Lai SM, Liu KF, Lyu PC: Binding mechanism of nonspecific lipid transfer proteins and their role in plant defense. Biochemistry. 2004, 43: 13628-13636. 10.1021/bi048873j.
Swindell WR, Huebner M, Weber AP: Transcriptional profiling of Arabidopsis heat shock proteins and transcription factors reveals extensive overlap between heat and non-heat stress response pathways. BMC Genomics. 2007, 8: 125-10.1186/1471-2164-8-125.
Wang J, Shi ZY, Wan XS, Sheu GZ, Zhang JL: The expression pattern of a rice proteinase inhibitor gene OsP18-1 implies its role in plant development. J Plant Physiol. 2008, 165: 1519-1529. 10.1016/j.jplph.2007.08.008.
Huang Y, Xiao B, Xiong L: Characterization of a stress responsive proteinase inhibitor gene with positive effect in improving drought resistance in rice. Planta. 2007, 226: 73-85. 10.1007/s00425-006-0469-8.
Poppenberger B, Berthiller F, Lucyshyn D, Sieberer T, Schuhmacher R, Krska R, Kuchlerk , Glössl J, Luschnig C, Adam G: Detoxification of the Fusarium mycotoxin deoxynivalenol by a UDP-glucosyltransferase from Arabidopsis thaliana. J Biol Chem. 2003, 278: 47905-47914. 10.1074/jbc.M307552200.
Sepulveda-Jimenez G, Rueda-Benitez P, Porta H, Rocha-Sosa M: A red beet (Beta vulgaris) UDP-glucosyl transferase gene induced by wounding, bacterial infiltration and oxidative stress. J Exp Bot. 2005, 56: 605-611. 10.1093/jxb/eri036.
Fusaro AF, Bocca SN, Ramos RL, Barroco RM, Magioli C, Jorge VC, Couthinho TC, Rangel-Lima CM, De Rycke R, Inze D, Engler G, Sachetto-Martins G: AtGRP2, a cold-induced nucleo-cytoplasmic RNA-binding protein, has a role in flower and seed development. Planta. 2007, 225: 1339-1351. 10.1007/s00425-006-0444-4.
Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.
Duret L, Mouchiroud D: Determinants of substitution rates in mammalian genes: Expression pattern affects selection intensity but not mutation rate. Mol Biol Evol. 2000, 17: 68-70.
Tsuji H, Saika H, Tsutsumi N, Hirai A, Nakazono M: Dynamic and reversible changes in histone H3-Lys4 methylation and H3 acetylation occurring at submergence-inducible genes in rice. Plant Cell Physiol. 2006, 47: 995-1003. 10.1093/pcp/pcj072.
Mohammadi M, Kav NN, Deyholos MK: Transcriptional profiling of hexaploid wheat (Triticum aestivum L.) roots identifies novel, dehydration-responsive genes. Plant Cell Environ. 2007, 30: 630-645. 10.1111/j.1365-3040.2007.01645.x.
Minglin L, Yuxiu Z, Tuanyao C: Identification of genes up-regulated in response to Cd exposure in Brassica juncea L. Gene. 2005, 363: 151-158. 10.1016/j.gene.2005.07.037.
Zhang Y, Shih DS: Isolation of an osmotin-like protein gene from strawberry and analysis of the response of this gene to abiotic stresses. J Plant Physiol. 2007, 164: 68-77. 10.1016/j.jplph.2006.02.002.
Simmons CR, Litts JC, Huang N, Rodriguez RL: Structure of a rice beta-glucanase gene regulated by ethylene, cytokinin, wounding, salicylic acid and fungal elicitors. Plant Mol Biol. 1992, 18: 33-45. 10.1007/BF00018454.
Kottapalli KR, Kottapalli P, Agrawal GK, Kikuchi S, Rakwal R: Recessive bacterial leaf blight resistance in rice: complexity, challenges and strategy. Biochem Biophys Res Commun. 2007, 355: 295-301. 10.1016/j.bbrc.2007.01.134.
Colmer TD: Long-distance transport of gases in plants: a perspective on internal aeration and radial oxygen loss from roots. Plant Cell Environ. 2003, 26: 17-36. 10.1046/j.1365-3040.2003.00846.x.
Malik AI, Colmer TD, Lambers , Schortemeyer : Aerenchyma formation and radial O2 loss along adventitious roots of wheat with only the apical root portion exposed to O2 deficiency. Plant Cell Environ. 2003, 26: 1713-1722. 10.1046/j.1365-3040.2003.01089.x.
Martinez-Perez E, Moore G: To check or not to check? The application of meiotic studies to plant breeding. Curr Opin Plant Biol. 2008, 11: 222-227.
Nonomura KI, Nakano M, Fukuda T, Eguchi M, Miyao A, Hirochika H, Kurata N: The novel gene HOMOLOGOUS PAIRING ABERRATION IN RICE MEIOSIS 1 of rice encodes a putative coiled-coil protein required for homologous chromosome pairing in meiosis. Plant Cell. 2004, 16: 1008-1020. 10.1105/tpc.020701.
Clauss K, Baumert A, Nimtz M, Milkowski C, Strack D: Role of a GDSL lipase-like protein as sinapine esterase in Brassicaceae. Plant J. 2008, 53: 802-813. 10.1111/j.1365-313X.2007.03374.x.
Salas JJ, Youssar L, Martinez-Force E, Garces R: The biochemical characterization of a high-stearic acid sunflower mutant reveals the coordinated regulation of stearoyl-acyl carrier protein desaturases. Plant Physiol Biochem. 2008, 46: 109-116. 10.1016/j.plaphy.2007.10.007.
Marri L, Trost P, Trivelli X, Gonnelli L, Pupillo P, Sparla F: Spontaneous assembly of photosynthetic supramolecular complexes as mediated by the intrinsically unstructured protein CP12. J Biol Chem. 2008, 283: 1831-1838. 10.1074/jbc.M705650200.
Martsinkovskaya AI, Poghosyan ZP, Haralampidis K, Murphy DJ, Hatzopoulos P: Temporal and spatial gene expression of cytochrome B5 during flower and fruit development in olives. Plant Mol Biol. 1999, 40: 79-90. 10.1023/A:1026417710320.
Schierup MH, Vekemans X: Genomic consequences of selection on self-incompatibility genes. Curr Opin Plant Biol. 2008, 11: 116-122. 10.1016/j.pbi.2008.01.003.
Jacquard C, Mazeyrat-Gourbeyre F, Devaux P, Boutilier K, Baillieul F, Clement C: Microspore embryogenesis in barley: anther pre-treatment stimulates plant defence gene expression. Planta. 2009, 229: 393-402. 10.1007/s00425-008-0838-6.
D'Angeli S, Altamura MM: Osmotin induces cold protection in olive trees by affecting programmed cell death and cytoskeleton organization. Planta. 2007, 225: 1147-1163. 10.1007/s00425-006-0426-6.
Zhang Y, Shih DS: Isolation of an osmotin-like protein gene from strawberry and analysis of the response of this gene to abiotic stresses. J Plant Physiol. 2007, 164: 68-77. 10.1016/j.jplph.2006.02.002.
Palusa SG, Golovkin M, Shin SB, Richardson DN, Reddy AS: Organ-specific, developmental, hormonal and stress regulation of expression of putative pectate lyase genes in Arabidopsis. New Phytol. 2007, 174: 537-550. 10.1111/j.1469-8137.2007.02033.x.
Opassiri R, Pomthong B, Akiyama T, Nakphaichit M, Onkoksoong T, Ketudata Cairns M, Ketudat Cains M: A stress-induced rice (Oryza sativa L.) beta-glucosidase represents a new subfamily of glycosyl hydrolase family 5 containing a fascin-like domain. Biochem J. 2007, 408: 241-249. 10.1042/BJ20070734.
Balzer HJ, Borisiuk L, Meyer HM, Matzk F, Baumlein H: A pollen allergen-encoding gene is expressed in wheat ovaries. Plant Mol Biol. 1996, 32: 435-445. 10.1007/BF00019095.
Jiang SY, Jasmin PX, Ting YY, Ramachandran S: Genome-wide identification and molecular characterization of Ole_e_I, Allerg_1 and Allerg_2 domain-containing pollen-allergen-like genes in Oryza sativa. DNA Res. 2005, 12: 167-179. 10.1093/dnares/dsi005.
Arai S, Matsumoto I, Emori Y, Abe K: Plant seed cystatins and their target enzymes of endogenous and exogenous origin. J Agric Food Chem. 2002, 50: 6612-6617. 10.1021/jf0201935.
Wilson ID, Barker GL, Lu C, Coghill JA, Beswick RW, Lenton JR, Edwards KJ: Alteration of the embryo transcriptome of hexaploid winter wheat (Triticum aestivum cv. Mercia) during maturation and germination. Funct Integr Genomics. 2005, 5: 144-154. 10.1007/s10142-005-0137-2.
Gu XY, Kainian SF, Foley ME: Dormancy genes from weedy rice respond divergently to seed development environments. Genetics. 2006, 172: 1199-1211. 10.1534/genetics.105.049155.
Delseny M, Bies-Etheve N, Carles C, Hull G, Vicient C, Raynal M, Grellet F, Aspart L: Late embryogenesis abundant (LEA) protein gene regulation during Arabidopsis seed maturation. J Plant Physiol. 2001, 158: 419-427. 10.1078/0176-1617-00353.
Tai HH, Tai GCC, Beardmore T: Dynamic histone acetylation of late embryonic genes during seed germination. Plant Mol Biol. 2005, 59: 909-925. 10.1007/s11103-005-2081-x.
Mrva K, Wallwork M, Mares DJ: α-Amylase and programmed cell death in aleurone of ripening wheat grains. J Exp Bot. 2006, 57: 877-885. 10.1093/jxb/erj072.
Karlin S, Altschul SF: Applications and statistics for multiple high-scoring segments in molecular sequences. Proc Natl Acad Sci USA. 1993, 90: 5873-5877. 10.1073/pnas.90.12.5873.
We would like to thank the members of the Wheat Genomics Consortium of Japan for supplying the original wheat RNAs and their kind support throughout this project. This work was supported by Grants-in-Aid for Scientific Research on Priority Areas in the field of Comparative Genomics, and the National BioResource Project from the Ministry of Education, Culture, Sports, Science and Technology of Japan.
The authors declare that they have no competing interests.
This study was conceived and directed by YO, MS and KS. The full-length cDNA library was constructed by AE, YT and CK under the guidance of JK and YH. Selection of cDNA clones and DNA sequencing were conducted by AE and YT with the help of KM, AT and YS. Data were analyzed by KK and KM. AT, YS, JK, YH, MS, KS and YO participated in the design and coordination of the study. All authors read and approved the final manuscript.
Kanako Kawaura, Keiichi Mochida contributed equally to this work.