Skip to main content
  • Research article
  • Open access
  • Published:

Deep sequencing reveals the complex and coordinated transcriptional regulation of genes related to grain quality in rice cultivars

Abstract

Background

Milling yield and eating quality are two important grain quality traits in rice. To identify the genes involved in these two traits, we performed a deep transcriptional analysis of developing seeds using both massively parallel signature sequencing (MPSS) and sequencing-by-synthesis (SBS). Five MPSS and five SBS libraries were constructed from 6-day-old developing seeds of Cypress (high milling yield), LaGrue (low milling yield), Ilpumbyeo (high eating quality), YR15965 (low eating quality), and Nipponbare (control).

Results

The transcriptomes revealed by MPSS and SBS had a high correlation co-efficient (0.81 to 0.90), and about 70% of the transcripts were commonly identified in both types of the libraries. SBS, however, identified 30% more transcripts than MPSS. Among the highly expressed genes in Cypress and Ilpumbyeo, over 100 conserved cis regulatory elements were identified. Numerous specifically expressed transcription factor (TF) genes were identified in Cypress (282), LaGrue (312), Ilpumbyeo (363), YR15965 (260), and Nipponbare (357). Many key grain quality-related genes (i.e., genes involved in starch metabolism, aspartate amino acid metabolism, storage and allergenic protein synthesis, and seed maturation) that were expressed at high levels underwent alternative splicing and produced antisense transcripts either in Cypress or Ilpumbyeo. Further, a time course RT-PCR analysis confirmed a higher expression level of genes involved in starch metabolism such as those encoding ADP glucose pyrophosphorylase (AGPase) and granule bound starch synthase I (GBSS I) in Cypress than that in LaGrue during early seed development.

Conclusion

This study represents the most comprehensive analysis of the developing seed transcriptome of rice available to date. Using two high throughput sequencing methods, we identified many differentially expressed genes that may affect milling yield or eating quality in rice. Many of the identified genes are involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins. Some of the differentially expressed genes could be useful for the development of molecular markers if they are located in a known QTL region for milling yield or eating quality in the rice genome. Therefore, our comprehensive and deep survey of the developing seed transcriptome in five rice cultivars has provided a rich genomic resource for further elucidating the molecular basis of grain quality in rice.

Background

Rice is the staple food crop of more than 50% of the global population, and development of high yielding and high quality rice varieties is essential. Rice grain quality is assessed by its appearance and by its milling, cooking, eating, and nutritional quality [1–3]. Milling yield (the percentage of whole grain remaining after removal of the hulls and bran layers from paddy rice) is a very important characteristic that greatly affects profits for rice farmers. Milling yield or milling efficiency is determined based on the quality of the paddy rice, the milling equipment used and the skill of the mill operator. Milling yield is influenced by grain hardness, chalky area of the grain, grain size and shape, depth of surface ridges, bran thickness, and milling efficiency [4–7]. Agronomic and field managements also affect grain breakage during milling [5, 8, 9]. Rice eating quality is important because it determines the price of rice in the market. Eating quality is determined by water, protein, starch, and fat content [10–14]. Eating quality is negatively correlated with protein content, stickiness, and hardness of rice [10, 11]. The main factors affecting both eating and cooking quality of rice are amylose content, gel consistency and gelatinization temperature [12, 13, 15, 16]. Cooked rice with high amylose content is flaky, dry, hard and non-sticky while rice with low amylose content is sticky, moist, tender and glossy [12, 13]. Developing cultivars with high milling yield and eating quality have been the main objectives in rice breeding programs in the last few decades.

Milling yield and eating quality are complex traits controlled by quantitative trait loci (QTLs) [17]. In the last several years, many QTLs for eating quality have been mapped in the rice genome. For example, using chromosome segment substitution lines (CSSLs), Wan et al. [18] identified a total of 25 QTLs for nine eating quality traits. Many QTLs affecting different quality traits are mapped in the same chromosomal regions. Six QTLs are non-environment-specific and could be used for marker-assisted selection in rice quality improvement. Recently, Hao et al. [19] constructed 154 CSSLs for QTL mapping of quality traits. In that study, 10 QTLs for rice appearance traits and eight QTLs concerned with physico-chemical traits were detected. QTLs related to glossiness of cooked rice were identified in different genomic regions in Ilpumbyeo, a high grain quality rice in Korea [20]. The amylose content of rice is governed by the waxy (Wx) locus and mapped to chromosome 6 [21–23]. In contrast to the advances in genetic analysis of eating quality, less progress has been made on the genetic analysis of milling quality because the trait has low heritability and is sensitive to environmental factors [24, 25]. Another challenge for milling yield analysis is that many mapping populations for milling yield had varied kernel shape among the individual lines and heterogeneity in grain dimensions confounds the assessment of genetic effects [9, 24, 26–31]. Recently, a mapping study identified six QTLs responsible for head rice (milling) yield using recombinant inbred lines (RILs) derived from crosses of common parent Cypress (high milling) with RT0034 (low milling) and LaGrue (low milling) [9].

The molecular and biochemical basis of grain quality in cereals have been studied in the last decade, and the biochemical processes and many participating genes in the synthesis of starch [32–34], storage proteins [35–39], and lysine within the aspartate family amino acid pathway [40] have been characterized in rice and other cereals. However, how the expression of these genes is coordinated and regulated during grain filling is still poorly understood. Recently, Tian et al. [41] demonstrated that starch synthesis-related genes form a fine network to control eating and cooking qualities by regulating amylose content, gel consistency, and/or gelatinization temperature, and through genetic modification of any of these starch synthesis-related genes, eating and cooking quality can be improved in rice. The expression of 44 genes participating in three pathways (the synthesis of starch, storage proteins, and lysine) during rice grain filling were examined by RT-PCR in the maternal line 93-11 and in the super-hybrid rice line Liang-You-Pei-Jiu (LYP9) [3]. The analysis revealed diverse yet coordinated expression profiles of genes involved in the three pathways in developing seeds. These unique expression patterns of the quality-related genes may influence the final composition and property of starch, protein, and lysine synthesis in rice seeds.

Tools for whole-genome expression analysis like microarrays, serial analysis of gene expression (SAGE) and massively parallel signature sequencing (MPSS) have been widely used for transcriptome analysis in plants in last 10 years [42]. The sequencing-by-synthesis (SBS) second-generation sequencing method has been recently used for transcriptome analysis in many organisms because of its low cost and large sequencing output [43]. In this study, we used both MPSS and SBS to analyze the transcriptome of the developing rice seeds in five cultivars that differed in milling yield and eating quality. Many differentially expressed novel transcripts and genes involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins were identified. Promoter analysis revealed the presence of hundreds of novel conserved patterns of cis regulatory elements in the up-regulated genes and putative co-expressed genes in the rice cultivars with high milling yield and good eating quality. Our comprehensive and deep survey of the developing seed transcriptome in five rice cultivars has provided an excellent starting material for further elucidating the molecular and biochemical basis of milling and eating quality in rice.

Results

Characteristics of the MPSS and SBS libraries and their matching to the rice genome and to EST and full-length cDNA databases

Both MPSS and SBS tags are short cDNA tags or digital gene expression tags, which are mainly derived from the 3' regions of a transcript [44]. About 1.0 to 1.3 million 17-base MPSS signatures and about 2.0 to 4.0 million 20-base SBS signatures were obtained in the 10 libraries (Table 1). These signatures were clustered and then processed with reliability and significance filters as described by Meyers et al. [45] (Additional File 1). For comparison of the expression levels across the libraries, the frequency of signatures in the individual libraries was normalized to one million (transcripts per million or TPM) [45]. The number of distinct signatures ranged from 12,000 to 18,000 in the MPSS libraries and from 77,000 to 165,000 in the SBS libraries. The SBS libraries contained two to three times more significant signatures (≥4 TPM) than the MPSS libraries. About 79 to 85% of the MPSS and 89 to 95% of the SBS significant signatures matched to the japonica (Nipponbare) genomic sequence (Table 1). The significant MPSS and SBS signatures from all five libraries were classified into seven classes based on their location on the annotated genes according to the method previously described by Meyers et al. [45] (Additional File 2).

Table 1 Characteristics of the MPSS and SBS libraries of developing rice seeds

Correlation of the transcriptomic results generated by the MPSS and SBS technologies

From 62 to 77% of the significant signatures overlapped between the MPSS and SBS libraries (Table 1). Further, we used all the significant signatures in the MPSS and SBS libraries of the same cultivar for Pearson correlation coefficient analysis. The correlation coefficient was low when unfiltered MPSS and SBS data were used (Table 2). Removal of a small fraction of outliers (3-8, < 0.001% of the signatures) increased the correlation coefficient significantly in all five libraries (Table 2). For example, the correlation coefficient between the two YR15965 libraries was increased from 0.58 to 0.90 after removal of only four of 5,757 signatures.

Table 2 Correlation of the transcriptome results obtained by the MPSS and SBS technologies

Expression patterns of grain quality-related genes in the cultivars with high milling yield and good eating quality

Data mining of the TIGR rice annotated genes (pseudomolecules version 5) identified 338 grain quality-related genes belonging to starch biosynthesis and degradation, seed storage protein synthesis (glutelin, globulin, and prolamins), seed maturation, seed allergen synthesis, seed development, and biosynthesis and degradation of aspartate family amino acids (aspartate, asparagine, threonine, isoleucine, methionine, and lysine). We examined the expression level of these genes in developing rice seeds of the five cultivars (Additional File 3). In both SBS and MPSS libraries, a total of 419 (16 grain-related genes) and 168 genes (3 grain-related genes) were ≥5-fold up- and down-regulated, respectively, in Cypress relative to both LaGrue and Nipponbare (Table 3). Similarly, 518 (8 grain-related genes) and 106 genes (4 grain-related genes) were ≥5-fold up- and down-regulated, respectively, in Ilpumbyeo relative to both YR15965 and Nipponbare (Table 3). The number of 5-fold up- and down-regulated antisense genes, genes with antisense transcripts, and genes encoding transcription factors (TFs) in Cypress (compared to both LaGrue and Nipponbare) and Ilpumbyeo (compared to both YR15965 and Nipponbare) are also listed in Table 3.

Table 3 The number of over five fold up- and down-regulated genes in Cypress in comparison to LaGrue and Nipponbare and in Ilpumbyeo in comparison to YR15965 and Nipponbare

Genes involved in starch metabolism

Many genes involved in starch metabolism showed similar expression patterns in both SBS and MPSS libraries (Table 4 and Additional File 4A). For example, the genes encoding 1,4-α-glucan branching enzyme (Os02g32660), limit dextrinase (Os04g08270), 1,4-α-glucan branching enzyme (Os06g51084), and α-amylase (Os09g29404) were 5-fold up-regulated in Cypress compared to LaGrue and Nipponbare in both SBS and MPSS libraries (Table 4).

Table 4 List of grain quality genes with similar expression patterns in both SBS and MPSS libraries and up-regulated over five fold in Cypress (in comparison to LaGrue and Nipponbare) and in Ilpumbyeo (in comparison to YR15965 and Nipponbare)

Interestingly, we found that genes encoding enzymes involved in the biosynthesis of starch underwent alternative splicing (Figure 1). For example, genes involved in the breakdown of long linear glucan leading to β-D-glucose-6-phosphate (Os03g55090 encoding phosphorylase and Os03g50480 encoding phosphoglucomutase) underwent alternative splicing in Ilpumbyeo and Cypress (Additional File 4A, 4B, 4C). Similarly, the genes encoding the α-amylases (Os09g29404/Os04g08270/Os04g33040/Os01g51754) and 1-4 α- glucan branching enzyme (Os06g51084) involved in the breakdown of short linear glucan leading to β-D-glucose underwent alternative splicing in Ilpumbyeo and Cypress (Additional File 4A, 4B, 4C). Some of the 5-fold up-regulated genes identified by either MPSS or SBS also had alternative splicing forms, and these included genes encoding glucose-1-phosphate adenylyltransferase large subunit 1 (also called AGPase) (Os01g44220) and 1,4-α-glucan branching enzyme (Os06g51084). Similarly, some of the 5-fold down-regulated genes identified by either MPSS or SBS produced alternative splicing forms, and these included genes encoding 1,4-α-glucan branching enzyme (Os06g51084) and phosphoglucomutase (Os03g50480) (Additional File 4A, 4B, 4C). These results showed the complexity of the transcription of quality-related genes in developing rice seeds.

Figure 1
figure 1

Network of genes involved in starch biosynthesis and degradation, and in the biosynthesis of seed storage, seed maturation, and allergenic proteins http://www.gramene.org. Only the genes with 5-fold up- or down-regulation in Cypress (PSC) or Ilpumbyeo (PSI) compared with that in LaGrue or YR15965 are shown. The positive number in parenthesis indicates up-regulation and the negative number in parenthesis indicates down-regulation. The first value in parenthesis shows the fold change in expression either in LaGrue or YR15965, and the second value shows the fold change in expression in Nipponbare. The italicized and underlined bold number before the parenthesis shows the MPSS/SBS signature class [45]. Green indicates that the gene was identified by SBS only. Red indicates that the gene was identified by MPSS only. Blue indicates that the gene was identified by both MPSS and SBS.

For validation of the MPSS data, two starch biosynthesis-related genes that showed differential expression in the grain libraries were selected for strand specific RT-PCR. These two genes encode AGPase (AK073146) and GBSS I (AK070431). Total RNA was isolated from the developing seeds of Cypress, LaGrue, Ilpumbyeo, YR15965 and Nipponbare at 3, 6, 9, 12 and 15 DAF (days after flowering). A time-course study of the AGPase and GBSS I genes indicated that expression levels were higher in the high milling Cypress than in the low milling LaGrue in the early stages (6 and 9 DAF) of seed development (Figure 2).

Figure 2
figure 2

RT-PCR analysis of the genes encoding GBSS I and AGPase in developing rice seeds at 3, 6, 9, 12, and 15 days after anthesis in five rice cultivars.

Genes encoding essential amino acids

The aspartate family pathway consists of five amino acids (asparagine, aspartate, lysine, methionine, and threonine), and is catalysed primarily by the enzymes aspartate kinase (AK) and dihydrodipicolinate synthase (DHPS). The regulatory network of the genes involved in the biosynthesis and degradation of aspartate family amino acids is plotted in Additional File 5. The genes involved in the metabolism of the aspartate family amino acids with 5-fold up- or down-regulation in Cypress and Ilpumbyeo compared to their controls (LaGrue, YR15965, and Nipponbare) are listed in Additional File 4A, 4B, 4C, and Additional File 5). Some of the important genes for amino acid biosynthesis showed similar expression patterns in both MPSS and SBS libraries (Table 4 and Additional File 4A, 4B, 4C). For example, the genes encoding aspartate transaminase (Os01g55540), methionine adenosyltransferase (Os01g22010), and acetolactate synthase (Os03g21080) were 5-fold up-regulated in Ilpumbyeo compared to YR15965 and Nipponbare in both SBS and MPSS libraries. In contrast, some of the genes involved in aspartate family amino acid biosynthesis were down-regulated, including those encoding threonine synthase (Os01g49890), aspartate kinase (Os03g63330), and malate dehydrogenase (Os10g33800) (Additional File 4A, 4B, 4C). In addition, many of the genes involved in the amino acid biosynthesis also underwent alternative splicing. Among them, some showed 5-fold up-regulation in Ilpumbyeo in either the MPSS or SBS libraries, and these included genes encoding L-3-cyanoalanine synthase (Os04g08350), methionine gamma-lyase (Os09g28050), and asparaginase (Os04g46370), which showed two, two, and three alternative splice forms, respectively (Additional File 4A, 4B, 4C).

Genes encoding seed-storage proteins

The major classes of storage proteins are glutelins, globulins, and prolamins. Some of the genes encoding these classes showed over 5-fold up-regulation in Cypress compared to LaGrue and Nipponbare, and these genes included those encoding glutelin type-B7 precursors (Os02g15070, Os02g15090), globulin-1 S allele precursor (Os03g46100), prolamin PPROL 17 precursor (Os06g31070), and 13 kDa prolamin precursor (Os07g10570) (Table 4 and Additional File 4A). Among the storage-protein genes with over 5-fold up-regulation in Cypress, some produced antisense transcripts like those encoding glutelin type-B7 precursor (Os02g15070, Os02g15090), prolamin PPROL 17 precursor (Os06g31070), and 13 kDa prolamin precursor (Os07g10570) (Additional File 4A). Many genes encoding glutelins and prolamins also underwent alternative splicing or termination in Cypress and Ilpumbyeo. The gene encoding glutelin type-A 2 (Os02g25640) produced 15 and 17 alternative splice forms in MPSS and SBS libraries, and most of them were up-regulated in Cypress but down-regulated in Ilpumbyeo. Among the prolamin-related genes, the prolamin precursor protein gene (Os07g10570) produced five and six alternative splice forms in MPSS and SBS libraries, respectively. The 5-fold induced or suppressed genes encoding globulin, prolamin, and glutelin storage proteins either in Cypress or Ilpumbyeo or both are listed in Additional File 4A and in Figure 1.

Genes encoding seed maturation and allergenic and seed-specific expression proteins

Some of the genes belonging to this group showed similar expression patterns in both MPSS and SBS libraries (Table 4 and Additional File 4A). For example, the genes encoding seed-specific protein Bn15D14A (Os03g58480) and seed-maturation protein LEA4 (Os09g10620) were > 5-fold up-regulated in Cypress compared to LaGrue and Nipponbare in both MPSS and SBS libraries. However, the seed-allergenic protein RA5 precursor gene (Os07g11510) was up-regulated 15-fold in Ilpumbyeo compared to YR15965 (Table 4; Additional File 4A).

Expression patterns of TF genes in cultivars with high milling and good eating quality

TFs were identified using homology search in the rice TF database http://plntfdb.bio.uni-potsdam.de/v3.0/. Clustering analysis was performed to identify TF genes up- and down-regulated in Cypress and Ilpumbyeo compared to the controls (Table 3; Additional File 6). A total of 37 and 14 TF genes showed 5-fold up-regulation in Cypress and Ilpumbyeo libraries, respectively, in both SBS and MPSS libraries (Additional File 6). Similarly, 50 and 5 TF genes were down-regulated in Cypress and Ilpumbyeo, respectively, in both libraries. Some TFs were specifically up-regulated in either Cypress or Ilpumbyeo compared to the controls in both libraries. These TF genes encode PHD-finger family protein (PHD family; Os01g65600), zinc finger CCCH type domain containing protein ZFN-like 2 (C3H family; Os01g68860), transfactor (G2-like; Os06g40710), and bZIP transcription factor family protein (bZIP family; Os06g45140) (Additional File 6).

Identification of the conserved cis motifs among the up-regulated genes in cultivars with high milling and good eating quality

The promoter sequences (1.0 kb before the ATG site) of the highly up-regulated genes (≥50-fold) in Cypress (compared to LaGrue and Nipponbare) and Ilpumbyeo (compared to YR15965 and Nipponbare) identified in both SBS and MPSS libraries were analyzed using the 'PLACE Signal Scan Search' software http://www.dna.affrc.go.jp/htdocs/PLACE/. Many conserved motifs were present in the up-regulated genes in Cypress and Ilpumbyeo, and these included CAATBOX1, WRKY71OS, GATABOX, EBOXBNNAPA, SEF4MOTIFGM7S, CGACGOSAMY3, WBOXHVISO1, CAREOSREP1, CANBNNAPA, AMYBOX1, AACACOREOSGLUB1, BOXIIPCCHS, 2SSEEDPROTBANAPA, ACGTABOX, AMYBOX2, ACGTCBOX, ACGTOSGLUB1, CEREGLUBOX2PSLEGA, and GADOWNAT (Additional File 7). Interestingly, many of the motifs have been reported to play a role in seed development and germination (Additional File 7) [46–70].

Discussion

Rice is a major source of nutrition for most people in the developing world. Although tremendous achievements have been made for the improvement of many agronomic traits in rice in the last three decades, much less progress has been obtained for quality traits due to the lack of simple and efficient selection methods in rice breeding. With rapid advancement in crop molecular breeding, marker-aided selection has been successfully applied in many crop plants. Similarly, new methods for genetic engineering of better crop plants have been reported in the last decade by overexpressing or gene silencing of candidate genes. Although several eating quality QTLs have been identified in previous studies [18, 19], it is not clear whether these QTLs are useful for marker-aided selection or not because the genomic regions of these QTLs have not been further characterized. Recently Nelson et al. [9] identified six main-effect milling yield QTLs in the two RIL populations derived from crosses of common parent Cypress with RT0034, a low-milling yield japonica line and LaGrue, a low-milling yield japonica cultivar, respectively. In this study, we used two high throughput sequencing technologies to profile the transcriptome of five cultivars differing in milling yield and eating quality. Many genes specifically or commonly expressed in the high milling yield cultivar Cypress and the good eating quality cultivar Ilpumbyeo were identified from the MPSS and SBS libraries. These candidate genes are excellent starting materials for the development of molecular markers linked to milling quality in the US and eating quality in Korea for rice breeding. It is also possible that overexpression or silencing of some candidate genes will lead to the generation of transgenic rice plants with superior grain quality.

During the rice seed development, sugars, amino acids, and other important metabolites are transported from source (primarily leaves) to sink (seeds). Once in the seeds, these metabolites are allocated to different biosynthetic pathways (primarily starch metabolism and storage protein biosynthesis) to produce mainly starch and proteins in precise quantities and ratios. Achieving such a defined composition of starch and proteins require the regulation and coordination of various pathways so that, at each developmental stage, the participating enzymes are present in appropriate amounts and in the correct cellular compartments [3]. AGPase and GBSS I play important roles during starch biosynthesis in rice [71]. The genes encoding for AGPase and GBSS I enzymes are highly expressed 7 to 28 days after flowering during grain development, and their expression is highly correlated with the increases in both starch content and grain weight. The AGPase gene is also highly expressed in the high-yield cultivars of both glutinous and non-glutinous rice [71]. In addition, AGPase (Os01g44220) undergoes alternative splicing similar to the AGPase small subunit gene in barley [72]. Duan and Sun [3] showed that a mutation in the GBSS I gene leads to a lower level of functional GBSS I mRNA and correspondingly to a lower level of GBSS I enzyme for amylose synthesis, which causes a reduction in amylose accumulation. During rice seed formation, the genes encoding AGPases are active 3 days before flowering and maintain an intermediate although declining level of activity during seed maturation [3]. Genetic variation survey showed that the polymorphism in the rice waxy gene encoding the GBSS enzyme explains much of the variation in apparent amylose content across 92 important long, medium and short grain US rice cultivars and 101 progeny of a cross between low-amylose and intermediate-amylose breeding lines [73, 74]. The amylose content and the level of waxy protein in 31 rice cultivars from China were correlated with the ability of the cultivar to excise intron I from the leader sequence of the Wx transcript [75]. In this study, we found that the important starch biosynthesis related genes encoding AGPase (Os01g44220), 1,4-α-glucan branching enzyme (Os02g32660), limit dextrinase (Os04g08270), 1,4-α-glucan branching enzyme (Os06g51084), and α-amylase (Os09g29404) were up-regulated in Cypress compared to LaGrue and Nipponbare in six-days old developing seeds. Our time-course RT-PCR analysis also confirmed that expression of AGPase and GBSS I genes was higher in the high milling cultivar Cypress than in the low milling cultivar LaGrue early (6 and 9 DAF) in seed development. These results suggest that these two genes related to starch synthesis may greatly affect milling yield. Starch biosynthesis is also associated with complex genotypic-environmental interactions in maize endosperm [76]. Since the plants in this study were grown in the controlled environmental conditions (growth chambers), the effect of environmental factors on the expression of the starch biosynthesis genes should be tested in the field conditions.

Cereal proteins are generally deficient in lysine, but lysine content might be increased with increased accumulation of the precursor molecules required for the enzymatic reactions involved in lysine metabolism. The key precursor molecules include lactate, acetyl CoA, malate, L-aspartate, L-asparagine, L-aspartate-semialdehyde, homoserine, homocysteine, 2-oxobutanoate, 2-aceto-1-hydroxybutyrate, and α-ketoglutarate, and the enzymes involved in their production are very important (Additional File 5). Enhancing the production of these precursor molecules will require the identification of the genes encoding these enzymes. In this study, we found that the genes encoding malate dehydrogenase (Os03g56280, Os01g46070) and aminotransferase (Os09g28050, Os03g18810) involved in the production of malate and aspartate in Cypress and Ilpumbyeo, respectively, were up-regulated compared to the controls. Genes encoding aspartate transaminase (Os01g55540) and enoyl-CoA hydratase (Os02g43720) enzymes, which are responsible for the production of acetyl CoA, were also up-regulated in Cypress compared to the controls. Similarly, the gene encoding lactoylglutathione lyase (Os05g07940), which is responsible for the production of lactate, was up-regulated in Cypress compared to the controls. As indicated, genetic manipulation of the expression levels of these precursors/enzymes may lead to an increased accumulation of lysine in the endosperm and thus an increased nutritional value of the rice seeds.

In the last decade, oligoarrays, SAGE, MPSS, and SBS have been widely used for transcriptome profiling. MPSS and SBS have been recently used for whole-genome transcription analysis and have generated abundant expression data for many organisms [42, 44, 45]. In this study, both MPSS and SBS technologies were used to analyze the transcriptomes of the 6-days-old developing seeds in five rice cultivars. The number of redundant and non-redundant signatures generated in this study were similar to those in previous reports in rice and Arabidopsis[43, 45, 77]. Although MPSS generates large volume of data, its complicated library-construction procedure and high sequencing cost limit its use in individual laboratories. As the cost of the next-generation sequencing methods has significantly decreased in the last few years, SBS sequencing has become a popular method for transcriptome analysis because it costs 90% less than MPSS and can generate at least three times more transcripts. Furthermore, in the current study, about 30% more transcripts were found in the SBS library than in the MPSS library. Many of these additional signatures are low-copy transcripts, indicating that SBS is a powerful method for identifying rare transcripts [43]. The correlation coefficient is higher between MPSS and SBS than between RL-SAGE and microarray [78], or between RL-SAGE and MPSS or MPSS and microarrays as in previous studies [79]. Therefore, SBS will undoubtedly become the preferred high throughput sequencing method for deep transcriptome analysis in plants.

Conclusion

Breeding for milling yield and eating quality in rice has been a daunting task due to the low genetic inheritability of both traits and the lack of molecular markers linked to the phenotypes. Genetic mapping of the two traits is also challenging because the traits are easily affected by environmental factors in the field. Using two high throughput sequencing methods, we identified many differentially expressed genes in developing rice seeds that may affect milling yield or eating quality. Many of the identified genes are involved in the biosynthesis of starch, aspartate family amino acids, and storage proteins. Some of these potential candidate genes could be used for the development of molecular markers for breeding programs or for the engineering of rice cultivars with high milling yield and eating quality. Our study provides a valuable genomic resource for both improvement of rice grain quality and for the characterization of grain quality pathways at the molecular and biochemical levels.

Methods

Plant materials, developing seeds harvest and growth conditions

Five rice cultivars including Cypress, LaGrue, Ilpumbyeo, YR15965, and Nipponbare were used in the study. Cypress (japonica cultivar) is a long grain cultivar with high yield and high milling quality released by Louisiana State University. Cypress dries down slowly in the field, avoiding grain fissuring, cracking and chalkiness that reduce milling quality http://agebb.missouri.edu/rice/research/99/pg5.htm[80–84]. LaGrue (japonica cultivar), a long grain variety released by the University of Arkansas in 1993, has low milling quality [80–84]. Both Cypress and LaGrue seeds were provided by Dr. Robert Fjellstrom, USDA-ARS Dale Bumpers National Rice Research Center, Stuttgart, Arkansas, USA. Ilpumbyeo (japonica cultivar) is a good eating quality cultivar with low amylose content [85–87]. YR15965 (japonica cultivar) is a low eating quality rice, derived from a cross between Hwayeongbyeo (temperate japonica variety) and Shennung 89-366 (sub-tropical japonica) [86]. Both Ilpumbyeo and YR15965 seeds were provided by Dr. Gynheung An, Crop Biotech Institute, Kyung Hee University, Korea. Nipponbare (japonica cultivar) was used as a control for milling and eating quality with Cypress, LaGrue, Ilpumbyeo and YR15965. All the five cultivars were grown in 3 replications in a Conviron growth chamber at 80% relative humidity with 12 h of light (500 μmol photons m-2 sec-1) at 26°C followed by 12 h of dark at 20°C. The spikelets were labeled on the day of anthesis to identify the age of developing seeds in a panicle. The developing seeds were harvested from the panicles at 3, 6, 9, 12 and 15 D after anthesis. The excised developing seeds from the panicle were freezed immediately in liquid nitrogen.

RNA isolation and RT-PCR

Total RNA was isolated from developing rice seeds harvested from Cypress, LaGrue, Ilpumbyeo, YR15965 and Nipponbare plants using Trizol reagent (Invitrogen). For removal of polysaccharides/polyglycons from the extract, the extracted RNA was purified twice by high salt precipitation according to the manufacturer's instructions. For the MPSS and SBS library construction, RNA isolated from the 6-days (D)-old developing seeds (intermediate stage of grain filling) was used. For the time-course RT-PCR validation experiments, RNA isolated at 3, 6, 9, 12 and 15 D old developing seeds was used. RT-PCR was performed as described previously [78].

MPSS and SBS library construction, sequencing, and bioinformatics

MPSS and SBS libraries were constructed using the RNA obtained from 6 days old developing seeds from Cypress (MPSS library-PSC; SBS library-SPSC), LaGrue (MPSS library-PSL; SBS library-SPSL), Ilpumbyeo (MPSS library-PSI; SBS library-SPSI), YR15965 (MPSS library PSY; SBS library-SPSY) and Nipponbare (MPSS library-PSN; SBS library-SPSN). MPSS and SBS library construction and sequencing were performed essentially as previously described [43, 45, 77]. Data analysis was carried out to identify the genes responsible for milling quality and eating quality. The expression profiles of Cypress were compared with that of LaGrue and Nipponbare to identify the genes responsible for milling quality. Similarly, the expression profiles of Ilpumbyeo were compared with that of YR15965 and Nipponbare to identify the genes responsible for eating quality. Bioinformatic analyses including identification of antisense transcripts, alternate transcripts, and TFs were conducted as previously described [43]. Gramene database http://www.gramene.org was used as a reference database for the identification of genes involved in starch metabolism, aspartate amino acid metabolism, storage and allergenic protein synthesis, and seed maturation [88]. The entire dataset is available at the NCBI's Gene Expression Omnibus (GEO) database through the accession number GSM629225 to GSM629233

References

  1. Juliano BO: Rice in human nutrition. FAO Food and Nutrition Series, No. 26. 1993, Publication Division. FAO of the United Nations, Rome, 35-84.

    Google Scholar 

  2. Wang Z, Gu YJ, Chen G, Xiong F, Li YX: Rice quality and its affecting factors. Mol Plant Breeding. 2003, 1: 231-241.

    Google Scholar 

  3. Duan M, Sun SS: Profiling the expression of genes controlling rice grain quality. Plant Mol Biol. 2005, 59 (1): 165-78. 10.1007/s11103-004-7507-3.

    CAS  PubMed  Google Scholar 

  4. van Ruiten HTL: Rice milling: an overview. Rice: chemistry and technology. Edited by: Juliano BO. 1985, The American Association of Cereal Chemists, St. Paul, 349-388. 2

    Google Scholar 

  5. Webb BD: Criteria of rice quality in the United States. Rice: Chemistry and Technology. Edited by: Juliano BO. 1985, The American Association of Cereal Chemists, St. Paul, 403-442. 2

    Google Scholar 

  6. Siebenmorgen TJ, Meullenet JF: Impact of drying, storage, and milling of rice quality and functionality. Rice: chemistry and technology. Edited by: Champagne ET. 2004, The American Association of Cereal Chemists, St. Paul, 301-328. 3

    Google Scholar 

  7. Zheng TQ, Xu JL, Li ZK, Zhai HQ, Wan JM: Genomic regions associated with milling quality and grain shape identified in a set of random introgression lines of rice (Oryza sativa L.). Plant Breed. 2007, 126: 158-163. 10.1111/j.1439-0523.2007.01357.x.

    CAS  Google Scholar 

  8. Kepiro JL, McClung AM, Chen MH, Yeater KM, Fjellstrom RG: Mapping QTLs for milling yield and grain characteristics in a tropical japonica long grain cross. J Cereal Sci. 2008, 48: 477-485. 10.1016/j.jcs.2007.12.001.

    CAS  Google Scholar 

  9. Nelson JC, McClung AM, Fjellstrom RG, Moldenhauer KAK, Boza E, Jodari F, Oard JH, Linscombe S, Scheffler BE, Yeater KM: Mapping QTL main and interaction influences on milling quality in elite US rice germplasm. Theor Appl Genet. 2011, 122: 291-309. 10.1007/s00122-010-1445-z.

    CAS  PubMed  Google Scholar 

  10. Juliano BO, Onate LU, Mundo AM: Relation of starch composition, protein content, and gelatinization temperature to cooking and eating qualities of milled rice. Food Technol. 1965, 19: 1006-1011.

    CAS  Google Scholar 

  11. Ishima T, Taira H, Taira H, Mikoshiba K: Effect of nitrogenous fertilizer application and protein content in milled rice on organleptic quality of cooked rice. Rep Nat Food Res Inst. 1974, 29: 9-15.

    Google Scholar 

  12. Juliano BO: A simplified assay for milled-rice amylose. Cereal Sci Today. 1971, 16: 334-340.

    Google Scholar 

  13. Juliano BO: The rice caryopsis and its composition. Rice: Chemistry and Technology. Edited by: Houston DF. 1985, American Assoc. Cereal Chemists Inc. St. Paul, 17-74.

    Google Scholar 

  14. Liu W, Zeng J, Jiang G, He Y: QTLs identification of crude fat content in brown rice and its genetic basis analysis using DH and two backcross populations. Euphytica. 2009, 169 (2): 197-205. 10.1007/s10681-009-9922-7.

    CAS  Google Scholar 

  15. Cagampang GB, Perez CM, Juliano BO: A gel consistency test for the eating quality of rice. J Sci Food Agric. 1973, 24: 1589-1594. 10.1002/jsfa.2740241214.

    CAS  PubMed  Google Scholar 

  16. Bao JS, Sun M, Corke H: Analysis of genetic behavior of some starch properties in indica rice (Oryza sativa L.): thermal properties, gel texture, swelling volume. Theor Appl Genet. 2002, 104: 408-413. 10.1007/s001220100688.

    CAS  PubMed  Google Scholar 

  17. Yano M, Sasaki T: Genetic and molecular dissection of quantitative traits in rice. Plant Mol Biol. 1997, 35: 145-153. 10.1023/A:1005764209331.

    CAS  PubMed  Google Scholar 

  18. Wan XY, Wan JM, Su CC, Wang CM, Shen WB, Li JM, Wang HL, Jiang L, Liu SJ, Chen LM, et al: QTL detection for eating quality of cooked rice in a population of chromosome segment substitution lines. Theor Appl Genet. 2004, 110: 71-79. 10.1007/s00122-004-1744-3.

    CAS  PubMed  Google Scholar 

  19. Hao W, Zhu MZ, Gao JP, Sun SY, Lin HX: Identification of quantitative trait loci for rice quality in a population of chromosome segment substitution lines. J Integr Plant Biol. 2009, 51 (5): 500-12. 10.1111/j.1744-7909.2009.00822.x.

    CAS  PubMed  Google Scholar 

  20. Cho YC, Hong HC, Sub JP, Jeong YP, Choi IS, Kim MK, Kim YG, Choi HC, Hwang HG: QTL mapping for grain quality and shape in japonica × javanica in rice. Korean J Breeding. 2004, 36 (1): 408-409.

    Google Scholar 

  21. Tan YF, Li JX, Yu SB, Xing YZ, Xu CG, Zhang QF: The three important traits for cooking and eating qualities of rice grain are controlled by a single locus. Theor Appl Genet. 1999, 99: 642-648. 10.1007/s001220051279.

    CAS  PubMed  Google Scholar 

  22. Septiningsih EM, Trijatmiko KR, Moeljopawiro S, McCouch SR: Identification of quantitative trait loci for grain quality in an advanced backcross population derived from the Oryza sativa variety IR64 and the wild relative O. rufipogon. Theor Appl Genet. 2003, 107: 1433-1441. 10.1007/s00122-003-1376-z.

    CAS  PubMed  Google Scholar 

  23. Zhou PH, Tan YF, He YQ, Xu CG, Zhang Q: Simultaneous improvement for four quality traits of Zhenshan 97, an elite parent of hybrid rice, by molecular marker-assisted selection. Theor Appl Genet. 2003, 106: 326-331.

    CAS  PubMed  Google Scholar 

  24. Ordonez SA, Silva J, Oard JH: Association mapping of grain quality and flowering time in elite japonica rice germplasm. J Cereal Sci. 2010, 51: 337-343. 10.1016/j.jcs.2010.02.001.

    Google Scholar 

  25. Li ZF, Wan JM, Xia JF, Zhai HQ, Ikehashi H: Identification of quantitative trait loci underlying milling quality of rice (Oryza sativa) grains. Plant Breeding. 2004, 123: 229-234. 10.1111/j.1439-0523.2004.00977.x.

    CAS  Google Scholar 

  26. Tan YF, Sun M, Xing YZ, Hua JP, Sun XL, Zhang QF, Corke H: Mapping quantitative trait loci for milling quality, protein content and color characteristics of rice using a recombinant inbred line population derived from an elite rice hybrid. Theor Appl Genet. 2001, 103: 1037-1045. 10.1007/s001220100665.

    CAS  Google Scholar 

  27. Mei H, Luo L, Guo L, Wang Y, Yu X, Ying C, Li Z: Molecular mapping of QTLs for rice milling yield traits. Acta Genet Sin. 2002, 29: 791-797.

    CAS  PubMed  Google Scholar 

  28. Aluko G, Martinez C, Tohme J, Castano C, Bergman C, Oard JH: QTL mapping of grain quality traits from the interspecific cross Oryza sativa × O. glaberrima. Theor Appl Genet. 2004, 109: 630-639. 10.1007/s00122-004-1668-y.

    CAS  PubMed  Google Scholar 

  29. Dong Y, Tsuzuki E, Lin D, Kamiunten H, Terao H, Matsuo M, Cheng S: Molecular genetic mapping of quantitative trait loci for milling quality in rice (Oryza sativa L.). J Cereal Sci. 2004, 40 (2): 109-114. 10.1016/j.jcs.2004.04.008.

    CAS  Google Scholar 

  30. Jiang GH, Hong XY, Xu CG, Li XH, He YQ: Identification of Quantitative Trait Loci for Grain Appearance and Milling Quality Using a Doubled-Haploid Rice Population. J Integr Plant Biol. 2005, 47 (11): 1391-1403. 10.1111/j.1744-7909.2005.00089.x.

    Google Scholar 

  31. Lou J, Chen L, Yue G, Lou Q, Mei H, Xiong L, Luo L: QTL mapping of grain quality traits in rice. J Cereal Sci. 2009, 50: 145-151. 10.1016/j.jcs.2009.04.005.

    CAS  Google Scholar 

  32. Ball SG, van de Wal MHBJ, Visser RGF: Progress in understanding the biosynthesis of amylose. Trends Plant Sci. 1998, 3: 462-467. 10.1016/S1360-1385(98)01342-9.

    Google Scholar 

  33. Myers AM, Morell MK, James MG, Ball SG: Recent progress towards understanding biosynthesis of the amylopectin crystal. Plant Physiol. 2000, 122: 989-997. 10.1104/pp.122.4.989.

    CAS  PubMed  PubMed Central  Google Scholar 

  34. James MG, Denyer K, Myers AM: Starch synthesis in the cereal endosperm. Curr Opin Plant Biol. 2003, 6: 215-222. 10.1016/S1369-5266(03)00042-6.

    CAS  PubMed  Google Scholar 

  35. Kim WT, Li X, Okita TW: Expression of storage protein multigene families in developing rice. Plant Cell Phyisol. 1993, 34: 595-603.

    CAS  Google Scholar 

  36. Muntz K: Deposition of storage protein. Plant Mol Biol. 1998, 38: 77-99. 10.1023/A:1006020208380.

    CAS  PubMed  Google Scholar 

  37. Choi SB, Wang C, Muench DG, Ozawa K, Franceschi VR, Wu L, Okita TW: Messenger RNA targeting of rice seed storage proteins to specific ER subdomains. Nature. 2000, 407: 765-767. 10.1038/35037633.

    CAS  PubMed  Google Scholar 

  38. Shewry PR, Halford NG: Cereal seed storage proteins: structures, properties and role in grain utilization. J Exp Bot. 2002, 53: 947-958. 10.1093/jexbot/53.370.947.

    CAS  PubMed  Google Scholar 

  39. Zhou ZK, Robards K, Helliwell S, Blanchard C: Composition and functional properties of rice. Intl J Food Technol. 2002, 37: 849-868. 10.1046/j.1365-2621.2002.00625.x.

    CAS  Google Scholar 

  40. Azevedo RA, Arruda P, Turner WL, Lea PJ: The biosynthesis and metabolism of the aspartate derived amino acids in higher plants. Phytochem. 1997, 46: 395-419. 10.1016/S0031-9422(97)00319-1.

    CAS  Google Scholar 

  41. Tian Z, Qian Q, Liu Q, Yan M, Liu X, Yan C, Liu G, Gao Z, Tang S, Zeng D, Wang Y, Yu J, Gu M, Li J: Allelic diversities in rice starch biosynthesis lead to a diverse array of rice eating and cooking qualities. Proc Natl Acad Sci, USA. 2009, 106 (51): 21760-21765. 10.1073/pnas.0912396106.

    CAS  PubMed  Google Scholar 

  42. Vega-Sanchez ME, Gowda M, Wang G-L: Tag-based approaches for deep transcriptome analysis in plants. Plant Science. 2007, 173: 371-380. 10.1016/j.plantsci.2007.07.005.

    CAS  Google Scholar 

  43. Venu RC, Sheshu Madhav M, Sreerekha MV, Nobuta K, Zhang Y, Carswell P, Boehm MJ, Meyers BC, Korth KL, Wang G-L: Deep and Comparative Transcriptome Analysis of Rice Plants Infested by the Beet Armyworm (Spodoptera exigua) and Water Weevil (Lissorhoptrus oryzophilus). Rice. 2010, 3: 22-35. 10.1007/s12284-010-9037-8.

    Google Scholar 

  44. Simon SA, Zhai J, Nandety RS, McCormick , et al: Short-Read Sequencing Technologies for Transcriptional Analyses. Annu Rev Plant Biol. 2009, 60: 305-33. 10.1146/annurev.arplant.043008.092032.

    CAS  PubMed  Google Scholar 

  45. Meyers BC, Tej SS, Vu TH, Haudenschild CD, Agrawal V, Edberg SB, et al: The use of MPSS for whole-genome transcriptional analysis in Arabidopsis. Genome Res. 2004, 14: 1641-53. 10.1101/gr.2275604.

    CAS  PubMed  PubMed Central  Google Scholar 

  46. Shirsat A, Wilford N, Croy R, Boulter D: Sequences responsible for the tissue specific promoter activity of a pea legumin gene in tobacco. Mol Gen Genet. 1989, 215: 326-331. 10.1007/BF00339737.

    CAS  PubMed  Google Scholar 

  47. Rogers HJ, Bate N, Combe J, Sullivan J, Sweetman J, Swan C, Lonsdale DM, Twell D: Functional analysis of cis-regulatory elements within the promoter of the tobacco late pollen gene g10. Plant Mol Biol. 2001, 45: 577-585. 10.1023/A:1010695226241.

    CAS  PubMed  Google Scholar 

  48. Zhang ZL, Xie Z, Zou X, Casaretto J, Ho TH, Shen QJ: A rice WRKY gene encodes a transcriptional repressor of the gibberellin signaling pathway in aleurone cells. Plant Physiol. 2004, 134: 1500-1513. 10.1104/pp.103.034967.

    CAS  PubMed  PubMed Central  Google Scholar 

  49. Rubio-Somoza I, Martinez M, Abraham Z, Diaz I, Carbonero P: Ternary complex formation between HvMYBS3 and other factors involved in transcriptional control in barley seeds. Plant J. 2006, 47: 269-281. 10.1111/j.1365-313X.2006.02777.x.

    CAS  PubMed  Google Scholar 

  50. Stalberg K, Ellerstom M, Ezcurra I, Ablov S, Rask L: Disruption of an overlapping E-box/ABRE motif abolished high transcription of the napA storage-protein promoter in transgenic Brassica napus seeds. Planta. 1996, 199: 515-519. 10.1007/BF00195181.

    CAS  PubMed  Google Scholar 

  51. Filichkin SA, Leonard JM, Monteros A, Liu PP, Nonogaki H: A novel endo-beta-mannanase gene in tomato LeMAN5 is associated with anther and pollen development. Plant Physiol. 2004, 134: 1080-1087. 10.1104/pp.103.035998.

    CAS  PubMed  PubMed Central  Google Scholar 

  52. O'Neill SD, Kumagai MH, Majumdar A, Huang N, Sutliff TD, Rodriguez RL: The alpha-amylase genes in Oryza sativa: Characterization of cDNA clones and mRNA expression during seed germination. Mol Gen Genet. 1990, 221: 235-244.

    CAS  PubMed  Google Scholar 

  53. Lessard PA, Allen RD, Bernier F, Crispino JD, Fujiwara T, Beachy RN: Multiple nuclear factors interact with upstream sequences of differentially regulated beta-conglycinin genes. Plant Mol Biol. 1991, 16: 397-413. 10.1007/BF00023991.

    CAS  PubMed  Google Scholar 

  54. Hwang YS, Karrer EE, Thomas BR, Chen L, Rodriguez RL: Three cis-elements required for rice alpha-amylase Amy3D expression during sugar starvation. Plant Mol Biol. 1998, 36: 331-341. 10.1023/A:1005956104636.

    CAS  PubMed  Google Scholar 

  55. Sun C, Palmqvist S, Olsson H, Boren M, Ahlandsberg S, Jansson C: A novel WRKY transcription factor, SUSIBA2, participates in sugar signaling in barley by binding to the sugar-responsive elements of the iso1 promoter. Plant Cell. 2003, 15: 2076-2092. 10.1105/tpc.014597.

    CAS  PubMed  PubMed Central  Google Scholar 

  56. Nakashima K, Fujita Y, Katsura K, Maruyama K, Narusaka Y, Seki M, Shinozaki RAK, Yamaguchi-Shinozaki K: Transcriptional regulation of ABI3- and ABA-responsive genes including RD29B and RD29A in seeds, germinating embryos, and seedlings of Arabidopsis. Plant Mol Biol. 2006, 60: 51-68. 10.1007/s11103-005-2418-5.

    CAS  PubMed  Google Scholar 

  57. Thomas MS, Flavell RB: Identification of an enhancer element for the endosperm-specific expression of high molecular weight glutenin. Plant Cell. 1990, 2: 1171-1180. 10.1105/tpc.2.12.1171.

    CAS  PubMed  PubMed Central  Google Scholar 

  58. Sutoh K, Yamauchi D: Two cis-acting elements necessary and sufficient for gibberellin-upregulated proteinase expression in rice seeds. Plant J. 2003, 34: 635-645. 10.1046/j.1365-313X.2003.01753.x.

    CAS  PubMed  Google Scholar 

  59. Ellerstrom M, Stalberg K, Ezcurra I, Rask L: Functional dissection of a napin gene promoter: identification of promoter elements required for embryo and endosperm-specific transcription. Plant Mol Biol. 1996, 32: 1019-1027. 10.1007/BF00041385.

    CAS  PubMed  Google Scholar 

  60. Huang N, Sutliff TD, Litts JC, Rodriguez RL: Classification and characterization of the rice alpha-amylase multigene family. Plant Mol Biol. 1990, 14: 655-668. 10.1007/BF00016499.

    CAS  PubMed  Google Scholar 

  61. Wu C, Washida H, Onodera Y, Harada K, Takaiwa F: Quantitative nature of the Prolamin-box, ACGT and AACA motifs in a rice glutelin gene promoter: minimal cis-element requirements for endosperm-specific gene expression. Plant J. 2000, 23: 415-421. 10.1046/j.1365-313x.2000.00797.x.

    CAS  PubMed  Google Scholar 

  62. Nakashima K, Fujita Y, Katsura K, Maruyama K, Narusaka Y, Seki M, Shinozaki K, Yamaguchi-Shinozaki K: Transcriptional regulation of ABI3- and ABA-responsive genes including RD29B and RD29A in seeds, germinating embryos, and seedlings of Arabidopsis. Plant Mol Biol. 2006, 60: 51-68. 10.1007/s11103-005-2418-5.

    CAS  PubMed  Google Scholar 

  63. Stalberg K, Ellerstom M, Ezcurra I, Ablov S, Rask L: Disruption of an overlapping E-box/ABRE motif abolished high transcription of the napA storage-protein promoter in transgenic Brassica napus seeds. Planta. 1996, 199: 515-519. 10.1007/BF00195181.

    CAS  PubMed  Google Scholar 

  64. Izawa T, Foster R, Nakajima M, Shimamoto K, Chua N-H: The rice bZIP transcriptional activator RITA-1 is highly expressed during seed development. Plant Cell. 1994, 6: 1277-1287. 10.1105/tpc.6.9.1277.

    CAS  PubMed  PubMed Central  Google Scholar 

  65. Huang N, Sutliff TD, Litts JC, Rodriguez RL: Classification and characterization of the rice alpha-amylase multigene family. Plant Mol Biol. 1990, 14: 655-668. 10.1007/BF00016499.

    CAS  PubMed  Google Scholar 

  66. Forde BG, Heyworth A, Pywell J, Kreis M: Nucleotide sequence of a B1 hordein gene and the identification of possible upstream regulatory elements in endosperm storage protein genes from barley, wheat and maize. Nucleic Acids Res. 1985, 13: 7327-7339. 10.1093/nar/13.20.7327.

    CAS  PubMed  PubMed Central  Google Scholar 

  67. Izawa T, Foster R, Nakajima M, Shimamoto K, Chua N-H: The rice bZIP transcriptional activator RITA-1 is highly expressed during seed development. Plant Cell. 1994, 6: 1277-1287. 10.1105/tpc.6.9.1277.

    CAS  PubMed  PubMed Central  Google Scholar 

  68. Washida H, Wu CY, Suzuki A, Yamanouchi U, Akihama T, Harada K, Takaiwa F: Identification of cis-regulatory elements required for endosperm expression of the rice storage protein glutelin gene GluB-1. Plant Mol Biol. 1999, 40: 1-12. 10.1023/A:1026459229671.

    CAS  PubMed  Google Scholar 

  69. Shirsat A, Wilford N, Croy R, Boulter D: Sequences responsible for the tissue specific promoter activity of a pea legumin gene in tobacco. Mol Gen Genet. 1989, 215: 326-331. 10.1007/BF00339737.

    CAS  PubMed  Google Scholar 

  70. Ogawa M, Hanada A, Yamauchi Y, Kuwahara A, Kamiya Y, Yamaguchi S: Gibberellin biosynthesis and response during Arabidopsis seed germination. Plant Cell. 2003, 15: 1591-1604. 10.1105/tpc.011650.

    CAS  PubMed  PubMed Central  Google Scholar 

  71. Hwang JW, Kim SK, Lee JS, Kim IS: Gene expression of the biosynthetic enzymes and biosynthesis of starch during Rice Grain development. J Plant Biol. 2005, 48 (4): 448-455. 10.1007/BF03030587.

    CAS  Google Scholar 

  72. Thorbjornsen T, Villand P, Kleczkowski LA, Olsen OA: A single gene encodes two different transcripts for the ADP-glucose pyrophosphorylase small subunit from barley (Hordeum vulgare). Biochem J. 1996, 313 (1): 149-154.

    PubMed  PubMed Central  Google Scholar 

  73. Ayres NM, McClung AM, Larkin PD, Bligh HFJ, Jones CA, Park WD: Microsatellites and a single-nucleotide polymorphism differentiate apparent amylose classes in an extended pedigree of US rice germplasm. Theor Appl Genet. 1997, 04: 773-781. 10.1007/s001220050477.

    Google Scholar 

  74. Bergman CJ, Fjellstrom R, McClung A: Association between amylose content and a microsatellite across exotic rice germplasm. International Rice Genetics Symposium. 2001, IRRI: Manila, The Philippines, 4

    Google Scholar 

  75. Wang ZY, Zheng FQ, Shen GZ, Gao JP, Snustad DP, Li MG, Zhang JL, Hong MM: The amylose content in rice endosperm is related to the post-transcriptional regulation of the waxy gene. Plant J. 1995, 7 (4): 613-22. 10.1046/j.1365-313X.1995.7040613.x.

    CAS  PubMed  Google Scholar 

  76. Fergason VL, Zuber MS: Influence of environments on amylose content of maize endosperm. Crop Sci. 1962, 6: 209-10.2135/cropsci1962.0011183X000200030010x.

    Google Scholar 

  77. Nobuta K, Venu RC, Lu C, Belo A, Vemaraju K, Kulkarni K, et al: An expression atlas of rice mRNAs and small RNAs. Nature Biotechnol. 2007, 25: 473-7. 10.1038/nbt1291.

    CAS  Google Scholar 

  78. Venu RC, Jia Y, Gowda M, Jia MH, Jantasuriyarat C, Stahlberg E, et al: RL-SAGE and microarray analysis of the rice transcriptome after Rhizoctonia solani infection. Mol Genet Genomics. 2007, 278: 421-31. 10.1007/s00438-007-0260-y.

    CAS  PubMed  Google Scholar 

  79. Gowda M, Venu RC, Raghupathy MB, Nobuta K, Li H, Stahlberg E, et al: Deep and comparative analysis of the mycelium and appressorium transcriptomes of Magnaporthe grisea using MPSS, RL-SAGE, and oligoarray methods. BMC Genomics. 2006, 8 (7): 310-10.1186/1471-2164-7-310.

    Google Scholar 

  80. Counce PA, Bryant RJ, Bergman CJ, Bautista RC, Wang YA, Siebenmorgen TJ, Moldenhauer KK, Meullenet JC: Rice Milling Quality, Grain Dimensions, and Starch Branching as Affected by High Night Temperatures. Cereal Chem. 2005, 82 (6): 645-648. 10.1094/CC-82-0645.

    CAS  Google Scholar 

  81. Fan J, Siebenmorgen TJ, Yang W: A study of head rice yield reduction of long- and medium-grain rice varieties in relation to various harvest and drying conditions. Transactions of the ASAE. 2000, 43 (6): 1709-1714.

    Google Scholar 

  82. Kepiro JL, Fjellstrom RG, McClung AM: Studying the inheritance of high milling yield in Cypress. Texas Rice, Highlighting Research in. 2006, VII-IX.

    Google Scholar 

  83. Kumar R, Qiu J, Joshi T, Valliyodan B, Xu D, Nguyen HT: Single Feature Polymorphism Discovery in Rice. PLoS ONE. 2007, 2 (3): e284-10.1371/journal.pone.0000284.

    PubMed  PubMed Central  Google Scholar 

  84. Oard JH, Moldenhauer KK, Fjellstrom RG, Nelson J, Linscombe SD, Silva J, May GD: Registration of the MY2 Cypress/LaGrue rice recombinant inbred line mapping population. Journal of Plant Registrations. 2010, 4 (3): 1-5. 10.3198/jpr2009.11.0668crmp.

    Google Scholar 

  85. Kim KS, Kang HJ, Hwang IK, Hwang HG, Kim TY, Choi HC: Comparative ultrastructure of Ilpumbyeo, a high-quality japonica rice, and its mutant, Suweon 464: scanning and transmission electron microscopy studies. J Agric Food Chem. 2004, 52 (12): 3876-83. 10.1021/jf049767r.

    CAS  PubMed  Google Scholar 

  86. Lee JS, Ha WG, Chang JK, Ryu KL, Cho JH, Song YC, Kwon OK, Yang SJ, Kim HY, Suh HS: QTL Analysis for Grain Quality Properties in a Japonica Rice Combination. New directions for a diverse planet. Proceedings of the 4th International Crop Science Congress, Brisbane, Australia, 26 Sep - 1. 2004, Oct

    Google Scholar 

  87. Kim KS, Hwang HG, Kang HJ, Hwang IK, Lee YT, Choi HC: Ultrastructure of individual and compound starch granules in isolation preparation from a high-quality, low-amylose rice, ilpumbyeo, and its mutant, G2, a high-dietary fiber, high-amylose rice. J Agric Food Chem. 2005, 53 (22): 8745-51. 10.1021/jf051194a.

    CAS  PubMed  Google Scholar 

  88. Ware DH, Jaiswal P, Ni J, Yap IV, Pan X, Clark KY, Teytelman L, Schmidt SC, Zhao W, Chang K, Cartinhour S, Stein LD, McCouch SR: Gramene, a tool for grass genomics. Plant Physiol. 2002, 130 (4): 1606-13. 10.1104/pp.015248.

    CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

This work was supported by US National Science Foundation grants (#0321437 and 0701745) to B.C.M. and G.L.W, a USDA RiceCAP grant to G.L.W. and funding from Kyung Hee University to G.A.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guo-Liang Wang.

Additional information

Authors' contributions

RCV and MVS grew the plants, collected developing seeds, extracted RNA and performed RTPCR experiments, constructed SBS libraries, bioinformatic analysis of MPSS and SBS data and wrote the manuscript, KN and AB analyzed the data, GA, BCM and G-LW designed the experimental plan and revised the manuscript. The authors agreed on the contents of the paper.

R C Venu, M V Sreerekha contributed equally to this work.

Electronic supplementary material

12864_2010_10082_MOESM1_ESM.PPT

Additional file 1:Filter results of the five MPSS and SBS libraries. A) A total of 39,288 distinct 17-base expressed signatures from the five MPSS libraries were processed according to three filters: significance, reliability, and genomic match. B) Similarly, 397,543 signatures from the five SBS libraries were also processed using these same filters as previously described by Meyers et al. [45]. (PPT 134 KB)

12864_2010_10082_MOESM2_ESM.DOC

Additional file 2:Classification of the MPSS and SBS signatures from the five libraries based on their location on the annotated gene (hits = 1) (See Meyers et al. 2004[45]for details).(DOC 60 KB)

12864_2010_10082_MOESM3_ESM.XLS

Additional file 3:List of expressed grain quality related genes identified in 6 days old developing seeds by MPSS and SBS technologies. (XLS 682 KB)

12864_2010_10082_MOESM4_ESM.ZIP

Additional file 4:List of five fold up and down-regulated genes, antisense and alternate transcripts. A: List of genes commonly identified by MPSS and SBS technologies. Five fold up- and down-regulated genes, antisense and alternate transcripts are presented. B: Genes identified by SBS technology. Five fold up- and down-regulated genes, antisense and alternate transcripts are listed. C: Genes identified by MPSS technology. Five fold up- and down-regulated genes, antisense and alternate transcripts are listed. (ZIP 3 MB)

12864_2010_10082_MOESM5_ESM.PPT

Additional file 5:Network of lysine and aspartate family amino acid biosynthesis and degradation. http://www.gramene.org. Only the genes with 5-fold up- or down-regulation in Cypress (PSC) or Ilpumbyeo (PSI) compared with that in LaGrue or YR15965 are shown. The positive number in parenthesis indicates up-regulation and the negative number in parenthesis indicates down-regulation. The first value in parenthesis shows the fold change in expression either in LaGrue or YR15965, and the second value shows the fold change in expression in Nipponbare. The italicized and underlined bold number before the parenthesis shows the MPSS/SBS signature class [45]. Green indicates that the gene was identified by SBS only. Red indicates that the gene was identified by MPSS only. Blue indicates that the gene was identified by both MPSS and SBS. (PPT 261 KB)

Additional file 6:Five fold up and down regulated transcription factors identified by MPSS, SBS and both. (XLS 296 KB)

12864_2010_10082_MOESM7_ESM.DOC

Additional file 7:Conserved cis elements in the promoter region of the highly induced genes (≥50 fold) in Cypress (compared to LaGrue and Nipponbare) and Ilpumbyeo (compared to YR15965 and Nipponbare) that are involved in seed development. (DOC 46 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Venu, R.C., Sreerekha, M.V., Nobuta, K. et al. Deep sequencing reveals the complex and coordinated transcriptional regulation of genes related to grain quality in rice cultivars. BMC Genomics 12, 190 (2011). https://doi.org/10.1186/1471-2164-12-190

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2164-12-190

Keywords