- Research article
- Open Access
Association of variation in the sugarcane transcriptome with sugar content
BMC Genomics volume 18, Article number: 909 (2017)
Sugarcane is a major crop of the tropics cultivated mainly for its high sucrose content. The crop is genetically less explored due to its complex polyploid genome. Sucrose synthesis and accumulation are complex processes influenced by physiological, biochemical and genetic factors, and the growth environment. The recent focus on the crop for fibre and biofuel has led to a renewed interest on understanding the molecular basis of sucrose and biomass traits. This transcriptome study aimed to identify genes that are associated with and differentially regulated during sucrose synthesis and accumulation in the mature stage of sugarcane. Patterns of gene expression in high and low sugar genotypes as well as mature and immature culm tissues were studied using RNA-Seq of culm transcriptomes.
In this study, 28 RNA-Seq libraries from 14 genotypes of sugarcane differing in their sucrose content were used for studying the transcriptional basis of sucrose accumulation. Differential gene expression studies were performed using SoGI (Saccharum officinarum Gene Index, 3.0), SAS (sugarcane assembled sequences) of sugarcane EST database (SUCEST) and SUGIT, a sugarcane Iso-Seq transcriptome database. In total, about 34,476 genes were found to be differentially expressed between high and low sugar genotypes with the SoGI database, 20,487 genes with the SAS database and 18,543 genes with the SUGIT database at FDR < 0.01, using the Baggerley’s test. Further, differential gene expression analyses were conducted between immature (top) and mature (bottom) tissues of the culm. The DEGs were functionally annotated using GO classification and the genes consistently associated with sucrose accumulation were identified.
The large number of DEGs may be due to the large number of genes that influence sucrose content or are regulated by sucrose content. These results indicate that apart from being a primary metabolite and storage and transport sugar, sucrose may serve as a signalling molecule that regulates many aspects of growth and development in sugarcane. Further studies are needed to confirm if sucrose regulates the expression of the identified DEGs or vice versa. The DEGs identified in this study may lead to identification of genes/pathways regulating sucrose accumulation and/or regulated by sucrose levels in sugarcane. We propose identifying the master regulators of sucrose if any in the future.
Among the domesticated grasses, sugarcane and sweet sorghum have undergone extensive selection for high accumulation of sucrose that serves as the primary sources of sugars for human and animal consumption, as well as ethanol production for fuel .The maturing sugarcane culm represents both an economically important and physiologically interesting experimental system to study the dynamics of carbohydrate partitioning and metabolism associated with the accumulation of high concentrations of sucrose. A distinctive feature of sugarcane is that high levels of sucrose storage occurs only in the culm parenchyma cells as against in other plants where storage of sugar or other storage molecule/s occurs in terminal sink organs such as tubers, grains, or fleshy fruits. Sucrose concentration that peaks in the sugarcane culm during the end of the vegetative cycle (called ripening) is utilized for the sexual reproductive phase and the remaining reserve is re-mobilized to produce new vegetative structures unlike the pattern in monocarpic annuals where there is a single cycle of storage and utilization for the reproductive phase . In addition, sucrose is the only major form in which reduced carbon is exported from the source and hence all cellular processes outside the source are dependent on the mobilisation and utilisation of sucrose. Sucrose is the dominant storage reserve in sugarcane in contrast to most other plant stems that store polysaccharides such as starch or fructans with a low concentration of sucrose. As sugarcane matures, there is a shift in carbon partitioning from that of insoluble and respiratory components towards the osmotically active sucrose .
Although sugarcane stores the highest concentration (reaching about 0.7 M) of sucrose in the plant kingdom, studies on the physiological, biochemical and genetic basis of sucrose synthesis and accumulation have been limited compared to those in model plants like Arabidopsis or rice that do not accumulate high levels of sucrose. There are very few studies of sucrose accumulation primarily focusing on the sugarcane culm. Often these studies in sugarcane have reported a network of genes related to cell wall metabolism, carbohydrate metabolism, stress responses and regulatory processes [4,5,6,7,8,9,10,11]. Microarray analysis of sugarcane genotypes that varied in sucrose content revealed that many of the genes associated with high sucrose content showed overlap with drought data sets, but appeared to be mostly independent from abscisic acid signalling . A large expressed sequence tag (EST) study of the sugarcane transcriptome and physiological, developmental and tissue-specific gene regulation was initiated in Brazil . Sugarcane cultivars differing in both maximum sucrose accumulation (in Brix) capacity and accumulation dynamics during growth and culm maturation were studied cDNA microarrays and developmentally regulated genes related to hormone signalling, stress response, sugar transport, lignin biosynthesis and fibre content were identified . An expression profiling of a set of genes associated with sucrose accumulation was studied using quantitative real time reverse transcription PCR (qRT-PCR) in 13 genotypes of sugarcane and its progenitor species including S. officinarum, S. spontaneum and related genera Erianthus arundinaceus . High brix genotypes exhibited increased expression of sucrose non-fermenting related kinases and cellulose synthases in an expression study comparing high and low brix genotypes of sugarcane using qRT-PCR . In another transcriptome study using next generation sequencing (NGS)  enrichment of transcripts involved in a network of sucrose synthesis, accumulation, storage and retention in relation to the agronomic characteristics of the genotypes contrasting for rust resistance was observed. Casu et al.  proposed that sucrose accumulation may be regulated by a network of genes induced during culm maturation which included clusters of genes with roles that contribute to key physiological processes including sugar translocation and transport, fibre synthesis, membrane transport, vacuole development and function, and abiotic stress tolerance. These studies show that the sugarcane culm is a composite organ associated with numerous diverse functions other than sucrose storage. A gene networking pattern involving genes associated with culm maturation and sucrose accumulation, sugar transport, vacuole development, lignification, suberisation and abiotic-stress tolerance can be inferred from these studies. The present study aimed to identifying transcripts that were associated with sucrose accumulation using a set of seven high sugar and seven low sugar genotypes by expression profiling of mature and immature culm tissues and bioinformatic analyses of culm transcriptomes. The upregulation of several thousands of transcripts associated with sucrose biosynthesis was demonstrated in the high sugar, and maturing culm of sugarcane. This is the first transcriptome study showing the association of expression of a large number of genes with sucrose synthesis and accumulation in the sugarcane culm tissue.
Plant material and phenotypic data collection
Fully grown, disease free 12 months old plants grown in the field in a randomized complete block design were selected for analysis. The genotypes were derived from a sugarcane population provided by Sugar Research Australia (SRA), Brisbane, Australia, previously described in . Sugar content measured as Brix (a measure of the soluble solids in sugarcane juice) was used for classifying the genotypes as high and low sugar genotypes (Table 1). The low sugar genotypes had a Brix range of 17–18.4 while the high sugar genotypes had a Brix range of 19.4–21.4. The Brix at the point of collection was used for defining the high and low sugar groupings. These genotypes may have high or low sugar content in other environments. A wide variation in sugar content was not obtained as these genotypes were commercial cultivars and introgression lines in the breeding pipeline with a sugar content above 16 and fibre content below 15 on a fresh weight basis. Culm samples (from both top and bottom tissues, the 4th internode from top and 3rd internode from the bottom of the cane) were collected from four representative stalks and pooled for each internode sample. All samples were collected between 10 am to 2 pm to limit the diurnal fluctuations in the transcriptome. After collection, the samples were immediately flash frozen in liquid nitrogen and stored at −80 °C until RNA extraction. In addition, HPLC (high performance liquid chromatography) and NIR (near infrared spectroscopy) was used to measure the sugar composition and fibre content on a fresh weight basis. A sub-sample of each genotype was processed through a mechanical grinder, a component of the SpectraCane system (Biolab, Australia) and scanned by NIR for fibre content, Brix and sugar content (commercial cane sugar - CCS). For details see Additional file 1: Tables S1a, S2, S3; Figure S1.
Sample collection and preparation for RNA-Seq
The frozen sugarcane samples were pulverized using a Retsch TissueLyser (Retsch, Haan, Germany) at a frequency of 30/S for 1 min 30 s and about 1 g of ground sample powder was used for RNA extraction. RNA extractions were conducted as described by Furtado et al. (2014)  employing a Trizol kit (Invitrogen) and a Qiagen RNeasy Plant minikit (#74134, Qiagen, Valencia, CA, USA). For RNA quality and quantity assessment, a NanoDrop8000 spectrophotometer (ThermoFisher Scientific, Wilmington, DE, USA), and an Agilent Bioanalyser 2100 with the Agilent RNA 6000 Nano kit (Agilent Technologies, Santa Clara, CA, USA) were used. Only RNA samples with a RIN value of >7.5 were chosen for library preparations. About 3 μg each of 28 internodal RNA samples was used for indexed-library preparation (average insert size of 200 bp) with a TruSeq stranded with Ribo-Zero Plant Library Prep Kit for preparing total RNA library (Illumina Inc.) as described in . The library was subjected to sequencing in two lanes (equimolar) using an Illumina HiSeq4000 instrument to obtain paired-end (PE) read of 150 bp. The library preparation and sequencing was conducted at the Translational Research Institute, The University of Queensland, Australia.
RNA-Seq data processing
Read adapter and quality trimming were performed in CLC Genomics Workbench v9.0 (CLC-GWB, CLC Bio-Qiagen, Aarhus, Denmark) with a quality score limit of <0.01 (equivalent to Phred Q score ≥ 20), and allowing a maximum of two ambiguous nucleotides. Only PE reads with a length ≥ 35 bp were kept for further analyses. Further information on the RNA Bioanalyser profiles, raw RNA-Seq reads, trimming, quality parameters including size distribution and GC content is described in detail in  and in Additional file 1; Table S1b. Table 2 gives the details of reads from each genotype (top and bottom internode tissues) after quality trimming.
Differential gene expression (DGE) analyses
Using the CLC-GWB v9.5.1 software, RNA-Seq experiments were performed with a minimum length fraction of 0.9 and a minimum similarity fraction of 0.8. The number of reads per kilobase per million mapped reads (RPKM) was used for normalization . The CLC-GWB provides a comprehensive RNA-Seq tool for differential gene expression accompanied by statistical analyses. The Baggerley’s test that is used in this case  is the proportion-based statistical analysis that uses raw count data (un-transformed, not-previously-normalized) as input for setting up the experiment and uses total or unique gene/exon reads for calculating the differentially expressed genes. This test compares counts by considering the proportions that the counts for each gene make-up of the total sum of counts in each group. That is, it takes into account the proportion of every genotype in a group for a gene to be considered as differentially expressed. When Edge test  in CLC-GWB (an equivalent tool to EdgeR available in R Package v3.4.0) was used, this consistency (of differential expression across all genotypes in a group) was not observed. Similarly, the Differential expression for RNA-Seq tool available in the recent version of CLC GWB 10.1.1 gave a different set of results for the DGE experiments (data not shown here) and hence was not included for further analyses. As sugarcane genotypes differ genetically among and between each other, the criterion was to select only those genes that were differentially expressed despite the genetic differences inherent to the genotypes. For example, a gene was considered differentially expressed only when it is consistently differentially expressed in all the seven genotypes in one group in comparison with all the seven genotypes in the other group. Further the Baggerley test also corrects for the differences in the sample sizes (within and between library variations) by comparing the expression levels at the level of proportions rather than raw counts ; CLC manual).
The reads for each genotype in the high and low sugar groups were separately mapped against reference databases, the Saccharum officinarum gene indices (SoGI), the sugarcane Iso-Seq transcriptome database (SUGIT, TSA accession number GFHJ01000000) and the sugarcane assembled sequences (SAS) from the sugarcane expressed sequence tags database (SUCEST). The SoGI database was downloaded from the DFCI gene indices  which had adequate gene or protein function descriptions. In the present case, the SoGI dataset represented 282,683 ESTs that resulted in 121,342 unique sequences after clustering. A collection of ∼240,000 ESTs generated by the SUCEST project from 26 cDNA libraries from different sugarcane tissues sampled at various developmental stages  were assembled into 43,141 distinct contigs using CAP3 . This set of 43,141 contigs make up the SAS database. The SAS database was not annotated and the annotation was performed using the BLASTX against the nr protein database with an e value of 10–5, for 100 hits using the high-performance computing facility (HPC), at The University of Queensland, Australia. In addition, we used a newly constructed SUGIT, sugarcane long reads database described in . In brief, the database was derived from a pooled RNA sample collection including those genotypes used in this study, plus leaf and root tissue samples of 22 commercial and introgressed sugarcane genotypes. The basic descriptions of the databases are given in Table 3 and the methodology is summarised in Fig. 1.
Identification of differentially expressed transcripts
For all the RNA-Seq experiments, involving high and low sugar groups, low sugar samples were used as the references, for comparing top and bottom (immature and mature culm tissues respectively), bottom internode sample was used as the reference for identifying DEGs that were upregulated or down regulated. This means, if one transcript was up-regulated in the reference group, it was down-regulated in the group being compared, and vice versa. Proportion based statistical analysis (Baggerley’s Test) and a Volcano plot were used to compare gene expression levels in the two groups that were considered for differential gene expression (high and low sugar, top and bottom internode samples) in terms of the log2 fold change (at FDR 0.01). The DEGs were further sorted and selected at three different fold change levels, i.e., above and equal to 2, above and equal to 10 fold and below 2 fold change to identify highly expressed and those expressed at low levels.
Functional annotation of identified differentially expressed transcripts
Functional annotation of the transcripts was performed using MapMan categories  using BlastX (e-value ≤10−5, with a cut off value of 80% similarity) against Arabidopsis thaliana and Oryza sp. and SwissProt/UniProt Plant Proteins. In addition Blast2GO  followed by KEGG pathway mapping analyses were performed for the DEGs.
Validation of gene expression using quantitative real-time PCR (qPCR)
In addition, a correlation analysis was performed to validate the expression levels of eight selected transcripts from the RNA-Seq analyses in this study using qPCR expression values of the same transcripts extracted from a separate study . The RPKM values obtained for four tissue samples (two top and two bottom internodes) of two genotypes (QC02–402 and QN05–803) were correlated against the respective qPCR expression values (Cq qPCR normalised gene expression), using Microsoft Excel 2013.
RNA-Seq analyses and identification of differentially expressed genes
The mapping of reads to each reference database is shown in the Table 4. The results of the different RNA-Seq experiments (hereafter SoGI-DGE, SUGIT-DGE and SAS-DGE) are listed in Table 5 and the differential gene expression patterns are depicted as Volcano plots in the Fig. 2. For all DEGs, FDR 0.01 and a fold change of ≥2 were used as cut off values. In SoGI-DGE, with high and low sugar bottom internode samples (HSB vs LSB), out of the total 121,342 transcripts, 34,375 showed upregulation and 101 transcripts showed down regulation in high sugar genotypes when compared to low sugar genotypes. When low sugar top and low sugar bottom internode samples (LST vs LSB) were compared, 30,723 transcripts were differentially expressed, upregulated in the low sugar top internode sample, while 86 transcripts were down regulated. When high sugar top and high sugar bottom intermodal samples (HST vs HSB) were considered, 31 transcripts were found to be upregulated in high sugar bottom internode sample compared to the corresponding top. In SUGIT-DGE, out of 107,598 transcripts, 18,411 transcripts were upregulated while 132 transcripts were down regulated in high sugar bottom internode sample compared to low sugar bottom internode sample (HSB vs LSB). 11,713 transcripts were differentially expressed between low sugar top and low sugar bottom intermodal samples (LST vs LSB), wherein 11,599 transcripts were upregulated and 114 transcripts were down regulated. In the SAS-DGE, 19,808 transcripts showed differential expression (19,782 upregulated, 26 down regulated) out of 43,141 transcripts of the SAS reference database in the HSB vs LSB comparison and 20,487 transcripts were differentially expressed (20,449 up regulated, 38 down regulated) in the LST vs LSB comparison (see Table 5 for details). However, in the SAS-DGE, there were more DEGs in the HST vs HSB comparison, with 2826 DEGs. This comparison resulted in only 21 and 31 DEGs with the SUGIT and SoGI DGEs respectively (Fig. 3). In addition, the common and unique transcripts among the three comparisons in three different DGEs were found (Figs. 4 and 5). For additional information on experimental set up and statistical analyses, see Additional files 2: Table S4-S6 (complete list of DEGs in the SOGI-DGE), Additional files 3: Table S7-S9 (complete list of DEGs in the SUGIT-DGE), and Additional files 4: Table S10-S12 (complete list of DEGs in the SAS-DGE). The results of the qPCR expression values were found to be significantly correlated with the RPKM values for selected genes (r = 0.629, p < 0.001, n = 32, df = 30). The details of qPCR validation analysis are provided in Additional file 5: Table S13 and Figure S2.
Identification of consistently differentially expressed transcripts between high and low sugar genotypes
The results of the DGE analyses are given in the Table 5. The DEGs at different fold change cut off values, i.e. ≥2, ≥10 and <2 fold changes were identified. This resulted in the identification of DEGs that are expressed at high levels (10 fold and above), low levels (<2) apart from the cut off of 2 and above (Table 6). In addition, to check for specific sucrose/sugar related transcripts, filtering was done for “sucrose” and “sugar” as key words in the DGE experiment files as the DEGs were in large numbers. Although some transcripts related to sucrose/sugar may have been missed, this approach helped screening the large number of DEGs. At the fold change value of 2 and above, the sucrose and sugar related genes were 63, 68 and 49 in HSB vs LSB and 75, 74, and 60 in in LST vs LSB using SoGI-DGE, SUGIT-DGE and SAS-DGE respectively. These transcripts are listed in the Additional file 6: Tables S14–22 and some are listed in the Tables 7, 8 and 9. At the fold change value of 10 and above, the sucrose/sugar related transcripts were very few in number and included sucrose synthase (SuSy), sucrose transporter (SuT), sucrose phosphate synthase (SPS) and a SWEET transporter (Table 6). Further, SuSy2 and SuT3 were consistently present in all three sets of DEGs for LST vs LSB, at the maximum fold change value of 10 and above, showing upregulation in LST. In HSB vs LSB, SuSy 2 and SuT 2 were observed in SoGI-DGE and SUGIT-DGE, upregulated in HSB, whereas no sucrose/sugar related transcripts were present in SAS-DGE at this fold change. At the fold change value of below 2, there were no sucrose/sugar related transcripts for these two comparisons in any of the DGEs. In HST vs HSB, sucrose/sugar related transcripts were not found in SoGI- and SUGIT-DGEs, however, at the fold change cut off value of below <2, sucrose phosphate phosphatase (SPP) 2, SuSy, SWEET 16 like transporter and a sugar phosphate phosphate translocator were found in SAS-DGE, showing upregulation in HSB. Interestingly, the DEGs at fold change ≥10 in HST vs HSB were related to phenyl propanoid pathway genes like terpene cyclase (TC), phenyl ammonia lyase (PAL), chalcone synthase (CHS), cinnamoyl CoA reductase (CCoAR), ferruloyl esterase (FE), laccase 7-like (LAC), β-expansin (BE) 1a and ethylene responsive transcripts etc., in SoGI, SUGIT and SAS-DGEs (Table 6). Overall, genes specific to sucrose synthesis and accumulation were enriched in the HSB vs LSB and LST vs LSB experiments, while genes for secondary metabolites, were found to be enriched in the case of the HST vs HSB comparison. There were no DEGs in HST vs LST experiment in the three DGE analyses.
Gene ontology annotation
The gene ontology annotation using MapMan resulted in grouping and classification of the DEGs into different functional categories. The DGE analysis between LST and LSB was almost similar in number and composition to the DEGs obtained between HSB vs LSB (Additional file 7: Figure S3, Table S23).
Upregulated transcripts in high sugar genotypes when compared with low sugar genotypes
The DGE analyses of HSB vs LSB and LST vs LSB were showing a similar trend and the two sets of DEGs had an extensive overlap (see Additional file 7: Figure S3). About 89.3% (with SoGI) of the transcripts differentially expressed were similar in both the comparisons (63% in SUGIT and 96.8% in case of SAS). Hence only the DEGs of HSB vs LSB is considered for further discussion. Only a few transcripts are discussed here. For a complete list DEGs of all the three DGE analyses, refer Additional files 2, 3 and 4: Tables S4-S12. In addition, a list of unique and commonly expressed transcripts in each group was prepared (Additional File 8: Tables S24–28) for all the DGEs. The description below gives an overview of the DEGs obtained in the three DGE analyses at FDR 0.01 without any filtering.
Sucrose, starch and other sugar derivatives
In the SoGI-DGE, there were 71 sucrose related transcripts consisting of sucrose synthases 2 and 3, sucrose phosphate synthase (SPS) 2 and 3, sucrose phosphate phosphatase (SPP), sucrose non-fermenting related protein kinases; impaired sucrose induction 1-like protein and sucrose transporters (SuT) 2 and 4. About 22 transcripts were sugar related including transport, efflux, and glycosyltransferases. Ten transcripts were related to alkaline/neutral invertases and three transcripts with homology to sucrase from Oryza sativa were found. There were ten high-glucose regulated protein 8-like transcripts. Forty six transcripts, were related to intermediary metabolism of fructose phosphates, the most expressed being fructose-bisphosphate aldolase cytoplasmic isozyme. Sixteen transcripts were related to xylose metabolism and β-glucosidase related transcripts were observed. Fifteen hexose related transcripts were transporters, while 18 transcripts were related to triose phosphates metabolism. Fifty three UDP-related transcripts were found, out of which six were UDP-glucose-dehydrogenases. There were also UDP-sugar, arabinose, xylose, galactose transporters, −epimerases and -pyrophosphorylase related transcripts. Fucoses are hexose sugars and nine transcripts associated with them include fucosidases and fucosyltransferases. Thirteen mannose, trehalose and sorbitol related transcripts were found. Glucans metabolizing genes were another prominent group found to be highly expressed with 45 transcripts including β-1, 4 glucan synthases and endoglucanases. Nine transcripts related to alpha amylases were also upregulated in high sugar genotypes. In addition, 85 transcripts were found to be related to kinases including hexokinases, fructokinases (1, 2 and 3), phosphofructokinases, carbohydrate kinases and galactokinases. In the SUGIT-DGE, there were 208 sucrose related transcripts. In addition to the transcripts observed in the SoGI-DGE, sugar transport 5 and 7, sugar transporter ERD6 like, bidirectional sugar transporter SWEET1 and 4 like, and an abundance of ABC transporters B, C, D, E, G, F, and I for sugar were found in SUGIT-DGE. In the SAS-DGE, 75 transcripts were related to sucrose consisting of galactinol-sucrose galactosyltransferase 1,2 and 6, sucrose transporters SUC3 and its isoform X2, SUC4, SPP 2 and SPS 1, 3 and 4, bidirectional sugar transporter2a, 4, 14 and 16, sugar transporter ERD6-like 5, 6 and 16, sugar transport 5 and its isoform X1, 7 and 9 transcripts for sugar phosphate phosphate translocator. Interestingly one transcript for invertase inhibitor and one transcript for sulfofructose kinase like transcript which were not detected in the other two DGEs were found. Starch synthases II b and c, III, IV and starch branching and debranching (pullulanase and isoamylase) enzymes were found to be upregulated. The KEGG pathway map for starch and sucrose related DEGs are shown in the Additional file 9: Figure S4.
Vacuole and transporters
Transcripts related to transporters comprising of sucrose, sugar, sugar efflux, sugar phosphate exchanger, hexose, nitrate, GDP-mannose, aquaporins, vacuolar ATP synthase subunit C, vacuolar H+ ATP synthase subunit C, vacuolar H+ pyrophosphatase, vacuolar proton pumps, vacuolar targeting receptors, vacuolar protein sorting proteins (1, 13, 13A, 22, 25, 33, 36, 41, 55, DUF1162) and vacuolar H+- inorganic pyrophosphatase were found to be upregulated. A transcript was found to match the bacterial sugar transport system probably due to contaminating sequences. An abundance of ABC transporters could be observed in all DGEs.
Auxin related transcripts were auxin response factors1, 3, 4, 5, 7, 9, 13, 15, 16, 17, 22, 23, 26, 27 and 31, auxin responsive proteins, auxin influx/efflux carriers, auxin transporters 1, 2, and auxin binding protein 4 were found. With respect to ethylene, 43 transcripts including ethylene over-producer like proteins, ethylene responsive transcription factors, elongation factors, calmodulin binding factors, element binding factors, small GTP binding proteins, ethylene receptors, and ethylene insensitive 2, and 3 proteins were found. Transcripts related to abscisic acid (ABA) and gibberellic acid (GA) and very few jasmonate and brassinosteroid related transcripts were found in the DEGs.
Transcripts related to the chloroplast, notably chloroplastic group IIB intron splicing facilitator CRS2, alpha-glucan water dikinase, rubisco large subunit alpha binding, chloroplast post-illumination chlorophyll fluorescence increase protein, starch synthases II b and c, III, IV and starch branching and debranching (pullulanase and isoamylase) enzymes to name a few from the three DGEs. The ribosomal proteins were one of the most upregulated transcripts in all the DGEs comprising of nuclear, cytoplasmic, chloroplast and mitochondrial ribosome related functions especially of 30S, 40S, 50S, and 60S and acidic ribosomal transcripts.
Transcripts related to senescence including senescence-inducible chloroplast stay-green protein and leaf senescence proteins, senescence-inducible chloroplast stay-green protein, heat shock related transcripts of DNA and chloroplast, wound inducible protein, ripening ABA induced, autophagy, programmed cell death, cell death related protein, and defender against cell death, vascular death associated transcript were found. Transcripts were related to stress (light, water, heat, salt, ozone-responsive, bio-stress) and pathogenesis related transcripts, hypersensitive induced response proteins, 22 kDa drought inducible proteins, dehydrins and transcripts related to proline were found to be upregulated.
Flowering related transcripts including pistil, pollen, immature pollen, flowering-time protein isoforms, phytochrome and flowering time, flowering locus, GIGANTEA, OVA4 ovule abortion 4, and fertilization independent were upregulated in high sugar genotypes. Proteins related to the egg apparatus, seed maturation, shrunken seed and seed starch branching enzyme related transcripts were upregulated. HASTY 1 flower development, agamous-like MADS box AGL12, photoperiod-independent early flowering 1, early flowering 3, flowering time control FY, luminidependens are some of the flowering related transcripts found across the DEGs.
Transcripts of signalling related to DNA damage, signal recognition, pollen, and integral membrane, 14–3-3 like proteins. Out of a large number of kinases, serine/threonine phosphatases, appeared to have a dominant role during sucrose accumulation. Also, it was observed that several signalling events can be inter related with others from the pattern of gene expression observed to be upregulated in high sugar genotypes (Additional file 9: Figure S5).
In the SoGI-DGE, transcripts matching with fibre proteins 11, 12, 15, 19 and 34 of cotton and Hyacinthus sp. were found. There was a transcript weakly similar to cement protein 3b from the marine worm Phragmatopoma californica. Vegetative and secondary cell wall proteins, cell wall hydrolases, cell envelope and cell shape, cell wall beta 1,3, endoglucanase cellulose synthases, bundle sheath cell specific proteins, 50 transcripts for cellulose synthases 2, 3, 4, 5, 6, E6, D3, A and 7, cellulose 1,4, beta-cellobiosidase were upregulated in high sugar genotypes. Also, transcripts of phenyl ammonia lyase (PAL), 6 caffeic acid-o-methyl transferase (COMT), caffeoyl CoA 3-O-methyl transferases (CCoAOMT), glutathione S-transferase, 6 dihydroxyacetone kinase, and transcripts related to chorismate, succinyl, cinnamoyl alcohol of shikimate pathway, caffeoylshikimate esterase, expansins A2 and A13, transcripts for vegetative cell wall, and secondary cell wall related transcripts were found.
Transcripts related to light/photosystem including light induced, light responsive proteins. De-etiolated 1, phytochrome, rubisco sub unit binding proteins, chloroplast post-illumination chlorophyll fluorescence increase protein, cryptochrome, photosystems I 700 and II 680 chlorophyll A apoprotein, photosystem reaction centre subunits II, III, VIII, IX, XI, 23 are few to mention. Interestingly, there were eight non-photosynthetic NADP-malic enzymes transcripts from Zea mays in SoGI-DGE. In SUGIT-DGE, transcripts of CIRCADIAN TIMEKEEPER, blue light photoreceptor PHR2, negatively light regulated, light-stress responsive one helix like, light inducible CPRF2 and WEAK CHLOROPLAST MOVEMENTUNDER BLUE LIGHT 1 like. Transcripts related to photosynthetic NDH subunit of subcomplex B chloroplastic, light dependant short hypocotyls 4 like, high light induced chloroplastic like, blue light photoreceptor PHR2 etc. were found in SAS-DGE. Nitrogen (N) related transcripts comprising of nitrogen utilization substrate protein, nitrogenase, nitrilase, nitrate extrusion proteins and nitrate reductase, bifunctional nitrilase nitrile hydratase NIT4A were up regulated in high sugar genotypes.
Interestingly, in SoGI-DGE, about 6552 transcripts were found to match the chromosomal regions of Vitis vinifera (SoGI annotation) which are whole genome shotgun sequences. In SUGIT-DGE, 243 transcripts were uncharacterized and in SAS-DGE, 320 transcripts were found to be uncharacterized.
Down regulated transcripts in high sugar genotypes
The transcripts down regulated in high sugar genotypes included 17S, 18S, 26S, ribosomal RNA genes, cytochrome P450, and photosystem I 700, a stem specific transcript and leaf specific transcript from Saccharum hybrid cultivar, rRNA intron encoded homing endonuclease, zinc finger protein and uncharacterized transcripts in the three DGEs.
Two groups of genotypes, high sugar and low sugar, were formed based on the sugar content in terms of Brix as in sugarcane most of the soluble solids in the juice (70–91%) correspond to sucrose [12, 30]. Differential expression of genes was studied between the two groups and between top and bottom internodal samples (immature and mature) of the two groups. Therefore, gene expression changes were studied among high sugar top internode (HST), high sugar bottom internode (HSB), low sugar top internode (LST) and low sugar bottom internode (LSB)samples in various comparisons. Thus, the HST vs LST and HSB vs LSB were comparisons between the high and low sugar genotypes, whereas HST vs HSB and LST vs LSB were comparisons between top and bottom intermodal samples. For the DGE analyses, three databases were used as references individually wherein a large number of DEGs were identified from each. The databases were chosen to be specific for sugarcane. SoGI and SAS are derived from 26 different cDNA libraries  as a result, a large number of DEGs where obtained. The SUCEST database which encompasses SoGI and SAS is reported to cover >90% of the sugarcane genes . The SUGIT database is essentially a long reads database sequenced using the latest Iso-Seq technology  which can further be used for refining the DEGs for isoform/allelic information. This database covers approximately 71% of the total predicted genes in sugarcane . The common and unique transcripts from each database are not discussed further as the main objective of this paper was to find the DGE for sugar content. A subset of sucrose /sugar related DEGs were derived, which is interesting as several other studies on sucrose accumulation in sugarcane reported that sucrose related genes were less abundant or not expressed during the maturation stage [6, 12, 32]. There were approximately 70 transcripts related to sugar/sucrose in each DGE. Sucrose synthase (SuSy) and sucrose transporters (SuTs) were consistently found to be highly expressed in high sugar genotypes. Similar association was reported in [6, 8, 14]. The identity of the exact isoform of these two genes could not be found due to the varying annotations of the three databases used, which needs further studies. SuSy is reported to contribute to increasing the sink capacity, building cell wall materials and starch while sucrose transporters facilitate transportation of sucrose that leads to steady increase in sucrose content . Further work on the isoforms/allelic expression of these genes would certainly be useful for understanding the finer details of their regulatory roles. The functioning of the two sucrose synthesis enzymes, SuSy and SPS and their regulation, has not yet been well demonstrated in sugarcane. SPS, sucrose non-fermenting related kinases, bidirectional sugar transporter SWEET, UDP-sugar pyrophosphorylase, impaired sucrose induction 1 -like proteins were the other genes that were consistently present at lower fold changes. Interestingly, an invertase inhibitor gene was found to be highly expressed in LST (13 folds) in LST vs LSB in SAS-DGE. Invertase inhibitors have been previously reported to be highly expressed during the sucrose accumulation stages in sugarcane .
In addition to the above genes, the gene expression pattern in our study reveals a clear association between different gene networks during sucrose accumulation similar to earlier reports [9, 12]. It is possible to make a direct parallel between sucrose content and gene expression levels for almost all the DEGs though the difference in the sugar content between the two groups is very narrow. Sucrose is a carbohydrate compound and was originally recognized only as an energy source for metabolism in plants but was recently shown to also function as a signalling molecule involved in regulation of various physiological processes in plants such as root growth, fruit development and ripening, and hypocotyl elongation . Sugars serve as key components reflecting the plant’s energy status and, therefore, the ability to continuously sense sugar levels and control energy status is a key to survival and therefore transcript levels of thousands of genes respond to changing sugar levels . Further, different sugars can have different regulatory roles in physiological processes, and the developmental stage of the plant further determines the response to sugars [35,36,37]. Recently, it was observed that glucose facilitates the juvenile to adult phase change in Arabidopsis by repressing microRNA (miRNA) 156 expression [38,39,40]. Consequently, mutants in sugar signalling or starch metabolism display an altered juvenile phase . At high concentrations, sugars can induce meristem quiescence as observed in the arrest of development of seedlings germinated on high sugar levels . Sugar induced quiescence of the stem can be seen through the expression of several transcripts for no apical meristem and indeterminate spikelet transcripts in addition to senescence related transcripts. Transcripts related to less abundant and lignocellulosic sugars identified in this study included xylose, trehalose, galactose, arabinose, fucose, mannose, taurine and the sugar alcohols inositol and sorbitol. The constant synthesis and breakdown of sucrose into its hexose components helps regulating various physiological events associated with these less abundant sugars and maintain a reserve for tackling any stimuli including the accumulation of sugar in the form of sucrose. It could be possible that breeding programmes for high sucrose genotypes have resulted in selection for these sugars (total sugars, in addition to sucrose) and gene expression changes of certain regulatory genes . Therefore, diverse phenotypes may stem from multiple effects of sucrose and other sugars as signal and storage compounds when accumulated in various developmental and compartmental patterns resulting from differential gene expression and regulation.
The vacuole occupies as much as 90% of most mature cells and can accumulate and store sucrose, glucose and fructose and serves as a primary pool of free calcium ions in plant cells. Furthermore, the space-filling function of the vacuole is essential for cell growth, as the cell enlargement is mainly through the expansion of the vacuole rather than of the cytoplasm . A vast majority of the differentially expressed gene transcripts were vacuole related including aquaporins, glucans related, aspartic proteinases, endopeptidases, ABC transporters, TIPs, V-ATP synthases, vacuolar protein sorting proteins, proton pumps, Ca2+ ATPases, calmodulins, that showed higher expression levels in high sugar genotypes. Further molecular characterization of vacuolar and tonoplast sugar transporters should advance our understanding of vacuole function, sugar transport and sugar accumulation in sugarcane.
Several transcripts related to plant defense, wounding, and disease were upregulated in high sugar genotypes together with the ripening and senescence related transcripts. Further, water stress and dehydration related gene transcripts were upregulated in the high sugar genotypes. Apart from the ripening and senescence related transcripts which indicate the physiological state of the stem, the up regulation of transcripts encoding plant disease resistance proteins suggests that the defense system of sugarcane was activated by high sugar levels which might contribute to protecting from the extreme stress caused by the high sucrose levels during the maturity stages. It may also create steep osmotic gradients between compartments with varying sucrose concentrations (more negative than −2.0 MPa during sucrose accumulation). The increased commitment to fibre synthesis in the maturing stem is evident in the upregulation of several fibre and cellulose related transcripts in the high sugar genotypes highlights the need to maintain the structure of the stem in conjunction with sucrose accumulation. These may act to restrict apoplastic movement of solutes between the vascular bundles and the sucrose-storage parenchyma cells . Transcripts related to proline and glyoxalase were highly expressed in high sugar genotypes. The differential expression of genes related to fibre, cellulose and lignin synthesis shows that the osmotic regulation and structural maintenance as directed by the sugar levels. Though sucrose content in the sugarcane culm ranges from 14 to 42% of the culm dry weight , the majority of carbohydrates in sugarcane is lignocellulose, a major component in the cell wall. As cell elongation and sucrose accumulation ceases in the maturing sugarcane internodes, there is a major increase in cell wall thickening and lignification . Cellulose accounts for about 42–43% in sugarcane and energy cane cultivars  and can be a prominent competing sink for carbon in sugarcane. Cellulose synthases 1, 2, 3, 4, 5, 6, 7, and 9 along with a novel transcript matching for a cement protein like gene that is upregulated in high sugar genotypes indicates that there are several aspects of sugarcane cell wall composition remain to be explored . S-adenosylmethionine (SAM) produced by SAM-synthase is required as the methylation donor in lignin and suberin biosynthesis and secondary metabolism. It is also required as a precursor for SAM-decarboxylase, which is also up-regulated and important in polyamine synthesis, a response to osmotic stress. Elevated SPS activity is consistently correlated with high rates of cellulose synthesis and secondary wall deposition . UDP-Glucose, apart from being the precursor for sucrose synthesis, is a nucleotide sugar central to diverse pathways of polysaccharide biosynthesis, leading to starch and cellulose, hemicellulose and callose synthesis. About 10 major monosaccharides in cell wall polymers are converted from glucose through UDP-Glucose related interconversion pathways. All UDP-Glucose related transcripts including UDP-Glucose dehydrogenases, pyrophosphorylases were upregulated in high sugar genotypes indicating a high correlation with sugar contents. Ethylene is often related to the lignification of plant tissues by increasing the expression of genes involved in the phenylpropanoid pathway . This explains the parallel upregulation of cellulose synthases, ethylene related transcripts, as well as SPS in the DGEs. The mechanisms regulating cell wall biosynthesis and source-sink relations in sugarcane will be crucial constituents of any efforts to alter carbon partitioning between fibre and sugar in the culm. In addition, the alteration of cell wall biosynthesis genes in association with sucrose (Brix) content is an interesting indication of a correlation between these processes. Silencing or over-expression of some of these genes may lead to altered cell wall or increased sucrose content. Interestingly, when comparing two genotypes contrasting for lignin content, Vicentini et al.  found that a simple correlation between lignin content and differential expression of lignin genes is not always straightforward and most of the lignin biosynthetic genes did not show increased transcript levels in the high lignin genotype.
Sugar signals and the circadian clock are part of a complex network that controls floral transition. In sugarcane, sugar levels peak just before flowering induction. The signalling for senescence, arrest of apical growth, high sucrose levels and flowering induction are well coordinated. The upregulation of several flowering related genes like flowering locus D, pollen and pistil related transcripts in the high sugar genotypes clearly shows that the crop has attained its maximum sugar levels and was in a transition state to flowering though many are commercial cultivars that do not or flower rarely. Sugarcane has been selected for higher sugar content that involved strategies for delayed flowering and seed set, due to which a majority of sugarcane cultivars now are either sterile or the reproductive cycle has been delayed, or dormant for years . Trehalose and its phosphate derivative trehalose-6- phosphate have recently gained importance as signalling molecules involved in carbon partitioning and also linking sugar status and diurnal rhythm to floral transition, in plants [49, 50]. For example, high sucrose and trehalose-6-phosphate (T6P) levels signal a cellular sugar abundance status [37, 51]. In addition to other sugar forms, the role of trehalose in sugarcane sucrose metabolism needs further studies as corroborated by the upregulation of several transcripts for trehalose phosphate synthase and trehalases.
Light interception and the stay-green trait are considered as major factors influencing the level of carbohydrates in the internodes . Leaf angle is a genetic trait and higher sucrose yield in sweet sorghum can be achieved by genetic adjustment of leaf angles to optimum light interception. In addition, stay-green varieties of sweet sorghum were found to have higher stem sugar concentrations than senescing lines . This may be due to the reduced need for re-mobilizing stem sucrose in addition to prolonged photosynthetic capacity . Similarly, the upregulation of stay-green gene transcripts in the high sugar genotypes indicates an association between high sugar levels and higher photosynthetic capacity as the C4 enzymes are mainly localized in the chloroplasts. Further, high expression levels of photosynthetic, light harvesting, etiolation, starch, chlorophyll, gene transcripts were observed in high sugar genotypes. In addition, transcripts related to non- photosynthetic NADP malic enzymes  were upregulated in high sugar genotypes for which the functional significance is unknown in sugarcane. The rapid cycling of sugars in non-photosynthetic cells has been referred to as a ‘futile cycle’  because of the continuous and simultaneous synthesis and degradation of sucrose. However, it is recognised that these cycles allow cells to respond in a highly sensitive manner to small changes in the balance between the supply of sucrose and the demand for carbon for respiration and biosynthesis and thus resulting in a strong sink . This remobilisation of stored sucrose as a food supply results in rapid regrowth following stress or in germination of axillary buds of the internode . Photosynthesis, growth and yield are strongly linked to N availability especially in C4 crops .The upregulation of N related transcripts in high sugar genotypes indicate that this is an ongoing process even if the crop has reached maturity.
The general cell related functions and growth, organellar and nuclear functions, biosynthetic pathways of pigments, amino acids, metabolites, hormonal signalling, transcription factors, various other transporters, proteins of transposons, root/stem/leaf related transcripts, were upregulated in the high sugar genotypes. The functions enriched in genes that are differentially expressed between different tissues in each comparison are consistent with the physiological changes associated with the development of that tissue, mainly sucrose content (Figs. 6 and 7). The absence of DEGs in HST vs LST suggests that the top internodes are metabolically active irrespective of their sucrose contents (i.e. high or low sugar genotype). The absence of sucrose related DEGs in HST vs HSB, where the top and bottom internodes of high sugar genotypes show almost similar expression patterns, indicates homogeneity for sucrose content throughout the culm. Further, the high sugar genotypes seem to invest in more fibre and cellulose as revealed by the nature of the transcripts that are differentially expressed between top and bottom internodes (Table 6). Meanwhile, a large number sucrose related DEGs in LST vs LSB shows that a gradient for sucrose exists in the low sugar genotypes. This observation can be inferred in two possible ways. One is that the low sugar genotypes have an active top internode compared to bottom leading to sucrose futile cycling, resulting in less accumulation or the other way could be that the bottom internodes have slowed down metabolically over time, reaching their physiological threshold levels of sucrose. The former is unlikely as there are no acid invertases (cell wall/ vacuolar) expression observed in the DEGs which are involved in the sucrose breakdown. The latter is likely to be the reason and the bottom internodes play a major role in the sucrose content of the genotypes. Also, the bottom internode of high sugar genotypes shows high expression of sucrose related genes. Feedback inhibition or post translational regulation could possibly be involved in the low sugar genotypes having higher expression of the sucrose related genes in the top internode and in turn having a low sugar content. In addition, the low sugar genotypes could also be late maturing genotypes, as some of them are introgression lines (other than the commercial hybrids) not having an established sugar profile or maturity indices yet (for e.g., fibre: sugar ratio). Many factors besides Brix, like ratoonability, vigour, softness, several resistance mechanisms, secondary metabolites, starch, etc., may also differ among the genotypes taken that remain to be evaluated. There were 7814 transcripts unique to HSB vs LSB, and 3667 transcripts unique to LST vs LSB (of the 34,476 DEGs in HSB vs LSB and 30,809 DEGs in LST vs LSB). These transcripts may indicate tissue specificity of the genes or their isoforms which is to be explored. When these unique transcripts were filtered for sucrose/sugar genes, SPS, SPP, SuSy, and sugar transporter genes were more specific to HSB vs LSB whereas, only some of the sugar transporter genes were specific to LST vs LSB (Additional file 8: Tables S24–28).
It was proposed that sucrose accumulation may be regulated by a network of genes induced during culm maturation that contribute to key physiological processes including sugar translocation/transport, fibre synthesis, membrane transport, vacuole development and function, and abiotic stress tolerance [9, 12]. We found a similar trend in this profiling study, in addition to a very high number of differentially expressed sucrose and sugar related transcripts that might help bridge missing links in the interlinking of biosynthetic pathways and their regulatory factors. It is to be studied if sucrose regulates the large number of genes or large number genes is required for controlling this trait. As sucrose emerges as a signalling molecule as seen in the recent studies [34, 50], the all-pervasive nature of this sugar is likely to regulate the growth and developmental processes of the plant. It can be speculated for the presence of master switches or the major regulatory genes of this trait as further genomic information is obtained in the future. Many novel genes, like caffeoyl shikimate esterase that was recently discovered in Arabidopsis and reported to be absent in sugarcane  where found in the upregulated transcripts in our study. Further mining of the transcriptomes would certainly lead to new targets and new aspects for sucrose synthesis and accumulation in sugarcane.
The data reported here provide a comprehensive resource for sucrose related as well as culm maturation related studies in sugarcane. Further studies on a large data set with different developmental time points for genotypes contrasting for sugar content or energy canes that do not accumulate high levels of sugars should indicate targets for further biotechnological approaches. A dedicated analysis of transcription factors, and regulatory elements will further help understanding the complexity of the sugar network. Sucrose accumulation is very dynamic and unlike fruiting organs, the sugarcane culm is continuously exposed to every possible stimulus in the crop, soil and water continuum which results in a plethora of genes that are expressed at any point of time (approximately about 33,000). Although the present study identified more than 30,000 genes regulated and differentially expressed between high and low sugar genotypes, it is hard to pinpoint any particular group of genes or a gene to be responsible for the sucrose content and maintenance. Further, it is not possible for a gene to be lacking or not expressed in either of the groups as sucrose is a primary metabolite and principal transport sugar in sugarcane, which shows that the trait is quantitative and it is under transcriptional control. The machinery for sucrose synthesis is conserved across species and it is supposed that the complexity of sugarcane genome must play an important role in the sucrose levels that are observed in sugarcane. With multiple forms of each enzyme, with their own isoforms, various localizations, compartmentalized processes, the availability of large vacuoles and a unique stem morphology together contributes to the sugarcane stem sucrose content. Further, the availability of multiple isoforms or alleles gives the crop the advantage of buffering against any functional disruption which is the main reason for the instability of transformation events in sugarcane . With these challenges in the sugarcane crop, a multitude of strategies are required for any genetic manipulation or for identification of regulatory genes for important traits particularly sucrose.
Caffeoyl CoA 3-O-methyl transferases
Cinnamoyl CoA reductase
Commercial cane sugar
6 caffeic acid-o-methyl transferase
Differentially expressed genes
Differential gene expression
Expressed sequence tag
False discovery rate
Guanidine tri phosphate
High performance computing
High performance liquid chromatography
High sugar bottom internode
High sugar top internode
Kyoto Encyclopedia of Genes and Genomes
Low sugar bottom internode
Low sugar top internode
Nicotinamide adenosine diphosphate
National Center for Biotechnology Information
Next generation sequencing
- Nr database:
Open reading frame
Phenyl ammonia lyase
Quantitative real-time polymerase chain reaction
Ribonucleic acid sequencing
Reads per kilobase per million mapped reads
Sugarcane assembled sequences
Single nucleotide polymorphism
Saccharum officinarum Gene Index
Sucrose phosphate phosphatase
Sucrose phosphate synthase
Sugar phosphate translocator
Sugar Research Australia
Simple sequence repeats or microsatellites
Sugarcane expressed sequence tag database
Sugarcane Iso-Seq transcriptome database
Transcriptome shot-gun assembly
Vacuolar adenosine triphosphate
Slewinski T, Baker R, Stubert A, Braun D. Tie-dyed2 encodes a callose synthase that functions in vein development and affects symplastic trafficking within the phloem of maize leaves. Plant Physiol. 2012;160:1540–50.
Slewinski TL. Non-structural carbohydrate partitioning in grass stems: a target to increase yield stability, stress tolerance, and biofuel production. J Exp Bot. 2012;63(13):4647–70.
Whittaker A, Botha F. Carbon partitioning during sucrose accumulation in sugarcane internodal tissue. Plant Physiol. 1997;115:1651–9.
Welbaum GE, Meinzer FC. Compartmentation of solutes and water in developing sugarcane stalk tissue. Plant Physiol. 1990;3:1147–53.
Casu R, Dimmock C, Thomas M, Bower N, Knight D, Grof C, McIntyre L, Jackson P, Jordan D, Whan V. Genetic and expression profiling in sugarcane. In: Proc Int soc sugar cane Technol: 2001; 2001. p. 542–6.
Casu R, Grof C, Rae A, McIntyre C, Dimmock C, Manners J. Identification of a novel sugar transporter homologue strongly expressed in maturing stem vascular tissues of sugarcane by expressed sequence tag and microarray analysis. An Int J on Mol Biol, Mol Genet and Biochem. 2003;52(2):371–86.
Casu R, Dimmock C, Chapman S, Grof C, McIntyre C, Bonnett G, Manners J. Identification of differentially expressed transcripts from maturing stem of sugarcane by in silico analysis of stem expressed sequence tags and gene expression profiling. An Int J on Mol Biol, Mol Genet and Biochem. 2004;54(4):503–17.
Casu R, Rae A, Nielsen J, Perroux J, Bonnett G, Manners J. Tissue-specific transcriptome analysis within the maturing sugarcane stalk reveals spatial regulation in the expression of cellulose synthase and sucrose transporter gene families. An Int J on Mol Biol, Mol Genet and Biochem. 2015;89(6):607–28.
Casu RE, Manners JM, Bonnett GD, Jackson PA, McIntyre CL, Dunne R, Chapman SC, Rae AL, Grof CPL. Genomics approaches for the identification of genes determining important traits in sugarcane. Field Crops Res. 2005;92(2–3):137–47.
da Silva JA, Bressiani JA. Sucrose synthase molecular marker associated with sugar content in elite sugarcane progeny. Genet Mol Biol. 2005;28(2):294–8.
Watt DA, McCormick AJ, Govender C, Carson DL, Cramer MD, Huckett BI, Botha FC. Increasing the utility of genomics in unravelling sucrose accumulation. Field Crops Res. 2005;92(2–3):149–58.
Papini-Terzi FS, Rocha FR, Vêncio RZN, Felix JM, Branco DS, Waclawovsky AJ, Del Bem LEV, Lembke CG, Costa MDL, Nishiyama MY, et al. Sugarcane genes associated with sucrose content. BMC Genomics. 2009;10:120.
Arruda P. Sugarcane transcriptome. A landmark in plant genomics in the tropics. Genet Mol Biol. 2001;24(1–4):36.
Iskandar HM, Casu RE, Fletcher AT, Schmidt S, Xu J, Maclean DJ, Manners JM, Bonnett GD. Identification of drought-response genes and a study of their expression during sucrose accumulation and water deficit in sugarcane culms. BMC Plant Biol. 2011;11:12.
Waclawovsky AJ, Sato PM, Lembke CG, Moore PH, Souza GM. Sugarcane for bioenergy production: an assessment of yield and regulation of sucrose content.(report). Plant Biotechnol J. 2010;8:263.
Cardoso-Silva CB, Costa EA, Mancini MC, Balsalobre TWA, Canesin LEC, Pinto LR, Carneiro MS, Garcia AAF, de Souza AP, Vicentini R. De novo assembly and Transcriptome analysis of contrasting sugarcane varieties. PLoS One. 2014;9(2):e88462.
Hoang NV, Furtado A, Donnan L, Keeffe EC, Botha FC, Henry RJ. High-throughput profiling of the fiber and sugar composition of sugarcane biomass. Bioenerg Res. 2016;10(2):400–16.
Furtado A. RNA extraction from developing or mature wheat seeds. Methods Mol Biol. 2014;1099:23–8.
Hoang NV. Analysis of genes controlling biomass traits in the genome of sugarcane (Saccharum spp. hybrids) PhD Thesis, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland. 2017. doi:10.14264/uql.2017.602.
Ali M, Brian AW, Kenneth M, Lorian S, Barbara W. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621.
Baggerly KA, Deng L, Morris JS, Aldaz CM. Differential expression in SAGE: accounting for normal between-library variation. Bioinformatics. 2003;19(12):1477–83.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
DFCI. 2016. ftp://occams.dfci.harvard.edu/pub/bio/tgi/data/. Accessed 27 Feb 2016. In.
Vettore AL, Da Silva FR, Kemper EL, Arruda P. The libraries that made SUCEST. Genet Mol Biol. 2001;24(1–4):1–7.
Huang X, Madan A. CAP3: a DNA sequence assembly program. Genome Res. 1999;9(9):868.
Hoang NV, Furtado A, Mason PJ, Marquardt A, Kasirajan L, Thirugnanasambandam PP, Botha FC, Henry RJ. A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing. BMC Genomics. 2017;18(1):395.
Thimm O, Bläsing O, Gibon Y, Nagel A, Meyer S, Krüger P, Selbig J, Müller LA, Rhee SY, Stitt M. MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004;37(6):914.
Conesa A, Götz S. Blast2GO: a comprehensive suite for functional analysis in plant genomics. Int J Plant Genomics. 2008;2008:619832.
Hoang NV, Furtado A, O’Keeffe AJ, Botha FC, Henry RJ. Association of gene expression with biomass content and composition in sugarcane. PLoS One. 2017;12(8):e0183417.
Moore PH. Temporal and spatial regulation of sucrose accumulation in the sugarcane stem. Aust J Plant Physiol. 1995;22(4):661–79.
Vettore AL, Da Silva FR, Kemper EL, Souza GM, Da Silva AM, Ferro MIT, Henrique-Silva F, Giglioti EA, Lemos MVF, Coutinho LL, et al. Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane. Genome Res. 2003;13(12):2725.
Ferreira S, Hotta C, Poelking V, Leite D, Buckeridge M, Loureiro M, Barbosa M, Carneiro M, Souza G. Co-expression network analysis reveals transcription factors associated to cell wall biosynthesis in sugarcane. An Int J on Mol Biol, Mol Genet and Biochem. 2016;91(1):15–35.
Prathima P, Suparna T, Anishma S, Punnya R, Ramalashmi K. Cloning and characterization of a differentially regulated invertase inhibitor gene during sucrose accumulation in sugarcane. J Sugarcane Res. 2014;4:21–8.
Lastdrager J, Hanson J, Smeekens S. Sugar signals and the control of plant growth and development. J Exp Bot. 2014;65(3):799–807.
Eveland AL, Jackson DP. Sugars, signalling, and plant development. J Exp Bot. 2012;63(9):3367–77.
Rolland F, Baena-Gonzalez E, Sheen J. SUGAR SENSING AND SIGNALING IN PLANTS: conserved and novel mechanisms. Annu Rev Plant Biol. 2006;57:675–709.
Tognetti JA, Pontis HG, Martínez-Noël GMA. Sucrose signaling in plants: a world yet to be explored. Plant Signal Behav. 2013;8(3):e23316.
Matsoukas IG, Massiah AJ, Thomas B. Starch metabolism and antiflorigenic signals modulate the juvenile-to-adult phase transition in a rabidopsis. Plant Cell Environ. 2013;36(10):1802–11.
Yang L, Xu M, Koo Y, He J, Poethig RS. Sugar promotes vegetative phase change in Arabidopsis Thaliana by repressing the expression of MIR156A and MIR156C. elife. 2013;2:e00260.
Yu S, Cao L, Zhou C-M, Zhang T-Q, Lian H, Sun Y, Wu J, Huang J, Wang G, Wang J-W. Sugar is an endogenous cue for juvenile-to-adult phase transition in plants. elife. 2013;2:e00269.
Maeshima M. TONOPLAST TRANSPORTERS: organization and function. Annu Rev Plant Physiol Plant Mol Biol. 2001;52:469–97.
Jacobsen KR, Fisher DG, Maretzki A, Moore PH. Developmental changes in the anatomy of the sugarcane stem in relation to phloem unloading and sucrose storage. Botanica Acta. 1992;105(1):70–80.
Botha FC and Black KG. Sucrose phosphate synthase and sucrose synthase activity during maturation of internodal tissue in sugarcane. Funct Plant Biol. 2000;27(1):81–5.
Kim M, Day D. Composition of sugar cane, energy cane, and sweet sorghum suitable for ethanol production at Louisiana sugar mills. Official J Soc Ind Microbiol. 2011;38(7):803–7.
Vicentini R, Bottcher A, Brito M, Santos A, Creste S, Landell G, Cesarino I, Mazzafera P. Large-scale Transcriptome analysis of two sugarcane genotypes contrasting for lignin content. PLoS One. 2015;10(8):e0134909.
Babb VM, Haigler CH. Sucrose phosphate synthase activity rises in correlation with high-rate cellulose synthesis in three heterotrophic systems (1). Plant Physiol. 2001;127(3):1234.
PDC S, Palhares AC, Taniguti LM, Peters LP, Creste S, Aitken KS, Van Sluys M-A, Kitajima JP, MLC V, Monteiro-Vitorello CB. RNAseq transcriptional profiling following whip development in sugarcane smut disease.(research article)(report). PLoS One. 2016;11(9):e0162237.
Rae A, Perroux J, Grof C. Sucrose partitioning between vascular bundles and storage parenchyma in the sugarcane stem: a potential role for the ShSUT1 sucrose transporter. An Int J Plant Biol. 2005;220(6):817–25.
Eastmond PJ, van Dijken AJH, Spielman M, Kerr A, Tissier AF, Dickinson HG, Jones JDG, Smeekens SC, Graham IA. Trehalose-6-phosphate synthase 1, which catalyses the first step in trehalose synthesis, is essential for Arabidopsis embryo maturation. Plant J. 2002;29(2):225.
Griffiths CA, Sagar R, Geng Y, Primavesi LF, Patel MK, Passarelli MK, Gilmore IS, Steven RT, Bunch J, Paul MJ, et al. Chemical intervention in plant sugar signalling increases yield and resilience. Nature. 2016;540(7634):574–8.
Lunn JE, Feil R, Hendriks JHM, Gibon Y, Morcuende R, Osuna D, Scheible W-R, Carillo P, Hajirezaei M-R, Stitt M. Sugar-induced increases in trehalose 6-phosphate are correlated with redox activation of ADPglucose pyrophosphorylase and higher rates of starch synthesis in Arabidopsis Thaliana. The Biochem J. 2006;397(1):139.
Cook MG and Evans LT. The roles of sink size and location in the partitioning of assimilates in wheat ears. Func Plant Biol. 1983;10(3):313–27.
Borrell AK, Hammer GL. Nitrogen dynamics and the physiological basis of stay-green in sorghum. Crop Sci. 2000;40(5):1295.
Duncan RR, Bockholt AJ, Miller FR. Descriptive comparison of senescent and nonsenescent sorghum genotypes. Agronomy Journal. 1981;73(5):849–53.
Maurino VG, Saigo M, Andreo CS, Drincovich MF. Non-photosynthetic ‘malic enzyme’ from maize: a constituvely expressed enzyme that responds to plant defence inducers. Plant Mol Biol. 2001;45(4):409.
Dancer J, Hatzfeld W-D, Stitt M. Cytosolic cycles regulate the turnover of sucrose in heterotrophic cell-suspension cultures of Chenopodium Rubrum L. An Int J Plant Biol. 1990;182(2):223–31.
Bull TA, Glasziou K. The evolutionary significance of sugar accumulation in Saccharum. Aust J Biol Sci. 1963;16(4):737.
Birch R, Bower R, Elliott A. Highly efficient, 5′-sequence-specific Transgene silencing in a complex Polyploid. An International Journal devoted to original research in tropical plants. 2010;3(2):88–97.
We gratefully acknowledge the financial support to PPT from the Department of Biotechnology, Government of India, for the Indo-Australian Career Boosting Gold Fellowship. We are grateful to the Australian Agency for International Development (AusAID) for financial support through an Australian Awards Scholarship to NVH. We thank SRA staff in Brandon station, Burdekin, Queensland, Australia for helping with the sample collecting and processing; Ravi Nirmal for helping us in sample collection and transport.
This work was funded by the Queensland Government and Sugar Research Australia (SRA). The funders had no role in the design of the study, collection, analysis, and interpretation of data, nor in writing the manuscript.
Availability of data and materials
All raw RNA-Seq read data used in this work are available from the NCBI SRA database, under the BioProject PRJNA356226, with 56 accession numbers: SRR5258946, SRR5258947, SRR5258950–57, SRR5258960–67, SRR5258970, SRR5258971, SRR5258974- SRR5258977, SRR5258982–87, SRR5258990–97, SRR5259000- SRR5259007, SRR5259010, SRR5259011, SRR5259014-SRR5259017, SRR5259022- SRR5259025, inclusive.
Ethics approval and consent to participate
Sugarcane genotypes were collected from the field planting at Sugar Research Australia’s Brandon station, Queensland, Australia. No ethics approval was required for the conduct of experiments in this study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Table S1a.
Sugar profile of the genotypes taken for the study. Table S1b Quality Report of RNA Seq reads of samples used in the study. Table S2 Genotypes and their transcriptome samples taken based on sugar content (Brix). Table S3 Additional information with regard to genotypes selected. Figure S1 Graphical representation of the sugar profiles of the genotypes selected for the study. (XLSX 2248 kb)
Additional file 2: Table S4.
Complete list of DEGs in the SOGI-DGE. Table S5 List of DEGs of SOGI HSB VS LSB. Table S6 List of DEGs of SOGI HST VS HSB. (XLSX 8800 kb)
Additional file 3: Table S7.
List of DEGs in the SUGIT, HST VS HSB. Table S8 List of DEGs in the SUGIT, LST VS LSB. Table S9 List of DEGs in the SUGIT, HST VS HSB. (XLSX 3817 kb)
Additional file 4: Table S10.
List of DEGs in the SAS-DGE. Table S11 List of DEGs in the SAS-DGE. Table S12 List of DEGs in SAS-DGE (Excel workbook). (XLSX 5447 kb)
Additional file 5: Table S13.
qPCR for selected transcripts and correlation analysis with RNA-Seq expression values. Figure S2 Correlation analysis of RNA- Seq and qRT-PCR expression values for selected transcripts. (XLSX 28 kb)
Additional file 6: Table S14.
DEGs of the experiment high sugar bottom vs low sugar bottom with SoGI database. Table S15 DEGs in the experiment low sugar top vs low sugar bottom with SoGI database. Table S16 DEGs in the experiment high sugar top vs high sugar bottom SoGI database. Table S17 DEGs in high sugar bottom vs low sugar bottom with SUGIT database. Table S18 DEGs obtained in High sugar top vs high sugar bottom with SUGIT database. Table S19 DEGs obtained in low sugar top vs low sugar bottom with SUGIT database. Table S20 DEGs in high sugar top vs high sugar bottom experiment with SAS database. Table S21 DEGs in low sugar top vs low sugar bottom experiment with SAS database. Table S22 DEGs in high sugar bottom vs low sugar bottom experiment with SAS database. (DOCX 45 kb)
Additional file 7: Figure S3.
Functional classification of the DEGs obtained in three different DGEs using Mapman annotation. 1. High sugar top vs high sugar bottom internode samples (HST vs HSB), 2. Low sugar top vs low sugar bottom internode samples (LST vs LSB), 3. High sugar bottom vs low sugar bottom internode samples; a, d, g) Saccharum officinarum gene indices, SoGI; b, e, h) Sugarcane long read database, SLRD; c, f, i) Sugarcane assembled sequences, SAS. Table S23 Functional classification of the DEGs obtained in three different DGEs using Mapman annotation. 1. High sugar top vs high sugar bottom internode samples (HST vs HSB), 2. Low sugar top vs low sugar bottom internode samples (LST vs LSB), 3. High sugar bottom vs low sugar bottom internode samples; a, d, g) Saccharum officinarum gene indices, SoGI; b, e, h) Sugarcane long read database, SLRD; c, f, i) Sugarcane assembled sequences, SAS. (XLSX 101 kb)
Additional file 8: Table S24.
Common and unique transcripts between LST vs LSB and HSB vs LSB (SoGI-DGE). Table S25 Common and unique transcripts between LST vs LSB and HSB vs LSB (SUGIT-DGE). Table S26 Common and unique transcripts between LST vs LSB and HSB vs LSB (SAS-DGE). Table S27 Unique transcripts (sucrose/sugar related) in HSB vs LSB (SoGI-DGE). Table S28 Unique transcripts (sucrose/sugar related) in LST vs LSB (SoGI-DGE). (XLSX 3047 kb)
Additional file 9: Figure S4.
Blast2GO and KEGG Mapping for DEGs in the SOGI-DGE with respect to starch and sucrose metabolism. Figure S5 Sucrose emerges as a signalling molecule regulating most of the inter-linked plant functions in sugarcane. The gene expression pattern in culm tissue during sucrose accumulation in the genotypes studied reveals several networks of genes regulated by sucrose, and correlating with the sucrose content of the genotypes studied. (XLSX 469 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Thirugnanasambandam, P.P., Hoang, N.V., Furtado, A. et al. Association of variation in the sugarcane transcriptome with sugar content. BMC Genomics 18, 909 (2017). https://doi.org/10.1186/s12864-017-4302-5
- High and low sugar genotypes
- Sucrose genes
- Sugarcane transcriptome