Skip to main content

Haplotype variations of sucrose phosphate synthase B gene among sugarcane accessions with different sucrose content



Sucrose phosphate synthase B (SPSB) gene encoding an important rate-limiting enzyme for sucrose synthesis in sugarcane is mainly expressed on leaves, where its alleles control sucrose synthesis. In this study, genetic variation of SPSB gene represented by different haplotypes in sugarcane was investigated in hybrid clones with high and low sugar content and various accessory species.


A total of 39 haplotypes of SPSB gene with 2, 824 bp in size were identified from 18 sugarcane accessions. These haplotypes mainly distributed on Chr3B, Chr3C, and Chr3D according to the AP85-441 reference genome. Single nucleotide polymorphisms (SNPs) and insertion/deletion (InDels) were very dense (42 bp/sequence variation) including 44 transitional and 23 transversional SNPs among the 39 haplotypes. The sequence diversity related Hd, Eta, and Pi values were all lower in clones of high sucrose content (HS) than those in clones of low sucrose content (LS). The evolutionary network analysis showed that about half SPSB haplotypes (19 out of 39) were clustered into one group, including 6 (HAP4, HAP6, HAP7, HAP9, HAP17 and HAP20) haplotypes under positive selection in comparison to HAP26 identified in Badila (S. officinarum), an ancestry noble cane species and under purification selection (except HAP19 under neutral selection) in comparison to HAP18 identified from India1 (S. spontaneum), an ancestry species with low sugar content but high stress tolerance. The average number of haplotypes under positive selection in HS clones was twice as that in LS. Most of the SNPs and InDels sequence variation sites were positively correlated with sucrose and fiber content and negatively correlated with reducing sugar.


A total of 39 haplotypes of SPSB gene were identified in this study. Haplotypes potentially associated with high sucrose synthesis efficiency were identified. The mutations of SPSB haplotypes in HS were favorable and tended to be selected and fixed. The results of this study are informative and beneficial to the molecular assisted breeding of sucrose synthesis in sugarcane in the future.

Peer Review reports


Sucrose plays a key role in plant growth and development. It is mainly synthesized in the cytoplasm of leave cells. Carbon dioxide (CO2) from the air enters the leaves of plants and is fixed into triose phosphates. Most of the triose phosphate generated is converted to sucrose, which starts from condensation of two triose phosphates to form a fructose 1, 6 biphosphate, and then is hydrolyzed to yield fructose 6 phosphate. Sucrose 6-phosphate synthase catalyzes the reaction of fructose 6-phosphate with uridine diphosphate glucose (UDPG) to form sucrose 6-phosphate. Finally, sucrose 6-phosphate phosphatase removes the phosphate group of sucrose 6-phosphate making sucrose available to be transported to other places of the plants as energy or for storage. Sucrose phosphate synthase (SPS) is a key and irreversible rate-limiting enzyme of sucrose synthesis in plants by catalyzing the conversion of fructose 6-phosphate and UDP-glucose into sucrose-6-phosphate, the substrate for a final step of sucrose synthesis.

Sugarcane is an important economical crop primarily for sugar production. Its sucrose yield can reach up to 50% of dry weight [1]. Extensive studies have been conducted to investigate the gene pathways involved in sucrose synthesis, transportation, and storage [2,3,4,5,6,7,8,9]. Beside the sucrose phosphate synthase genes, the invertase family genes, sucrose synthase family genes, sucrose transporter family genes (SUTs) also have important influence on sucrose accumulation. Sucrose synthase can not only promote the synthesis of sucrose, but also promote the decomposition of sucrose. Sucrose invertase can promote the conversion of sucrose into fructose and starch. Sucrose transporter transport sucrose from the original source to the sink tissue. However, due to the complexity of sugarcane genome, the progress has been slow. Sugarcane is highly polyploidy with chromosome number ranged between 80 and 130 in a ploidy level up to 13x [10]. Therefore, at every single gene locus, multiple alleles on different homo(eo)logous chromosomes (up to 13) are exist, which interact together with certain dosage effects to determine the gene function, no mention about the paralogs in the gigantic genomes of sugarcane.

As a critical enzyme in sucrose synthesis, the SPS is encoded by SPS gene, which has five groups in sugarcane genomes, named SPSA, SPSB, SPSC, SPSD1 and SPSD2 [11, 12]. The expression patterns of the five SPS genes in different sugarcane tissues were different [13]. The SPSC predominantly expressed in both immature and mature leaves [14]; whereas SPSD expressed at similar levels in all tissues examined; SPSA has low expression in leaves and gradually increasing expression level from the meristem region down to internode 7 of the stem [15]. Except SPSA, all SPS genes expressed significantly higher in accessions with high sucrose content than in accessions with low sucrose content [13, 16]. Linkage analysis of a sugarcane mapping population revealed that SPS genes located in different linkage groups and SPSD genes were strongly associated with sugar-related traits [17]. The SPS activity and sucrose content were both enhanced in sugarcane with SPSA gene overexpressed, in addition to increased plant height and effective stem number [18], consistent with McIntyre's findings [17]. The SPSB predominantly expressed in both immature and mature leaves with the highest expression in the whole SPS family genes and was considered to be positively correlated with sucrose content [14]. Improving sucrose content has been one of the major ultimate goals of sugarcane breeding programs. In traditional sugarcane breeding programs, several desirable traits are targeted for selection, which are influenced by allelic (intragenic) interaction, intergenic interaction, environmental factors and interactions between genetic and environmental factors, making the precise selection extremely challenging. Dissecting the effects of different genes, alleles, and environmental factors on the traits of interest are critical for effective selection in the breeding programs. Identifying different gene haplotypes and their functions are one of the important steps in dissecting the genetic factors controlling agronomic traits.

However, there were few reports on the haplotype study of functional genes in sugarcane. Previously haplotypes of sucrose transporter genes, sucrose synthase genes were identified and compared by sequencing homologous bacterial artificial chromosomes (BACs) and further approved that every SPS gene does have many haplotypes [9, 19]. However, which haplotypes contribute to sucrose accumulation in sugarcane were largely unknown. Sucrose is mainly synthesized in the leaves. Of the whole SPS family genes, SPSB had a high expression in both immature and mature leaves and was considered to be positively correlated with sucrose content particularly at the early stage of stem sucrose accumulation [15]. This current study is aimed to investigate the genetic variation of different haplotypes or alleles of SPSB gene in different sugarcane accessions with contrastive sucrose content to identify the haplotypes potentially associated with sucrose content. The results of this study will provide valuable reference for haplotype selection in sugarcane to enable molecule assisted breeding for sucrose content improvement.


Sucrose content analyses

According to the Brix measured in the field, a total of 6 high sucrose (HS) hybrid clones including YZ02-588, DeZhe93-88, YT00-236, YZ14-401, YZ14-405, YZ14-407, and 7 low sucrose (LS) hybrid clones: GT12, YZ94-343, YZ14-402, YZ14-403, YZ14-404, YZ14-406 and YZ14-408 were classified (Table 1). The Brix difference of all the accessions ranged from 1.93 to 7.93. The average sucrose content in HS group was 15.90%, higher than that in LS, 13.04%, also higher than that of ROC22 (13.07%), and Badila (Saccharum officinarum) (15.2%). The ANOVA analysis showed that there were significant differences (P < 0.01) between HS and LS groups in terms of Brix (%), sucrose content, and reduced sucrose, but no significant differences within the group (Table 1).

Table 1 The ANOVA results between or within groups of high sucrose content and low sucrose content

The amplification of SPSB gene and sequence analysis

The SPSB gene was successfully amplified from 18 accessions with an amplificon size of 2,824 bp (Additional file 1). A total of 1080 amplicon clones (60 clones for each accession’s amplicon) were picked for Sanger sequencing. In total, 869 amplicon clones were successfully sequenced and 567 of them were assembled into contigs. The number of assemblies derived from each accession ranged from 7 to 52. Through sequence alignment and comparison, a total of 39 haplotypes were identified from the 567 assemblies (Table 2, NCBI accessions: OP615365 ~ OP615403). In all of SPSB haplotypes, the sugarcane ancestral species, S. officinarum (Badila) and S. spontaneum (India1) each contained only one haplotype, while the ancient sugarcane variety, Guangze bamboo cane (S. sinense) possessed four haplotypes, and Katha (S. barberi) had two haplotypes. In the hybrid clones, the number of identified haplotypes ranged from 1 to 13. The 6 HS and 7 LS clones had a total of 60 and 56 haplotypes, respectively, including 9 HS-specific and 5 LS-specific haplotypes, respectively. The haplotype HAP16 only existed in Guangze bamboo cane. All of the haplotypes were located on Chr3D, Chr3C, and Chr3B of S. Spontaneum genome ( (Additional file 2). Most haplotypes (38) were highly homologous compared with Chr3D (99% with no or less gap), Chr3C (98–99% with a few gaps), Chr3B (98–99% with a lot of gaps), except Hap17 has highly homologous with Chr3B. Four haplotypes (HAP6, HAP25, HAP33 and HAP38) showed partial homology on other chromosomes of S. spontaneum genome. Among these haplotypes, 18 were caused by intron mutations and 21 were caused by exon mutations. Further look into the partial homology revealed that HAP6 had a Harbinger transposon (Targer site Duplication was "TAA") inserted into the 7th intron of SPSB gene (Predict URL: and HAP25, HAP33 and HAP38 had 277 bases inserted in the third intron of the SPSB gene [20].

Table 2 The haplotype distribution of SPSB gene in 18 accessions

Genetic variation analysis of SPSB alleles

Among the 39 haplotypes, 67 SNPs and 6 InDels were identified with an average SNP density of 42 bp/SNP. Transition SNPs occurred at 44 sites, and transversion at 23 sites in the 39 SPSB haplotypes (Additional file 3). The haplotype diversity (Hd), total number of mutations (Eta), number of polymorphic (segregating) sites (S), and variance of haplotype diversity (Hv) in HS clones were all relatively lower than those in LS group (Table 3), which indicated that the mutations in HS were favorable and tended to be selected and fixed.

Table 3 The genetic diversity analysis of haplotypes from high sucrose clones and low sucrose clones

The nucleotide diversity (Pi) of 39 SPSB haplotypes showed that the nucleotide mutation positions scattered across the SPSB gene (Fig. 1a). The haplotypes in HS groups had much higher variations in the first 500 bp of the SPSB amplicon sequences while the haplotypes in LS groups had high variation at 2000–2500 bp region (Fig. 1b, c). The high variation in HS contained the SNP mutations in haplotypes, HAP1, HAP5, HAP7, and insert sequence in HAP25, HAP33, and HAP38. The average Pi value of HS was 0.00268, lower than that of LS (0.00275) (Table 3), indicating that the DNA sequences in HS group was relatively conservative comparing to LS. Through variable splicing enzyme recognition site analysis, HAP25, HAP33 and HAP38 had YNYYRAY-type recognition sites [21] in the 3rd intron, and HAP6 had two and one YNYYRAY-type recognition sites in the 7th and 8th intron, respectively. Compared with other haplotypes, HAP15 had an additional YNYYRAY recognition site in the 4th intron. According to these haplotype variable splicing recognition sites, there was no significant difference in the distribution of HAP6, HAP15, HAP25, HAP33, and HAP38 between the HS and LS groups.

Fig. 1
figure 1

Nucleotide variations along the SPSB gene among the (a) 39 haplotypes, (b) haplotypes of low sucrose content clones, and (c) haplotypes of high sucrose content clones. The ordinate represented the nucleotide diversity(Pi), the red dot represented the area of difference

With ROC22 SPSB cDNA (ID: JN584485) as a reference, the coding regions of 39 SPSB haplotypes encoded nine different protein types, of which HS contained six and LS contained eight. A total of 26 haplotypes encoded one protein type (P1 type, Table 2). Majority of the accessions (75%) had these 26 haplotypes. Particularly, HS had an average of 8.7 P1 type haplotypes, while LS had 5.7 P1 type haplotypes. Protein type P7 was the second popular protein type in all the accessions, with 15% of the total haplotypes encoding this protein type.

Haplotype network and phylogenetic analysis

According to the characteristics of haplotype evolutionary network relationship, the haplotypes were divided into four groups (Fig. 2). In Group 1 there were 7 haplotypes encoding 3 protein types, Group 2, 6 haplotypes encoding 2 protein types, Group 3, 6 haplotypes encoding 5 protein types, and Group 4, 19 haplotypes encoding 2 protein types, P1 and P7, the main protein types in all accessions. HAP9 was distributed in the center of Group 4, thus was considered as the primitive haplotype. In addition, the haplotypes of HAP18, HAP26, HAP23 and HAP39 derived from the four ancestral Saccharum clones, India1 (Saccharum spontaneum), Badila (Saccharum officinarum), Katha (Saccharum barberi J), and Guangze bamboo cane (Saccharum sinense R.), respectively, were all clustered in the group 4 centered with HAP9, which further indicated that other haplotypes in Group 4 may have evolved from HAP9. In HS clones, the six HS-specific haplotypes were all clustered in Group 4; while the five LS-specific haplotypes mostly scattered into other different groups, specifically, HAP11 and HAP12 in Group 3, HAP31 in Group 1, HAP34 in Group 2, and only HAP22 in Group 4 at a distal area.

Fig. 2
figure 2

The evolutionary network for the 39 haplotypes. Each link between haplotypes (presented in numbers) represents the minimum of mutational difference. Unlabeled nodes indicate inferred steps not found in the sampled populations. One transversal represents one SNP. The circle size represents the frequency of haplotypes in 18 accessions included in this study

Analysis of evolutionary selection effect of SPSB haplotypes

The analysis of synonymic and non-synonymic mutations between haplotypes showed that the 39 SPSB haplotypes were either under positive, neutral, or purification selection (Fig. 3). With the HAP26 haplotype of Badila (S. officinarum) as the reference, only six (HAP4, HAP6, HAP7, HAP9, HAP17, and HAP20) of 39 haplotypes were under positive selection, all encoding protein 1 and were all in Group 4 in the TCS network (Fig. 2). Interestingly, the average number of the haplotypes under positive selection was 2 in HS group (Additional file 4), which was more than that (1) in LS group. In HS (DZ93-88, YZ14-401, YZ14-405, YZ14-407 and YT00-236), each of the accession contained two or more haplotypes under positive selection. While as in LS group, except YZ14-402 contained two haplotypes under positive selection, all other accessions contained only 1 or 0 haplotypes under positive selection. Particularly, HAP7, a HS-specific haplotype, was one of those under positive selection. HAP22, the only LS-specific haplotype in Group 4 in the TCS network was under purification selection.

Fig. 3
figure 3

The heatmap of Ka/Ks of the 39 haplotypes

Correlation analysis of different SPSB haplotypes and sugar traits

Correlation analyses between sucrose, fiber, reducing sucrose and nucleotide sites with variation among SPSB haplotypes were conducted, respectively, with the HAP26 haplotype of SPSB gene of Badila as the reference (Fig. 4). Among the SNP and InDels mutation sites (a total of 73) of 39 haplotypes of SPSB gene, most of the 73 variation sites were positively correlated with sucrose content and fiber content, but negatively correlated with reducing sugar content. However, several mutation sites such as the 29th nucleotide of HAP1, 2, 40th of HAP16, 48th of HAP10, 11,12,13, and 66th of HAP22 were negatively correlated with sucrose content and positively correlated with reduced sugar content. In the sibling sugarcane hybrid clone pairs with contrastive sugar content, the sucrose content of YZ14-407 without HAP22 was higher than that of YZ14-406 with HAP22, the same as the sibling pair of YZ14-405 and YZ14-404. The haplotypes containing the nucleotide sites correlated to high sucrose and fiber contents and low reduced sugar content, which presumably will be selected in the breeding programs are 31st (HAP4, HAP5), 42nd (HAP1, HAP2, HAP3, HAP21, HAP24, HAP30, HAP31, HAP32, HAP34, HAP35, HAP36, HAP37), 60th (HAP8), and 61th (HAP6) nucleotide sites, among which, 4.6 haplotypes on average were in HS group, and 3.2 in the LS. These haplotypes will be further investigated for their potential for increasing sucrose content in sugarcane.

Fig. 4
figure 4

Sequence variations among 39 SPSB gene haplotypes and their correlations with sucrose content, fiber content and reduced sugar content. Notes: number 1–73 on the top row indicate SNP and InDels sites. Dark red box represents an extremely significant positive correlation. Dark blue represents an extremely significant negative correlation


The study of gene haplotypes provides details of genetic variation gene sequences, which allow us to deeply understand the historical characteristics of gene domestication. The results lay foundations for accelerating functional genome and molecular breeding research in sugarcane particularly with its high polypoid nature. Genetic variation in different haplotypes of functional genes can affect phenotypic or agronomic traits in plants, such as those found in studies of soybean GmST05 gene haplotypes, the natural variation in GmST05 determines transcription levels and influences seed size and quality in soybean [22]. In sugarcane, SPS gene plays a critical role in controlling sucrose synthesis and different haplotypes play different functions in sugarcane and add another layer to the function complexity of gene expression related to sucrose synthesis [23, 24]. Accurate identification of different SPSB haplotypes in sugarcane genomes are critical to understand the genetic control of the sucrose synthesis process, sucrose content difference between different accessions, which will provide referable information on sucrose content improvement in sugarcane breeding programs.

Reliability analysis of SPSB gene haplotype

Identifying gene haplotypes relies on high fidelity during gene fragment amplification, cloning, and sequencing to capture every possible and biological sequence variation among the haplotypes with no or minimum sequence errors. In our study, high fidelity KOD polymerase, which has a very low mutation frequency (0.09%), was used. Meanwhile, every single amplicon of 2,824 bp SPSB gene fragment was cloned, and 60 independent clones were picked and sequenced to capture all the possible haplotypes within each amplicon of the target gene. Extensive manual checks were conducted to ensure the reliability of the reads with no ambiguous bases allowed. Meanwhile, only the haplotype with more than two identical assemblies were considered as a true and reliable haplotype. Of the 39 SPSB haplotypes identified, almost every haplotype had mutations at multiple sites, which were also presented in more than one accession, which further validate the reliability of the haplotype calling. However, due to the high polyploid level of sugarcane, it is possible that we missed detecting all the possible haplotypes, particularly the rare alleles from certain accession. Statistically, we may need 56 successful assemblies from each accession (according to binomial distribution) to not miss any minor haplotype for a 12 × polyploid of sugarcane [25]. However, in our study, we generated 31.5 assemblies per accession in average, ranged from 7 to 52 assemblies across the 18 sugarcane accessions. Some minor haplotypes might not be detected due to low number of successful assemblies. The 39 SPSB haplotypes identified in this study should represent the most dominant haplotypes. Majority of the haplotypes appeared multiple times in different accessions. The most sugarcane accessions had more than one haplotype and some had high number of haplotypes closing to their ploidy level. The downstream genetic variation and correlation analyses of these dominant haplotypes should be informative and valuable for our research goal.

Genetic variation analysis of different haplotypes of SPSB gene

The 39 SPSB gene haplotypes were highly variable with not only frequent SNPs, but also some InDels. For example, HAP15 had a 6-bp insertion in the 4th intron and formed inverted repeats with the existing intron sequence. In HAP17 sequence, a 11-bp was missing from the 7th intron and a 19-bp was inserted in the 4th intron as a direct repeat sequence with its downstream sequence. Due to transposon insertion, HAP6 may be de-methylated to achieve higher expression efficiency [22]. The insertion sequence in HAP25, HAP33 and HAP38 need to be further validated for their relationship with sucrose content. Among the haplotypes produced by intron mutation, HAP2 and HAP1 has a single base mutation in the 3rd intron, HAP7 has a mutation of T-C base in the 4th intron, resulting in its separation from several other haplotypes. The sequence variation in introns among haplotypes could possibly affect the alternative splicing, thus the structure and function of SPSB proteins [26]. In the TCS network analysis, Group 4 was considered as a cluster of haplotypes under positive selection, which also contained haplotypes from ancestral accessions of S. officinarum, S. spontaneum, S.sinense and S. barberi. The results suggested that the haplotypes under positive selection were most likely evolved recently from the ancestral accessions and are contributing to sucrose accumulation in modern sugarcane cultivars. In this study, three pairs of sister lines from the same cross showing contrastive sugar content were compared. Between the pair of sister clones, YZ14-401 and YZ14-402, the HAP9 under positive selection was only present in the high sugar genotype YZ14-401 but not in the low sugar genotype, YZ14-402. The HAP38 from Group4 also exists only in YZ14-401. The results suggested that the Group 4, which contained SPSB haplotypes under positive selection may contribute to a high efficiency in sucrose synthesis. The sister pair clones, YZ14-406 and YZ14-407 had 11 (encoding 3 protein types) and 10 (encoding 2 protein types) haplotypes, respectively, with 3 of them as common ones. The high sugar clone, YZ14-407, had 3 haplotypes under positive selection and the low sugar line, YZ14-406 only had 1 under positive selection. Between sister clones YZ14-404 and YZ14-405, there were 7 common haplotypes. Again, HAP4 under positive selection only existed in high sugar clone. These results indicated that the haplotype under positive selection had important effects on the sucrose synthesis. Meanwhile, dose effect appeared to contribute to the sucrose content. It was noticed that the number of haplotypes under positive selection in high sucrose content clones was higher than that in low sucrose content clones.

Most SNPs and InDels among the 39 haplotypes were positively correlated with sucrose content and fiber content in sugarcane, but negatively correlated with reduced sugar content, which was most likely the results of the long-term domestication and artificial selection of high sugar content of sugarcane clones. Those drastic nucleotide changes such as transversion instead of transition SNPs or big insertion and deletions can change the functionality of SPSB, which may be beneficial to sucrose synthesis and accumulation and thus were selected during domestication and breeding processes.


SPSB gene showed extensive sequence variations in the sugarcane accessions in this study with a huge number of haplotypes or alleles. The number of SPSB haplotype under positive selection was more in HS clones than that in LS clones. The Hd, Eta, and Pi values were all lower in HS clones than those in LS clones. The evolutionary network of all the haplotypes demonstrated that SPSB haplotypes in Group 4 may derived from the haplotype of Badila, an old ancestry noble cane with high sucrose content. The group 4 also contained all haplotypes (HAP4, 6, 7, 9, 17, 20) under positive selection and demonstrating dosage effect. The SNPs and InDels of SPSB gene haplotype under positive selection were correlated with sucrose content and fiber content. The results in this study laid the foundation for further analysis of the functional alleles contributing to sucrose accumulation, which will allow us to develop the functional markers to assist selection of breeding accessions for sugarcane cultivar improvement.


Plant materials

A total of 18 Saccharum accessions were used in this study, including 4 Saccharum ancestry accessions: Badila, India1, Guangze bamboo cane, and Katha, belonging to S. officinarum L., S. spontaneum L., S. sinense R., and S. barberi J., respectively, 6 commercial sugarcane varieties with high or low sucrose performance in commercial production, and 8 interspecific hybrid clones from 5 different cross combinations (Table 4), among which, three pairs of lines with contrastive sucrose content according to Brix (%) in the field were selected, YZ14-401 and YZ14-402, YZ14-404 and YZ14-405, YZ14-406 and YZ14-407 with each pair derived from the same cross. The 4 Saccharum ancestry accessions and 7 commercial varieties were provided by National Nursery of Sugarcane Germplasm Resources (NNSGR) in Kaiyuan, Yunnan province of China; the 8 hybrid lines were provided by Sugarcane Research Institute, Yunnan Academy of Agricultural Sciences, Yunnan province of China (YAAS, YSRI). All the above accessions were planted in the first field station of YSRI in January 2017. Each accession was planted in 3 4-m trenches under maintenance in the same condition as the commercial sugarcane cultivars.

Table 4 Sugar measurement of all the accessions used in this study

Measuring sucrose content and statistics analysis

Brix (%) was firstly measured by extracting juice from the middle internode of mature stem in the field in Dec. 2016 using a hand refractometer Brix (BX/TDS, ATAGO CO., LTD) as a reference for accession selection for the experiment. The sucrose content was measured by collecting five randomly selected mature stalks of each accession in the field by the second optical rotation test of sucrose in November, December of 2017 and January of 2018 for a total of three times [27]. Meanwhile, the remaining bagasse was soaked in boiling water for 30 min, and then the water was removed, the bagasse was put into a drying oven for drying treatment, after the water was completely evaporated, the fiber content ratio of dry matter weight to fresh weight was calculated for accession. The reduced sugar content was determined by tetramethyl salt volumetric analysis method [28]. The average sucrose content, fiber content and reducing sucrose content data were analyzed by using ANOVA. The significance level (P-value) of sucrose difference within and between groups were analyzed by F test in SPSS software.

DNA extraction, SPSB gene amplification, and sequencing

The genomic DNA of the 18 accessions was extracted from freshly collected newly emerged leaf samples by using EasyPure® genomic DNA Kit (TransGen Inc.). Primers for SPSB amplification were designed by using the Primer5.0 according to the DNA sequence of SPSB gene (GenBank ID: JN584485). The forward primer was designed from the third exon and the reverse primer on the 10th exon of SPSB gene. The expected length of the amplified fragment was 2,824 bp, containing eight exons and seven introns (Fig. 5), including an important function domain, glycosyl transferase group 1.

Fig. 5
figure 5

SPSB gene structure and the genomic region amplified by primers, spsB-GR1F and spsB-GT2R. The blue boxes represent exon sequence of SPSB gene, the black lines represent intron sequence, the blue triangle arrows represent primer sites for the amplification. spsB-GT1F: GTGTGCATGGTCTTGTTCGTG; spsB-GT2R: CGCAACCCGTTTCTCCG

The PCR amplification reaction was set by using High-fidelity KOD polymerase (Thermo Scientific Inc., Cat: 11,304,102). PCR products were purified by using EasyPure® PCR purification kit (TransGen Inc., Cat: EP101-02) and then cloned into cloning vectors by using Blunt Zero cloning kit (TransGen Inc., Cat: CB501-01) following the manufacturer’s instructions. For each amplicon from one accession, 60 clones generated from the cloning process were randomly selected for Sanger sequencing in Shanghai Sangon Biotechnology Co., Ltd. (China), and a total of 1080 clones would be generated from 18 accessions.

Haplotype analyses of SPSB gene

The Sanger sequences of each clone were trimmed and assembled by Sequencher (Gene Codes Co., Ann Arbor, MI) with 100% identify and a minimum of six clean nucleotides overlap. At least two or more assemblies with identical sequences were considered as a confirmed haplotype. Multi-alignment of the haplotypes was conducted by using Clustal W [29], and the comparison between haplotypes and Saccharum spontaneum AP85-411 genome was performed by BLASTN 2.6.0 + [30]. MEGA 6 software for protein sequence translation and evolutionary analysis [31]. The number of haplotype (Hn), total number of mutations (Eta), number of polymorphic (segregating) sites (S), haplotype diversity (Hd), variance of haplotype diversity (Hv), nucleotide diversity (Pi) were conducted by using DNAsp [32]. The haplotypes from Badila (S. officinarum) and India1 (S. spontaneum) were used as references for calculating the synonymous (Ks) and non-synonymous (Ka) site by DNAsp. The haplotype network based on the method of Templeton (TCS) was analyzed by using PopART software with default settings [33, 34].

The correlation analysis of sugar content and variation sites along SPSB gene haplotypes

The correlation analysis of sucrose trait and SPSB gene haplotypes was conducted by SPSS. The SPSB gene haplotype from Badila was set as reference, SNPs and insertion/deletion (InDels) between a haplotype and the reference were represented by arbitrary values,but can distinguish the type of mutation. The transition between C—T is 0.5, A—G is 0.555, the transversion between C—A is 7, C—G is 7.777, T—A is 70, and T—G is 70.777. Insertion was set to 900, and deletion to 900.999 for the correlation analysis. The P-value < 0.01 represents an extremely significant correlation between SNPs (InDels) of haplotype and sucrose content. According to the correlation coefficient matrix, a heat map was drawn to compare the correlation between the different SNPs or InDels of SPSB gene haplotype and sucrose, fiber, and reduced sugar content, respectively.

Availability of data and materials

The dataset of the 39 haplotype sequences supporting the conclusions of this article is available in the National Center of Biotechnology Information (NCBI) GenBank with accession numbers of OP615365 ~ OP615403. The rest intermediate analysis data and the plant materials are available upon request from the first author.



Yunnan Academy of Agricultural Sciences, China


Sugarcane Research Institute of Yunnan


National Nursery of Sugarcane Germplasm Resources, China


Single nucleotide polymorphism


Insert and delete sequence


Non-synonymous substitution




The number of haplotype


Total number of mutations


Number of polymorphic (segregating) sites


Haplotype diversity


Variance of haplotype diversity


Nucleotide diversity


The method of Templeton, Crandall and Sing


Analysis of Variance




Sucrose phosphate synthase


High sucrose content accession


Low sucrose content accession


Bacterial artificial chromosomes


Uridine diphosphate glucose




Degrees of freedom


Mean Squared Error


  1. Moore P. Temporal and spatial regulation of sucrose accumulation in the sugarcane stem. Funct Plant Biol. 1995;22(4):661–79.

    Article  CAS  Google Scholar 

  2. Carson D. Botha F Genes expressed in sugarcane maturing internodal tissue. Plant Cell Rep. 2002;20(11):1075–81.

    Article  CAS  Google Scholar 

  3. Cai G, Faleri C, Del Casino C, Emons AM, Cresti M. Distribution of callose synthase, cellulose synthase, and sucrose synthase in tobacco pollen tube is controlled in dissimilar ways by actin filaments and microtubules. Plant Physiol. 2011;155(3):1169–90.

    Article  CAS  Google Scholar 

  4. Coleman HD, Yan J, Mansfield SD. Sucrose synthase affects carbon partitioning to increase cellulose production and altered cell wall ultrastructure. Proc Natl Acad Sci USA. 2009;106(31):13118–23.

    Article  CAS  Google Scholar 

  5. Milne RJ, Perroux JM, Rae AL, Reinders A, Ward JM, Offler CE, Patrick JW, Grof CP. Sucrose transporter localization and function in phloem unloading in developing stems. Plant Physiol. 2017;173(2):1330–41.

    Article  CAS  Google Scholar 

  6. Niu JQ, Huang JL, Phan TT, Pan YB, Yang LT, Li YR. Molecular cloning and expressional analysis of five sucrose transporter (SUT) genes in sugarcane. Sugar Tech. 2019;21(1):47–54.

    Article  CAS  Google Scholar 

  7. Reinders A, Sivitz AB, Hsi A, Grof CP, Perroux JM, Ward JM. Sugarcane ShSUT1: analysis of sucrose transport activity and inhibition by sucralose. Plant Cell Environ. 2006;29(10):1871–80.

    Article  CAS  Google Scholar 

  8. Sachdeva M, Mann APS, Batta SK. Multiple forms of soluble invertases in sugarcane juice: Kinetic and thermodynamic analysis. Sugar Tech. 2003;5(1):31–5.

    Article  CAS  Google Scholar 

  9. Zhang J, Arro J, Chen Y, Ming R. Haplotype analysis of sucrose synthase gene family in three Saccharum species. BMC Genomics. 2013;10(14):314.

    Article  Google Scholar 

  10. Grivet L, Arruda P. Sugarcane genomics: depicting the complex genome of an important tropical crop. Curr Opin Plant Biol. 2002;5(2):122–7.

    Article  CAS  Google Scholar 

  11. Lutfifiyya LL, Xu N, Robert L, D’Ordine Morrell JA, Miller PW, Duff SM. Phylogenetic and expression analysis of sucrose phosphate synthase isozymes in plants. J of Plant Physiol. 2007;7:923–33.

    Article  Google Scholar 

  12. Qin CX, Zhao LH, Huang DL, Gui YY, Sun Y. Role of the SPS Gene Families in the Regulation of Sucrose Accumulation in Sugarcane. Sugar Tech. 2017;19(2):117–24.

    Article  Google Scholar 

  13. Chen LP, Chen YQ, Fang JP, Song XM, Ye BY, Chen RK, Zhang JS. Analysis of relation between SPS gene family expression and sugar accumulation in Sugarcane. Chin J Trop Crops. 2014;35(7):1354–61.

    Google Scholar 

  14. Ma PP, Zhang X, Chen L, Zhao Q, Zhang Q, Hua X, Wang Z, Tang H, Yu Q, Zhang M, Ming R, Zhang J. Comparative analysis of sucrose phosphate synthase (SPS) gene family between Saccharum officinarum and Saccharum spontaneum. BMC Plant Biol. 2020;20(1):422.

    Article  CAS  Google Scholar 

  15. Grof CPL, So CTE, Perroux JM, Bonnett GD, Forrester RI. Research Note: The five families of sucrose-phosphate synthase genes in Saccharum spp. are differentially expressed in leaves and stem. Funct Plant Biol. 2006;6:605–10.

    Article  Google Scholar 

  16. Chen Z, Qin C, Wang M, Liao F, Liao Q, Liu X, Li Y, Lakshmanan P, Long M, Huang D. Ethylene-mediated improvement in sucrose accumulation in ripening sugarcane involves increased sink strength. BMC Plant Biol. 2019;19(1):285.

    Article  Google Scholar 

  17. McIntyre CL, Goode ML, Cordeiro G, Bundock P, Eliott F, Henry RJ, Casu RE, Bonnett GD, Aitken KS. Characterisation of alleles of the sucrose phosphate synthase gene family in sugarcane and their association with sugar-related traits. Mol Breeding. 2015;35(3):286.

    Article  Google Scholar 

  18. Anur RM, Mufithah N, Sawitri WD, Sakakibara H, Sugiharto B. Overexpression of sucrose phosphate synthase enhanced sucrose content and biomass production in transgenic sugarcane. Plants. 2020;9(2):200.

    Article  CAS  Google Scholar 

  19. Zhang Q, Hu W, Zhu F, Wang L, Yu Q, Ming R, Zhang J. Structure, phylogeny, allelic haplotypes and expression of sucrose transporter gene families in Saccharum. BMC Genomics. 2016;17:88.

    Article  Google Scholar 

  20. Kapitonov VV, Jurka J. Harbinger transposons and an ancient HARBI1 gene derived from a transposase. DNA Cell Biol. 2004;23(5):311–24.

    Article  CAS  Google Scholar 

  21. Clancy S. RNA splicing: introns, exons and spliceosome. Nature Education. 2008;1(1):31.

    Google Scholar 

  22. Duan CG, Wang X, Xie S, Pan L, Miki D, Tang K, Hsu CC, Lei M, Zhong Y, Hou YJ, Wang Z, Zhang Z, Mangrauthia SK, Xu H, Zhang H, Dilkes B, Tao WA, Zhu JK. A pair of transposon-derived proteins function in a histone acetyltransferase complex for active DNA demethylation. Cell Res. 2017;27(2):226–40.

    Article  CAS  Google Scholar 

  23. Cai M, Lin J, Li Z, Lin Z, Ma Y, Wang Y, Ming R. Allele specific expression of Dof genes responding to hormones and abiotic stresses in sugarcane. PLoS One. 2020, 16;15(1):e0227716.

  24. Vilela MM, Del Bem LE, Van Sluys MA, de Setta N, Kitajima JP, Cruz GM, Sforça DA, de Souza AP, Ferreira PC, Grativol C, Cardoso-Silva CB, Vicentini R, Vincentz M. Analysis of three sugarcane homo/homeologous regions suggests independent polyploidization events of Saccharum officinarum and Saccharum spontaneum. Genome Biol Evol. 2017;9(2):266–78.

    CAS  Google Scholar 

  25. Yang X, Song J, You Q, Paudel DR, Zhang J, Wang J. Mining sequence variations in representative polyploid sugarcane germplasm accessions. BMC Genomics. 2017;18(1):594.

    Article  Google Scholar 

  26. Calvin K, Li H. RNA-splicing endonuclease structure and function. Cell Mol Life. 2008;65(7–8):1176–85.

    Article  CAS  Google Scholar 

  27. Norton N. A photometric adaptation of the Somogyi method for the determination of glucose. J Biol Chem. 1944;153(2):375–80.

    Article  Google Scholar 

  28. GB/T 5009.7–2016. Determination of reducing sugar in foods. 2016.

  29. Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22(22):4673–80.

    Article  CAS  Google Scholar 

  30. Zhang Z, Schwartz S, Wagner L, Miller W. A greedy algorithm for aligning DNA sequences. JComput Biol. 2000;7(1–2):203–14.

    Article  CAS  Google Scholar 

  31. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;12:2725–9.

    Article  Google Scholar 

  32. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2.

    Article  CAS  Google Scholar 

  33. Clement M, Snell Q, Walke P, Posada D, Crandall K. TCS: estimating gene genealogies. in: Parallel and distributed processing symposium, International. IEEE. Computer Society. 2002;3:18.

  34. Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6(9):1110–6.

    Article  Google Scholar 

Download references


We would like to thank Dr. Yue-bin, Zhang at Sugarcane Research Institute, Yunnan Academy of Agricultural Sciences for his support and encouragement in the early stages of this project.


This work was supported by grants from the National Key R&D Program of China (2018YFD1000503), NSFC (31760411), Joint Special Project of Basic Agricultural Research of Yunnan Province (202101BD070001-025, 2017FG001-069), The key research plan of Yunnan Province (202203AK140029), and Florida Sugarcane League.

Author information

Authors and Affiliations



HB Liu and JP Wang conceived the study, designed the experiments, and contained the analyzed methods. HB Liu, XQ Lin, XJ Li, ZL Luo, Xin Lu, Q You, XP Yang, CH Xu carried out the experiments and analyzed the data, XL Liu, JY Liu and CW Wu selected and maintained the experimental materials. HB Liu and JP Wang wrote the manuscript. All authors read and approved the final paper.

Authors’ information

Not applicable

Corresponding author

Correspondence to Jianping Wang.

Ethics declarations

Ethics approval and consent to participate

Experimental research and field studies on plants in this study, including the collection of plant material complied with the relevant institutional, national, and international guidelines and legislation.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

The amplification of SPSB gene for 18 germplasms, M: markers, the size is 5000bp, 3000bp, 2000bp, 1500bp, 1000bp, 750bp, 500bp, 250bp, 100bp;1: Badila, 2: India1, 3: Guangze bamboo cane, 4: katha, 5: YZ02-588, 6: Dezhe93-88, 7: YZ14-401, 8: YZ14-405, 9: YT00-236, 10: YZ14-407, 11: GT12, 12: YZ94-343, 13: YZ14-402, 14: YZ14-403, 15: YZ14-404, 16: YZ14-406, 17: YZ14-408, 18: ROC22.

Additional file 2.

The comparative information between the 39 haplotypes of SPSB gene and the reference sequence of AP85-411 genome.

Additional file 3.

Variation sites of single nucleotide polymorphism (SNP) and InDels in the 39 SPSB haplotypes.

Additional file 4.

The sequence variations of positive selection haplotype in clones of high sucrose content (HS) and low sucrose content (LS).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, H., Lin, X., Li, X. et al. Haplotype variations of sucrose phosphate synthase B gene among sugarcane accessions with different sucrose content. BMC Genomics 24, 42 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: