Skip to main content

Genomic analysis of SBP gene family in Saccharum spontaneum reveals their association with vegetative and reproductive development



SQUAMOSA promoter binding proteins (SBPs) genes encode a family of plant-specific transcription factors involved in various growth and development processes, including flower and fruit development, leaf initiation, phase transition, and embryonic development. The SBP gene family has been identified and characterized in many species, but no systematic analysis of the SBP gene family has been carried out in sugarcane.


In the present study, a total of 50 sequences for 30 SBP genes were identified by the genome-wide analysis and designated SsSBP1 to SsSBP30 based on their chromosomal distribution. According to the phylogenetic tree, gene structure and motif features, the SsSBP genes were classified into eight groups (I to VIII). By synteny analysis, 27 homologous gene pairs existed in SsSBP genes, and 37 orthologous gene pairs between sugarcane and sorghum were found. Expression analysis in different tissues, including vegetative and reproductive organs, showed differential expression patterns of SsSBP genes, indicating their functional diversity in the various developmental processes. Additionally, 22 SsSBP genes were predicted as the potential targets of miR156. The differential expression pattern of miR156 exhibited a negative correlation of transcription levels between miR156 and the SsSBP gene in different tissues.


The sugarcane genome possesses 30 SsSBP genes, and they shared similar gene structures and motif features in their subfamily. Based on the transcriptional and qRT-PCR analysis, most SsSBP genes were found to regulate the leaf initial and female reproductive development. The present study comprehensively and systematically analyzed SBP genes in sugarcane and provided a foundation for further studies on the functional characteristics of SsSBP genes during different development processes.

Peer Review reports


Various transcription factors have revealed their critical roles in organism-specific function by activating or suppressing the expression of target genes [1]. The SQUAMOSA promoter binding (like) proteins (SBPs/SPLs) represent a major family of plant-specific transcription factors. SBPs/SPLs proteins share a highly conserved 76 amino acids in length DNA binding domain, also known as SBP binding domain [2]. The first SBP/SPL protein was identified in Antirrhinum majus, and this protein could interact with the promoter sequence of the floral meristem gene SQUAMOSA [3]. As a multigene family, SBP/SPL genes have been characterized from different species ranging from single-cell green algae to multicellular angiosperm [4, 5]. There are 16 SBP/SPL genes identified in Arabidopsis [6], 19 in rice [7], and 41 in soybean [8]. SBP transcription factors play central roles in various aspects of plant development including [2, 9, 10], flower development [11], leaf development [12], plant hormone signaling transduction [13], vegetative to reproductive phase transition [14, 15]. For example, AtSPL3 participates in regulating flowering under long photoperiod, and constitutively expressed SPL3 shows early flowering [6]. AtSPL8 is a central regulator involved in the regulation of microsporogenesis and megasporogenesis. spl8 mutant shows pollen sac development defects, and overexpression SPL8 affects plant fertility by GA-dependent signaling pathway [16]. Moreover, SPL8 and other SPL genes influence gynoecium patterning through mediating auxin homeostasis [17]. In monocot plants, such as rice and maize, SBP genes are also reported to modulate essential developmental processes. Overexpression of OsSPL14 during the reproductive stage significantly promotes panicle branching and increased grain yield [18]. OsSPL16 is also a regulator of grain size, shape, and quality [19]. OsSPL3 regulates crown root development [20]. For maize, SBP proteins encoding genes, unbranched2 and unbranched3, affect plant architecture and yield traits by regulating the lateral primordia initiation [21].

Numerous studies have revealed that many development processes mediated by SBP proteins are closely related to miR156. It is reported that miR156 in Arabidopsis can complementarily bind to the 3′ UTR of SPL3 mRNA, and reduce its expression level through translation repression or transcript cleavage [10, 11]. In rice, overexpression of OsmiR156 decreased the expression of SPL genes, indicating the conserved interaction relationship between SPL and miR156 [22]. Similarly, miR156 targeted OsSPL16 and OsSPL13 control grain shape, size and quality in rice [19, 23]. In switchgrass, miR156/SPL4 module controls aerial axillary bud formation and biomass yield [24].

Sugarcane (Saccharum spontaneum), one of the most economically valuable plant, is a perennial tropical or subtropical crop, contributing up to about 80% of sugar production and 40% biofuel feedstock in the world [25]. Since 2000 years ago, sugarcane has been cultivated as sugar crop in China and India [26]. This domesticated sugarcane cultivar is a cross between species S. officinarum and S. spontaneum and accounts for the major genome information to modern sugarcane cultivars [27]. Although the genome information of S. spontaneum L. is available [28], little progress has been made in sugarcane germplasm improvement through sexual propagation due to the degeneration of sugarcane reproductive organs [29]. Therefore, unveiling the fundamental mechanism of the sugarcane reproductive developmental process is necessary to develop improved varieties [30].

Concerning recent findings of SBPs roles in Arabidopsis, rice, and other plants, analysis of SBP gene function in sugarcane will undoubtedly accelerate sugarcane germplasm improvement. In this present study, we systematically analyzed the SBP gene family of sugarcane for their gene structure, phylogeny, motif and domain composition, miR156 target site, and expression pattern in various tissues and organs. Besides, the interaction between the SBP genes and miR156 was critically examined to study their functional relationship during the reproductive stage in sugarcane.


Identification and characterization of SBP genes in S. spontaneum

To identify of SBP genes in sugarcane, the HMM profile of the SBP domain was used as a query to search the sugarcane genome database and BLASTP program. Initially, 66 putative SBP proteins were identified from the sugarcane genome database. All the resulting sequences were further checked by SMART and pfam tools to confirm SBP domain. Sixteen proteins without SBP (Cys-Cys-His-Cys, Zn2) motif or with incomplete SBP domain were removed. Finally, 50 SBP proteins were identified and used for further analysis. Among them, 13 SBP genes had 2, 3 or 4 alleles, including 7 SsSBPs with 2 allelic genes, 5 SsSBPs with 3 allelic genes and 1 SsSBP with 4 allelic genes. We named these SsSBPs as SsSBP1 to SsSBP30 based on their chromosomal locations and added − 1 to − 4 for their alleles (Table 1). To futher investigate the conserved status of the SBP domain, 30 SBP protein sequences from sugarcane were aligned to predict conserved domains. The alignment results showed that all SsSBP proteins contained the complete SBP domain and possessed the typical characteristics of SBP domain with two Zinc motifs (Zn1 and Zn2) and one nuclear localization signal (NLS) (Fig. 1).

Table 1 The characteristics of identified SBP genes in sugarcane
Fig. 1
figure 1

Multiple alignment of the highly conserved SBP domains

The detailed information about the SsSBPs was deduced by ExPASy server, including protein length, molecular weight (MW), theoretical isoelectric point (pI) and the grand average of hydropathicity (GRAVY). The length of the SsSBPs ORF region varied from 570 bp (SsSBP5) to 3195 bp (SsSBP3–3) and the protein lengths ranged from 189 to 1064 amino acids. The MW of the proteins ranged from 19.3214 to 116.09958 kDa. The pI ranged from 5.37 to 10.32, and the values of GRAVY were all negative, suggesting that all SsSBPs are hydrophilic. Moreover, the subcellular localization of 50 SsSBP proteins was predicted by ProtComp software and found that all SsSBP proteins localized in the nucleus except SsSBP4 and SsSBP12 proteins, which have no NLS signal, and localize in the cell membrane (Fig. S1; Table 1).

Phylogenetic analysis of the SBP gene families

We selected a total of 293 SBP homologs from 17 representative species from 7 green plant families, including chlorophytes, bryophytes, lycophytes, gymnosperms, basal magnoliophytes, eudicots and monocots, for phylogenetic analysis of SBP. Among them, both Ostreococcus sp. RCC809 and Ostreococcus Lucimarinus in green algae had only one SBP gene; 46 SBP genes exist in soybean. In comparison to the number of genes in these species, SBP genes in S. spontaneum showed an obvious expansion in the number of genes (Fig. 2). To gain further insight into the phylogenetic relationship of SsSBP genes, a phylogenetic tree was constructed using SBP proteins from Arabidopsis, Vitis vinifera, Ananas comosus, Sorghum bicolor and Oryza sativa (Fig. 3). SBP genes from these different species could be classified into 8 groups (I to VIII), and SBP proteins also tend to cluster the similar group. As expected, SsSBPs exhibited a closer relationship with the SBP proteins from S. bicolor and O. sativa. Group V and VII contained maximum SsSBP genes, where SBP genes from S. bicolor and O. sativa were also grouped. While the group I contained only 2 members of SsSBP genes formed the smallest group. This result was in agreement with the conservation analysis of the SBP proteins in other plants like Arabidopsis, grape, rice and sorghum. For example, a relatively high homologous genes, AtSPL6 / SbSBP5 / OsSPL1 / SsSBP8 / SsSBP9 clustered in one evolutionary branch (Fig. 3A). In addition, a ML phylogenetic tree was also constructed based on gene sequence similarity of 50 SsSBP proteins. The result indicated that the alleles of each SsSBP gene cluster in the same group, indicating that their sequences have high homology (Fig. 3B).

Fig. 2
figure 2

Distribution of SBP genes in 17 species

Fig. 3
figure 3

Maximum likelihood phylogenetic tree of SBP gene family

Structure characterization of SBP genes in S. spontaneum

To better understand the genetic diversity of the SsSBP genes, the coding sequence of each gene was compared with their corresponding genomic sequence. The result revealed that the exon of SsSBP genes ranged from 2 to 13 in number (Fig. 4). SsSBP genes in group I contain 4–7 introns, group V contains 2–4 introns, group VI contains 1–2 introns, most of SsSBP members in group VIII contain 2 introns. Interestingly, alleles of SsSBP genes (SsSBP1–1/2, SsSBP3–1/2/4, SsSBP4–1/2, SsSBP15–2/3, SsSBP17–1/2, SsSBP18–1/2, SsSBP20–1/2, SsSBP21–1/2 and SsSBP24–1/2/3) had the same number of exon/introns, although the length of introns varied. Some alleles possessed different exon/intron numbers (SsSBP2–1/2/3, SsSBP7–1/2/3, SsSBP10–1/2, SsSBP11–1/2). The SsSBP3 allele gene SsSBP3–3 possessed the largest number (13) of exons, but the SsSBP3 alleles SsSBP3–1/2/4 only had 3 exons (Fig. 4). These results indicate that the number of exon and intron are diverse in different groups yet nearly consistent within the same group.

Fig. 4
figure 4

The exon/intron structure of sugarcane SBP genes

The SsSBP genes clustered into the same group exhibited similar structure and possessed a similar motif sequence. A total of 20 motifs were identified in SsSBP proteins, designated as motif 1–20 (Fig. 5, Fig. S2). The result expectedly showed that all SBP members contain Motif 1, Motif 2, Motif 3, Motif 5 and Motif 6, which was annotated as the SBP domain. Most of SBP members within the same group exhibited similar motif composition and exon-intron structure. In contrast, some motifs were found to be specific to one or two groups of SsSBP proteins. Motif 8 only appeared in group VIII. Motif 14, Motif 17 and Motif 19 were only present in group II, suggesting that proteins in these groups may have a specific biological function under a given condition. At the same time, the divergence among the different groups indicated their diverse functions.

Fig. 5
figure 5

Motif composition of sugarcane SBP proteins. The motif numbers from 1 to 20 are displayed in different colored boxes

Chromosome distribution and gene duplication of SsSBP genes

The chromosome distribution information of SsSBP genes revealed that 49 of the 50 SsSBP genes are located to the eight chromosomes of S. spontaneum, with the SsSBP30 mapped to the unanchored scaffolds (Fig. 6). On chromosomes 1 and 7, only two SsSBP genes were found. Chromosomes 3 and 5 contain five SsSBP genes. Chromosome 2 had the maximum number of SsSBP genes with 12 members, followed by chromosome 6 with 11 SsSBP genes. In addition, 27 synteny gene pairs were identified in sugarcane using MCScanX software, with 24 pairs of alleles and 3 pairs of nonalleles. It should be defined as a tandem duplication event if a chromosomal region within 200 kb containing two or more genes [31]. According to this criterion, only two tandem duplications (SsSBP4–3/SsSBP6 and SsSBP17–2/SsSBP19) were noticed (Fig. 6A, Table S3). These results indicate that segmental duplication events might significantly contribute to the SsSBP gene expansions than tandem duplication.

Fig. 6
figure 6

Gene location and synteny in sugarcane. A Synteny analysis of SBP genes within sugarcane. B Synteny analysis of SBP genes between sugarcane and sorghum

To further analyze the evolutionary process of SsSBP genes, a comparative analysis of genome synteny blocks between S. spontaneum and Sorghum bicolor was conducted. Sorghum is the closest related diploid to sugarcane, and the comparison of gene structures between these two species provided clues to the evolutionary gene events caused by polyploidization. A total of 37 syntenic gene pairs between S. spontaneum and S. bicolor were found (Fig. 6B, Table S3). To further understand the evolutionary forces on SsSBP genes, the ratio of the synonymous (Ks) and nonsynonymous (Ka) substitutions rate (Ka/Ks) was calculated for estimating the selection pressure of homologous genes, where Ka/Ks < 1 indicates purifying selection, Ka/Ks = 1 means neutral selection and Ka/Ks > 1 indicates positive selection [32]. In this study, with the exception of three gene pairs SsSBP4–1/SsSBP5 (1.0379), SsSBP3–2/SsSBP3–4 (1.98497), SsSBP24–2/SsSBP24–3 (1.00893), Ka/Ks ratios of SsSBP homologous genes were less than 1, indicating that these genes probably underwent a purifying selection (Table S2). Similarly, most Ka/Ks values of sorghum genes were also less than 1, suggested that SBP genes of these two close species underwent a strong purifying selection to reduce adverse mutations after duplication during the evolutionary process (Table S3).

miR156 target prediction, distribution and expression pattern analysis

Previous studies showed that miR156 complementarily binds to SBP genes either at the coding or 3’UTR region and reducs gene expression level through translation repression or transcript cleavage [11, 24]. In this present study, 22 SsSBP genes were found as the targets of miR156, and these genes were mainly distributed in groups V, VI, VII and VIII (Fig. 7A). Among these SsSBP genes, miR156 complementary sequences were at their coding regions except SsSBP5 where miR156 binding in the 3′-UTR (Fig. 7A; Fig. S3). Interestingly, the allelic genes SsSBP3–1, SsSBP3–2, SsSBP3–3 and SsSBP3–4 were all the targets of miR156 (Fig. 7A). Although there are 20 miRNA members of Saccharum sp., only one putative miRNA156 in S. spontaneum is represented in the miRbase database ( v 22.1). Therefore, we performed a genome-wide study and found 29 members of miR156 genes in S. spontaneum genome. These Ssp-miR156 genes were mainly distributed on chromosomes 2, 3, 4, 5, 6, 8, except for chromosomes 1 and 7 (Fig. S4; Table S4). Ssp-miR156a, Ssp-miR156f, Ssp-miR156j and Ssp-miR156k as alleles were localized on chromosome 2A. Chromosome 3A possesses 4 Ssp-miR156a alleles. Chromosome 3B contains 3 miR156 members (2 Ssp-miR156a and 1 Ssp-miR156l). Chromosome 4A has 4 miR156 members (3 Ssp-miR156a and 1 Ssp-miR156i), followed by 2 and 3 members on chromosome 4B (Ssp-miR156a and Ssp-miR156d) and 5A (2 Ssp-miR156a and 1 Ssp-miR156e), respectively. Two Ssp-miR156 were on chromosome 8A (Ssp-miR156e and Ssp-miR156k) and 8D (2 Ssp-miR156b). Only 1 Ssp-miR156 was found on chromosome 2C (Ssp-miR156b), 2D (Ssp-miR156b), 3C (Ssp-miR156a), 6A (Ssp-miR156a) and 6B (Ssp-miR156a). However, no Ssp-miR156 members was found on chromosomes 1 and 7 (Fig. S4; Table S4).

Fig. 7
figure 7

Sequence alignment and expression patterns of miR156 compare with SsSBP genes. A Sequence alignment of the miR156 complementary sequences with the target sites in SsSBP genes. B The expression patterns of miR156 and their targets in different tissue samples. C-J The confirmation of miR156s and their targets by degradome ananlysis

To further gain insight into the role of miR156 during female gametophyte development, we studied the miR156-SBP module during female gametophyte development. The results showed that the expression level of miR156 was mostly enriched in the mature stage of female reproductive development, and relatively low expression levels were found during the stages of AC (Archesporial Cell) to MMC (Megaspore Mother Cell). Generally, the expression level of miR156 increased from the initial stage to the mature stage of the female gametophyte by sRNA-seq analysis (Fig. 7B; Table S5). In addition, the expression profiles of Ssp-miR156 precursors were also quantitatively verified using RT-PCR and qRT-PCR analysis. The results for the relative expression of Ssp-miR156 were consistent with the sRNA-seq data (Fig. S5). On the contrary, the expression level of target SsSBP genes was mostly decreased during the female gametophyte development stages, such as the target SsSBP11–1, SsSBP21–2, SsSBP22 and SsSBP30 (Fig. 7B).

To verify the authenticity of miR156-SBP module in sugarcane, we performed the degradome analysis and found that miR156 family members target the SsSBPs. The miR156k could bind the site 1400 bp of its target SsSBP2–1 (Fig. 7C). Similarily, miR156a binds on the SsSBP3–2 (site 862) (Fig. 7D), SsSBP7–1 (site 850) (Fig. 7E), SsSBP7–3 (site 1110) (Fig. 7F), SsSBP10–1 (site 568) (Fig. 7G), SsSBP10–2 (site 1096) (Fig. 7H), SsSBP11–1 (site 1140) (Fig. 7I) and SsSBP21–1 (site 1065) (Fig. 7J). Taken together, these results suggest that miR156-SBP module is highly conserved, and the regulation pattern has diverged in different species.

Expression profiles analysis of SsSBP genes

To study spatiotemporal expression patterns of SsSBP genes, RNA-seq data of different organs and tissues were analyzed. The expression level of SsSBP genes of leaf development and female reproductive organs is shown by heatmap representation (Fig. 8). As illustrated in Fig. 8A, SsSBP4, SsSBP6, SsSBP13, SsSBP14, SsSBP18, SsSBP21 and SsSBP26 sustained low expression level in sugarcane leaf gradient segments, while SsSBP1 and SsSBP20 showed high expression in the leaf gradient segments. The transcript levels of SsSBP7, SsSBP10, SsSBP19, SsSBP22, SsSBP28 and SsBP29 decreased gradually from base to mature zone of leaf in sugarcane, showed that gene expression decreased following the maturing leaf (Fig. 8A, Table S6).

Fig. 8
figure 8

Expression profiling of SsSBP genes in different sugarcane tissues. The color bar shows the gene expression with log2 (FPKM+ 1). Green colors indicate low expression and red colors indicate high expression

To investigate the SsSBP genes involvement in sugarcane female reproductive organ development, the transcription level of all SsSBP genes was extracted from RNA-seq data of sugarcane female reproductive organs. The heat map represents expression levels in the lines at five developmental stages shown in Fig. 8B. Many SsSBP genes showed different expression patterns among these five development stages. SsSBP1 and SsSBP10 were highly expressed in different stages of female gametophyte development. The transcripts of 7 SsSBP genes (SsSBP13, SsSBP14, SsSBP15, SsSBP16, SsSBP17, SsSBP18, SsSBP19) were zero in all these samples. The expression level of SsSBP7 and SsSBP30 showed differential expression during the female gametophyte development, revealing that these two genes may play an important role in AC and MMC stages (Fig. 8B, Table S7).

We also performed qRT-PCR experiments to confirm the expression level of some SsSBP genes in those different female developmental stages. As shown in Fig. S6, the results of qRT-PCR data were highly consistent with the RNA-seq data for the relative expression of SsSBP genes during the female gametophyte development. Further studies may focus on the role of these genes on female reproductive development.


Sugarcane (S. spontaneum) has been widely domesticated and cultivated for thousands of years for its excellent economic values. It has become essential industrial material for sugar sources [25, 26]. The high quantity of genome data and abundance of increasing high-throughput transcriptome data make it possible to explore gene functions in non-model plants like Saccharum spp. Although the genome information of S. spontaneum L. is available, little progress has been made in sugarcane germplasm innovation and improvement due to the degeneration of sugarcane reproductive organs [28, 29]. Previous studies revealed that SBP genes play crucial roles in plant development, especially in flower development, signaling transduction, and vegetative to reproductive phase transition [13,14,15]. However, the functions of S. spontaneum SBP genes remain unknown, although 17 SPLs were identified in sugarcane without taking alleles into account [33]. As for sugarcane genomic autopolyploidization, we conducted the genome-wide identification of SBP genes and their alleles in S. spontaneum, which resulted in the identification of 30 SBP genes (Fig. 1, Table 1). The number of SBP genes in S. spontaneum was similar to that in P. trichocarpa (28), O. zativa (19), and S. bicolor (19), but smaller than that in G. max (46), indicating that SBP genes in different species underwent different gene duplication events. Based on phylogenetic and gene structure analysis, SsSBP genes could be divided into eight groups (group I-VIII), which is consistent with the results of previous studies on SBP genes [34].

In general, the members of SBP genes clustered into a subgroup shared similar gene structure and functions, suggesting these genes underwent common evolutionary origins. In other words, gene duplication events (segmental and tandem duplication) are the major driving forces for evolution and gene expansion by which many paralogous gene pairs are produced and could help organisms cope with different developmental processes [35]. In our study, a total of 27 duplication events were found in SsSBP genes, consisted of segmental duplications and tandem duplication (Fig. 6). The Ka/Ks ratio is reported as the criterion for estimation the gene duplication. The Ka/Ks ratio of a given > 1 means that the gene has experienced positive selection, = 1 suggests neutral selection and < 1 indicates purifying selection. Based on the values of Ka/Ks ratio, all the SsSBP gene pairs were duplicated under purifying selection except gene pairs SsSBP4–1/SsSBP5 (1.0379), SsSBP3–2/SsSBP3–4 (1.98497), SsSBP24–2/SsSBP24–3 (1.00893) (Table S3). The diversity of SsSBP genes is likely to be motivated by gene duplication and genomic structure variation during the evolutionary process.

Up to now, there is little functional information on the SBP genes of sugarcane. Generally, the gene functions, to a large extent, are correlated to their expression patterns. In this present study, the expression levels of 30 SsSBP genes were examined across the four different leaf gradient segments and five female gametophyte development stages (Fig. 7). Most SsSBPs were predominantly expressed in the initial developmental stages of either leaf development or female gametophyte development. These results were similar with other species in the apical meristem, including apical buds inflorescences and flower buds [9, 10, 22]. Among the SBP genes in Arabidopsis, AtSPL1 and AtSPL12 expressed highly in inflorescences and overexpression of these two genes enhanced the inflorescence thermotolerance [36]. AtSPL2, AtSPL9, AtSPL10, AtSPL11, AtSPL13 and AtSPL15 were reported to control the determination of leaf shape and the transformation of vegetative to reproductive stages [37]. Interestingly, the evolutionary analysis showed that AtSPL1 and AtSPL12 are highly orthologous to SsSBP genes in group II, including SsSBP1, SsSBP20 and SsSBP25. AtSPL2, AtSPL9 and AtSPL10 are orthologous to SsSBP genes in the group V with SsSBP10, SsSBP11, SsSBP12, SsSBP16, and SsSBP29. Based on the belief that homologous genes perform similar functions. SsSBP1 and SsSBP10, which were expressed highly in female gametophyte stages, would be involved in the development of female reproductive organs in sugarcane. Three genes SsSBP1, SsSBP20, SsSBP25 grouped with SsSBP16, SsSBP29 and SsSBP30, which are orthologous to AtSPL2, AtSPL9 and AtSPL10, expressed highly in the sugarcane leaves, confirming their roles in the regulation of leaf development. Certainly, additional studies need to be performed to confirm the potential roles in female gametophyte development (for SsSBP1 and SsSBP10) and leaf development (for SsSBP1, SsSBP20, SsSBP25, SsSBP16, SsSBP29 and SsSBP30).

In addition, miR156/SBP module has been reported to govern many aspects of plant growth and development [10, 17, 24, 38]. Overexpression of miR156 in Arabidopsis significantly repressed the SPL transcription and resulted in the loss of apical dominance, leading to dwarfism, an increase in total leaf number, and plant biomass [39]. Meanwhile, the expression levels of the target SBP genes of miR156 were suppressed in the miR156 overexpressing plants [10, 37]. In the present study, the transcript level of miR156 was abundant in the mature stage of female reproductive development (Fig. 7B). In contrast, most putative target SsSBP genes predicted miR156 target sites showed lower expression levels in these tissues (Fig. 7B). These results suggested that the transcript of miR156 is negatively correlated with the expression of most SsSBP genes (Fig. 8). All together, our results revealed that miR156/SBP module could be used as an important tool to genetically improve crop architecture and productivity.


A total of 30 SBP genes were identified in sugarcane (S. spontaneum) by genome-wide analysis. These SsSBP genes were comprehensively characterized and classified into eight groups. The phylogenetic analysis showed that these genes shared orthologous relationships of SBP members from Arabidopsis and rice. The spatiotemporal expression patterns of these SsSBP genes in different tissues indicate that SsSBP genes may regulate the leaf and female gametophyte development. Our results also showed that miR156 targeted many SsSBP genes. The expression level of miR156 was enriched in the female reproductive mature stages. The different expression levels between the miR156 and SsSBP genes in diverse tissues suggested that miR156/SBP module plays a crucial role in the leaf and female gametophyte development processes (Fig. 9). Taken together, our study provides the foundation for future in-depth elaboration of the potential functions of the SBP genes in the growth and development of sugarcane.

Fig. 9
figure 9

A summary work model of this study for SBP genes in sugarcane


Identification and annotation of SBP genes in sugarcane

Sugarcane genome data, CDS, protein sequence and annotation data were downloaded from the sugarcane Genome database ( [28]. Arabidopsis and other species sequences were searched and downloaded from Phytozome v13 ( [40]. To identify the SBP genes in sugarcane, the HMM profile of the SBP domain (PF03110) was downloaded from the Pfam database ( [41] and used as the query to search the sugarcane genome database. SBP homologs were obtained by running a local BLASTP search using the Arabidopsis and rice SBP sequence as a query against the given protein database with an E-value cutoff of 10− 5. The candidate genes were further confirmed by SMART server ( Sequences without the complete SBP domain were deleted. Finally, all the candidates were confirmed by multiple sequence alignments using DNAMAN software to ensure they contained the SBP domain. ExPASy ( [42] server was used to calculate the detailed information about the SsSBPs in sugarcane, such as molecular weights (MW), isoionic point (pI), and the grand average of hydropathicity (GRAVY). The subcellular localization of the SBP proteins identified was obtained using the ProtComp (v.9.0) software (

Gene structure, sequence alignments and phylogenetic analysis of SsSBP genes

The exon/intron structure of SBP genes was analyzed using Gene Structure Display Server ( [43] by comparing their coding and genomic sequences. Using BLASTP program to search homologous gene pairs among sugarcane and sorghum with the parameter of e-value = 1e-10. The estimation of selection and substitution rates, the non-synonymous (Ka), synonymous (Ks) and Ka/Ks substitution ratios of the homologous gene pairs of sugarcane and sorghum were calculated by the easy Ka/Ks calculation program. MCScanX software [44] was used to detect the gene synteny and collinearity in sugarcane, and the SBPs locations were shown using Circos software [45]. Multiple sequence alignment of SBP protein sequence from Arabidopsis, rice, and sorghum was conducted using the MUSCLE in MEGA (v.6.0) [46]. A phylogenetic tree was constructed using RAxML software ( using the maximum likelihood (ML) method with bootstrap 1000 replications. The phylogenetic tree was displayed and manipulated using the Interactive tree of life (iTOL, [47,48,49].

Conserved motif identification, miR156 target site prediction and distribution

The conserved motifs of SsSBP proteins were identified using the online program MEME ( [50] with the default setting parameters: maximum number of motifs to find was 20; minimum width of motif was 6 and maximum width of motif was 50. The sequence logos of the SsSBP domain were showed by TBtools [51]. To predict the putative target sites of miR156, the cDNA sequences of SsSBP genes were analyzed using psRNATarget tool ( The chromosome location information of the Ssp-miR156s and SBPs were searched in sugarcane genome databases, and MapInspect software was used to generate chromosomal distribution information.

Plant material and sample preparation

The sugarcane (S. spontaneum L.) cultivar Yuetang 91–976 was grown and collected by State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources (Guangxi, China), and all samples from this cultivar was adopted for all experiment. When the plants reached the age of florescence stage, five different stages of the sugarcane female gametophyte development (i.e., AC, MMC, Meiosis, Mitosis and Mature) were collected. All samples were harvested as three biological replicates, which were quick-freeze with liquid nitrogen and stored at ultra-low temperature to facilitate the extraction of RNA.

RNA extraction, expression profiles and qRT-PCR analysis

Total RNA was isolated by the Omega Total RNA kit II (R6934–02, USA). The evaluation of RNA quality was performed by the gel electrophoresis and 2000 spectrophotometer assessment at 260 nm (NanoDrop, Thermo Fisher Scientific), and Illumina sequencing was done using the method of Zhao et al. (2018).

For qRT-PCR analysis, the cDNA was synthesized using the ThermoScript RT-PCR kit (Life Technologies) in a 20 μL volume reaction under the program: 42 °C for 15 min and 85 °C for 15 s. According to the SYBR Premix RT reagent kit system (TaKaRa, Dalian, China), the reaction contains 1 μg RNA prior to qRT-PCR.

To understand the expression profiles of SBP genes, the RNA-seq data of leaf development were downloaded from the Saccharum Genome database ( The RNA-seq data of female reproductive development have been deposited in the European Nucleotide Archive (ENA, accession number PRJEB44944). Different leaf developmental stages, including basal zone, a transitional zone, a maturing zone, a mature zone [52, 53], and the female reproductive development stages, AC, MMC, Meiosis, Mitosis and Mature, were used for the study. The RNA-seq raw reads were filtered by Trimmomatic software with default parameters to obtain clean reads. The clean reads were mapped to the reference genome using Hisat2 [54]. Gene expression was calculated by Cufflinks software [55]. The log2-transformed RPKM value of the expression patterns of SsSBP genes was used to generate the heatmap using the pheatmap package in R software. The expression pattern of miR156 was calculated by count values according to the miRNA-seq data.

To further confirm the expression profiles of the SsSBP genes, qRT-PCR assays were performed in different female reproductive development stages. qRT-PCR was conducted in CFX96 Real-Time System (Bio-Rad) using SYBR Green (TaKaRa) according to the instructions. Each reaction contains 12.5 μL SYBR mixture, 1.0 μL specific primer and 1 μg sample template. Three replicate reactions were performed for each sample under the following program: 95 °C for 30s; 40 cycles of 95 °C for 5 s; 60 °C for 30 s. The primers used in this study are listed in Table S1.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files. The sequencing data of miRNA that support the findings of this study have been deposited in the NCBI SRA database with BioProject accession no. PRJNA723681 (, and the RNA-seq data of female reproductive development have been deposited in the EMBL Nucleotide Sequence database (ENA) with accession no. PRJEB44944 (, which will be available publicly upon acceptance of the article. The RNA-seq data of leaf development were downloaded from the Saccharum Genome database (


  1. Gong W, Shen YP, Ma LG, Pan Y, Du YL, Wang DH, et al. Genome-wide ORFeome cloning and analysis of Arabidopsis transcription factor genes. Plant Physiol. 2004;135(2):773–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Yamasaki K, Kigawa T, Inoue M, Tateno M, Yamasaki T, Yabuki T, et al. A novel zinc-binding motif revealed by solution structures of DNA-binding domains of Arabidopsis SBP-family transcription factors. J Mol Biol. 2004;337(1):49–63.

    Article  CAS  PubMed  Google Scholar 

  3. Klein J, Saedler H, Huijser P. A new family of DNA binding proteins includes putative transcriptional regulators of the Antirrhinum majus floral meristem identity gene SQUAMOSA. Mol Gen Genet. 1996;250(1):7–16.

    Article  CAS  PubMed  Google Scholar 

  4. Riese M, Hohmann S, Saedler H, Munster T, Huijser P. Comparative analysis of the SBP-box gene families in P. patens and seed plants. Gene. 2007;401(1–2):28–37.

    Article  CAS  PubMed  Google Scholar 

  5. Zhang SD, Ling LZ, Yi TS. Evolution and divergence of SBP-box genes in land plants. BMC Genomics. 2015;16(1):787.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Cardon GH, Hohmann S, Nettesheim K, Saedler H, Huijser P. Functional analysis of the Arabidopsis thaliana SBP-box gene SPL3: a novel gene involved in the floral transition. Plant J. 1997;12(2):367–77.

    Article  CAS  PubMed  Google Scholar 

  7. Yang Z, Wang X, Gu S, Hu Z, Xu H, Xu C. Comparative study of SBP-box gene family in Arabidopsis and rice. Gene. 2008;407(1–2):1–11.

    Article  CAS  PubMed  Google Scholar 

  8. Tripathi RK, Goel R, Kumari S, Dahuja A. Genomic organization, phylogenetic comparison, and expression profiles of the SPL family genes and their regulation in soybean. Dev Genes Evol. 2017;227(2):101–19.

    Article  CAS  PubMed  Google Scholar 

  9. Shikata M, Koyama T, Mitsuda N, Ohme-Takagi M. Arabidopsis SBP-box genes SPL10, SPL11 and SPL2 control morphological change in association with shoot maturation in the reproductive phase. Plant Cell Physiol. 2009;50(12):2133–45.

    Article  CAS  PubMed  Google Scholar 

  10. Wu G, Poethig RS. Temporal regulation of shoot development in Arabidopsis thaliana by miR156 and its target SPL3. Development. 2006;133(18):3539–47.

    Article  CAS  PubMed  Google Scholar 

  11. Gandikota M, Birkenbihl RP, Hohmann S, Cardon GH, Saedler H, Huijser P. The miRNA156/157 recognition element in the 3′ UTR of the Arabidopsis SBP box gene SPL3 prevents early flowering by translational inhibition in seedlings. Plant J. 2007;49(4):683–93.

    Article  CAS  PubMed  Google Scholar 

  12. Usami T, Horiguchi G, Yano S, Tsukaya H. The more and smaller cells mutants of Arabidopsis thaliana identify novel roles for SQUAMOSA PROMOTER BINDING PROTEIN-LIKE genes in the control of heteroblasty. Development. 2009;136(6):955–64.

    Article  CAS  PubMed  Google Scholar 

  13. Zhang Y, Schwarz S, Saedler H, Huijser P. SPL8, a local regulator in a subset of gibberellin-mediated developmental processes in Arabidopsis. Plant Mol Biol. 2007;63(3):429–39.

    Article  CAS  PubMed  Google Scholar 

  14. Jung JH, Seo PJ, Kang SK, Park CM. miR172 signals are incorporated into the miR156 signaling pathway at the SPL3/4/5 genes in Arabidopsis developmental transitions. Plant Mol Biol. 2011;76(1–2):35–45.

    Article  CAS  PubMed  Google Scholar 

  15. Yu S, Lian H, Wang JW. Plant developmental transitions: the role of microRNAs and sugars. Curr Opin Plant Biol. 2015;27:1–7.

    Article  CAS  PubMed  Google Scholar 

  16. Unte, U. SJPC. SPL8, an SBP-box gene that affects pollen sac development in arabidopsis. Plant Cell. 2003;15(4):1009–19.

    Article  Google Scholar 

  17. Xing S, Salinas M, Garcia-Molina A, Hohmann S, Berndtgen R, Huijser P. SPL8 and miR156-targeted SPL genes redundantly regulate Arabidopsis gynoecium differential patterning. Plant J. 2013;75(4):566–77.

    Article  CAS  PubMed  Google Scholar 

  18. Miura K, Ikeda M, Matsubara A, Song XJ, Ito M, Asano K, et al. OsSPL14 promotes panicle branching and higher grain productivity in rice. Nat Genet. 2010;42(6):545–9.

    Article  CAS  PubMed  Google Scholar 

  19. Wang S, Wu K, Yuan Q, Liu X, Liu Z, Lin X, et al. Control of grain size, shape and quality by OsSPL16 in rice. Nat Genet. 2012;44(8):950–4.

    Article  CAS  PubMed  Google Scholar 

  20. Shao Y, Zhou HZ, Wu Y, Zhang H, Lin J, Jiang X, et al. OsSPL3, an SBP-domain protein, regulates crown root development in Rice. Plant Cell. 2019;31(6):1257–75.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Chuck GS, Brown PJ, Meeley R, Hake S. Maize SBP-box transcription factors unbranched2 and unbranched3 affect yield traits by regulating the rate of lateral primordia initiation. Proc Natl Acad Sci U S A. 2014;111(52):18775–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Xie K, Wu C, Xiong L. Genomic organization, differential expression, and interaction of SQUAMOSA promoter-binding-like transcription factors and microRNA156 in rice. Plant Physiol. 2006;142(1):280–93.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Si L, Chen J, Huang X, Gong H, Luo J, Hou Q, et al. OsSPL13 controls grain size in cultivated rice. Nat Genet. 2016;48(4):447–56.

    Article  CAS  PubMed  Google Scholar 

  24. Gou J, Fu C, Liu S, Tang C, Debnath S, Flanagan A, et al. The miR156-SPL4 module predominantly regulates aerial axillary bud formation and controls shoot architecture. New Phytol. 2017;216(3):829–40.

    Article  CAS  PubMed  Google Scholar 

  25. Zhang J, Arro J, Chen Y, Ming R. Haplotype analysis of sucrose synthase gene family in three Saccharum species. BMC Genomics. 2013;14(1):314.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Henry RJ, Henry R, Kole CJG, genomics, sugarcane bo. Basic information on the sugarcane plant; 2010. p. 1–7.

    Google Scholar 

  27. D'Hont A. Unraveling the genome structure of polyploids using FISH and GISH; examples of sugarcane and banana. Cytogenet Genome Res. 2005;109(1–3):27–33.

    Article  CAS  PubMed  Google Scholar 

  28. Zhang J, Zhang X, Tang H, Zhang Q, Hua X, Ma X, et al. Allele-defined genome of the autopolyploid sugarcane Saccharum spontaneum L. Nat Genet. 2018;50(11):1565–73.

    Article  CAS  PubMed  Google Scholar 

  29. Wang LP, Jackson PA, Lu X, Fan YH, Foreman JW, Chen XK, et al. Evaluation of sugarcane x Saccharum spontaneum progeny for biomass composition and yield components. Crop Sci. 2008;48(3):951–61.

    Article  Google Scholar 

  30. Hoang NV, Furtado A, Mason PJ, Marquardt A, Kasirajan L, Thirugnanasambandam PP, et al. A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing. BMC Genomics. 2017;18(1).

  31. Holub EBJNRG. The arms race is ancient history in Arabidopsis, the wildflower. Nat Rev Genet. 2001;2(7):516–27.

    Article  Google Scholar 

  32. Yang Z, Bielawski JP. Statistical methods for detecting molecular adaptation. Trends Ecol Evol. 2000;15(12):496–503.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Feng X, Wang Y, Zhang N, Zhang X, Wu J, Huang Y, et al. Systematic identification, evolution and expression analysis of the SPL gene family in sugarcane (Saccharum spontaneum). Trop Plant Biol. 2021;14(3):313–28.

    Article  CAS  Google Scholar 

  34. Liu M, Sun W, Ma Z, Huang L, Wu Q, Tang Z, et al. Genome-wide identification of the SPL gene family in Tartary Buckwheat (Fagopyrum tataricum) and expression analysis during fruit development stages. BMC Plant Biol. 2019;19(1):299.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Du H, Feng BR, Yang SS, Huang YB, Tang YX. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7(6):e37463.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Chao LM, Liu YQ, Chen DY, Xue XY, Mao YB, Chen XY. Arabidopsis transcription factors SPL1 and SPL12 confer plant thermotolerance at reproductive stage. Mol Plant. 2017;10(5):735–48.

    Article  CAS  PubMed  Google Scholar 

  37. Xu M, Hu T, Zhao J, Park MY, Earley KW, Wu G, et al. Developmental functions of miR156-regulated SQUAMOSA PROMOTER BINDING PROTEIN-LIKE (SPL) genes in Arabidopsis thaliana. PLoS Genet. 2016;12(8):e1006263.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Zhu T, Liu Y, Ma L, Wang X, Zhang D, Han Y, et al. Genome-wide identification, phylogeny and expression analysis of the SPL gene family in wheat. BMC Plant Biol. 2020;20(1):420.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D. Specific effects of microRNAs on the plant transcriptome. Dev Cell. 2005;8(4):517–27.

    Article  CAS  PubMed  Google Scholar 

  40. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(Database issue):D1178–86.

    Article  CAS  PubMed  Google Scholar 

  41. El-Gebali S, Mistry J, Bateman A, Eddy SR, Luciani A, Potter SC, et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019;47(D1):D427–32.

    Article  CAS  PubMed  Google Scholar 

  42. Artimo P, Jonnalagedda M, Arnold K, Baratin D, Csardi G, de Castro E, et al. ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 2012;40(Web Server issue):W597–603.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015;31(8):1296–7.

    Article  PubMed  Google Scholar 

  44. Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44(W1):W242–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Letunic I, Bork P. Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007;23(1):127–8.

    Article  CAS  PubMed  Google Scholar 

  50. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009;37(Web Server issue):W202–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–202.

    Article  CAS  PubMed  Google Scholar 

  52. Zhang Q, Hu W, Zhu F, Wang L, Yu Q, Ming R, et al. Structure, phylogeny, allelic haplotypes and expression of sucrose transporter gene families in Saccharum. BMC Genomics. 2016;17(1):88.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Chen Y, Zhang Q, Hu W, Zhang X, Wang L, Hua X, et al. Evolution and expression of the fructokinase gene family in Saccharum. BMC Genomics. 2017;18(1):197.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37(8):907–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012;7(3):562–78.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We want to thank all the colleagues in our lab for providing valuable discussions and technical assistance. We are grateful to the reviewers for their critical and constructive comments on our manuscript.


This work was supported by Science and Technology Major Project of Guangxi (Gui Ke 2018–266-Z01), National Natural Science Foundation of China (31800262; U1605212; 31761130074; 31600249; 31700279), China Postdoctoral Science Foundation (2018 M632564), Weng Hongwu Academic Innovation Research Fund of Peking University, and a Guangxi Distinguished Experts Fellowship.

Author information

Authors and Affiliations



YQ and XN designed the research and wrote the manuscript. YL and LW performed phylogenetic analysis and annotated the genes on chromosomes and conducted the evolution analysis. MA performed the sRNA-seq and degradome-seq. MA, MZ and YH analyzed data. LY, LW and HC performed qRT-PCR analysis. MA, XN and YQ revised the manuscript. All authors have read and approved the manuscript.

Corresponding authors

Correspondence to Yuan Qin or Xiaoping Niu.

Ethics declarations

Ethics approval and consent to participate

The sugarcane (S. spontaneum L.) cultivar Yuetang 91–976 was grown and collected by State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources (Guangxi, China), and all samples from this cultivar was adopted for all experiment. These plant materials don’t include any wild species at risk of extinction. No specific permits are required for sample collection in this study. We comply with relevant institutional, national, and international guidelines and legislation for plant study.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Aslam, M., Yao, LA. et al. Genomic analysis of SBP gene family in Saccharum spontaneum reveals their association with vegetative and reproductive development. BMC Genomics 22, 767 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: