- Open Access
Genome-wide analysis of R2R3-MYB transcription factors family in the autopolyploid Saccharum spontaneum: an exploration of dominance expression and stress response
BMC Genomics volume 22, Article number: 622 (2021)
Sugarcane (Saccharum) is the most critical sugar crop worldwide. As one of the most enriched transcription factor families in plants, MYB genes display a great potential to contribute to sugarcane improvement by trait modification. We have identified the sugarcane MYB gene family at a whole-genome level through systematic evolution analyses and expression profiling. R2R3-MYB is a large subfamily involved in many plant-specific processes.
A total of 202 R2R3-MYB genes (356 alleles) were identified in the polyploid Saccharum spontaneum genomic sequence and classified into 15 subgroups by phylogenetic analysis. The sugarcane MYB family had more members by a comparative analysis in sorghum and significant advantages among most plants, especially grasses. Collinearity analysis revealed that 70% of the SsR2R3-MYB genes had experienced duplication events, logically suggesting the contributors to the MYB gene family expansion. Functional characterization was performed to identify 56 SsR2R3-MYB genes involved in various plant bioprocesses with expression profiling analysis on 60 RNA-seq databases. We identified 22 MYB genes specifically expressed in the stem, of which RT-qPCR validated MYB43, MYB53, MYB65, MYB78, and MYB99. Allelic expression dominance analysis implied the differential expression of alleles might be responsible for the high expression of MYB in the stem. MYB169, MYB181, MYB192 were identified as candidate C4 photosynthetic regulators by C4 expression pattern and robust circadian oscillations. Furthermore, stress expression analysis showed that MYB36, MYB48, MYB54, MYB61 actively responded to drought treatment; 19 and 10 MYB genes were involved in response to the sugarcane pokkah boeng and mosaic disease, respectively.
This is the first report on genome-wide analysis of the MYB gene family in sugarcane. SsMYBs probably played an essential role in stem development and the adaptation of various stress conditions. The results will provide detailed insights and rich resources to understand the functional diversity of MYB transcription factors and facilitate the breeding of essential traits in sugarcane.
Modern cultivated sugarcane (Saccharum spp.) is the primary source of sugar for the world. It is the topmost crop concerning total biomass production and is listed among the 10 most valuable crops . Sugarcane, having a complex genetic background resulting from polyploid interspecific hybrids, was first domesticated approximately 10,000 years ago in New Guinea. Saccharum spontaneum contributes to 10–15% chromosomes in modern sugarcane cultivars, endowing the characteristics such as disease resistance and ratooning capacity . The genome of haploid S. spontaneum has been assembled to the chromosome level and used as the reference genome of sugarcane . Because of the development of multiple transcriptome models in recent times, including those for different tissues, developmental stages, and under various stress treatments, massive RNA-seq data are available, which can provide detailed insights and rich resources for studying sugarcane genes functions.
Transcription factors recognize specific DNA motifs in upstream regions of the genes to regulate their expression. MYB genes constitute one of the most prominent families of plant transcription factors and characteristically possess highly conserved Myb DNA-binding domains, forming a helix-turn-helix structure of about 52 amino acids . MYB genes can be divided into four categories, including MYB-related, R2R3-MYB, R1R2R3-MYB, and atypical MYB, depending on the number of adjacent MYB repeats (R). Proteins with a single or a partial MYB repeat, generally located at either ends or middle of the peptide chain, are MYB-related.
MYB-related proteins include critical telomere binding proteins in maintaining the integrity of the chromosome structure . Moreover, they also play an essential role in regulating gene transcription, e.g., the GARP family of plant Myb-related DNA binding motifs is involved in organ polarity in Arabidopsis . Further, CIRCADIAN CLOCK ASSOCIATED1 (CCA1) and LATE ELONGATED HYPOCOTYL (LHY) genes regulate the plant circadian clock . A small number of members of R1R2R3-MYB genes are found in higher plants. Interestingly, plant R1R2R3-MYB genes share a similar function of regulating the cell cycle control with the animals . R1R2R3-MYB has also been involved in cell differentiation  and plant stress tolerance .
Atypical MYB proteins contain four or more adjacent MYB repeats (R). These proteins have been found to encode in a few plants, e.g., Arabidopsis thaliana, Oryza sativa, Vitis vinifera, Glycine max, Physcomitrella patens (data sources displayed in Materials and Methods 2.1), as shown in Fig. 1. Only a few reports have been published about atypical MYB proteins by now, and the role of these proteins in the plant bioprocesses is mainly unknown. MYB transcription factors binding specific DNA sequence (CAACG/TG) result from domain structure that is formed by two closely packed amino acid sequence repeats(R) . When the MYB gene contains at least two MYB repeats (R), it has transcription factor characteristics and explicitly recognizes the DNA motifs to regulate the gene transcription. R2R3-MYB proteins are the largest subfamily of MYB transcription factors in plants, as well as in S. spontaneum (Fig. 1). R2R3-MYB is characterized by two MYB repeats and the presence of a single amino acid (Leu) in the first (R2) repeat . R2R3-MYB has two MYB repeats and a single amino acid (Leu) inserted in the first (R2) repeat. The R2R3-MYB family’s expansion originated from the R1R2R3-MYB gene ancestor when losing the R1 repeat sequences during evolution  and benefiting from gene duplication events .
MYB genes are widely involved in plant-specific processes, such as differentiation , hormone response , secondary metabolism , environmental stress tolerance , and diseases resistance [19, 20]. At least four MYB genes are involved in lignin biosynthesis in Arabidopsis by activating key regulator genes related to secondary cell wall formation . Under environmental stress, MYB genes have been reported to function in response to adverse stress in Arabidopsis. Moreover, AtMYB2 and AtMYB96 function as transcriptional activators in ABA-inducible gene expression under drought stress . AtMYB96 mediates abscisic acid signaling, induces pathogen resistance response by promoting salicylic acid biosynthesis, and provides drought tolerance via controlling the cuticular wax biosynthesis [20, 23].
This study focused on the R2R3-MYB gene family in the S. spontaneum published sugarcane genome. We provided a detailed overview of phylogenetic relationship, gene structure, regulatory elements, expression profiles, allelic evolution, and functional characterization based on abundant transcriptome data. Our study systematically explored the evolutionary dynamics and functional diversification of SsR2R3-MYB genes and could facilitate future research on sugarcane MYB transcription factors.
Genome-wide identification of R2R3-MYB genes and classification in S. spontaneum genome
Based on the functional annotation of the Myb_DNA-binding domain (PF00249), a total of 418 MYB genes (695 alleles) were identified in the S. spontaneum genome by combining the HMMER program and NCBI-CDD database (Fig. 1). The SsMYB gene family was classified into four distinct subfamilies, including 207 MYB-related (329 alleles), 202 R2R3-MYB (356 alleles), 3 R1R2R3-MYB (3 alleles), and 5 Atypical MYB (7 alleles) genes (detailed data presented in supplementary Table S1). Total 122 SbMYB-related, 125 SbR2R3-MYB, three SbR1R2R3-MYB, and two atypical SbMYB genes were also identified to increase the understanding of the SsR2R3-MYB genes in sorghum (Table S3).
To analyze the plant MYB genes thoroughly, 20 species representing 11 lineages were screened to construct a plant phylogenetic tree with S. spontaneum, including Green algae, Bryophyta, Gramineae, Cruciferous, Leguminous, Rosaceae, Solanaceae, and others. The tree topology reflected the phylogenetic relationship of these species and divergence time (Fig. 1). Plant phylogeny showed that the higher plants possessed more MYB genes than the lower plants, such as green algae (e.g., Ostreococcus lucimarinus, Volvox carteri, and Chlamydomonas reinhardtii). A significant expansion of MYB genes was observed after the Cambrian (about 540 ~ 480MYA), demonstrating an explosive biological diversification episode near the early period . Most of the phylogenetic nodes of plant species were observed in the Cretaceous, a geological period when a typical global warming climate contributed to the terrestrial species diversity . Compared with the other four kinds of grasses, S. spontaneum had one of the most significant MYB genes as predicted by PlantTFDB. One reason is the tetraploid nature of the autopolyploid S. spontaneum (octoploid). However, when corrected for ploidy level, the number of SsR2R3-MYB genes in S. spontaneum was still higher than most of the species, including Arabidopsis thaliana and other grass species. The number of MYB genes with a phylogenetic tree of the plant species indicated the expansion of MYB genes from alga to land plants, consistent with previous reports .
Maximum likelihood phylogenetic tree of R2R3-MYB genes from O. sativa and S. spontaneum showed that the sugarcane genome contained 15 subgroups (G1-G15) OsR2R3-MYB genes (Fig. 2, Table S2) with each distributing rice MYBs. Sugarcane and rice diverged in the Paleogene (67-26MYA) (Fig. 1); the short divergence time indicated relative conservatism of the ortholog genes. As expected, two species of R2R3-MYB genes were evenly distributed in the tree, and most genes in rice clustered with sugarcane, except for G8. However, the number of genes in each clade varied greatly; for instance, the biggest group, G3, contained 38 genes while the group G8 and G13 comprised just two genes. Twenty SsMYB genes from three unique subgroups, G2, and G10, did not contain rice genes, indicating the species’ genetic divergence. Besides, the clusters depicted that the sugarcane MYB family exhibited a more significant number of genes than that in rice, showing a significant expansion of the SsMYB family. In addition, we also constructed an ML phylogenetic tree with Arabidopsis (Figure S1). Comparative analysis of two trees showed some differences because of different species, but overall high similarity implied the topology of reliability.
Analysis of genomic location, gene structure, and regulatory elements
A total of 202 SsR2R3-MYB genes were named in turn according to their physical position on the chromosomes. MYB genes were distributed throughout all 32 chromosomes (Fig. 3c); the autopolyploid S. spontaneum genome comprised of eight homologous groups of four members each . The chromosome distribution map showed that the location of the MYB genes was not evenly distributed. Most of the SsMYB genes were located on Chr3A and Chr7A, encompassing 19 and 16 genes, respectively. About 11 enrichment clusters, tiny fragments on genomic regions containing three MYB genes, were detected, and half of these genes had MYB-binding sites (MBS) depicting potential interaction among each cluster. However, some chromosomes only contained a few MYB genes. For instance, five chromosomes, including Chr2C, Chr2D, Chr6C, Chr8B, and Chr8D, had only one MYB gene.
S. bicolor is one of the closest lineages of sugarcane, possessing relatively perfect genomic sequence data [27, 28]. Total 125 SbR2R3-MYB genes were identified from the available sorghum genome using a similar method (Fig. 1, Table S3). The diversity of the gene structure might be a shred of evidence regarding the evolution of gene families. The Neighbor-Joining method performed the phylogenetical gene structure analysis using diverse gene information (Fig. 3a and Figure S2). The distribution of the tree branches was consistent with the structural features of the genes. Various sorghum genes were clustered with highly similar SsR2R3-MYB genes, e.g., SbMYB92 clustered with SsMYB149 and SsMYB156 while SbMYB27 was clustered with SsMYB30 and SsMYB44. These results sharpened our understanding of the evolution of gene events during sugarcane polyploidization. A total of 19 SsR2R3-MYB genes did not show the presence of intron, including SsMYB154, SsMYB188, SsMYB194, SsMYB170, SsMYB122, SsMYB182, and SsMYB189. Many MYB genes demonstrated a domain with a cross-intron structure.
Cis-elements in promoter regions play an essential role in controlling transcription and expression, which can deepen the understanding of the regulatory function of MYB genes. Total 2000 bp upstream of transcription initiation site (ATG) was regarded as MYB gene promoters and submitted to the PlantCARE for predicting the motifs. Various motifs from 202 SsR2R3-MYB gene promoters were involved in various plant bioprocesses (Fig. 3b). These diversified cis-regulatory elements could be divided into four main categories in terms of function: stress response, hormone response, light response, and plant growth and metabolism. A high percentage of MYB genes in the anaerobic induction (92%) and drought elements (58.9%) indicated that the MYB genes were more likely to function under these stresses. Moreover, a notable gene, MYB88, was found to have 10 LTR motifs, which is a cis-acting element involved in low-temperature responsiveness. The significantly enriched LTR elements (5′-CCG AAA-3′) suggested that the MYB88 gene might be involved in plant metabolic response to cold stress. Many of the MYB genes regulate the plant hormone response, especially methyl jasmonate (MeJA) and abscisic acid (ABA) responsiveness. A total of 75 gene promoters had enriched regulatory elements TGACG-motif (5′-TGACG-3′) and CGTCA-motif (5′-CGTCA-3′) involved in MeJA-responsiveness, while 38 gene promoters enriched regulatory elements ABRE involved in abscisic acid responsiveness. These MYB genes were predicted to regulate MeJA and ABA signaling in plants and function in plant defense and leaf abscission. Furthermore, more than 30 light response-related elements were predicted; for instance, conservative light element G-box was widely present in the upstream sequence of genes. Several regulatory elements were also associated with other plant growth and development functions and regulation of seed growth and meristem development. Genes involved in seed-specific regulation contained the same RY-element (5′-CATGCATG-3′), and the elements involved in meristem expression demonstrated CAT-box (5′-GCC ACT-3′) and NON-box (5′-AGATCGACG-3′) in promoter regions. Finally, 119 genes were scattered on MYB binding sites, and 49 genes showed more than one binding site, suggesting that these genes probably interacted with other MYB genes. Four MYB binding elements were found in 202 SsR2R3-MYB promoters, including CCAAT-box (5′-CAACGG-3′), MBS (5′-CAACTG-3′), MBSI (5′-aaaAaaC(G/C)GTTA-3′), and MRE (5′-AACCTAA-3′). There was only one base difference between the former two elements, which accounted for 80% of the total MYB binding elements, suggesting the conservative nature of the sequence CAACG/TG of the MYB binding site. The autoregulation of plant transcription factors is common in one family, which showed sequence-specific interactions of the family [29, 30]. Dof1 binds the PEPC1 promoter, but Dof2 blocks the transactivation of Dof1 . Some identified MYB genes were co-expressed on the STRING network (http://string-db.org/), such as MYB114, MYB 168, and MYB167; MYB109, MYB108, and MYB47. These MYB genes with MYB binding site indicated the potential interaction effects. Co-expression analysis further supports the hypothesis.
Pervasive gene duplications
Duplication is a striking feature of the plant genome. Gene duplication in the R2R3-MYB gene family occurred during earlier evolution in land plants and contributed to its amplification . We estimated gene duplication events in the S. spontaneum genome by collinearity analysis. A total of 274 collinearity pairs of SsR2R3-MYB genes were identified by Blastp for all protein sequences and evaluated with MCScanX, including 144 allelic pairs and 130 non-allelic pairs (Fig. 4, Table S4). The collinearity relationships revealed that over half of the collinearity genes were concentrated in Chr 3 and Chr 7. The duplication events for MYB genes were predicted. Total 91 (25.84%) genes were tandem repeats, of which one-quarter of genes were located on Chr 7. Furthermore, 146 (39.88%) genes were identified to derive from segmental duplication events; 28.1% genes on Chr 2 and 33.5% on Chr 3 evolved from segmental duplication (Fig. 4, Table S5). Segmental duplication played a critical role in the MYB gene evolution of S. spontaneum, similar to other species. About 66.5% of the R2R3-MYB genes derived from gene duplication events drove the MYB gene family expansion.
Temporal and spatial expression of the R2R3-MYB gene family
To characterize the expression profile of MYB transcription factors, the temporally and spatially expression profiles of 202 SsR2R3-MYB genes were analyzed using a total of 50 RNA-seq data among three transcriptome models, including tissue and developmental stages, leaf developmental gradient, and circadian rhythm. The expression heatmap showed that most of the MYB genes had low expression levels, but 71% of gene expression values were greater than 1 (FPKM) in at least one RNA-seq sample (Fig. 5a, Table S6). Expression values of 15 MYB groups were presented in Table S6, and group G6 genes seemed to be expressed greater than that in the other groups.
Five different expression patterns, i.e., C1-C5, were investigated on the tissue and developmental stages transcriptome by K-means (Fig. 5b). A total of 85 SsR2R3-MYB genes belonging to the C1 and C3 clusters had low expression value, particularly C1 genes with almost no expression. On the contrary, C2 cluster genes displayed a relatively higher expression level in all developmental periods of leaf and stem. Interestingly, 37 genes of the C4 cluster were highly expressed in the stem during the seedling stage, the early stage of the stem formation (Figure S3A). Moreover, in the C5 cluster, 35 genes were highly expressed in the stem during each period, probably playing a regulatory role in the stem development (Figure S3B). The clusters indicated that the global gene expression levels in the stem were significantly higher than those in the leaves, suggesting SsR2R3-MYB genes might play an essential role in stem tissue. The relative expression of SsMYB43, SsMYB52, SsMYB65, SsMYB78, and SsMYB99 were quantified by RT-qPCR, verifying the results of RNA-seq data (Figure S4A); additionally, SsMYB3, SsMYB15, and SsMYB157 predominant expressed in the early stage of stem formation depicted as prophase of the stem (Pro-stem), which was much higher than those in other stem nodes and leaf tissues (Figure S4B).
Sugarcane is a typical C4 plant with high light use efficiency. The developmental gradient model of grass leaves could be used to study C4 photosynthesis and its regulatory factors [33, 34]. The regulatory role of SsMYB genes on C4 photosynthesis was investigated on the developmental dynamical transcriptome of sugarcane leaf. As suggested by the C4 photosynthetic development model, leaves are gradually differentiated for active photosynthesis . A total of 27 differentially expressed SsR2R3-MYB genes were detected by the leaf developmental gradient. Most of the genes (Class I) showed an expression profile, illustrating high value in the early stage of leaf development (Figure S5). Only three genes SsMYB169, SsMYB181, and SsMYB192 in Class II (Fig. 5c, Figure S5), were identified as putative C4-related transcription factors using the method that associated the co-expression pattern with the photosynthetic activity . The expression increased with the development of C4 photosynthesis and displayed the highest accumulation at the leaf mature zone. SsMYB181 and SsMYB192 shared one haplotype gene Sspon.07G0015250 with SsMYB169, as the tandem genes SsMYB181 and SsMYB192 derived from a gene duplication event. Circadian rhythm is another module to study photosynthesis, in which previously identified C4-related regulators could also be verified. Nine SsR2R3-MYB genes showed a significant association of expression profile with the light-dark cycle (Fig. 5d). Their expression patterns were divided into three modules, each module containing three genes. The expression level of SsMYB169, SsMYB159, and SsMYB153 tailed off during the daytime until around 6:00 pm and then gradually recovered till the next cycle. However, SsMYB48, SsMYB57, and SsMYB158 raised their expressions during the day and fell at night. Unexpected but reasonable, the preliminary identification of three C4-related regulators, SsMYB169, SsMYB181, and SsMYB192, also showed daylight expression pattern, hinting at their involvement in the regulation of circadian rhythm, indicating that the three candidate MYB transcription factors were associated with C4 photosynthesis.
MYB genes involved in response to drought and disease-induced stress
The expression patterns of SsR2R3-MYB genes were evaluated under environmental stress (biotic and abiotic stress). Six SsR2R3-MYB genes with significantly differential expression were responsive to drought induction (Fig. 6a, Table S7). The transcripts of four genes, SsMYB54, SsMYB36, SsMYB61, and SsMYB48, rapidly accumulated after drought treatment, but their expression reduced normal level after rehydration. On the other hand, SsMYB29 and SsMYB166 showed the opposite trend. Further, the upstream regulatory elements of these six genes contained the MBS element (5′-CAACTG-3′), which was identified as MYB binding site involved in drought-inducibility. Half of these genes retained more than one MBS.
Pokkah boeng disease of sugarcane (PBD) is one of the most severe and devastating diseases caused by the Fusarium species complex, a fungal pathogen [36, 37]. Nineteen different MYBs were associated with sugarcane PBD-infection and response (Fig. 6b, Table S7). According to the gene expression profiles, these genes included 14 genes with increased expression in defense response and five genes with reduced expression.
Sugarcane mosaic disease is a highly transmissible viral disease present in cane-growing regions worldwide. Sugarcane mosaic virus (SCMV), belonging to the positive-sense single-stranded RNA viruses, reduces yields by damaging chloroplast and blocking photosynthesis [38, 39]. After SCMV infection, 10 SsR2R3-MYB gene expression increased, and one gene, MYB176, decreased, suggesting that these MYB genes were involved in defense against SCMV infection (Fig. 6c, Table S7). We discovered that these MYB genes were unique to sugarcane diseases, indicating the defense specificity of MYB genes for conferring the resistance of sugarcane pokkah boeng and mosaic disease.
The potential function of SsR2R3-MYB genes was predicted on the identified genes with significantly specific expression. Fifty-six SsMYB genes were involved in seven plant bioprocesses (Fig. 6d). Six MYB genes only expressed during seeding stem and were possibly involved in stem differentiation and formation (Fig. 6a). Three MYB genes were identified as candidate C4 photosynthesis regulators, and nine genes responded in the circadian clock. Under diverse stresses, it was seen that 6, 19, and 10 SsR2R3-MYB genes responded to drought, pokkah boeng disease, and mosaic disease, respectively. Notably, SsMYB51 and SsMYB162 illustrated different expression changes between two sugarcane diseases (pokkah boeng and mosaic disease). SsMYB162 significantly accumulated, actively responding to the infection of two diseases (Table S7). However, SsMYB51 showed a different expression pattern, negatively responding to pokkah boeng but positively answering SCMV. Moreover, 13 MYB genes had more than one putative function, indicating their role in diverse plant bioprocesses (Fig. 6d).
Allelic expression dominance drove SsMYB to function in stem
The transcriptional levels of R2R3-MYB allelic genes were compared among different tissues and different developmental stages to investigate the transcriptome dynamics of R2R3-MYB genes in the allopolyploid across eight homoeologous chromosome pairs, of which 25% of the R2R3-MYB genes displayed allelic expression dominance in all samples. The number of expression dominant genes in the A, B, C, and D genomes was 84, 93, 82, and 79, respectively. Further, the allelic genes were compared in pairs, including A-B, A-C, A-D, B-C, B-D, and C-D (Fig. 7a). Both the number of dominant genes in a single set of homoeologous chromosomes and the pairwise comparison of alleles showed no significant allelic dominance. Captivatingly, the number of dominant genes in the stem was more than that in the leaf in each allelic pair comparison. For four sets of homoeologous chromosomes, the percentages increase was 46.5, 90.6, 10.2, 143.4%, corresponding to A, B, C, and D genomes, respectively, and the overall average rise was 64.5%. The transcriptional expression of allelic genes in the stem tissues showed significant differences among different alleles than those in the leaf tissues. Allelic expression dominant genes derived predominantly from stem transcriptomes. Selective pressure analysis demonstrated that the Ka/Ks median values of dominant allelic pairs in the stem and leaf were mainly in the range of 0.3–0.5 (Fig. 7b). The number of dominant genes in the stem with positive selection (Ka/Ks > 1) was twice that in the leaf, indicating MYB dominant genes evolved faster. MYB allelic genes with median Ka/Ks in the range from 0.4 to 0.6 were divided into three categories: dominant, subordinate, and neutral alleles, of which neutral genes were the majority (Fig. 7c). The differentially expressed dominant and subordinate genes might contribute to allelic variation that affected gene expression, function, and phenotype.
Gene duplication played an essential role in gene expansion and functional diversification in the genetic revolution and phenotypic evolution . A total of 202 SsR2R3-MYB genes were identified, the second-highest number of these genes among the 21 essential plant species (displayed in Fig. 1). The number of R2R3-MYB in sugarcane was far more than the other members of the grass family. Nevertheless, sugarcane with octoploid nature had a higher number of MYB genes compared with the other species. The significant enrichment of SsMYB genes probably was affected by two rounds of whole-genome duplication, including allopolyploidization followed by autopolyploidization , or two rounds of autopolyploidization . In grasses, 11 (7.09%) genes in O. sativa were derived from tandem duplications, 26 (21.31%) in B. distachyon, and 24 (15%) in Z. mays, while 44 (28.38%) segmental gene pairs were derived from segmental duplications in O. sativa, 34 (45.08%) in B. distachyon, and 19 (24%) in Z. mays, respectively . The duplication of genes distribution indicated that the MYB genes family expansion in S. spontaneum could be attributed to these duplication events.
The large R2R3-MYB gene family resulted from duplication events, and autopolyploidization demonstrated diverse functions in plant-specific processes. Some genes specially expressed in stem tissues were concentrated in the stem prophase, indicating that these MYB genes might regulate biological processes related to stem development. Stem morphogenesis is tightly associated with the formation and lignification of the secondary wall (the central mechanical tissue in the stems of grass species) . Indeed, some MYB transcription factors are identified to be involved in sugarcane stem development. A previous study revealed that 7 ScMYB genes were correlated with lignin content and biosynthesis . ShMYB78 has been recognized as an activator of suberin biosynthesis and regulates suberin deposition . In Arabidopsis, the asymmetric leaves1 (as1) gene encoding an MYB protein-mediated stem cell function interacted with meristematic genes to regulate the shoot morphogenesis . Furthermore, a group of rice and maize MYB genes (OsMYB46 and ZmMYB46) activated the transcription of secondary cell wall biosynthesis and probably interacted with secondary wall-associated NAC genes . The role of these SsMYB genes in stem development might provide potential genetic resources for sugarcane breeding.
MYB genes also play an important role in leaf development in grasses. In maize, a group of MYB was recognized to be involved in leaf development, as indicated by expression gradients. Myb-ZmRS2, MYB60, and MYB61 influence adaxial/abaxial polarity and stomata patterning . Moreover, some ZmMYBs are highly expressed in the transition zone, affecting secondary cell wall and lignin production . The Class I containing 24 SsR2R3-MYB genes were also inferred to have similar functions. A few MYB genes were identified as C4 regulators and correlated with C4 photosynthetic cell type-specific gene expression. An MYB gene encoding GRMZM2G130149, which regulates the transcription of phosphoenolpyruvate carboxykinase (PEPCK) in Z. mays, was categorized as a C4 transcription factor . Three putative C4 transcription factors (SsMYB169, SsMYB181, and SsMYB192) identified in this study might play a potential role in forming photosynthetic organs and regulating the C4 photosynthetic pathway. The LATE ELONGATED HYPOCOTYL (LHY) gene, encoding an MYB transcription factor, regulated circadian rhythms in Arabidopsis, and MYB-LHY was involved in circadian photoperiod . In sugarcane, nine candidate MYB genes with high expression were associated with the circadian cycle and therefore performed similar functions. The genes associated with leaf development showed relatively low expression levels (FPKM< 10) than those linked with the stem tissues, hinting at a stem-related expression dominance for most SsMYB genes.
Drought is one of the main factors restricting sugarcane growth and sugar production . Identifying particular and novel candidate genes is a great strategy to improve stress tolerance in sugarcane in this context. Certain MYB transcription factors, for instance, MYB_2 , SoMYB18 , ScMYB2S1 & 2 , and ScMYBAS1 , have been associated with the response to drought-induced stress in sugarcane. Six MYB genes were identified as a differential expression in this study, helping understand the sugarcane drought tolerance mechanism.
Following pathogen invasions, plants turn on a series of plant defense mechanisms. MYB transcription factors play a facilitating role in disease resistance by regulating plant hormone metabolism and mediating systemic resistance . AtMYB30 , AtMYB96 , and SpMYB  have already been reported to be involved in disease resistance. Several defense-related MYB candidate genes were identified against pokkah boeng disease and mosaic disease of sugarcane. Hence, MYB genes are a component of plant defense mechanisms against fungal and viral pathogens.
Polyploids are widely distributed among plants, and about 70% of the angiosperms have experienced one or more polyploidization events during their evolution. As a plant genome evolutionary force, polyploidization plays an essential role in speciation and genomic plasticity. In this study, homologous expression dominant genes Ka/Ks of autopolyploid sugarcane were higher than those of neutral genes, consistent with the report of allopolyploid of B. juncea . The asymmetric evolution of alleles facilitated differential expression, then affected plant biological processes. No significant differences were detected between the four unichromosomal genomes A, B, C, and D for MYB dominance. However, MYB homologous expression of dominant genes was more remarkable in number in the stem tissues than that in the leaf, and the former were subject to selection pressures with more powerful, implying that MYB stems dominant genes intensified selection in sugarcane. Not surprisingly, the transcription level of MYB genes more significantly enriched in the stem. The transcriptional advantages of these MYB homologous expression dominant genes in stem tissues might provide new insights for facilitating polyploid crop breeding for sugarcane.
It is the first time deciphering the phylogeny, gene structure, and expression of the MYB family in S. spontaneum. Genome-wide expression analysis demonstrated that SsMYB genes were involved in stem development and stress response. The MYB genes might be engineered to adjust important sugarcane traits, and therefore, these genes would be a promising target for sugarcane genetic improvement.
Materials and methods
Obtainment of MYB genes
The autopolyploid sugarcane S. spontaneum L. genomic sequence was published in 2018 and available online (http://www.life.illinois.edu/ming/downloads/Spontaneum_genome/). The Hidden Markov Model (HMM) profile of the MYB DNA-binding domain (PF00249) downloaded from Pfam database (http://pfam.xfam.org/)  was used to search protein sequences containing MYB domain by hmm search program (HMM3.0) . Then, putative MYB proteins were further screened through the NCBI-CDD database to investigate the former protein sequences and delete the proteins with incomplete domains. SbR2R3-MYB genes were obtained by performing the same sugarcane method without publicly available data for sorghum MYB genes. Data for sorghum protein sequences (the newest version of Sbicolor_454_v3.1.1.) were downloaded from the plant genome website Phytozome (https://phytozome.jgi.doe.gov/). Finally, we identified 418 SsMYB genes and 252 SbMYB genes (Table S1), including 202 SsR2R3-MYB genes and 125 SbR2R3-MYB genes, belonging to haplotype genes. All SsMYBs and SbMYBs sequences were placed in the Supplementary FASTA file. A plant phylogeny tree was constructed by the TimeTree Database . The distribution of MYB family genes in 19 plant species were demonstrated on the previously published reports: Ostreococcus lucimarinus, Volvox carteri and Chlamydomonas reinhardtii from PlantTFDB (http://planttfdb.cbi.pku.edu.cn/), and a public plant transcription factor database , including Physcomitrella patens , Oryza sativa , Brachypodium distachyon , Zea mays , Ananas comosus , Vitis vinifera , Arabidopsis thaliana , Brassica napus , Glycine max , Medicago truncatula , Pyrus bretschneideri , Rosa chinensis , Populus trichocarpa , Beta vulgaris , Solanum tuberosum , and Solanum lycopersicum .
To generate the phylogenetic trees of MYB transcription factor family genes, multiple protein sequence alignment was performed through the MATTF program (https://mafft.cbrc.jp/alignment/server/index.html)  with the reported 88 rice and 138 Arabidopsis R2R3-MYB proteins , respectively. Moreover, different phylogenetic trees were constructed via the maximum likelihood method using software FastTree2 . Neighbor-joining phylogenetic trees of sugarcane and sorghum were performed using MEGA7.0 .
Naming R2R3-MYB genes and gene structure
Because of the autopolyploid nature of sugarcane (S. spontaneum), the identified SsR2R3-MYB genes partly possessed several alleles. The representative gene models for different alleles were screened by comparing the phylogenetic relationship and protein identity with sorghum homology protein and paralogs. Tandem replication genes and paralogs were regarded as new, which gene IDs were followed by P and T, respectively. The 202 representative SsR2R3-MYB genes were named from SsMYB1 to SsMYB202 according to their physical position on the chromosomes. Subsequently, allele names were supplemented with numbers (e.g., The Sspon.01G0002470-1A gene located at the top of chromosome 1A is MYB1–1, and Sspon.01G0002470-2D is named as MYB1–2). In general, MYB1–1 as a representative gene model was directly regarded as MYB1. The naming method of sorghum MYB genes was also treated like that of S. spontaneum.
SsR2R3-MYB genes and CDS sequences come from the newest version of Sspon.v20190103. The domain location was derived from the previous hmm search results. Gene structures were displayed using the Gene Structure Display Server (GSDS2.0) , consisting of the CDS region, intron region, and MYB domain. Each gene structure was arranged according to the phylogenetic location.
Utilizing MCScanX analysis , collinearity relationships of SsR2R3-MYB genes and classifier program were used to sort gene duplication types. The identified collinear gene pairs were mapped to their respective locus in the S. spontaneum genome in a circular diagram using Circos 0.69 .
Regulatory element of upstream sequences
The 2000 bp upstream sequences were extracted from SsR2R3-MYB genes to the PlantCARE website, plant promoter, and cis-element database . Then, we used them to predict regulatory motifs and estimate potentially related functions.
Abundant RNA-seq data showing gene expression
To analyze SsR2R3-MYB gene expression profiles thoroughly, 60 RNA-seq data were conducted to decipher their expressions from our lab and cooperative labs. Tissue and development transcriptome contained RNA-seq data of 16 samples, including leaf, stem, three different development stages viz. seeding (35-day-old), pre-maturity (9-month-old), and maturity (12-month-old) stages in S. spontaneum . The leaf development transcriptome was derived from the second leaf alone, the ligule on 11-day-old seedlings; 15 cm leaves were selected and cut into 15 pieces with one segment per centimeter . Mature leaves corresponding to ligule in S. spontaneum, over 12-month-old, were selected to supply circadian rhythm transcriptome using 19-time points, i.e., 2 h intervals apart from 6:00 am to the second day 4:00 am, and 4 h apart from 6:00 am to the third day 6:00 am.
RNA-seq were extracted from the drought-treatment sugarcane of FN95–1702, a new sugarcane variety for both sugar and energy, bred by Fujian Agriculture and Forestry University. Sugarcane grown to 4–5 leaves was subjected to the natural drought stress treatment in the greenhouse. The mild drought was characterized by soil relative water content of about 55% ~ 60% after 6 days, and severe drought by 25% ~ 30% after 12 days. After a severe drought, rehydration was done, and relative water content was kept around 75% ~ 85%, and then leave samples were retaken (5 days later). The R2R3-MYB gene expression profiles were obtained by Blast mapping to express data with transcripts of unreferenced genomes. RNA-seq for pokkah boeng disease was extracted from hybrid sugarcane ZZ1, which is highly resistant to smut disease but highly susceptible to pokkah boeng disease. According to the severity of the diseased leaves, pokkah boeng disease was divided into five grades from 0 to 5. The mildly diseased leaves (1 or 2 grades) and severely diseased leaves (4 or 5 grades) were selected for analysis, while healthy leaves were used as control (CK). Three samples were extracted for RNA-seq of sugarcane mosaic disease. For the infection experiment, sugarcane grown through virus-free tissue culture was used, and then leaves corresponding to ligule were collected 1 month after the infection, while the control plants were not infected. An expression ratio > 2 (adjusted p-value< 0.05) was considered statistically significant for evaluating differentially expressed genes.
S. spontaneum was planted in Multifunctional Specimen Garden, Institute of Agriculture, Guangxi University. The stem-3 at the third internode and mature leaves were collected for comparing the difference of relative expression between stem and leaf. Pro-stem is short for prophase stem, in which the samples were taken from the stem precursor tissue wrapped in the leaf sheath and located on the upper part of the stem with prominent stem nodes. Combining with stem-3, stem-6, stem-9, and mature leaves was used to verify the expression during the prophase of stem formation. Total RNA was carried out using TRIZOL reagent (Takara), employing the corresponding protocol. The qualified RNA was reverse transcribed to produce cDNA using PrimeScript™ RT reagent Kit with gDNA Eraser reagent (Takara, Japan). Primers were designed by qPCR-PrimerQuest Tool, and qPCR primers were shown in Table S8. Glyceraldehyde-3-phosphate dehydrogenase gene (GAPDH) was selected as a reference gene . The real-time qPCR with three biological replications was performed with SYBR green on Roche Lightcyler® 480 instrument using 2 × TB Green Mix (Takara). The reaction profile was as follows: 95 °C for 30s, followed by 40 cycles of 95 °C for 10s, 60 °C for 30s, and 95 °C for 10s. The relative expression levels were calculated by the 2-△△CT method.
Availability of data and materials
All data generated or analyzed during this study were included in supplementary information files. Genomic data of sugarcane and sorghum for testing were obtained from the autopolyploid S. spontaneum L. genome (http://www.life.illinois.edu/ming/downloads/Spontaneum_genome/) and S. bicolor genome (https://phytozome-next.jgi.doe.gov/info/Sbicolor_v3_1_1). The domain architecture of the MYB genes was downloaded from the Pfam database (http://pfam.xfam.org/family/PF00249/hmm). The sequencing data of sugarcane pokkah boeng disease: SRP127969 (https://www.ncbi.nlm.nih.gov/sra/SRP127969); and sugarcane mosaic virus disease: SRR10058145, SRR10058144 in the GenBank database. RNA-seq of tissues and development stage, leaf segments, and circadian rhythms were downloaded from sugarcane public database (http://sugarcane.zhangjisenlab.cn/sgd/html/mRNA.html).
FAO F. Food and Agriculture Organization of the United Nations - Statistic Division https://www.fao.org/faostat/en/#data/QC. 2019.
D’Hont A, Grivet L, Feldmann P, Rao S, Berding N, Glaszmann JC. Characterisation of the double genome structure of modern sugarcane cultivars (Saccharum spp.) by molecular cytogenetics. MGG. Mol Gen Genet. 1996;250(4):405–13. https://doi.org/10.1007/s004380050092.
Saccharum L, Zhang J, Zhang X, Tang H, Zhang Q, Hua X, et al. Allele-defined genome of the autopolyploid. Nat Genet. 2018;50:1565–73.
Stracke R, Werber M, Weisshaar B. The R2R3-MYB gene family in Arabidopsis thaliana. Curr Opin Plant Biol. 2001;4(5):447–56. https://doi.org/10.1016/S1369-5266(00)00199-0.
Bilaud T, Koering CE, Binet-Brasselet E, Ancelin K, Pollice A, Gasser SM, et al. The Telobox, a Myb-related Telomeric DNA binding motif found in proteins from yeast, Plants and Human. Nucleic Acids Res. 1996;24(7):1294–303. https://doi.org/10.1093/nar/24.7.1294.
Kerstetter RA, Bollman K, Taylor RA, Bomblies K, Poethig RS. KANADI regulates organ polarity in Arabidopsis. Nature. 2001;411(6838):706–9. https://doi.org/10.1038/35079629.
Yakir E, Hilman D, Kron I, Hassidim M, Melamed-Book N, Green RM. Posttranslational regulation of CIRCADIAN CLOCK ASSOCIATED1 in the circadian oscillator of Arabidopsis. Plant Physiol. 2009;150(2):844–57. https://doi.org/10.1104/pp.109.137414.
Ito M, Araki S, Matsunaga S, Itoh T, Nishihama R, Machida Y, et al. G2/M-phase-specific transcription during the plant cell cycle is mediated by c-Myb-like transcription factors. Plant Cell. 2001;13(8):1891–905. https://doi.org/10.1105/tpc.010102.
Haga N, Kato K, Murase M, Araki S, Kubo M, Demura T, et al. R1R2R3-Myb proteins positively regulate cytokinesis through activation of KNOLLE transcription in Arabidopsis thaliana. Development. 2007;134:1101 LP–110.
Dai X, Xu Y, Ma Q, Xu W, Wang T, Xue Y, et al. Overexpression of an R1R2R3 MYB gene, OsMYB3R-2, increases tolerance to freezing, drought, and salt stress in transgenic Arabidopsis. Plant Physiol. 2007;143(4):1739–51. https://doi.org/10.1104/pp.106.094532.
Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010;15(10):573–81. https://doi.org/10.1016/j.tplants.2010.06.005.
Dias AP, Braun EL, McMullen MD, Grotewold E. Recently duplicated MAIZE R2R3 MYB genes provide evidence for distinct mechanisms of evolutionary divergence after duplication. Plant Physiol. 2003;131:610 LP–620.
Rosinski JA, Atchley WR. Molecular evolution of the Myb family of transcription factors: evidence for polyphyletic origin. J Mol Evol. 1998;46(1):74–83. https://doi.org/10.1007/PL00006285.
Zhang P, Chopra S, Peterson T. A segmental gene duplication generated differentially expressed myb-homologous genes in maize. Plant Cell. 2000;12(12):2311–22. https://doi.org/10.1105/tpc.12.12.2311.
Waites R, Selvadurai HRN, Oliver IR, Hudson A. The PHANTASTICA gene encodes a MYB transcription factor involved in growth and Dorsoventrality of lateral organs in Antirrhinum. Cell. 1998;93(5):779–89. https://doi.org/10.1016/S0092-8674(00)81439-7.
Jin H, Martin C. Multifunctionality and diversity within the plant MYB-gene family. Plant Mol Biol. 1999;41(5):577–85. https://doi.org/10.1023/A:1006319732410.
Bender J, Fink GR. A Myb homologue, ATR1, activates tryptophan gene expression in Arabidopsis. Proc Natl Acad Sci U S A. 1998;95(10):5655–60. https://doi.org/10.1073/pnas.95.10.5655.
Urao T, Yamaguchi-Shinozaki K, Urao S, Shinozaki K. An Arabidopsis myb homolog is induced by dehydration stress and its gene product binds to the conserved MYB recognition sequence. Plant Cell. 1993;5:1529 LP–1539.
He Q, Jones DC, Li W, Xie F, Ma J, Sun R, et al. Genome-wide identification of R2R3-MYB genes and expression analyses during abiotic stress in gossypium raimondii. Sci Rep. 2016;6:1–14.
Seo PJ, Park C-M. MYB96-mediated abscisic acid signals induce pathogen resistance response by promoting salicylic acid biosynthesis in Arabidopsis. New Phytol. 2010;186(2):471–83. https://doi.org/10.1111/j.1469-8137.2010.03183.x.
McCarthy RL, Zhong R, Ye Z-H. MYB83 is a direct target of SND1 and acts redundantly with MYB46 in the regulation of secondary Cell Wall biosynthesis in Arabidopsis. Plant Cell Physiol. 2009;50(11):1950–64. https://doi.org/10.1093/pcp/pcp139.
Abe H, Urao T, Ito T, Seki M, Shinozaki K, Yamaguchi-Shinozaki K. Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) function as transcriptional activators in abscisic acid signaling. Plant Cell. 2003;15:63 LP–78.
Seo PJ, Lee SB, Suh MC, Park M-J, Go YS, Park C-M. The MYB96 transcription factor regulates cuticular wax biosynthesis under drought conditions in Arabidopsis. Plant Cell. 2011;23:1138 LP–1152.
Bowring SA, Grotzinger JP, Isachsen CE, Knoll AH, Pelechaty SM, Kolosov P. Calibrating rates of early Cambrian evolution. Science. 1993;261(5126):1293–8. https://doi.org/10.1126/science.11539488.
McInerney FA, Wing SL. The Paleocene-Eocene thermal maximum: a perturbation of carbon cycle, climate, and biosphere with implications for the future. Annu Rev Earth Planet Sci. 2011;39(1):489–516. https://doi.org/10.1146/annurev-earth-040610-133431.
Chen S, Niu X, Guan Y, Li H. Genome-wide analysis and expression profiles of the MYB genes in Brachypodium distachyon. Plant Cell Physiol. 2017;58(10):1777–88. https://doi.org/10.1093/pcp/pcx115.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, et al. The Sorghum bicolor genome and the diversification of grasses. Nature. 2009;457(7229):551–6. https://doi.org/10.1038/nature07723.
McCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu S, Sims D, et al. The Sorghum bicolor reference genome: improved assembly and annotations, a transcriptome atlas, and signatures of genome organization. Plant J. 2018;93(2):338–54. https://doi.org/10.1111/tpj.13781.
Kieffer M, Stern Y, Cook H, Clerici E, Maulbetsch C, Laux T, et al. Analysis of the transcription factor WUSCHEL and its functional homologue in Antirrhinum reveals a potential mechanism for their roles in meristem maintenance. Plant Cell. 2006;18(3):560–73. https://doi.org/10.1105/tpc.105.039107.
Sharma P, Lin T, Grandellis C, Yu M, Hannapel DJ. The BEL1-like family of transcription factors in potato. J Exp Bot. 2014;65(2):709–23. https://doi.org/10.1093/jxb/ert432.
Yanagisawa S, Sheen J. Involvement of maize Dof zinc finger proteins in tissue-specific and light-regulated gene expression. Plant Cell. 1998;10(1):75–89. https://doi.org/10.1105/tpc.10.1.75.
Rabinowicz PD, Braun EL, Wolfe AD, Bowen B, Grotewold E. Maize R2R3 Myb genes: sequence analysis reveals amplification in the higher plants. Genetics. 1999;153(1):427–44. https://doi.org/10.1093/genetics/153.1.427.
Li P, Ponnala L, Gandotra N, Wang L, Si Y, Tausta SL, et al. The developmental dynamics of the maize leaf transcriptome. Nat Genet. 2010;42(12):1060–7. https://doi.org/10.1038/ng.703.
Studer AJ, Schnable JC, Weissmann S, Kolbe AR, McKain MR, Shao Y, et al. The draft genome of the C3 panicoid grass species Dichanthelium oligosanthes. Genome Biol. 2016;17(1):223. https://doi.org/10.1186/s13059-016-1080-3.
Wang L, Czedik-Eysenberg A, Mertz RA, Si Y, Tohge T, Nunes-Nesi A, et al. Comparative analyses of C4 and C3 photosynthesis in developing leaves of maize and rice. Nat Biotechnol. 2014;32(11):1158–65. https://doi.org/10.1038/nbt.3019.
Rott R, Bailey RA. JC Comstock BC and AS. A guide to sugarcane diseases. France: CIRAD; 2000.
Singh A, Chauhan SS, Singh A, Singh SB. Deterioration in sugarcane due to pokkah boeng disease. Sugar Tech. 2006;8(2-3):187–90. https://doi.org/10.1007/BF02943659.
Xu D-L, Park J-W, Mirkov TE, Zhou G-H. Viruses causing mosaic disease in sugarcane and their genetic diversity in southern China. Arch Virol. 2008;153:1031.
Chauhan RP, Rajakaruna P, Verchot J. Complete genome sequence of nine isolates of canna yellow streak virus reveals its relationship to the sugarcane mosaic virus (SCMV) subgroup of potyviruses. Arch Virol. 2015;160(3):837–44. https://doi.org/10.1007/s00705-014-2327-5.
Flagel LE, Wendel JF. Gene duplication and evolutionary novelty in plants. New Phytol. 2009;183(3):557–64. https://doi.org/10.1111/j.1469-8137.2009.02923.x.
Kim C, Wang X, Lee T-H, Jakob K, Lee G-J, Paterson AH. Comparative analysis of Miscanthus and Saccharum reveals a shared whole-genome duplication but different evolutionary fates. Plant Cell. 2014;26(6):2420–9. https://doi.org/10.1105/tpc.114.125583.
Du H, Feng B-R, Yang S-S, Huang Y-B, Tang Y-X. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7(6):e37463. https://doi.org/10.1371/journal.pone.0037463.
Zhong R, Lee C, McCarthy RL, Reeves CK, Jones EG, Ye Z-H. Transcriptional activation of secondary wall biosynthesis by rice and maize NAC and MYB transcription factors. Plant Cell Physiol. 2011;52(10):1856–71. https://doi.org/10.1093/pcp/pcr123.
dos Santos Brito M. Unraveling two ways of lignin biosynthesis in sugarcane culm: an overview of transcriptional regulation. Plant Biology Dept. 2012:P0738 Brazil: Ribeirao Preto, IB-UNICAMP.
Figueiredo R, Portilla Llerena JP, Kiyota E, Ferreira SS, Cardeli BR, de Souza SCR, et al. The sugarcane ShMYB78 transcription factor activates suberin biosynthesis in Nicotiana benthamiana. Plant Mol Biol. 2020;104(4-5):411–27. https://doi.org/10.1007/s11103-020-01048-1.
Byrne ME, Barley R, Curtis M, Arroyo JM, Dunham M, Hudson A, et al. Asymmetric leaves1 mediates leaf patterning and stem cell function in Arabidopsis. Nature. 2000;408(6815):967–71. https://doi.org/10.1038/35050091.
Husbands AY, Chitwood DH, Plavskin Y, Timmermans MCP. Signals and prepatterns: new insights into organ polarity in plants. Genes Dev. 2009;23(17):1986–97. https://doi.org/10.1101/gad.1819909.
Carré IA, Kim J-Y. MYB transcription factors in the Arabidopsis circadian clock. J Exp Bot. 2002;53(374):1551–7. https://doi.org/10.1093/jxb/erf027.
Prabu G, Kawar PG, Pagariya MC, Prasad DT. Identification of water deficit stress Upregulated genes in sugarcane. Plant Mol Biol Report. 2011;29(2):291–304. https://doi.org/10.1007/s11105-010-0230-0.
Souza SC. Characterization of transcription factors differentially expressed under drought conditions in sugarcane (Saccharum spp). Plant Animal Genome. 2015.
Shingote PR, Kawar PG, Pagariya MC, Kuhikar RS, Thorat AS, Babu KH. SoMYB18, a sugarcane MYB transcription factor improves salt and dehydration tolerance in tobacco. Acta Physiol Plant. 2015;37(10):217. https://doi.org/10.1007/s11738-015-1961-1.
Guo J, Ling H, Ma J, Chen Y, Su Y, Lin Q, et al. A sugarcane R2R3-MYB transcription factor gene is alternatively spliced during drought stress. Sci Rep. 2017;7(1):41922. https://doi.org/10.1038/srep41922.
Fávero Peixoto-Junior R, Mara de Andrade L, dos Santos Brito M, Macedo Nobile P, Palma Boer Martins A, Domingues Carlin S, et al. Overexpression of ScMYBAS1 alternative splicing transcripts differentially impacts biomass accumulation and drought tolerance in rice transgenic plants. PLoS One. 2018;13(12):e0207534. https://doi.org/10.1371/journal.pone.0207534.
Raffaele S, Rivas S, Roby D. An essential role for salicylic acid in AtMYB30-mediated control of the hypersensitive cell death program in Arabidopsis. FEBS Lett. 2006;580(14):3498–504. https://doi.org/10.1016/j.febslet.2006.05.027.
Vailleau F, Daniel X, Tronchet M, Montillet J-L, Triantaphylidès C, Roby D. A R2R3-MYB gene, AtMYB30, acts as a positive regulator of the hypersensitive cell death program in plants in response to pathogen attack. Proc Natl Acad Sci U S A. 2002;99(15):10179–84. https://doi.org/10.1073/pnas.152047199.
Liu Z, Luan Y, Li J, Yin Y. Expression of a tomato MYB gene in transgenic tobacco increases resistance to Fusarium oxysporum and Botrytis cinerea. Eur J Plant Pathol. 2016;144(3):607–17. https://doi.org/10.1007/s10658-015-0799-0.
Yang J, Liu D, Wang X, Ji C, Cheng F, Liu B, et al. The genome sequence of allopolyploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection. Nat Genet. 2016;48(10):1225–32. https://doi.org/10.1038/ng.3657.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44(D1):D279–85. https://doi.org/10.1093/nar/gkv1344.
Wheeler TJ, Eddy SR. Nhmmer: DNA homology search with profile HMMs. Bioinformatics. 2013;29(19):2487–9. https://doi.org/10.1093/bioinformatics/btt403.
Hedges SB, Dudley J, Kumar S. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 2006;22(23):2971–2. https://doi.org/10.1093/bioinformatics/btl505.
Guo A-Y, Chen X, Gao G, Zhang H, Zhu Q-H, Liu X-C, et al. PlantTFDB: a comprehensive plant transcription factor database. Nucleic Acids Res. 2008;36(Database issue):D966–9. https://doi.org/10.1093/nar/gkm841.
Pu X, Yang L, Liu L, Dong X, Chen S, Chen Z, et al. Genome-wide analysis of the MYB transcription factor superfamily in Physcomitrella patens. Int J Mol Sci. 2020;21(3). https://doi.org/10.3390/ijms21030975.
Katiyar A, Smita S, Lenka SK, Rajwanshi R, Chinnusamy V, Bansal KC. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis. BMC Genomics. 2012;13(1):544. https://doi.org/10.1186/1471-2164-13-544.
Liu C, Xie T, Chen C, Luan A, Long J, Li C, et al. Genome-wide organization and expression profiling of the R2R3-MYB transcription factor family in pineapple (Ananas comosus). BMC Genomics. 2017;18(1):503.
Matus JT, Aquea F, Arce-Johnson P. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes. BMC Plant Biol. 2008;8(1):83. https://doi.org/10.1186/1471-2229-8-83.
Chen B, Niu F, Liu W-Z, Yang B, Zhang J, Ma J, et al. Identification, cloning and characterization of R2R3-MYB gene family in canola (Brassica napus L.) identify a novel member modulating ROS accumulation and hypersensitive-like cell death. DNA Res. 2016;23(2):101–14. https://doi.org/10.1093/dnares/dsv040.
Du H, Yang S-S, Liang Z, Feng B-R, Liu L, Huang Y-B, et al. Genome-wide analysis of the MYB transcription factor superfamily in soybean. BMC Plant Biol. 2012;12(1):106. https://doi.org/10.1186/1471-2229-12-106.
ZHENG X, YI D, SHAO L, LI C. In silico genome-wide identification, phylogeny and expression analysis of the R2R3-MYB gene family in Medicago truncatula. J Integr Agric. 2017;16(7):1576–91. https://doi.org/10.1016/S2095-3119(16)61521-6.
Feng S, Xu Y, Yang L, Sun S, Wang D, Chen X. Genome-wide identification and characterization of R2R3-MYB transcription factors in pear. Sci Hortic (Amsterdam). 2015;197:176–82. https://doi.org/10.1016/j.scienta.2015.09.033.
Han Y, Yu J, Zhao T, Cheng T, Wang J, Yang W, et al. Dissecting the Genome-Wide Evolution and Function of R2R3-MYB Transcription Factor Family in Rosa chinensis. Genes. 2019;10:823.
Wilkins O, Nahal H, Foong J, Provart NJ, Campbell MM. Expansion and diversification of the Populus R2R3-MYB Family of Transcription Factors. Plant Physiol. 2009;149:981 LP–993.
Stracke R, Holtgräwe D, Schneider J, Pucker B, Rosleff Sörensen T, Weisshaar B. Genome-wide identification and characterisation of R2R3-MYB genes in sugar beet (Beta vulgaris). BMC Plant Biol. 2014;14(1):249. https://doi.org/10.1186/s12870-014-0249-8.
Sun W, Ma Z, Chen H, Liu M. MYB gene family in potato (Solanum tuberosum L.): genome-wide identification of hormone-responsive reveals their potential functions in growth and development. Int J Mol Sci. 2019;20(19). https://doi.org/10.3390/ijms20194847.
Zhao P, Li Q, Li J, Wang L, Ren Z. Genome-wide identification and characterization of R2R3MYB family in Solanum lycopersicum. Mol Gen Genomics. 2014;289(6):1183–207. https://doi.org/10.1007/s00438-014-0879-4.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80. https://doi.org/10.1093/molbev/mst010.
Price MN, Dehal PS, Arkin AP. FastTree 2 – approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. https://doi.org/10.1371/journal.pone.0009490.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4. https://doi.org/10.1093/molbev/msw054.
Hu B, Jin J, Guo A-Y, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015;31:1296–7.
Thompson JD, Gibson TJ, Higgins DG. Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinform. 2003;00:2.3.1–2.3.22.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45. https://doi.org/10.1101/gr.092759.109.
Lescot M, Déhais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002;30(1):325–7. https://doi.org/10.1093/nar/30.1.325.
Chen Y, Zhang Q, Hu W, Zhang X, Wang L, Hua X, et al. Evolution and expression of the fructokinase gene family in Saccharum. BMC Genomics. 2017;18(1):197. https://doi.org/10.1186/s12864-017-3535-7.
Li Z, Hua X, Zhong W, Yuan Y, Wang Y, Wang Z, et al. Genome-wide identification and expression profile analysis of WRKY family genes in the autopolyploid Saccharum spontaneum. Plant Cell Physiol. 2020;61(3):616–30. https://doi.org/10.1093/pcp/pcz227.
Ling H, Wu Q, Guo J, Xu L, Que Y. Comprehensive selection of reference genes for gene expression normalization in sugarcane by real time quantitative rt-PCR. PLoS One. 2014;9(5):e97469. https://doi.org/10.1371/journal.pone.0097469.
The authors thank all editors and reviewers for their comments on this manuscript.
This work was funded by the National Natural Science Foundation of China (31660420), the Key Project of Science and Technology of Guangxi (AA17202042–7), and the earmarked fund for the Modern Agriculture Technology of China (CARS-170190), and the Innovation Project of Guangxi Graduate Education (YCBZ2020031). The funding body only provided the funds and did not have any role in the study’s design, sample collection, data analysis and interpretation, and manuscript writing.
Ethics approval and consent to participate
The sugarcane materials (Saccharum spontaneum) used in the experiment were supplied by the sugarcane clonal germplasm repository of Guangxi University. These plant materials are widely used worldwide, and no permits were required to collect plant samples. This article did not contain any studies with human participants or animals and did not involve any endangered or protected species.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Members of identified MYB family in S. spontaneum.
The classification of the SsR2R3-MYB family by phylogenetic tree. The 15 subgroups of the SsR2R3-MYB family by phylogenetic classification with rice.
Members of identified MYB family in Sorghum.
Collinear gene pairs in the SsR2R3-MYB gene family.
Tandem duplication genes and segmental duplication genes in the SsR2R3-MYB gene family.
The expression value of SsR2R3-MYB genes in both temporal and spatial models. FPKM values of MYB genes in tissue and developmental stages, leaf developmental gradient, and circadian rhythm.
The expression value of DEGs of SsR2R3-MYB genes in stress response.
SsMYB and SbMYB sequences.
Gene primers of expression quantified by qPCR.
Phylogenetic tree of R2R3-MYB subgroup members from S. spontaneum and Arabidopsis.
Comparison of phylogeny and gene structure of R2R3-MYB gene between S. spontaneum and S. bicolor. (A), (B), (C), (D) continue to supplement Fig. 3a in turn.
Heat map of significant DEGs with tissue specificity. Highly expressed in prophase of stem formation (A) and whole stem development period (B).
Relative expression level quantified by RT-qPCR. High transcripts of MYB gene in prophase of stem formation (A) and whole stem development period (B).
Sample diagram for RT-qPCR.
About this article
Cite this article
Yuan, Y., Yang, X., Feng, M. et al. Genome-wide analysis of R2R3-MYB transcription factors family in the autopolyploid Saccharum spontaneum: an exploration of dominance expression and stress response. BMC Genomics 22, 622 (2021). https://doi.org/10.1186/s12864-021-07689-w
- Expression analysis of stress
- Allelic diversity