Genome-wide identification and expression analysis of expansin gene family in common wheat (Triticum aestivum L.)
BMC Genomics volume 20, Article number: 101 (2019)
Expansin loosens plant cell walls and involves in cell enlargement and various abiotic stresses. Plant expansin superfamily contains four subfamilies: α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). In this work, we performed a comprehensive study on the molecular characterization, phylogenetic relationship and expression profiling of common wheat (Triticum aestivum) expansin gene family using the recently released wheat genome database (IWGSC RefSeq v1.1 with a coverage rate of 94%).
Genome-wide analysis identified 241 expansin genes in the wheat genome, which were grouped into three subfamilies (EXPA, EXPB and EXLA) by phylogenetic tree. Molecular structure analysis showed that wheat expansin gene family showed high evolutionary conservation although some differences were present in different subfamilies. Some key amino acid sites that contribute to functional divergence, positive selection, and coevolution were detected. Evolutionary analysis revealed that wheat expansin gene superfamily underwent strong positive selection. The transcriptome map and qRT-PCR analysis found that wheat expansin genes had tissue/organ expression specificity and preference, and generally highly expressed in the roots. The expression levels of some expansin genes were significantly induced by NaCl and polyethylene glycol stresses, which was consistent with the differential distribution of the cis-elements in the promoter region.
Wheat expansin gene family showed high evolutionary conservation and wide range of functional divergence. Different selection constraints may influence the evolution of the three expansin subfamilies. The different expression patterns demonstrated that expansin genes could play important roles in plant growth and abiotic stress responses. This study provides new insights into the structures, evolution and functions of wheat expansin gene family.
Common wheat (Triticum aestivum L.) is one of the three most important grain crops worldwide, accounting for about 35% of the world's staple food. Meanwhile, wheat also serves as an important protein source of human food. Hexaploid wheat contains A, B and D genomes with about 17 Gb in genome size, which was five times larger than that of humans . The growth of plant cells is closely related to cell wall that must not only withstand the internal turgor pressure, but also ensure the extension of the cells during plant growth [2, 3]. Expansins are plant cell-wall loosening proteins that involved in cell enlargement and a variety of other developmental processes as well as various abiotic stresses , and thus, expansins play important roles in crop growth and development as well as ultimate yield formation.
Expansins were first found in the study of acid-induced cell wall elongation in cucumber hypocotyls, and then identified, isolated and purified from the hypocotyls of cucumber seedlings . Subsequently, expansin genes were widely investigated in different plants, including oat coleoptiles , rice , cotton fiber , and soybean . Expansin proteins normally contain 250-275 amino acid residues, which consists of two domains: N-terminal domain I with 120 to 135 amino acid residues and C-terminal domain II with 90 to 120 amino acids. A signal peptide of 20-30 amino acid residues is present at the N-terminus. Domain I, rich in cysteine, is considered as an important catalytic domain. This region shares some homology with the catalytic domain of the β-1,4-D-endo-glucanase from Trichoderma . However, subsequent study showed that β-1,4-D-endo-glucanase cannot induce cell wall elongation although it has obvious catalytic activity, thus the expansin family does not have the activity of hydrolyzing β-glucon sugar . Domain II is speculated to be a polysaccharide binding domain with about 50% similarity to Group-II pollen allergen protein (G2A family) , and contains a series of conserved tryptophans . The G2A family proteins consist of two superimposed β-sheets, similar to immunoglobulin. When pollens are secreted onto the stigma, the pollen allergen relaxes the cell wall, so expansins may have similar function. In addition to the G2A family, no other proteins containing domain II homologues are found .
According to the nomenclature of Kende et al. , plant expansin superfamily is divided into four subfamilies: α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA) and expansin-like B (EXLB). The α-expansin is mainly found in both dicotyledonous plants and monocotyledonous plants of non-Poaceae, while β-expansin is predominantly present in monocotyledonous plants. Although EXLA and EXLB have two typical domains of expansin proteins, few experimental reports showed the cell relaxation activity of these two family members . Studies showed that the α- and β-expansin gene subfamilies already existed before the disorganization of vascular plants and bryophytes, and the recent ancestral era of expansin-like A and expansin-like B subfamily can be traced back to gymnosperms and angiosperms [15, 16].
Stretching and relaxation of plant cell walls are achieved by a process of the slow microfibril creep, in which parallel microfibrils and their assemble sugars are separated from each other . The pressure of cell wall provides energy to overcome the viscous resistance of intertwining between cell wall multimers. In living plants, the pressure of cell wall comes from intracellular turgor pressure. This molecular motility occurs only when the relaxation of the cell wall is caused by expansin or some other factors. Otherwise, the cellulose fibrils on the cell wall are tightly tangled with the stromal polysaccharide in situ .
The expression of plant expansin genes has obvious tissue specificity [7, 19]. Functional studies have shown that expansins are involved in many developmental processes such as plant growth , root hair growth [9, 21], leaf growth , fruit softening and ripening [23, 24], seed size and germination , pollen tube growth [26, 27] and salinity stress response . In wheat, Lin et al. isolated 18 wheat expansin genes and found that the expression of expansin gene is closely related to the growth and development process ; Zhang et al. identified 128 wheat expansin genes using the previous wheat genome database (TGACv1), and found that some of them participated in cold stress response . Carolina et al. found that expansins expression is associated with grain size dynamics in wheat . The over-expression of TaEXPA2 gene in tobacco improved salt stress tolerance , drought stress tolerance  and Cd stress tolerance . The over-expression of TaEXPA2 gene in Arabidopsis plants improved oxidative stress tolerance . The over-expression of TaEXPB23 gene in tobacco improved both oxidative stress tolerance  and salt stress tolerance . Expansins may involve in increasing phosphorus availability by regulating the growth and development of plant roots ; Expansin also plays important roles in dealing with drought stresses in wheat [38,39,40,41]. However, in-depth studies on the structural features, molecular evolution and functional properties of wheat expansin gene family are still needed.
This work aims to carry out a comprehensive study on the molecular characterization, phylogenetic relationship and expression profiling of wheat expansin gene family using the recently released wheat genome database (IWGSC RefSeq v1.1 with a coverage rate of 94%). Our results provide new evidence for further understanding the structure, evolution and function of plant expansin genes.
Genome-wide identification and phylogenetic relationship of wheat expansin genes
Through blast search against the Triticum aestivum genome database from GRAMENE (http://ensembl.gramene.org/), a total of 241 wheat expansin genes were obtained. To obtain more information of the expansin superfamily, genome-wide identification of the expansin genes from Brachypodium distachyon, Sorghum bicolor, Solanum lycopersicum and Gossypium raimondii genome database was performed. Based on the multiple alignments of the full-length sequences of expansins, two softwares MEGA 5.0 and MrBayes 3.2 were used to construct the phylogenetic trees, including neighbor-joining (NJ) phylogenetic tree (Additional file 1: Figure S1) and Bayesian phylogenetic tree (Fig. 1).
Comparative analysis showed that the topological structures of the two phylogenic trees were generally similar, with only minor differences. Bayesian tree constructed had more advantages in accuracy than the other tree-building methods, which was considered as more credible results for subsequent analysis [42, 43]. The seven plant species were used in this study, including four monocotyledonous plants (Triticum aestivum, Oryza sativa, Brachypodium distachyun and Sorghum bicolor) and three dicotyledonous plants (Arabidopsis thaliana, Solanum lycopersicum and Gossypium raimondii). The results from Bayesian tree showed that the four subfamilies (EXPA, EXPB, EXLA and EXLB) of plant expansin genes were classified while wheat expansin genes had no EXLB subfamily (Fig. 1). The size of each subfamily in seven plant species shown in Additional file 2: Table S1 demonstrated that the distribution of expansin genes in the four subfamilies was quite uneven and EXPA was the largest subfamily. EXLA and EXLB were the least-abundant subfamily, only one member of the EXLB family was present in Arabidopsis and Oryza sativa. In addition, the EXPB family members in the monocotyledonous Poaceae were much greater than those in the other three dicotyledonous plants (Fig. 1).
Molecular characterization of wheat expansin genes
All wheat expansin genes identified were named according to the standard principles for consistency , and their information were listed in Additional file 3: Table S2. Among them, TaEXPA1-29, TaEXPB1-24 and TaEXLA1-4 were named previously [4, 34, 44,45,46], so the newly identified expansins were named as TaEXPA30-48, TaEXPB27-49, and TaEXLA5-7. The isoelectric points of the expansin family members ranged from 4.64 to 9.78 with an average of 7.95, showing a weakly alkaline. The average molecular weight was 28427.76 Da, ranging from 16305.76 to 39105.36 Da. The number of amino acid residues varied between 150 and 372, and the signal peptide of expansin family members ranged from 10 to 68 amino acids in length. The number of the predicted DPBB_1 domains was 37–91 amino acids in TaEXPAs, 72–92 in TaEXPBs and 71–80 in TaEXLAs. The number of the predicted Pollen_allerg_1 domains was 32–79 amino acids in TaEXPAs, 56–93 in TaEXPBs and 83–100 in TaEXLAs. In general, the expansin members from same subfamily showed similar properties, but those from different subfamilies had significant differences. These results indicate that plant expansins may adapt to different functional requirements by changing the length of its amino acids and physical/ chemical properties.
Based on Triticum aestivum (IWGSC) genome database, the physical positions of the expansin genes to corresponding chromosomes were shown in Fig. 2. All expansin genes identified could be mapped on the chromosomes from 1A to 7D. Obviously, the distribution of expansin genes on the different chromosomes was uneven. Particularly, the chromosome 3B with 26 expansin members had the highest density, but the chromosomes 7A and 7B contained no more than three expansin genes. Almost all of the expansin protein genes had three copies from chromosomes A, B and D such as TaEXPA1, TaEXPA4, TaEXPA5, TaEXPB1, TaEXPB2, TaEXPB5, TaEXLA1, TaEXLA2 and TaEXLA3.
A sequence alignment of wheat expansin proteins was performed, and a Bayesian phylogenetic tree was constructed (Fig. 3a), the intron-exon distribution of the expansin genes in wheat was analyzed by comparing the CDS sequences and the complete gene sequences. The results showed that the members in the same subfamily had similar structures, and most of them had the same number of exons (Fig. 3b). For example, most members of the EXPB subfamily had four exons, and most members in the EXPA subfamily had three exons and the others generally had two or four exons. These differences may be resulted from the absence or gain of exons during long-term evolutionary processes.
Twenty different motifs were detected in wheat expansins (Fig. 3c), and their structural features were showed in Additional file 4: Figure S2. In the same subfamily, the order, type and number of motifs in expansins were similar, but divergence occurred between different subfamilies. In EXPA subfamily, more than 90% (109/121) of the members had motif 1, 2, 4, 5, 6, 7, 8, 9, 12 and 16 with the same order. This is in marked contrast to the other two subfamilies in which motifs 2, 8, 14 and 17 were absent. The other subfamilies contained motifs 3 and 10, and almost all of the EXPA and EXPB subfamilies had the motifs 1, 4, 5, 6, 7, 9, 12 and 18. These results suggest that EXPA and EXPB subfamilies have a more recent evolution and close phylogenetic relationships. The motif distribution of the entire family members showed that all the subfamilies were highly conservative, but some divergences among different subfamilies still occurred. The high similarity between the sequences and intron-exon structures suggest that wheat expansin genes may have undergone gene duplication events during evolution process.
Functional divergence analysis of wheat expansins
In this study, type-I and type-II functional divergences were estimated based on Bayesian phylogenetic tree constructed from 241 members of wheat expansin gene family, and the results were showed in Table 1. Type-I functional divergence refers to the evolution of amino acid sites in the evolutionary process occurred after the evolution of the rate of change while type-II functional divergence refers to changes in the physical and chemical properties of amino acid sites [47, 48]. The coefficients of Type-I functional divergence between subfamily pairs EXPA/EXPB, EXPA/EXLA and EXPB/EXLA were statistically significant (θ > 0, likelihood ratio test statistic > 4.96; p < 0.01), indicating that significantly different site-specific shifts in evolutionary rate may have taken place at certain amino acid sites between these pairs. Critical amino acid sites were identified in three groups of expansin subfamilies for the analysis of Type-I functional divergence. In order to reduce the occurrence of false positives, Qk > 0.8 is taken as the threshold of important amino acid sites, and the site of Qk < 0.8 is not considered. As shown in Table 1, six type-I functional divergence amino acid sites were detected between EXPA and EXLA subfamilies: 131E, 139Q, 144I, 146P, 161R and 164I, while subfamily pairs EXPA/EXLA and EXPB/EXLA had 45 and 46 sites, respectively (amino acid reference sequence from TaEXPA1-A). These data indicate that a significant change in the rate of evolution may have occurred at these amino acid sites. Besides, compared with EXPA/EXPB, EXPA/EXLA and EXPB/EXLA had relatively larger coefficients of functional divergence (θI) and more sites that were related to functional divergence. Therefore, the functional divergence that exists in EXPA/EXLA and EXPB/EXLA is more significant compared with that present in EXPA/EXPB. No type-II functional divergence sites were identified in this study, because the coefficients of type-II functional divergence between subfamily pairs EXPA/EXPB, EXPA/EXLA and EXPB/EXLA were not statistically significant (θII < 0).
Positive selection and coevolution analysis of wheat expansin superfamily
To detect whether expansin gene family has evolved adaptively, we used the CODEML program in PAML v4.4 package  and selected the site models and branch-site models to perform positive selection analysis of wheat expansin superfamily. In the site models, no positive selection sites were identified (Table 2). In the branching site model, only when EXLA was chosen as the foreground branch, four sites (166G, 169Y, 170F and 182G) that had undergone positive selection were detected, and their posterior probability values were greater than 0.85 (Table 3). Positive selection often acts on a small number of sites in a relatively short evolutionary period, so positive selection signal may be annihilated by a broadly-acting negative selection . In contrast, when EXPA or EXPB was selected as the foreground branch, the ω value was low, and no sites with a posterior probability greater than 0.85 were identified. Positive selection test results showed that three subfamilies were under different selection pressure. EXPA and EXPB subfamilies were more conserved relative to the EXLA subfamily. When EXLA was selected as the foreground branch, EXPA and EXPB were selected as background branches, more positive selection sites were identified. Thus, the selection pressure of EXLA subfamily was the greatest among three subfamilies.
Coevolutionary analysis of amino acid residues does not take into account the evolutionary dependence between amino acids and can well complement the Bayesian method's defects . In this study, we detected five groups co-evolution sites (15C, 16L; 19R, 20Q; 24G, 25G; 46M, 47G; 249A, 250Q) with the CAPS software (Additional file 5: Table S3). Each group of co-evolution sites is not only adjacent, but also related in terms of molecular weight and hydrophobicity (p < 0.01).
Three-dimensional structure prediction and critical amino acid sites identification of wheat expansin proteins
According to our results, four sites (166G, 169Y, 170F and 182G) are critically important for functional divergence, which also underwent strong positive selection. Thus, these sites could play a key role in the evolution of wheat expansin gene family. The 3-D structure of the representative TaEXPA1-A was constructed by Swiss-model using the homology modeling method, and four key amino acid sites were marked on the 3-D structure (Fig. 4). The 166G, 169Y, 170F and 182G were located on the non-return curl of domain II, and all of them were non-polar amino acid (Fig. 4a). Three sites (166G, 169Y, 170F) were located on the surface of the 3-D structure (Fig. 4b-c), implying that these possible catalytic sites can react more easily with the substrate. In addition, four groups of co-evolution sites were located at the N-terminus of the 3D structure while 249A and 250Q located at the C-terminus of the 3D structure, indicating that these sites may play a key role in maintaining the stability of protein structure and function.
Analysis of cis-acting elements in wheat expansins
The cis-acting elements of the promoter region among 241 expansin members were analysed and the results were shown in Additional file 6: Table S4. The results showed that the cis-acting elements present in the promoter region of the expansin genes can be divided into seven categories: light responsive elements, development related elements, hormone responsive elements, environmental stress-related elements, promoter related element, site-binding related elements and other elements. Among them, three types of elements related to photoreaction, phytohormone and environmental stress were particularly abundant.
The photoreaction related cis-acting elements included G-box, Box 4, GT1-motif and Sp1 etc, of which G-box appeared to be the most abundant, on average, each member contained more than three G-box copies. Phytohormone regulation-related elements were also abundantly present in the expansin promoter region, mainly including ABRE (cis-acting element involved in the abscisic acid responsiveness), CGTCA-motifs and TGACG-motifs (cis-acting regulatory element involved in the MeJA-responsiveness), and each member contained more than 2.5 copies. The third type of widely distributed cis-acting elements was associated with external environmental stress response, of which ARE (cis-acting regulatory element essential for the anaerobic induction) and MBS (MYB binding site involved in drought-inducibility) were the most abundant. Other response elements related to environmental stress were also detected, including W-box, GC-motif as well as low temperature induction related LRT and WUN-motif.
Expression profiling of wheat expansin genes in different organs
The publicly available RNA-seq data generated from bread wheat (var. Chinese Spring) was used to study the expression profiling of 241 wheat expansin genes in different organs, in which 3 genes (F775_19619, F775_14978 and TRIUR3_08652) lacked RNA-Seq atlas data because these genes come from Triticum urartu (ASM34745v1) and Aegilops tauschii (ASM34733v1). The expression data of 238 genes showed that wheat expansin genes had a relatively low transcriptional abundance (Additional file 7: Table S5). Some members expressed only in one organ while some others displayed a wide expression profiling. Specifically, 34.5% (82/238) of expansin genes expressed in all five organs (grain, leaf, root, spike and steam) such as TaEXPA1-A, TaEXPA9-B, TaEXPB1-B, TaEXLA5-A and TaEXLA1-B. Interestingly, 14.2% members (34/238) specifically expressed in roots such as TaEXPA13-A, TaEXPA14-B and TaEXPB15-A; 27.3% members (65/238) had root expression preference such as TaEXPA16-A, TaEXPA10-B and TaEXPB9-A. In addition, TaEXPB18-A, TaEXPB37-B, TaEXPB39-B and TaEXPB40-D had the highest expression level in wheat spike while TaEXPB27-A, TaEXPB27-B and TaEXPB27-D expressed in grains higher than other organs. However, 14 members did not express at the different growth stages of five organs. Our results also showed that most homologous copies had similar expression patterns and widely expressed in five organs with an high expression level in roots such as TaEXPB7-A, TaEXPB7-B and TaEXPB7-D.
We further used the expression data of 238 wheat expansin genes to construct a heat map (Additional file 8: Figure S3a). The results showed that expansin genes showed clear differential expression in the different developmental stages of five organs. In general, high expression occurred at various stages of root development, especially in the seedling roots such as TaEXPB17-B, TaEXPB31-A, TaEXPB31-B, TaEXPB28-D, and TaEXPB16-A. In the grains, 18 genes had relatively high expression levels at 2 days post-anthesis (dpa), such as TaEXPB20-D, TaEXPB20-B and TaEXLA2-A. Some of them were also highly expressed at 14 dpa during grain development such as TaEXPB17-A, TaEXLA3-B and TaEXLA4-D. In the leaves, 15 genes showed high expression during leaf development (2 dpa) such as TaEXPB4-A TaEXLA5-A and TaEXPA28-D, while some genes were highly expressed in the leaves of tillering stage such as TaEXPB1-B, TaEXPB1-D and TaEXPB16-D. And in the spike, 16 genes were only highly expressed at the anthesis stage such as TaEXPB5-B, TaEXPB24-D and TaEXPB4-A. Some genes were highly expressed in the spike at both flag leaf and two nodes detectable stages such as TaEXPA7-A, TaEXPA27-B and TaEXPB33-B. In the stem, some of genes had relatively high expression levels at the early stages of stem elongation such as TaEXPA9-A, TaEXPA9-B, TaEXPA9-D, TaEXPB1-B and TaEXPB1-D, while TaEXPB20-D, TaEXPB36-A and TaEXLA2-A were only highly expressed in the stem of anthesis stage.
To further detect the expression level of wheat expansin genes in different organs (seed, leaf and root), we selected 20 representative genes from EXPA and EXPB subfamily members for quantitative real-time polymerase chain reaction (qRT-PCR) analysis (Fig. 5). The results showed that five genes (TaEXPA4-A, TaEXPA5-A, TaEXPA6-A, TaEXPA8-A and TaEXPB8-A) specifically expressed in roots while six genes (TaEXPA7-A, TaEXPB7-A, TaEXPA9-A, TaEXPB9-A, TaEXPB10-A and TaEXPB1-D) expressed in all three organs, but had a high expression level in the roots. Four genes (TaEXPA1-A, TaEXPA1-D, TaEXPA12-A and TaEXPB1-A) showed the highest expression level in leaves, while TaEXPA2-D, TaEXPB1-B, TaEXPB2-A and TaEXPB4-A had a high expression in seeds and low expression in both leaves and roots. In addition, TaEXPA3-A highly expressed in leaves, but had a lower expression level in both seeds and roots. These results are generally consistent with the RNA-seq data described above.
Expression profiling of wheat expansin genes in response to drought and salt stresses
We also used the expression data of 238 wheat expansin genes of seedlings with PEG-simulating drought to construct a heat map (Additional file 8_ Figure S3b). The results showed that 40.3% (96/238) expansin genes were expressed in seedling leaves (9 days after germination) with PEG treatment while 25% (24/96) genes had the highest expression level after 12 h of PEG treatment such as TaEXPA1-D, TaEXPB14-D, TaEXPB32-D, TaEXPB16-A, TaEXPB47-D, TaEXPB38-B, TaEXPA23-D and TaEXPA9-A. However, 41.7% (40/96) expansin genes were downregulated after PEG treatment such as TaEXPA3-A, TaEXPA9-B, TaEXPB1-B, TaEXPB7-B, TaEXPB10-B, TaEXPB2-D and TaEXPA27-B. Particularly, eight genes (TaEXLA3-D, TaEXPA27-D, TaEXPB14-B, TaEXPB20-B, TaEXLA2-D, TaEXPB30-B, TaEXPB35-B and TaEXPB45-A) showed an upregulation in early stages of seedlings exposed to PEG stress and downregulation after 12 h of PEG treatment.
In order to investigate the expression level of expansin genes in roots and leaves of wheat seedlings with PEG-simulating drought and salt stress, we further selected the same 20 expansin genes shown in Fig. 5 for qRT-PCR analysis. The relative water content (RWC) of leaves showed a significant decrease after drought and salt stress treatments (Additional file 9: Figure S4a-b), and plant phenotypes were also significantly affected by stress treatments, including short seedlings, yellow leaf tip and drooping downwards, and fewer root branches. After 72 hours of stress treatment, the growth of wheat seedlings is severely restricted (Additional file 10: Figure S5).
qRT-PCR analysis revealed a distinct expression changes of wheat expansin genes in the roots and leaves under drought and salt stresses. In particular, TaEXPA7-A in both leaves and roots was upregulated at most of time points after PEG and NaCl stresses. In the leaves, PEG treatment induced an upregulated expression of TaEXPB2–A and TaEXPB4-A at both 24h and 72h as well as three genes (TaEXPA3-A, TaEXPB7-A and TaEXPB10-A) at 24h. TaEXPB9-A displayed an upregulated expression at 48h, and TaEXPA9-A was upregulated at all time points under PEG stress (Fig. 6). When subjected to salt stress, the expression of TaEXPA3-A, TaEXPB2-A and TaEXPB4-A increased after 24 hours of the stress treatment, and TaEXPB10-A also showed an upregulated expression at 24 h. Particularly, TaEXPA9-A was upregulated at all time points of salt stress treatment. The remaining genes generally showed a downregulated expression trend in response to salt stress (Fig. 7).
In the roots, wheat expansin genes also showed greater expression differences under PEG and salt stresses. Generally, the significant upregulation of wheat expansin genes induced by PEG treatment occurred at different time points such as TaEXPB10-A, TaEXPA3-A, TaEXPA4-A and TaEXPA8-A at 6-12 h, TaEXPB7-A and TaEXPA6-A at 12 h, 24 h and 48 h, and TaEXPB1-B, TaEXPA5-A and TaEXPB8-A at 72 h, TaEXPA7-A and TaEXPB1-A at 24 h. The other expansin genes were generally downregulated under PEG stress (Fig. 8). When suffered from salt stress, TaEXPB4-A, TaEXPB10-A and TaEXPA6-A showed a significant upregulated expression at all time points of stress treatment. The clear upregulated expression of the other genes occurred at different time points: TaEXPA12-A, TaEXPA7-A and TaEXPA3-A at 6 h, 12 h and 24 h, TaEXP7-A and TaEXPA8-A at 24 h and 72 h, and TaEXPA5-A and TaEXPB2-A at 72 h. The remaining genes generally showed a downregulated expression trend (Fig. 9). These results are generally consistent with the RNA-seq data (Additional file 8: Figure S3b), indicated that wheat expansin genes have distinct responses to abiotic stresses due to their functional differentiation. These results are generally consistent with the cis-acting elements data described above.
Recent studies have showed that about 70-80% of angiosperms have undergone duplication events [52,53,54]. Common wheat (Triticum aestivum L.) is allohexaploid species with three genomes A, B and D, and more than 85% of sequences are repeats . In this study, the number of wheat expansin genes identified by the recently released wheat genome database was much more than previous reports [4, 34, 45], which is four times of rice. We speculate that polyploidy, tandem duplications, segmental duplications and transposition events are the main mechanisms for the increase of wheat expansin gene copy. On the other hand, our results suggest that the expansin genes in seven plant species have four highly conserved subfamilies (Fig. 1). Interestingly, except for the EXLB subfamily, the other three subfamilies are present in all seven plant species, indicating that these expansins have emerged before the differentiation of these plant species. In particular, the number of EXPB subfamily members in the monocotyledonous is much greater than dicotyledonous plants, consistent with the previous report . In wheat, expansin genes in the same subfamily share similar exon-intron organization and motif composition, but greater structural differences between subfamily are present (Fig. 3), suggesting that expansin genes within subfamily are highly conservative.
Gene duplications are one of the primary driving forces in the evolution of genomes and genetic systems . Divergence of repeated intergenic functions is caused by the accumulation of frequent mutations in amino acid sites [58,59,60]. By analyzing the function divergence of wheat expansin gene family, we detected 46 sites that play a key role in the type-I functional divergence among subfamily members (Table 1), no type-II functional divergence sites were identified, indicating that a significant change in the rate of evolution may occur at these amino acid sites. Functional divergence and expression differences between repeat genes promote the retention of these genes after a complete genome-wide repeat event [61,62,63]. At the molecular level, negative selection removes amino acid mutations that impair adaptability, while positive selection retains those amino acid mutations that increase adaptability . The positive selection analysis of wheat expansin gene family detected four sites (166G, 169Y, 170F and 182G) that underwent strong positive selection and led to functional divergence. According to the 3-D structure of the representative TaEXPA1-A, all of these key sites were located on the domain II (Fig. 4), indicating that domain I is more conservative than domain II. In addition, the domain II is the putative polysaccharide binding domain, therefore, these sites may not only exert an important function, but also play a key role in the evolution of wheat expansin gene family.
Protein evolution depends on intramolecular coevolutionary networks, the complexity of which is proportional to the underlying functional and structural interactions among sites . Testing for coevolution between sites is thus an essential step to complement molecular selection analysis and to provide more biologically realistic results. Coevolution analysis of expansin proteins detected five groups of amino acid sites. Interestingly, the amino acid positions of all these groups are located at the N-terminal (4/5) and C-terminal (1/5) signal peptide regions based on the 3-D structure of the representative TaEXPA1-A (Fig. 4). Thus, these sites may play a very important role in the function of the signal peptide to drive the protein into the subcellular organelles of different membrane structures within the cell. These sites may play a key role in maintaining the stability of protein structure and function.
The expression of plant expansin genes is mainly regulated by various hormones, including ABA, auxin, cytokinin, gibberellin, and ethylene. For example, ABA signal transduction may be involved in NaCl-induced accumulation of TaEXPA2-A protein in wheat [29, 31]; ethylene affects the expression of LeEXP1 gene . In this study, we analyzed the cis-acting elements of the promoter region of 241 expansin members, and detected a large number of phytohormone regulation-related elements such as ABRE, CGTCA-motifs and TGACG-motifs, each member contains more than 2.5 copies of these elements. The expression of plant expansin genes is also regulated by the development process, and has tissue/organ expression specificity and preference. For example, GmEXP1 in soybeans showed a big expression difference in different development stages of roots . RNA-seq data and qRT-PCR analysis of TaEXPA4-A, TaEXPA5-A, TaEXPA6-A, TaEXPA8-A and TaEXPB8-A were specifically expressed in roots whereas TaEXPA1-A, TaEXPA1-D, TaEXPA12-A and TaEXPB1-A had clear expression preference in leaves (Fig. 5). This suggests that expansin genes play important roles in the growth and development of different wheat organs.
Expansin genes are also regulated by various environmental factors. For example, the upregulated expression of wheat expansin genes can be induced by oxidative stress , salt stress  and drought stress . Water status of plants dominates their response to drought and salt stress . When subjected to drought and salt stresses, plants need to maintain higher relative water content and lower water potential in tissues. It may be a common mechanism for adaptability of plants to drought and salt stresses [68, 69]. On the one hand, stress-induced expression of expansin genes promotes the accumulation of expansin and plant root elongation. The growth of plant roots enhances the ability to absorb water in the stress environments. Elevated levels of NaCl in the environment affect the ion selectivity of the cell membrane, eventually affect the homeostasis of Na+/K+ in plant tissues . Our results showed that many wheat expansin genes were upregulated under high salt stress, these genes may play an important role in maintaining the balance of Na+ and K+ inside cells absorption. The high expression of expansin gene increased the content of soluble sugar and proline in plant cells, thereby reduced the internal water potential of the cells, made the cells easier to absorb water and reduced their damage under stress conditions . Meanwhile, in order to protect itself from damage under stress conditions, plants may eliminate reactive oxygen species (ROS) through some antioxidant enzymes [71,72,73]. The expression of wheat expansin genes may promote the upregulation of certain antioxidant-related genes, and reduced ROS accumulation . In this study, we also detected a large number of environmental stress-related elements in wheat expansin genes such as ARE and MBS, and each member contains more than 1.2 copies of these elements, indicated that the expression of expansin genes was affected by external environmental stress such as hypoxia and drought. Other response elements related to external environmental stress such as W-box and GC-motif, the action element related to low temperature induction LRT and WUN-motif were also found in expansin genes. These stress-related elements could confer expansins a potential function in response to abiotic stresses.
This study identified 241 expansin genes from recently released wheat genome database, which were classified into EXPA, EXPB and EXLA subfamilies. Molecular characterization showed that wheat expansin gene family showed high evolutionary conservation. The branch site model analysis revealed that there was weak selection pressure between subfamilies of expansins. Functional divergence analysis showed that type I divergence was the main cause of the function changes of wheat expansins. Five groups of coevolutionary sites identified in this study may play an important role in maintaining the stability of expansin structure and function. Analysis of wheat expansin cis-acting elements showed that the expression of plant expansin genes is regulated by various hormones and various environmental factors. RNA-seq data and qRT-PCR analysis revealed that wheat expansin genes were differentially expressed in different tissues and organs, and had tissue/organ specificity and expression preference, suggesting that wheat expansins had clear function differentiation. The expression profiling under PEG and salt stresses showed that some wheat expansin genes were significantly upregulated, indicating their important roles in response to drought and salt stresses. Our findings provide new insights into the structure, evolution and function of plant expansin gene family.
Genome-wide identification of expansin genes
Genome-wide identification of the expansin gene family from seven species of monocotyledonous and dicotyledonous plants was performed. Firstly, 35 and 58 expansin sequences from Arabidopsis thaliana and Oryza sativa were respectively obtained from EXPANSIN CENTRAL (http://www.personal.psu.edu/fsl/ExpCentral/). The sequences of the 128 previously identified TaEXP genes  were obtained from TGACv1 database (http://www.gramene.org/), and all of these sequences were used to identify new wheat expansin genes using a blast search against the recently released Triticum aestivum genome database (IWGSC RefSeq v1.1) with a coverage rate of 94% from GRAMENE (http://ensembl.gramene.org/). The sequences from Triticum urartu (ASM34745v1) and Aegilops tauschii (ASM34733v1) were used as a second set of supplementary data . We also used these sequences for BLAST searches in the following species: monocotyledonous plants Brachypodium distachyon and Sorghum bicolour, and dicotyledonous plants Solanum lycopersicum and Gossypium raimondii genome database in Phytozome v12.1 (https://phytozome.jgi.doe.gov/pz/portal.html). The online tools Pfam (http://pfam.xfam.org/) and SMART (http://smart.embl-heidelberg.de/) were used to screen expansin proteins. Those with only one of two domains or without a complete open reading frame were removed.
Exon-intron structure, conserved motif, chemical character, genic physical location on chromosomes and cis-acting elements analysis
Genomic sequences, coding sequences, protein sequences and promoter sequences of all wheat expansins identified were downloaded from the Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html) and GRAMENE. All gene IDs were unified into IWGSC gene ID. The exon-intron organization of wheat expansin genes was detected by comparing the coding sequences (CDS) with their corresponding genomic sequences by the online tool GSDS (http://gsds.cbi.pku.edu.cn/) . Conserved motifs other than the canonical domain of expansins were detected by the online tool MEME (http://meme-suite.org/tools/meme). The repeat number was set to 0 or 1, the maximum number of motifs to 20, and the rest of the run parameters to system default [56, 76]. In addition, the signal peptide lengths of wheat expansin-encoding genes were predicted by SignalP 4.1 server (http://www.cbs.dtu.dk/services/SignalP/). The theoretical values of the isoelectric point, relative molecular mass and the grand average of hydrophobicity (GRAVY) of wheat expansins were predicted by the Protparam tool (http://web.expasy.org/ protparam/) [77,78,79]. Map Inspect software (http://mapinspect.software.informer.com/) was used to analyze the genic physical location on chromosomes. Cis-acting elements analysis in the 1500 bp upstream regions was performed with PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/).
Phylogenetic tree construction
Multiple sequence alignments of the identified expansin full-length protein sequences were performed using the MUSCLE software [80, 81]. Based on the alignment files, two softwares MEGA5.0  and MrBayes3.2  were used to construct the phylogenetic tree. When using MEGA5.0, the bootstrap of the phylogenetic tree was set to 1000, and the rest of the parameters were default. Neighbor-joining (NJ) phylogenetic tree was constructed by using adjacency method .
Functional divergence, positive selection and coevolution analysis
The software DIVERGE 3.0 can detect whether there is a significant change in evolution rate at a particular site using maximum likelihood (Type-I functional divergence) or a significant change in the physicochemical properties of the amino acid on the homologous sequence (Type-II functional divergence) . Both Type-I and Type-II functional divergence coefficients (θI, θII) between subfamilies were calculated by DIVERGE3.0. If these coefficients are significantly greater than 0, it implies that the selection constraints or the physicochemical properties of the particular amino acid residues were changed significantly between the two subfamilies after gene duplication or differentiation. In addition, the posterior probability (Qk) predicts the reliability of functional divergence amino acid sites. The higher the Qk value, the higher the probability of functional divergence of type I or type II between two subfamilies. In this study, the critical value of Qk is set to 0.8 .
The positive selection analysis for wheat expansin genes was performed using the site models and the branch site models in the Codeml program in PAML4.4 software package . The nucleotide sequence and corresponding protein sequence alignment file were submitted to PAL2NAL , the Bayesian tree and multiple sequence alignment files were submitted to the PAML software to calculate the ratio of dN (non-synonymous substitution rate) to dS (synonymous substitution rate) at each site. The magnitude of the value of dN/dS (ω) represents the types of selection: ω < 1 for negative selection, ω = 1 for neutral selection and ω > 1 for positive selection . In the site models, the likelihood ratio test (LRT) was used to test positive selection by comparing the two pairs of models (M0/M3 and M7/M8). M0 is called one-ratio model, meaning that all expansin codons evolve at a single evolutionary rate. M3 is a discrete model that allows each site to evolve at a different rate and calculates the probability of each site having a purification selection (p0), a neutral selection (p1), and a positive selection (p2). The corresponding ω values (ω1, ω2, ω3) can also be derived from the data. The Beta model (M7) is a null test for positive selection, assuming a Beta distribution with ω between 0 and 1. Finally, the Beta & ω model (M8) add one extra class with the same ratio ω1 . The test made by the comparison between M7 and M8 is the strictest of positive selections. The branching site models used each branch of the phylogenetic tree as a foreground branch, while the other branches serve as a background branch, and the sites in the sequence are classified into four categories. The first type of sites are highly conserved in each branch with smaller values of ω (0 < ω0 < 1); the second type of locus is subject to neutral selection or weaker positive selection (ω1=1 or slightly less than 1). Categories III and IV include background branches that are more conservative or neutral-selected and those under positive selection (ω2 > 1). Before the likelihood ratio test (LRT), we should also compare whether there is a significant difference between Mf model (free ratio model) based on LRT test and M0 model, if there is, abandon M0 model, Mf model is established, indicating that different branches do different rate of evolution. The BEB method was used to calculate posterior probabilities (Qks) of sites with ω2 > 1, sites with a posterior probability of > 0.8 are most likely sites of positive selection [47, 90].
To identify coevolution between amino acid sites, a Coevolution Analysis using Protein Sequences (CAPS) was performed with PERL-based software . CAPS provides a mathematically simple and computationally feasible means of comparing the correlated variance of evolutionary rates at two amino acid sites corrected by time since divergence of the protein sequences to which they belong. Blosumcorrected amino acid distance was used to identify amino acid covariation. The phylogenetic sequence relationships were used to remove phylogenetic and stochastic dependencies between sites.
Three-dimensional structure prediction
The structure of wheat expansin proteins was modeled by searching SWISS-MODEL database (http://swissmodel.expasy.org/) using the amino acid sequence , and Pymol software was applied to visualize the three-dimensional structure of expansin proteins.
Plant materials and stress treatments
Common wheat (Triticum aestivum L., AABBDD, 2n=6x=42) Chinese Spring (CS) was used as material in this study. Seeds with full size were selected and sterilized with 70% alcohol and 10% sodium hypochlorite. Then sterilized seeds were put on the wet sterile filter paper in sterilized Petri dishes and shaded for 48 h under natural conditions. Under the condition of 16/8 h light, 25/20°C temperature and 70% relative humidity, the seedlings (15 days after germination) were cultured to two leaves and then respectively transferred into a nutrient solution containing 20% polyethylene glycol (PEG6000) and 200 mM sodium chloride (NaCl) for stress treatment. Meanwhile, seedlings with normal growth condition were used as control. Roots and leaves from control and treatment were collected at 0, 6, 12, 24, 48 and 72 h, quickly frozen with liquid nitrogen and stored in a -80°C for RNA extraction. In addition, CS seeds were planted in the greenhouse, and the samples from five days after pollination (DAP) were collected for RNA extraction. The relative water content (RWC) of seedling leaves and roots was measured according to Lv et al. .
RNA-seq expression analysis
The publicly available RNA-seq data generated from bread wheat var. Chinese Spring was used to study the expression of newly identified wheat expansin genes. These data were collected from five different wheat organs (grain, leaf, root, spike and steam) during developing seedling, vegetative and reproductive stages. And we also collected dates of seedlings with PEG-simulating drought, RNA-seq data of wheat expansin genes were obtained from expVIP (http://www.wheat-expression.com/) . Cluster analysis of the RNA-seq data was performed by employing the Euclidean distance method over a complete linkage dissimilarity matrix using the Cluster 3.0 and TreeView.
Total mRNA extraction and qRT-PCR
Total RNA from the roots, leaves and seeds at each time points was isolated using TRIzol Reagent (Invitrogen). Purification of total RNA and cDNA synthesis according to the manufacturer’s instructions with PrimeScript® RT Reagent Kit with gDNA Eraser (TaKaRa, Shiga, Japan). The specific primer sequences for expansin genes were designed using Primer 5.0, shown in Additional file 11: Table S6. Cyclophilin (ADP) was used as the reference gene. The sample mixture for qRT-PCR was performed according to the procedures of the previous report , and three biological replicates were used for each sample. All data were analyzed with CFX Manager Software (Bio-Rad). The optimal performance was conducted, in which the correlation coefficient (R2) of 0.994-0.999 and PCR amplification efficiency (E) of 90–110% were controlled. Ct values were averaged. The fold change in the target gene relative to the Triticum aestivum L. constitutively expressed expansin gene is determined by: Fold Change =2-Δ (ΔCt) where ΔCt = Ct target - CtSamDC and Δ (ΔCt) = ΔCttreated - ΔCtcontrol, according to the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines.
Coevolution Analysis using Protein Sequences
Days after pollination
Non-synonymous substitution rate
Synonymous substitution rate
Group-II pollen allergen protein
the grand average of hydrophobicity
the likelihood ratio test
Minimum Information for Publication of Quantitative Real-Time PCR Experiments
quantitative real-time polymerase chain reaction
Reactive oxygen species
Relative water content
Paux E, Legeai F, Guilhot N, Adam-Blondon AF, Alaux M, Salse J, Sourdille P, Leroy P, Feuillet C. Physical mapping in large genomes: accelerating anchoring of BAC contigs to genetic maps through in silico analysis. Funct Integr Genomics. 2008;8:29–32.
Braidwood L, Breuer C, Sugimoto K. My body is a cage: mechanisms and modulation of plant cell growth. New Phytol. 2013;201(2):388–402.
Wolf S, Hématy K, Höfte H. Growth control and cell wall signaling in plants. Annu Rev Plant Biol. 2012;63(1):381–407.
Zhang JF, Xu YQ, Dong JM, Peng LN, Feng X, Wang X, Li F, Miao Y, Yao SK, Zhao QQ, Feng SS, Hu BZ, Li FL. Genome-wide identification of wheat (Triticum aestivum) expansins and expansin expression analysis in cold-tolerant and cold-sensitive wheat cultivars. PLoS One. 2018;13:e0195138.
Mcqueen-Mason S, Cosgrove DJ. Two endogenous proteins that induce cell wall extension in plants. Plant Cell. 1992;4:1425–33.
Li ZC, Cosgrove DJ. An oat coleoptile wall protein that induces wall extension in vitro and that is antigenically related to a similar protein from cucumber hypocotyls. Planta. 1993;191:349–56.
Cho HT, Kende H. Tissue localization of expansins in deepwater rice. Plant J. 2010;15:805–12.
Shimizu Y, Aotsuka S, Hasegawa O, Kawada T, Sakuno T, Sakai F, Hayashi T. Changes in levels of mRNAs for cell wall-related enzymes in growing cotton fiber Cells. Plant Cell Physiol. 1997;38:375–8.
Lee DK, Ahn JH, Song SK, Choi YD, Lee JS. Expression of an expansin gene is correlated with root elongation in soybean. Plant Physiol. 2003;131:985.
Javier S, Cosgrove DJ. The expansin superfamily. Genome Biol. 2005;6:242–2.
Cosgrove DJ. Enzymes and other agents that enhance cell wall extensibility. Annu Rev Plant Physiol Plant Mol Biol. 1999;50:391–417.
Cosgrove DJ. Relaxation in a high-stress environment: the molecular bases of extensible cell walls and cell enlargement. Plant Cell. 1997;9(7):1031.
Din N, Forsythe IJ, Burtnick LD, Gilkes NR, Miller RC Jr, Warren RA, Kilburn DG. The cellulose-binding domain of endoglucanase A (CenA) from cellulomonas fimi: evidence for the involvement of tryptophan residues in binding. Mol Microbiol. 2010;11:747–55.
Kende H, Bradford K, Brummell D, Cho HT, Cosgrove D, Fleming A, Gehring C, Lee Y, McQueen-Mason S, Rose J, Voesenek LA. Nomenclature for members of the expansin superfamily of genes and proteins. Plant Mol Biol. 2004;55:311–4.
Li Y, Darley CP, Ongaro V, Fleming A, Schipper O, Baldauf SL, McQueen-Mason SJ. Plant expansins are a complex multigene family with an ancient evolutionary origin. Plant Physiol. 2002;128:854–64.
Schipper O, Schaefer D, Reski R, Fleming A. Expansins in the bryophyte Physcomitrella patens. Plant Mol Biol. 2002;50:789–802.
Marga F, Grandbois M, Cosgrove DJ, Baskin TI. Cell wall extension results in the coordinate separation of parallel microfibrils: evidence from scanning electron microscopy and atomic force microscopy. Plant J. 2005;43:181–90.
Cosgrove DJ. Growth of the plant cell wall. Nat Rev Mol Cell Bio. 2005;6:850–61.
Cosgrove DJ. New genes and new biological roles for expansins. Curr Opin Plant Biol. 2000;3:73–8.
Choi D, Lee Y, Cho HT, Kende H. Regulation of expansin gene expression affects growth and development in transgenic rice plants. Plant Cell. 2003;15:1386–98.
Cho HT, Cosgrove DJ. Regulation of root hair Initiation and expansin gene expression in Arabidopsis. Plant Cell. 2002;14:3237–53.
Pien S, Wyrzykowska J, Mcqueenmason S, Smart C, Fleming A. Local expression of expansin induces the entire process of leaf development and modifies leaf shape. Proc Natl Acad Sci USA. 2001;98:11812–7.
Brummell DA, Harpster MH, Civello PM, Palys JM, Bennett AB, Dunsmuir P. Modification of expansin protein abundance in tomato fruit alters softening and cell wall polymer metabolism during ripening. Plant Cell. 1999;11:2203–16.
Civello PM, Powell AL, Sabehat A, Bennett AB. An expansin gene expressed in ripening strawberry fruit. Plant Physiol. 1999;121:1273–80.
Yan A, Wu M, Yan L, Hu R, Ali I, Gan Y. AtEXP2 is involved in seed germination and abiotic stress response in Arabidopsis. PLoS One. 2014;9:e85208.
Cosgrove DJ, Bedinger P, Durachko DM. Group I allergens of grass pollen as cell wall-loosening agents. Proc Natl Acad Sci USA. 1997;94:6559–64.
Pezzotti M, Feron R, Mariani C. Pollination modulates expression of the PPAL gene, a pistil-specific β-expansin. Plant Mol Biol. 2002;49:187–97.
Kaashyap M, Ford R, Kudapa H, Jain M, Edwards D, Varshney R, Mantri N. Differential regulation of genes involved in root morphogenesis and cell wall modification is associated with salinity tolerance in chickpea. Sci Rep. 2018;8:4855.
Lin Z, Ni Z, Zhang Y, Yao Y, Wu H, Sun Q. Isolation and characterization of 18 genes encoding alpha- and beta-expansins in wheat (Triticum aestivum L.). Mol Genet Genomic. 2005;274:548–56.
Carolina LX, Ricardo R, Gomez LD, Herrera J, Isla A, Mcqueenmason SJ, Calderini DF. Expansins expression is associated with grain size dynamics in wheat (Triticum aestivum L.). J Exp Bot. 2010;61:1147–57.
Chen Y, Han Y, Kong X, Ren Y, Wang W. Ectopic expression of wheat expansin gene TaEXPA2 improved the salt tolerance of transgenic tobacco by regulating Na+/K+ and antioxidant competence. Physiol Plant. 2017;159:161–77.
Chen Y, Han Y, Meng Z, Zhou S, Kong X, Wang W. Overexpression of the wheat expansin gene TaEXPA2 improved seed production and drought tolerance in transgenic tobacco plants. PLoS One. 2016;11:e0153494.
Ren Y, Chen Y, An J, Zhao Z, Zhang G, Wang Y, Wang W. Wheat expansin gene TaEXPA2, is involved in conferring plant tolerance to Cd toxicity. Plant Sci. 2018;270:245.
Chen Y, Ren Y, Zhang G, An J, Yang J, Wang Y, Wang W. Overexpression of the wheat expansin gene TaEXPA2 improves oxidative stress tolerance in transgenic Arabidopsis plants. Plant Physiol Biochem. 2018;124:190–8.
Han Y, Chen Y, Yin S, Zhang M, Wang W. Over-expression of TaEXPB23, a wheat expansin gene, improves oxidative stress tolerance in transgenic tobacco plants. J Plant Physiol. 2015;173:62.
Han YY, Li AX, Li F, Zhao MR, Wang W. Characterization of a wheat (Triticum aestivum L.) expansin gene, TaEXPB23, involved in the abiotic stress response and phytohormone regulation. Plant Physiol Biochem. 2012;54:49.
Han YY, Zhou S, Chen YH, Kong X, Xu Y, Wang W. The involvement of expansins in responses to phosphorus availability in wheat, and its potentials in improving phosphorus efficiency of plants. Plant Physiol Biochem. 2014;78(3):53.
Zhao MR, Li F, Fang Y, Gao Q, Wang W. Expansin-regulated cell elongation is involved in the drought tolerance in wheat. Protoplasma. 2011;248(2):313–23.
Zhao MR, Han YY, Feng YN, Li F, Wang W. Expansins are involved in cell growth mediated by abscisic acid and indole-3-acetic acid under drought stress in wheat. Plant Cell Rep. 2012;31(4):671–85.
Zhou S, Han YY, Chen Y, Kong X, Wang W. The involvement of expansins in response to water stress during leaf development in wheat. J Plant Physiol. 2015;183:64.
Li F, Xing S, Guo Q, Zhao M, Zhang J, Gao Q, Wang G, Wang W. Drought tolerance through over-expression of the expansin gene TaEXPB23 in transgenic tobacco. J Plant Physiol. 2011;168(9):0–966.
Hall BG. Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences. Mol Biol Evol. 2005;22(3):792.
Yang ZH, Rannala B. Bayesian phylogenetic inference using DNA sequences: a Markov Chain Monte Carlo Method. Mol Biol Evol. 1997;14(7):717–24.
Breen J, Li D, Dunn DS, Békés F, Kong X, Zhang J, Jia J, Wicker T, Mago R, Ma W, Bellgard M, Appels R. Wheat beta-expansin (EXPB11) genes: Identification of the expressed gene on chromosome 3BS carrying a pollen allergen domain. BMC Plant Biol. 2010;10:1–11.
Gao Q, Zhao M, Li F, Guo Q, Xing S, Wang W. Expansins and coleoptile elongation in wheat. Protoplasma. 2008;233:73.
Liu Y, Liu D, Zhang H, Gao H, Guo X, Wang D, Zhang X, Zhang A. The alpha- and beta-expansin and xyloglucan endotransglucosylase/hydrolase gene families of wheat: molecular cloning, gene expression, and EST data mining. Genomics. 2007;90:516–29.
Liu Q, Zhu H. Molecular evolution of the MLO gene family in Oryza sativa and their functional divergence. Gene. 2008;409:1–10.
Wang M, Wang Q, Zhao H, Zhang X, Pan Y. Evolutionary selection pressure of forkhead domain and functional divergence. Gene. 2009;432:19–25.
Zhang J, Nielsen R, Yang Z. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evol. 2005;22:2472–9.
Fares MA, Mcnally D. CAPS: coevolution analysis using protein sequences. Bioinform. 2006;22:2821.
Otto SP, Whitton J. Polyploid incidence and evolution. Annu Rev Genet. 2000;34:401–37.
Blanc G, Hokamp K, Wolfe KH. A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res. 2003;13:137.
Bowers JE, Chapman BA, Rong J, Paterson AH. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003;422:433–8.
Paterson AH, Bowers JE, Chapman BA. Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci U S A. 2004;101:9903–8.
Ling HQ, Zhao S, Liu D, Wang J, Sun H, Zhang C, Fan H, Li D, Dong L, Tao Y, Gao C, Wu H, Li Y, Cui Y, Guo X, Zheng S, Wang B, Yu K, Liang Q, Yang W, Lou X, Chen J, Feng M, Jian J, Zhang X, Luo G, Jiang Y, Liu J, Wang Z, Sha Y, Zhang B, Wu H, Tang D, Shen Q, Xue P, Zou S, Wang X, Liu X, Wang F, Yang Y, An X, Dong Z, Zhang K, Zhang X, Luo MC, Dvorak J, Tong Y, Wang J, Yang H, Li Z, Wang D, Zhang A, Wang J. Draft genome of the wheat A-genome progenitor Triticum urartu. Science Foundation in China. 2013;496:87–90.
Zhu Y, Wu N, Song W, Yin G, Qin Y, Yan Y, Hu Y. Soybean ( Glycine max ) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies. BMC Plant Biol. 2014;14:1–19.
Moore RC, Purugganan MD. The early stages of duplicate gene evolution. Proc Natl Acad Sci U S A. 2003;100:15682–7.
Blanc G, Wolfe KH. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004;16:1667–78.
Gu X. A simple statistical method for estimating type-II (cluster-specific) functional divergence of protein sequences. Mol Biol Evol. 2006;23:1937–45.
Ha M, Kim ED, Chen ZJ. Duplicate genes increase expression diversity in closely related species and allopolyploids. Proc Natl Acad Sci U S A. 2009;106:2295–300.
Lynch M, Force A. The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000;154:459.
He X, Zhang J. Gene complexity and gene duplicability. Curr Biol. 2005;15:1016–21.
Schnable JC, Freeling M. Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss. Proc Natl Acad Sci U S A. 2011;108:4069–74.
Fetterman CD, Rannala B, Walter MA. Identification and analysis of evolutionary selection pressures acting at the molecular level in five forkhead subfamilies. BMC Evol Biol. 2008;8:261.
Fares MA, Travers SA. A novel method for detecting intramolecular coevolution: adding a further dimension to selective constraints analyses. Genetics. 2006;173:9.
Feng C, Bradford KJ. Expression of an expansin is associated with endosperm weakening during tomato seed germination. Plant Physiol. 2000;124:1265–74.
Yeo AR, Caporn SJM, Flowers TJ. The effect of salinity upon photosynthesis in rice (Oryza sativa L.): Gas exchange by individual leaves in relation to their salt content. J exp bot. 1985;36:1240–8.
Greenway H, Munns R. Mechanisms of salt tolerance in nonhalophytes. Ann rev plant Physiol. 2003;31:149–90.
Watanabe S, Kojima KY, Sasaki S. Effects of saline and osmotic stress on proline and sugar accumulation in Populus euphratica in vitro. Plant Cell Tissue Organ Cult. 2000;63:199–206.
Gadallah MAA. Effects of proline and glycinebetaine on Vicia faba responses to salt stress. Biologia Plantarum. 1999;42:249–57.
Dionisio-Sese ML, Tobita S. Antioxidant responses of rice seedlings to salinity stress. Plant Sci. 1998;135:1–9.
Moradi F, Ismail AM. Responses of photosynthesis, chlorophyll fluorescence and ROS-scavenging systems to salt stress during seedling and reproductive stages in rice. Ann Bot. 2007;99:1161–73.
Montezano AC, Touyz RM. Reactive oxygen species, vascular Noxs, and hypertension: focus on translational and clinical research. Antioxid Redox Signal. 2014;20:164–82.
Santo SD, Vannozzi A, Tornielli GB, Fasoli M, Venturini L, Pezzotti M, Zenoni S. Genome-wide analysis of the expansin gene superfamily reveals grapevine-specific structural and functional characteristics. PLoS One. 2013;8:e62206.
Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2014;31:1296.
Bailey TL, Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. 1994;2:28–36.
Bjellqvist B, Hughes GJ, Pasquali C, Paquet N, Ravier F, Sanchez JC, Hochstrasser D. The focusing positions of polypeptides in immobilized pH gradients can be predicted from their amino acid sequences. Electrophoresis. 2010;14:1023–31.
Bjellqvist B, Basse B, Olsen E, Celis JE. Reference points for comparisons of two-dimensional maps of proteins from different human cell types defined in a pH scale where isoelectric points correlate with polypeptide compositions. Electrophoresis. 1994;15:529–39.
Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, Appel RD, Hochstrasser DF. Protein identification and analysis tools on the expasy server. Methods Mol Biol. 1999;112:531.
Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5:113.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28:2731–9.
Ronquist F, Teslenko M, Van Der Mark P, Ayres DL, Darling A, Höhna S, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.
Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406.
Gaucher EA, Gu X, Miyamoto MM, Benner SA. Predicting functional divergence in protein evolution by site-specific rate shifts. Trends Biochem Sci. 2002;27:315–21.
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609.
Yang Z, Wong WSW, Nielsen R. Bayes empirical bayes inference of amino acid sites under positive selection. Mol Biol Evol. 2005;22:1107–18.
Anisimova M, Bielawski JP, Yang Z. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol Biol Evol. 2001;18:1585–92.
Liu Q, Wang H, Zhang Z, Wu J, Feng Y, Zhu Z. Divergence in function and expression of the NOD26-like intrinsic proteins in plants. BMC Genomics. 2009;10:313.
Song W, Qin Y, Zhu Y, Yin G, Wu N, Li Y, Hu Y. Delineation of plant caleosin residues critical for functional divergence, positive selection and coevolution. BMC Evol Biol. 2014;14:1–14.
Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, Kiefer F, Cassarino GT, Bertoni M, Bordoli L, Schwede T. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 2014;42:W252.
Lv DW, Zhu GR, Zhu D, Bian YW, Liang XN, Cheng ZW, Deng X, Yan YM. Proteomic and phosphoproteomic analysis reveals the response and defense mechanism in leaves of diploid wheat T. monococcum under salt stress and recovery. J Proteomics. 2016;143:93–105.
Kaur S, Dhugga KS, Beech R, Singh J: Genome-wide analysis of the cellulose synthase-like (Csl) gene family in bread wheat (Triticum aestivum L.). BMC Plant Biol 2017, 17:193.
This research was financially supported by grants from National Key R & D Program of China (2016YFD0100502) and the National Natural Science Foundation of China (31771773). The funding bodies didn’t include in the design of the study nor in collection, analysis, and interpretation of data, and also not in writing of the manuscript.
Availability of data and materials
All data generated or analysed during this study are included in this published article and its supplementary information files.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Neighbor-joining (NJ) phylogenetic tree of all of the expansin proteins from Triticum aestivum, Oryza sativa, Brachypodium distachyon and Sorghum bicolour, Arabidopsis thaliana, Solanum lycopersicum and Gossypium raimondii. Clade of blue branches refers to the EXPA subfamily; clade of red branches refers to the EXPB subfamily; clade of green branches refers to the EXLA subfamily; clade of pink branches refers to the EXLB subfamily. (JPG 14757 kb)
Table S1. Distribution of four subfamily members in expansin gene families from different plant species. (XLSX 11 kb)
Table S2. The information of expansin gene superfamily in wheat. (XLSX 31 kb)
Figure S2. Schematic diagram of motifs of wheat expansin proteins. The schematic diagram was derived from MEME. The order of motifs of expansin proteins in the diagram was automatically generated by MEME according to scores. (JPG 8356 kb)
Table S3. Coevolution sites in wheat expansin (All sites are located on the reference sequence TaEXPA1-A based on the multiple alignment results). (XLSX 10 kb)
Table S4 Cis-element analysis of 2000 bp nucleotide sequences data upstream of the translation initiation codon of expansin genes. (XLSX 141 kb)
Table S5. The RNA-Seq atlas data of the expansin genes. (XLSX 48 kb)
Figure S3. RNA-seq expression analysis of wheat expansin genes. The hierarchical cluster color code: the largest values are displayed as the reddest (hot), the smallest values are displayed as the greenest (cool), and the intermediate values are a lighter color of either red or green. Raw data were normalized by the following equation: reads/kilobase/million. (JPG 6660 kb)
Figure S4. Relative water content changes of wheat leaf and root under PEG and salt stress treatments. (JPG 1208 kb)
Figure S5. The wheat seeding changes under 200 mM NaCl and 20% PEG6000. (JPG 236 kb)
Table S6. Primers used for qRT-PCR of expansins in wheat (XLSX 12 kb)
About this article
Cite this article
Han, Z., Liu, Y., Deng, X. et al. Genome-wide identification and expression analysis of expansin gene family in common wheat (Triticum aestivum L.). BMC Genomics 20, 101 (2019). https://doi.org/10.1186/s12864-019-5455-1