Transcriptome analysis and identification of abscisic acid and gibberellin-related genes during seed development of alfalfa (Medicago sativa L.)
BMC Genomics volume 23, Article number: 651 (2022)
Alfalfa (Medicago sativa) is a widely cultivated plant. Unlike many crops, the main goal of breeding alfalfa is to increase its aboveground biomass rather than the biomass of its seeds. However, the low yield of alfalfa seeds limits alfalfa production. Many studies have explored the factors affecting seed development, in which phytohormones, especially ABA and GAs, play an important role in seed development.
Here, we performed a transcriptome analysis of alfalfa seeds at five development stages. A total of 16,899 differentially expressed genes (DEGs) were identified and classified into 10 clusters, and the enriched Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were determined. The contents of ABA, GA1, GA3, GA4 and GA7 in alfalfa seeds at five development stages were determined. In addition, 14 ABA-related DEGs and 20 GA-related DEGs were identified and analysed. These DEGs are involved in plant hormone pathways and play an important role in seed development. Moreover, morphological and physiological analyses revealed the dynamic changes during the development of alfalfa seeds.
Overall, our study is the first to analyse the transcriptome across various stages of seed development in alfalfa. The results of our study could be used to improve alfalfa seed yield. The key ABA and GA related-genes are potential targets for improving alfalfa seed yield via genetic engineering in the future.
The seed is not only a reproductive organ of plants but also the source of nourishment of the embryo giving rise to a new plant . The seed development process can be divided into two stages. The first is the differentiation and development of tissues and organs, and the second is the accumulation of nutrients and seed maturation. Both of these processes are highly complex . After the seed becomes fully mature, the dry matter content remains stable, the water content decreases, and the embryo enters a dormant state . During seed development, large amounts of nutrients are synthesized and stored, including various carbohydrates, proteins, and fats . The development of the seed not only affects seed quality but also affects the normal growth and development of the following generation. Many studies have evaluated the factors having the largest effects on the development of plant seeds. Although environmental factors (drought, salt, and extreme temperature) limit seed formation, the regulation of gene expression during seed maturation is the ultimate determinant of the fate of developing seeds . There is thus a need to study the mechanism and regulatory network underlying the processes of seed development.
The development of the seed requires a series of complex and dynamic changes in cell division and differentiation, as well as in the biosynthesis of hormones, carbohydrates, proteins, cell walls, lipids, amino acids, and secondary metabolites. Critical to these processes is the expression and regulation of a large number of genes. Thus, the study of the expression patterns of these genes can enhance our understanding of the molecular mechanisms underlying the accumulation of various nutrients during seed development. Previous studies have identified various genes that play key roles in seed development , including genes that control embryo generation and differentiation , endosperm formation and development , seed coat formation , and seed size . Many studies have examined the synthesis and accumulation of nutrients, such as genes encoding seed storage proteins, lipid biosynthesis catabolism and storage, early starch synthesis, and late transformation [11,12,13].
The plant hormones ABA and GAs have important roles in seed development. ABA regulates many developmental processes in plants, including seed maturation and dormancy, nodule development, and plant senescence [14, 15]. Higher plants use the carotenoid pathway initiated by β-carotene to synthesize ABA . GA is a diterpenoid plant hormone biosynthesized through complex pathways that controls all aspects of plant growth and development, including seed germination, stem elongation, leaf unfolding, and flower and seed development . Several studies have shown that ABA and GAs (mostly GA1, GA3, GA4 and GA7 in higher plant) antagonistically regulate diverse aspects of plant growth, and the dynamic balance between these hormones is key to the regulation of seed development . ABA is necessary for inducing the cessation of germ cell division and maintaining seed dormancy, and GA is necessary for ending dormancy and initiating seed germination .
Alfalfa (Medicago sativa) is a high-yield and top-quality forage with high protein content; it is thus known as ‘the queen of forage’. Alfalfa hay is the most important roughage for herbivorous livestock such as cattle and sheep . In addition, alfalfa buds are a natural alkaline food that is low in calories and rich in various amino acids and vitamins; the buds are also known to promote the circulation of blood and prevent several diseases . Research on the development of alfalfa seeds is important for increasing seed yield, the rapidity of seed propagation, and seed quality. Studies examining the role of ABA, GA1, GA3, GA4 and GA7 in seed development at the transcriptional level have been mostly conducted on plants other than alfalfa [22, 23]. Consequently, transcriptomic changes during the development of alfalfa seeds and the role of hormone-related gene regulation in the seed development process remain unclear. There is thus a need to clarify the regulatory mechanisms underlying alfalfa seed development. In this study, a transcriptome analysis was carried out on alfalfa seeds at 11, 19, 27, 35, and 43 days after pollination (DAP). The content of ABA, GA1, GA3, GA4 and GA7 were determined, and the hormone biosynthesis pathways were determined. Changes in gene expression were determined, and the transcriptome data were validated by quantitative real-time PCR (qRT-PCR). The results of this study enhance our understanding of the molecular mechanism underlying hormone-related gene regulation during seed development at the transcriptional level.
Seed development stages and morphological features
The six stages of alfalfa seed development from pollination to full maturity are shown in Fig. 1. At stage 1 (S1, 11 DAP) and stage 2 (S2, 19 DAP), the seed coat was smooth and could be easily crushed with fingertips. At stage 3 (S3, 27 DAP), the seed coat was slightly rough, and the seeds were hard but could be easily broken with fingernails. From stage 4 (S4, 35 DAP) to stage 5 (S5, 43 DAP), the surface of the seed coat was rough, and the seeds were hard and difficult to break with fingernails (Fig. 1A). During these five stages of seed development, the seed coat changed from bright green (S1, S2) to light green (S3), gradually to yellowish-green (S4), and finally to their normal colour (S5) (Fig. 1A). The seed size first increased and reached a maximum at S3; it then gradually decreased as nutrients accumulated in the late stage of seed maturation (Fig. 1A-D). The fresh weight first increased, then decreased, and reached a maximum at S4 (Fig. 1E). The dry weight increased continuously, indicating that nutrients gradually accumulated (Fig. 1F). The water content of the seeds reached a maximum at S2 and then decreased from S2 to S6 (Fig. 1G). No significant changes in morphology and physiology were observed between stage 6 (S6, 51 DAP) and S5, which suggested that full maturity was reached at S5 (Fig. 1). RNA-Seq data were generated for each of the five different stages.
Transcriptome sequencing and assembly
A total of 15 cDNA libraries from the five stages of seed development (S1, S2, S3, S4, and S5) were constructed for Illumina sequencing to comprehensively analyse the seed development and hormone biosynthesis and accumulation at the transcriptional level. There were three biological replicates at each stage. A total of 656,647,958 raw reads and 640,383,114 clean reads were obtained, which consisted of 96.08 Gb clean bases (Additional file 1: Table S1). The average content of GC was 42.22%. The average values of Q20 and Q30 were 98.28 and 94.50%, respectively (Additional file 1: Table S1). The clean reads in each library were mapped to the reference genome of alfalfa . The proportion of clean reads mapped to the reference genome ranged from 73.66 to 80.88%, and the percentage of uniquely mapped reads was over 69.00% (Additional file 1: Table S1).
Principal component analysis (PCA) was conducted on all samples across the different stages. The samples from different biological replicates were clustered in different groups corresponding to the distinct development stages of the seeds (Fig. 2A). Pearson’s correlation correlations were used to evaluate relationships between biological replicates. The correlations between replicates ranged from 0.93 to 0.97, and the correlations between different stages were between 0.35 and 0.88, indicating that the results were highly repeatable (Fig. 2B). The distribution of the gene expression levels of different samples is shown in Fig. 2C.
To validate the DEGs identified by RNA-Seq, nine DEGs were randomly selected for qRT-PCR assays. The qRT-PCR results were consistent with the RNA-Seq data, which indicated that the RNA-Seq data were reliable and accurate (Fig. 3).
DEGs during alfalfa seed development
The FPKM method was used to calculate the expression levels of the transcripts. A total of 16,899 DEGs were identified across the five stages of seed development (Additional file 2: Fig. S1). A total of 4600 (2807 up-regulated and 1793 down-regulated), 6919 (3653 up-regulated and 3266 down-regulated), 10,199 (4242 up-regulated and 5957 down-regulated), and 13,128 (4398 up-regulated and 8730 down-regulated) DEGs were identified for S2 vs. S1, S3 vs. S1, S4 vs. S1, and S5 vs. S1, respectively (Fig. 4A). Of these DEGs, 1028, 637, 755, and 3894 were specific to S2, S3, S4, and S5, respectively (Fig. 4B). A total of 1843 DEGs exhibited different expression patterns in different stages, indicating that the expression of these DEGs changed continuously during the development of alfalfa seeds (Fig. 4B). Analysis of the expression patterns of all DEGs revealed 10 expression patterns with significantly different expression levels among the 16,899 DEGs (Fig. 4C), including seven up-regulation patterns (clusters 1, 2, 3, 4, 6, 8, and 9) and three down-regulation patterns (clusters 5, 7, and 10). The expression levels of clusters 4 and 8 peaked at S2, the expression levels of clusters 3 and 6 peaked at S3, and the expression levels of cluster 2 peaked at S4. Hierarchical clustering results of all 16,899 DEGs were consistent with the clustering results of the expression patterns (Fig. 4D).
GO and KEGG pathway enrichment analysis of the DEGs
Gene Ontology (GO) enrichment analysis was performed to classify the functions of DEGs. In total, 102 GO terms were functionally classified into three main categories: biological processes (36 members), cellular components (7 members), and molecular functions (59 members). In the biological processes category, movement of cell or subcellular component (GO:0006928, padj = 0.00012), microtubule-based movement (GO:0007018, padj = 0.00012), and cellular carbohydrate metabolic process (GO:0044262, padj = 0.00017) were significantly enriched. In the cellular components category, processes related to photosynthesis were significantly enriched, including photosynthetic membrane (GO:0034357, padj = 0.00001), photosystem (GO:0009521, padj = 0.00001), and thylakoid (GO:0009579, padj = 0.00001). In the molecular function category, processes related to enzyme activity were significantly enriched, such as transferase activity, transferring hexosyl groups (GO:0016758, padj< 0.00001), serine-type peptidase activity (GO:0008236, padj< 0.00001), and serine hydrolase activity (GO:0017171, padj< 0.00001) (Fig. 5A).
An enrichment analysis was performed using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database  to identify the catabolism pathways in which the DEGs were involved. Of all 123 pathways, the most representative pathways included ‘photosynthesis’ (mtr00195, 56 genes, padj< 0.00001), ‘plant hormone signal transduction’ (mtr04075, 284 genes, padj = 0.00001), and ‘starch and sucrose catabolism’ (mtr00500, 173 genes, padj = 0.00019) (Fig. 5B and Additional file 1: Table S2). ‘Cutin, suberine and wax biosynthesis’ (mtr00073, padj = 0.00480) was significantly enriched in S2 vs S1, indicating that it plays an important role in the formation of seed structure (Additional file 2: Fig. S2). Several pathways involved in nutrient mobilization were enriched in S4 vs S1 and S5 vs S1, such as ‘starch and sucrose metabolism’, ‘Linoleic acid metabolism’ (mtr00591, padj = 0.00134) and ‘galactose metabolism’ (mtr00052, padj = 0.00216). It shows that the synthesis and accumulation of nutrients begin in the later stage of seed development (Additional file 2: Fig. S2). ‘Plant hormone signal transduction’ was significantly enriched in all four comparisons, indicating that plant hormones have sustained and significant regulatory effects during alfalfa seed development (Additional file 2: Fig. S2).
DEGs involved in ABA and GA biosynthesis and catabolism
To investigate the roles of endogenous hormones in seed development, the content of ABA, GA1, GA3, GA4 and GA7 was measured at different stages (Fig. 6). Both ABA and GA1 levels increased first and then decreased. The ABA level reached a peak at S3, while the GA1 level reached a peak at S2 (Fig. 6A-B). The content of GA3 decreased continuously during seed maturation and was nearly zero in S4 and S5 (Fig. 6C). In contrast to GA1 and GA3, GA4 and GA7 levels increased significantly at late stage of seed development (S5), and GA4 even peaked at S5 (Fig. 6D-E).
We next analysed the genes proposed to be involved in the ABA and GA pathways (Fig. 7). A total of 14 DEGs related to ABA biosynthesis and catabolism pathways were identified from the library, and their expression patterns were analysed (Additional file 1: Table S3). A gene encoding β-carotene hydroxylase was highly expressed and peaked in late development stages (S5). The expression of four genes encoding zeaxanthin epoxidase (ZEP) was up-regulated, peaked at S2, and then decreased. Two genes encoding 9′-cis-epoxycarotenoid dioxygenases (NCED) were highly expressed during seed development (S2 and S3). The expression of one gene encoding abscisic aldehyde oxidase 3 (AAO3) peaked at S4 and S5. Three CYP707As were found to be involved in ABA catabolism pathways, and the expression of these genes was up-regulated in later stages of seed development (S4 or S5). One AtBG1 and two AtBG2 genes were identified, and the expression of these genes peaked in the early stages of seed development (S1 or S2).
The expression of 20 DEGs related to GA biosynthesis and catabolism pathways was analysed (Additional file 1: Table S3). The expression of genes encoding ent-copalyl diphosphate synthase (CPS), ent-kaurene synthase (KS), and ent-kaurenoic acid oxidase (KAO) was up-regulated Gibberellin 3/20 oxidase (GA3/20ox) is important for the oxidation of GA12 to other types of GAs, and GA20ox is the key enzyme in GA biosynthesis. A total of six genes encoding GA20ox were identified, and the expression of four of them (MsG0180005519.01, MsG0180004530.01, MsG0480023478.01, and MsG0480023479.01) was down-regulated and peaked in the earliest stage of seed development (S1). The expression levels of two genes (MsG0580024647.01 and MsG0880043418.01) first increased and then decreased; the highest expression level was observed at S3. Two genes encoding GA3ox were identified (MsG0180000636.01 and MsG0280011347.01), and the expression of these genes were up-regulated during seed development. Five gibberellin 2 oxidase (GA2ox) candidate genes were identified, and these genes play a role in GA catabolism.
Morphological and physiological changes in seed development
The seed development of legumes has been examined in several studies. In soybean, the seed coat changes from bright green to light green and finally becomes yellow; the fresh weight first increases, then decreases, and peaks in the middle stage of seed development . During the seed development of Medicago truncatula, the seed dry weight increases continuously and decreases slightly in the final stage of seed development . Our findings are consistent with these studies and indicate that dramatic physiological changes occur during alfalfa seed development; additional studies of these changes at the molecular level are needed.
Biosynthesis and catabolism of ABA in alfalfa seeds
The physiological effects of ABA are regulated by its endogenous levels . During the seed development of Arabidopsis and Nicotiana plumbaginifolia, the level of ABA increases during the early stage, peaks in the middle stage of seed development, and then decreases during seed desiccation [27, 28]. These findings are consistent with the results of our research.
Plants synthesize ABA through the carotenoid pathway. In the process of β-carotene turning into zeaxanthin, the β-carotene hydroxylase acts a very important function . Although the content of ABA is low in the early stage of alfalfa seed development, its precursors are synthesized during these stages, and β-carotene is one of the important precursors of ABA biosynthesis. The β-carotene hydroxylase candidate gene (MsG0680032953.01) was highest in the later stage of seed development. One possible explanation for this pattern is that alfalfa seeds need to accumulate ABA in the later stage of development.
Conversion of zeaxanthin to antheraxanthin is catalyzed by ZEP, which is the first rate-limiting step of ABA biosynthesis . Four genes encoding ZEP were highly expressed in the early stage of seed development (S2), indicating that the rate of cleavage of zeaxanthin was high, which contributed to the accumulation of xanthoxins in the early stage of seed development and generated the precursors necessary for the biosynthesis and accumulation of ABA in the middle and late stages of seed development .
The 9′-cis-neoxanthin synthesized in the previous step can be oxidized by NCED to produce xanthoxins , which is the second rate-limiting step of ABA biosynthesis . The NCED gene family is thought to play a direct role in determining the final ABA content of seeds and the response induced by ABA . The expression of NCED candidate genes (MsG0280009781.01 and MsG0280009782.01) was higher in the middle stage of seed development, which was consistent with the changes in the ABA content, indicating that NCED plays a key role in regulating ABA biosynthesis .
In the cytoplasm, short-chain alcohol dehydrogenase encoded by ABA-DEFICIENT 2 (ABA2) converts xanthoxin to abscisic aldehyde, which is finally oxidized to abscisic acid by AAO3 . The AAO3 candidate gene (MsG0580029491.01) was up-regulated, and its expression was higher in the later stage of seed development, which is consistent with the results of previous studies .
There are two main types of ABA catabolism. One is the decomposition of ABA into DPA-4-O-β-D-glucoside (DPAG), in which cytochrome P450 monooxygenase encoded by CYP707A acts as a catalyst ; the other is the glycosylation of ABA by UDP-glucosyltransferase encoded by UGT71C5 to form ABA-glucose ester (ABA-GE), which is the inactive form of ABA. β-glucosidases encoded by AtBG1/2 can convert ABA-GE to active ABA when needed by the plant . The three CYP707A candidate genes identified in this study were highly expressed in the middle and later stages of seed development, and one AtBG1 and two AtBG2 candidate genes were highly expressed in the early stage of seed development. ABA might maintain a suitable internal environment for seed development in the early stage of seed development through a flexible transformation mechanism involving a cycle of β-glucosidases encoded by AtBG1/2 and UDP-glucosyltransferase encoded by UGT71C5. However, in the later stage of seed development, the seeds mainly depend on CYP707A to completely decompose the redundant ABA, which makes the seeds mature and dormant .
Biosynthesis and catabolism of GAs in alfalfa seeds
GA1, GA3, GA4 and GA7 are considered to be the most important GA in higher plants, and seeds are major sites of GA synthesis . At the early stage of alfalfa seed development (S1-S2), the contents of four GAs were all at a high level, which was consistent with the results of Arabidopsis . Therefore, it can be considered that S1-S2 is the key period for GAs synthesis of alfalfa seeds. At the middle stage (S2-S4), the contents of the four GAs decreased continuously, indicating that S2-S4 may be the key stage of endogenous GA metabolism in alfalfa seeds. At the late stage of seed development (S5), the contents of GA1 and GA3 decreased, indicating that the requirement of these two hormones was greatly reduced when alfalfa seeds entered the stage of maturation and dehydration . On the contrary, amount of GA4 and GA7 accumulated in seeds, indicates that a certain level of GA may need to be maintained at the late stage of seed development .
Plants synthesize GA from GGDP, which is a common precursor for ABA. GGDP is cyclized to ent-kaurene under the catalysis of CPS and KS. Three CPS candidate genes (MsG0780040643.01, novel.6865, and MsG0780036356.01) and two KS candidate genes (MsG0280009515.01 and MsG0380014444.01) were highly expressed in the early and middle stages of seed development, which resulted in the accumulation of sufficient precursors for GA12 biosynthesis. This result was consistent with the observed changes in the level of four GAs. Ent-kaurene is converted to GA12 through a series of reactions catalysed by ent-kaurene oxidase (KO) and KAO, and GA12 is further oxidized by GA3/20ox to generate bioactive GA . Two candidate genes of KAO have been identified in the transcriptome, all of which belong to KAO2, indicating that KAO is a key enzyme that catalyses the conversion of ent-kaurenoic acid to GA12, which is the common precursor of bioactive GAs (including GA1, GA3, GA4 and GA7) [39, 43]. Numerous genes encoding GA20ox are downregulated during alfalfa seed development, suggesting that the synthesis of bioactive GA was reduced, which was consistent with the observation of the declined GA content .
Finally, GA2ox converts bioactive GA to inactive GA . Most of the GA2ox candidate genes were highly expressed in the middle and later development stages, indicating that amount of inactive GA was accumulated in the later stage of alfalfa seed development, which was consistent with GA1, GA3 and GA7 levels .
Dynamic regulation of ABA and GA during alfalfa seed development
ABA and GA are recognized as primary hormones that antagonistically regulate seed development. A large number of genes were upregulated during alfalfa seed development in GA biosynthesis pathway, which was not concomitant with the declination of GA content. One possible reason is that these genes were also related to ABA synthesis and contributed to ABA accumulation at the early stage of seed development. For example, isopentenyl diphosphate (IPP), farnesyl pyrophosphate (FPP), and geranylgeranyl diphosphate (GGDP) are the common precursors in the biosynthesis pathways of ABA and GA, which may provide more allocation for ABA synthesis, which also led to less allocation for GA synthesis. Therefore, although genes related to GA synthesis were up-regulated, the content of GA at the physiological level dropped in the early seed development.
The morphological and physiological characteristics of alfalfa seeds, the ABA and GA3 content, and the gene expression profiles were analysed during the development of seeds to provide information that enhances our understanding of hormone regulation in alfalfa seed development. The morphology of alfalfa seeds, seed dry and fresh weight, and seed water content were determined at the five stages of seed development (11, 19, 27, 35, and 43 DAP). Illumina sequencing technology was used for the transcriptome analysis, and a total of 16,899 DEGs were identified. All DEGs were divided into 10 clusters according to their expression patterns. The GO terms and KEGG pathways significantly enriched for these DEGs were also determined. In addition, 14 genes involved in ABA biosynthesis and 20 genes involved in GA3 biosynthesis were identified, and these might be key regulators in different stages of alfalfa seed development. To the best of our knowledge, this is the first study to characterize the transcriptional changes in alfalfa seeds during their development. The findings of this study enhance our understanding of the complex transcriptome dynamics and gene regulation mechanisms during the process of seed development and hormone regulation. Our results also have implications for improving the seed yield of alfalfa.
Seeds of alfalfa (variety Gold Empress) were collected from a germplasm resources nursery in Yuzhong, Lanzhou, China. After pollination, seeds were hand-collected at intervals of 8 days (11, 19, 27, 35 43, and 51 DAP) from the beginning of pollination until full maturity from June to August 2020. There were six biological replicates for each stage (three for transcriptome sequencing and three for hormone content determination) with 1 g of seeds per replicate. All the biological replicates were collected from the same three individuals. The collected samples were flash-frozen in liquid nitrogen and stored in a − 80 °C refrigerator.
Morphological and physiological determination
To reduce random error, the following analyses were all based on 200 seeds collected from each of the three individuals at each stage. After the pods were removed on ice, seeds were immediately photographed with a stereomicroscope (Zeiss SteREO Discovery V20, Germany), and the length, width, and projected area were measured by Digimizer Image Analysis Software (https://www.digimizer.com/). The fresh weight of seeds was measured immediately after the pods were removed, and then the seeds were dried at 85 °C until a constant weight was achieved. Each measurement was repeated three times, and the water content was calculated by subtracting the dry weight from the fresh weight.
Library construction and sequencing
One microgram of RNA was used to prepare samples. mRNA was purified from total RNA using poly-T oligo-attached magnetic beads. First-strand cDNA was synthesized using random hexamer primers and M-MuLV Reverse Transcriptase (RNase H). Second-strand cDNA synthesis was performed using DNA Polymerase I and RNase H. Remaining overhangs were converted into blunt ends via exonuclease/polymerase activities. The library fragments were purified with the AMPure XP system (Beckman Coulter, Beverly, USA). PCR was then performed using Phusion High-Fidelity DNA polymerase, Universal PCR primers, and Index (X) Primer. PCR products were purified (AMPure XP system), and library quality was assessed on the Agilent Bioanalyzer 2100 system. The clustering of the index-coded samples was performed on a cBot Cluster Generation System using the TruSeq PE Cluster Kit v3-cBot-HS (Illumia) according to the manufacturer’s instructions. After cluster generation, the library preparations were sequenced on an Illumina Novaseq platform, and 150-bp paired-end reads were generated. Clean reads were obtained by removing reads containing adapter, reads containing ploy-N and low-quality reads from raw reads in fastaq format. All the downstream analyses were based on clean data with high quality. Reference genomes were indexed using Hisat2 v2.0.5 and paired-end clean reads were aligned to the reference genome. FeatureCounts v1.5.0-p3 was used to count the reads numbers mapped to each gene. And then FPKM of each gene was calculated based on the length of the gene and reads count mapped to this gene.
Differentially expressed genes (DEGs) analysis
Differential expression analysis of five stages (three biological replicates per stage) was performed using the DESeq2 R package (1.16.1). Both the absolute values of the log2(Fold Change) ≥2 and the absolute values of the padj ≤0.05 were used as the thresholds for identifying significant DEGs. Cluster analysis and expression profile evaluation were carried out using the hierarchical clustering and K-means clustering methods in MEV4.9 software (https://sourceforge.net/projects/mev-tm4/files/mev-tm4/) . GO enrichment analysis of DEGs was conducted using the clusterProfiler R package, and gene length bias was corrected. GO terms were considered significantly enriched for DEGs at P < 0.05. KEGG pathway enrichment analysis of DEGs was conducted using KOBAS 3.0 (http://kobas.cbi.pku.edu.cn/) .
The location and annotation information of clean reads were obtained by comparing the RNA-Seq data and reference genome of ZhongMu No.1. Combined with the published hormone-related gene information, 14 ABA-related DEGs and 20 GA-related DEGs were identified.
Hormone content determination
The methods for quantitatively analysing the content of phytohormones were based on those described in a previous study . Approximately 0.5 g of seeds were ground and mixed with the extraction buffer composed of isopropanol, ultrapure water and concentrated hydrochloric acid (2:1:0.002 by volume). The extract was shaken at 4 °C for 30 min. Then, 10 mL of dichloromethane was added, and the sample was again shaken at 4 °C for 30 min. The sample was centrifuged at 13,000 rpm for 5 min at 4 °C and then the organic phase was extracted. The organic phase was dried under N2, dissolved in methanol (0.1% methanoic acid) and filtered through a 0.22 μm filter membrane. The standard solutions with gradients of 0.1 ng/mL, 0.2 ng/mL, 0.5 ng/mL, 2 ng/mL, 5 ng/mL, 20 ng/mL, 50 ng/mL and 200 ng/mL were prepared with methanol (0.1% methanoic acid) as solvent. The purified product was subjected to high-performance liquid chromatography-tandem mass spectrometry (HPLC-MS/MS) analysis. The mobile phase A solvents consisted of methanol/0.1% methanoic acid, and the mobile phase B solvents consisted of ultrapure water/0.1% methanoic acid. The elution gradient was 0–1 min, A = 20%; 1–9 min, A = 80%; 9–10 min, A = 80%; 10–10.1 min, A = 20%; 10.1–15 min, A = 20%. The injection volume was 2 μL. MS conditions were as follows: the ionspray voltage was 4500 V; the pressure of the curtain gas, nebulizer, and aux gas were 15, 65, and 70 psi, respectively; and the atomizing temperature was 400 °C. The content of hormones was determined by an Agilent 1290 Infinity system combined with an AB Sciex QTRAP 6500 mass spectrometer. The reaction monitoring conditions were parent ions: 263.1 m/z for ABA, 347.4 m/z for GA1, 345.2 m/z for GA3, 331.4 m/z for GA4, 329.2 m/z for GA7; quantitative ions: 153.0 m/z for ABA, 259.2 m/z for GA1, 143.0 m/z for GA3, 213.1 m/z for GA4, 223.2 m/z for GA7; qualitative ions: 204.2 m/z for ABA, 273.1 m/z for GA1, 239.2 m/z for GA3, 243.2 m/z for GA4, 241.1 m/z for GA7. Measurements of each sample were taken three times.
Primer 6 software was used to design primers for qRT-PCR, and then the primers were synthesized by Tsingke Biotechnology (Beijing, China). Gene-specific primers are listed in Additional file 1: Table S4. cDNA was reverse-transcribed from total RNA using a FastQuant RT Kit (with gDNase) (Tiangen Biotech, Beijing, China). qRT-PCR analysis was performed using 2xSG Fast qPCR Master Mix (Sangon Biotech, Shanghai, China) on a CFX96 TOUCH Real-Time Detection System (Bio-Rad, Singapore) under the following parameters: 95 °C for 30 s and 40 cycles of 95 °C for 5 s and 60 °C for 30 s; each qRT-PCR experiment was conducted three times . An internal control, the alfalfa actin gene, was used to normalize the expression data . The 2−ΔΔCt method was used to calculate the relative expression level of genes .
Statistical analysis of data
SPSS 26.0 software was used to perform one-way analysis of variance and Duncan’s multiple range tests (P < 0.05). All data points were the averages of three biological replicates.
Availability of data and materials
The reference genome data of Zhongmu No.1 cultivated alfalfa was obtained from https://figshare.com/articles/dataset/Medicago_sativa_genome_and_annotation_files/12623960. All transcriptome sequencing reads are available in NCBI SRA (accession number PRJNA781768).
Abscisic aldehyde oxidase 3
- AtBG :
Arabidopsis thaliana beta-1,3-glucosidase
Ent-copalyl diphosphate synthase
Days after pollination
Differentially expressed gene
Gibberellin 2/3/20 oxidase
Ent-kaurenoic acid oxidase
Kyoto encyclopedia of genes and genomes
National center for biotechnology information
Principal component analysis
Quantitative real-time PCR
Sreenivasulu N, Wobus U. Seed-development programs: a systems biology-based comparison between dicots and monocots. Annu Rev Plant Biol. 2013;64:189–217.
Shu K, Liu XD, Xie Q, He ZH. Two faces of one seed: hormonal regulation of dormancy and germination. Mol Plant. 2015;9:34–45.
Graeber K, Nakabayashi K, Miatton E, Leubner-Metzger G, Soppe WJ. Molecular mechanisms of seed dormancy. Plant Cell Environ. 2012;35:1769–86.
Li J, Berger F. Endosperm: food for humankind and fodder for scientific discoveries. New Phytol. 2012;195:290–305.
Su LY, Wan SQ, Zhou JM, Shao QS, Xing BC. Transcriptional regulation of plant seed development. Physiol Plantarum. 2021. https://doi.org/10.1111/ppl.13548.
Meinke D, Muralla R, Sweeney C, Dickerman A. Identifying essential genes in Arabidopsis thaliana. Trends Plant Sci. 2008;13:483–91.
Bayer M, Slane D, Ju¨rgens G. Early plant embryogenesis - dark ages or dark matter? Curr Opin Plant Biol. 2017;35:30–6.
Olsen OA. The modular control of cereal endosperm development. Trends Plant Sci. 2020;25:279–90.
Khan D, Millar JL, Girard IJ, Belmonte MF. Transcriptional circuitry underlying seed coat development in Arabidopsis. Plant Sci. 2014;219:51–60.
Orozco-Arroyo G, Paolo D, Ezquer I, Colombo L. Networks controlling seed size in Arabidopsis. Plant Reprod. 2015;28:17–32.
Yamamoto A, Kagaya Y, Toyoshima R, Kagaya M, Takeda S, Hattori T. Arabidopsis NF-YB subunits LEC1 and LEC1-LIKE activate transcription by interacting with seed-specific ABRE-binding factors. Plant J. 2010;58:843–56.
Jolivet P, Boulard C, Bellamy A, Valot B, D’Andréa S, Zivy M, et al. Oil body proteins sequentially accumulate throughout seed development in Brassica napus. J Plant Physiol. 2011;168:2015–20.
Basnet RK, Moreno-Pachon N, Lin K, Bucher J, Visser RGF, Maliepaard C, et al. Genome-wide analysis of coordinated transcript abundance during seed development in different Brassica rapa morphotypes. BMC Genomics. 2013;14:840.
Zhu JK. Abiotic stress signaling and responses in plants. Cell. 2016;167:313–24.
Liu H, Zhang C, Yang J, Yu N, Wang E. Hormone modulation of legume-rhizobial symbiosis. J Integr Plant Biol. 2018;60:632–48.
Gong ZZ, Xiong LM, Shi HZ, Yang SH, Herrera-Estrella LR, Xu GH, et al. Plant abiotic stress response and nutrient use efficiency. Sci China Life Sci. 2020;63:40.
Wang B, Fang RQ, Chen FM, Han JL, Liu YG, Chen LT, et al. A novel CCCH-type zinc finger protein SAW1 activates OsGA20ox3 to regulate gibberellin homeostasis and anther development in rice. J Integr Plant Biol. 2020;62:1594–606.
Frey A, Effroy D, Lefebvre V, Seo M, Perreau F, Berger A, et al. Epoxycarotenoid cleavage by NCED5 fine-tunes ABA accumulation and affects seed dormancy and drought tolerance with other NCED family members. Plant J. 2012;70:501–12.
Yano R, Kanno Y, Jikumaru Y, Nakabayashi K, Kamiya Y, Nambara E. CHOTTO1, a putative double APETALA2 repeat transcription factor, is involved in abscisic acid-mediated repression of gibberellin biosynthesis during see d germination in Arabidopsis. Plant Physiol. 2009;151:641–54.
Broderick GA. Performance of lactating dairy cows fed either alfalfa silage or alfalfa hay as the sole forage. J Dairy Sci. 1995;78:0–329.
Gu YJ, Guo QH, Zhang L, Chen ZG, Han YB, Gu ZX. Physiological and biochemical metabolism of germinating broccoli seeds and sprouts. J Agric Food Chem. 2012;60:209–13.
Gallardo K, Firnhaber C, Zuber H, Héricher D, Belghazi M, Henry C, et al. A combined proteome and transcriptome analysis of developing Medicago truncatula seeds. Mol Cell Proteomics. 2007;6:2165–79.
Jones SI, Gonzalez DO, Vodkin LO. Flux of transcript patterns during soybean seed development. BMC Genomics. 2010;11:136.
Shen C, Du HL, Chen Z, Lu HW, Zhu FG, Chen H, et al. The chromosome-level genome sequence of the autotetraploid alfalfa and resequencing of core germplasms provide genomic resources for alfalfa research. Mol Plant. 2020;13(9):12.
Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Chauffour F, Bailly M, Perreau F, Cueff G, Suzuki H, Collet B, et al. Multi-omics analysis reveals sequential roles for ABA during seed maturation. Plant Physiol. 2019;180:1198–218.
Karssen CM, Swan DL, Breekland AE, Koornneef M. Induction of dormancy during seed development by endogenous abscisic acid: studies on abscisic acid deficient genotypes of Arabidopsis thaliana (L.) Heynh. Planta. 1983;157:158–65.
Frey A, Boutin JP, Sotta B, Mercier R, Marion-Poll A. Regulation of carotenoid and ABA accumulation during the development and germination of Nicotiana plumbaginifolia seeds. Planta. 2006;224:622–32.
Nambara E, Marion-Poll A. Abscisic acid biosynthesis and catabolism. Annu Rev Plant Biol. 2005;56:165–85.
Audran C, Liotenberg S, Gonneau M, North HM, Frey A, Tap-Waksman K, et al. Localisation and expression of zeaxanthin epoxidase mRNA in Arabidopsis in response to drought stress and during seed development. Aust J Plant Physiol. 2001;28:1161.
Gonzalez-Jorge S, Mehrshahi P, Magallanes-Lundback M, Lipka AE, Angelovici R, Gore MA, et al. ZEAXANTHIN EPOXIDASE activity potentiates carotenoid degradation in maturing seed. Plant Physiol. 2016;171:1837–51.
Schwartz SH, Tan BC, Gage DA, Zeevaart JA, McCarty DR. Specific oxidative cleavage of carotenoids by VP14 of maize. Science. 1997;276:1872–4.
Chen K, Li GJ, Bressan RA, Song CP, Zhu JK, Zhao Y. Abscisic acid dynamics, signaling, and functions in plants. J Integr Plant Biol. 2020;62(1):25–54.
Seo M, Aoki H, Koiwai H, Kamiya Y, Nambara E, Koshiba T. Comparative studies on the Arabidopsis aldehyde oxidase (AAO) gene family revealed a major role of AAO3 in ABA biosynthesis in seeds. Plant Cell Physiol. 2004;45(11):1694–703.
González-Guzmán M, Abia D, Salinas J, Serrano R, Rodríguez PL. Two new alleles of the abscisic aldehyde oxidase 3 gene reveal its role in abscisic acid biosynthesis in seeds. Plant Physiol. 2004;135:325–33.
Kushiro T, Okamoto M, Nakabayashi K, Yamagishi K, Kitamura S, Asami T, et al. The Arabidopsis cytochrome P450 CYP707A encodes ABA 8′-hydroxylases: key enzymes in ABA catabolism. EMBO J. 2004;23:1647–56.
Liu Z, Yan JP, Li DK, Luo Q, Yan QJ, Liu ZB, et al. UDP-glucosyltransferase71c5, a major glucosyltransferase, mediates abscisic acid homeostasis in Arabidopsis. Plant Physiol. 2015;167:1659–70.
Palaniyandi SA, Chung G, Kim SH, Yang SH. Molecular cloning and characterization of the ABA-specific glucosyltransferase gene from bean (Phaseolus vulgaris L.). J Plant Physiol. 2015;178:1–9.
Yamaguchi S. Gibberellin metabolism and its regulation. Annu Rev Plant Biol. 2008;59:225–51.
Kim YC, Nakajima M, Nakayama A, Yamaguchi I. Contribution of gibberellins to the formation of Arabidopsis seed coat through starch degradation. Plant Cell Physiol. 2005;46:1317.
Liu Y, Fang J, Xu F, Chu JF, Yan CY, Schläppi MR, et al. Expression patterns of ABA and GA metabolism genes and hormone levels during rice seed development and imbibition: a comparison of dormant and non-dormant rice cultivars. J Genet Genomics. 2014;41(6):327–38.
Monpara JK, Chudasama KS, Thaker VS. Role of phytohormones in soybean (Glycine max) seed development. Russ J Plant Physiol. 2019;66:992–8.
Regnault T, Davière JM, Heintz D, Lange T, Achard P. The gibberellin biosynthetic genes AtKAO1 and AtKAO2 have overlapping roles throughout Arabidopsis development. Plant J. 2014;80:462–74.
Thomas SG, Phillips AL, Hedden P. Molecular cloning and functional expression of gibberellin 2-oxidases, multifunctional enzymes involved in gibberellin deactivation. Proc Natl Acad Sci U S A. 1999;96:4698–703.
Burlat V, Oudin A, Courtois M, Rideau M, St-Pierre B. Co-expression of three MEP pathway genes and geraniol 10-hydroxylase in internal phloem parenchyma of Catharanthus roseus implicates multicellular translocation of intermediates during the biosynthesis of monoterpene indole alkaloids and isoprenoid-derived primary metabolites. Plant J. 2004;38(1):131–41.
Luo D, Wu YG, Liu J, Zhou Q, Liu WX, Wang YR, et al. Comparative transcriptomic and physiological analyses of Medicago sativa L. indicates that multiple regulatory networks are activated during continuous ABA treatment. Int J Mol Sci. 2018;20:47.
Pan XQ, Welti R, Wang XM. Quantitative analysis of major plant hormones in crude plant extracts by high-performance liquid chromatography-mass spectrometry. Nat Protoc. 2010;5:986–92.
Zhou Q, Luo D, Chai XT, Wu YG, Wang YR, Nan ZB, et al. Multiple regulatory networks are activated during cold stress in Medicago sativa L. Int J Mol Sci. 2018;19:3169.
Liu WX, Zhang ZS, Chen SY, Ma LC, Wang HC, Dong R, et al. Global transcriptome profiling analysis reveals insight into saliva-responsive genes in alfalfa. Plant Cell Rep. 2016;35:561–71.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2−∆∆CT method. Methods. 2001;25:402–8.
We acknowledge Wenxian Liu and Yue Cui at Lanzhou University for their valuable help of plant materials.
This research was supported by the Strategic Priority Research Program of Chinese Academy of Sciences (XDA26030103), the National Natural Science Foundation of China (31722055) and the National Natural Science Foundation of China (32071862).
Ethics approval and consent to participate
We conducted the experimental research on cultivated alfalfa in accordance with the IUCN Policy Statement on Research Involving Species at Risk of Extinction and the Convention on the Trade in Endangered Species of Wild Fauna and Flora.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Summary statistics of RNA-Seq results in seed development of Medicago sativa. Table S2. DEGs of the most representative KEGG pathways. Table S3. List of DEGs involved in biosynthetic and metabolism pathway of ABA and GAs. Table S4. qRT-PCR specific primers.
Identification of the DEGs in five development stages: S2 vs. S1 (A); S3 vs. S1 (B). S4 vs. S1 (C) and S5 vs. S1 (D). Fig. S2. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways in: S2 vs. S1 (A); S3 vs. S1 (B). S4 vs. S1 (C) and S5 vs. S1 (D). The left Y-axis indicates the KEGG pathway. The X-axis indicates the gene ratio. High padj values are indicated in purple, and low padj values are indicated in red.
About this article
Cite this article
Zhao, L., Li, M., Ma, X. et al. Transcriptome analysis and identification of abscisic acid and gibberellin-related genes during seed development of alfalfa (Medicago sativa L.). BMC Genomics 23, 651 (2022). https://doi.org/10.1186/s12864-022-08875-0