- Open Access
Combined transcriptomic and metabolomic analysis reveals the potential mechanism of seed germination and young seedling growth in Tamarix hispida
BMC Genomics volume 23, Article number: 109 (2022)
Seed germination is a series of ordered physiological and morphogenetic processes and a critical stage in plant life cycle. Tamarix hispida is one of the most salt-tolerant plant species; however, its seed germination has not been analysed using combined transcriptomics and metabolomics.
Transcriptomic sequencing and widely targeted metabolomics were used to detect the transcriptional metabolic profiles of T. hispida at different stages of seed germination and young seedling growth. Transcriptomics showed that 46,538 genes were significantly altered throughout the studied development period. Enrichment study revealed that plant hormones, such as auxin, ABA, JA and SA played differential roles at varying stages of seed germination and post-germination. Metabolomics detected 1022 metabolites, with flavonoids accounting for the highest proportion of differential metabolites. Combined analysis indicated that flavonoid biosynthesis in young seedling growth, such as rhoifolin and quercetin, may improve the plant’s adaptative ability to extreme desert environments.
The differential regulation of plant hormones and the accumulation of flavonoids may be important for the seed germination survival of T. hispida in response to salt or arid deserts. This study enhanced the understanding of the overall mechanism in seed germination and post-germination. The results provide guidance for the ecological value and young seedling growth of T. hispida.
Seed germination and post-germination (early seedling growth) are controlled by various environmental factors and are important stages for the survival of higher plants under ambiently suitable environment . Many physiological and morphological studies have been conducted on related processes, including plant pigment regulation [2,3,4], abiotic stress [5,6,7,8] and plant hormone regulation [9,10,11]. These stages require a large amount of energy and nutrients which can only be obtained from seed reserves because the germinated seeds cannot absorb minerals and produce energy through photosynthesis . Seed germination begins with the water absorption by the stationary dry seed and is completed by radicle protrusion through the surrounding germ tissues. This process involves a series of orderly physiological and morphogenetic processes, such as energy conversion, nutrient consumption and metabolite changes . After dry seeds have absorbed water, glycolysis, pentose phosphate pathways, and tricarboxylic acid (TCA) cycles are activated. Glycolysis and the TCA cycle provide most of the energy for seed germination . Seed germination and early seedling growth are regulated through a complex network of signalling and gene expression regulation. For example, seed germination can be regulated by multiple plant hormones [15, 16], such as the antagonistic action of abscisic acid (ABA) and gibberellin (GA)  and the interaction of jasmonic acid (JA), indole-3-acetic acid (IAA) and other phytohormones .
Different plants may have similar molecular mechanisms, including plant hormonal behaviour, transcription and translation activation and radicle protrusion. However, different plant species also have some unique mechanisms, especially for reserve mobilisation and metabolic activation. These differences may be attributed to their different seed stocks . With the rapid development of system biology and high-throughput sequencing, multi-omics technology has become an indispensable research tool in life science [20, 21]. This method can extensively analyse the cell life cycle in all aspects by identifying gene transcripts, metabolites and protein changes throughout growth and development from cell to tissue and the individual itself to understand the complex mechanism of plants and animals.
One of the extremely saline/alkali-tolerant plants, Tamarix hispida Willd. is a typical woody halophyte that forms a natural forest in 1% saline–alkali soil of desert environments. This species is also tolerant to drought stress and thus is an ideal material for cloning drought and saline tolerance-related genes and studying the saline tolerance mechanism of woody halophytes . ThSAP30BP may play an important physiological role in the salt tolerance of T. hispida . The 2-Cys peroxidase gene of this plant improves its tolerance to salt stress . ThNAC7 induces the transcriptional levels of genes associated with stress tolerance to enhance salt and osmotic tolerance by increasing osmotic potential and enhanced ROS scavenging capability . ThMYB13 may also play a role in salt stress tolerance in this transgenic plant . The bZIP protein Thbzip1 of T. hispida is an ACGT elemental binding factor that enhances abiotic stress signalling in transgenic Arabidopsis thaliana . However, the molecular mechanisms underlying the seed germination and post-germination of T. hispida have not been reported. Transcriptomic sequencing is an effective tool to understand complex molecular regulatory mechanisms and provide a new perspective on T. hispida seed germination. At present, this method is routinely used as an experimental platform and has made important contributions to the discovery and identification of genes involved in metabolic pathways . Metabolomics can reveal the terminal products of the signalling pathway and consequently reflect the physiological state of an organism at a specific time. Metabolomes are quite similar to phenotypes and thus can provide detailed information about the intracellular activities regulated by metabolites. Therefore, metabolomics has been widely used in model plants and crops, such as A. thaliana [28,29,30], rice [31,32,33], soybean [34, 35] and many other medical plants [36,37,38].
In this study, the mechanisms underlying the germination and post-germination of T. hispida were investigated using an integrative transcriptomic and metabolomic approach. Gene Ontology (GO) enrichment analysis of differentially expressed genes (DEGs) indicated the differential involvement of biological processes in the six stages of seed germination and post-germination. The dynamic changes of the key genes in flavonoid biosynthesis and phytohormone-related were observed also observed. The Combined transcriptomic and metabolomic analysis showed the accumulation of flavonoids in post-germination. The results provide a valuable reference for the study of seed germination and its functions and effects on saline/alkali-tolerant plants.
Stage definition and transcript assembly of seed germination and post-germination in Tamarix hispida
Seed germination can be divided into the following three stages according to Bewley’s definition : rapid water imbibition, limited water absorption and increase in water uptake accompanied by embryonic axis elongation. For T. hispida, the dry seeds were labelled as stage 1 at 0 h, followed by the rapid increase in water uptake at 0.5 h as stage 2, a slow increase in water uptake at 5 h as stage 3 and hypocotyl extension period at 24 h as stage 4 (Fig. 1a and Additional file 1: Table S1). Post-germination comprised stage 5 with cotyledon unfolding at 144 h and stage 6 with four-true-leaf unfolding at 288 h (Fig. 1b). All these defined stages are shown in Fig. 1b.
Eighteen samples from six stages (1–6, three replicates for each stage) were sampled for RNA-seq. A total of 866,487,598 high-quality reads with Q30 higher than 90% were obtained after quality control, and 75,249 unigenes were de novo assembled using Trinity software. The length was over 1000 bp for 36,531 unigenes (48.6%) and over 1500 bp for 23,089 (30.1%) unigenes. The average gene length and N50 length were 1502 and 2031 bp, respectively. Functional unigene annotation was performed and mapped to NCBI non-redundant (Nr) (48,606, 64.6%), eggNOG database (39,772, 52.9%), Swiss-Prot (33,428, 44.4%), Kyoto Encyclopedia of Genes and Genomes (KEGG) (12,510, 16.6%) and GO (26,371, 35%) (Additional file 2: Fig. S1). A summary of RNA-seq data is shown in Table 1.
Differential expression and functions of the genes involved in seed germination and post-germination
Pairwise differential expression profiling analysis was conducted with the threshold of FDR ≤ 0.05 and absolute value fold change ≥ 2.0 using DEseq2 software to investigate the molecular basis of seed germination and post-germination . Various numbers of DEGs among neighbouring stages were identified as follows: 1220 between stages 1 and 2 (736 up, 484 down), 1284 between stages 2 and 3 (830 up, 454 down), 19,439 between stages 3 and 4 (9206 up, 10,233 down), 12,728 between stages 4 and 5 (6458 up, 6270 down) and 955 between stages 5 and 6 (547 up, 408 down) (Fig. 2a, Additional file 1: Table S2). This result showed that the number of DEGs in the hypocotyl extension period (stage 3 versus stage 4) of seed germination and the cotyledon unfolding period (stage 4 vs. stage 5) of seed post-germination was higher than that in the other comparison groups.
GO and KEGG enrichment analysis for adjacent compared groups reflected the physiological response of germination and post-germination. For example, the most enriched DEGs were identified in the GO term of ‘cellular response to osmotic stress’ in the comparison between stages 1 and 2. Meanwhile, the most enriched DEGs were observed in the KEGG pathway of ‘metabolism of xenobiotics by cytochrome P450’ in the comparison between stages 1 and 2, indicating that cytochrome P450 may be important for the rapid increase in water uptake (Additional file 2: Fig. S2 and S3). Given that seed germination occurs from stage 1 to stage 4, further KEGG enrichment analysis was performed on 18,975 DEGs (stage 1 vs. stage 4). Among these enriched pathways, ‘phenylpropanoid biosynthesis’ (ko00940), ‘plant hormone signal transduction’ (ko04075) and ‘flavonoid biosynthesis’ (ko00941) were the top three significantly enriched categories (Fig. 2b, Additional file 1: Table S3) and thus have important roles in the seed germination of T. hispida. At early young seedling during post-germination, ‘phenylpropanoid biosynthesis’ and ‘flavonoid biosynthesis’ were the most significantly enriched in the periods of cotyledon unfolding (part I, stage 4 vs. stage 5) and true-leaf unfolding (part II, stage 5 vs. stage 6). Among the separate top five enriched pathways, ‘plant–pathogen interaction’ and ‘plant hormone signal transduction’ were significantly enriched in part I, and ‘flavone and flavonol biosynthesis’ and ‘anthocyanin biosynthesis’ were significantly enriched in part II (Fig. 2c and d, Additional file 1: Table S3). The enrichment of ‘anthocyanin biosynthesis’ term in true-leaf unfolding period indicated anthocyanin accumulation, which was consistent with the morphology of extremely dark red stem and root compared with those during cotyledon unfolding (Figs. 1b and 2d). These results showed that hormone signal transduction and flavonoid biosynthesis may be important for the seed germination and post-germination of T. hispida.
Dynamic regulation of key gene families in flavonoid biosynthesis during seed germination and post-germination
The phenylpropanoid pathway is the upstream branch of flavonoid biosynthesis; hence, DEGs involved in phenylpropanoid biosynthesis (ko00940) and flavonoid biosynthesis (ko00941) were identified during the seed germination and post-germination of T. hispida. A total of 158 DEGs in phenylpropanoid pathway, such as genes encoding phenylalaninammo-nialyase (PAL), 4-coumarate-CoA ligase (4CL) and trans-cinnamate 4-monooxygenase (CYP73A), were overall up-regulated after stage 3 (Additional file 2: Fig. S4). Flavonoid biosynthetic genes were divided into two categories, namely, early biosynthetic genes (EBGs) responsible for the production of common precursors and late biosynthetic genes (LBGs) for the eventual products. The former mainly encodes naringenin-chalcone synthase (CHS), chalcone isomerase (CHI), naringenin 3-dioxygenase (F3H), flavonoid 3′-monooxygenase (F3’H) and flavonoid 3′,5′-hydroxylase (F3’5’H), and the latter mainly encodes flavonol synthase (FLS), dihydrokaempferol 4-reductase (DFR), anthocyanidin synthase (ANS), anthocyanidin reductase (ANR) and leucoanthocyanidin reductase (LAR) [40, 41]. Finally, 41 DEGs were identified in flavonoid biosynthesis pathway, including 35 EBGs and six LBGs (Fig. 3a, Additional file 1: Table S4). A clustering heatmap of gene expression divided these DEGs into six clusters. Except ThOMT-2 and ThF3’H-2 of cluster3, all of these genes were up-regulated upon hypocotyl extension. The EBGs of flavonoid biosynthesis were distributed in all clusters, and the LBGs were significantly enriched in cluster6 (p-value = 0.035) (Fig. 3b). The cluster4 genes were upregulated in stage 3, and the cluster6 genes were upregulated in stages 4 and 6. Further validation of ThCHS2 (TRINITY_DN7978_c0_g1) of cluster4 and ThCHS3 (TRINITY_DN8_c1_g1) of cluster6 by real-time quantitative reverse transcriptional PCR (qRT-PCR) experiment indicated the differential regulation of CHS genes (Fig. 3c). Compared with those in seed germination, more genes were involved in the phenylpropanoid and flavonoid biosynthesis pathways of post-germination and further contributed to flavonoid accumulation. This high level of flavonoid would help young seedlings to survive from extreme drought desert environments.
Dynamic regulation of phytohormone-related DEGs during seed germination and post-germination
A total of 144 DEGs were significantly enriched in ‘plant hormone signal transduction’ (ko04075) in seed germination and post-germination, suggesting their important roles in the seed germination of T. hispida. Further classification of these hormone-related DEGs showed that 61 were involved in auxin (AUX) pathway, five in GA pathway, 45 in ABA pathway, eight in ethylene (ETH) pathway, eight in JA pathway and 17 in salicylic acid (SA) pathway (Additional file 1: Table S5). The 144 phytohormone-related DEGs were further divided into five expression clusters. Among which, cluster1 genes were upregulated in stages 1–3, cluster2 genes in stages 1–4, cluster3 genes only in stage 4, cluster4 genes in stages 4–6, and cluster5 genes in stages 5 and 6 (Fig. 4a).
The DEGs (14/61) of AUX pathway were significantly enriched in cluster3 (17 genes) (p-value = 0.0005) and upregulated in stage 4, indicating the important role of auxin signalling in hypocotyl elongation. The DEGs (24/45) of ABA pathway were significantly enriched in cluster1 (45 genes) (p-value = 0.00016) and consistently upregulated in stage 1–3, implying the regulatory function of ABA in seed germination such as water uptake. The JA (6/8) and SA (12/17) genes were enriched in cluster5 (54 genes) (JA: p-value = 0.032, SA: p-value = 0.0035) and upregulated in stages 5 and 6 of post-germination. For example, jasmonate ZIM domain-containing protein (JAZ), NONEXPRESSER OF PR GENES 1 (NPR1) and transcription factor TGA (TGA) were significantly up-regulated during cotyledon expansion and true-leaf unfolding. The expression patterns of the identified phytohormone-related DEGs were mapped in plant hormone signal transduction (Fig. 4b). Five genes [IAA (TRINITY_DN37691_c0_g1), auxin responsive GH3 gene family (TRINITY_DN38209_c0_g1), PP2C (TRINITY_DN4920_c0_g1), serine/threonine-protein kinase SnRK2 (TRINITY_DN1976_c0_g1) and ABF (TRINITY_DN7836_c2_g100)] were randomly selected for qRT-PCR confirmation to further validate their differential transcription (Fig. 4c). The results suggested that different plant hormones play varying roles in the seed germination and post-germination of T. hispida.
Co-expression network analysis with WGCNA
A global co-expression network with all DEGs (46,538 genes) was constructed by weighted gene co-expression network analysis (WGCNA) to identify additional potential genes involved in seed germination and post-gemination. Thirteen co-expressed modules were obtained on the basis of the network concept (Fig. 5a), and stage-associated modules were identified by correlation analysis between module eigengenes and each stage (Fig. 5b). Among the positive stage-correlated modules with Pearson correlation coefficient (PCC) ≥ 0.60, three modules of ‘midnightblue’, ‘darkorange’ and ‘skyblue’ were correlated with stage 1; three modules of ‘white’, ‘cyan’ and ‘purple’ with stage 2; four modules of ‘darkturquoise’, ‘salmon’, ‘black’ and ‘darkred’ with stage 4; three modules of ‘green’, ‘lightcyan’ and ‘steelblue’ with stage 5; and one module of ‘green’ with stage 6. Except for an invalid ‘grey’, no module was correlated with stage 3 (Fig. 5b). GO and KEGG enrichment analyses of the gene sets merged from specific stage-associated modules were performed to further investigate the biological processes or pathways of the stage-associated modules. Different processes or pathways were enriched in germination and post-germination stages. In seed germination, ‘transposition, RNA-mediated’ GO term, and ‘apoptosis’ KEGG pathway were enriched in stage 2. Multiple GO processes, such as ‘translation’, ‘tricarboxylic acid cycle’ and ‘chloroplast organisation’ were enriched in stage 4. GO terms of ‘carbamoyl-phosphate synthase (glutamine-hydrolysing) activity’ and ‘protein autophosphorylation’ were the most enriched in stages 5 and 6, respectively. (Additional file 2: Fig. S5, Additional file 1: Table S6). Therefore, the co-expressed genes involved in multiple biological processes during stages 2 and 4 were essential for the seed germination and early seeding growth of T. hispida.
Genes from three modules (‘purple’, ‘black’ and ‘green’) were further identified for transcriptomic analysis and visualised to understand the possible regulatory mechanism in the key developmental stages of seed germination and post-germination. GO enrichment analysis of the genes of ‘purple’ module revealed multiple terms related to phytohormone response and abiotic stress response, such as ‘response to heat’, ‘response to auxin’ and ‘response to abscisic acid’ (Additional file 2: Fig. S6, Additional file 1: Table S6). The node gene with a high connectivity within a module is defined as hub gene with an important role in different modules. In the ‘purple’ module, various transcription factors (TFs) with high connectivity were involved in hormone response and abiotic stress response. For example, DEHYDRATION-RESPONSIVE ELEMENT BINDING PROTEIN 2A (DREB2A, ThAP2–7) was involved in seed germination and induced by drought and osmotic stress , and DIVARICATA2 (DIV2, ThMYB-138) was required for ABA signalling and response to salt stress in Arabidopsis  (Fig. 6a). The ‘black’ module had the highest correlation with hypocotyl elongation (stage 4). Function analysis of high connectivity TFs showed that the hub genes of ‘black’ module were connected with plant organ development and flavonoid biosynthesis. For example, TRANSPARENT TESTA 2 (TT2, ThMYB-52) positively regulated anthocyanin accumulation in hypocotyls  and YABBY1 (YAB1, ThC2C2–44) and YABBY5 (YAB5, ThC2C2–4) controlled leaf blade development [46, 47] (Fig. 6b, Additional file 1: Table S6). Meanwhile, ‘phenylpropanoid biosynthesis’, ‘plant hormone signal transduction’ and ‘flavonoid biosynthesis’ were significantly enriched in ‘green’ module (Additional file 2: Fig. S6, Additional file 1: Table S6). Several genes in ‘green’ module encoded by WRKY DNA-BINDING PROTEIN (WRKY) TFs with high connectivity, such as WRKY DNA-BINDING PROTEIN 42 (WRKY42, ThWRKY-93), WRKY DNA-BINDING PROTEIN 33 (WRKY33, ThWRKY-2 and ThWRKY-70) and WRKY DNA-binding protein 4 (WRKY4, ThWRKY-56), were related to response to abiotic stress; their homologs are also involved in response to abiotic stress in Arabidopsis [48,49,50] (Fig. 6c).
Combination of metabolic profiling and transcriptomic analysis
Many flavonoid-biosynthesis related genes were activated and enhanced in post-germination (Fig. 3). A widely targeted metabolomics analysis was performed on three stages, namely, slow water absorption (stage 3), hypocotyl elongation (stage 4) and cotyledon expansion (stage 5) to further understand the metabolic profiles in post-germination. The samples from three stages showed good triplications according to their principal component analysis (PCA) score plots, indicating that metabolite accumulation is stage-specific (Fig. 7a). A total of 1022 metabolites were identified in all samples. Within the threshold of variable importance in projection (VIP) > 1 and absolute value fold change > 2, the numbers of up-regulated and down-regulated metabolites in stage 3 versus stage 4 and stage 4 versus stage 5 were 205 and 149, 287 and 178, respectively. A total of 651 differentially expressed metabolites (DEMs) were identified and mainly comprised of flavonoids (133), phenolic acids (123), lipids (89), amino acids and derivatives (66), alkaloids (53), organic acids (45) and others (Fig. 7b, Additional file 1: Table S7). As the most abundant differentiated metabolite, the accumulation of flavonoid was consistent with the results of gene expression and functional enrichment in flavonoid biosynthesis during seed germination and post-germination.
A combined transcriptomic and metabolomic analysis was performed in stages 3, 4 and 5 to investigate the potential regulation of metabolites. The results showed significantly different accumulation for 43 flavonoids in stage 3 versus stage 4 and stage 4 versus stage 5, 44 flavonoids in stage 3 versus stage 4 and 46 flavonoids in stage 4 versus stage 5 (Fig. 7c). With these identified differential flavonoids, the metabolic intermediates and end products of flavonoid biosynthesis pathway were mapped to a known KEGG pathway. The overall trends of flavonoid biosynthesis suggested a decrease in metabolic intermediates (such as naringenin, dihydrokaempferol, apigenin and hesperetin) but an increase in end products (such as quercetin 3-sulfonate, rhoifolin and baimaside) in seed germination and post-germination (Fig. 7d). The accumulation of end flavonoids in post-germination revealed the consistent upregulation of CHS gene regulation starting at stage 4 (Fig. 3), further indicating that chalcone synthase is a key and limited enzyme for flavonoid biosynthesis in the seed germination of T. hispida. During seed germination (stages 3 and 4), the metabolites located downstream of the flavonoid synthesis pathway, specifically hesperetin7-O-glucoside, quercetin 3-O-[beta-D-xylosyl-(1- > 2)-beta-D-glucoside] and baimaside, were significantly increased. Meanwhile, the metabolites located upstream of the flavonoid synthesis pathway, such as naringenin, dihydrokaempferol and isoquercitrin, were significantly decreased (Fig. 7d, Additional file 2: Fig. S7). The metabolite flow from naringenin to quercetin 3-sulfonate also confirms the increase and decrease in the metabolites located upstream and downstream of the flavonoid biosynthesis, respectively, in young seedling growth (stages 4 and 5). These results suggest that the metabolites located downstream of the flavonoid synthesis pathway are beneficial to seed germination and young seedling growth.
T. hispida is a perennial shrub or small tree and a woody halophyte that serves as an excellent model for studies on resistance to abiotic stress. Seeds are at the crucial stage of biodiversity and agriculture for plant survival. If environmental conditions are only suitable for plant growth but not for seed germination and young seedling growth, then plants cannot survive in the community . Therefore, the molecular mechanism of seed germination and young seedling growth in T. hispida must be explored to understand its adaptative evolution in extremely arid deserts.
Shortening the breeding cycle can help to meet the demand for the seedling supply of T. hispida. Transcriptomic and metabolomic analyses were performed to detect the changes in RNA level and metabolome levels during seed germination and young seedling growth. Differential transcriptome expression analysis revealed that the DEGs during seed germination and young seedling growth were significantly enriched in phenylpropanoid biosynthesis, plant hormone signal transduction and flavonoid biosynthesis pathway. Meanwhile, four-true-leaf turning red and anthocyanin biosynthesis pathway were significantly enriched for T. hispida seedlings. Flavones are synthesised by the flavonoid pathway, the downstream of phenylpropanoid biosynthesis . These compounds are important in plant defence against biotic and abiotic stresses, such as oxidative damage  and UV stress . Therefore, the DEGs and metabolites of flavonoid biosynthesis play an important role in the seed germination and young seedling growth of T. hispida.
In this study, the flavonoid biosynthesis genes were divided into six expression patterns. CHS is the first committed enzyme in the conserved flavonoid synthesis pathway , and three ThCHSs were identified in T. hispida. Clustering heatmap showed that two CHSs (ThCHS1 and ThCHS2) were highly expressed in stages 5 and 6, respectively, and ThCHS3 showed significantly increased expression, albeit in various levels, in stages 4 and 6. The expression model of ThCHSs is consistent with flavonoid biosynthesis in T. hispida, indicating that flavonoids are produced in large quantities during seed germination and young seedling growth. Meanwhile, the early and late genes of flavonoid biosynthesis pathway showed different expression trends. ThANR-1, ThLAR-1, ThDFR-1 and ThDFR-2 are the downstream genes of flavonoid biosynthesis pathway and were highly expressed in stages 4 and 6. However, some of the early genes of flavonoid biosynthesis pathway such as ThCHS-1, ThCHS-2, ThCHI-2 and ThF3H-2 were not expressed in stage 4. Therefore, the expression of flavonoid biosynthesis DEGs undergoes complex regulation during seed germination and young seedling growth.
Seed germination and young seedling growth are important processes affecting crop production and are influenced by a range of factors, including plant hormones .. The most important plant hormones for seed germination are ABA and GA, which have inhibitory and stimulatory effects on seed germination, respectively. In this study, ABA and GA expression significantly varied in seed germination and young seedling growth. For example, ThABFs and ThPP2Cs were down-regulated in seed germination but up- regulated in young seedling growth. Meanwhile, ThPIF and ThGID2 were up- regulated in seed germination. These findings indicate that ABA and GA play important roles in seed germination and young seedling growth. Auxin by itself is not a necessary hormone for seed germination but can interact with other hormones to affect seed germination and young seedling growth. For example, the release of ARF from repression by miRNA also affects ABA sensitivity during young seedling growth . In addition, the DEGs of auxin pathway are vital for young seedling growth. As stress-response hormones, the upregulation of JA and SA pathways can be helpful to flavonoid accumulation during post-germination .
The ‘purple’ module was the largest module in WGCNA (19,095 genes, 41.03%) and was the most relevant to seed water uptake (stage 1: dry seed, stage 2: rapid water absorption, stage 3: slow water absorption). In this module, phytohormone and abiotic stress response related terms were significantly enriched, thereby supporting the importance of phytohormone in seed germination. The ‘green’ module (12,200 genes, 26.22%) is the second largest module, and its genes were positively relevant to post-germination (stage 5: cotyledon expansion, stage 6: four true leaf). Enrichment analysis showed that ‘phenylpropanoid biosynthesis’, ‘flavonoid biosynthesis’ and ‘plant hormone signal transduction’ were significantly enriched in this module, indicating that flavonoid is important to post-germination. As a transition node from seed germination to seedling development, hypocotyl elongation (stage 4) was significantly associated with four modules (‘darkred’, ‘black’, ‘salmon’ and ‘darkturquoise’). Among these four modules, ‘black’ (5221 genes, 11.22%) was the largest and the most relevant. ‘Phenylpropanoid biosynthesis’ and ‘flavonoid biosynthesis’ were also significantly enriched in this module, and its high connectivity node functional TFs were associated with plant organ development and flavonoid biosynthesis. These results further support the importance of flavonoids and phytohormones.
In this study, the molecular regulation of seed germination and post-germination in T. hispida was investigated using an integrated transcriptional and metabolomic method. GO and KEGG analysis showed that the pathways of plant hormone signal transduction, phenylpropanoid biosynthesis, and flavonoid biosynthesis were significantly enriched and thus have important roles in the two developmental periods. The gene families involved in the plant hormone pathway that was enriched in different expression clusters showed differential regulation in seed germination and post-germination, such as ABA for early germination stages, auxin for hypocotyl elongation, JA for cotyledon expansion and SA for four-true-leaf unfolding. Metabolomics showed that the final products of flavonoids such as 3 − O−[beta−D − xylosyl−(1− > 2) − beta−D − glucoside], quercetin, baimaside, rhoifolin, hesperetin− 7 − O − glucoside and quercetin− 3 − O − sulfonate accumulated in post-germination. RNA-seq and metabolomic analysis indicated the importance of flavonoid biosynthesis pathways and identified CHS as the key enzyme for the accumulation of final flavonoid products in post-germination. In addition, organic acids were involved in seed germination. All these results provide important insights into the cellular and metabolic changes underlying T. hispida seed germination and young seedling growth.
Materials and methods
Plant materials, experimental conditions and fresh weight measurements
The seeds of T. hispida were obtained from Alar City, Xinjiang Uygur Autonomous Region of China. The hairs of selected seeds were removed, and germination test was carried out in three replicates (100 seeds per replicate). The seeds were incubated in 3 mL of ultrapure water on two sheets of absorbent paper in a covered glass petri dish at 25 °C. At ambient temperature of 25 ± 1 °C, 100 germinated seeds (± 0.0001 g) were weighed every 15 min at the stage of rapid water absorption between 0 and 1 h. After sampling at a specific time, the excess water was immediately sucked up with absorbent paper, and the samples were quickly frozen in liquid nitrogen and stored at − 80 °C in a refrigerator for transcriptomic and metabolomic analyses. We declare that the research programme complies with relevant institutional, national and international guidelines and legislation, and we have permission to collect T. hispida seeds.
RNA isolation and cDNA library construction for RNA sequencing
Total RNA of samples was isolated and purified using Trizol reagent (Invitrogen, Carlsbad, CA, USA) following the manufacturer’s procedure. The RNA was purified and quantified using NanoDrop 2000 (NanoDrop, Wilmington, DE, USA), and its integrity was assessed by Agilent 2100 with RIN number > 7.0. Total RNA was used to construction the RNA-seq libraries: mRNA was enriched from the total RNA using oligo (dT) magnetic beads, and the final average insert size for the final cDNA library was 350 bp (± 50 bp). Paired-end sequencing was performed on an Illumina Hiseq X-Ten (LC Bio, China) following the vendor’s recommended protocol.
Cutadapt  was used to remove the reads containing adaptor contamination, low quality bases and undetermined bases. Sequence quality was then verified using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). After the adaptor and low-quality sequences were removed, the clean reads were assembled into expressed sequence tag clusters (contigs) and de novo assembled into transcripts by using Trinity  in the paired-end method. On the basis of the similarity and length of the sequence, the longest transcript was selected as a single gene for subsequent analysis.
Functional annotation of DEGs
The function of the unigenes was annotated by alignment with the NCBI non-redundant (NR) and SwissProt databases using Blastx (v2.10.1) with a threshold e-value of 10− 5. The proteins with the highest hits to the unigenes were used to assign functional annotations. With SwissProt annotation, GO classification was performed by mapping the relation between SwissProt and GO terms. The unigenes were mapped to the KEGG database to annotate their potential metabolic pathways [42, 60, 61] using eggNOG-mapper (v2.0.1–14) .
All clean reads were aligned onto pomegranate genome. The trimmed mean of M values (TMM) method was used to calculate the unigene expression abundance. DEGs were detected by DESeq2 (v1.22.2)  with the absolute value of log2(fold change) value > 1 and a false discovery rate (FDR) < 0.05 as selection criteria. GO enrichment and KEGG pathway enrichment analyses of DEGs were performed using R (v4.0.3) package clusterProfiler (v3.18.1) .
Weighted correlation network analysis and gene network visualisation
Co-expression networks were constructed using the WGCNA (v1.69) package in R (v3.6.1)  with the soft threshold = 12 and the minModuleSize = 400. Eigengene values were calculated for each module and used to test associations with each germination stage. Networks were visualised using Cytoscape v.3.8.2 .
The sample preparation, extract analysis, metabolite identification, and quantification were performed at Wuhan Metware Biotechnology Co., Ltd. (Wuhan, China) (http://www.metware.cn/) following their standard procedures [66, 67].
Biological samples were freeze-dried by vacuum freeze-dryer (Scientz-100F) and then crushed using a mixer mill (MM 400, Retsch) with a zirconia bead for 1.5 min at 30 Hz. Briefly, 100 mg of lyophilised powder was dissolved in 1.2 mL of 70% methanol solution and vortexed for 30 s for every 30 min for six times in total. The sample was placed in a refrigerator at 4 °C overnight. Following centrifugation at 12,000 rpm for 10 min, the extracts were filtrated (SCAA-104, 0.22 μm pore size; ANPEL, Shanghai, China, http://www.anpel.com.cn/) for UPLC-MS/MS analysis.
The sample extracts were analysed using an UPLC-ESI-MS/MS system (UPLC, SHIMADZU Nexera X2, https://www.shimadzu.com.cn/; MS, Applied Biosystems 4500 Q TRAP, https://www.thermofisher.cn/cn/zh/home/brands/applied-biosystems.html) under the following analytical conditions: UPLC column, Agilent SB-C18 (1.8 μm, 2.1 mm * 100 mm); and mobile phase, solvent A of pure water with 0.1% formic acid and solvent B of acetonitrile with 0.1% formic acid. Sample measurements were performed with a gradient program as follows: initial gradient of 95% A, 5% B; a linear gradient to 5% A within 9 min; 5% A, 95% B for 1 min; and adjusted 95% A, 5.0% B within 1.1 min and kept for 2.9 min. The flow velocity was set as 0.35 mL per minute, the column oven was set to 40 °C, and the injection volume was 4 μL. The effluent was alternatively connected to an ESI-triple quadrupole-linear ion trap (QTRAP)-MS.
LIT and triple quadrupole (QQQ) scans were acquired on a triple quadrupole-linear ion trap mass spectrometer (Q TRAP), AB4500 Q TRAP UPLC/MS/MS System, equipped with an ESI Turbo Ion-Spray interface, operating in positive and negative ion modes and controlled by Analyst 1.6.3 software (AB Sciex). The ESI source operation parameters were as follows: ion source, turbo spray; source temperature 550 °C; ion spray voltage (IS) 5500 V (positive ion mode)/− 4500 V (negative ion mode); ion source gas I (GSI), gas II (GSII) and curtain gas (CUR) were set at 50, 60 and 25.0 psi, respectively; and high collision-activated dissociation (CAD). Instrument tuning and mass calibration were performed with 10 and 100 μmol/L polypropylene glycol solutions in QQQ and LIT modes, respectively. QQQ scans were acquired as MRM experiments with collision gas (nitrogen) set to medium. DP and CE for individual MRM transitions were conducted with further DP and CE optimisation . A specific set of MRM transitions were monitored for each period according to the metabolites eluted within this period.
Significant differences in relative metabolite content were tested by orthogonal partial least squares discriminant analysis (OPLS-DA) with a threshold of variable important in projection (VIP) value > 1 and an absolute value of absolute value of fold change value > 2 using R software version 4.0.3 (http://www.r-project.org/).
Seven DEGs were selected for qRT-PCR analysis with actin (β-actin) as internal reference gene. Primers were designed through NCBI official website (https://www.ncbi.nlm.nih.gov/) and are listed in Additional file 1: Table S8.
The RNA extracted from T. hispida was used to synthesise first-strand cDNA with SweScript RT I First Strand cDNA Synthesis Kit following the manufacturer’s instructions. qRT-PCR was performed with a 2*Universal Blue SYBR Green qPCR Master Mix kit in accordance with the manufacturer’s instructions. The experimental conditions were set as follows: 40 cycles at 95 °C for 30 s (predegeneration), 95 °C for 15 s, 60 °C for 10 s and 72 °C for 30 s. The mRNA expression level of the genes was calculated with the 2−ΔΔCt method. Each plant sample was analysed three times (each replicate contained three technical replicates).
Availability of data and materials
The raw sequence data from this study have been deposited in the publicly accessible National Genomics Data Center (NGDC, https://ngdc.cncb.ac.cn/) database as accession number PRJCA006871. The datasets supporting the conclusions of this article are included within the article and its additional files. The datasets used and/or analyzed during the current study are available from the authors on reasonable request (Hong Liu, email@example.com).
Kyoto Encyclopedia of Genes and Genomes
Swiss-prot protein database
Differentially expressed gene
Differentially expressed metabolite
Early biosynthetic gene
Late biosynthetic gene
Weighted gene co-expression network analysis
Pearson correlation coefficient
Variable importance in projection
Principal component analysis
Real-time quantitative reverse transcriptional PCR
Bentsink L, Koornneef M. Seed dormancy and germination Arabidopsis Book, vol. 6; 2008. p. e0119.
Song K, Choi G. Phytochrome regulation of seed germination. Methods Mol Biol. 2019;2026:149–56.
Shinomura T. Phytochrome regulation of seed germination. J Plant Res. 1997;110(1):151–61.
Appenroth KJ, et al. Tomato seed germination: regulation of different response modes by phytochrome B2 and phytochrome a. Plant Cell Environ. 2006;29(4):701–9.
Belmehdi O, et al. Effect of light, temperature, salt stress and pH on seed germination of medicinal plant Origanum elongatum (bonnet) Emb & Maire. Biocatalysis Agric Biotechnol. 2018;16:126–31.
Luan Z, et al. Effects of salinity, temperature, and polyethylene glycol on the seed germination of sunflower (Helianthus annuus L.). ScientificWorldJournal. 2014;2014:170418.
Meng L-B, et al. A systematic proteomic analysis of NaCl-stressed germinating maize seeds. Mol Biol Rep. 2014;41(5):3431–43.
Wijewardana C, et al. Drought stress has transgenerational effects on soybean seed germination and seedling vigor. PLoS One. 2019;14(9):e0214977.
Shu K, et al. Two faces of one seed: hormonal regulation of dormancy and germination. Mol Plant. 2016;9(1):34–45.
Ishibashi Y, et al. Regulation of soybean seed germination through ethylene production in response to reactive oxygen species. Ann Bot. 2013;111(1):95–102.
Kim SY, Warpeha KM, Huber SC. The brassinosteroid receptor kinase, BRI1, plays a role in seed germination and the release of dormancy by cold stratification. J Plant Physiol. 2019;241:153031.
Bewley JD. Seed germination and dormancy. Plant Cell. 1997;9(7):1055–66.
Bewley JD, Black M. In: Bewley JD, Black M, editors. Dormancy and the control of germination, in seeds: physiology of development and germination. Boston, MA: Springer US; 1994. p. 199–271.
Finch-Savage B. Seeds: physiology of development, germination and dormancy (3rd edition) - J.D. Bewley, K.J. Bradford, H.W.M. Hilhorst H. Nonogaki. 392 pp. springer, New York – Heidelberg – Dordrecht – London2013978-1-4614-4692-7. Seed Sci Res. 2013;23(4):289–9.
Miransari M, Smith DL. Plant hormones and seed germination. Environ Exp Bot. 2014;99:110–21.
Graeber K, et al. Cross-species approaches to seed dormancy and germination: conservation and biodiversity of ABA-regulated mechanisms and the Brassicaceae DOG1 genes. Plant Mol Biol. 2010;73(1):67–87.
Penfield S, et al. Arabidopsis ABA INSENSITIVE4 regulates lipid mobilization in the embryo and reveals repression of seed germination by the endosperm. Plant Cell. 2006;18(8):1887–99.
Linkies A, Leubner-Metzger G. Beyond gibberellins and abscisic acid: how ethylene and jasmonates control seed germination. Plant Cell Rep. 2012;31(2):253–70.
Han C, Yang P. Studies on the molecular mechanisms of seed germination. Proteomics. 2015;15(10):1671–9.
Kim J, Woo HR, Nam HG. Toward systems understanding of leaf senescence: an integrated multi-Omics perspective on leaf senescence research. Mol Plant. 2016;9(6):813–25.
Zhang T, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33(5):531–7.
Zhang T, et al. Comprehensive analysis of MYB gene family and their expressions under abiotic stresses and hormone treatments in Tamarix hispida. Front Plant Sci. 2018;9:1303.
Liu Z, et al. Overexpression of ThSAP30BP from Tamarix hispida improves salt tolerance. Plant Physiol Biochem. 2020;146:124–32.
Wang Y, et al. A 2-Cys peroxiredoxin gene from Tamarix hispida improved salt stress tolerance in plants. BMC Plant Biol. 2020;20(1):360.
He Z, et al. The NAC protein from Tamarix hispida, ThNAC7, confers salt and osmotic stress tolerance by increasing reactive oxygen species scavenging capability. Plants. 2019;8(7):221.
Ji X, et al. The bZIP protein from Tamarix hispida, ThbZIP1, is ACGT elements binding factor that enhances abiotic stress signaling in transgenic Arabidopsis. BMC Plant Biol. 2013;13:151.
Teshome A, et al. Transcriptome sequencing of Festulolium accessions under salt stress. BMC Res Notes. 2019;12(1):311.
Dekkers BJ, et al. Transcriptional dynamics of two seed compartments with opposing roles in Arabidopsis seed germination. Plant Physiol. 2013;163(1):205–15.
Fait A, et al. Arabidopsis seed development and germination is associated with temporally distinct metabolic switches. Plant Physiol. 2006;142(3):839–54.
Gallardo K, et al. Proteomic analysis of Arabidopsis seed germination and priming. Plant Physiol. 2001;126(2):835–48.
Howell KA, et al. Mapping metabolic and transcript temporal switches during germination in rice highlights specific transcription factors and the role of RNA instability in the germination process. Plant Physiol. 2009;149(2):961–80.
Cho K, et al. Comparative analysis of seed transcriptomes of ambient ozone-fumigated 2 different rice cultivars. Plant Signal Behav. 2013;8(11):e26300.
Sano N, et al. Proteomic analysis of embryonic proteins synthesized from long-lived mRNAs during germination of rice seeds. Plant Cell Physiol. 2012;53(4):687–98.
Dierking EC, Bilyeu KD. Raffinose and stachyose metabolism are not required for efficient soybean seed germination. J Plant Physiol. 2009;166(12):1329–35.
Feng Z, et al. Applications of metabolomics in the research of soybean plant under abiotic stress. Food Chem. 2020;310:125914.
Zhou P, et al. Multi-omics analysis of the bioactive constituents biosynthesis of glandular trichome in Perilla frutescens. BMC Plant Biol. 2021;21(1):277.
Wang R, et al. Integrated metabolomics and Transcriptome analysis of flavonoid biosynthesis in safflower (Carthamus tinctorius L.) with different colors. Front Plant Sci. 2021;12:712038.
Cai S, et al. CannabisGDB: a comprehensive genomic database for Cannabis sativa L. Plant Biotechnol J. 2021;19(5):857–9.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550.
Pelletier MK, Burbulis IE, Winkel-Shirley B. Disruption of specific flavonoid genes enhances the accumulation of flavonoid enzymes and end-products in Arabidopsis seedlings. Plant Mol Biol. 1999;40(1):45–54.
Xu W, Dubos C, Lepiniec L. Transcriptional control of flavonoid biosynthesis by MYB–bHLH–WDR complexes. Trends Plant Sci. 2015;20(3):176–85.
Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.
Wang R, et al. Integrative analyses of metabolome and genome-wide transcriptome reveal the regulatory network governing flavor formation in kiwifruit (Actinidia chinensis). New Phytol. 2021;233(1):373–89.
Fang Q, et al. AtDIV2, an R-R-type MYB transcription factor of Arabidopsis, negatively regulates salt stress by modulating ABA signaling. Plant Cell Rep. 2018;37(11):1499–511.
Liu Y, et al. Tc-MYBPA an Arabidopsis TT2-like transcription factor and functions in the regulation of proanthocyanidin synthesis in Theobroma cacao. BMC Plant Biol. 2015;15(1):160.
Lin X, et al. LFR physically and genetically interacts with SWI/SNF component SWI3B to regulate leaf blade development in Arabidopsis. Front Plant Sci. 2021;12:717649.
Husbands AY, et al. The ASYMMETRIC LEAVES complex employs multiple modes of regulation to affect Adaxial-Abaxial patterning and leaf complexity. Plant Cell. 2015;27(12):3321–35.
Moison M, et al. The lncRNA APOLO interacts with the transcription factor WRKY42 to trigger root hair cell expansion in response to cold. Mol Plant. 2021;14(6):937–48.
Liu B, et al. The ubiquitin E3 ligase SR1 modulates the submergence response by degrading phosphorylated WRKY33 in Arabidopsis. Plant Cell. 2021;33(5):1771–89.
Li P, Li X, Jiang M. CRISPR/Cas9-mediated mutagenesis of WRKY3 and WRKY4 function decreases salt and me-JA stress tolerance in Arabidopsis thaliana. Mol Biol Rep. 2021;48(8):5821–32.
Jiménez-Alfaro B, et al. Seed germination traits can contribute better to plant community ecology. J Veg Sci. 2016;27(3):637–45.
Falcone Ferreyra ML, Rius SP, Casati P. Flavonoids: biosynthesis, biological functions, and biotechnological applications. Front Plant Sci. 2012;3:222.
Yin R, Ulm R. How plants cope with UV-B: from perception to response. Curr Opin Plant Biol. 2017;37:42–8.
Wang F, et al. Transcriptome analysis reveals complex molecular mechanisms underlying UV tolerance of wheat (Triticum aestivum, L.). J Agric Food Chem. 2019;67(2):563–77.
Ferrer JL, et al. Structure and function of enzymes involved in the biosynthesis of phenylpropanoids. Plant Physiol Biochem. 2008;46(3):356–70.
Liu P-P, et al. Repression of AUXIN RESPONSE FACTOR10 by microRNA160 is critical for seed germination and post-germination stages. Plant J. 2007;52(1):133–46.
Chen J, et al. Integrated metabolomics and transcriptome analysis on flavonoid biosynthesis in safflower (Carthamus tinctorius L.) under MeJA treatment. BMC Plant Biol. 2020;20(1):353.
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–2.
Haas BJ, et al. De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat Protoc. 2013;8(8):1494–512.
Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28(11):1947–51.
Kanehisa M, et al. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49(D1):D545–d551.
Huerta-Cepas J, et al. Fast genome-wide functional annotation through Orthology assignment by eggNOG-mapper. Mol Biol Evol. 2017;34(8):2115–22.
Yu G, et al. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics. 2012;16(5):284–7.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559.
Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Chen L, et al. Combined De novo Transcriptome and Metabolome analysis of common bean response to Fusarium oxysporum f. sp. phaseoli infection. Int J Mol Sci. 2019;20(24):6278.
Chen W, et al. A novel integrated method for large-scale detection, identification, and quantification of widely targeted metabolites: application in the study of rice metabolomics. Mol Plant. 2013;6(6):1769–80.
We thank all the authors for their contributions to this study.
This work was supported by National Natural Science Foundation of China (No.31460103) and the Construction Plan of Hubei Province Science and Technology basic conditions platform (No.2021DFE021).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1 Table S1.
Changes in the fresh weight of Tamarix hispida seeds during germination. Table S2. Expression of differentially expressed genes and the detailed information of gene annotation in the seeds of Tamarix hispida seeds during germination. Table S3. Results of KEGG enrichment of differentially expressed genes in the seeds of Tamarix hispida during germination. Table S4. KEGG enrichment of differentially expressed genes in flavonoid biosynthesis (Ko00941) during seed germination and post-germination. Table S5. KEGG enrichment of differentially expressed genes in plant hormone signal transduction (ko04075) during seed germination and post-germination. Table S6. Results of GO and KEGG enrichment of differentially expressed genes in modules. Table S7. Differentially accumulated metabolites in stages 3, 4 and 5. Table S8. List of primers used for the relative quantification of gene transcripts.
Additional file 2 Fig. S1.
Gene Ontology (GO) classification of DEGs. Fig. S2. Enriched KEGG pathways of differential expressed genes (DEGs) during the four adjacent stages of Tamarix hispida seed germination. Fig. S3. Enriched Gene Ontology (GO) terms of differential expressed genes (DEGs) during the six adjacent stages of Tamarix hispida seed germination and post- germination processes. Fig. S4. Analysis of DEGs in phenylpropanoid biosynthesis. Fig. S5. Enriched KEGG pathways and Gene Ontology (GO) terms of gene sets merging from specific stage-associated modules. Fig. S6. Enriched KEGG pathways and Gene Ontology (GO) terms of DEGs in purple, black and green modules. Fig. S7. Heatmap showing the contents of corresponding flavonoid expression at different stages.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Pang, X., Suo, J., Liu, S. et al. Combined transcriptomic and metabolomic analysis reveals the potential mechanism of seed germination and young seedling growth in Tamarix hispida. BMC Genomics 23, 109 (2022). https://doi.org/10.1186/s12864-022-08341-x
- Tamarix hispida
- Seed germination