Genome-wide characterization and expression analysis of the HD-Zip gene family in response to drought and salinity stresses in sesame
BMC Genomics volume 20, Article number: 748 (2019)
The homeodomain-leucine zipper (HD-Zip) gene family is one of the plant-specific transcription factor families, involved in plant development, growth, and in the response to diverse stresses. However, comprehensive analysis of the HD-Zip genes, especially those involved in response to drought and salinity stresses is lacking in sesame (Sesamum indicum L.), an important oil crop in tropical and subtropical areas.
In this study, 45 HD-Zip genes were identified in sesame, and denominated as SiHDZ01-SiHDZ45. Members of SiHDZ family were classified into four groups (HD-Zip I-IV) based on the phylogenetic relationship of Arabidopsis HD-Zip proteins, which was further supported by the analysis of their conserved motifs and gene structures. Expression analyses of SiHDZ genes based on transcriptome data showed that the expression patterns of these genes were varied in different tissues. Additionally, we showed that at least 75% of the SiHDZ genes were differentially expressed in responses to drought and salinity treatments, and highlighted the important role of HD-Zip I and II genes in stress responses in sesame.
This study provides important information for functional characterization of stress-responsive HD-Zip genes and may contribute to the better understanding of the molecular basis of stress tolerance in sesame.
Transcription factors (TFs) play a vital role in regulatory networks that link between the developmental program and response of genes to endogenous and environmental signals. TFs directly bind to the promoters of target genes in a sequence-specific manner to activate or repress the expression of their target genes . Homeobox proteins are a large superfamily of transcription factors (TFs) found in invertebrates, vertebrates, fungi, and plants, which contain a highly conserved DNA-binding domain known as the homeodomain (HD) . The HD is 60 amino acids in length and adopts a structure of three α-helices connected by a loop and a turn [2,3,4]. The HD binds DNA as a monomer with high affinity through the interactions established by helix III (called the recognition helix) with the major groove of the target DNA while the disordered N-terminal arm, located upstream the first helix, interacts with the DNA minor groove . Homeodomain proteins have been classified into different families according to the distinguishing features, such as HD location, association with other domains, their size and gene structures. Based on thorough analyses of homeodomain proteins from flowering plants, moss, Selaginella, unicellular green algae, and red algae, Mukherjee et al.  classified the plant HD-containing proteins into 14 families, including HD-Zip (homeodomain associated to a leucine zipper), WOX (Wuschel related homeobox) and KNOX (Knotted related homeobox).
The plant-specific HD-Zip transcription factors contain a leucine zipper motif (LZ) that acts as a dimerization motif, located in the carboxyl terminal to the homeodomain [2, 6]. The HD-ZIP family further divided into four subfamilies, HD-Zip I-IV, according to the conserved HD-Zip domain, additional conserved domain, structure features and functions . Members of HD-Zip I and II family share the conserved HD and LZ domains, and bind similar pseudo-palindromic cis-elements, CAATNATTG, where N can be A/T or G/C for HD-Zip I or II homeodomains, respectively [7, 8]. Besides HD and LZ domains, HD-ZIP II proteins present a conserved CPSCE motif (named by five conserved amino acids: Cys, Pro, Ser, Cys and Glu) downstream of the LZ . Both the HD-Zip III and HD-Zip IV subfamily proteins are characterized by the START (steroidogenic acute regulatory protein–related lipid transfer) domain and START adjacent domain (SAD) [6, 10]. HD-Zip III proteins can be distinguished from HD-Zip IV proteins by the presence of C-terminal MEKHLA domain which is absent in the HD-ZIP IV proteins . HD-Zip III proteins bind the consensus sequence GTAAT [G/C] ATTAC, while members of HD-Zip IV family recognized the sequence TAAATG [C/T] A [12, 13].
There are 48 and 49 HD-ZIP genes in Arabidopsis thaliana and rice (Oryza sativa L.), respectively [6, 14, 15]. Through genome-wide analysis, members of HD-Zip gene family have been also identified in many other plant species, including grape (Vitis vinifera) , maize (Zea mays) , soybean (Glycine max) [18, 19], cassava (Manihot esculenta) , wheat (Triticum aestivum) , tea plant (Camellia sinensis) , and potato (Solanum tuberosum) . HD-Zip proteins are known to participate in transcriptional regulation of various biological processes, and members of the different subfamilies have specific roles [2, 24, 25]. HD-Zip I proteins were found to be implicated in the regulation of abiotic stress responses, light and hormone (ABA, auxin and ethylene) signal transduction, and plant growth and development [2, 24, 26]. AtHB7 (Arabidopsis thaliana HOMEOBOX 7) and AtHB12 (Arabidopsis thaliana HOMEOBOX 12) from Arabidopsis HD-Zip I group participate in ABA sensing and transduction, playing a key role in drought and salt responses [27, 28]. MtHB1 (MEDICAGO TRUNCATULA HOMEOBOX 1), a HD-Zip I protein from M. truncatula, regulates root architecture under adverse environmental stresses by repressing LBD1 (LOB-BINDING DOMAIN 1) involving crosstalk between auxin and ABA signaling pathways . In the case of HD-Zip II proteins, they are mainly involved in development, shade avoidance and abiotic stress responses [30,31,32]. For example, members of the HD-Zip II family, including AtHB2 (Arabidopsis thaliana HOMEOBOX 2), AtHB4 (Arabidopsis thaliana HOMEOBOX 4), and HAT3 (HOMEOBOX FROM Arabidopsis thaliana 3), play crucial roles in regulation of leaf polarity and shade avoidance response [31, 33]. Two other HD-Zip II proteins, AtHB17 (Arabidopsis thaliana HOMEOBOX 17) and ABIG1 (ABA INSENSITIVE GROWTH 1), are involved in ABA-mediated stress response or growth inhibition [34, 35]. HD-Zip III proteins were reported to be involved in apical meristem formation, vascular development, organ polarity establishment, as well as auxin biosynthesis, transport and response [31, 36]. Rice LF1 (LATERAL FLORET 1) gene, encoding a class III HD-ZIP protein, induced the three-florets spikelet by directly regulating the expression of meristem maintenance gene OSH1 (ORYZA sativa HOMEOBOX 1) . HD-Zip IV proteins play critical role in the specification of the protoderm, anthocyanin accumulation, and environmental responses . For example, GhHOX3 (GOSSYPIUM HIRSUTUM HOMEOBOX 3) in this subfamily plays a central role in controlling cotton fibre elongation .
Sesame is an ancient and important oil crop, which is grown mainly in tropical and subtropical areas of the world. Sesame has been widely used in baked and confectionery products and edible oil due to its highly stable oil and high quantities of nutritious amino acids, minerals, vitamins, and lignans . However, sesame production and quality is threatened by drought, salinity and other environmental stresses [41,42,43]. A series of TFs, such as ERF, WRKY, MYB, NAC and bZIP, have been genome-wide analyzed in sesame, and some stress-responsive TFs have been identified [44,45,46,47,48]. However, response to abiotic stress of HD-Zip genes was unclear in sesame. In this study, we systematically characterized the HD-Zip gene family in sesame, and analyzed their phylogenetic relationships, conserved motifs and gene structure, as well as expression patterns in different tissues and in response to abiotic stresses. Our results provide a perspective for further investigation of the functions of stress-responsive HD-ZIPs in sesame.
Genome-wide identification of HD-zip family genes in sesame
For the genome-wide identification of HD-Zip (homeodomain-leucine zipper) family genes in sesame, the Hidden Markov Model (HMM) profile of the homeodomain (HD) (PF00046) and the leucine zipper (LZ) domain (PF02183) were employed as queries to search against the Sinbase database (http://ocri-genomics.org/Sinbase) using the program HMM3.0. In addition, the known HD-Zip protein sequences from Arabidopsis were obtained from the TAIR database based on a previous study , and these sequences were also used as queries for searches in the Sinbase database. After removing redundant sequences, the SMART database was used to examine the presence of the HD and LZ domains for each identified candidate. As a result, a total of 45 HD-Zip genes were identified in sesame, and they were designated as SiHDZ1-SiHDZ45 according to their chromosomal locations on the sesame linkage groups (LGs). The identified sesame SiHDZ genes encoded proteins ranging from a minimum of 160 (SiHDZ37) to a maximum of 847 (SiHDZ22) amino acids in length. Detail information of SiHDZs such as gene locus ID, linkage group location, proteins length, and other corresponding information are shown in Additional file 2: Table S1.
Chromosomal localization and gene duplication analysis of SiHDZ genes
The chromosomal localization of SiHDZ genes was determined to visualize their genomic position information (Fig. 1). Of the 45 SiHDZ genes, 44 genes were distributed unequally on 12 out of the 16 LGs, with the LG08 having the majority of SiHDZ genes (7), whereas the LG07 had only one gene. In addition, one SiHDZ gene (SiHDZ45) was mapped to the unanchored scaffold, and is not shown in Fig. 1.
To study the potential gene duplication within the sesame genome, we examined the segmental and tandem duplication events in SiHDZ gene family during evolution. No tandem duplication events were found, while 23 gene pairs involving 36 SiHDZ genes resulted from segment duplication (Additional file 1: Figure S1).
Phylogenetic analysis amongst the Arabidopsis and sesame HDZs
To reveal the phylogenetic relationships among the SiHDZ proteins, an unrooted phylogenetic tree was created to assess the genetic relationships between Arabidopsis and sesame HDZs. As shown in Fig. 2, these proteins can be divided into four distinct groups (HD-Zip I-IV), which is similar to that described in previous studies [2, 16]. The numbers of HD-Zip I-IV members in sesame were 16, 10, 9, and 10, respectively (Additional file 2: Table S1; Fig. 2). The results provide an important basis for functional prediction of HD-Zip proteins in sesame.
Gene structure and conserved domain analysis of SiHDZ genes
To investigate the structural diversity of SiHDZ genes, Gene Structure Display Server (http://gsds.cbi.pku.edu.cn/index.php) was employed to analyze the exon-intron structures of SiHDZ genes by comparing their coding sequences and corresponding genomic sequences. The results showed that the number of introns of SiHDZ genes varied between 0 and 17, but the genes that clustered together had similar number of introns (Fig. 3a, b). For example, all of the SiHDZ genes in group III contained the largest number of introns (17), members of SiHDZ group IV had 8–10 introns, with the exception of SiHDZ17, which contained 7 introns (Fig. 3b). Compared to group III and IV, SiHDZ genes in group I and II had much fewer introns. SiHDZ genes in group II usually contain 2–3 introns, whereas SiHDZ genes in group I feature 1–2 introns, except for SiHDZ24 and SiHDZ37, whose number of introns were 3 and 0, respectively (Fig. 3b).
To further study the origin and evolutionary pattern of SiHDZs, their protein sequences were subjected to the MEME tool and a total of 20 conserved motifs were identified. The identified motifs ranged from 24 to 50 amino acids in length. The details of the sequence logo of each motif were presented in Additional file 1: Figure S2. Amongst them, motif 1 and 2, corresponding to the homeobox domain, and motif 5, corresponding to the LZ domain, were in common among all of the SiHDZs (Fig. 3c). In general, members in the same group harbored the similar motif organizations, while the motifs were divergent among different groups. For example, motifs 3, 4, 8, and 12, which correspond to START domain, were shared in group III and group IV, while motif 9 and 16, which correspond to MEKHLA domain, were present in group III but absent in group IV. In addition, motif 9 was exclusively present in group II, motif 13 and 19 only existed in group III, while motifs 10 and 17 was only present in the group IV HDZs (Fig. 3c). These group-specific motifs may imply diverse functions of the HDZ family in sesame.
Expression profiles of SiHDZ genes in different tissues
To study the potential functions of the SiHDZ genes, we analyzed the expression profiles of SiHDZ genes in different tissues, including root, stem, flower, leaf, capsule and seed, based on the transcriptome data from SesameFG database . As shown in Fig. 4, 25 SiHDZ genes were commonly detected (TPM value > 1) in all of the tissues, suggesting that these genes might play a universal role in the tested tissues. Among them, four SiHDZ genes (SiHDZ04, 27, 29, and 40) were highly expressed (TPM value > 10) in all six tissues, while SiHDZ03 and SiHDZ19 were most highly expressed (TPM value > 90) in root. In addition, some SiHDZ genes were shown to have tissue-specific expression (Fig. 4). For example, SiHDZ11, 17, 28 and 38 displayed relatively low expression levels in root and capsule, but high expression in stem, flower, leaf and seed. SiHDZ31 and SiHDZ43 exhibited low expression levels in leaf, but relatively high expression levels in other tissues. The transcription levels of SiHDZ01 and SiHDZ07 were relatively higher in root and capsule, but low in stem, flower and leaf. Additionally, SiHDZ21 showed specific expression in seed, whereas low in other tissues. These findings indicated the SiHDZ genes play differential roles in tissue development.
Expression profiles of SiHDZ genes under drought stress
To identify the expression profiles of SiHDZ genes under drought stress, the expression pattern of each SiHDZ gene was obtained from the transcriptome data sets previously developed by our group . It was observed that most of the SiHDZ genes were differentially affected under drought stress. For example, three SiHDZ genes, SiHDZ16, SiHDZ27, and SiHDZ31, were up-regulated under drought stress, with their expression markedly increasing at 3 d, and peaking at 11 d (Fig. 5). Some SiHDZ genes, such as SiHDZ24, SiHDZ03, and SiHDZ42, were also up-regulated under stress, but their expression levels were peaked at 3 d, implying their roles in early response to drought stress. However, some SiHDZ genes, such as SiHDZ30, SiHDZ14, SiHDZ22, and SiHDZ37, showed obvious down-regulation at all of the time points (Fig. 5). Moreover, some highly homologous SiHDZ genes exhibited similar expression profiles after drought stress, such as SiHDZ16 and SiHDZ31, SiHDZ19 and SiHDZ04, SiHDZ12 and SiHDZ02 (Fig. 5).
Expression analysis of SiHDZ genes in response to osmotic and salinity stresses
Numerous evidences showed that HD-Zip genes play a critical role in plant drought and salt tolerance [28, 51,52,53]. In order to identify candidate SiHDZ genes that function in drought and salt stress responses, 20 drought-responsive SiHDZs (based on the transcriptome data) from different subfamilies were selected to analyze their expression patterns under osmotic and salinity stresses by qRT-PCR. Overall, the expression levels of all the selected genes were significantly changed in response to osmotic and salinity stresses, but some differences were present among these genes. Under osmotic stress, the expression levels of SiHDZ03, 07, 10, 13, 16, 24, 27, 31, 34, and 43 were significantly up-regulated through all the time points, while SiHDZ22, 29, 41, and 42 were induced at certain time points, with the highest expression levels at 6 h, 6 h, 12 h, and 12 h, respectively (Fig. 6 and Additional file 1: Figure S3). However, other SiHDZ genes displayed an obvious decrease in expression under osmotic stress at certain time points. Under salinity stress, half of the selected SiHDZ genes (SiHDZ03, 07, 10, 13, 16, 24, 27, 34, 42, and 43) showed strong up-regulation in expression at all of the time points, while SiHDZ26, 28, 29, 33, 37, and 41 kept low transcription levels in some time points (Fig. 7). In addition, SiHDZ26, 29, 31, and 37 showed obviously increased expression at 2 h under salinity stress, while the transcripts were dramatically decreased at 6 h and 12 h (Fig. 7). Taken together, we found that the expression of several SiHDZ genes, such as SiHDZ07, 10, 27 and 43 was highly induced under both osmotic and salt treatments. These results suggested that these genes might play a vital role in response to multiple abiotic stresses in sesame.
HD-Zip genes encode a family of plant-specific transcription factors involved in various biological processes in plants. In this study, a total of 45 SiHDZ genes were identified from the sesame genome (Additional file 2: Table S1). The number of HDZ members in sesame was comparable to that of potato (43) , wheat (46) , foxtail millet (Setaria italica) (47) , rice (48) [14, 15], and Arabidopsis (48) , more than tea plant (33) , but less than maize (55)  and cassava (57) . These findings suggested that the number of HD-Zip genes is not correlated with genome size of these plant species, which may partly result from tandem duplication and segmental duplication. For instance, over 75% of Arabidopsis HD-Zip genes are evolved from segmental duplication and are not clustered on the chromosomes [17, 55]. In cassava, ten HD-Zip genes were identified as segmentally duplicated, while none of the HD-Zip genes were resulted from tandem duplication . In the current study, 36 SiHDZ genes were found to be involved in segmental duplication events, whereas no tandem duplicated SiHDZ genes were identified, indicating that the segmental duplication was largely responsible for the expansion of HD-Zip gene family in sesame, which is consistent with the results in Arabidopsis, cassava and other species [17, 18, 20].
The phylogenetic tree constructed using HDZ proteins from sesame and Arabidopsis revealed that the sesame HDZ proteins can be classified into four groups (HD-Zip I-IV) (Fig. 2). In addition, the number of HD-Zip I, II, III, and IV was 16, 10, 9, and 10, respectively (Additional file 2: Table S1; Fig. 2), compared to 17, 10, 5, and 16 in Arabidopsis , 11, 7, 5, and 8 in grape , 17, 18, 5, and 15 in maize , and 20, 17, 4, and 5 in wheat . These results revealed that HD-Zip I was the most abundant group, but the number of HD-Zip III proteins in sesame was much higher than those in other plant species, accounting for 20%. Noteworthy, 7 out of 9 SiHDZ genes in HD-Zip III subfamily were found to be involved in segmental duplication events (Fig. 2; Additional file 1: Figure S1), suggesting that the HD-Zip III SiHDZ genes also mainly expanded during evolution and may play important roles in sesame. Besides, the number of introns in SiHDZ family genes varied widely (0–17), but members of the same group have similar exon-intron arrangements (Fig. 3a, b). For example, most SiHDZ genes in group III and IV harbored 17 and 8–10 introns, respectively, while members in group I and II had the least number of introns (1–3) (Fig. 3b), which is in accordance with the results in other plant species, such as maize , cassava , and potato . The differences in the gene structures of different groups suggested group-specific gain or loss of introns that may affect the functional divergence of the different SiHDZ group members in sesame. The motif analysis also showed that the distribution of protein motifs in the different groups was noticeably diverse, but the same group had a similar motif construction (Fig. 3a, c), which support their strong evolutionary conservation. These findings suggested that the classification and evolution of the HD-Zip gene family is quite conserved in the sesame, as well as in other plant species.
Increasing evidences have demonstrated that HD-Zip genes participate in various aspects of growth and development in plants [2, 30, 31, 56]. For example, sunflower HaHB10 (Helianthus ANNUUS HOMEOBOX 10) mediates the transition from the vegetative to the flowering stage by activating particular flowering transition genes and plant response to salicylic acid . AtHB2 regulates red/far-red light effects on the shade avoidance response and specific cell proliferation including lateral root formation . SiHDZ19 is the homologous gene of AtHB2 (Fig. 2), and was highly expressed in the root (Fig. 4), suggesting that it may also be involved in root development in sesame. It’s worth noting that some SiHDZ genes, especially in HD-Zip I, II, and III, have a broad expression in all tissues (Fig. 4), implying that they participate in the overall development of the sesame. Similar results were also observed in several plant species, such as wheat , and potato . However, most HD-Zip IV SiHDZ genes displayed distinct tissue-specific expression patterns (Fig. 4), similar to the Group IV genes in cassava . For example, SiHDZ06, 11, 17, 26, 28, and 36 have a much lower expression in roots than in other tissues, while another HD-Zip IV gene, SiHDZ23, was specifically expressed in seed (Fig. 4). In cucumber, nearly all HD-Zip IV genes (10 out of 11) showed preferential expression in reproductive organs . Tomato HD-Zip IV genes also showed tissue specific expression patterns, with higher expression in young leaves and flowers . These findings revealed that the functions of different group SiHDZ genes were diverged during evolution processes.
Numerous reports have revealed that HD-Zip genes were involved in response to a variety of abiotic stresses in different plant species, such as foxtail millet , tea plant , wheat , and potato . In the present study, the expression levels of over 75% SiHDZ genes were found to be affected by drought or salinity stresses based on transcriptome data and qRT-PCR results. Moreover, SiHDZ03, 07, 10, 13, 16, 24, 27, 31, and 43, have similar expression patterns in response to drought and salinity. A large body of evidence indicates that HD-Zip I proteins are involved in developmental reprogramming in response to environmental stresses . For example, AtHB7 and AtHB12, which are strongly induced by water limiting and ABA, act as mediators that negatively feedback effect on ABA signaling in the plant response to water deficit [26, 27]. The ectopically expression and/or overexpression of the Arabidopsis AtHB12 or AtHB7 gene, or their homologs in M. truncatula (MtHB1), sunflower (HaHB4), and maize (ZmHDZ4), enhanced tolerance to drought or salinity stress in transgenic plants [29, 61,62,63]. Some of the HD-Zip I genes in sesame, such as SiHDZ07, 10, 16, 24, 27, 31, 37, and 43, were significantly induced in response to drought or salt stress. Notably, the expression of SiHDZ07, 16, and 31, homologs of Arabidopsis AtHB12 and AtHB7 genes, was significantly up-regulated by both drought and salinity stresses, indicating that these genes may regulate drought and salt tolerance through an ABA-dependent pathway. In addition, SiHDZ43, like its homologs AtHB21 and AtHB40, was up-regulated under drought and salinity stress. Three HD-Zip II members, SiHDZ03, 13, and 42, were also significantly up-regulated under drought and/or salt treatment, thereby suggesting their possible role in abiotic stress responses. These results suggest HD-Zip I and II genes may play a vital role in regulating plant tolerance to the adverse environmental conditions.
In this study, we identified 45 HD-Zip genes from sesame at the whole genome level. Phylogenetic analysis identified four subfamilies (HD-Zip I-IV) in the HD-Zip gene family, which was further supported by the analysis of their conserved motifs and gene structures. Transcriptomic analysis revealed some constitutively or tissue-specific expressed HD-Zip genes. Expression profiles of SiHDZ genes under various abiotic stress treatments indicated that over 75% SiHDZ genes are involved in abiotic stress signaling, and members of HD-Zip I and II subfamilies may play a vital role in regulating plant response to abiotic stresses. Together, these data provide useful information for functional characterization of SiHDZ genes and extend our knowledge of abiotic stress response in sesame.
Genome-wide identification of HD-zip family genes in sesame
To identify HD-Zip protein in sesame, all proteins sequences were downloaded from the Sesamum indicum genome database (Sinbase, http://ocri-genomics.org/Sinbase/index.html) . A local protein database was constructed and searched against known HD-Zip protein sequences collected from Arabidopsis using a local protein basic local alignment search (BLASTP) program with an E-value cut-off < 10–5 and an identity of 50% as the threshold. HMM profile of the homeodomain (PF00046) and the leucine zipper domain (PF02183) were download from the PFAM database and used for local Hidden Markov Model (HMM) search by HMMER3.0 . Subsequently, all obtained protein sequences were further examined by SMART (http://smart.embl-heidelberg.de/) to confirm the presence of the HD and LZ domains. Finally, a total of 45 non-redundant sesame HD-Zip encoding genes were identified.
Phylogenetic analysis and gene duplication
The protein sequences of HD-Zip from sesame and were Arabidopsis used to construct the phylogenetic tree by MEGA 5.2, using the neighbor-joining (NJ) method with 1000 bootstrap replications . Gene duplication was analyzed with MCScanX following the method described previously .
Gene structure and protein conserved motifs analysis
The exon/intron organization of the HD-Zip genes in sesame was performed with Gene Structure Display Server (GSDS) (http://gsds.cbi.pku.edu.cn/index.php). Conserved motifs present in SiHDZs were identified using MEME (Multiple Em for Motif Elicitation) v4.11.4 (http://meme-suite.org/tools/meme).
Expression profiling of SiHDZ genes using available transcriptome data
To gain insight into the tissue-specific gene expression patterns of HD-Zip gene, transcriptome data for six tissues (root, stem, flower, leaf, capsule and seed) were extracted from Sesame Functional Genomics Database (SesameFG, http://www.ncgr.ac.cn/SesameFG) . To analysis the expression profiles of SiHDZ genes in response to drought stress, the transcriptome data (accession number SAMN06130606) were used . The hierarchical cluster analyses of gene expression were performed using Cluster 3.0 software , and heatmaps were visualized with TreeView .
Plant materials and treatments
In this study, Two-week old seedlings of Sesame cultivar Zhongzhi No. 13 were used to examine the expression patterns of SiHDZ genes under osmotic and salt stresses. Plants were grown in a growth chamber at 28 ± 2 °C with a 16-h light/8-h dark photoperiod, and exposed to different stresses as described previously . The shoots tissues were collected at 0, 2, 6 and 12 h after treatment. Three biological replications were conducted per sample.
Quantitative real-time RT-PCR
Total RNA isolation and the first-strand cDNA synthetization were performed as described previously . The gene-specific primers used in this study are listed in Additional file 3: Table S2. Quantitative real-time RT-PCR (qRT-PCR) was performed according to the protocol described previously . Significant up- and down-regulated genes were determined as p < 0.05 or p < 0.01 using t test.
Availability of data and materials
Gene sequence information of HD-Zip in sesame is available at the Sesamum indicum genome database (Sinbase, http://ocri-genomics.org/Sinbase/index.html).
Arabidopsis thaliana homeobox
Hidden Markov model
Quantitative real-time reverse transcription-polymerase chain reaction
Sesamum indicum homeodomain-leucine zipper
Inukai S, Kock KH, Bulyk ML. Transcription factor-DNA binding: beyond binding site motifs. Curr Opin Genet Dev. 2017;43:110–9.
Ariel FD, Manavella PA, Dezar CA, Chan RL. The true story of the HD-zip family. Trends Plant Sci. 2007;12(9):419–26.
Kissinger CR, Liu BS, Martin-Blanco E, Kornberg TB, Pabo CO. Crystal structure of an engrailed homeodomain-DNA complex at 2.8 a resolution: a framework for understanding homeodomain-DNA interactions. Cell. 1990;63(3):579–90.
Ottinng G, Qian YQ, Billeter M, Muller M, Affolter M, Gehring WJ, Wuthrich K. Protein-DNA contacts in the structure of a homeodomain-DNA complex determined by nuclear magnetic resonance spectroscopy in solution. EMBO J. 1990;9(10):3085–92.
Gehring WJ, Qian YQ, Billeter M, Furukubo-Tokunaga K, Schier AF, Resendez-Perez D, Affolter M, Otting G, Wüthrich K. Homeodomain-DNA recognition. Cell. 1994;78(2):211–23.
Mukherjee K, Brocchieri L, Burglin TR. A comprehensive classification and evolutionary analysis of plant homeobox genes. Mol Biol Evol. 2009;26(12):2775–94.
Sessa G, Morelli G, Ruberti I. The Athb-1 and −2 HD-zip domains homodimerize forming complexes of different DNA binding specificities. EMBO J. 1993;12(9):3507–17.
Tron AE. Combinatorial interactions of two amino acids with a single base pair define target site specificity in plant dimeric homeodomain proteins. Nucleic Acids Res. 2001;29(23):4866–72.
Chan RL, Gago GM, Palena CM, Gonzalez DH. Homeoboxes in plant development. Biochim Biophys Acta. 1998;1442(1):1–19.
Schrick K, Nguyen D, Karlowski WM, Mayer KF. START lipid/sterol-binding domains are amplified in plants and are predominantly associated with homeodomain transcription factors. Genome Biol. 2004;5(6):R41.
Mukherjee K, Burglin TR. MEKHLA, a novel domain with similarity to PAS domains, is fused to plant homeodomain-leucine zipper III proteins. Plant Physiol. 2006;140(4):1142–50.
Sessa G, Steindler C, Morelli G, Ruberti I. The Arabidopsis Athb-8, −9 and −14 genes are members of a small gene family coding for highly related HD-ZIP proteins. Plant Mol Biol. 1998;38(4):609–22.
Abe M, Takahashi T, Komeda Y. Identification of a cis-regulatory element for L1 layer-specific gene expression, which is targeted by an L1-specific homeodomain protein. Plant J. 2001;26(5):487–94.
Jain M, Tyagi AK, Khurana JP. Genome-wide identification, classification, evolutionary expansion and expression analyses of homeobox genes in rice. FEBS J. 2008;275(11):2845–61.
Agalou A, Purwantomo S, Overnas E, Johannesson H, Zhu X, Estiati A, de Kam RJ, Engstrom P, Slamet-Loedin IH, Zhu Z, et al. A genome-wide survey of HD-zip genes in rice and analysis of drought-responsive family members. Plant Mol Biol. 2008;66(1–2):87–103.
Jiang H, Jin J, Liu H, Dong Q, Yan H, Gan D, Zhang W, Zhu S. Genome-wide analysis of HD-zip genes in grape (Vitis vinifera). Tree Genet Genomes. 2015;11(1):827.
Lespinet O, Zhao Y, Zhou Y, Jiang H, Li X, Gan D, Peng X, Zhu S, Cheng B. Systematic analysis of sequences and expression patterns of drought-responsive members of the HD-zip gene family in maize. PLoS One. 2011;6(12):e28488.
Ali J, Chen X, Chen Z, Zhao H, Zhao Y, Cheng B, Xiang Y. Genome-wide analysis of soybean HD-zip gene family and expression profiling under salinity and drought treatments. PLoS One. 2014;9(2):e87156.
Belamkar V, Weeks NT, Bharti AK, Farmer AD, Graham MA, Cannon SB. Comprehensive characterization and RNA-Seq profiling of the HD-zip transcription factor family in soybean (Glycine max) during dehydration and salt stress. BMC Genomics. 2014;15(1):950.
Ding Z, Fu L, Yan Y, Tie W, Xia Z, Wang W, Peng M, Hu W, Zhang J. Genome-wide characterization and expression profiling of HD-zip gene family related to abiotic stress in cassava. PLoS One. 2017;12(3):e0173043.
Yue H, Shu D, Wang M, Xing G, Zhan H, Du X, Song W, Nie X. Genome-wide identification and expression analysis of the HD-zip gene family in wheat (Triticum aestivum L.). Genes (Basel). 2018;9(2):70.
Shen W, Li H, Teng R, Wang Y, Wang W, Zhuang J. Genomic and transcriptomic analyses of HD-zip family transcription factors and their responses to abiotic stress in tea plant (Camellia sinensis). Genomics. 2018;111:1142–1151.
Li W, Dong J, Cao M, Gao X, Wang D, Liu B, Chen Q. Genome-wide identification and characterization of HD-ZIP genes in potato. Gene. 2019;697:103–17.
Sessa G, Carabelli M, Possenti M, Morelli G, Ruberti I. Multiple links between HD-Zip proteins and hormone networks. Int J Mol Sci. 2018;19(12):4047.
Roodbarkelari F, Groot EP. Regulatory function of homeodomain-leucine zipper (HD-ZIP) family proteins during embryogenesis. New Phytol. 2017;213(1):95–104.
Gong S, Ding Y, Hu S, Ding L, Chen Z, Zhu C. The role of HD-zip class I transcription factors in plant response to abiotic stresses. Physiol Plant. 2019. https://doi.org/10.1111/ppl.12965.
Valdes AE, Overnas E, Johansson H, Rada-Iglesias A, Engstrom P. The homeodomain-leucine zipper (HD-zip) class I transcription factors ATHB7 and ATHB12 modulate abscisic acid signalling by regulating protein phosphatase 2C and abscisic acid receptor gene activities. Plant Mol Biol. 2012;80(4–5):405–18.
Re DA, Capella M, Bonaventure G, Chan RL. Arabidopsis AtHB7 and AtHB12 evolved divergently to fine tune processes associated with growth and responses to water stress. BMC Plant Biol. 2014;14:150.
Ariel F, Diet A, Verdenaud M, Gruber V, Frugier F, Chan R, Crespi M. Environmental regulation of lateral root emergence in Medicago truncatula requires the HD-zip I transcription factor HB1. Plant Cell. 2010;22(7):2171–83.
Turchi L, Carabelli M, Ruzza V, Possenti M, Sassi M, Penalosa A, Sessa G, Salvi S, Forte V, Morelli G, et al. Arabidopsis HD-zip II transcription factors control apical embryo development and meristem function. Development. 2013;140(10):2118–29.
Turchi L, Baima S, Morelli G, Ruberti I. Interplay of HD-zip II and III transcription factors in auxin-regulated plant development. J Exp Bot. 2015;66(16):5043–53.
Ciarbelli AR, Ciolfi A, Salvucci S, Ruzza V, Possenti M, Carabelli M, Fruscalzo A, Sessa G, Morelli G, Ruberti I. The Arabidopsis homeodomain-leucine zipper II gene family: diversity and redundancy. Plant Mol Biol. 2008;68(4–5):465–78.
Carabelli M, Possenti M, Sessa G, Ruzza V, Morelli G, Ruberti I. Arabidopsis HD-zip II proteins regulate the exit from proliferation during leaf development in canopy shade. J Exp Bot. 2018;69(22):5419–31.
Park MY, Kim SA, Lee SJ, Kim SY. ATHB17 is a positive regulator of abscisic acid response during early seedling growth. Mol Cells. 2013;35(2):125–33.
Liu T, Longhurst AD, Talavera-Rauh F, Hokin SA, Barton MK. The Arabidopsis transcription factor ABIG1 relays ABA signaled growth inhibition and drought induced senescence. Elife. 2016;5:e13768.
Ramachandran P, Carlsbecker A, Etchells JP. Class III HD-ZIPs govern vascular cell fate: an HD view on patterning and differentiation. J Exp Bot. 2017;68(1):55–69.
Zhang T, Li Y, Ma L, Sang X, Ling Y, Wang Y, Yu P, Zhuang H, Huang J, Wang N, et al. LATERAL FLORET 1 induced the three-florets spikelet in rice. Proc Natl Acad Sci U S A. 2017;114(37):9984–9.
Zhang F, Zuo K, Zhang J, Liu X, Zhang L, Sun X, Tang K. An L1 box binding protein, GbML1, interacts with GbMYB25 to control cotton fibre development. J Exp Bot. 2010;61(13):3599–613.
Shan CM, Shangguan XX, Zhao B, Zhang XF, Chao LM, Yang CQ, Wang LJ, Zhu HY, Zeng YD, Guo WZ, et al. Control of cotton fibre elongation by a homeodomain transcription factor GhHOX3. Nat Commun. 2014;5:5519.
Makinde FM, Akinoso R. Comparison between the nutritional quality of flour obtained from raw, roasted and fermented sesame (Sesamum indicum L.) seed grown in Nigeria. Acta Sci Pol Technol Aliment. 2014;13(3):309–19.
Dossa K, Diouf D, Wang L, Wei X, Zhang Y, Niang M, Fonceka D, Yu J, Mmadi MA, Yehouessi LW, et al. The emerging oilseed crop Sesamum indicum enters the “omics” era. Front Plant Sci. 2017;8:1154.
Wei W, Li D, Wang L, Ding X, Zhang Y, Gao Y, Zhang X. Morpho-anatomical and physiological responses to waterlogging of sesame (Sesamum indicum L.). Plant Sci. 2013;208:102–11.
Hassanzadeh M, Asghari A, Jamaati-e-Somarin S, Saeidi M, Zabihi-e-Mahmoodabad R, Hokmalipour S. Effects of water deficit on drought tolerance indices of sesame (Sesamum indicum L.) genotypes in Moghan region. Res J Environ Sci. 2009;3:116–21.
Mmadi MA, Dossa K, Wang L, Zhou R, Wang Y, Cisse N, Sy MO, Zhang X. Functional characterization of the versatile MYB gene family uncovered their important roles in plant development and responses to drought and waterlogging in sesame. Genes (Basel). 2017;8(12):362.
Zhang Y, Li D, Wang Y, Zhou R, Wang L, Zhang Y, Yu J, Gong H, You J, Zhang X. Genome-wide identification and comprehensive analysis of the NAC transcription factor family in Sesamum indicum. PLoS One. 2018;13(6):e0199262.
Wang Y, Zhang Y, Zhou R, Dossa K, Yu J, Li D, Liu A, Mmadi MA, Zhang X, You J. Identification and characterization of the bZIP transcription factor family and its expression in response to abiotic stresses in sesame. PLoS One. 2018;13(7):e0200850.
Dossa K, Wei X, Li D, Fonceka D, Zhang Y, Wang L, Yu J, Boshou L, Diouf D, Cisse N, et al. Insight into the AP2/ERF transcription factor superfamily in sesame and expression profiling of DREB subfamily under drought stress. BMC Plant Biol. 2016;16(1):171.
Li D, Liu P, Yu J, Wang L, Dossa K, Zhang Y, Zhou R, Wei X, Zhang X. Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses. BMC Plant Biol. 2017;17(1):152.
Wei X, Gong H, Yu J, Liu P, Wang L, Zhang Y, Zhang X. SesameFG: an integrated database for the functional genomics of sesame. Sci Rep. 2017;7(1):2342.
Dossa K, Li D, Wang L, Zheng X, Liu A, Yu J, Wei X, Zhou R, Fonceka D, Diouf D, et al. Transcriptomic, biochemical and physio-anatomical investigations shed more light on responses to drought stress in two contrasting sesame genotypes. Sci Rep. 2017;7(1):8755.
Bang SW, Lee DK, Jung H, Chung PJ, Kim YS, Choi YD, Suh JW, Kim JK. Overexpression of OsTF1L, a rice HD-zip transcription factor, promotes lignin biosynthesis and stomatal closure that improves drought tolerance. Plant Biotechnol J. 2019;17(1):118–31.
Tan W, Zhang D, Zhou H, Zheng T, Yin Y, Lin H. Transcription factor HAT1 is a substrate of SnRK2.3 kinase and negatively regulates ABA synthesis and signaling in Arabidopsis responding to drought. PLoS Genet. 2018;14(4):e1007336.
Yang Y, Luang S, Harris J, Riboni M, Li Y, Bazanova N, Hrmova M, Haefele S, Kovalchuk N, Lopato S. Overexpression of the class I homeodomain transcription factor TaHDZipI-5 increases drought and frost tolerance in transgenic wheat. Plant Biotechnol J. 2018;16(6):1227–40.
Chai W, Si W, Ji W, Qin Q, Zhao M, Jiang H. Genome-wide investigation and expression profiling of HD-zip transcription factors in foxtail millet (Setaria italica L.). Biomed Res Int. 2018;2018:8457614.
Henriksson E, Olsson AS, Johannesson H, Johansson H, Hanson J, Engstrom P, Soderman E. Homeodomain leucine zipper class I genes in Arabidopsis. Expression patterns and phylogenetic relationships. Plant Physiol. 2005;139(1):509–18.
Romani F, Reinheimer R, Florent SN, Bowman JL, Moreno JE. Evolutionary history of HOMEODOMAIN LEUCINE ZIPPER transcription factors during plant transition to land. New Phytol. 2018;219(1):408–21.
Dezar CA, Giacomelli JI, Manavella PA, Re DA, Alves-Ferreira M, Baldwin IT, Bonaventure G, Chan RL. HAHB10, a sunflower HD-zip II transcription factor, participates in the induction of flowering and in the control of phytohormone-mediated responses to biotic stress. J Exp Bot. 2011;62(3):1061–76.
Steindler C, Matteucci A, Sessa G, Weimar T, Ohgishi M, Aoyama T, Morelli G, Ruberti I. Shade avoidance responses are mediated by the ATHB-2 HD-zip protein, a negative regulator of gene expression. Development. 1999;126(19):4235–45.
Fu R, Liu W, Li Q, Li J, Wang L, Ren Z. Comprehensive analysis of the homeodomain-leucine zipper IV transcription factor family in Cucumis sativus. Genome. 2013;56(7):395–405.
Gao Y, Gao S, Xiong C, Yu G, Chang J, Ye Z, Yang C. Comprehensive analysis and expression profile of the homeodomain leucine zipper IV transcription factor family in tomato. Plant Physiol Biochem. 2015;96:141–53.
Mishra KB, Iannacone R, Petrozza A, Mishra A, Armentano N, La Vecchia G, Trtilek M, Cellini F, Nedbal L. Engineered drought tolerance in tomato plants is reflected in chlorophyll fluorescence emission. Plant Sci. 2012;182:79–86.
Romani F, Ribone PA, Capella M, Miguel VN, Chan RL. A matter of quantity: common features in the drought response of transgenic plants overexpressing HD-zip I transcription factors. Plant Sci. 2016;251:139–54.
Gonzalez FG, Capella M, Ribichich KF, Curin F, Giacomelli JI, Ayala F, Watson G, Otegui ME, Chan RL. Field-grown transgenic wheat expressing the sunflower gene HaHB4 significantly outyields the wild type. J Exp Bot. 2019;70(5):1669–81.
Wang L, Yu J, Li D, Zhang X. Sinbase: an integrated database to study genomics, genetics and comparative genomics in Sesamum indicum. Plant Cell Physiol. 2015;56(1):e2.
Wheeler TJ, Eddy SR. Nhmmer: DNA homology search with profile HMMs. Bioinformatics. 2013;29(19):2487–9.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee TH, Jin H, Marler B, Guo H, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.
de Hoon MJ, Imoto S, Nolan J, Miyano S. Open source clustering software. Bioinformatics. 2004;20(9):1453–4.
Saldanha AJ. Java Treeview--extensible visualization of microarray data. Bioinformatics. 2004;20(17):3246–8.
You J, Wang Y, Zhang Y, Dossa K, Li D, Zhou R, Wang L, Zhang X. Genome-wide identification and expression analyses of genes involved in raffinose accumulation in sesame. Sci Rep. 2018;8(1):4331.
This research was funded by China Agriculture Research System (CARS-14), the Agricultural Science and Technology Innovation Project of Chinese Academy of Agricultural Sciences (CAAS-ASTIP-2016-OCRI), and Central Public-interest Scientific Institution Basal Research Fund (1610172018007). The funders had no role in the experimental design, data collection and analysis or writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Fig. S1. Segmental duplicated SiHDZ genes on 16 linkage groups. Red lines indicate duplicated SiHDZ gene pairs. Grey lines indicate collinear blocks in whole sesame genome. Fig. S2. The logos of 20 conserved motifs in SiHDZ proteins. Fig. S3. Expression profiles of stress marker genes under osmotic and salinity stress treatments.
Table S1. HD-ZIP transcription factor gene family in sesame.
Table S2. List of primers used for quantitative real-time RT-PCR analysis.
About this article
Cite this article
Wei, M., Liu, A., Zhang, Y. et al. Genome-wide characterization and expression analysis of the HD-Zip gene family in response to drought and salinity stresses in sesame. BMC Genomics 20, 748 (2019). https://doi.org/10.1186/s12864-019-6091-5