Characterization of cotton ARF factors and the role of GhARF2b in fiber development
BMC Genomics volume 22, Article number: 202 (2021)
Cotton fiber is a model system for studying plant cell development. At present, the functions of many transcription factors in cotton fiber development have been elucidated, however, the roles of auxin response factor (ARF) genes in cotton fiber development need be further explored.
Here, we identify auxin response factor (ARF) genes in three cotton species: the tetraploid upland cotton G. hirsutum, which has 73 ARF genes, and its putative extent parental diploids G. arboreum and G. raimondii, which have 36 and 35 ARFs, respectively. Ka and Ks analyses revealed that in G. hirsutum ARF genes have undergone asymmetric evolution in the two subgenomes. The cotton ARFs can be classified into four phylogenetic clades and are actively expressed in young tissues. We demonstrate that GhARF2b, a homolog of the Arabidopsis AtARF2, was preferentially expressed in developing ovules and fibers. Overexpression of GhARF2b by a fiber specific promoter inhibited fiber cell elongation but promoted initiation and, conversely, its downregulation by RNAi resulted in fewer but longer fiber. We show that GhARF2b directly interacts with GhHOX3 and represses the transcriptional activity of GhHOX3 on target genes.
Our results uncover an important role of the ARF factor in modulating cotton fiber development at the early stage.
Cotton is the most important natural and renewable material for the textile industry in the world . The primary cultivated species upland cotton (G. hirsutum L.) is grown in over 80 countries and accounts for more than 90% of global cotton fiber output. Cotton fibers are unusually long, single-celled epidermal seed trichomes and a model for plant cell growth research . Fiber development can be divided into four overlapping stages: initiation, elongation, secondary cell wall biosynthesis and maturation . The fiber length and density are both key traits that determine cotton quality and yield.
The study of cotton fiber development regulation provides not only valuable knowledge to understanding plant cell growth and cell wall biosynthesis, but also candidate genes for cotton molecular breeding . To date a number of genes that function in cotton fiber cells have been identified, including homeodomain transcription factor GaHOX1, GhHOX3 and GhHD1 [5,6,7], bHLH transcription factor GhPRE1 , KNOX transcription factor knl1 , the sterol carrier gene , MYB transcription factors GhMYB25, GhMYB25-like, GhMML3 and GhMML4 [11,12,13,14], NAC transcription factor fsn1 , transcription factor WLIM1a gene , sucrose synthase gene , cotton actin1 gene , cotton BURP domain protein GhRDL1 , ethylene pathway related genes , fasciclin-like arabinogalactan protein, Ghfla1 , and TCP transcription factor GhTCP4  etc. Among recent progresses are the characterizations of transcription factors which regulate the major events of cotton fiber development, such as MYBs and HD-ZIP IVs involved in cotton fiber initiation and elongation, as well as a number of other types of factors. The MIXTA type MYB transcription factors (GhMYB25, GhMYB25-like and GhMML4_D12) are master regulators of cotton fiber initiation [11, 13, 14] and lint fiber development , whereas the HD-ZIP IV transcription factor GhHOX3 plays a pivotal role in controlling fiber elongation , whose activity is regulated by the phytohormone gibberellin. In addition, NAC (GhFSN1) and TCP4 transcription factors positively regulates secondary cell wall biosynthesis [15, 22]. However, cotton fiber growth and development are complex processes involving cell differentiation, cell skeleton orientation growth, cell wall synthesis, and so on . Currently the picture of the regulation network of cotton fiber is far from complete.
Auxin response factors (ARFs), a group of plant transcription factors, are composed of a conserved N-terminal DNA binding domain (DBD), a most case conserved C-terminal dimerization domain (CTD) and a non-conserved middle region (MR) . The MR region has been proposed to function as a repression or an activation domain . Arabidopsis thaliana contains 23 ARF genes and Oryza sativa has 25 [26, 27]. It has been reported that ARF2 negatively modulates plant growth in A. thaliana [26, 28,29,30] and tomato , yet functions of transcription factors can vary with tissues and more diversified in polyploid species, to date the role ARF2 in cotton fiber cells has not been explored.
In this study, we conducted a genome-wide analysis ARF genes in three cotton species (G. hirsutum, G. arboreum and G. raimondii), and classified them into four clades. In G. hirsutum most ARF genes were expressed in multiple cotton tissues, among which GhARF2b exhibited a preferential expression in developing cotton fiber cells, and it negatively affects cotton fiber elongation but plays a role in promoting fiber initiation.
ARF transcription factors in G. arboreum and G. hirsutum
The genome sequences of G. raimondii and G. arboreum provide us data resources to conduct a genome-wide screen of the ARF genes in the extent diploid progenitors of the allotetraploid G. hirsutum. In the previous studies, Sun et al., (2015) identified 35 ARF genes in G. raimondii . To mine more ARF transcription factors in cottons the conserved domain (Pfam ID: PF06507) was used to hmmersearch against the G. arboreum and G. hirsutum genome databases, which resulted in 36 and 73 genes in G. arboreum and G. hirsutum genomes, respectively. The 36 G. arboreum ARF genes were designated GaARF1–GaARF20, and the 73 G. hirsutum ARF genes in A- and D-subgenomes were designated as GhARF1A/D–GhARF21A/D (Table 1). As those of Arabidopsis, cotton ARF proteins are composed of three domain regions, including DBD (DNA-binding Domain), MI (Middle Region) and CTD (C-terminal Domain) (Additional file 1: Figure S1).
Phylogenetic analysis of Gossypium ARF proteins
To illustrate the evolutionary relationships among the cotton ARFs, a phylogenetic tree was constructed using the protein sequences of 144 cotton ARFs, which were clustered into four clades (I–IV). The highest number of Gossypium ARFs are found in clade III and I, followed by clade IV and II (Fig. 1).
Overall, the expected diploid-polyploid topology is reflected in the tree for each set of orthologous/homoeologous genes, indicating general preservation during divergence of diploids and through the polyploid formation. We found that the number of ARF genes in G. hirsutum are approximately twice that in G. raimondii and G. arboreum, with one At or Dt homoeologous copy corresponding to one ortholog in each of the diploid cottons. Further, as shown in Fig. 1, the orthologous paired genes of the A genome (G. arboreum) and At sub-genome, or from the D genome (G. raimondii) and Dt sub-genome, tend to be clustered together and share a sister relationship.
Divergence of ARF genes in allotetraploid G. hirsutum and its diploid progenitors
The ARF genes in the two diploid species were then compared with G. hirsutum At- and Dt-subgenome homoeologs (Table 1). To explore the evolutionary relationship and possible functional divergence of ARF genes between the allotetraploid cotton and its extend diploid progenitors, the nonsynonymous substitution (Ka) and synonymous substitution values (Ks) and the Ka/Ks ratios for each pair of the genes were calculated (Table 1). By comparing the Ka and Ks values of 66 orthologous gene sets between the allotetraploid and its diploid progenitor genomes, we found that the Ka and Ks values are higher in the Dt subgenome than in the At subgenome (Fig. 2). These results indicate that GhARF genes in the Dt subgenome tend to have experienced faster sequence divergence than their At counterparts, suggesting an inconsistent evolution of ARF genes in the two subgenomes (Fig. 2).
In addition, the Ka/Ks ratios of one Dt-subgenome genes (GhARF3b_D) and five At-subgenome gene (GhARF2e_A, GhARF3c_A, GhARF4b_A, GhARF16b_A and GhARF17b_A) are greater than 1 (Table 1), suggesting that these genes have under positive selections after divergence of G. hirsutum from diploid ancestors, and may have gained new functions.
Expression analysis of GhARF genes in different cotton tissues
The expression profile of a gene family can provide valuable clues to possible functions of each genes. Analysis of 73 GhARF genes showed that most genes have different spatial expression patterns. For instance, GhARF1, GhARF2a, GhARF2b and GhARF2c were expressed in all the tissues of cotton examined (Additional file 2: Figure S2), whereas GhARF3a and GhARF3c were expressed preferentially in the pistils and ovules. Compared to GhARF5b, GhARF5a showed higher expressions in the root, pistil and ovule organs. Transcripts of GhARF3c and GhARF4a, GhARF9a and GhARF9b were most abundant in stem and root, respectively. Over half of GhARF genes showed a relatively high level of transcript accumulation in leaf. Notably, there are more than 10 genes (including GhARF1, GhARF2a, GhARF2b, GhARF8a, GhARF9a, GhARF10b, GhARF11, GhARF16a, GhARF18 and GhARF19) that were highly expressed in cotton fiber cells at the fast elongation stage (5 dpa).
Among them, GhARF2 genes showed the highest expression in fiber (5 dpa) and were located in the Clade I of phylogenetic tree (Fig. 1), suggesting that they may function in cotton fiber development. Previous studies have demonstrated that ARF2 plays a role in transcriptional regulation in auxin-mediated cell division , leaf longevity , response to stress , regulation of fruit ripening  and so on. As GhARF2s shown pleiotropic effects on plant development , we decided to identify the major GhARF2s in regulation of cotton fiber elongation in subsequent experiments.
GhARF2 had a high expression pattern during fiber elongation process
There are nine ARF2 genes in G. hirsutum (GhARF2c_At not annotated), we first examined their expression profiles in different tissues in cotton (Fig. 3). Based on the RNA-seq data (Zhang et al., 2015), GhARF2a, GhARF2b and GhARF2c genes had higher expression levels in various tissues than GhARF2d or GhARF2e (Fig. 3a). Among them, in 5 dpa fiber, the expressions of GhARF2b were 1.1–37 folds to other four GhARF2 genes. Whereas in ovule (0dpa), GhARF2b showed 1.2–15 folds higher expressions than others. Thus, the transcripts of GhARF2b homoeologs (GhARF2b_At and GhARF2b_Dt) were enriched and abundant in cotton fiber and ovule cells (Fig. 3a). Subsequent quantitative RT-PCR (qRT-PCR) confirmed the expression pattern, and GhARF2b showed 3.6–9 folds higher expressions in fiber (3dpa) or ovule (0dpa) than other tissues (Fig. 3b). The highly up-regulated expression in fiber cell suggested that GhARF2b has been recruited to act primarily in cotton fiber.
GhARF2b overexpression represses cotton fiber elongation
To test the function of GhARF2b, we constructed the vectors to over-express and down-regulate GhARF2b_Dt in G. hirsutum by using the fiber-specific GhRDL1 promoter [8, 19, 36]. The expression levels of GhARF2b in transgenic cotton were clearly elevated in the overexpression lines according to qRT-PCR analysis; for example, the GhARF2b transcript abundance was about two-fold higher in the OE-3 than in the wild-type cotton fiber cells (Fig. 4a). However, GhARF2b did not stimulate fiber cell elongation, rather, it resulted in shorter fiber (Fig. 4b, c).
On the contrary, suppressing GhARF2b expression by RNAi resulted in longer fibers (Fig. 5a, b). The expression levels of GhARF2b in RNAi cottons in the RNAi lines were about 3 ~ 5-fold down-regulated in cotton fiber of 0DPA, 6DPA and 12DPA (Fig. 5c-e). Together, these data suggest that GhARF2b acted as a negative regulator of fiber cell elongation, at least when its expression exceeded the threshold. Alternatively, it may function in other aspects of cotton fiber development.
GhARF2b interacted with GhHOX3
The homeodomain-leucine zipper (HD-ZIP) transcription factor, GhHOX3, plays a determinant role in controlling cotton fiber elongation . We used the yeast two-hybrid system (Y2H) to screen a cotton fiber cDNA library for GhHOX3 interacting proteins. GhARF2 was among the top five interacting factors of the target proteins. In further yeast two-hybrid assays, GhARF2b and GhARF2b middle region strongly interacted with GhHOX3 (Fig. 6a, b). We also used bimolecular fluorescence complementation (BiFC) assays to confirm the interaction between GhARF2b and GhHOX3 (Fig. 6c).
The transcriptional activities of GhHOX3 target genes were repressed by GhARF2b protein interactions
Given the fact that GhARF2b represses cotton fiber elongation, we tested the two protein interactions would affect the transcriptional activation of GhHOX3 target genes. Two cell wall protein coding genes [19, 36], GhRDL1 and GhEXPA1, are direct targets of GhHOX3 in promoting the fiber elongation . We used a dual-luciferase assay system to study the effect of GhARF2b on activity of GhHOX3 protein (Fig. 7a). The level of the luciferase activity driven by GhRDL1 and GhEXPA1 promoters was significantly increased when GhHOX3 was expressed (Fig. 7b, c). In contrast, activation of GhHOX3 to GhRDL1 or GhEXPA1 promoters was significantly repressed by GhARF2b (Fig. 7b, c). These results further supported that interaction of GhARF2b with GhHOX3 results in a much lower activity of targets gene activation, thus cotton fiber elongation was disturbed.
GhARF2b overexpression enhances cotton fiber initiation
Next, we examined the effects of GhARF2b up-regulation on cotton fiber initiation. The over-expression line OE-3 and RNAi line ds-2 were selected for analyses. The SEM with 60 × magnification of ovules of WT-R15, OE-3 and ds-2 collected at − 1, 0, 1 DPA were observed (Fig. 8). The cotton fiber initiation of the − 1-DPA ovules did not present differences among the three types of cottons, however, the 0- and 1-DPA ovules of OE-3 and ds-2 lines showed higher and lower densities of fiber initials compared to the wild-type control (Fig. 8). Further, we magnified the SEM views of ovules to 500–700× (Fig. 9). Obviously, at the fiber initiation stage (0, 1 DPA), the fiber initial density of the OE-3 was increased by about 1.5-fold compared with that of the wild-type, in contrast, the fiber initial density of the ds-2 line was reduced (Fig. 9a-c). These results support a role of GhARF2b in promoting cotton fiber cell initiation.
Currently, more than 20 cotton genome sequences have been assembled and released, including diploid G. raimondii [37, 38], G. herbaceum and G. arboreum [39,40,41] and tetraploid G. hirsutum, G. barbadense, G. tomentosum, G. mustelinum and G. darwinii [41,42,43,44,45,46,47,48,49]. These genome sequences provided a platform for dissecting gene functions by forward and reverse genetics and would accelerate the rate of molecular breeding in cotton. Here, based on these high-quality genome sequences, we additionally characterized 36 ARF genes in G. arboreum and 73 in G. hirsutum, adding valuable data to understanding the distribution and evolution of ARF genes in cotton plants.
After whole genome duplication, the amplified genes generally undergo the events of functional loss, or neofunctionalization or subfunctionalization . In this study, we found that six GhARF genes (five from At subgenome) have experienced relatively faster positive selection compared to its diploid progenitors. Thus, duplicated genes from At and Dt subgenomes might be functionally diverged in the allotetraploid cotton after the merge of the two genomes. In addition, the GhARF genes expression profiles analyzed from the RNA-seq data showed subgenome-biased expression that might undergone functional divergence during the evolution. For instance, unequal expressions were observed in A and D-subgenome genes, including GhARF3c, GhARF16c, GhARF18a and GhARF20. These massive alterations in gene expression can cause distinct function and may just be one of the important features emerging from polyploid [8, 51]. During the evolution of allopolyploid, some duplicate gene pairs (homoeologs) are expressed unequally, as also proved in the allopolyploid cotton genome with the features of asymmetrical evolution . The above results indicated that this suite of unequally expressed genes may be a fundamental feature of allopolyploids.
Previous studies showed that ARF family genes have been identified in many plant species, including 23 ARF genes in Arabidopsis thaliana , 25 in Oryza sativa , 39 in Populus trichocarpa , 31 in Zea mays , 15 in Cucumis sativus  and 35 in G. raimondii . Auxin response factors (ARFs) are important in plant development as they play crucial roles in regulating a variety of signaling pathways [24, 25]. According to their functions, ARF proteins are divided into two classes: transcriptional activators and transcriptional repressors . Many studies have revealed their regulatory roles in regulating various aspects of cellular activities [35, 55,56,57]. As transcriptional repressors, ARF2 was involved in the regulation of K+ uptake by repressing HAK5 transcription in Arabidopsis . In addition, ARF2 is regulated by a variety of upstream factors at the transcription and protein levels, and participated in the pathways of auxin, gibberellin, oleoresin, ethylene and abscisic acid [28, 29, 31, 58].
In cotton, Zhang et al. uncovered that expression of the IAA biosynthetic gene, iaaM, can significantly increase IAA levels in the epidermis of cotton ovules at the fiber initiation stage, and increased the number of lint fibers and lint percentage in a 4-year field trial. They proved that the lint percentage of the transgenic cotton was increased in transgenic plants with a 15% increase in lint yield . Han et al. found that the auxin response factor gene (GhARF3) was highly correlated with fibre quality by using the haplotype analysis and transcriptomic data. Above all, auxin signaling plays an essential role in regulating fibre development. In addition, Xiao et al. showed that G. hirsutum ARF genes promoted the trichome initiation in transgenic Arabidopsis plants . They identified 56 GhARF genes in their study, including three GhARF2 genes . They showed that GhARF2–1 could be exclusively expressed in trichomes, and overexpression of GhARF2–1 in Arabidopsis can enhance trichome initiation. But their study did not perform the cotton transformation to test the function of GhARF2–1 in cotton fiber cell.
In our study, we reported 73 GhARF genes in Gossypium hirsutum genome, including 9 GhARF2 genes. Among them, GhARF2b, was specifically higher expressed in developing fibers. Overexpression of GhARF2b represses fiber elongation, and RNAi silencing of GhARF2b promotes the fiber longer. Through yeast two-hybrid assays and the Dual-LUC experiment, GhARF2b plays a negative role in controlling cotton fiber elongation by interacting with GhHOX3. Further, GhARF2b was shown to promote the production of fiber initials, suggesting that auxin is an important player in controlling cotton fiber development. The auxin signaling pathways in developing cotton fiber cells deserve further investigation.
Identification of Gossypium species ARF factors
G. raimondii , G. arboreum , G. hirsutum  genome sequences were acquired from the CottonGen database . We developed a Hidden Markov Model  profile matrix of ARF factors (Pfam ID: PF06507) via the hmmbuild program  with default parameters to identify Gossypium ARF transcription factor proteins. SMART conserved domain search tool  and Pfam databases  were used to identify the conserved domain.
Sequence alignment, Ka, Ks analyses and phylogenetic analyses
Gossypium ARF factor amino acids and nucleotide sequences were aligned by MAFFT software with the G-INS-i algorithm . Ka, Ks and Ka/Ks values for each gene pairs between diploid and allotetraploid were calculated by DnaSP v5 . The Neighbor-Joining (NJ) phylogenetic tree was drawn by MEGA 5.03  by sampling 1000 bootstrap replicates based on the ARF whole protein sequences.
Gene expression analyses based on transcriptome
Raw RNA-Seq data were downloaded from the NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA248163) , including G. hirsutum seed, root, stem, leaf, torus, petal, stamen, ovary, calyx, ovule (− 3 dpa, − 1 dpa, 0 dpa, 1 dpa, 3 dpa, 5 dpa, 10 dpa, 20 dpa, 25dpa, 35dpa) and fiber (5 dpa, 10 dpa, 20 dpa, 25dpa). The method of gene expression analyses based on transcriptome was same to our previous study . Differentially expressed genes were determined based on the following criteria: more than two-fold change and p-value less than 0.05. Multiple Experiment Viewer (MeV)  was used to display the gene expression values.
Plant materials and growth conditions
Gossypium hirsutum cv. R15 wild type plants were obtained from Institute of Cotton Research, Shanxi Academy of Agricultural Sciences, Yuncheng, Shanxi, China. Upland cotton R15 plants and its transgenic lines were grown in a greenhouse or in a field under standard farming conditions, which is in the experimental field of Chinese Academy of Sciences in Shanghai according to relevant national approvals for biotechnology research (China, http://pg.natesc.gov.cn/sites/pg/). The greenhouse is in a controlled environment at 28 °C day/20 °C night, a 16-h light/8-h dark photoperiod. Cotton tissues, including roots, cotyledon, petal, stamen, style, ovules (− 3, − 1, 0 and 6 dpa) and fiber (3, 6, 12 and 18 dpa) were collected for expression analyses. Fibers were collected by scraping the ovule in liquid nitrogen. All these tissues were frozen in liquid nitrogen immediately after sampling and stored at − 80 °C until RNA extraction. Three times were repeated for all these treatments.
All cotton samples were ground in liquid nitrogen and total RNAs of these cotton tissues were extracted using the RNAprep pure plant kit (TIANGEN, Shanghai, China) following the manufacturer’s protocol. The method of qRT-PCR analyses was same to our previous study . The forward and reverse primers of specific gene for quantitative real-time PCR (qRT-PCR) analyses, were designed using the Primer5 software (Additional file 3: Table S1). Analyses were performed with SYBR-Green PCR Mastermix (TaKaRa) on a cycler (Mastercycler RealPlex; Eppendorf Ltd., Shanghai, China). The internal gene was G. hirsutum histone-3 (GhHIS3, AF024716), and the 2-∆∆Ct method was used to calculate the relative amount of amplified product . Relative expression levels among different organs of G. hirsutum samples were normalized by calibrating with the WT samples.
Cotton transformation and fiber length analysis
The open reading frame (ORF) of GhARF2b was PCR-amplified from a G. hirsutum cv R15 fiber cDNA library with PrimeSTAR HS DNA polymerase (Takara Biomedical Technology Co. Ltd., Beijing, China) and inserted into the pCAMBIA2301 vector to construct RDL1::GhARF2b. For 35S::dsGhARF2b, sense and antisense: GhARF2b fragments, separated by a 120-bp intron of the RTM1 gene from A. thaliana, were cloned into pCAMBIA2301. Primers used in this investigation are listed in Additional file 3: Table S1. The binary constructs were transferred into Agrobacterium tumefaciens. Cotton transformation was conducted as reported in Shangguan et al. . Transgenic cotton plants were grown in glasshouse or field. β-glucuronidase (GUS) staining and PCR amplification were performed to identify the transgenic lines of T0 and subsequent generations. Thirty seeds from each plant were harvested to statistics fiber length.
Yeast two-hybrid assay
Yeast two-hybrid analysis were carried out using the Matchmaker GAL4 Two-Hybrid System as performed previously . Briefly, for the yeast two-hybrid assays, the full-length ORF of GhHOX3 inserted into pGBKT7 (Clontech) and GhARF2b or GhARF2b different domains into pGADT7 (Clontech). Plasmids were co-transferred into yeast strain AH109 by the LiCl-PEG method, and SD/−Leu/−Trp/−His selective plates containing 5 mM 3-AT (3-amino-1,2,4,-triazole) were used to detect the protein-protein interactions. pGADT7 and pGBKT7 empty vectors were used as controls. Three biological duplications for each transformation were performed.
BiFC and dual-luciferase (dual-LUC) assays
We performed the BiFC assays following previous reports [73, 74]. In summary, CDSs of GhARF2b and GhHOX3 were amplified and cloned into JW771 and JW772 vectors, respectively. Each gene was fused to the carboxyl-terminal half (cLUC-GhARF2b/GhHOX3) and the amino-terminal half (GhARF2b/GhHOX3-nLUC) of luciferase (LUC), respectively. cLUC and nLUC were used as controls. Assays were finished as described [5, 75].
The Dual-LUC assay was performed as reported [5, 76]. Briefly, the promoters containing intact L1-boxes of GhRDL1 and GhEXPA1 were inserted into pGreen-LUC vector with a firefly LUC reporter gene. Then, the constructs were transferred into Agrobacterium tumefaciens cell with a co-suppression repressor plasmid pSoup-P19. Transient transformation was conducted by infiltrating the A. tumefaciens cells into N. benthamiana leaves. The total protein was extracted from the infected area after 3 days. The Dual-Luciferase Reporter Assay System (Promega) was used to detect the fluorescent values of LUC and REN with a luminometer (BG-1, GEM Biomedical Inc.). The value of LUC was normalized to that of REN. Three biological replicates were measured for each experiment.
Images were generated with an optical microscope (BX51, Olympus). For scanning electron microscope images, cotton ovules (− 1, 0, 1DPA) were attached with colloidal graphite to a copper stub, frozen under vacuum and visualized with a scanning electron microscope (JSM-6360LV, JEOL).
Availability of data and materials
The genome sequences of three cotton species and the genome annotation gff3 file were downloaded from the CottonGen database (https://www.cottongen.org/data/download) . Raw RNA-Seq data for G. hirsutum seed, root, stem, leaf, torus, petal, stamen, ovary, calyx, ovule and fiber were downloaded from the NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA248163) (NCBI Sequence Read Archive SRR1695173, SRR1695174, SRR1695175, SRR1695177, SRR1695178, SRR1695179, SRR1695181, SRR1695182, SRR1695183, SRR1695184, SRR1695185, SRR1695191, SRR1695192, SRR1695193,SRR1695194, SRR1768504, SRR1768505, SRR1768506, SRR1768507, SRR1768508, SRR1768509, SRR1768510, SRR1768511, SRR1768512, SRR1768513, SRR1768514, SRR1768515, SRR1768516, SRR1768517, SRR1768518 and SRR1768519) . The G. hirsutum histone-3 (GhHIS3, AF024716) gene was downloaded from the National Center for Biotechnology Information (NCBI) database, which were used as internal references. The conserved domain of ARF transcription factors (Pfam ID: PF06507) was downloaded from the Pfam databases (http://pfam.xfam.org/family/PF06507#tabview=tab3). All other data generated or analyzed during this study are included in this published article and its Additional files.
Auxin response factors
Days post anthesis
Fragments per kilobase of transcript per million mapped fragments
- G. arboreum :
- G. hirsutum :
- G. raimondii :
Quantitative real-time polymerase chain reaction
Schell J. Cotton carrying the recombinant insect poison Bt toxin: no case to doubt the benefits of plant biotechnology. Curr Opin Biotechnol. 1997;8(2):235–6.
Kim HJ, Triplett BA. Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis. Plant Physiol. 2001;127(4):1361–6.
Graves DA, Stewart JM. Chronology of the differentiation of cotton (Gossypium hirsutum L.) fiber cells. Planta. 1988;175(2):254–8.
Fang DD, Naoumkina M, Kim HJ. Unraveling cotton Fiber development using Fiber mutants in the post-genomic era. Crop Sci. 2018;58(6):2214–28.
Shan CM, Shangguan XX, Zhao B, Zhang XF, Chao LM, Yang CQ, Wang LJ, Zhu HY, Zeng YD, Guo WZ, et al. Control of cotton fibre elongation by a homeodomain transcription factor GhHOX3. Nat Commun. 2014;5:5519.
Guan XY, Li QJ, Shan CM, Wang S, Mao YB, Wang LJ, Chen XY. The HD-zip IV gene GaHOX1 from cotton is a functional homologue of the Arabidopsis GLABRA2. Physiol Plant. 2008;134(1):174–82.
Walford SA, Wu Y, Llewellyn DJ, Dennis ES. Epidermal cell differentiation in cotton mediated by the homeodomain leucine zipper gene, GhHD-1. Plant J. 2012;71(3):464–78.
Zhao B, Cao JF, Hu GJ, Chen ZW, Wang LY, Shangguan XX, Wang LJ, Mao YB, Zhang TZ, Wendel JF, et al. Core cis-element variation confers subgenome-biased expression of a transcription factor that functions in cotton fiber elongation. New Phytol. 2018;218(3):1061–75.
Gong SY, Huang GQ, Sun X, Qin LX, Li Y, Zhou L, Li XB. Cotton KNL1, encoding a class II KNOX transcription factor, is involved in regulation of fibre development. J Exp Bot. 2014;65(15):4133–47.
Zhang Z, Ruan YL, Zhou N, Wang F, Guan X, Fang L, Shang X, Guo W, Zhu S, Zhang T. Suppressing a putative sterol carrier gene reduces Plasmodesmal permeability and activates sucrose transporter genes during cotton Fiber elongation. Plant Cell. 2017;29(8):2027–46.
Wan Q, Guan X, Yang N, Wu H, Pan M, Liu B, Fang L, Yang S, Hu Y, Ye W, et al. Small interfering RNAs from bidirectional transcripts of GhMML3_A12 regulate cotton fiber development. New Phytol. 2016;210(4):1298–310.
Wu H, Tian Y, Wan Q, Fang L, Guan X, Chen J, Hu Y, Ye W, Zhang H, Guo W, et al. Genetics and evolution of MIXTA genes regulating cotton lint fiber development. New Phytol. 2018;217(2):883–95.
Machado A, Wu Y, Yang Y, Llewellyn DJ, Dennis ES. The MYB transcription factor GhMYB25 regulates early fibre and trichome development. Plant J. 2009;59(1):52–62.
Walford SA, Wu Y, Llewellyn DJ, Dennis ES. GhMYB25-like: a key factor in early cotton fibre development. Plant J. 2011;65(5):785–97.
Zhang J, Huang GQ, Zou D, Yan JQ, Li Y, Hu S, Li XB. The cotton (Gossypium hirsutum) NAC transcription factor (FSN1) as a positive regulator participates in controlling secondary cell wall biosynthesis and modification of fibers. New Phytol. 2018;217(2):625–40.
Han LB, Li YB, Wang HY, Wu XM, Li CL, Luo M, Wu SJ, Kong ZS, Pei Y, Jiao GL, et al. The dual functions of WLIM1a in cell elongation and secondary wall formation in developing cotton fibers. Plant Cell. 2013;25(11):4421–38.
Ruan YL, Llewellyn DJ, Furbank RT. Suppression of sucrose synthase gene expression represses cotton fiber cell initiation, elongation, and seed development. Plant Cell. 2003;15(4):952–64.
Li XB, Fan XP, Wang XL, Cai L, Yang WC. The cotton ACTIN1 gene is functionally expressed in fibers and participates in fiber elongation. Plant Cell. 2005;17(3):859–75.
Xu B, Gou JY, Li FG, Shangguan XX, Zhao B, Yang CQ, Wang LJ, Yuan S, Liu CJ, Chen XY. A cotton BURP domain protein interacts with alpha-expansin and their co-expression promotes plant growth and fruit production. Mol Plant. 2013;6(3):945–58.
Shi YH, Zhu SW, Mao XZ, Feng JX, Qin YM, Zhang L, Cheng J, Wei LP, Wang ZY, Zhu YX. Transcriptome profiling, molecular biological, and physiological studies reveal a major role for ethylene in cotton fiber cell elongation. Plant Cell. 2006;18(3):651–64.
Huang GQ, Gong SY, Xu WL, Li W, Li P, Zhang CJ, Li DD, Zheng Y, Li FG, Li XB. A fasciclin-like arabinogalactan protein, GhFLA1, is involved in fiber initiation and elongation of cotton. Plant Physiol. 2013;161(3):1278–90.
Cao JF, Zhao B, Huang CC, Chen ZW, Zhao T, Liu HR, Hu GJ, Shangguan XX, Shan CM, Wang LJ, et al. The miR319-targeted GhTCP4 promotes the transition from cell elongation to wall thickening in cotton fiber. Mol Plant. 2020;13(7):1063–77.
Yu Y, Wu S, Nowak J, Wang G, Han L, Feng Z, Mendrinna A, Ma Y, Wang H, Zhang X, et al. Live-cell imaging of the cytoskeleton in elongating cotton fibres. Nat Plants. 2019;5(5):498–504.
Guilfoyle TJ, Hagen G. Auxin response factors. Curr Opin Plant Biol. 2007;10(5):453–60.
Tiwari SB, Hagen G, Guilfoyle T. The roles of auxin response factor domains in auxin-responsive transcription. Plant Cell. 2003;15(2):533–43.
Okushima Y, Overvoorde PJ, Arima K, Alonso JM, Chan A, Chang C, Ecker JR, Hughes B, Lui A, Nguyen D, et al. Functional genomic analysis of the AUXIN RESPONSE FACTOR gene family members in Arabidopsis thaliana: unique and overlapping functions of ARF7 and ARF19. Plant Cell. 2005;17(2):444–63.
Wang DK, Pei KM, Fu YP, Sun ZX, Li SJ, Liu HQ, Tang K, Han B, Tao YZ. Genome-wide analysis of the auxin response factors (ARF) gene family in rice (Oryza sativa). Gene. 2007;394(1–2):13–24.
Vert G, Walcher CL, Chory J, Nemhauser JL. Integration of auxin and brassinosteroid pathways by Auxin response factor 2. Proc Natl Acad Sci U S A. 2008;105(28):9829–34.
Wang L, Hua DP, He JN, Duan Y, Chen ZZ, Hong XH, Gong ZZ. Auxin response factor2 (ARF2) and its regulated homeodomain gene hb33 mediate abscisic acid response in arabidopsis. PLoS Genet. 2011;7(7):e1002172.
Schruff MC, Spielman M, Tiwari S, Adams S, Fenby N, Scott RJ. The AUXIN RESPONSE FACTOR 2 gene of Arabidopsis links auxin signalling, cell division, and the size of seeds and other organs. Development. 2006;133(2):251–61.
Breitel DA, Chappell-Maor L, Meir S, Panizel I, Puig CP, Hao Y, Yifhar T, Yasuor H, Zouine M, Bouzayen M, et al. AUXIN RESPONSE FACTOR 2 intersects hormonal signals in the regulation of tomato fruit ripening. PLoS Genet. 2016;12(3):e1005903.
Sun RR, Wang KB, Guo TL, Jones DC, Cobb J, Zhang BH, Wang QL. Genome-wide identification of auxin response factor (ARF) genes and its tissue-specific prominent expression in Gossypium raimondii. Function Integr Genom. 2015;15(4):481–93.
Lim PO, Lee IC, Kim J, Kim HJ, Ryu JS, Woo HR, Nam HG. Auxin response factor 2 (ARF2) plays a major role in regulating auxin-mediated leaf longevity. J Exp Bot. 2010;61(5):1419–30.
Zhao S, Zhang ML, Ma TL, Wang Y. Phosphorylation of ARF2 relieves its repression of transcription of the K+ transporter gene HAK5 in response to low potassium stress. Plant Cell. 2016;28(12):3005–19.
Okushima Y, Mitina I, Quach HL, Theologis A. AUXIN RESPONSE FACTOR 2 (ARF2): a pleiotropic developmental regulator. Plant J Cell Mol Biol. 2005;43(1):29–46.
Wang S, Wang JW, Yu N, Li CH, Luo B, Gou JY, Wang LJ, Chen XY. Control of plant trichome development by a cotton fiber MYB gene. Plant Cell. 2004;16(9):2323–34.
Paterson AH, Wendel JF, Gundlach H, Guo H, Jenkins J, Jin D, Llewellyn D, Showmaker KC, Shu S, Udall J, et al. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature. 2012;492(7429):423–7.
Wang K, Wang Z, Li F, Ye W, Wang J, Song G, Yue Z, Cong L, Shang H, Zhu S, et al. The draft genome of a diploid cotton Gossypium raimondii. Nat Genet. 2012;44(10):1098–103.
Li F, Fan G, Wang K, Sun F, Yuan Y, Song G, Li Q, Ma Z, Lu C, Zou C, et al. Genome sequence of the cultivated cotton Gossypium arboreum. Nat Genet. 2014;46(6):567–72.
Du X, Huang G, He S, Yang Z, Sun G, Ma X, Li N, Zhang X, Sun J, Liu M, et al. Resequencing of 243 diploid cotton accessions based on an updated a genome identifies the genetic basis of key agronomic traits. Nat Genet. 2018;50(6):796–802.
Huang G, Wu Z, Percy RG, Bai M, Li Y, Frelichowski JE, Hu J, Wang K, Yu JZ, Zhu Y. Genome sequence of Gossypium herbaceum and genome updates of Gossypium arboreum and Gossypium hirsutum provide insights into cotton A-genome evolution. Nat Genet. 2020;52(5):516–24.
Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, Zhang J, Saski CA, Scheffler BE, Stelly DM, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33(5):531–7.
Chen ZJ, Sreedasyam A, Ando A, Song Q, De Santiago LM, Hulse-Kemp AM, Ding M, Ye W, Kirkbride RC, Jenkins J, et al. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat Genet. 2020;52(5):525–33.
Li F, Fan G, Lu C, Xiao G, Zou C, Kohel RJ, Ma Z, Shang H, Ma X, Wu J, et al. Genome sequence of cultivated upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat Biotechnol. 2015;33(5):524–30.
Liu X, Zhao B, Zheng HJ, Hu Y, Lu G, Yang CQ, Chen JD, Chen JJ, Chen DY, Zhang L, et al. Gossypium barbadense genome sequence provides insight into the evolution of extra-long staple fiber and specialized metabolites. Sci Rep. 2015;5:14139.
Yuan D, Tang Z, Wang M, Gao W, Tu L, Jin X, Chen L, He Y, Zhang L, Zhu L, et al. The genome sequence of Sea-Island cotton (Gossypium barbadense) provides insights into the allopolyploidization and development of superior spinnable fibres. Sci Rep. 2015;5:17662.
Wang M, Tu L, Yuan D, Zhu D, Shen C, Li J, Liu F, Pei L, Wang P, Zhao G, et al. Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense. Nat Genet. 2019;51(2):224–9.
Hu Y, Chen J, Fang L, Zhang Z, Ma W, Niu Y, Ju L, Deng J, Zhao T, Lian J, et al. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat Genet. 2019;51(4):739–48.
Yang Z, Ge X, Yang Z, Qin W, Sun G, Wang Z, Li Z, Liu J, Wu J, Wang Y, et al. Extensive intraspecific gene order and gene structural variations in upland cotton cultivars. Nat Commun. 2019;10(1):2989.
Wendel JF. The wondrous cycles of polyploidy in plants. Am J Bot. 2015;102(11):1753–6.
Chen ZW, Cao JF, Zhang XF, Shangguan XX, Mao YB, Wang LJ, Chen XY. Cotton genome: challenge into the polyploidy. Sci Bull. 2017;62(24):1622–3.
Kalluri UC, Difazio SP, Brunner AM, Tuskan GA. Genome-wide analysis of aux/IAA and ARF gene families in Populus trichocarpa. BMC Plant Biol. 2007;7:59.
Xing H, Pudake RN, Guo G, Xing G, Hu Z, Zhang Y, Sun Q, Ni Z. Genome-wide identification and expression profiling of auxin response factor (ARF) gene family in maize. BMC Genomics. 2011;12(1):178.
Liu SQ, Hu LF. Genome-wide analysis of the auxin response factor gene family in cucumber. Genet Mol Res. 2013;12(4):4317–31.
Hardtke CS, Berleth T. The Arabidopsis gene MONOPTEROS encodes a transcription factor mediating embryo axis formation and vascular development. EMBO J. 1998;17(5):1405–11.
Nemhauser JL, Feldman LJ, Zambryski PC. Auxin and ETTIN in Arabidopsis gynoecium morphogenesis. Development. 2000;127(18):3877–88.
Ellis CM, Nagpal P, Young JC, Hagen G, Guilfoyle TJ, Reed JW. AUXIN RESPONSE FACTOR1 and AUXIN RESPONSE FACTOR2 regulate senescence and floral organ abscission in Arabidopsis thaliana. Development. 2005;132(20):4563–74.
Richter R, Behringer C, Zourelidou M, Schwechheimer C. Convergence of auxin and gibberellin signaling on the regulation of the GATA transcription factors GNC and GNL in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2013;110(32):13192–7.
Zhang M, Zheng X, Song S, Zeng Q, Hou L, Li D, Zhao J, Wei Y, Li X, Luo M, et al. Spatiotemporal manipulation of auxin biosynthesis in cotton ovule epidermal cells enhances fiber yield and quality. Nat Biotechnol. 2011;29(5):453–8.
Xiao G, He P, Zhao P, Liu H, Zhang L, Pang C, Yu J. Genome-wide identification of the GhARF gene family reveals that GhARF2 and GhARF18 are involved in cotton fibre cell initiation. J Exp Bot. 2018;69(18):4323–37.
Yu J, Jung S, Cheng CH, Ficklin SP, Lee T, Zheng P, Jones D, Percy RG, Main D. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res. 2014;42(Database issue):1229–36.
Wong DC, Schlechter R, Vannozzi A, Holl J, Hmmam I, Bogs J, Tornielli GB, Castellarin SD, Matus JT. A systems-oriented analysis of the grapevine R2R3-MYB transcription factor family uncovers new insights into the regulation of stilbene accumulation. DNA Res. 2016;23:451–66.
Mistry J, Finn RD, Eddy SR, Bateman A, Punta M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013;41(12):e121.
Letunic I, Bork P. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 2018;46(D1):D493–6.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44(D1):D279–85.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Cao J-F, Huang J-Q, Liu X, Huang C-C, Zheng Z-S, Zhang X-F, Shangguan X-X, Wang L-J, Zhang Y-G, Wendel JF, et al. Genome-wide characterization of the GRF family and their roles in response to salt stress in Gossypium. BMC Genomics. 2020;21(1):575.
Saeed AI, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M, et al. TM4: a free, open-source system for microarray data management and analysis. Biotechniques. 2003;34(2):374–8.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(T) (−Delta Delta C) method. Methods. 2001;25(4):402–8.
Shangguan XX, Xu B, Yu ZX, Wang LJ, Chen XY. Promoter of a cotton fibre MYB gene functional in trichomes of Arabidopsis and glandular trichomes of tobacco. J Exp Bot. 2008;59(13):3533–42.
Gou JY, Felippes FF, Liu CJ, Weigel D, Wang JW. Negative regulation of anthocyanin biosynthesis in Arabidopsis by a miR156-targeted SPL transcription factor. Plant Cell. 2011;23(4):1512–22.
Chen HM, Zou Y, Shang YL, Lin HQ, Wang YJ, Cai R, Tang XY, Zhou JM. Firefly luciferase complementation imaging assay for protein-protein interactions in plants. Plant Physiol. 2008;146(2):368–76.
Papp I, Mette MF, Aufsatz W, Daxinger L, Schauer SE, Ray A, van der Winden J, Matzke M, Matzke AJ. Evidence for nuclear processing of plant micro RNA and short interfering RNA precursors. Plant Physiol. 2003;132(3):1382–90.
Liu H, Yu X, Li K, Klejnot J, Yang H, Lisiero D, Lin C. Photoexcited CRY2 interacts with CIB1 to regulate transcription and floral initiation in Arabidopsis. Science. 2008;322(5907):1535–9.
We thank Prof. Tian-Zhen Zhang for providing the RNA-seq data and calculating the RPKM values and Prof. Xiao-Ya Chen participating in discussion and revising the manuscript.
This work reported in this publication was supported by the National Natural Science Foundation of China through the Awards Nos. 31690092, 31571251, 31788103, the National Key R&D Program of China (2016YFD0100500) and the Ministry of Agriculture of China (2016ZX08005–003), the China Postdoctoral Science Foundation through the Awards Nos. 2017 M621546 and 2018 T110411. The funding bodies did not participate in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Multiple alignment of GrARF2 (Gossypium raimondii ARF2) and AtARF2 protein sequences.
Expression patterns of ARF genes in G. hirsutum based on RNA-seq data. FPKM represents fragments per kilobase of exon model per million mapped reads. DPA, days post-anthesis.s.
List of forward and reverse primers used for this study.
About this article
Cite this article
Zhang, X., Cao, J., Huang, C. et al. Characterization of cotton ARF factors and the role of GhARF2b in fiber development. BMC Genomics 22, 202 (2021). https://doi.org/10.1186/s12864-021-07504-6
- Fiber elongation
- Fiber initiation