Genome-wide association study reveals the genetic architecture of root hair length in maize

Background Root hair, a special type of tubular-shaped cell, outgrows from root epidermal cell and plays important roles in the acquisition of nutrients and water, as well as interactions with biotic and abiotic stress. Although many genes involved in root hair development have been identified, genetic basis of natural variation in root hair growth has never been explored. Results Here, we utilized a maize association panel including 281 inbred lines with tropical, subtropical, and temperate origins to decipher the phenotypic diversity and genetic basis of root hair length. We demonstrated significant associations of root hair length with many metabolic pathways and other agronomic traits. Combining root hair phenotypes with 1.25 million single nucleotide polymorphisms (SNPs) via genome-wide association study (GWAS) revealed several candidate genes implicated in cellular signaling, polar growth, disease resistance and various metabolic pathways. Conclusions These results illustrate the genetic basis of root hair length in maize, offering a list of candidate genes predictably contributing to root hair growth, which are invaluable resource for the future functional investigation. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07961-z.


Background
Root hairs are special tubular-shaped outgrowth from root epidermal cells, which vastly enlarge the root surface area and assist in water and nutrient absorption such as NO 3− , Cl − , Ca 2+ , K + , Zn 2+ , and Mn 2+ , as well as the interactions with biotic and abiotic stresses [1,2]. On the other hand, the length, density and morphology of root hair are influenced by various endogenous and environmental factors, including phytohormones and mineral nutrients, especially under phosphate (Pi) limiting condition [3][4][5][6][7]. The development of root hair can be separated into three basic stages: specification of the epidermal cell fate, initiation of the root hair outgrowth, and elongation of the hair via tip growth [2,8].
As a unique single cell type in plant biology, the development, physiology, and cell biology of root hair have been intensively studied in Arabidopsis [2,6,[9][10][11]. In contrast, only a few of genes functional in root hair development have been known in monocot crop species [12]. In rice, some RSL class I and RSL Class II genes have been reported to positively regulate the development of root hairs, suggesting that the mechanism of RSL-regulated root hair development is at least partly conserved among grasses and eudicots [13,14]. In addition, several genes involved in the elongation of root hairs also have been identified in rice, including OsEXPA8 [15], OsEXPA17 [16], OsCSLD1 [17], OsFH1 [18], OsSNDP1 [19] and OsPHR [20]. So far, a total of six genes involved in root hair development have been identified in maize. Roothairless 1 (rth1), rth3, rth5 and rth6 mutants are deficient in root hair formation in all root types, and exhibit defect in different stages of root hair development [21][22][23][24]. The ZmLRL5 gene, encoding a basic helix-loop-helix (bHLH) transcription factor, was demonstrated to play a positive role in orchestrating the translational process by directly regulating the expression of translational processes/ribosomal genes during maize root hair growth. The loss-of-function of ZmLRL5 resulted in a dramatic reduction in the elongation of root hair [25]. Recently, ZmTIP1, encoding a functional S-acyltransferase, was identified to participate in drought tolerance by regulating root hair growth [26]. Although these reports have made great contribution to understand molecular regulation of root hair development in maize, it remains unclear how the root hair growth is controlled in a natural population.
In recent years, genome-wide association studies (GWAS), which is based on linkage disequilibrium (LD) in a panel, has offered high mapping resolution and could effectively benefit the exploration of the genetic basis associated with complex quantitative traits [27][28][29][30]. In maize, LD decay is rapid due to its extensive genetic diversity. Therefore, maize is recognized as an ideal model plant for conducting association studies [31][32][33][34]. To date, GWAS has successfully used to identify numerous candidate loci/ genes controlling sever-al morphological or metabolic traits in maize, such as shoot apical meristem size, husk trait, plant height, kernel weight, drought tolerance, grain drying rate and grain moisture [35][36][37][38][39][40][41]. In this study, we used a maize association panel including 281 inbred lines with tropical, subtropical, and temperate backgrounds to interpret the phenotypic diversity and the genetic basis of root hair development. Several candidate genes putatively involved in root hair development were identified, providing a useful resource for further functional studies to elucidate molecular pathways involved in maize root hair growth and development.

Phenotypic variation among root hair length
The association population in this study consists of a global collection of 281 diverse maize inbred lines [42,43]. The root hair length of primary roots was measured from 3-day-old plants (Table S1). The noticeable variation in root hair length was represented by B73 and Mo17, two inbred lines commonly used in maize biology (Fig. 1a). The measured root hair trait followed a normal distribution with a slight right skew (Fig. 1b). The trait maximum, minimum, mean, standard deviation and coefficient of variation were listed in Table 1. The ratio of root hair length of inbred lines to B73 control ranged from 0.2 to 1.42 with the mean of 0.95, indicating that the root hair length exhibits broad variations in the association population.
All 281 lines used in this study include three subpopulations and one mixed group, which are referred to TST, SS, NSS, and MIXED [42]. TST subpopulation is of tropical or subtropical origin, consisting of 112 lines. SS and NSS subpopulations are of temperate origin, consisting of 22 and 30 lines, respectively. MIXED subpopulation is inbred lines which were not accurately assigned into the above three subpopulations based on the phylogenic analysis [42,43]. To investigate the effect of population structure on root hair phenotypes, the root hair length was compared between different subpopulations. Compared with SS, NSS and MIXED subpopulations, the mean value of TST subpopulation was significantly less, suggesting that maize inbred lines from tropical/subtropical origin tend to have shorter root hairs (Fig. 1c).

Associations of root hair phenotype with agronomic traits and metabolic pathways
As root hairs play a crucial role in the plant acquisition of nutrients and water, we postulated that root hair morphology is likely coordinated with other agronomic traits and amino acid metabolism. To verify our hypothesis, the Pearson-correlations were calculated after comparing root hair length with 17 agronomic traits and 18 amino acid contents in maize kernel, which were previously measured in the same association panel [44,45]. The 17 agronomic traits include seven morphological traits, i.e. plant height (PH), ear height (EH), ear leaf width (ELW), ear leaf length (ELL), tassel maximum axis length (TMAL), tassel branch number (TBN), leaf number above ear (LNAE); seven yield-related traits, i.e. ear length (EL), ear diameter (ED), cob diameter (CD), kernel number per row (KNPR), cob grain weight (GW), cob weight (CW), kernel width (KW); three flowering-related traits, i.e. days to anthesis (DTA), days to silking (DTS) and days to heading (DTH) [44]. The amino acids measured in dry maize kernel include Ala, Arg, Asx, Glx, Gly, Lle, Leu, Lys, Met, Pro, Phe, Val, Tyr, His, Cys, Thr and Ser, and the total concentration of amino acids [45].
Of the 17 agronomic traits examined, 7 were correlated with root hair length (P ≤ 0.05), which were EH, ELW, TBN, LNAE, and all three flowering-related traits (Fig. 2a). Meanwhile, of the 18 amino acids, 3 were correlated with root hair length (P ≤ 0.05), which were Gly, Lys and Arg (Fig. 2b). The Pathway Association Study Tool (PAST) was further performed to elucidate the biochemical pathways likely contributing to root hair elongation [46][47][48]. Under the statistical threshold P < 0.01, a total of 8 and 1 metabolic pathways exhibited significant associations with increased or decreased root hair length, respectively (Table S2 and S3). Overall, the close correlations of root hair length with other agronomic or metabolic traits above-mentioned suggest that the root hair occurrence is correlated to other aspects of plant development and growth.

GWAS on maize root hair length
Using 1,253,814 SNPs covering the whole maize genome with a minor allele frequency (MAF) ≥ 0.05, we performed GWAS to explore the genetic loci underlying the root hair length. Under the mixed linear model (MLM) [49,50], which accounts for false positives arising from the population structure (Q matrix) and kinship relationship (K matrix) of the natural variation in the population, a stringent threshold of -log 10 P ≥ 5.2 was designated as the threshold for calling significantly associated SNPs. The Manhattan plots for the SNPs associated with root hair length were shown in Fig. 3. In total, 18 significant SNPs were detected, which are located on chromosomes 1, 2, 4, 5, 6, and 10, explaining the phenotypic variations ranged from 6.3 to 10.1%. Moreover, GWAS was also performed using the general linear Under the stringent threshold oflog 10 P ≥ 6.5, a total of 14 SNPs were identified, 12 out of which overlapped with MLM-derived SNPs (Fig. S1). As there were 8 MLM-derived SNPs on chromosome 6 present within the same LD region (r 2 ≥ 0.2), and the leading SNP_ 75630277 was selected as the representative of this locus (Table 2), yielding a total of 11 SNPs used for the further analysis.

Genes co-localized with significant SNPs
Protein-coding genes harboring or nearest to the significant SNPs were nominated as the candidates associated with root hair ( Table 2). The allelic effect of haplotype coordinated with significant SNPs on root hair phenotypes was assessed ( Fig. 4 and Figure S2). The most significant SNP (chr6.S_ 34221678, P-value = 1.70E-07, R 2 = 9.0%) locates in the fourth exon of GRMZM2G403003 (Fig. 4a), encoding a TON1 recruiting motif (TRM) protein. The average root hair length for A allele was substantially shorter than T allele (P ≤ 0.01, Fig. 4d). The second significant SNP (chr5.S_ 198197360, P-value = 1.80E-07, R 2 = 9.8%) locates in the third intron of GRMZM2G044851 (Fig. 4b), encoding the nitrate transporter 1.5. The average root hair length for allele with A was greatly longer than allele with C (P ≤ 0.01, Fig. 4e). The third significant SNP (chr6.S_ 75630277, P-value = 3.97E-07, R 2 = 9.7%), which is the one with other 7 SNPs within the same LD region, locates at the promoter region of GRMZM5G825276 (Fig. 4c), encoding a GDSL-like lipase. The average root hair length for allele with A was greatly longer than allele with G (P ≤ 0.01, Fig. 4f). It is noted that the haplotype analysis for the other eight genes was depicted in Fig. S2.

Discussion
Root hairs enlarge the root surface area and thereby play vital roles in plant absorbing water and nutrient, as well as coping with biotic and abiotic stress. In recent years, extensive mutation-based studies have been carried out to dissect the genetic regulatory network determining root hair development, but we lack fundamental knowledge about whether and how root hair is controlled in a plant natural variation population. In this work, we interpreted the natural variation and the associated genetic architecture of root hair length in a maize association panel, and a set of putative candidate genes controlling root hair development were revealed by performing GWAS analysis.

Phenotypic variability and coordination of root hair length with other processes of plant development and growth
Maize was domesticated from its wild ancestor, teosinte (Zea mays ssp. parviglumis) about 8000-9000 years ago. Nowadays, modern maize displays a large geographic distribution from tropical to temperate climate. Therefore, population structure is associated with significant differences in maize morphology [51]. In this study, we observed that root hair exhibits wide length variation with normal distribution. Interestingly, tropical lines displayed shorter root hairs relative to temperate lines, implying that the adaption of maize from tropical to temperate regions was probably accompanied by the character of lessening root hair growth. In our investigation, the root hair at the length dimension was negatively rather than positively correlated with several morphological and metabolic traits. This is somewhat counter-intuitive. However, it is known that the maize plant is a monocot and has a fibrous root system consisting of massive adventitious and lateral roots, the anatomy of which is markedly different from dicotyledonous model Arabidopsis [52][53][54]. In this context, it is possible that the other aspects of root system may enable maize to have adequate larger surface area to support plant growth under normal soil condition. Meanwhile, it is worthy to be mention that to facilitate high throughput phenotyping, the root hair from the emerging primary roots after 3-days post germination was assessed in this study, thereby whether the growth rate will alter distinctly in different inbred lines is uncertain at this point.

Putative genes involved in root hair morphogenesis
We identified a total of 11 candidate genes associated with root hair elongation, and the homologs of three genes have been reported to influence root hair development in Arabidopsis. GRMZM2G100288 encodes a receptor-like protein kinase, and its closest homologs in Arabidopsis is FERONIA (FER), which is well known as a key hub of cell signaling networks mediating various hormone, stress, and immune responses [55][56][57][58][59][60][61][62][63][64][65]. Root hair initiation and elongation require functional FER, and its loss-of-function caused severe root hair defects [66][67][68]. GRMZM2G064644 encodes a protein containing a CRIB motif required for its specific interaction with GTP-bound Rop1 (plant-specific Rho GTPase), promoting root hair development [69][70][71]. GRMZM2G403003 encoding a TON1 recruiting motif (TRM) proteins is able to target TON1 to cortical microtubules [72]. It has been reported that the disruption of transverse cortical microtubules is essential for root hair initiation [73,74].
Nucleotide binding site leucine-rich repeat (NBS-LRR) comprises a large class of disease resistance proteins that play a widespread role in plant protection against pathogens [75,76]. Surprisingly, out of 11 candidate genes associated with root hair elongation, 3 encode NBS-LRR protein. Although disease resistance is the only function demonstrated for NBS-LRR proteins so far, their functions in other areas of plant biology cannot be excluded [77]. For instance, Arabidopsis BNT1 encodes an atypical TIR-NBS-LRR protein and works as a regulator of the hormonal response to stress. The disruption of BNT1 could result in dramatic alteration in root hair distribution [78].
The function and morphology of root hairs in relation to the acquisition of water and nutrients have been well studied previously [79]. Root hair length could be stimulated and elongated under different nutrient deficiencies, such as phosphorus, potassium, magnesium, iron, or manganese [80][81][82][83][84]. GRMZM2G000471 and GRMZM2G044851 encode an aquaporin and a nitrate transporter, respectively. The loss-of-function of HvEXPB7 and OsWOX11 severely suppressed root hairs under drought conditions [85,86]. AtNPF7.3/NRT1.5 has been reported to alter lateral root architecture under potassium deprivation [87].
Root hair requires robust activity of various metabolism pathways to support its polar elongation [24,[88][89][90]. GRMZM2G147446 encodes a UDP-rhamnose/UDP-galactose transporter situated in the Golgi lumen where UDPgalactose is used for synthesis of noncellulosic polysaccharides and glycoproteins [91][92][93][94]. GRMZM5G825276 encodes a GDSL-like lipase involved in lipid biosynthesis [95,96]. Overall, although the involvement of these candidate genes in root hair growth and development is biologically conceivable, their biological importance waits further functional validation. Meanwhile, once functionally proved, how the natural polymorphisms in these genes regulating root hair growth are the other intriguing questions worthy of future studies. Moreover, given that the importance of root hair in plant growth, especially under water or nutrient limitation condition, understanding and manipulating the molecular regulatory network of root hair development would prospectively benefit and facilitate the crop breeding program.

Conclusions
Prior to this study, genetic architecture and genes controlling natural variation in maize root hair development remains unclear. We elucidated a broadly natural variation in the root hair length in a maize association population. Several genetic loci putatively regulating the natural variation in root hair were revealed by performing GWAS, shedding light on novel knowledge about the genetic basis of root hair development in maize. Given that the importance of root hair in plant nutrient acquisition and water uptake, candidate genes identified in this study provide a list of research targets for future functional characterization to understand how root hair growth is naturally regulate, and benefit the breeding program to improve maize varieties with proper root hair morphology assisted by molecular breeding and engineering.

Association mapping panel and genotyping
Association tests were conducted in an association panel consisting of 281 inbred lines, which are publicly available at http://maizego.org/. The maize lines in this association panel were clustered into three subpopulations, including 22 SS lines, 30 NSS lines and 112 TST lines, and the remaining 117 lines fall into a mixed subpopulation. Detailed information about the origins of these lines was described in previous studies [42,43]. The 1.25 million high-quality SNPs (MAF > 5%) and the estimated population structure and kinship were assessed using the method previously described [97,98].

Phenotypic data collection and statistical analysis
Maize seeds were surface-sterilized in a 10% NaClO solution for 5 min, rinsed several times with distilled water, and germinated in moist germination paper rolls (Anchor Paper) at 28°C under constant darkness as previously described [25]. The root hair length of primary roots from 3-day-old plants were captured using stereo microscope. Quantification of the root hair length was measured from captured images using ImageJ. Given that the limited number of inbred lines could be measured at the same time, we cultured 15 inbred lines in each batch, and selected 5 representative plants at the same developmental stage to measure the root hair length. To rule out the variation, the B73 line was used as control of each batch. Eventually, the ratio of root hair length between each individual inbred line and B73 were collected and inputted as phenotypic data.
Genome-wide association mapping and phenotypic variance contribution of significant loci A GWAS on root hair length was performed using Tassel 5.2 under both GLM and MLM. Considering the non-independence of SNPs caused by strong LD, it is usually too strict for significant association detection when the threshold is derived from the total number of markers. Thus, the effective number of independent markers for the multiple adjustment were used to obtain the P value thresholds [99,100]. The 165,248 markers in approximate linkage equilibrium with each other were found by PLINK (window size 50, step size 50, r 2 ≥ 0.2) [101]. Then, we used the uniform Bonferroni-corrected thresholds at α = 1 for MLM and α = 0.05 for GLM as the significance cutoffs as reported in the previous studies [37,44,102]. Finally, the suggestive P value was computed by 1/n and 0.05/n (n = 165,248), and we obtained the suggestive threshold 6.05 × 10 − 6 for MLM and 3.03 × 10 − 7 for GLM, respectively.
To estimate the phenotypic variance explained by each significant SNP, we used ANOVA to construct linear models of Y = αX + βP + ε (1) and Y = βP + ε (2). In this model, Y is the phenotype, X is the SNP genotype, P is the matrix of three subpopulations (NSS, SS and TST), α is the SNP effect, β is the subpopulation effects, and ε is random effects. Thus, the R 2 of each significant SNP after adjusting for the population structure effects were reported as previously described [37].

Pathway analysis
The pathway analysis was performed in https:// maizegdb.org/past. The resulting SNP-trait association data and effects data generated by TASSEL were implemented in the pathway analysis. During the process of loading data, the LD data is filtered to drop rows where the loci are not the same, and then unneeded columns from the TASSEL output are dropped [103]. Only pathways with five or more mapped genes were considered in the analysis. Significance of the enrichment score was determined by permutation analysis (1000 random permutations of the effect values).

Prediction of candidate genes
To search the candidate genes underlying associated SNPs, we selected the most significant/leading SNPs within the same LD block (R 2 < 0.2) to represent the locus, and candidate genes were nominated by the leading SNP positioned. The physical locations of the SNPs were recorded according to the B73 RefGen_v2 (www. maizesequence.org). The corresponding genes were annotated based on the literatures describing the function of their homologs in other species or the information retrieved from conserved domain database (CDD).  Table S1. Root hair length ratio (Inbreds/B73) of the 281 inbred lines from different subpopulations.
Additional files 4: Supplemental Table 2. Pathways associated with increased root hair length at statistical significance P < 0.05. Table 3. Pathways associated with decreased hair length at statistical significance P < 0.05.

Acknowledgments
We thank all members of our laboratories for the helpful assistances and discussions during the research.

Availability of data and materials
All supporting data can be found within the manuscript and its additional files.

Declarations
Ethics approval and consent to participate All the experiments on plant seeds were in accordance with guidelines of China Agricultural University. The source from the maize seeds were publicly available at Maizego.

Consent for publication
Not applicable.