- Research article
- Open Access
Genome-wide association analysis on pre-harvest sprouting resistance and grain color in U.S. winter wheat
BMC Genomics volume 17, Article number: 794 (2016)
Pre-harvest sprouting (PHS) in wheat can cause substantial reduction in grain yield and end-use quality. Grain color (GC) together with other components affect PHS resistance. Several quantitative trait loci (QTL) have been reported for PHS resistance, and two of them on chromosome 3AS (TaPHS1) and 4A have been cloned.
To determine genetic architecture of PHS and GC and genetic relationships of the two traits, a genome-wide association study (GWAS) was conducted by evaluating a panel of 185 U.S. elite breeding lines and cultivars for sprouting rates of wheat spikes and GC in both greenhouse and field experiments. The panel was genotyped using the wheat 9K and 90K single nucleotide polymorphism (SNP) arrays.
Four QTL for GC on four chromosomes and 12 QTL for PHS resistance on 10 chromosomes were identified in at least two experiments. QTL for PHS resistance showed varied effects under different environments, and those on chromosomes 3AS, 3AL, 3B, 4AL and 7A were the more frequently identified QTL. The common QTL for GC and PHS resistance were identified on the long arms of the chromosome 3A and 3D.
Wheat grain color is regulated by the three known genes on group 3 chromosomes and additional genes from other chromosomes. These grain color genes showed significant effects on PHS resistance in some environments. However, several other QTL that did not affect grain color also played a significant role on PHS resistance. Therefore, it is possible to breed PHS-resistant white wheat by pyramiding these non-color related QTL.
Pre-harvest sprouting (PHS) of wheat (Triticum aestivum L.) refers to the germination of wheat grains in matured spikes before harvest due to continuous wet weather during harvest seasons. PHS can result in a significant reduction in wheat grain yield and grain end-use quality, thus a reduction in grain sale price [1, 2]. Growing PHS-resistant cultivars is the most effective way to minimize PHS damage. PHS resistance QTL have been reported on almost all wheat chromosomes. One major QTL mapped on chromosome 3AS, designated as TaPHS1, has been cloned [3, 4]. Another major QTL on chromosome 4AL has been fine mapped with single nucleotide polymorphisms (SNPs) [5–7]. Recently, several candidate genes have been reported for the 4A QTL in different studies [6, 8]. In addition, several minor QTL have also been reported on chromosomes 2B [9–12], 3D , 4B, 4D  and many others .
Wheat grain color (GC) has long been associated with PHS, and red-grained wheats are usually more tolerant to PHS than the white-grained wheats [16–18]. The pigments, catechin and proanthocyanidins (PAs) synthesized through the flavonoid synthesis pathway, result in red GC [19, 20]. Early cytogenetic studies suggested that three genes, R-A1, R-B1 and R-D1, on homoeologous group 3 chromosomes control GC [21–23], and show a pleiotropic effect on wheat PHS resistance by accumulating catechin, a precursor of the red pigment, that inhibits grain germination [19, 24]. Flinthman  found that grain dormancy levels were increased in white-grained wheat NS-67 after adding a single GC (R) gene to one of group 3 chromosomes. Groos et al.  identified common QTL for GC and PHS resistance on chromosomes 3AL, 3BL, 3DL and 5A in a white × red wheat cross. The white-grained mutants of 'Chinese Spring' and 'AUS1490' showed increased sprouting, indicating that R genes enhanced PHS tolerance [17, 18]. Recently, Tamyb10 genes, the transcription factors of the flavonoid biosynthetic pathway, have been reported as candidate genes for the GC trait . However, how much these R genes contribute to PHS resistance remains unknown. Therefore, simultaneous genome-wide association studies (GWAS) on both traits may reveal the relationship between R genes and PHS resistance.
Genome-wide association studies have been conducted in many plant species to discover and validate QTL and genes for various traits. By taking advantages of historical recombination events and linkage disequilibrium (LD) between causal genetic variants and nearby SNPs, GWAS detects statistical associations between genetic variations and phenotypic variations throughout the genome [26–29]. Therefore, GWAS can potentially increase mapping resolution by taking advantages of historical recombinations using highly diverse populations. To date, GWAS has not been reported for GC, and only several studies have been reported for wheat PHS resistance [11, 30–32]. In the current study, we analyzed a panel of elite breeding lines and cultivars from major U.S. winter wheat breeding programs using the wheat 9K and 90K arrays to (1) study the phenotypic variance of PHS resistance in U.S. winter wheat, (2) identify genome-wide QTL for GC and PHS resistance, and (3) determine the genetic relationship between GC and PHS resistance.
A set of 185 winter wheat accessions  was assembled to include 130 hard winter wheat and 55 soft winter wheat accessions as listed in Additional file 1: Table S1. A mapping population of 155 F6 recombinant inbred lines (RILs) derived from the cross of Tutoumai A x Siyang 936 [7, 34] was used to validate the SNPs that showed significant associations with the Qphs.hwwgr-4A.
Pre-harvest sprouting evaluation
In the greenhouse experiments, five plants per accession were grown in a 13 by 13 cm Dura-pot (Hummert Int. Topeka, KS) under the growth condition listed in Additional file 2: Table S2 after vernalization for 7 weeks at 4 °C in a cold chamber. The GWAS panel was evaluated for PHS in the greenhouse experiments of fall (August-December) 2011, spring (January-May) and fall 2012, and spring 2013. All experiments were conducted in a randomized complete block design with two replications of five plants.
The GWAS panel was also planted for PHS resistance evaluation in the Kansas State University Rocky Ford Wheat Research Farm, Manhattan, KS and the Agricultural Research Center-Hays, Hays, KS, respectively, in the summers of 2013 and 2014. About 30 seeds per accession were planted in a 1.22-m-long single-row plot, and each experiment had two replications.
When wheat plants reached physiological maturity, similar to Zadoks scale 91 , spikes that lost green color  were harvested from both greenhouse and field experiments, and evaluated for PHS in the lab. Five spikes per accession that were harvested from each replicate were air-dried for 5 d in a greenhouse, and then stored at -20 °C to maintain dormancy for PHS evaluation. After all accessions had been collected, the greenhouse- harvested spikes were air-dried 9 d and field-harvested spikes for 5 d at room temperature. The additional drying days were determined based on preliminary test results of randomly selected samples from field and greenhouse experiments that maximize phenotypic differences among genotypes. After the dried spikes had been immersed in de-ionized water for 12 h, they were enclosed in a moist chamber at 22 ± 1 °C with an attached humidifier that ran twice daily at 2 h each time to maintain high moisture in the chamber. After 7 d of incubation, the germinated and non-germinated kernels were hand-threshed and counted separately to calculate the percentage of germinated kernels from all five spikes of each replication.
Evaluation of grain color
Grain color was evaluated for grains harvested from one field experiment (2009-2010 Enid Oklahoma) and the fall 2011 greenhouse experiment at Manhattan KS. For each accession, ten seeds were soaked in 1 M sodium hydroxide (NaOH) for 1 h to increase the color contrast. Grain color intensity was determined visually using a scale of 1 to 4, where 1 represents white, 2 light red, 3 red and 4 dark red.
DNA isolation and genotyping
Leaf tissue was collected at the two-leaf stage, and genomic DNA was isolated using a modified cetyltrimethyl ammonium bromide (CTAB) method . A total of 446 polymorphic SSR markers were selected to genotype the association panel based on PCR product quality, chromosome distribution in available genetic maps (http://wheat.pw.usda.gov/GG3/; verified 11 Aug. 2010), and previously reported associations with PHS resistance. One expression sequence tag (EST), ZXQ118  and three gene markers of PM19A1 and PM19A2  were used to determine the association between PHS resistance and Qphs.hwwgr-4A. Five sequence-tagged sites (STS) from three Tamyb10 genes  were analyzed to determine QTL for GC. Amplification, separation and scoring of polymorphic chain reaction (PCR) products followed Zhang et al. .
The GWAS panel was also genotyped with the Wheat 9K and 90K SNP arrays [37, 38] at USDA-ARS Cereal Crops Research Unit (Fargo, ND). SNPs with less than 5 % minor allele frequency (MAF) or with more than 15 % missing data were removed. A total of 5,921 and 21,600 SNPs were scored from the 9K and 90K SNP arrays, respectively. Association analysis was initially conducted using the 9K genotypic data, and 28 non-redundant SNPs with p < 0.001 were then selected and pooled together with the 90K data. Totally, 21,628 SNPs were used for the final analysis. Also, one SNP in the promoter region and two SNPs in the coding region of the TaPHS1 gene [3, 4] were analyzed using three Kompetitive Allele Specific PCR (KASP) assays. Sequences that harbored significant SNPs and SSR markers were searched against the W7984 reference sequence to estimate their putative chromosome positions.
Population structure and kinship
Population structure was characterized by a set of 1500 SNPs that are evenly distributed on all the 21 wheat chromosomes using the admixture model in STRUCTURE 2.3.4 . K-values ran from 2 to 20 with 10 iterations set for each k-value. The burn-in time and replication number were set at 2 × 105 and 2 × 104, respectively. For each trait, Bayesian information criterion (BIC)  was applied to determine the optimum number of subpopulations. Marker-based kinship was estimated to approximate the probability of two individuals being identical by descent through adjusting the average probability of identical in the state between random individuals . Kinship was calculated with the same set of 1,500 SNPs used for structure analysis using SPAGeDi package .
Statistical analysis and genome-wide association analysis
Best linear unbiased predictions (BLUPs) were calculated for each accession evaluated in the greenhouse and field experiments using the 'lme4' package in R 3.2.2  with year and location as random effects in the model. Genome-wide association analysis was conducted using two models: the generalized linear model (GLM) with the Q matrix as fixed effects, and the mixed linear model (MLM) with a Q matrix as fixed effects and a kinship matrix as random effects. These two models were applied to each experiment for GC and PHS resistance, and model fitness was determined based on the BIC values. Association analysis of SNP data was conducted using the genome association and prediction integrated tool (GAPIT) implemented in R , and association analysis of SSR data was conducted using PROC MIXED procedure in SAS 9.3 (SAS Institute Inc., Cary, NC). A threshold of p < 0.001 was set to claim significant associations between SSR markers and the traits (GC and PHS resistance), and p < 0.0001 was set to claim significant associations between SNPs and the traits. Linkage disequilibrium and haplotype analyses of the significant SNPs were performed with HAPLOVIEW v.4.2 (http://www.broadinstitute.org/scientific-community/science/programs/medical-and-population-genetics/haploview/haploview).
A linkage map covering the 4A QTL region was constructed for the RIL population of Tutoumai A x Siyang 936 using KASP markers converted from significant SNPs from the association study, and previously mapped SSR markers in Liu et al.  and GBS-SNPs in Lin et al.  by JoinMap version 4.0 . Recombination fractions were converted into centiMorgans (cM) using the Kosambi function . Interval mapping (IM) using sprouting data from the 2005 and 2006 greenhouse experiments and their combined mean was performed using WinQTLCart 2.5 . LOD thresholds to claim significant QTL for each dataset were determined from 1000 permutations .
Phenotypic variations in grain color and pre-harvest sprouting
Twenty-nine accessions were scored as white wheats, and 156 accessions as red wheats. The GC scores were highly consistent between greenhouse and field grown seeds (Fig. 1) with a high correlation coefficient of 0.87 (P < 0.0001), indicating a low genotype-by-environment interaction for GC.
Significant correlations for sprouting rates were observed among most of the eight experiments (Table 1). Cluster analysis showed high similarities in sprouting rates of accessions among all the field experiments, but significant differences between the field and the greenhouse experiments (Fig. 2a). The broad sense heritability across all eight experiments was high (0.83), with 0.62 in the greenhouse experiments and 0.92 in the field experiments. The population could be roughly divided into three subgroups (Fig. 2b), with average sprouting rates of 13.9 % in Group 1, 35.5 % in Group 2 and 60.3 % in Group 3, and average GC scores of 3.0 in Group 1, 2.6 in Group 2 and 1.7 in Group 3, indicating red wheats were more likely to have low sprouting rates. Most of the soft winter wheats were clustered to Group 1, as well as some hard white winter (HWW) wheat accessions from the Regional Germplasm Observation Nursery (RGON). The rest accessions from the RGON were mostly clustered to Group 2, whereas accessions from the Southern Regional Performance Nursery (SRPN) and the Northern Regional Performance Nursery (NRPN) were mainly clustered to Group 2 and Group 3.
Genome-wide association studies on grain color
According to the BIC values, the mixed model with population structure and kinship fit the best for the GC trait, thus was applied in the following analysis. GWAS detected four significant QTL on chromosomes 1B, 3A, 3B and 3D, which were represented by the gene markers for Tamyb10 genes (5 STS markers) and closely linked SSRs (6) and SNPs (12) (Table 2). Three major QTL for GC in the distal region of the long arms of group 3 chromosomes are significant for the data from both greenhouse and field experiments (Table 2). Among them, the QTL on 3DL, as indicated by significant markers Tamyb10-D1 and 3 SNPs, showed the largest effect and explained up to 23.0 % of the phenotypic variance for GC. The QTL on chromosome 3BL that was characterized by seven SNPs, two gene makers for Tamyb10-B1, and one SSR was significant in both greenhouse and field experiments, and explained up to 19.2 % of the phenotypic variance. A QTL on 3AL showed a moderate effect on GC, and explained about 11.1 % phenotypic variance. QTL on chromosomes 1B was identified in both field and greenhouse experiments and explained up to 11.7 % of the phenotypic variances. Also, three SSRs, Xwmc93, Xbarc145 and Xbarc148, were also significant for GC, but their positions cannot be determined because they were mapped to multiple chromosomes.
The association mapping population can be classified into eight genotypic groups based on the allele combinations of the gene markers of Tamyb10 genes on group 3 chromosomes. The average GC scores in each group tended to increase as the number of red color alleles increases. However, red wheat accessions T154, LA02-923, MO040192 and NC04-15533 do not contain the red alleles (abd) at any of the three loci, whereas white accessions KS05HW15-2 and OK06848W carry the red allele of Tamyb10-A1 (Abd), and white accessions KS05HW136-3 and CO03W139 carry the red allele of Tamyb10-D1 (abD) (Fig. 3), suggesting that other genes besides Tamyb10 may also contribute to GC, or the markers for Tamyb10 genes may not be diagnostic in some genetic backgrounds.
Genome-wide association studies on pre-harvest sprouting resistance
Generalized linear model with Q matrix of k = 3 was selected for GWAS on PHS resistance based on BIC values. Twelve QTL on ten chromosomes were significant for PHS resistance in at least two experiments (Table 3; Additional file 3: Table S3). Among them, QTL on chromosome 3AS, 3AL, 3B and 4AL were the most frequently identified QTL for PHS resistance. The 3AS QTL was detected in the fall 2011 greenhouse experiment and all the four field experiments, and explained 9.5 % to 15.8 % of the phenotypic variances for PHS resistance. Significant markers included one SSR, Xbarc57, five SNPs from the SNP chips and one SNP developed from the TaPHS1 gene sequence (Table 3; Additional file 3: Table S3), thus this QTL corresponds to TaPHS1. The 3AL QTL was identified by SNPs, SSR and the Tamyb10-a1 gene marker in spring 2012 and all the field experiments, and explained 6.8 % to 12.1 % of the phenotypic variances. Thus this QTL corresponds to the Tamyb10-a1 gene for GC on 3AL. The QTL on chromosome 4AL showed a wide range of effects among the experiments, and explained 9.9 % to 47.6 % phenotypic variance among the two greenhouse experiments (fall and spring 2012) and one field experiment (Manhattan, 2014). The most significant SNPs for Qphs.hwwgr-4A were Ex_c66324_1151, wsnp_Ex_c13031_20625900 and wsnp_Ex_rep_c66324_64493429. ZXQ118, an EST in the 4A QTL region (Zhang et al., 2008), and the gene markers for PM19A1 were also significant in the fall 2012 greenhouse experiment and the mean sprouting rates over all the greenhouse experiments, but explained much lower phenotypic variation than the previous three markers (Table 3; Additional file 3: Table S3). The QTL on chromosome 3B was significant in two greenhouse experiments (fall 2011 and spring 2012) and all field experiments, and explained 7.0 % to 12.3 % of the phenotypic variances for PHS resistance. Two QTL were identified on chromosome 3D with one at the distal end of the short arm (Qphs.hwwgr-3DS) and another at the distal end of the long arm (Qphs.hwwgr-3DL). Qphs.hwwgr-3DS was detected in the spring 2012 greenhouse, and two 2014 field experiments, and Qphs.hwwgr-3DL was significant in both field experiments in 2013. QTL identified on chromosome 7A was significant in the fall 2011 greenhouse and 2013 Manhattan field experiments and explained up to 13.5 % of the phenotypic variance.
Some QTL were only significant in a single environment. For example, the two QTL identified on chromosomes 2B, Qphs.hwwgr-2B.1 and Qphs.hwwgr-2B.2, were associated with PHS resistance each in one greenhouse experiment (spring 2013 and 2012, respectively), and the Qgc.hwwgr-6B.1 was identified only in one field experiment (Manhattan, 2013) (Table 4). Therefore, these QTL may be more sensitive to environmental conditions.
Relationships between grain color and pre-harvest sprouting resistance
Analysis of variance (ANOVA) was conducted by taking CG as the explanatory variable and PHS resistance as the response variable, and it showed that GC had significant effects on PHS resistance in all the field experiments (P < 0.0001), but not in any of the greenhouse experiments (Table 5). White wheat had significantly higher sprouting rates than red wheats (P < 0.0001) in the field experiments, but the difference was not significant between red-grained accessions with different color scores (data not shown).
Common QTL for GC and PHS resistance were identified on the long arms of chromosomes 3A and 3D (Table 6), but not on 3BL. The QTL on chromosome 3AL, identified by Tamyb10-A1, was significant for GC in both field and greenhouse experiments and for PHS resistance in all the field experiments. For the QTL on chromosome 3D as represented by Tamyb10-D1, one SNP was significant for GC in both experiments and also for PHS resistance in the 2013 field experiments at both Manhattan and Hays. Unlike the 3A and 3D QTL, QTL on chromosome 3B, represented by the Tamyb10-B1 as well as seven linked SNPs and one SSR, was significant for only GC, not PHS resistance in any experiments. Therefore, Tamyb10-A1 and Tamyb10-D1, but not Tamyb10-B1, were very likely to have pleiotropic effects on PHS resistance under the field conditions.
Validation of the significant SNPs for the 4A QTL in a bi-parental population
Seventeen KASP assays were designed based on the sequences of the significant SNPs identified in the 4A QTL region for PHS resistance. Four of the KASP markers (Additional file 4: Table S4) showed co-segregation among the F6 RILs of 'Tutoumai A' × 'Siyang 936', and were mapped between the two previously reported flanking GBS SNPs (GBS212432, GBS109947) for the QTL  at 1.02 cM to GBS212432 and 2.10 cM to GBS109947 (Fig. 4). These four SNPs showed the highest LOD scores in all experiments, and explained up to 31.76 % of the phenotypic variance in the population.
Linkage disequilibrium (LD) parameter r 2 was calculated to determine the linkage relationship between SNPs from different QTL and link the markers with unknown positions to known QTL. LD was calculated for the 125 SNPs that were significantly linked to nine PHS resistance QTL in at least two experiments. Strong LD was detected for SNPs within each PHS resistance QTL region, but not between different QTL (Fig. 5b), indicating that those QTL for PHS resistance were independent. Pair-wise r 2 values were also estimated for the 17 SNPs that were tightly linked to the five GC QTL. Similarly, strong LD was not detected among the SNPs linked to GC QTL in group 3 chromosomes (Fig. 5a).
Genetic positions of most significant SNPs for GC on chromosome 3D and Tamyb10-D1 could not be determined using the W7984 reference sequence, and SNPs significantly related to GC on chromosome 3D, D_GA8KES402JVT1Y_74 and BS00067163_51, were far apart from each other on the chromosome 3D. However, LD analysis suggested that these SNPs were tightly linked to Tamyb10-D1, and thus they linked to the same 3D QTL for GC (Fig. 5a).
QTL for grain color
Wheat GC has been a classic example for dissection of a quantitative trait  and three genes on wheat chromosomes group 3 have long been proposed as the genes controlling wheat GC. Several previous studies have mapped the three genes as major QTL as well as some minor QTL on chromosomes 2B, 2D, 5A and 6B for GC [1, 25, 51]. Being the first association study for wheat GC, we not only validated the effects of these three GC genes, Tamyb10-A1, Tamyb10-B1 and Tamyb10-D1, on the long arms of chromosomes group 3, but also identified a new QTL on the chromosome 1B for GC, suggesting that QTL on other chromosomes than these well-known QTL on chromosomes group 3 may also play a role in regulating GC in some wheat germplasm lines.
Groos et al.  mapped all group 3 QTL in a bi-parental population, but they did not discuss their effects of each QTL. In this study, a diverse association panel makes it possible to compare the effects of all the three QTL. Among the three genes on the chromosome group 3, Tamyb10-D1 had the largest effect on GC (R2 = 0.24) and Tamyb10-A1 the smallest (R2 = 0.11) in the association mapping panel whereas their minor allele frequencies (MAF) were similar (Table 2), indicating that the large effect of Tamyb10-D1 was not due to a higher MAF than other two genes. On the other hand, one single gene changed GC from white to red, and adding one or two additional GC genes only slightly increased redness (Fig. 3). Besides, QTL on chromosome 1B also contribute to GC, which was not reported previously, thus it is likely a new QTL for GC. That the red allele of the 1B QTL presents in the four red wheat accessions that do not carry the red alleles (abd) at any of the three Tamyb10 genes supports this assumption. Therefore, when breeding for white wheat cultivars, breeders not only need to remove the three Tamyb10 genes, but also should watch for other genes that may contribute to GC.
In this study, wheat GC was visually scored after increasing color intensities using sodium hydroxide solution. High repeatability in GC between the greenhouse and field experiments (Fig. 1) indicates that the GC scoring method used in the experiments is highly repeatable. All of the four QTL identified for GC were detected in both experiments, which provided genetic evidence that QTL for GC are relatively stable across environments.
QTL for pre-harvest sprouting resistance
QTL for PHS resistance have been mapped on almost all wheat chromosomes in previous bi-parental mapping studies. Although association studies on PHS resistance have been conducted using several types of markers [11, 30], the current study is the first report to use high density SNPs for GWAS on PHS resistance. We identified 12 QTL that were significance in at least two experiments.
For the QTL on 3AS, the causal gene (TaPHS1) has been cloned [3, 4]. One of the reported functional SNPs in the coding region  was significant in one greenhouse (fall 2011), whereas the functional SNP in the promoter region  was not significant in any of the experiments (Table 3; Additional file 3: Table S3). However, the most significant markers linked to the 3AS PHS resistance QTL were not the functional SNPs (Additional file 3: Table S3), which was probably due to environmental effects on phenotyping . We had similar result for markers in the 4A QTL region. Only one of the candidate gene markers of PM19A1 was significantly associated with PHS resistance in the fall 2012 experiment, although the 4A QTL showed an extremely large effect on PHS in that experiment (Table 3; Additional file 3: Table S3). This was probably due to the fact that the gene expression was affected by environments or the gene markers are not diagnostic.
The QTL identified at the distal end of chromosome 3DS was not reported previously. LD analysis indicated that Qgc.hwwgr-3DS is different but could be a homeologue of Qgc.hwwgr-3AS (Fig. 5b). For the QTL on chromosome 3B, the sequences of the linked SSR markers are not found in the W7984 reference sequence, thus we cannot determine whether or not the significant SSR markers and SNPs on 3B linked to the same QTL. Similarly, we cannot determine the QTL positions on chromosome 7A.
QTL identified on chromosome 1A could be the same QTL reported by Knox et al.  in durum because Xwmc183 was located near the QTL region mapped in our study based on the W7984 reference sequence. The QTL on chromosome 2D is the same QTL as QPhs.ccsu-2D.4  because of the common SSR Xgwm539. However, we cannot determine whether the QTL that were identified on chromosomes 1D, 5A, 5B, 6A and 6B were the same QTL reported in previous studies [1, 9, 30, 51, 54] due to the lack of common markers.
Variation of PHS resistance across environments
PHS is a complicated trait affected by many factors, including seed dormancy (SD) [15, 55–57], GC [1, 58], spike morphology, as well as environmental factors such as temperature, moisture and photoperiod after flowering [59, 60]. In the current study, PHS resistance of the tested accessions and QTL effects varied across environments with more variation observed among the greenhouse experiments than that among the field experiments (Fig. 2a). A total of four greenhouse experiments were conducted in the fall greenhouse cycles of 2011 and 2012 with the harvest time in winter, and the spring cycles of 2012 and 2013 with the harvest time in summer. The two seasons were highly different in growing and post-harvesting temperatures, which has been shown to influence PHS resistance [3, 6]. Meanwhile, in the field experiments at Manhattan and Hays, dry hot winds shortened maturity period, which greatly reduced environment effects on wheat PHS resistance. Therefore, PHS resistance was similar in the four field experiments.
Qphs.hwwgr-3AS and Qphs.hwwgr-4A were the major QTL for PHS resistance, and most frequently identified in all experiments. However, Qphs.hwwgr-3AS was detected more frequently in the field experiments, while Qphs.hwwgr-4A was detected more frequently in the greenhouse conditions (Table 3; Additional file 3: Table S3), which might be due to high temperatures in field conditions during late grain maturation that suppressed the expression of Qphs.hwwgr-4A .
According to the heat map derived from individual PHS ratings across all the experiments, the population can be roughly divided into three clusters (Fig. 2b). Most of the soft winter wheats had low germination rates, and were clustered to Group 1. Wheat cultivars from RGON were mostly clustered to Group 1 and Group 2, whereas accessions in SRPN and NRPN showed higher germination rates, and were mainly clustered to Group 2 and Group 3. These results indicated that the soft winter wheat accessions grown in the humid climate during harvest season had a higher selective pressure on PHS resistance than the hard winter wheat accessions from the Great Plains that grown under relatively drier climate.
Validation of the markers for the QTL on 4A
In this study, a RIL population from ‘Tutoumai A’ x ‘Siyang 936’ was used to validate the position of significant SNPs for 4A PHS resistance QTL. Four polymorphic SNPs from GWAS were successfully mapped to the QTL region, and they are more closely linked to PHS resistance than previously reported flanking markers, GBS212432 and GBS109947, for this QTL . This result indicates that GWAS provides more power to increase marker density and mapping resolution, whereas bi-parental populations can further validate the positions of new markers. Barrero et al.  proposed PM19A1 and PM19A2 as the candidate genes for the 4A QTL and identified causal deletions in PM19A1 and PM19A2. We analyzed the markers developed based on the causal variation in Tutoumai A and Siyang 936, but did not find any polymorphism between the two parents. Therefore, a different gene or different causal SNP in the gene may control the PHS resistance of 4A QTL in this population, which was also supported by the results from the GWAS that the candidate gene markers contributed much lower phenotypic variation for PHS resistance than three other SNP markers (Ex_c66324_1151, wsnp_Ex_c13031_20625900, wsnp_Ex_rep_c66324_64493429) (Table 3; Additional file 3: Table S3).
Effect of grain color QTL on pre-harvest sprouting resistance
GC has been considered as an important factor for PHS resistance, and previous studies showed that seed dormancy level of a white-grained wheat line was improved by the introgression of an R gene . In the current study, GC explained 26 % to 44 % of the phenotypic variance for PHS resistance, and Tamyb10-A1 and Tamyb10-D1 showed significant effects on both GC and PHS resistance, which agree with a previous study . Tamyb10 genes encode R2R3-type MBY transcription factors, which regulate the accumulation of PA in the biosynthesis pathways . Therefore, it is possible that these transcription factors showed pleiotropic effects by regulating more than one metabolism pathway, and had effects on improving wheat PHS resistance. However, the GC gene on 3BL, Tamyb10-B1, did not show any effect on PHS resistance in this study (Table 2; Table 6).
In this study, GC was significantly related to PHS resistance in field experiments, but had barely any effect in the greenhouse experiments (Table 5). Also, the Tamyb10-A1 gene affected PHS resistance in all of the four field experiments, and the Tamyb10-D1 gene only affected PHS resistance in the 2013 experiments. Such results suggested that environmental factors could be important triggers of pleiotropic effects of the GC genes on PHS resistance. That Tamyb10-B1 did not show any effect on PHS resistance might be due to the field environments of this study that could not trigger the expression of pleiotropic effect of the gene.
Although some GC genes contributed to wheat PHS resistance, many QTL for PHS resistance did not affect GC. Therefore, some red wheats can be highly susceptible to PHS, while some white wheats can be highly resistant [61, 62]. Breeding for PHS resistance, attention should be paid to these QTL with a major effect on PHS in most environments without a pleiotropic effect on GC, such as these on 3AS and 4AL. Pyramiding several of these genes in one cultivar should be able to avoid PHS damage in U.S. HWW.
Using genome-wide association mapping, we identified four QTL for GC and 12 QTL for PHS resistance using 9K and 90K wheat SNP arrays. Besides three Tamyb10 genes cloned from group 3 chromosomes showed significant effects on GC, a new QTL on chromosome 1B also contributed to GC. Among them, Tamby10-A1 and Tamyb10-D1 showed a pleiotropic effect on PHS resistance under field conditions. Several other QTL that did not affect GC trait, especially the QTL on chromosomes 3AS and 4AL, showed significant effects on PHS resistance, thus they can be pyramided to improve PHS resistance in white wheat cultivars.
Quantitative trait loci
Genome-wide association study
Single nucleotide polymorphism
Recombinant inbred line
Polymorphic chain reaction
Minor allele frequency
Kompetitive Allele Specific PCR
Bayesian information criterion
Best linear unbiased prediction
Genome association and prediction integrated tool
Regional Germplasm Observation Nursery
Southern Regional Performance Nursery
Northern Regional Performance Nursery
Groos C, Gay G, Perretant MR, Gervais L, Bernard M, Dedryver F, Charmet G. Study of the relationship between pre-harvest sprouting and grain color by quantitative trait loci analysis in a white × red grain bread-wheat cross. Theor Appl Genet. 2002;104:39–47.
Mares DJ, Mrva K, Cheong J, Williams K, Watson B, Storlie E, Sutherland M, Zou Y. A QTL located on chromosome 4A associated with dormancy in white- and red-grained wheats of diverse origin. Theor Appl Genet. 2005;111:1357–64.
Nakamura S, Abe F, Kawahigashi H, Nakazono K, Tagiri A, Matsumoto T, Utsugi S, Ogawa T, Handa H, Ishida H, Mori M, Kawaura K, Ogihara Y, Miura H. A wheat homolog of MOTHER OF FT AND TFL1 acts in the regulation of germination. Plant Cell. 2011;23:3215–29.
Liu S, Sehgal SK, Li J, Lin M, Trick HN, Yu J, Gill BS, Bai G. Cloning and characterization of a critical regulator for preharvest sprouting in wheat. Genetics. 2013;195:263–73.
Cabral AL, Jordan MC, McCartney CA, You FM, Humphreys DG, MacLachlan R, Pozniak CJ. Identification of candidate genes, regions and markers for pre-harvest sprouting resistance in wheat (Triticum aestivum L.). BMC Plant Biol. 2014;14:340.
Barrero JM, Cavanagh C, Verbyla KL, Tibbits JF, Verbyla AP, Huang BE, Rosewarne MG, Stephen S, Wang P, Whan A, Rigault P, Hayden JM, Gubler F. Transcriptomic analysis of wheat near-isogenic lines identifies PM19-A1 and A2 as candidates for a major dormancy QTL. Genome Biol. 2015;16:93.
Lin M, Cai S, Wang S, Liu S, Zhang G, Bai G. Genotyping-by-sequencing (GBS) identified SNP tightly linked to QTL for pre-harvest sprouting resistance. Theor Appl Genet. 2015;128:1385–1395.
Torada A, Koike M, Ogawa T, Takenouchi Y, Tadamura K, Wu J, Matsumoto T, Kawaura K, Ogihara Y. A Causal Gene for Seed Dormancy on Wheat Chromosome 4A Encodes a MAP Kinase Kinase. Curr Biol. 2016;http://dx.doi.org/10.1016/j.cub.2016.01.063.
Kulwal PL, Singh R, Balyan HS, Gupta PK. Genetic basis of pre-harvest sprouting tolerance using single-locus and two-locus QTL analyses in bread wheat. Funct Integr Genomics. 2004;4:94–101.
Munkvold JD, Tanaka J, Benscher D, Sorrells ME. Mapping quantitative trait loci for preharvest sprouting resistance in white wheat. Theor Appl Genet. 2009;119:1223–35.
Kulwal P, Ishikawa G, Benscher D, Feng Z, Yu LX, Jadhav A, Mehetre S, Sorrells ME. Association mapping for pre-harvest sprouting resistance in white winter wheat. Theor Appl Genet. 2012;125:793–805.
Zhang XQ, Li C, Tay A, Lance R, Mares D, Cheong J, Cakir M, Ma J, Appels R. A new PCR-based marker on chromosome 4AL for resistance to pre-harvest sprouting in wheat (Triticum aestivum L.). Mol Breed. 2008;22:227–36.
Imtiaz M, Ogbonnaya FC, Oman J, Ginkel MV. Characterization of quantitative trait loci controlling genetic variation for preharvest sprouting in synthetic backcross derived wheat lines. Genetics. 2008;178:1725–36.
Kato K, Nakamura W, Tabiki T, Miura H, Sawada S. Detection of loci controlling seed dormancy on group 4 chromosomes of wheat and comparative mapping with rice and barley genomes. Theor Appl Genet. 2001;102:980–5.
Anderson JA, Sorrells ME, Tanksley SD. RFLP analysis of genomic regions associated with resistance to preharvest sprouting in wheat. Crop Sci. 1993;33:453–9.
Flintham JE. Different genetic components control coatimposed and embryo-imposed dormancy in wheat. Seed Sci Res. 2000;10:43–50.
Warner RL, Kudrna DA, Spaeth SC, Jones SS. Dormancy in white-grain mutants of Chinese Spring wheat (Triticum aestivum L.). Seed Sci Res. 2000;10:51–60.
Himi E, Mares DJ, Yanagisawa A, Noda K. Effect of grain colour gene (R) on grain dormancy and sensitivity of the embryo to abscisic acid (ABA) in wheat. J Exp Bot. 2002;53(374):1569–74.
Miyamoto T, Everson EH. Biochemical and physiological studies of wheat seed pigmentation. Agron J. 1958;50:733–4.
McCallum JA, Walker JRL. Proanthocyanidins in wheat bran. Cereal Chem. 1990;67(3):282–5.
Sears ER. Cytogenetic studies with polyploid species of wheat. II. Additional chromosomal aberrations in Triticum vulgare. Genetics. 1944;29:232–46.
Allan RE, Vogel OA. Monosomic analysis of red seed color in wheat. Crop Sci. 1965;5:474–5.
Metzger RJ, Silbaugh BA. Location of genes for seed coat color in hexaploid wheat, Triticum aestivum L. Crop Sci. 1970;10:495–6.
Stoy V, Sundin K. Effects of growth regulating substances in cereal grain germination. Cereal Res Commun. 1976;4:157–63.
Himi E, Maekawa M, Miura H, Noda K. Development of PCR markers for Tamyb10 related to R-1, red grain color gene in wheat. Theor Appl Genet. 2011;122:1561–76.
Flint-Garcia SA, Thornsberry JM, Buckler IV ES. Structure of linkage disequilibrium in plants. Annu Rev Plant Biol. 2003;54:357–74.
Nordborg M, Weigel D. Next-generation genetics in plants. Nature. 2008;456:720–3.
Myles S, Peiffer J, Brown PJ, Ersoz ES, Zhang Z, Costich DE, Buckler ES. Association mapping: critical considerations shift from genotyping to experimental design. Plant Cell. 2009;21:2194–202.
Lipka AE, Kandianis CB, Hudson ME, Yu J, Drnevich J, Bradbury PJ, Gore MA. From association to prediction: statistical methods for the dissection and selection of complex traits in plants. Curr Opin Plant Biol. 2015;24:110–8.
Rehman Arif MA, Neumann K, Nagel M, Kobiljski B, Lohwasser U, Börner A. An association mapping analysis of dormancy and pre-harvest sprouting in wheat. Euphytica. 2012;188:409–17.
Jaiswal V, Mir RR, Mohan A, Balyan HS, Gupta PK. Association mapping for pre-harvest sprouting tolerance in common wheat (Triticum aestivum L.). Euphytica. 2012;188:89–102.
Albrecht T, Oberforster M, Kempf H, Ramgraber L, Schacht J, Kazman E, Zechner E, Neumayer A, Hartl L, Mohler V. Genome-wide association mapping of preharvest sprouting resistance in a diversity panel of European winter wheats. J Appl Genet. 2015;56(3):277–85. doi:10.1007/s13353-015-0286-5.
Zhang D, Bai G, Zhu C, Yu J, Carver BF. Genetic diversity, population structure, and linkage disequilibrium in U.S. elite winter wheat. Plant Genome. 2010;3:117–27.
Liu S, Cai S, Graybosch R, Chen C, Bai G. Quantitative trait loci for resistance to pre-harvest sprouting in US hard white winter wheat Rio Blanco. Theor Appl Genet. 2008;117:691–9.
Zadoks JC, Chang TT, Konzak CF. A decimal code for the growth stages of cereals. Weed Res. 1974;14(6):415–21.
Trethowan RM. Evaluation and selection of bread wheat (Triticum aestivum L) for preharvest sprouting tolerance. Aust J Agric Res. 1995;46:463–74.
Cavanagh CR, Chao S, Wang S, Huang BE, Stephen S, Kiani S, Forrest K, Saintenac C, Brown-Guedira GL, Akhunova A, et al. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars. Proc Natl Acad Sci U S A. 2013;110:8057–62.
Wang S, Wong D, Forrest K, Allen A, Chao S, Huang BE, Maccaferri M, Salvi S, Milner SG, Cattivelli L, Mastrangelo AM, Whan A, Stephen S, Barker G, Wieseke R, Plieske J, International Wheat Genome Sequencing Consortium, Lillemo M, Mather D, Appels R, Dolferus R, Brown-Guedira G, Korol A, Akhunova AR, Feuillet C, Salse J, Morgante M, Pozniak C, Luo M-C, Dvorak J, et al. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array. Plant Biotech J. 2014;12:787–96.
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.
Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6:461–4.
Yu J, Pressoir G, Briggs WH, Bi IV, Yamasak M, Doeble JF, McMullen MD, Gaut BS, Nielson DM, Holland JB, Kresovich S, Kresovich S. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38(2):203–8.
Hardy OJ, Vekemans X. spagedi: a versatile computer program to analyze spatial genetic structure at the individual or population levels. Mol Ecol Notes. 2002;2:618–20. doi:10.1046/j.1471-8286.2002.00305.x.
Bates D, Maechler M, Bolker B, Walker S. lme4: Linear mixed-effects models using Eigen and S4. R package version 1.1-7. 2014;http://CRAN.R-project.org/package=lme4.
Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Gore MA, Buckler ES, Zhang Z. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012;28(18):2397–9.
Liu S, Bai G, Cai S, Chen C. Dissection of genetic components of preharvest sprouting resistance in white wheat. Mol Breed. 2011;27:511–23.
Van Ooijen JW. JoinMap® 4, Software for the calculation of genetic linkage maps in experimental populations. Wageningen, Netherlands: Kyazma BV; 2006.
Kosambi DD. The estimation of map distances from recombination values. Ann Eugen. 1944;12:172–5.
Wang S, Basten CJ, Zeng ZB. Windows QTL Cartographer 2.5. Raliegh: Departmentof Statistics, North Carolina State University; 2005. http://statgen.ncsu.edu/qtlcart/WQTLCart.htm.
Doerge RW, Churchill GA. Permutation tests for multiple loci affecting a quantitative character. Genetics. 1996;142:285–94.
Nilsson-Ehle H. Kreuzungsuntersuchungen an hafer und weizen. Lunds Universitets Arsskrift, N.F. Afd 2, Bd 5, Nr 2. 1909;pp 122.
Kumar A, Kumar J, Singh R, Garg T, Chhuneja P, Balyan HS, Gupta PK. QTL analysis for grain colour and pre-harvest sprouting in bread wheat. Plant Sci. 2009;177:114–22.
Knox RE, Clarke FR, Clarke JM, Fox SL. Genetic analysis of pre-harvest sprouting in a durum wheat cross. Euphytica. 2005;143:261–4.
Mohan A, Kulwal P, Singh R, Kumar V, Mir RR, Kumar J, Prasad M, Balyan HS, Gupta PK. Genome-wide QTL analysis for pre-harvest sprouting tolerance in bread wheat. Euphytica. 2009;168:319–29.
Roy JK, Prasad M, Varshney RK, Balyan HS, Blake TK. Identification of a microsatellite on chromosomes 6B and a STS on 7D of bread wheat showing an association with preharvest sprouting tolerance. Theor Appl Genet. 1999;99:336–40.
Bewley JD, Black M. Physiological and biochemistry of seeds in relation to germination, vol. 2. Heidelberg: Springer; 1982. p. 61–81.
Mares DJ, Mrva K. Mapping quantitative trait loci associated with variation in grain dormancy in Australian wheat. Crop Pasture Sci. 2001;52:1257–65.
Ogbonnaya FC, Imtiaz M, Ye G, Hearnden PR, Hernandez E, Eastwood RF, Ginkel MV, Shorter SC, Winchester JM. Genetic and QTL analyses of seed dormancy and preharvest sprouting resistance in the wheat germplasm CN10955. Theor Appl Genet. 2008;116:891–902.
Gfeller F, Svejda F. Inheritance of post-harvest seed dormancy and kernel color in spring wheat lines. Can J Plant Sci. 1960;40:1–6.
Argel PJ, Humphreys LR. Environmental effects on seed development and hardseededness in Stylosanthes hamata cv. Verano. I. Temperature. Crop Pasture Sci. 1983;34:261–70.
Ceccato DV, Daniel Bertero H, Batlla D. Environmental control of dormancy in quinoa (Chenopodium quinoa) seeds: two potential genetic resources for preharvest sprouting tolerance. Seed Sci Res. 2011;21:133–41.
Torada A, Amano Y. Effect of seed coat color on seed dormancy in different environments. Euphytica. 2002;126(1):99–105.
Bi HH, Sun YW, Xiao YG, Xia LQ. Characterization of DFR allelic variations and their associations with pre-harvest sprouting resistance in a set of red-grained Chinese wheat germplasm. Euphytica. 2014;195(2):197–207.
This is contribution number 17-001-J from the Kansas Agricultural Experiment Station. Authors thank Dr. William Bockus in Plant Pathology Department, Kansas State University for help with Manhattan field experiments. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the US Department of Agriculture. USDA is an equal opportunity provider and employer.
This study was partially funded by the National Research Initiative Competitive Grants CAP Project (2011-68002-30029) from the USDA National Institute of Food and Agriculture.
Availability of supporting data
All supporting data were included as additional files.
GB conceived and designed this study. ML, DZ, SL, GZ carried out the experiments, ML, DZ, JY analyzed the data, and ML, GB, AF wrote the manuscript. All authors revised the manuscript and approved the final version of the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Table S1. Pedigree, sources, origins and preharvest sprouting phenotypic data of 185 accessions in the genome-wide association study panel. (XLSX 121 kb)
Table S2. Environmental statistics of greenhouse and field conditions in Manhattan and Hays (XLSX 44 kb)
Table S3. Quantitative trait loci identified at least twice among eight experiments using the association mapping population and the Best Linear Unbiased Prediction (BLUP) of greenhouse and field data of wheat sprouting rates in a spike. (XLSX 59 kb)
Table S4. Kompetitive Allele Specific PCR assays developed from significant SNPs for the 4A pre-harvest sprouting resistance quantitative trait locus. (XLSX 33 kb)
About this article
Cite this article
Lin, M., Zhang, D., Liu, S. et al. Genome-wide association analysis on pre-harvest sprouting resistance and grain color in U.S. winter wheat. BMC Genomics 17, 794 (2016). https://doi.org/10.1186/s12864-016-3148-6
- Triticum aestivum
- pre-harvest sprouting resistance
- grain color
- genome- wide association studies