Genome-wide association mapping of quantitative resistance to sudden death syndrome in soybean
BMC Genomics volume 15, Article number: 809 (2014)
Sudden death syndrome (SDS) is a serious threat to soybean production that can be managed with host plant resistance. To dissect the genetic architecture of quantitative resistance to the disease in soybean, two independent association panels of elite soybean cultivars, consisting of 392 and 300 unique accessions, respectively, were evaluated for SDS resistance in multiple environments and years. The two association panels were genotyped with 52,041 and 5,361 single nucleotide polymorphisms (SNPs), respectively. Genome-wide association mapping was carried out using a mixed linear model that accounted for population structure and cryptic relatedness.
A total of 20 loci underlying SDS resistance were identified in the two independent studies, including 7 loci localized in previously mapped QTL intervals and 13 novel loci. One strong peak of association on chromosome 18, associated with all disease assessment criteria across the two panels, spanned a physical region of 1.2 Mb around a previously cloned SDS resistance gene (GmRLK18-1) in locus Rfs2. An additional variant independently associated with SDS resistance was also found in this genomic region. Other peaks were within, or close to, sequences annotated as homologous to genes previously shown to be involved in plant disease resistance. The identified loci explained an average of 54.5% of the phenotypic variance measured by different disease assessment criteria.
This study identified multiple novel loci and refined the map locations of known loci related to SDS resistance. These insights into the genetic basis of SDS resistance can now be used to further enhance durable resistance to SDS in soybean. Additionally, the associations identified here provide a basis for further efforts to pinpoint causal variants and to clarify how the implicated genes affect SDS resistance in soybean.
Sudden death syndrome (SDS) of soybean [Glycine max (L.) Merr.], caused by the soil-borne fungal pathogen Fusarium virguliforme, is a considerable threat to soybean production . The fungus infects soybean root systems and produces toxins that are translocated to the leaves, resulting in premature defoliation and pod abortion [3, 4]. In recent years, SDS ranked among the top five most damaging diseases of soybean in the United States . In the Midwestern soybean producing area of the U.S., it is estimated that SDS has resulted in average losses valued at $190 million a year .
Host plant resistance is believed to be the most effective control measure for SDS . Since no soybean genotypes confer complete immunity to this disease, soybean breeders still rely on quantitative resistance to SDS [7, 8]. The wide range of variation of susceptibility to both leaf scorch and root rot also provides a great opportunity to improve SDS resistance through genetic manipulation . However, most of what we know about the genetic architecture of SDS resistance is based on traditional quantitative trait locus (QTL) linkage mapping using bi-parental populations. Fourteen QTLs dispersed throughout the genome, underlying resistance to root infection, leaf scorch or both, have been confirmed in several bi-parental populations . However, the large confidence intervals for those QTLs impair the precise identification of causative genes. To date, only one resistance gene (GmRLK18-1) has been tagged and cloned . This gene is at 1.071 kbp on chromosome 18, within the major resistance QTL (Rfs2). Association mapping, which exploits historical recombination events at the population level, has become a powerful alternative to linkage mapping in the dissection of complex trait variation at the sequence level . A more specific strategy, genome-wide association (GWA) mapping, is a powerful complementary strategy for classical bi-parental linkage mapping to dissect complex traits and has been used with success in Arabidopsis, rice  and maize [15, 16]. The use of association mapping in soybean was therefore desirable to improve the mapping of important traits in soybean. So far, only a few association mapping studies, with limited numbers of markers, have been reported for dissecting agronomic traits in soybean [17, 18]. To the best of our knowledge, GWA mapping has not yet been employed to study any traits related to soybean disease resistance. Recently, the availability of a soybean reference genome sequence and the development of high throughput SNP assays has enabled GWA mapping in soybean [19, 20]. A previous study reported that approximately 1% of the 6,037 Plant Introductions (PIs) from the USDA Soybean Germplasm Collection were partially resistant to SDS . Therefore, conducting an association study in assembled PI collections might not be feasible. Furthermore, previous research indicated that most of the PIs showing resistance to soybean cyst nematode (SCN) were also partially resistant to SDS . Therefore, association mapping with released elite cultivars is more likely to identify superior resistance alleles that have been captured and accumulated by SCN or SDS breeding practices.
The goal of this study was to investigate the genetic architecture of soybean SDS resistance in released elite soybean cultivars. Here we present the first experimental results of GWA mapping for SDS, across two independent panels of elite soybean cultivars, using a high-density customized oligonucleotide genotyping array. We detected 20 QTLs including known candidate genes (or QTLs) as well as new candidate loci in the soybean genome. The identification of these loci will increase our understanding of mechanisms underlying SDS resistance, and provide valuable markers for breeding soybean lines with SDS resistance.
Sampling and genotyping
Two independent experiments were conducted in this study. Experiment 1 was done with a mapping population of 392 diverse soybean cultivars (association panel P1), consisting of 251 varieties released between 2010 and 2012 and 141 advanced breeding lines from Michigan State University. Experiment 2 used a set of 300 diverse G. max advanced breeding lines (association panel P2) developed by public breeders. The germplasm was chosen to represent a range of materials developed for the U.S. North Central soybean production area. Further information about the P1 and P2 panels is given in Additional file 1.
Soybean genomic DNA was extracted from young leaf tissue following the previously described method . All the accessions in panel P1 were genotyped using the Illumina SoySNP50k iSelect BeadChip (Illumina, San Diego, Calif. USA) which consists of 52,041 SNPs . All the accessions in panel P2 were genotyped using the Illumina SoySNP6k iSelect BeadChip (Illumina, San Diego, Calif. USA), which consists of 5,361 SNPs . The chromosomal distributions of the SNPs of SoySNP50K and SoySNP6k BeadChip are shown in Additional file 2. Genotypes were called using the program GenomeStudio (Illumina, San Diego, Calif. USA). The SNP data were coded according to the standard codes for nucleotides derived from the International Union of Pure and Applied Chemistry (IUPAC). The quality of each SNP was checked manually as previously reported . SNPs without physical position information and with low quality (call rate < 80% and or minor allele frequency < 0.05) across all samples were removed from the dataset.
Field resistance evaluation
The association panel P1 was evaluated for SDS resistance in a naturally infested SDS disease nursery at Decatur, Michigan during the growing season (May-October) in 2011 and 2012, where consistent, natural and heavy SDS disease symptoms was observed on susceptible checks. Four replications per year were grown in a lattice design with four-row plots 6 meters long. The association panel P2 was divided into four groups based on the maturity group I to IV, and were evaluated for SDS resistance during the summers of 2011 and 2012 in 14 locations including Michigan (Decatur), Iowa (Kanawha and Ames), Minnesota (Waseca and Rosemount), Illinois (Manito, Streator, Fairbury, Beardstown, Urbana, Shawnee town and Valmeyer), Missouri (Sikeston) and Ontario, Canada (Harrow). Four replications per year were grown in a lattice design. SDS was evaluated by scoring disease incidence (DI) and disease severity (DS) at the R6 growth stage, the stage at which pods contain full-size green beans at one of the four uppermost nodes with a completely unrolled leaf . SDS leaf scorch DI was rated from 0% (no disease) to 100% (all plants symptomatic), and DS was measured on a scale from 1 to 9 as described in Additional file 3 (after Bond, J. unpublished). The SDS disease index (DX, 0–100) was calculated as DI × DS/9. In panel P1, mean values of DI, DS and DX across replicates and years were used in association analysis throughout the study. The trait distribution for DX and DI was slightly skewed towards susceptible, thus a square root transformation was used to normalize the trait distribution prior to further analysis. The association panel P2 was phenotyped in multiple environments; best linear unbiased predictors (BLUPs) were used for the overall association analysis in panel P2. The BLUPs for each line were calculated with the R package, lme4, using the equation Y ijk = Line k + Environment i + Replicate (Environment) ij + (Line × Environment) ik + ϵ ijk , where Y ijk is the observed phenotype for the kth line in the jth replicate of the ith environment; Linek is the random effect of the kth line; Environment i is the random effect of the ith environment; Replicate (Environment) ij is the random effect of the jth replicate in the ith environment; (Environment × Line) ik is the random interaction effect of the ith environment and the kth line, and ϵ ijk is the error term. Analysis of variance (ANOVA) for the phenotypic data was performed with the R package, lm(stats) and anova.lm(stats). The heritability estimates were calculated using variance components obtained by ANOVA .
Population genetic analysis
Principal components analysis and neighbor-joining (NJ) trees were applied to infer population stratification. A pairwise distance matrix derived from the Nei’s genetic distance for all polymorphic SNPs was calculated to construct Neighbor-joining trees under PowerMarker version 3.25 . Principal component analysis (PCA) was done using EIGENSTRAT  based on 5,578 SNPs and 2,587 SNPs with minor allele frequency (MAF) >20% and physical distance >60 kb for panels P1 and P2, respectively. Kinship matrices were calculated using TASSEL 4.0  to determine relatedness among individuals based on the same sets of SNPs for the two panels [see Additional file 4]. Linkage disequilibrium parameter (r2) for estimating the degree of LD between pair-wise SNPs (30,345 SNPs for panel P1 and 4,297 SNPs for panel P2 with MAF ≥5%) was calculated using the software TASSEL 4.0  with 1,000 permutations. The LD decay rate was measured as the chromosomal distance at which the average pairwise correlation coefficient (r2) dropped to half its maximum value.
Genome-wide association analysis
Two different models were used to test associations between the SNPs (MAF >5%) and disease assessment criteria. The first model was a simple model where a general linear model (GLM), containing only the SNP tested as a fixed effect, was used to test the association between the SNP and the disease assessment criteria. The second model is a mixed linear model (MLM) where, in addition to the SNP being tested, PCA matrix and relative kinship matrix were included as fixed and random effects, respectively. The GLM and MLM can be expressed as y = Xα + e and y = Xα + Pβ + Kμ + e, respectively, where y is the vector of phenotypic observations, α is the vector of SNP effects, β is the vector of population structure effects, μ is the vector of kinship background effects, e is the vector of residual effects, P is the PCA matrix relating y to β, X and K are incidence matrices of 1 s and 0 s relating y to α and μ, respectively . Top six principal components were used to build up the P matrix for population structure correction in the two panels. Analyses were performed by the software TASSEL 4.0 which implemented the EMMA and P3D algorithms to reduce computing time . False discovery rate (FDR) ≤ 0.05 was used to identify significant associations. In order to conduct conditional analyses to test for residual adjacent associations after accounting for a key SNP within the same chromosome, the key SNP was transformed to a numeric value and then added into the MLM as a covariate. A P-value threshold of 10-4, corresponding to an adjustment for 500 independent tests across the region examined, was used to declare statistical significance at secondary signals.
Results and discussion
Genetic diversity and phenotypic variation
Two independent association panels (P1 and P2) were genotyped using Illumina BeadChip containing 52,041 and 5,361 SNPs, respectively. SNPs with MAF of <0.05 and call rate <80% were excluded from further analyses to avoid problems of spurious LD. Final sets of 30,345 and 4,297 high-performing SNPs were used for all analyses. Among these SNPs, samples had an average call rate of >96.5% and between technical replicates yielded >99% pairwise concordances. From these SNPs, we observed an average nucleotide diversity (polymorphism information content or PIC) of 0.281 and 0.284 in panels P1 and P2, respectively. Compared with a previous study , these estimates showed that the overall genetic variation of the elite cultivars we studied represents about 80% diversity of soybean landraces. Less than 1.6% of heterozygous genotypes were observed in both panels, which is consistent with the highly inbred nature of cultivated soybean (Table 1). An examination of allele frequency distributions at polymorphic SNPs showed that both panels contained a large number of SNPs with a minor allele frequency (MAF) of <0.1 [see Additional file 5], reflecting the broad genetic diversity in the two association panels.
In both association panels, we observed abundant phenotypic variation in SDS resistance measured by disease incidence (DI), disease severity (DS) and calculated disease index (DX, see Additional file 6). The mean DX distribution ranged from 0 to 96.3 in panel P1 and 0 to 82.0 in panel P2. The broad-sense heritability of DX was higher within two environments (two years in one location) in panel P1 (83%) than that across multiple environments (two years across 14 locations) in panel P2 (average 65%) [see Additional file 6].
Patterns of linkage disequilibrium
To characterize the mapping resolution for genome scans and GWA mapping, we quantified the average extent of genome-wide LD decay distance in panel P1 and P2. These estimates were approximately 270 kb and 460 kb, respectively, where the r2 drops to half its maximum value (0.24 and 0.19, respectively). Given that our average inter-marker distance (density) is 35 kb and 241 kb for the panels P1 and the P2 respectively, we expect to have reasonable power to identify common large effect variants associated with SDS resistance in both association panels. Overall LD decay distance in panel P1 was smaller than that in the panel P2 (Figure 1). LD decay distance in panel P1 was also smaller than previously published values in soybean [33, 34]. This difference may be attributed to smaller sample size and lower genome coverage of markers in P2 and previous studies. Since panel P1 had larger sample size and was genotyped with more SNP markers than panel P2, estimation of LD in panel P1 is more reliable. Linkage disequilibrium decay distance varies over different chromosomes, with 410 kb in chromosome 19, 100–200 kb in chromosomes 3, 4, 9, 10, 11 and 16, and 200–300 kb in the remaining 13 chromosomes in panel P1. These LD decay estimates are slightly higher than that in rice (75-150 kb) , but much greater than in maize (1.5-10 kb) . This result is consistent with earlier estimate that LD extends to a much longer distance in self-pollinated species than in cross-pollinated species .
According to the NJ tree analysis as well as PCA, association panel P1 had 4 genetic subgroups (Figure 2a and b), whereas panel P2 had 6 subgroups (Figure 2c and d). It has previously been suggested that the photoperiod response between different maturity groups may be the primary factor driving differentiation of cultivated soybean . A Chi-square test was used to test whether the SNP-data-based clustering (NJ tree) is associated with maturity-group-based grouping in panel P1 and P2. The results showed very significant association (P < 0.0001) between the two grouping factors. Thus the photoperiod response might have driven genetic differentiation among the tested accessions in both panels [see Additional file 7]. The measure of population differentiation, FST, was estimated at 0.168 among the four subgroups of panel P1, suggesting a moderate level of differentiation within panel P1 [see Additional file 8]. The population differentiation of 6 subgroups within panel P2 was slightly less (FST = 0.135) but still similar to that between different soybean landraces (FST = 0.130) .
GWA mapping for SDS resistance
Using the GWA strategy to dissect genetic architecture of SDS resistance in the two soybean association panels (P1 and P2), we successfully identified both known associations (candidate genes or QTLs previously reported in soybean), as well as new candidate loci in the soybean genome. The results of significant SNPs discovered in both association panels are summarized in Additional file 9, 10, 11, 12, 13, 14, and Figure 3. As shown in the quantile-quantile (QQ) plots (Figure 3b and f, Additional file 13b and f, and Additional file 14b and f), the distribution of observed -log10 P-values from the simple model, which did not include population structure (Q) and familial relatedness (K), departed from the expected distribution under a model of no association with significant inflation of nominal P-values. While the MLM method, which includes Q and K, allowed us to reduce the excess low P-values for DS, DI and DX (Figure 3d and h, Additional file 13d and h, and Additional file 14d and h). In both association panels, lower inflation of nominal P-values was consistently observed when the MLM method was used than when the simple model was used. Therefore, only the results from the analysis with the MLM model are presented below.
Of the 52,041 SNPs evaluated in association panel P1, 30, 48 and 56 SNPs were significant, with FDR ≤ 0.05 for DS, DI and DX, respectively (Additional file 9, 10, 11, Figure 3a and c, Additional file 13a and c, and Additional file 14a and c). From the 5,361 SNPs evaluated in association panel P2, 6, 8 and 9 SNPs were significant, with FDR ≤ 0.05 for DS, DI and DX, respectively (Figure 3e and g, Additional file 12, Additional file 13e and g and Additional file 14e and g). To select major QTLs among all the significant SNPs, these SNPs were clumped by using LD block as a criterion , and the strongest association within each LD block was kept. After the clumping of SNPs, 20 QTLs for SDS resistance were identified and peak SNPs (strongest associations) are listed in Table 2. The peak SNPs at the identified loci explained approximately 54.5% of the phenotypic variance on average (ranging from 35.7% to 75.4% for different disease assessment criteria, Figure 4). A major QTL on chromosome 18 was found in both association panels (Figure 3).
QTL confirmation and candidate genes
We compared the positions of the significant SNPs identified in this study with the positions of the QTLs reported in bi-parental mapping studies and found considerable overlap between these SNPs and the reported genes or QTLs for SDS resistance. Of the 20 loci we detected in the two association panels, seven overlapped with previously identified QTLs (Table 2).
Notably, one of the overlaps is the QTL Rfs2/Rhg1 on chromosome 18. This locus consistently contributes more effective coinheritance of resistance to SDS and reduces infestation by SCN . Previous fine map development did not resolve Rfs2 from Rhg1, suggesting that the underlying genes were either very closely linked or pleiotropic . In this study, we did detect a cluster of associations spanning a physical region of 1.2 Mb (1.2-2.4 Mb) around three Rhg1 genes that were found to contribute to SCN resistance (Figure 5a, Additional file 9, 10, 11, 12) . The cluster of associations also explained a major part of phenotypic variation of SDS resistance in both panels (Additional file 9, 10, 11, 12). If the three Rhg1 genes were pleiotropic, the peak SNP for SDS resistance should be located either within or in the same LD block with the three Rhg1 genes. However, the peak SNP (GM18-1709751) was not only located outside of, but also belonged to a different LD block than the three Rhg1 genes (Additional file 15). One possible explanation for this is that SDS resistance mediated by Rhg1 is also conferred by copy number variation (CNV) that increases the expression of a set of dissimilar genes. Alternatively, there exist other gene/s mediating SDS resistance that are closely linked with Rhg1. In fact, we found that the peak SNP (GM18-1709751) was located at approximately 2.2 kb upstream of GmRLK18-1, a gene that encodes a receptor-like kinase, and its resistance allele is sufficient to confer nearly complete resistance to both root and leaf symptoms of SDS . Moreover, there was another significant SNP (GM18-1712832 with P-value of 1.2 × 10-8 for DX) located within an exon of GmRLK18-1. These results support previous studies with regard to the key role of GmRLK18-1. However, we cannot exclude the possibility that structural variation in the form of CNV may have functional importance and thus contribute to SDS resistance that is not captured by our SNPs.
We searched for additional independently associated SNP variants near the Rfs2/Rhg1 locus by conditioning on the peak SNP (GM18-1709751) at the Rfs2/Rhg1 loci. At 519 kb downstream of Rfs2/Rhg1, we found an independently associated SNP variant, Gm18-2228646 with P-value of 6.5 × 10-5 in the conditional model, as measured by DS. This SNP is located at 44 bp downstream of a gene encoding an aquaporin transporter (Figure 5b). Therefore, our results provide evidence for the presence of a regulating gene other than GmRLK18-1 that is associated with SDS resistance on chromosome 18.
Besides the Rfs2/Rhg1 region, we refined the mapping location with other significant SNPs within or adjacent to previously reported QTLs (Table 2). Notably, we repeatedly detected a cluster of associations, measured by DS, DI and DX, spanning a physical region of 0.7 Mb (36.5 to 37.2 Mb) near the Rzd locus (resistance to zygote death) on chromosome 7 in panel P1. The Rzd locus contributed to SCN resistance and was strictly co-inherited in phase with Rfs2/Rhg1 in an earlier study . However, not all of the QTLs detected in previous bi-parental populations were detected in our association panels. The reason for failure to detect them may be that root infection severity is not included in our disease assessment criteria, so those QTLs associated with resistance to root infection cannot be identified. Alternatively, some QTLs may segregate at low frequency or not at all in our association panels, or the SNP coverage in this study is still insufficient to capture all of the haplotypes present in the diverse soybean varieties. On the other hand, we found 13 novel QTLs. Compared with the 7 loci within intervals of known QTLs, the 13 new loci are slightly weaker in terms of average P-value (6.47 × 10-5 vs 7.54 × 10-4) and explained phenotypic variance (8.14% vs 7.06%). However, some of them explain as much as, or even more phenotypic variance than that of known QTLs (Table 2). We checked whether these new QTLs were near loci for determinacy, maturity date, leaf, stem or root morphology and found that one new QTL at 33.6 Mb on chromosome 13 was within the interval containing QTLs for plant height and stem strength. Another new QTL at 34.8 Mb on chromosome 19 was located approximately 2 Mb downstream of a locus related with flowering time and leaf morphology. No new QTLs were found near loci related to determinacy, maturity date, or root morphology. To further validate these new loci, we developed five recombinant inbred line (RIL) populations and are currently conducting a confirmation study. To date, we have conducted conventional QTL mapping in three of the five RIL populations. Five out of the 13 novel QTLs have been validated in the three RIL populations (data not shown). Undoubtedly, the major loci identified in this study can be used to improve soybean for SDS resistance.
When we checked candidate genes containing or immediately adjacent to the significant SNPs, we found that diverse types of genes are probably involved in natural variation for soybean SDS resistance (Table 2). For instance, we identified one pentatricopeptide repeat (PPR) gene, which has certain features in common with disease resistance genes (R genes) . We also identified two genes encoding leucine-rich repeat (LRR) domains, which are important in plant responses to a variety of external stimuli including pathogens (Table 2 and ref. 44). A gene with similarity to ubiquitin-like protein, which is required for host and nonhost disease resistance in plants , was also identified. Several other SNPs were within or adjacent to sequences annotated as homologous to genes previously shown to be involved in plant disease resistance (Table 2). Follow-up studies will focus on validating effects of these genes, uncovering the molecular mechanisms of complex SDS resistance in soybean and integrating this knowledge to dissect mechanisms underlying quantitative resistance to soil-borne pathogens.
In this study, GWA mapping with correction for population structure and cryptic relatedness identified multiple novel loci and refined the map locations of known loci related to SDS resistance in soybean. This information not only demonstrates that GWA mapping can be used as a powerful tool for dissecting disease resistance mechanisms in soybean, but also provides valuable markers for developing soybean cultivars with durable resistance against SDS. Moreover, the candidate genes containing these SNP loci represent promising targets for further efforts to pinpoint causal variants and to clarify how the implicated genes affect SDS resistance in soybean.
Availability of supporting data
The data sets supporting the results of this article are included within this article and its additional files.
Sudden death syndrome
Soybean cyst nematode
Single nucleotide polymorphisms
Quantitative trait locus
Simple sequence repeat
General linear model
Mixed linear model.
Roy KW, Rupe JCD, Hershman DE, Abney TS: Sudden death syndrome of soybean. Plant Dis. 1997, 81: 1100-1111. 10.1094/PDIS.19184.108.40.2060.
Wrather JA, Shannon G, Balardin R, Carregal L, Escobar R, Gupta GK, Ma Z, Morel W, Ploper D, Tenuta A: Effect of diseases on soybean yield in the top eight producing countries in 2006. Plant Health Progress. 2010, doi:10.1094/PHP-2010- 0125-01-RS
Jin H, Hartman GL, Nickell CD, Widholm JM: Characterization and purification of a phytotoxin produced by Fusarium solani, the causal agent of soybean sudden death syndrome. Phytopathology. 1996, 86: 277-282. 10.1094/Phyto-86-277.
Rupe JC, Hartman GL: Sudden death syndrome. Compendium of Soybean Diseases. Edited by: Hartman GL, Sinclair JB, Rupe JC. 1999, St. Paul: APS Press, 37-39.
Wrather JA, Koenning SR: Estimates of disease effects on soybean yields in the United States 2003 to 2005. J Nematology. 2006, 38: 173-180.
Wrather JA, Koenning SR: Effects of diseases on soybean yields in the United States 1996 to 2007. Plant Health Progr. 2009, doi:10.1094/PHP-2009-0401-01-RS
de Farias Neto AL, Hartman GL, Pedersen WL, Li S, Bollero GA, Diers BW: Mapping and confirmation of a new sudden death syndrome resistance QTL on linkage group D2 from the soybean genotypes PI567374 and ‘Ripley’. Mol Breed. 2007, 20: 53-62. 10.1007/s11032-006-9072-8.
Hartman GL, Huang YH, Nelson RL, Noel GR: Germplasm evaluation of Glycine max for resistance to Fusarium solani, the causal organism of sudden death syndrome. Plant Dis. 1997, 81: 515-551. 10.1094/PDIS.19220.127.116.115.
Njiti VN, Johnson JE, Torto TA, Gray LE, Lightfoot DA: Inoculum rate influences selection for field resistance to sudden death syndrome in the greenhouse. Crop Sci. 2001, 41: 1726-1733. 10.2135/cropsci2001.1726.
Luckew AS, Leandro LF, Bhattacharyya MK, Nordman DJ, Lightfoot DA, Cianzio SR: Usefulness of 10 genomic regions in soybean associated with sudden death syndrome resistance. Theor Appl Genet. 2013, 126: 2391-2403. 10.1007/s00122-013-2143-4.
Srour A, Afzal AJ, Blahut-Beatty L, Hemmati N, Simmonds DH, Li W, Liu M, Town CD, Sharma H, Arelli P, Lightfoot DA: The receptor like kinase at Rhg1-a/Rfs2 caused pleiotropic resistance to sudden death syndrome and soybean cyst nematode as a transgene by altering signaling responses. BMC Genomics. 2012, 13: 368-10.1186/1471-2164-13-368. doi:10.1186/1471-2164-13-368
Zhu CS, Gore M, Buckler ES, Yu JM: Status and prospects of association mapping in Plants. Plant Gen. 2008, 1: 5-20. 10.3835/plantgenome2008.02.0089.
Atwell S, Huang YS, Vilhjálmsson BJ, Willems G, Horton M, Li Y, Meng D, Platt A, Tarone AM, Hu TT, Jiang R, Muliyati NW, Zhang X, Amer MA, Baxter I, Brachi B, Chory J, Dean C, Debieu M, de Meaux J, Ecker JR, Faure N, Kniskern JM, Jones JD, Michael T, Nemri A, Roux F, Salt DE, Tang C, Todesco M, et al: Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010, 465: 627-631. 10.1038/nature08800.
Huang X, Wei X, Sang T, Zhao Q, Feng Q, Zhao Y, Li C, Zhu C, Lu T, Zhang Z, Li M, Fan D, Guo Y, Wang A, Wang L, Deng L, Li W, Lu Y, Weng Q, Liu K, Huang T, Zhou T, Jing Y, Li W, Lin Z, Buckler ES, Qian Q, Zhang QF, Li J, Han B: Genome-wide association studies of 14 agronomic traits in rice landraces. Nat Genet. 2010, 42 (11): 961-967. 10.1038/ng.695.
Kump KL, Bradbury PJ, Wisser RJ, Buckler ES, Belcher AR, Oropeza-Rosas MA, Zwonitzer JC, Kresovich S, McMullen MD, Ware D, Balint-Kurti PJ, Holland JB: Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat Genet. 2011, 43: 163-168. 10.1038/ng.747.
Tian F, Bradbury PJ, Brown PJ, Hung H, Sun Q, Flint-Garcia S, Rocheford TR, McMullen MD, Holland JB, Buckler ES: Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat Genet. 2010, 43: 159-162.
Hao D, Cheng H, Yin Z, Cui S, Zhang D, Wang H, Yu D: Identification of single nucleotide polymorphisms and haplotypes associated with yield and yield components in soybean (Glycine max) landraces across multiple environments. Theor Appl Genet. 2012, 124: 447-458. 10.1007/s00122-011-1719-0.
Wang J, McClean PE, Lee R, Goos RJ, Helms T: Association mapping of iron deficiency chlorosis loci in soybean (Glycine max L. Merr.) advanced breeding lines. Theor Appl Genet. 2008, 116: 777-787. 10.1007/s00122-008-0710-x.
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, et al: Genome sequence of the palaeopolyploid soybean. Nature. 2010, 463: 178-183. 10.1038/nature08670.
Song Q, Hyten DL, Jia G, Quigley CV, Fickus EW, Nelson RL, Cregan PB: Development and evaluation of SoySNP50K, a high-density genotyping array for soybean. PLoS ONE. 2013, 8 (1): e54985-10.1371/journal.pone.0054985. doi:10.1371/journal.pone.0054985
Muller DS, Hartman GL, Nelson RL, Pedersen WL: Evaluation of Glycine max germplasm for resistance to Fusarium solani f. sp. glycines. Plant Disease. 2002, 86: 741-746. 10.1094/PDIS.2002.86.7.741.
Gelin JR, Arelli PR, Rojas-Cifuentes PR: Using independent culling to screen plant introductions for combined resistance to soybean cyst nematode and sudden death syndrome. Crop Sci. 2006, 46: 2081-2083. 10.2135/cropsci2005.12.0505.
Kisha T, Sneller CH, Diers BW: Relationship between genetic distance among parents and genetic variance in populations of soybean. Crop Sci. 1997, 37: 1317-1325. 10.2135/cropsci1997.0011183X003700040048x.
Akond M, Liu S: A SNP-based genetic linkage map of soybean using the SoySNP6K Illumina Infinium BeadChip genotyping array. J Plant Genom Sci. 2013, 1 (3): 80-89.
Yan J, Yang X, Shah T, Sánchez-Villeda H, Li J, Warburton M, Zhou Y, Crouch JH, Xu Y: High-throughput SNP genotyping with the Golden Gate assay in maize. Mol Breed. 2010, 25: 441-451. 10.1007/s11032-009-9343-2.
Fehr WR, Caviness CE, Burmood DT, Pennington JS: Stage of development descriptions for soybeans, Glycine max (L.)Merrill. Crop Sci. 1971, 11: 929-931. 10.2135/cropsci1971.0011183X001100060051x.
Nyquist WE: Estimation of heritability and prediction of selection response in plant populations. Crit Rev Plant Sci. 1991, 10: 235-322. 10.1080/07352689109382313.
Liu K, Muse SV: PowerMarker: integrated analysis environment for genetic marker data. Bioinformatics. 2005, 21: 2128-2129. 10.1093/bioinformatics/bti282.
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D: Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006, 38: 904-909. 10.1038/ng1847.
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES: TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007, 23: 2633-2635. 10.1093/bioinformatics/btm308.
Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006, 38: 203-208. 10.1038/ng1702.
Zhang Z, Ersoz E, Lai CQ, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu J, Arnett DK, Ordovas JM, Buckler ES: Mixed linear model approach adapted for genome-wide association studies. Nat Genet. 2010, 42: 355-360. 10.1038/ng.546.
Li YH, Li W, Zhang C, Yang L, Chang RZ, Gaut BS, Qiu LJ: Genetic diversity in domesticated soybean (Glycine max) and its wild progenitor (Glycine soja) for simple sequence repeat and single-nucleotide polymorphism loci. New Phytologist. 2010, 188: 242-253. 10.1111/j.1469-8137.2010.03344.x.
Hyten DL, Choi IY, Song Q, Shoemaker RC, Nelson RL, Costa JM, Specht JE, Cregan PB: Highly variable patterns of linkage disequilibrium in multiple soybean populations. Genetics. 2007, 175: 1937-1944. 10.1534/genetics.106.069740.
Yan J, Shah T, Warburton ML, Buckler ES, McMullen MD, Crouch J: Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers. PLoS ONE. 2009, 4 (12): 8451-10.1371/journal.pone.0008451. doi:10.1371 /journal. pone.0008451
Roberts EH, Qi A, Ellis RH, Summerfield RJ, Lawn RJ, Shanmugasundaram S: Use of field observations to characterize genotypic flowering responses to photoperiod and temperature: A soybean exemplar. Theor Appl Genet. 1996, 93: 519-533. 10.1007/BF00417943.
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D: The structure of haplotype blocks in the human genome. Science. 2002, 296: 2225-2229. 10.1126/science.1069424.
Geddy R, Brown GG: Genes encoding pentatricopeptide repeat (PPR) proteins are not conserved in location in plant genomes and may be subject to diversifying selection. BMC Genomics. 2007, 8: 130-10.1186/1471-2164-8-130. doi:10.1186/1471-2164-8-130
Kassem MA, Ramos L, Leandro L, Mbofung G, Hyten DL, Kantartzi SK, Grier RL, Njiti VN, Cianzio S, Meksem K: The ‘PI 438489B’ by ‘Hamilton’ SNP-based genetic linkage map of soybean [Glycine max (L.) Merr.] identified quantitative trait loci that underlie seedling SDS resistance. J Plant Genome Sci. 2012, 1: 18-30. 10.5147/jpgs.2012.0053.
Hernández-Blanco C, Feng DX, Hu J, Sánchez-Vallet A, Deslandes L, Llorente F, Berrocal-Lobo M, Keller H, Barlet X, Sánchez-Rodríguez C, Anderson LK, Somerville S, Marco Y, Molina A: Impairment of cellulose synthases required for Arabidopsis secondary cell wall formation enhances disease resistance. Plant Cell. 2007, 19: 890-903. 10.1105/tpc.106.048058.
Yaeno T, Li H, Chaparro-Garcia A, Schornack S, Koshiba S, Watanabe S, Kigawa T, Kamoun S, Shirasu K: Phosphatidylinositol monophosphate-binding interface in the oomycete RXLR effector AVR3a is required for its stability in host cells to modulate plant immunity. Proc Natl Acad Sci USA. 2011, 108 (35): 14682-14687. 10.1073/pnas.1106002108.
Iqbal MJ, Meksem K, Njiti VN, Kassem MA, Lightfoot DA: Microsatellite markers identify three additional quantitative trait loci for resistance to soybean sudden-death syndrome (SDS) in Essex Forrest RILs. Theor Appl Genet. 2001, 102: 187-192. 10.1007/s001220051634.
Bent AF, Mackey D: Elicitors, effectors, and R genes: The new paradigm and a lifetime supply of questions. Annu Rev Phytopathol. 2007, 45: 399-436. 10.1146/annurev.phyto.45.062806.094427.
Avrova AO, Taleb N, Rokka VM, Heilbronn J, Campbell E, Hein I, Gilroy EM, Cardle L, Bradshaw JE, Stewart HE, Fakim YJ, Loake G, Birch PR: Potato oxysterol binding protein and cathepsin B are rapidly up-regulated in independent defence pathways that distinguish R gene-mediated and field resistances to Phytophthora infestans. Mol Plant Pathol. 2004, 5 (1): 45-56. 10.1111/j.1364-3703.2004.00205.x.
Peart JR, Lu R, Sadanandom A, Malcuit I, Moffett P, Brice DC, Schauser L, Jaggard DA, Xiao S, Coleman MJ, Dow M, Jones JD, Shirasu K, Baulcombe DC: Ubiquitin ligase-associated protein SGT1 is required for host and nonhost disease resistance in plants. Proc Natl Acad Sci USA. 2002, 99 (16): 10865-10869. 10.1073/pnas.152330599.
Gupta SK, Rai AK, Kanwar SS, Shama TR: Comparative analysis of zinc finger proteins involved in plant disease resistance. PLoS ONE. 2012, 7 (8): e42578-10.1371/journal.pone.0042578. doi:10.1371/ journal.pone.0042578
Kim HS, Delaney TP: Arabidopsis SON1 is an F-box protein that regulates a novel induced defense response independent of both salicylic acid and systemic acquired resistance. Plant Cell. 2002, 14 (7): 1469-1482. 10.1105/tpc.001867.
Rock CO, Park HW, Jackowski S: Role of feedback regulation of pantothenate kinase (CoaA) in control of coenzyme A levels in Escherichia coli. J Bacteriol. 2003, 185 (11): 3410-3415. 10.1128/JB.185.11.3410-3415.2003.
Liu G, Ji Y, Bhuiyan NH, Pilot G, Selvaraj G, Zou J, Wei Y: Amino acid homeostasis modulates salicylic acid-associated redox status and defense responses in Arabidopsis. Plant Cell. 2010, 22 (11): 3845-3863. 10.1105/tpc.110.079392.
Kassem MA, Shultz J, Meksem K, Cho Y, Wood AJ, Iqbal MJ, Lightfoot DA: An updated Essex by Forrest linkage map and first composite interval map of QTL underlying six soybean traits. Theor Appl Genet. 2006, 113: 1015-1026. 10.1007/s00122-006-0361-8.
Cook DE, Lee TG, Guo X, Melito S, Wang K, Bayless AM, Wang J, Hughes TJ, Willis DK, Clemente TE, Diers BW, Jiang J, Hudson ME, Bent AF: Copy number variation of multiple genes at Rhg1 mediates nematode resistance in soybean. Science. 2012, 338: 1206-1209. 10.1126/science.1228746.
Mysore KS, D'Ascenzo MD, He X, Martin GB: Overexpression of the disease resistance gene Pto in tomato induces gene expression changes similar to immune responses in human and fruitfly. Plant Physiol. 2003, 132 (4): 1901-1912. 10.1104/pp.103.022731.
Liu Y, Burch-Smith T, Schiff M, Feng S, Dinesh-Kumar SP: Molecular chaperone Hsp90 associates with resistance protein N and its signaling proteins SGT1 and Rar1 to modulate an innate immune response in plants. J Biol Chem. 2004, 279 (3): 2101-2108. 10.1074/jbc.M310029200.
Steffens B, Sauter M: G proteins as regulators in ethylene-mediated hypoxia signaling. Plant Signal Behav. 2010, 5 (4): 375-378. 10.4161/psb.5.4.10910.
Kazi S, Shultz J, Afzal J, Johnson J, Njiti VN, Lightfoot DA: Separate loci underlie resistance to root infection and leaf scorch during soybean sudden death syndrome. Theor Appl Genet. 2008, 116: 967-977. 10.1007/s00122-008-0728-0.
Lorenc-Kukuła K, Zuk M, Kulma A, Czemplik M, Kostyn K, Skala J, Starzycki M, Szopa J: Engineering flax with the GT family 1 Solanum sogarandinum glycosyltransferase SsGT1 confers increased resistance to Fusarium infection. J Agric Food Chem. 2009, 57 (15): 6698-6705. 10.1021/jf900833k.
Webb DM, Baltazar BM, Rao-Arelli AP, Schupp J, Clayton K, Keim P, Beavis WD: Genetic mapping of soybean cyst nematode race-3 resistance loci in the soybean PI 437654. Theor Appl Genet. 1995, 91: 574-581.
We thank all the participants in the 2011 and 2012 Soybean Sudden Death Syndrome Regional Tests who provided the SDS resistance data and/or the seeds of the tested soybean genotypes. The participants were J. Bond, R. Bowen, P. Chen, S. Cianzio, B. Diers, A. Dorrance, K. Gallo, S. Kantartzi, D. Kyle, D. Malvick, J.H. Orf, H. Ramasubramaniam, and T. Welacky. We also thank C. and A. Druskovich for use of their field and J. Boyse, N. Boyse, R. Laurenz, B. Serven and J. Jacobs for technical assistance. We appreciate Dr. Linda Kull for her coordination of the NCSRP SDS Research Alliance which included this study. We thank Dr. Frank Dennis and Paul Collins for editing. Funding was provided by the North Central Soybean Research Program and Michigan Soybean Promotion Committee.
The authors declare that they have no competing interests.
DW designed and supervised the research; ZW performed the SNP genotyping, analyzed the data and wrote the manuscript; RT, JY, CB, WD, SZ, MC, CS conducted the field investigation; QS and PBC developed SNP chips; All authors read and approved the manuscript.
Electronic supplementary material
Additional file 1: Soybean germplasm accessions analyzed in this study. Information is given in this file for each accession, including accession name, origin, maturity group and subpopulation ancestry based Neighbor-joining trees. (XLSX 34 KB)
Additional file 2: SNPs Distribution of each chromosome on SoySNP 50 k (a) and SoySNP 6 k (b)BeadChip used in genotyping for panel P1 and P2, respectively. This figure is a color index showing the SNP distribution and density of 20 chromosomes on SoySNP 50 k (a) and SoySNP 6 k (b) BeadChip. (DOCX 79 KB)
Additional file 3: Scale used for phenotyping sudden death syndrome disease severity (DS). Disease incidence (DI) is the percentage of plants in the plot showing leaf symptoms. Disease index (DX) = (DI × DS)/9. (DOCX 156 KB)
Additional file 4: Kinship value between individual accessions among panels P1 (a) and P2 (b). Individuals are ordered according to their order listed in Additional file 1. Pairwise kinship values are shown as color-index heat map. (DOCX 2 MB)
Additional file 5: The distributions of minor allele frequencies in P1 (a) and P2 (b) association panels. Two histograms, a for panel P1 and b for panel P2, showing the distributions of minor allele frequencies in two association panels. (DOCX 92 KB)
Additional file 6: Phenotypic variation, heritability and correlation analysis in the two association panels. Descriptive statistics information, including mean, range, standard deviation, source of variation and correlation coefficient for DS, DI and DX. (DOCX 17 KB)
Additional file 7: Distribution of accessions in each subgroup based on genetic distance in panel P1 and P2. Two-way classification of all accessions, with SNP-data-based clustering (NJ tree) at the top and the maturity-group-based grouping clusters at the left. (DOCX 15 KB)
Additional file 8: The population differentiation statistics ( F ST ) among subpopulation in panel P1 and P2. Pairwise population differentiation index (Fst) as well as corresponding significant levels are list in this table. (DOCX 16 KB)
Additional file 9: Associations (FDR < 0.05) identified by GWA mapping for DS in association panel P1. Information of significantly associated SNPs, including name, physical position and phenotypic variation explained by the SNP, is reported in this table. (DOCX 17 KB)
Additional file 10: Associations (FDR < 0.05) identified by GWAS for DI in association panel P1. Information of significantly associated SNPs, including name, physical position and phenotypic variation explained by the SNP, is reported in this table. (DOCX 19 KB)
Additional file 11: Associations (FDR < 0.05) identified by GWAS for DX in association panel P1. Information of significantly associated SNPs, including name, physical position and phenotypic variation explained by the SNP, is reported in this table. (DOCX 20 KB)
Additional file 12: Associations (FDR < 0.05) identified by GWAS in association panel P2. Information of significantly associated SNPs, including name, physical position and phenotypic variation explained by the SNP, are reported in this table. (DOCX 18 KB)
Additional file 13: Genome-wide association study of DS in the two association panels. (a) Manhattan plots of the simple model for DS in association panel P1. The - log10 P values from a genome-wide scan are plotted against the position on each of the 20 chromosomes. The horizontal red line indicates the genome-wide significance threshold (FDR < 0.05). (b) Quantile-quantile plot of simple model for DS in the association panel P1. (c) Manhattan plots of MLM for DX in association panel P2, as in a. (d) Quantile-quantile plot of MLM for DS in the panel P1. (e) Manhattan plots of the simple model for DS in association panel P2, as in a. (f) Quantile-quantile plot of simple model for DS in the panel P2. (g) Manhattan plots of MLM for DS in the panel P1, as in a. (h) Quantile-quantile plot of MLM for DS in association panel P2. (DOCX 965 KB)
Additional file 14: Genome-wide association study of DI in the two association panels. (a) Manhattan plots of the simple model for DI in association panel P1. The - log10 P values from a genome-wide scan are plotted against the position on each of the 20 chromosomes. The horizontal red line indicates the genome-wide significance threshold (FDR < 0.05). (b) Quantile-quantile plot of simple model for DI in the association panel P1. (c) Manhattan plots of MLM for DX in association panel P1, as in a. (d) Quantile-quantile plot of MLM for DI in association panel P1. (e) Manhattan plots of the simple model for DI in association panel P2, as in a. (f) Quantile-quantile plot of simple model for DI in the association panel P2. (g) Manhattan plots of MLM for DI in association panel P2, as in a. (h) Quantile-quantile plot of MLM for DI in association panel P2. (DOCX 1 MB)
Additional file 15: Regional plots showing association mapping results for SNPs located around Rfs2 / Rhg1 on chromosome 18. Negative log10-transformed P-values from the MLM are plotted on the left vertical axis for association panel P1; Negative log10-transformed P-values from the MLM are plotted on the right vertical axis for association panel P2. Blue horizontal dashed lines indicate the genome-wide significance threshold in association panel P1. Previously identified genes controlling the traits are labeled. (DOCX 94 KB)
About this article
Cite this article
Wen, Z., Tan, R., Yuan, J. et al. Genome-wide association mapping of quantitative resistance to sudden death syndrome in soybean. BMC Genomics 15, 809 (2014). https://doi.org/10.1186/1471-2164-15-809
- Genome wide association mapping
- Glycine max
- Fusarium virguliforme
- Quantitative trait loci mapping