- Research article
- Open access
- Published:
A high density SLAF-SNP genetic map and QTL detection for fibre quality traits in Gossypium hirsutum
BMC Genomics volume 19, Article number: 879 (2018)
Abstract
Background
Upland Cotton (Gossypium hirsutum) is a very important cash crop known for its high quality natural fiber. Recent advances in sequencing technologies provide powerful tools with which to explore the cotton genome for single nucleotide polymorphism marker identification and high density genetic map construction toward more reliable quantitative trait locus mapping.
Results
In the present study, a RIL population was developed by crossing a Chinese high fiber quality cultivar (Yumian 1) and an American high fiber quality line (CA3084), with distinct genetic backgrounds. Specific locus amplified fragment sequencing (SLAF-seq) technology was used to discover SNPs, and a genetic map containing 6254 SNPs was constructed, covering 3141.72 cM with an average distance of 0.5 cM between markers. A total of 95 QTL were detected for fiber quality traits in three environments, explaining 5.5-24.6% of the phenotypic variance. Fifty-five QTL found in multiple environments were considered stable QTL. Nine of the stable QTL were found in all three environments. We identified 14 QTL clusters on 13 chromosomes, each containing one or more stable QTL.
Conclusion
A high-density genetic map of Gossypium hirsutum developed by using specific locus amplified fragment sequencing technology provides detailed mapping of fiber quality QTL, and identification of ‘stable QTL’ found in multiple environments. A marker-rich genetic map provides a foundation for fine mapping, candidate gene identification and marker-assisted selection of favorable alleles at stable QTL in breeding programs.
Background
Cotton, the main source of natural textile fiber, is one of the most important cash crops. Cultivated species of cotton include two diploids (Gossypium herbaceum and G. arboreum) and two tetraploids (G. hirsutum and G. barbadense). Due to its higher yield and wide adaptability, G. hirsutum L. accounts for more than 95% of worldwide cotton production [1]. Cotton fiber yield is of primary importance for cotton growers, whereas the textile industry demands high fiber quality. In order to meet the diverse demands of cotton growers and the textile industry, it is necessary for cotton breeders to develop cultivars with both high yield and superior fiber quality. Cotton fiber quality components, important for spinning, primarily include length, strength and micronaire (a measure of fineness). These traits have genetically complex quantitative inheritance and are substantially influenced by environmental factors. Traditional breeding approaches are often not precise enough to uncover the genetic architecture of these traits. Marker assisted selection (MAS) has proven to be a robust and cost effective breeding tool to manipulate genes controlling quantitative traits [2, 3].
In the past few decades, the efficiency of QTL mapping has been improved with the advent of molecular markers. More than 1500 QTL for fiber quality traits have been reported [4]. Simple sequence repeats (SSR), a type of DNA marker, have been used for genetic mapping and QTL detection for about two decades. However, low polymorphism rate limits the use of SSRs for creating high density cotton genetic maps. Recent advances in ‘next generation sequencing (NGS)’ enabled researchers to identify DNA sequence polymorphisms i.e. SNPs, scanning large numbers of nucleotides to find sufficiently large subsets for genome mapping even between closely-related parents. Therefore, SNP markers are effective for creating high density genetic maps, mapping QTL and use in MAS [5,6,7,8], understanding population structure and studying genetic diversity [9].
The release of genome sequences of cotton species including G. raimondii [10, 11], G. arboreum [12], G. hirsutum [13, 14], and G. barbadense [15] made it easy to utilize NGS for construction of high density genetic maps and QTL identification. A variety of technologies were developed based on NGS, such as genotyping by sequencing (GBS) [16], restriction-site associated DNAÂ sequencing (RAD-seq) [17] and specific locus amplified fragment sequencing (SLAF-seq) [18]. SLAF-seq is one of the most efficient and cost effective tools for manipulating SNPs and genotyping [18]. SLAF-seq has many merits over the other advanced technologies: 1) it does not necessarily require a reference genome sequence and polymorphism information; 2) repetitive sequences can be avoided; and 3) a balance between marker density and population size can be maintained by varying the fragment size. SLAF-seq has previously been applied in many plant species for developing high density genetic maps and mapping QTL [19,20,21,22,23]. Shen et al. [23] identified 132,880 SNPs and 6,296 InDels between a reference genome (TM-1) and five tetraploid cotton species by using SLAF-seq. Zhang et al. [24] exploited SLAF-seq to construct a genetic map of 5521 SNPs covering 3259.4 cM and reveal 18 QTL for cotton boll weight, explaining 4.15-16.70% phenotypic variance, with 344 putative candidate genes identified.
In this study, a RIL population of 180 lines from a cross between a Chinese high fiber quality cultivar (Yumian 1) and an American high fiber quality line (CA3084) was investigated, with these parents selected based on their distinct genetic background. SLAF-seq was used for the discovery of SNPs, which were then used to develop a high density genetic map and identify QTL for fiber quality traits.
Results
Phenotypic analysis of fiber quality traits
Descriptive statistics for fiber quality traits i.e. fiber length (FL), fiber uniformity (FU), fiber micronaire (FM), fiber elongation (FE) and fiber strength (FS) across three environments were summarized in Table 1. In all environments, the fiber length and strength of CA3084 were higher than Yumian 1. All five traits changed according to circumstances for both parents and population. Besides, the phenotype of RIL population showed continuous variation and transgressive segregation. Skewness and kurtosis tests showed that all these traits were normally distributed (Table 1).
One-way ANOVA showed that all traits had significant genetic and environmental effects (p < 0.01) (Table 2). Correlations among all five fiber quality traits were significant except FU-FM and FM-FE (Table 3).
Analysis of SLAF-seq data and SLAF markers
After SLAF library construction and high-throughput sequencing, a total of 83.65 GB data comprising 418.26 M paired-end reads was generated. Among them, 93.80% of bases met or exceeded a quality score of 30 (i.e., Q30, indicating a 0.1 % chance of an error, and thus 99.9% confidence) and guanine-cytosine (GC) content was 38.54%. The 418.26 M paired-end reads consisted of 158985 SNP markers. A total of 33288 (20.93% of) markers were polymorphic in the RIL population. The polymorphic SNP markers were classified into four genotypes as follows: aa × bb meant that both parents were homozygous in this SNP position, the genotype of one parent was aa and the other was bb; hk × hk meant that both parents were heterozygous; and the lm × ll and nn × np meant that one parent was heterozygous and the other was homozygous. Only genotype aa × bb, consisting of 21781 SNPs, was used for further analysis in our study. Markers with low sequencing depth or more than 30% missing data were filtered out. A total of 6254 markers meeting these quality standards were used for genetic map construction.
Construction of the genetic map
From the 6254 high quality SNP markers, a genetic map covering a total distance of 3141.7 cM with an average marker interval of 0.5 cM was constructed (Table 4; Figs. 1, 2, 3, 4, 5, 6 and 7). The A sub-genome harbored 3430 markers covering 1618.4 cM whereas the D sub-genome harbored 2824 markers covering 1523.3 cM. The largest chromosome was Chr08, containing 958 markers covering 206.4 cM, with an average marker interval of 0.2 cM. The shortest chromosome was Chr06, containing 86 markers covering 94.1 cM with an average marker interval of 1.1 cM. There were only 8 gaps greater than 10.0 cM, the largest being 14.353 cM on Chr08 (Table 4).
Segregation distortion
Among 6254 mapped loci, 1561 (24.96%) showed segregation ratios that deviated significantly (p <0.05) from Mendelian expectations (Table 4). These segregation distortion markers (SDMs) were unevenly distributed over the genome, forming 56 segregation distortion regions (SDRs). A total of 768 SDMs in 28 SDRs were located on the A subgenome and 793 SDMs in 29 SDRs were located on the D subgenome. A remarkable 52.8% (506) of the total markers on Chr08 had significant segregation distortion, while Chr06 had no SDMs and Chr05 had only 3 SDMs (Table 4; Figs. 1, 2, 3, 4, 5, 6 and 7). A total of 1486 SDMs (95.19%) favored Yumian 1 alleles while 75 SDMs (4.81%) favored CA3084.
Collinearity between the genetic and the physical map
Collinearity with the physical map was studied to assess the quality of the genetic map. The physical maps of 26 chromosomes were constructed based on the positions of 6254 mapped loci on the reference genome sequence of G. hirsutum [13], and covered 96.53% of the genome. All chromosomes showed more than 90% coverage except Chr08 and Chr22 which showed 88.08% and 89.97% coverage, respectively (Additional file 1: Table S1). Most genetically mapped loci were collinear with their physical map locations (Fig. 8).
QTL mapping
A total of 95 QTL for fiber quality traits were identified in this study (Additional file 2: Table S2; Figs. 1, 2, 3, 4, 5, 6 and 7). Phenotypic variance explained (PVE) by these QTL ranged from 5.5-24.6% and LOD values ranged from 2.0-10.6. The At genome contained 50 QTL, while the Dt genome contained 45 QTL. Every chromosome had at least one QTL except Ch04, Ch15, and Ch19. Among the 95 detected QTL, Yumian 1 contributed 46 favorable alleles while CA3084 contributed 49. A total of 55 QTL were found in more than one environment with 9 found in all three environments (Table 5).
Fiber Length
Twenty QTL controlling fiber length were found on 16 chromosomes, explaining 5.7-22.7% of the phenotypic variance (Additional file 2: Table S2). Ch14, Ch17, Ch18 and Ch22 each had 2 fiber length QTL. Yumian1 contributed 8 favorable alleles whereas CA3084 contributed 12. Among all fiber QTL, 12 were found in two or more environments, but only 3 (qFL06.1, qFL21.1, qFL23.1) were found in all three (Table 5).
Fiber Strength
Twenty-one fiber strength QTL were found on 16 chromosomes, explaining 6-17.7% of the phenotypic variance (Additional file 2: Table S2). Ch03, Ch06, Ch16 and Ch22 each had 2 fiber strength QTL. Yumian1 contributed 10 favorable alleles for fiber strength whereas CA3084 contributed 11. Eleven QTL were detected in two or more environments, but only qFS12.2 was detected in all three. Five fiber strength QTL (qFS03.2, qFS06.1, qFS12.2, qFS22.1, qFS23.1) explained more than of 10% phenotypic variance (Table 5).
Fiber Micronaire
Twenty-two QTL controlling fiber micronaire were found on 18 chromosomes, explaining 5.5-17.4% of the phenotypic variance (Additional file 2: Table S2). Ch03, Ch05, Ch06 and Ch09 each had two QTL. Yumian1 contributed 10 favorable alleles for fiber micronaire while CA3084 contributed 12. Thirteen QTL were found in two or more environments, and three (qFM07.1, qFM12.1, qFM21.1) in all three (Table 5). Three QTL (qFM03.2, qFM12.1, qFM13.1) explained more than 10% of the phenotypic variance.
Fiber Elongation
Thirteen fiber elongation QTL were found on 12 chromosomes, explaining 6.1-12% of the phenotypic variance (Additional file 2: Table S2). Only two QTL (qFE06.1, qFE25.1) explained more than 10% phenotypic variance. CA3084 contributed 8 favorable alleles, while Yumian1 contributed 5. Eight QTL were found in two or more than two environments, but only qFE16.1 was found in all three, explaining 8.3% of the phenotypic variance (Table 5).
Fiber Uniformity
Nineteen QTL controlling fiber uniformity were detected on 15 chromosomes, explaining 5.6-15.7% of the phenotypic variance (Additional file 2: Table S2). Three QTL (qFU06.1, qFU20.1, qFU21.2) explained more than 10% phenotypic variance. Yumian1 contributed 8 favorable alleles for fiber uniformity, while CA3084 contributed 11. Eleven QTL were found in two or more than two environments, but only qFU12.1 was detected in all three (Table 5).
Cluster analysis
QTL clusters were defined as regions which contained multiple QTL associated with different traits within approximately 20 cM [25]. In this study 14 QTL clusters were found on 13 chromosomes (Table 6). The details of clusters have been provided in the supplementary files (Additional file 3: Table S3). Chr22 had two clusters while Chr12 had only one. Chr06-cluster-1 had 5 QTL, 4 of them stable, explaining 7.8-14.4% of the phenotypic variance. Chr21-cluster-1 had 4 stable QTL for FL, FE, FS and FM explaining 6.9-18.1% of the phenotypic variance. Stability among three environments and significant correlation of fiber quality traits make this QTL cluster particularly worthy of further studies. Chr03-cluster-1 and Chr05-cluster-1 each had 3 QTL, but only one stable QTL. Chr06-cluster-1 had four stable QTL (qFL06.1q, FE06.1, qFS06.1, qFU06.1). Chr07-cluster-1 included 4 QTL for FE, FL, FM, FS explaining 5.9-12.7% of the phenotypic variance, two of which (qFM07.1, qFS07.1) were stable. Chr09-cluster-1 had 3 QTL (qFE09.1, qFM09.2, qFS09.1), all stable, explaining 7.7% of phenotypic variance. Chr10-cluster-1 had 4 QTL, two of which (qFE10.1, qFS10.1) were stable. Ch11-cluster-1 and Chr12-cluster-1 each had 3 QTL, two of which were stable, explaining 9.4% and 11% of the phenotypic variance respectively. Ch16-cluster-1 had 3 QTL (qFE16.1, qFL16.1, qFS16.2), all stable, explaining 8.3% of the phenotypic variance. There were 4 QTL in Ch.17-cluster-1, only one being stable. Ch22-cluster-1 had 5 QTL, but only two were stable. Chr22-cluster-2 had 4 QTL but none were stable. Chr25-cluster-1 had three QTL, two (qFE25.1, qFL25.1) found across all three environments.
Discussion
Prominent features of the method SLAF-seq
Specific locus amplified fragment sequencing (SLAF-seq) is an efficient and cost effective tool that uses high-throughput sequencing for genotyping [18], and has been applied in many plant species for developing high density genetic maps and mapping QTL [19,20,21,22,23]. As compared to other high throughput genotyping technologies, such as GBS and RAD-seq, SLAF-seq provides a reliable and cost effective approach for QTL mapping [24]. High quality DNA is required for RAD-seq, with modified protocols necessary for low quality DNA [26, 27]. Shared restriction sites among genetically divergent clades may be lost progressively due to sequence polymorphism [26]. GBS is less complicated and more cost effective than RAD-seq [28]. Although GBS and SLAF-seq share some basic principles, some features distinguish SLAF-seq from GBS, i.e. 1) pre-experiment endonuclease digestion scheme development through in silico analysis of available sequencing database; 2) ability to create markers evenly distributed over a genome; 3) high quality markers due to stable sequence depth of each sample; 4) high density genetic map development by exploiting abundant SNPs.
In previous studies, usually one or a few common restriction endonucleases were used for genome digestion. The genome specificity was often ignored, which might lead to uneven distribution of the selected fragments in the whole genome. The SLAF-seq strategy remedies this defect [18, 23, 24]. Furthermore, compared with the highest-density SSR genetic map of an Upland cotton intraspecific population [29], the present map also showed some advantages due to prominent features of SLAF-seq. For example, the total number of markers in the present map is far more than the highest-density SSR map, and marker distribution in the present map is more uniform.
Genetic map
In previous studies, most cotton genetic maps were based on SSR, AFLP or RFLP. However, it is very difficult to construct a high density genetic map covering the genome sufficiently due to the low polymorphism rate of SSR and other markers which exploit length polymorphism [30,31,32]. Therefore, SNP markers are a better choice to construct high resolution maps due to their high abundance and even distribution. In the present study, we constructed a high density genetic map which harbored 6254 SNP markers with only 8 gaps larger than 10cM. The distribution of mapped SNP markers was not random throughout the genome. This uneven distribution of markers may be due to the multi-step screening of SLAF markers or selective sweeps during domestication. Good collinearity between physical and genetic maps and higher density suggests that this map is high-quality for QTL mapping.
Segregation Distortion
Segregation distortion is widely observed in plant species [33]. Many studies have reported some factors influencing this phenomenon, such as species, population type and marker type [34,35,36]. Some studies also suggest the genetic effect as the main factor controlling segregation distortion. Segregation distortion in the present study largely favored Yumian 1 alleles, in accordance with our previous studies [29, 31, 35, 37, 38] which showed that Yumian 1 plays a significant role in segregation distortion due to its complex genetic background. Previous studies suggest that segregation distortion will not affect QTL detection if the distorted marker is not significantly linked to the QTL. Large populations can help to decrease the effect of segregation distortion. Fang et al. [39] fine-mapped a QTL (qFS07.1) located in an SDR , illustrating that segregation distortion does not significantly impact QTL position. In some cases segregation distortion may contribute to higher genetic variation, which in turn benefits QTL detection [40].
QTL identification
Significant differences among the RIL population showed that the parents had diverse alleles for many fiber quality traits. Among the total of 95 detected QTL, Yumian 1 contributed 46 favorable alleles while CA3084 contributed 49. Each of the parents contributed equally for many traits. QTL diversity among phenotypically similar parent is consistent with previous reports [29, 41,42,43]. Many QTL for different traits showed overlapping regions, which suggests that these QTL may represent genes with pleiotropic effects contributing to the fiber development network and may contribute to the significant correlations among different fiber quality traits [25, 35, 41, 44,45,46]. A total of 53 detected QTL occurred in 14 clusters on 13 chromosomes. Yumian 1 contributed to all QTLs in Chr.12-cluster-1 and Chr.16-cluster-1. While CA3084 contributed to the all QTL in Chr.25-cluster-1. In rest of the QTL clusters Yumian 1 was the main contributor. Fiber micronaire alleles in all QTL clusters were contributed by CA3084 except for Chr.11-cluster-1 and Chr.12-cluster-1. This was the proof of expected segregation of the trait because of significant difference of this trait of this trait between the parents [32, 46, 47]. Some clusters had notable numbers of stable QTL. For example, Chr21-cluster-1 had 4 QTL mapped at approximately the same location. All QTL in this cluster were stable, explaining 6.9-18.1% of total phenotypic variance. These QTL may be high priorities for fine mapping studies and candidate gene identification.
Concentration of the most of QTL in 14 clusters confirmed the significant positive correlations among fiber quality traits except for fiber micronaire. Similar results were reported in previous studies [29, 35, 47,48,49,50]. This can be the result of presences of multiple genes in certain genomic region or pleiotropic effects of certain genes [25].
The fiber micronaire QTL in the clusters had no consistent correlation in additive effects, hence, showing insignificant phenotypic correlation between micronaire and other traits in the current study. Similar results have been reported in previous study using Yumian 1 as a parent [47].
Stable and common QTL
Environment plays an important role in phenotypic variation of fiber quality traits. Dissection of variability according to environmental and genetic factors enables us to identify ‘stable’ QTL, i.e., which tend to maintain their effects across multiple environments and are high-priority candidates for further studies. In the present study, 55 of 95 detected QTL were found in two or more environments.
We compared the detected QTL with CottonQTLdb (release 2.2, February 01, 2017) [4, 25] based on the physical position of the nearest DNA markers. In total, 59 QTL (Additional file 4: Table S4) corresponded closely with the physical positions on the TM-1 reference genome of previously reported QTL [13], while the rest were newly found in this study. We also compared our QTL results with recently published GWAS related to cotton fiber traits [51,52,53], comparing the physical position of our SNP markers linked with QTL to those of GWAS-identified markers (Additional file 4: Table S4). Only 5 QTL corresponded closely with the physical positions of GWAS markers. These common and stable QTL could be priorities for fine mapping, candidate gene identification and MAS to improve cotton fiber quality. Unstable QTL can be the result of imprecise coarse mapping, or unusual environmental conditions.
Conclusion
In this study, we constructed a high-density genetic map by using SLAF-seq, which covers 3141.7 cM of recombinational length. The average distance between markers was 0.5 cM. A total of 13 QTL clusters containing 29 stable QTL were identified. Some of the stable QTL (qFL06.1, qFM07.1, qFL16.1, qFL21.1) could be valuable for fine mapping and candidate gene identification and can be used in further breeding programs.
Methods
Mapping Population
G. hirsutum cultivars Yumian 1 and CA3084 were crossed to produce a segregating population at Southwest University, Chongqing, China, in the summer of 2010 [38]. Yumian 1 is a Chinese cultivar with high fiber strength, bred by our lab [35] while CA3084 is an American breeding line with high fiber quality (provided by Dr. John R. Gannaway at Texas A & M University Research Experimental Center, Lubbock, TX). Single-seed descent was used from the F2:3 to F2:7 generations to produce a RIL population with 180 lines. The RIL population was grown in three different environments (2015, 2016 in Chongqing, 2016 in Hainan).
Phenotypic data analysis
Mature bolls of the two parents and recombinant inbred lines were harvested manually and fiber quality was measured with HVI900 instruments at the Supervision Inspection and Testing Cotton Quality Center, Anyang, China. The fiber quality traits measured in our study mainly included upper half mean length (FL, mm), fiber strength (FS, cN/tex), fiber elongation (FE), fiber micronaire (FM) and fiber uniformity (FU). These data were analyzed by SPSS 17.0 (SPSS Inc. Released 2008. SPSS Statistics for Windows, Version 17.0. Chicago: SPSS Inc.).
DNA extractions, SLAF library construction and high-throughput sequencing
In the summer of 2015, the leaves of parents and 87 lines were sampled for DNA extraction. Total genomic DNA was extracted according to a modified CTAB method [54]. To assess the number of markers generated by the combination of different endonucleases, an [18, 55] in silico analysis was carried out using a G. hirsutum reference genome [13]. Based on this analysis, two endonucleases HaeIII and SspI (New England Biolabs, USA) were selected to digest the genome of mapping population. The SLAF-seq strategy with some modifications was followed for library construction.
Sequencing data grouping and genotyping
We identified and genotyped SLAF markers according to the procedures described by Sun et al [18] and Jiang et al [55]. Briefly, low-quality reads (< 20e) were filtered out and the remaining high quality reads were sorted to each progeny according to duplex barcode sequences. Then the terminal 5-bp of each high-quality read was trimmed off. Finally, 80 bp paired-end clean reads obtained from the sample were mapped to the G. hirsutum genome sequence [13] using BWA software (V 1.5) with default parameters [56]. Sequences mapped on the same position with over 95 % identity were defined as one SLAF locus [55]. SNP loci in each SLAF locus were then detected between parents using the software GATK (V 3.1.1) with default parameters [57]. All polymorphic SLAF loci with a sequence depth of more than 10 fold in parents were genotyped and the individuals were genotyped based on similarity to the parents. Genotype scoring was conducted to approve the genotyping quality by using a Bayesian approach as described [18]. Markers with more than 15% missing data were removed.
Linkage map construction
SLAF markers were arranged in specific order and genotyping errors were corrected by using the HighMap strategy and SMOOTH algorithm [58, 59]. The k-nearest neighbor algorithm was used to deal with missing genotypes [60]. Then, the genetic linkage map was constructed by Joinmap v4.0 [61]. The genetic map was constructed according to the maximum likelihood method [62]. The Kosambi mapping function was applied to estimate the genetic map distances in centimorgan (cM) [63]. Markers showing significant deviation from Mendelian expectations for segregation (p <0.05) were considered distorted. Regions containing more than three adjacent loci which showed significant segregation distortion were denoted ‘segregation distortion region(s)’ (SDR) [64].
QTL analysis
MapQTL 6.0 [65] was used to identify QTL. A threshold value of log of odds ratio (LOD) ≥2.0 was used to declare suggestive QTL [66]. Positive additive effect means that favorable alleles were contributed by Yumian 1 while negative additive effect means that favorable alleles were contributed by CA3084. QTL identified in two or more environments were considered to be potential stable QTL. MapChart 2.2 [67] was used to graphically represent the genetic map and QTL bars. QTL were named starting with ‘q’, followed by a trait abbreviation and the chromosome number, then by the number of QTL affecting the trait on the same chromosome (e.g., qFE06.1 for the first fiber elongation QTL on chromosome 6). QTL for the same trait across different environments were declared in the same QTL region when their confidence intervals (CI) overlapped.
Abbreviations
- Chr:
-
Chromosome
- CI:
-
Confidence interval
- FE:
-
Fiber elongation
- FL:
-
Fiber length
- FM:
-
Fiber micronaire
- FS:
-
Fiber strength
- FU:
-
Fiber uniformity
- GBS:
-
Genotyping by sequencing
- LOD:
-
Log of Odds
- MAS:
-
Marker assisted selection
- NGS:
-
Next generation sequencing
- PVE:
-
Phenotypic variance explained
- QTL:
-
Quantitative trait locus/loci
- RAD:
-
Restriction-site associated DNA
- RIL:
-
Recombinant inbred line
- SDM:
-
Segregation distortion marker
- SDR:
-
Segregation distortion region
- SLAF-seq:
-
Specific locus amplified fragment sequencing
- SNP:
-
Single nucleotide polymorphism
- SSR:
-
Simple sequence repeat(s)
References
Chen ZJ, Scheffler BE, Dennis E, Triplett BA, Zhang T, Guo W, Chen X, Stelly DM, Rabinowicz PD, Town CD. Toward sequencing cotton (Gossypium) genomes. Plant Physiol. 2007;145(4):1303–10.
Ashraf M, Akram NA, Mehboob Ur R, Foolad MR. Marker-assisted selection in plant breeding for salinity tolerance. Methods Mol Biol. 2012;913:305–33.
Zhang YH, Liu MF, He JB, Wang YF, Xing GN, Li Y, Yang SP, Zhao TJ, Gai JY. Marker-assisted breeding for transgressive seed protein content in soybean [Glycine max (L.) Merr]. Theor Appl Genet. 2015;128(6):1061–72.
Said JI, Knapka JA, Song M, Zhang J. Cotton QTLdb: a cotton QTL database for QTL analysis, visualization, and comparison between Gossypium hirsutum and G. hirsutum x G. barbadense populations. Mol Gen Genet. 2015;290(4):1615–25.
Hyten DL, Cannon SB, Song Q, Weeks N, Fickus EW, Shoemaker RC, Specht JE, Farmer AD, May GD, Cregan PB. High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence. BMC Genomics. 2010;11(1):38.
Huo N, Garvin DF, You FM, McMahon S, Luo M-C, Gu YQ, Lazo GR, Vogel JP. Comparison of a high-density genetic linkage map to genome features in the model grass Brachypodium distachyon. Theor Appl Genet. 2011;123(3):455–64.
Bancroft I, Morgan C, Fraser F, Higgins J, Wells R, Clissold L, Baker D, Long Y, Meng J, Wang X. Dissecting the genome of the polyploid crop oilseed rape by transcriptome sequencing. Nat Biotechnol. 2011;29(8):762–6.
Ren Y, Zhao H, Kou Q, Jiang J, Guo S, Zhang H, Hou W, Zou X, Sun H, Gong G. A high resolution genetic map anchoring scaffolds of the sequenced watermelon genome. PLoS One. 2012;7(1):e29453.
Esteras C, Formisano G, Roig C, DÃaz A, Blanca J, Garcia-Mas J, Gómez-Guillamón ML, López-Sesé AI, Lázaro A, Monforte AJ. SNP genotyping in melons: genetic variation, population structure, and linkage disequilibrium. Theor Appl Genet. 2013;126(5):1285–303.
Wang K, Wang Z, Li F, Ye W, Wang J, Song G, Yue Z, Cong L, Shang H, Zhu S, et al. The draft genome of a diploid cotton Gossypium raimondii. Nat Genet. 2012;44(10):1098–103.
Paterson AH, Wendel JF, Gundlach H, Guo H, Jenkins J, Jin D, Llewellyn D, Showmaker KC, Shu S, Udall J, et al. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature. 2012;492(7429):423–7.
Li F, Fan G, Wang K, Sun F, Yuan Y, Song G, Li Q, Ma Z, Lu C, Zou C, et al. Genome sequence of the cultivated cotton Gossypium arboreum. Nat Genet. 2014;46(6):567–72.
Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, Zhang J, Saski CA, Scheffler BE, Stelly DM, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33(5):531–7.
Li F, Fan G, Lu C, Xiao G, Zou C, Kohel RJ, Ma Z, Shang H, Ma X, Wu J, et al. Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat Biotechnol. 2015;33(5):524–30.
Yuan D, Tang Z, Wang M, Gao W, Tu L, Jin X, Chen L, He Y, Zhang L, Zhu L, et al. The genome sequence of Sea-Island cotton (Gossypium barbadense) provides insights into the allopolyploidization and development of superior spinnable fibres. Sci Rep. 2015;5:17662.
Jo J, Purushotham PM, Han K, Lee HR, Nah G, Kang BC. Development of a Genetic Map for Onion (Allium cepa L.) Using Reference-Free Genotyping-by-Sequencing and SNP Assays. Front Plant Sci. 2017;8:1606.
Wang H, Jin X, Zhang B, Shen C, Lin Z. Enrichment of an intraspecific genetic map of upland cotton by developing markers using parental RAD sequencing. DNA Res. 2015;22(2):147–60.
Sun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H, et al. SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS One. 2013;8(3):e58700.
Cai C, Cheng FY, Wu J, Zhong Y, Liu G. The First High-Density Genetic Map Construction in Tree Peony (Paeonia Sect. Moutan) using Genotyping by Specific-Locus Amplified Fragment Sequencing. PLoS One. 2015;10(5):e0128584.
Geng X, Jiang C, Yang J, Wang L, Wu X, Wei W. Rapid Identification of Candidate Genes for Seed Weight Using the SLAF-Seq Method in Brassica napus. PLoS One. 2016;11(1):e0147580.
Jia Q, Tan C, Wang J, Zhang XQ, Zhu J, Luo H, Yang J, Westcott S, Broughton S, Moody D, et al. Marker development using SLAF-seq and whole-genome shotgun strategy to fine-map the semi-dwarf gene ari-e in barley. BMC Genomics. 2016;17(1):911.
Liang D, Chen M, Qi X, Xu Q, Zhou F, Chen X. QTL Mapping by SLAF-seq and Expression Analysis of Candidate Genes for Aphid Resistance in Cucumber. Front Plant Sci. 2016;7:1000.
Shen C, Jin X, Zhu D, Lin Z. Uncovering SNP and indel variations of tetraploid cottons by SLAF-seq. BMC Genomics. 2017;18(1):247.
Zhang Z, Shang H, Shi Y, Huang L, Li J, Ge Q, Gong J, Liu A, Chen T, Wang D, et al. Construction of a high-density genetic map by specific locus amplified fragment sequencing (SLAF-seq) and its application to Quantitative Trait Loci (QTL) analysis for boll weight in upland cotton (Gossypium hirsutum.). BMC Plant Biol. 2016;16:79.
Said JI, Lin Z, Zhang X, Song M, Zhang J. A comprehensive meta QTL analysis for fiber quality, yield, yield related and morphological traits, drought tolerance, and disease resistance in tetraploid cotton. BMC Genomics. 2013;14(1):776.
Suchan T, Pitteloud C, Gerasimova NS, Kostikova A, Schmid S, Arrigo N, Pajkovic M, Ronikier M, Alvarez N. Hybridization Capture Using RAD Probes (hyRAD), a New Tool for Performing Genomic Analyses on Collection Specimens. PLoS One. 2016;11(3):e0151651.
Bhatia D, Wing R, Singh K. Genotyping by sequencing, its implications and benefits, vol. 40; 2013.
He J, Zhao X, Laroche A, Lu Z-X, Liu H, Li Z. Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding. Front Plant Sci. 2014;5:484.
Liu X, Teng Z, Wang J, Wu T, Zhang Z, Deng X, Fang X, Tan Z, Ali I, Liu D, et al. Enriching an intraspecific genetic map and identifying QTL for fiber quality and yield component traits across multiple environments in Upland cotton (Gossypium hirsutum L.). Mol Gen Genet. 2017;292(6):1281–306.
Shang L, Liang Q, Wang Y, Wang X, Wang K, Abduweli A, Ma L, Cai S, Hua J. Identification of stable QTLs controlling fiber traits properties in multi-environment using recombinant inbred lines in Upland cotton (Gossypium hirsutum L.). Euphytica. 2015;205(3):877–88.
Liu D, Liu F, Shan X, Zhang J, Tang S, Fang X, Liu X, Wang W, Tan Z, Teng Z. Construction of a high-density genetic map and lint percentage and cottonseed nutrient trait QTL identification in upland cotton (Gossypium hirsutum L.). Mol Genet Genomics. 2015;290(5):1683–700.
Sun F-D, Zhang J-H, Wang S-F, Gong W-K, Shi Y-Z, Liu A-Y, Li J-W, Gong J-W, Shang H-H, Yuan Y-L. QTL mapping for fiber quality traits across multiple generations and environments in upland cotton. Mol Breed. 2012;30(1):569–82.
Taylor DR, Ingvarsson PK. Common features of segregation distortion in plants and animals. Genetica. 2003;117(1):27–35.
Paterson AH, Lander ES, Hewitt JD, Peterson S, Lincoln SE, Tanksley SD. Resolution of quantitative traits into Mendelian factors by using a complete linkage map of restriction fragment length polymorphisms. Nature. 1988;335(6192):721–6.
Zhang Z-S, Hu M-C, Zhang J, Liu D-J, Zheng J, Zhang K, Wang W, Wan Q. Construction of a comprehensive PCR-based marker linkage map and QTL mapping for fiber quality traits in upland cotton (Gossypium hirsutum L.). Mol Breed. 2009;24(1):49–61.
Xu Y, Zhu L, Xiao J, Huang N, McCouch S. Chromosomal regions associated with segregation distortion of molecular markers in F2, backcross, doubled haploid, and recombinant inbred populations in rice (Oryza sativa L.). Mol Gen Genet. 1997;253(5):535–45.
Hu W, Zhang X, Zhang T, Guo W. Molecular tagging and source analysis of QTL for elite fiber quality in upland cotton. Acta Agron Sin. 2008;34(4):578.
Shao Q, Zhang F, Tang S, Liu Y, Fang X, Liu D, Liu D, Zhang J, Teng Z, Paterson AH. Identifying QTL for fiber quality traits with three upland cotton (Gossypium hirsutum L.) populations. Euphytica. 2014;198(1):43–58.
Fang X, Liu X, Wang X, Wang W, Liu D, Zhang J, Liu D, Teng Z, Tan Z, Liu F, et al. Fine-mapping qFS07.1 controlling fiber strength in upland cotton (Gossypium hirsutum L.). Theor Appl Genet. 2017;130(4):795–806.
Zhang L, Wang S, Li H, Deng Q, Zheng A, Li S, Li P, Li Z, Wang J. Effects of missing marker and segregation distortion on QTL mapping in F2 populations. Theor Appl Genet. 2010;121(6):1071–82.
Shen X, Guo W, Zhu X, Yuan Y, John ZY, Kohel RJ, Zhang T. Molecular mapping of QTLs for fiber qualities in three diverse lines in Upland cotton using SSR markers. Mol Breed. 2005;15(2):169–81.
Tan Z, Fang X, Tang S, Zhang J, Liu D, Teng Z. Genetic map and QTL controlling fiber quality traits in upland cotton (Gossypium hirsutum L.). Euphytica. 2014;203:615-28.
Li W, Shang H, Ge Q, Zou C, Cai J, Wang D, Fan S, Zhang Z, Deng X, Tan Y, et al. Genome-wide identification, phylogeny, and expression analysis of pectin methylesterases reveal their major role in cotton fiber development. BMC Genomics. 2016;17(1):1000.
Paterson A, Saranga Y, Menz M, Jiang C-X. Wright R. QTL analysis of genotype× environment interactions affecting cotton fiber quality. TAG Theor Appl Genet. 2003;106(3):384–96.
Lacape J-M, Nguyen T-B, Courtois B, Belot J-L, Giband M, Gourlot J-P, Gawryziak G, Roques S, Hau B. QTL analysis of cotton fiber quality using multiple× backcross generations. Crop Sci. 2005;45(1):123–40.
Shen X, Guo W, Lu Q, Zhu X, Yuan Y, Zhang T. Genetic mapping of quantitative trait loci for fiber quality and yield trait by RIL approach in Upland cotton. Euphytica. 2007;155(3):371–80.
Tan Z, Zhang Z, Sun X, Li Q, Sun Y, Yang P, Wang W, Liu X, Chen C, Liu D, et al. Genetic Map Construction and Fiber Quality QTL Mapping Using the CottonSNP80K Array in Upland Cotton. Front Plant Sci. 2018;9:225.
Du X, Huang G, He S, Yang Z, Sun G, Ma X, Li N, Zhang X, Sun J, Liu M, et al. Resequencing of 243 diploid cotton accessions based on an updated A genome identifies the genetic basis of key agronomic traits. Nat Genet. 2018;50(6):796–802.
Ma Z, He S, Wang X, Sun J, Zhang Y, Zhang G, Wu L, Li Z, Liu Z, Sun G, et al. Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield. Nat Genet. 2018;50(6):803–13.
Wang M, Tu L, Lin M, Lin Z, Wang P, Yang Q, Ye Z, Shen C, Li J, Zhang L, et al. Asymmetric subgenome selection and cis-regulatory divergence during cotton domestication. Nat Genet. 2017;49:579.
Diouf L, Magwanga RO, Gong W, He S, Pan Z, Jia YH, Kirungu JN, Du X. QTL Mapping of Fiber Quality and Yield-Related Traits in an Intra-Specific Upland Cotton Using Genotype by Sequencing (GBS). Int J Mol Sci. 2018;19(2):441.
Sun Z, Wang X, Liu Z, Gu Q, Zhang Y, Li Z, Ke H, Yang J, Wu J, Wu L, et al. Genome-wide association study discovered genetic variation and candidate genes of fibre quality traits in Gossypium hirsutum L. Plant Biotechnol J. 2017;15(8):982–96.
HS S, KI S, Sateesh A, JM P, LS P, RLV B. Association mapping for seed cotton yield, yield components and fibre quality traits in upland cotton (Gossypium hirsutum L.) genotypes. Plant Breed. 2017;136(6):958–68.
Zhang P, Zhu X, Guo W, Zhang T. Genetic analysis and QTLs tagging of lint percentage and its closely related yield components in upland cotton. Jiangsu J Agric Sci. 2005;21(4):264–71.
Jiang B, Liu W, Xie D, Peng Q, He X, Lin Y, Liang Z. High-density genetic map construction and gene mapping of pericarp color in wax gourd using specific-locus amplified fragment (SLAF) sequencing. BMC Genomics. 2015;16:1035.
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754–60.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303.
Liu D, Ma C, Hong W, Huang L, Liu M, Liu H, Zeng H, Deng D, Xin H, Song J, et al. Construction and analysis of high-density linkage map using high-throughput sequencing data. PLoS One. 2014;9(6):e98855.
van Os H, Stam P, Visser RG, van Eck HJ. SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor Appl Genet. 2005;112(1):187–94.
Huang X, Zhao Y, Wei X, Li C, Wang A, Zhao Q, Li W, Guo Y, Deng L, Zhu C, et al. Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nature Genet. 2011;44(1):32–9.
Stam P. Construction of integrated genetic linkage maps by means of a new computer package: Join Map. Plant J. 1993;3(5):739–44.
JW VANO. Multipoint maximum likelihood mapping in a full-sib family of an outbreeding species. Genet Res. 2011;93(5):343–9.
Kosambi DD. The estimation of map distances from recombination valueS. Ann Eugenics. 1943;12(1):172–5.
Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.
Van Ooijn J. MapQTL® 6.0. Sofware for the mapping of quantitative trait loci in experimental populations of diploid species Wageningen. The Netherlands: Kyazma BV; 2009.
Lander E, Kruglyak L. Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet. 1995;11(3):241–7.
Voorrips R. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002;93(1):77–8.
Acknowledgements
Not applicable.
Funding
Financial support for the design of the study and collection, analysis, and interpretation of data and in writing the manuscript was provided by the Natural Science Foundation of China (Grant No. 31571720 and 31371671).
Availability of data and materials
The datasets supporting the conclusions drawn in this article are included within the article and its Additional files 1, 2, 3 and 4.
Sequencing data related to this study has been uploaded to NCBI SRA database, which can be accessed through series of SRA numbers (SRR8205748 to SRR8205836).
Author information
Authors and Affiliations
Contributions
AI contributed to data analysis and manuscript writing; TZY, LXY and WWW contributed to data analysis; BYT, YQ, HYS, HJ, JYB, TLX, KK, SAYA, contributed to DNA extraction and field work; TZH and LDX, ZJ, GK, LDJ contributed to population construction; ZZS contributed to experimental design and writing. All authors have read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
We have all relevant rights to the materials used in this study. All materials were grown in the field in accordance with local legislation.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional files
Additional file 1:
Table S1. Genome coverage of genetic map (XLSX 11 kb)
Additional file 2:
Table S2. QTL for fiber quality in RIL population in three environments (XLSX 20 kb)
Additional file 3:
Table S3. Details of QTL Clusters for fiber quality traits (XLSX 14 kb)
Additional file 4:
Table S4. Common QTL compared with CottonQTLdb (XLSX 15 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Ali, I., Teng, Z., Bai, Y. et al. A high density SLAF-SNP genetic map and QTL detection for fibre quality traits in Gossypium hirsutum. BMC Genomics 19, 879 (2018). https://doi.org/10.1186/s12864-018-5294-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-018-5294-5