- Research article
- Open Access
High-density linkage map construction and QTL analyses for fiber quality, yield and morphological traits using CottonSNP63K array in upland cotton (Gossypium hirsutum L.)
BMC Genomics volume 20, Article number: 889 (2019)
Improving fiber quality and yield are the primary research objectives in cotton breeding for enhancing the economic viability and sustainability of Upland cotton production. Identifying the quantitative trait loci (QTL) for fiber quality and yield traits using the high-density SNP-based genetic maps allows for bridging genomics with cotton breeding through marker assisted and genomic selection. In this study, a recombinant inbred line (RIL) population, derived from cross between two parental accessions, which represent broad allele diversity in Upland cotton, was used to construct high-density SNP-based linkage maps and to map the QTLs controlling important cotton traits.
Molecular genetic mapping using RIL population produced a genetic map of 3129 SNPs, mapped at a density of 1.41 cM. Genetic maps of the individual chromosomes showed good collinearity with the sequence based physical map. A total of 106 QTLs were identified which included 59 QTLs for six fiber quality traits, 38 QTLs for four yield traits and 9 QTLs for two morphological traits. Sub-genome wide, 57 QTLs were mapped in A sub-genome and 49 were mapped in D sub-genome. More than 75% of the QTLs with favorable alleles were contributed by the parental accession NC05AZ06. Forty-six mapped QTLs each explained more than 10% of the phenotypic variation. Further, we identified 21 QTL clusters where 12 QTL clusters were mapped in the A sub-genome and 9 were mapped in the D sub-genome. Candidate gene analyses of the 11 stable QTL harboring genomic regions identified 19 putative genes which had functional role in cotton fiber development.
We constructed a high-density genetic map of SNPs in Upland cotton. Collinearity between genetic and physical maps indicated no major structural changes in the genetic mapping populations. Most traits showed high broad-sense heritability. One hundred and six QTLs were identified for the fiber quality, yield and morphological traits. Majority of the QTLs with favorable alleles were contributed by improved parental accession. More than 70% of the mapped QTLs shared the similar map position with previously reported QTLs which suggest the genetic relatedness of Upland cotton germplasm. Identification of QTL clusters could explain the correlation among some fiber quality traits in cotton. Stable and major QTLs and QTL clusters of traits identified in the current study could be the targets for map-based cloning and marker assisted selection (MAS) in cotton breeding. The genomic region on D12 containing the major stable QTLs for micronaire, fiber strength and lint percentage could be potential targets for MAS and gene cloning of fiber quality traits in cotton.
The cotton genus Gossypium spp. consists of at least 51 species, with 45 diploid (2n = 2x = 26) and six allotetraploid (2n = 4x = 52, AD) [1, 2] species. Of these only four are cultivated species: G. hirsutum L. (2n = 4x, AADD), G. barbadense L. (2n = 4x, AADD), G. arboreum L. (2n = 2x, AA) and G. herbaceum L. (2n = 2x, AA). G. hirsutum L., also called Upland cotton, contributes to more than 90% of the global cotton production and acreage and G. barbadense L., known as Pima cotton, accounts for 8% of the cotton production in the world .
As the largest natural fiber source, cotton is one of the most important economic crops worldwide. In 2018/19 season, cotton was primarily grown in around 30 countries, with more than 116 million bales of fiber produced . In the United States, which is the third largest cotton fiber producing country as well as the largest cotton fiber exporting country in the world, 18.59 million bales of cotton fiber was produced with 15 million bales exported in 2018/19 season . The production, distribution and processing of cotton in the United States provide about $27 billion direct business revenue while supporting more than 200 thousand jobs . However, the world cotton fiber market is recently under a lot of pressure because of the development of synthetic fibers . In addition, the US cotton has to compete with handpicked cotton from Asia. Currently, the US cotton could compete in the international markets because of its higher fiber quality. Therefore, improving the fiber quality has been an important objective of cotton breeders in the US. Farm productivity and economic viability of cotton production directly related to the lint yields . As such, continued improvements in the fiber quality and yield are critical for the US cotton production.
Plant height, a typical quantitatively inherited trait [7,8,9], can indirectly influence the yield of cotton fiber because optimal plant height can contribute to machine harvesting and help achieve higher harvesting index . Fuzziness seed trait, an important seed trait related to the cotton yield and fiber quality , was usually considered as a binomial trait (fuzzy seed or fuzzless seed) while some reports indicated this trait was polygenically controlled [10,11,12,13].
In general, fiber quality and yield traits in cotton are known to inherit polygenically and influenced by environment [14,15,16]. Further, fiber quality traits often have negative association with some yield traits . Although, traditional breeding methods played an important role in the development of cotton cultivars [18, 19], further improvements in the trait values especially for the quantitative traits using these breeding approaches have been limited [20, 21]. With the advancement of molecular marker technology, maker-assisted selection (MAS) has been increasingly applied in the cotton breeding programs . Restriction fragment length polymorphism (RFLP) markers were the first type of the markers used in the cotton improvement  and the first linkage maps in cotton were constructed using RFLP markers in 1994 . From then on, various types of the molecular markers were used in the cotton genetics and breeding [25,26,27,28,29,30,31,32]. High-density genetic maps with broadly adaptable markers are required for improving the efficiency in detection and MAS-based transfer of quantitative trait loci (QTLs) [33,34,35,36,37,38,39]. The abundance, extensive polymorphism and compatibility to high-throughput genotyping platforms have made the single nucleotide polymorphism (SNP) markers the most popular markers used in plant translational genomics [40,41,42]. With the development of next-generation sequencing (NGS) technologies, several methods to discover large numbers of SNP-based markers are now developed for cotton [36,37,38,39,40]. This enabled the development of high-density linkage maps in cotton [36,37,38,39,40]. In the present study, we used 63K SNP array  for genotyping a recombinant inbred line (RIL) population, derived from landrace by elite germplasm line cross, to construct a high-density linkage map and to map the QTLs for cotton fiber quality, yield and morphological traits in Upland cotton.
Analyses of the phenotypic traits
A summary of the statistical analyses for the phenotypic performance of the twelve traits is presented in Table 1. Among the six fiber quality traits measured, micronaire (MIC), upper half mean length (UHM), uniformity index (UI) and fiber strength (STR) of the parental accession NC05AZ06 were significantly (P < 0.05) higher (13.0–16.9%, 34.1–36.6%, 4.4–7.6%, 7.4–8.1%, respectively) than those of the parental accession NC11–2091 while the short fiber content (SFC) of NC11–2091 was significantly (P < 0.05) greater (26.3–55.3%) than that of NC05AZ06. No significant difference was found between the two parents for the fiber elongation (ELO). All the four yield traits, boll weight (BW), lint percentage (LP), seed index (SI) and lint index (LI) were significantly (P < 0.01) higher (209.4–222.8%, 137.2–160.0%, 12.5–24.6%, 311.8–317.9%, respectively) in NC05AZ06 than in NC11–2091. For morphological traits, the plant height (PH) of NC05AZ06 was significantly (P < 0.01) lower (− 32.5%) than NC11–2091. The seed fuzziness grade (FG) of NC05AZ06 was 100% (fuzz-rich) and the FG of NC11–2091 was 0 (fuzz-free). The broad-sense heritability of the traits calculated by the ratio of total genetic variance to total phenotypic variance for all the traits is listed in Table 2. Most traits, except for PH, had high broad-sense heritability across 2 years with values ranging from 82 to 96%. The broad-sense heritability of PH was only 56%. Since we only had 1 year’s data for PH, we can just state that the trait performance of PH might be sensitive to the environment.
The results of correlation analyses for the twelve traits was described in Table 3. Among the fiber quality traits, UHM was significantly (P < 0.01) positively correlated with UI, BW, LP, LI, FG, and significantly (P < 0.01) negatively correlated with MIC, ELO and SFC. The STR was significantly positively correlated with BW (P < 0.05), SI (P < 0.01) and PH (P < 0.05), and was significantly negatively correlated with ELO (P < 0.05) and LP (P < 0.01). The SFC was significantly (P < 0.01) positively correlated to MIC, ELO and it was significantly (P < 0.01) negatively correlated to UI. The ELO was significantly (P < 0.01) positively correlated with MIC and significantly negatively related to UI (P < 0.01) and BW (P < 0.05) (Table 3). Almost all the four yield traits BW, LP, SI, and LI showed a highly positive correlation with each other, except for LP and SI, which the correlation was not significant (Table 3). The morphological trait PH had a negative correlation with yield traits BW, LP and LI, and a positive correlation with SI and STR, respectively. Another morphological trait fuzziness grade was highly positively correlated with all the four yield traits (Table 3).
Construction of linkage maps
Out of 63,058 SNPs used in the genotyping, 11,255 (17.8%) SNPs were polymorphic between the two parents. A total of 3129 SNPs were selected for linkage map construction after removing the poor quality or duplicate SNPs. All the 3129 markers were mapped on 26 linkage groups (26 chromosomes) (Figs. 1, 2, 3, 4, 5, 6 and 7, and Additional file 2: Table S2). This resulted in the genetic map length of 4422.44 cM with an average distance of 1.41 cM between markers (Table 4). Of these 3129 SNPs, 1534 SNPs were mapped to the A sub-genome while 1595 SNPs were mapped to the D sub-genome. The mapped SNPs of the A sub-genome generated a genetic map of 2236.35 cM with an average marker density of 1.46 cM while 1595 SNPs of the D sub-genome gave a genetic map of 2186.09 cM with an average marker density of 1.37 cM (Table 4). Genetic lengths of 26 linkage groups ranged from 103.9 cM to 252.5 cM. Number of markers mapped per chromosome range from 69 to 180 and average marker density ranging from 1.09 cM to 1.72 cM in each group (Table 4). Five gaps (adjacent marker distance > 10 cM) with the interval distances of 11.02 cM, 11.30 cM, 14.59 cM, 10.01 cM and 10.01 cM were identified on 5 different linkage groups Chr.03 (A3), Chr.08 (A09), Chr.09 (D5), Chr.26 (D6) and Chr.05 (D11), respectively (Table 4).
Of the 3129 mapped SNPs, 175 (5.6%) SNP markers showed segregation distortion which spanned on 22 chromosomes, with the most distorted markers (34) and highest distortion rate (25.37%) on Chr.02 (A13) (Table 4). Seventeen segregation distortions region (SDR) were identified on 13 chromosomes, with 9 of the SDRs in A sub-genome and 8 SDRs in the D sub-genome (Table 4). Hence, the sub-genomes did not show any bias for the SDRs.
Comparison of the genetically mapped SNPs with the sequence based physical map of the TM-1 (G. hirsutum) reference genome sequence  for syntenic relationships showed that the strong collinearity between the genetic map and physical map (Fig. 8). The SNP based genetic map of 4422.44 cM corresponded to 1911.76 Mb of the sequence based physical map which represented 98.8% of the total length of the sequence based physical map (Additional file 2: Table S2 and Additional file 4: Table S4). All linkage groups showed good collinearity with the physical map. Coverage of the individual chromosomes ranged from 96.4 to 99.5% of the sequence based physical map. Figure 8 shows the circos plots that describe strong collinearity between the genetic map and physical map. Finally, collinearity between genetic and physical maps suggest that the genetic mapping population used in the current study did not contain any chromosomal rearrangements.
QTL analysis for cotton fiber quality, yield and morphological traits
QTL analysis using composite interval mapping (CIM) identified a total of 106 QTLs, with 59 of QTLs for fiber quality traits, 38 for yield traits and 9 for morphological traits (Additional file 1: Table S1). Overall the phenotypic variation explained by the QTLs ranged from 3.6–48.0% (Additional file 1: Table S1). Among the 106 QTLs, 22 were stable QTLs identified in both years, 40 QTLs were identified only in 2016 and 44 QTLs were identified only in 2017. By determining that the SFC with lower value was favorable and other traits (BW, SI, LI, LP, STR, MIC, UHM and UI) with higher value were favorable, the favorable alleles of 80 QTLs were derived from NC05AZ06 (P1) with positive additive effects whereas 26 QTLs with negative additive effects were contributed by NC11–2091 (P2). Of the 106 QTLs, 57 QTLs were mapped in the A sub-genome and 49 QTLs were in the D sub-genome (Table 4). Among the 57 A sub-genome QTLs, 43 QTLs with favorable alleles were from NC05AZ06 and 14 were from NC11–2091. In the D sub-genome, 37 QTLs with favorable alleles were contributed by NC05AZ06 and the 12 were contributed by NC11–2091. Overall, of the 106 mapped QTLs, 46 QTLs were major QTLs with PVE > 10%. These included 29 QTLs for fiber quality traits (Table 5) (18 in the A sub-genome and 11 in the D sub-genome), 12 QTLs for yield traits (Table 6) (5 QTLs in the A sub-genome and 7 in the D sub-genome) and 5 QTLs for morphological traits (one in A sub-genome and 4 in D sub-genome (Table 7).
QTL for fiber quality traits
A total of 59 QTLs, including 15 stable QTLs, 23 QTLs in 2016 and 21 QTLs in 2017, were identified for six fiber quality traits with the PVE ranging from 4.1 to 25.8% (Table 5, Additional file 1: Table S1). Parental accession NC05AZ06 contributed favorable alleles for 43 QTLs while NC11–2091 donated 16 QTLs. Sub-genome wide, of the 59 fiber quality QTLs, 31 QTLs were mapped in the A sub-genome (24 QTLs with favorable alleles from NC05AZ06 and 7 from NC11–2091) and 28 QTLs were mapped on the D sub-genome (19 QTLs with favorable alleles from NC05AZ06 and 9 from NC11–2091).
For fiber micronaire, seven QTLs explaining 4.1 to 25.8% of the phenotypic variance (PV) were identified, among which 5 are major QTLs (Table 5 and Additional file 1: Table S1). Three major stable QTLs, qMIC-CH10-A5–1, qMIC-CH24-D3–1, and qMIC-CH25-D12–1 explained 16.2–16.2%, 23–25.8%, 4.1–10.0% of phenotypic variance, respectively. Two major QTLs qMIC-16-CH3-A3–1 and qMIC-16-CH6-D7–1 with the PVE 17.2 and 19.3%, respectively, were detected in the 2016 dataset. The qMIC-CH10-A5–1 was the only QTL with favorable alleles derived from parental accession NC11–2091.
Upper half mean length (UHM)
UHM is a measure of fiber length. Ten QTLs explaining 5.5 to 12.1% of PV were identified (Table 5 and Additional file 1: Table S1). Five major QTLs, including 3 QTLs (qUHM-16-CH5-D11–1, qUHM-16-CH7-A7–1, qUHM-16-CH24-D3–1) in 2016 and 2 QTLs (qUHM-17-CH7-A7–1, qUHM-17-CH23-A2–1) in 2017, with the PVE ranging from 10.1 to 12.1% were detected. Majority of the QTLs with favorable alleles were derived from the parent NC05AZ06. The qUHM-16-CH5-D11–1 was the only QTL with favorable alleles derived from NC11–2091.
Uniformity index (UI)
Ten QTLs explaining 4.9 to 21% of PV were detected and mapped for UI in the genetic maps (Table 5 and Additional file 1: Table S1). Seven QTL favorable alleles were conferred by parental accession NC05AZ06. Of these, six were major QTLs. These included 2 stable QTLs, qUI-CH3-A3–1 and qUI-CH11-A10–1 with 6.0–21.0%, 4.9–16.1%, respectively, of PVE and 4 single-year QTLs (qUI-16-CH4-A11–1, qUI-16-CH10-A5–1, qUI-17-CH21-D2–1, qUI-17-CH26-D6–1) explaining 10.0–13.1% of PV.
Fiber strength (STR)
For fiber strength, 11 QTLs explaining 4.1 to 15.6% of PV, with 7 QTLs having favorable alleles conferred by NC05AZ06 were detected (Table 5 and Additional file 1: Table S1). Of these, four were major QTLs, including three stable QTLs (qSTR-CH2-A13–1, qSTR-CH19-D1–1, qSTR-CH25-D12–1) with PVE of 7.3–11.9%, 5.2–11.8%, 11.5–15.6%, respectively, and one QTL (qSTR-17-CH14-A8–1), detected only in 2017, explaining 11.9% of PV.
Fiber elongation (ELO)
Nine QTLs explaining 5.7 to 13.5% of PV were mapped in the linkage maps (Table 5 and Additional file 1: Table S1). Of these 9 QTLs for elongation, four were major QTLs which included three stable QTLs (qELO-CH4-A11–1, qELO-CH8-A9–1, qELO-CH19-D1–1) with 7.1–13.5%, 7.5–12.3%, 8.1–12.4% of PVE and one QTL (qELO-17-CH23-A2–1) detected only in 2017, explained 11.2% of PV. Further, five of these mapped QTLs had favorable alleles from NC11–2091 for fiber elongation.
Short fiber content (SFC)
A total of 12 QTLs explaining 4.9 to 20.6% of PV were identified, including 5 major QTLs. One major QTL (qSFC-CH3-A3–1) was detected in both years with 7.9–18.4% of PVE, to which the favorable allele was contributed by NC05AZ06. Another 4 major QTLs (qSFC-17-CH2-A13–1, qSFC-16-CH4-A11–1, qSFC-17-CH5-D11–1, qSFC-17-CH18-A6–1) with the PVE ranging from 12.4 to 20.6% were detected in a single year environment. Since cotton fiber with high SFC is adverse to its quality , SFC with lower values are considered favorable. Most of the QTL favorable alleles were derived from NC05AZ06, except for qSFC-CH3-A3–1 and qSFC-17-CH26-D6–1.
QTL for yield traits
A total of 38 QTLs, including 5 stable QTLs, 16 QTLs in 2016 and 17 QTLs in 2017, were identified for yield traits (BW, LP, SI, LI), with the PVE ranging from 4.2 to 30.4% (Table 6 and Additional file 1: Table S1). Accession NC05AZ06 contributed favorable alleles to 35 QTLs while NC11–2091 only donated favorable alleles for 3 of these 38 total QTLs. Further, of the 38 yield QTLs, 22 QTLs were mapped in the A sub-genome, including 19 QTLs with favorable alleles contributed by NC05AZ06 and 3 contributed by NC11–2091 and 16 QTLs were mapped in the D sub-genome, of which the favorable alleles were all contributed by NC05AZ06.
Boll weight (BW)
For boll weight, 11 QTLs explaining 5 to 14% of PV were identified and all favorable alleles of the QTLs were derived from NC05AZ06. Two major QTLs (qBW-16-CH4-A11–1, qBW-16-CH22-D13–1) with 14.0, 12.4% of PVE, respectively were detected only in year 2016 and another major QTL qBW-17-CH4-A11–1 with 12.1% of PVE was identified in year 2017.
Lint percentage (LP)
Eight QTLs explaining 5.9 to 17.7% of PV were identified for lint percentage (LP) which included 3 major QTLs (Table 6 and Additional file 1: Table S1). Two major and stable QTLs qLP-CH24-D3–1 and qLP-CH25-D12–1 explained 8.8–16.6%, 5.9–17.7% of PV, respectively for LP. Another major QTL (qLP-17-CH14-A8–1) with 15.2% of PVE, were detected in 2017 dataset. All favorable alleles of these QTLs were derived from NC05AZ06.
Seed index (SI)
For seed index, 9 QTLs explaining 5.8 to 30.4% of PV were detected. Among them, 6 QTLs with favorable alleles were derived from NC05AZ06. Two major and stable QTLs qSI-CH12-D10–1 and qSI-CH15-A12–1 with 27.1–30.4%, 10.6–17.5% of PVE, respectively, were identified in both environments.
Lint index (LI)
Ten QTLs explaining 4.2 to 21.1% of PV for lint index were identified and all their favorable alleles were contributed by NC05AZ06. Four major QTLs, including two QTLs detected only in the year 2016 (qLI-16-CH2-A13–1, qLI-16-CH12-D10–1) and other two QTLs detected only in year 2017 (qLI-17-CH22-D13–1, qLI-17-CH24-D3–1), explained 10.8, 13.5, 21.1, 12.6% of PV, respectively.
QTL for morphological traits
A total of 9 QTLs, including 2 stable QTLs, 1 QTL in 2016 and 6 QTLs in 2017, were identified for morphological traits (plant height and fuzziness grade), with the PVE from 3.6 to 48% (Table 7 and Additional file 1: Table S1). Accession NC05AZ06 contributed favorable alleles to 2 QTLs (qFG-CH22-D13–1, qFG-16-CH25-D12–2) whereas NC11–2091 donated favorable alleles for 7 of the 9 total QTLs (4 QTLs on the A sub-genome and 5 QTLs on the D sub-genome).
Plant height (PH)
Five QTLs explaining 6.5 to 15.8% of PV were identified in year 2017 and all these QTLs with positive additive effect for plant height were derived from NC11–2091. Three major QTLs for plant height (qPH-17-CH8-A9–1, qPH-17-CH9-D5–1, qPH-17-CH19-D1–1) explained 10.3, 15.8 and 10.4% of PV, respectively (Table 7).
Seed fuzziness grade (FG)
For seed fuzziness grade, 4 QTLs explaining 3.6 to 48% of PV were identified, of which 2 are major stable QTLs. Major stable QTL (qFG-CH22-D13–1) was the only QTL with positive additive effect for seed fuzziness contributed by NC05AZ06, with 39.2–48% of PVE. Another major stable QTL (qFG-CH25-D12–1) explained 3.6–19% of PV for seed fuzziness (Table 7).
A QTL cluster is a short region (< 30 cM) on the linkage map containing multiple QTLs . In this study, 21 QTL Clusters (Tables 8 and 9) were identified on 16 different chromosomes (Chr3, Chr4, Chr7, Chr8, Chr9, Chr10, Chr11, Chr12, Chr15, Chr16, Chr19, Chr21, Chr22, Chr23, Chr24 and Chr25) (Figs. 1, 2, 3, 4, 5, 6 and 7). Twelve QTL clusters were detected in A sub-genome and 9 clusters were detected in D sub-genome. Seven QTL clusters (Q-1 to Q-7) contained multiple fiber quality trait QTLs. Cluster Q-1, Q-3, Q-4, Q-5, Q-6 were identified with QTLs from SFC and UI (Table 8). In each of these 5 clusters, the favorable alleles of SFC and UI QTLs were contributed by same parents with different signs (“+” or “-”) of additive effects. For yield traits, four QTL clusters (Y-1 to Y-4) were identified (Table 8). The favorable alleles for most of the yield QTLs in these clusters were derived from NC05AZ06.
Ten QTL clusters contained multiple QTLs from different trait categories (Table 9). The QYA-1 and QYA-2 were two clusters carrying multiple QTLs from all 3 trait categories. The QYA-1 with a region in Chr.8 from 56.8 cM to 78.78 cM, contained 4 QTLs for FG, ELO, LP and PH. QYA-2 with a region in Chr.25 from 97.24 cM to 108.63 cM, carried 4 QTLs for STR, MIC, LP and FG.
Meta QTL analysis
A total of 2884 cotton QTLs for 11 traits: MIC(442), UHM(524), UI(289), STR(470), ELO(287), SFC(58), BW(176), LP(327), LI(42), SI(147), PH(122), which were collected by the CottonQTLdb [14, 15, 45] in different interspecific or intraspecific populations from 156 previous publications (http://www2.cottonqtldb.org:8081/references), were used for meta-QTL analysis in recent study (See additional file 3: Table S3).
In the current study, 74 QTLs were found to share the similar genetic positions (genetic distance window of < 20 cM) with previous reported QTLs, including 39 QTLs in the A sub-genome and 35 QTLs in D sub-genome. All these 74 shared QTLs were separated in to 11 different traits: STR (11), UI (10), UHM (7), MIC (7), ELO (7), SFC (7), BW (7), SI (6), LP (5), LI (4) and PH (3), including 33 major QTLs. Thirteen of these shared QTLs were stable QTLs (qELO-CH8-A9–1, qSTR-CH2-A13–1, qSTR-CH19-D1–1, qSTR-CH25-D12–1, qLP-CH24-D3–1, qMIC-CH10-A5–1, qMIC-CH24-D3–1, qMIC-CH25-D12–1, qSFC-CH3-A3–1, qSI-CH12-D10–1, qSI-CH15-A12–1, qUI-CH3-A3–1, qUI-CH11-A10–1). More than 70% of the QTLs shared the similar genetic positions with previously reported fiber quality and yield QTLs, which indicating consistency between the current study and previous studies. All the QTLs for STR, UI and MIC located on the similar genetic positions with previously reported QTLs. Twenty-eighty QTLs were unique QTLs with 17 QTLs in A sub-genome and 11 QTLs in D sub-genome, including 5 for SFC, 3 for UHM, 2 for ELO, 6 for LI, 4 for BW, 3 for SI, 3 for LP, 2 for PH. Out of these 28 unique QTLs, 11 were major QTLs (qBW-17-CH4-A11–1, qBW-16-CH4-A11–1, qELO-CH4-A11–1, qELO-CH19-D1–1, qLI-17-CH24-D3–1, qLI-16-CH12-D10–1, qLP-CH25-D12–1, qPH-17-CH9-D5–1, qSFC-17-CH2-A13–1, qUHM-17-CH23-A2–1, qUHM-16-CH24-D3–1). Three of them were stable QTLs: qELO-CH4-A11–1, qELO-CH19-D1–1, qLP-CH25-D12–1, which could be good addition to the existing QTLs.
Candidate gene analysis
BLAST searching of the 22 genomic regions harboring stable QTLs in the Cotton Functional Genomics Database (https://cottonfgd.org/) identified 33 known genes as candidates genes that had been reported [46,47,48,49,50,51,52,53,54,55,56,57,58] to have functional role in cotton fiber development  (Additional file 5: Table S5). Out of these 33 candidate genes, 19 genes, reportedly have functional role in fiber development, were mapped in the 11 major and stable QTL regions which were identified in both years. These included 3 QTLs for ELO, 3 QTLs for STR, 2 QTLs for MIC, 1 for UI, 1 for SFC and 1 for LP (Table 10). Further, the 6 reported fiber related candidate genes were found in the QTL cluster QYA-2 on chromosome D12, which contained 4 major stable QTLs: qLP-CH25-D12–1, qMIC-CH25-D12–1, qSTR-CH25-D12–1 and qFG-CH25-D12–1 (Additional file 5: Table S5).
Construction of high-density linkage maps with SNP arrays
The limited quantity of the polymorphic markers available were often limitations for the construction of high-density linkage maps in cotton [60, 61]. Due to the lack of the marker polymorphism in cotton, the linkage maps built by second-generation molecular markers such as SSRs and AFLPs, usually carried some disadvantages viz low marker coverage of the cotton genome, poor marker density and large gaps [61,62,63,64]. SNPs provide abundant genetic variation and their loci distribute evenly along the whole genome. Hence, they have been the most reliable markers for building high-density linkage maps and have been widely used in the QTL studies [40, 65, 66]. Recently, two sets of cotton SNP arrays CottonSNP63K and CottonSNP80K were developed and were used in the QTL mapping [40, 65]. Several high-density cotton genetic maps constructed successfully using these SNP arrays [35, 66,67,68,69,70]. In the current study, a linkage map was constructed using SNPs from CottonSNP63K array. The genetic map spanned a total length of 4422.44 cM, which was in correspondence with the estimated size of tetraploid cotton genome (4500 cM) . The average marker density of the map was 1.41 cM. No large gaps (> 15 cM) were found and marker density and coverage was better than the SSR-based linkage maps developed previously [27, 28, 30, 62,63,64, 72]. We identified 11,255 (17.8%) polymorphic SNPs between the parents from the 63,058 SNP markers and only 3129 (5.0%) of the polymorphic SNPs were unique. Based on the previous studies, the polymorphism rate of SSRs and SNPs for a RIL population was around 3–10% [62,63,64,65,66,67,68,69]. Out of a total 3129 SNP markers mapped, distribution of SNP markers was fairly even between A (1,534 SNPs) and D (1,595 SNPs) sub-genomes. Genetic map lengths produced by these markers in A and D sub-genomes were 2236.35 cM and 2186.09 cM, respectively. Further, SNP linkage maps showed a high level of collinearity with the sequence based physical map of Upland cotton (Fig. 8) suggesting there were no chromosome rearrangements among the parents and mapping population used for mapping. Finally, the circos plot suggested the accuracy of the linkage maps in comparison to the physical maps. Circos plots further confirmed that the polymorphic SNPs detected in each of the chromosomes distributed unevenly, which support an observation that the SNPs showed uneven distribution for polymorphism-rich and polymorphism-poor regions along each chromosome .
Segregation distortion (SD), commonly observed in mapping populations [33, 35, 66, 67, 69], could be due to genetic drift, preferential fertilization by particular gametic combinations and due to environmental factors [74,75,76]. In this study, 5.6% (175) of the mapped markers, showed segregation distortion (Table 4). This was lower than the previous reports in cotton (11.4–32.8%). Wang et al.  reported that the bigger the genetic differentiation between two parents, the smaller the segregation distortion in the derived population. This may suggest lower SD in our study since the parents used in this study were expected to show maximum allele diversity. Fifty-nine percent of distorted markers (103) were on 6 chromosomes (A3, A10, A12, A13, D5, D10), which was consistent with the previous reports that the majority of distorted markers were concentrated in a few chromosomes [33, 35, 66, 67, 69, 74,75,76].
QTL mapping population
The quality of a QTL map depends on the number of polymorphic markers and the genetic mapping populations used. Tetraploid cotton, in general, show a low level of marker polymorphism [77, 78]. According to the previous cotton QTL mapping research, it was observed that the marker polymorphism rates in the interspecific mapping populations [24, 31, 32, 72, 79, 80] were higher than intraspecific mapping populations [35,36,37,38,39] on the whole. In order to potentially detect a broader array of polymorphic markers and QTL alleles, interspecific mapping populations derived from G. hirsutum and G. barbadense have been extensively used for QTL mapping in cotton [24, 31, 32, 72, 79, 80]. However, the QTL type and their mapping information from an interspecific (G. hirsutum × G. barbadense) population were inconsistent in comparison to the QTLs studied based on an intraspecific G. hirsutum population . Further, QTLs identified using the interspecific RILs could not be transferred precisely into Upland cotton due to the genetic bottlenecks associated with interspecific hybridizations during the breeding process. Hence, QTLs of the interspecific mapping studies were not utilized in Upland cotton improvement. In this study, an intraspecific G. hirsutum RIL population, developed from a cross between an improved germplasm line NC05AZ06 and a landrace accession NC11–2091 was used for QTL mapping. The CottonSNP63K array based genotyping provided good number of the candidate markers. This allowed us to obtain enough polymorphic markers to develop high density genetic maps in Upland cotton which in general suffers from low density of markers and low marker polymorphism [24, 31].
QTLs with favorable alleles identification
The identification of favorable QTLs alleles can help improving the fiber quality and yield in Upland cotton by genomic and marker assisted selection . As expected, the performance of parent NC05AZ06 was superior to those of the landrace parental accession NC11–2091 for MIC, UHM, UI, STR and 4 yield traits. Among the 106 QTLs, the favorable alleles of 80 QTLs originated from NC05AZ06 while other 26 from NC11–2091. Only a few of the QTLs with favorable alleles of a given trait were derived from the parent NC11–2091 (Table 7 and Additional file 1: Table S1). Fifteen QTLs with favorable alleles contributed by NC11–2091. Of these, 8 were major QTLs.
QTL locations and clusters
Based on the reports from previous cotton QTL studies (http://www2.cottonqtldb.org:8081/references) , the QTLs for fiber quality traits and yield traits were distributed on most chromosomes, varied from population to population (See Additional file 3: Table S3). Of the 44 major QTLs for the 11 traits in the current study, eleven QTLs were unique QTLs: 2 for ELO (qELO-CH4-A11–1, qELO-CH19-D1–1), 2 for UHM (qUHM-17-CH23-A2–1, qUHM-16-CH24-D3–1), 1 for SFC (qSFC-17-CH2-A13–1), 2 for BW (qBW-17-CH4-A11–1, qBW-16-CH4-A11–1), 2 for LI (qLI-17-CH24-D3–1, qLI-16-CH12-D10–1), 1 for LP (qLP-CH25-D12–1) and 1 for PH (qPH-17-CH9-D5–1). The presence of the unique QTLs was expected because of the type of parental accessions used and the number of the SNP markers used to detect the maximum allele diversity. Out of these unique QTLs,11 were major QTLs, 3 were stable QTLs (qELO-CH4-A11–1, qELO-CH19-D1–1, qLP-CH25-D12–1). Most of the QTLs were detected on the chromosomes that were shown to carry the QTLs for the corresponding traits (See Additional file 1: Table S1 and Additional file 3: Table S3). Only 5 major QTLs were detected on the chromosomes where there were no previously reported QTLs for the corresponding traits: qBW-16-CH4-A11–1(A11), qLI-17-CH24-D3–1(D03), qLI-16-CH12-D10–1(D10), qSFC-17-CH2-A13–1(A13), qUHM-17-CH23-A2–1(A2) (See Additional file 1: Table S1). Huang et al. 2017  reported a genome-wide association study (GWAS) in Upland cotton using the CottonSNP63K array. Twelve QTLs mapped in the current study showed similar physical position with the QTLs reported by Huang et al. 2017  for the identical traits (Table 11). Of the 4 stable QTLs identified in the current study (qUI-CH3-A3–1, qUI-CH11-A10–1, qLP-CH25-D12–1, qMIC-CH25-D12–1), the QTL for LP and MIC in the QTL cluster on D12 showed similar chromosome location as were reported by Huang et al. 2017 . Identification of this QTL cluster from independent studies involving diverse mapping populations validates and proves the QTL region on D12 for fiber quality traits. These could be potential targets for MAS and map-based cloning of major fiber quality QTLs in Upland cotton.
Many genetic studies on cotton seed fuzzless trait have been carried out previously. In 1949, Ware et al.  first studied this seed fuzzless character and reported it was controlled by a single gene. But later, other reports suggested it was not a binary trait of naked or fuzzy seed, but there existed different degrees of seed fuzziness performance which may be controlled polygenically [10, 11, 13]. Previous study reported that there were two seed fuzzless trait loci on chromosomes A12 and D13 which were controlled by major genes . Our results not only confirmed the genetic factors located on D13, but also identified a new locus on D12 for seed fuzzless trait (Table 7 and Additional file 1: Table S1). It is interestingly to note that the new locus (qFG-CH25-D12–1) was mapped on chromosome D12 which is homoeologous chromosome A12, previously reported  to carry fuzzless trait suggesting the functional conservation of orthologous genomic regions controlling the fuzzless trait in Upland cotton. Majority of the QTLs showing shared position with previous studies suggest the genetic relatedness of the elite cottons of the USA and in general narrow genetic base of cultivated cotton. This further indicates that the marker trait associations identified for quantitatively inherited cotton traits could be broadly applicable across most cotton breeding programs.
The phenotypic trait correlation analysis showed high values of positive or negative correlations between different traits, which can be partially explained by the QTL clusters we identified. For example, Q-1, Q-3, Q-4, Q-5, Q-6 contained multiple QTLs from SFC and UI (Table 8). However, the signs of additive effects of SFC and UI QTLs in each of these clusters were opposite with favorable alleles from same parent. In this case, when we choose the favorable alleles of NC05AZ06 for this QTL, SFC will decrease and UI will increase. If we choose the other alleles for the QTL, the UI will decrease and SFC will increase. This explained why UI and SFC would always show negative correlation values (− 0.93). This strong negative relationship between SFC and UI was also reported in the previous study by Ramey et al. . On the contrary, Y-1, Y-3, Y-4 clusters provided the evidence of why all the yield traits were highly positively correlated since all the favorable alleles of QTLs in these clusters were derived from same parent. These positive correlations among yield traits were also widely observed in many previous studies [31,32,33, 35, 37, 66, 67]. Similarly, Q-2, Q-7 explained a negative correlation (− 0.62) between UHM and ELO. This is consistent with previous observation by Wang et al.  who reported a negative correlation (− 0.59) between fiber length and ELO. Q-7 also explained a positive correlation (0.46) between ELO and SFC as well as a negative correlation (− 0.79) between UHM and SFC. Interestingly, previous reports suggested both positive  as well as negative correlation (− 0.349)  between ELO and SFC. In the current study, some of the clusters contained both fiber quality traits and yield traits, which provided us an efficient way to improve the quality traits and yield traits at the same time. For example, QY-3 were shared by 3 QTLs from LI, SI and MIC, of which the favorable alleles were all contributed by NC05AZ06. In this case, the QTL markers in QY-3 can help improving the MIC, LI and SI concurrently. Similarly, the QTLs in QY-4 had the potential to improve the UHM, MIC, LI and LP simultaneously. Similar type of positive correlation between LI, LP and MIC was reported by Wang et al. .
Candidate gene analysis of the QTLs
The identification of the candidate genes with known functions in cotton fiber development, located in the mapped QTL regions, could add additional validity to the fiber quality QTLs. Out of the 11 stable and major QTLs analyzed, 8 regions with QTLs qELO-CH8-A9–1, qELO-CH4-A11–1, qSTR-CH2-A13–1, qELO-CH19-D1–1, qSTR-CH19-D1–1, qSTR-CH25-D12–1, qMIC-CH25-D12–1, qLP-CH25-D12–1 showed two or more cotton fiber related candidate genes (Additional file 5: Table S5). Moreover, a fiber related gene-rich QTL cluster QYA2 was identified on chromosome D12. The presence of 6 reported fiber related candidate genes localized in this QTL region may partially explain and confirm the QTL cluster containing multiple different QTLs. The importance and validation of this QTL on chromosome D12 could also be confirmed from the previous mapping efforts. Huang et al. 2017  reported a genome-wide association study (GWAS) in Upland cotton using the CottonSNP63K array and performed the BLAST search using the SNPs underlying QTLs against the Genome NAU-NBI v1.1 database . The QTL for LP (qLP-CH25-D12–1) and MIC (qMIC-CH25-D12–1) in the QTL cluster on D12 showed similar chromosome location and candidate genes as were reported by Huang et al. 2017 . Identification of QTL clusters from independent studies involving diverse mapping populations validates and proves the QTL regions. Such QTLs could be potential targets for MAS and map-based cloning of major fiber quality QTLs in Upland cotton.
A high-density linkage map spanning 4422.44 cM length with an average marker density of 1.41 cM was developed using 3129 SNP markers. Genetic maps showed high level of collinearity with their corresponding sequence based physical maps. Forty-six major QTLs were identified with 29 QTLs for fiber quality traits, 12 for yield traits and 5 for morphological traits. More than 70 % of the mapped QTLs shared the similar linkage and physical position with previously reported QTLs. QTLs for fiber quality showed clustering on a handful of chromosomal regions indicating these are possible regions of major selective sweeps, which could help explain the strong correlation between fiber quality traits in cotton. Majority of the QTLs showing shared position with previous studies suggest that the genetic relatedness of the elite cottons of the USA and the general narrow genetic base of cultivated cotton. Candidate gene analyses of the stable QTLs identified candidate genes with functional roles in fiber development. The stable QTLs, major QTLs and the QTL clusters identified in the SNP map in the current study could be the potential targets for MAS in cotton breeding and map-based cloning of QTLs controlling fiber quality traits in cotton.
Development of the RIL population
The G. hirsutum accessions NC05AZ06 and NC11–2091 were used as parents to develop the RIL population. NC05AZ06 is a sub-okra germplasm line with improved fiber quality and yield traits released by our program . The landrace accession NC11–2091(TEX 2313; PI 607640), collected from Thailand, was obtained from the U.S. National Cotton Germplasm Collection (NCGC), USDA-ARS, College Station, Texas. As landraces tend to be heterogeneous, we inbred the land race accession NC11–2091 for three generations using manual selfing and single seed descent method of line advancement. In the summer of 2010, the inbred parental accessions were planted at the Central Crops Research Station at Clayton, North Carolina and crossed to develop F1 seeds. The F1 plants were planted in the winter nursery, Tecoman, Mexico and manually selfed using glassine bags to obtain F2 seed. The F2 plants were grown and individual plants were manually selfed to obtain F3 seed in the summer nursery of 2012. From 2013 to 2015, 107 F2:3 lines were advanced to F5 generation by single seed decent method in the greenhouses. The 107 F5:6 lines were grown in the summer nursery at Clayton, NC in 2015 and seed increased by manual self-pollination. Seed cotton samples were ginned using 10-saw gin. Seed were acid delinted and treated with fungicide and insecticide before using in the current study.
Field experiments and phenotyping
The F5:6 RIL population containing 107 RILs along with parents and four checks (UA-48, UA-222, DP-393, SG-747) were planted using an augmented randomized complete block design with seven blocks in Clayton, NC in summer 2016. Each line was planted (2.5–3 seeds per ft) in the single row 10-ft plots with 38-in row spacing and 10 ft. alleys. Standard morphological practices were followed. Fifty fully opened bolls from each plot were hand-harvested in November of each year of the trials. Four yield traits, including boll weight (BW), lint percentage (LP), seed index (SI), lint index (LT) were evaluated. Approximately 15 g (g) of fiber sample ginned from each boll sample was tested for the fiber quality parameters using high-volume instrument (HVI) system at the Cotton Incorporated, Cary, North Carolina. The fiber quality traits evaluated were fiber elongation (ELO), micronaire (MIC), short fiber content (SFC), fiber strength (STR), upper half mean length (UHM) and uniformity index (UI). MIC is an airflow measurement of fibers and indicates fiber fineness and maturity. UHM is the mean length of the longer half of the fibers in the sample, measured in hundredths of an inch. STR is the force in grams required to break a bundle of fibers one tex unit in size. ELO is the amount in percentage a fiber sample can stretch prior to breakage. UI is a ratio between the mean length and the upper half mean length of the fibers, expressed as a percentage. It indicates the uniformity of fiber length in a sample. SFC is the percentage by weight of fibers 0.5 in. (12.7 mm) long or less. BW is the average weight in grams of seedcotton in a boll. LP is a ratio between the total fiber weight and the total seedcotton weight. SI is the weight of 100 seeds in grams. LI is the weight of lint in grams obtained from 100 seeds. The morphological trait, fuzziness grade of seed (FG) was determined by rating based on four levels of the seed fuzziness (0, 33.3, 66.6 and 100%). Progressive numbers 0 to 100% indicate fuzz-free to fuzz-rich cotton seed.
In the summer of 2017, the RIL population along with parents and the same four checks were planted using a completely randomized block design (RCBD) with two replications in Clayton, NC. Each line was planted in the single row 20-ft plot with a plant density of 2.5 seeds per foot. Forty fully opened bolls from each plot were hand-harvested in December 2017. Same phenotyping methods were used for evaluating the 11 cotton traits as in year 2016. Plant height (PH) trait values was evaluated by taking the average of the manually-measured height of five randomly selected plants from each plot.
Marker genotyping and linkage map construction
Genomic DNA was extracted from 3 to 4 weeks old plant leaf tissue of the RIL population and their parents using DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). One hundred and four of the 107 phenotyped RILs and the parents were genotyped with 63 K cotton SNP array  at Texas A&M Institute for Genome Sciences and Society. Candidate SNPs were filtered from the array with 63,058 SNPs based on the rules as follows: (1) SNPs with monomorphic genotypes were removed, (2) poor-quality SNPs and SNPs with missing values more than 30% were removed and (3) duplicate SNPs were removed .
The resultant candidate SNPs were used to construct the linkage map by JoinMap 4.1  using Kosambi’s mapping function  with logarithm of the odds (LOD) threshold of 7.0. The SNPs were then aligned to the TM-1 (G. hirsutum) Genome NAU-NBI Assembly v1.1 and Annotation v1.1 database  by BLAST (https://www.cottongen.org/blast). Correspondence of the linkage map groups with the physical map groups was performed with the circos plots by Circa software (http://omgenomics.com/circa/).
All the trait phenotypic values of the RILs and parents were estimated using the linear mixed models in SAS version 9.4 (SAS Institute, Cary, NC). The SAS software was also used for calculating the statistics, including the T-test of the difference between the value means of two parents, the broad-sense heritabilities of the traits, the genetic correlations between the traits and other basic statistical parameters.
Segregation of the markers from the Mendelian ratio 1:1 was tested using chi-square analysis (P < 0.05) and a segregation distortion region (SDR) was identified when at least three adjacent markers showing significant (P < 0.05) segregation distortion  using JoinMap 4.1.
All 12 traits related QTLs were detected using composite interval mapping (CIM) method  using WinQTLCart2.5 . The genotype of alleles from parental accession NC05AZ06 (P1) was coded as “AA” and the genotype of alleles from accession NC11–2091(P2) was coded as “aa”. Based on the results of a 1000-time permutation procedure, logarithm of the odds (LOD) ≥ 2.5 with at least 1 year’s phenotypic variation explained (PVE) ≥ 5.0 was used as the threshold for a QTL identified in both years with overlap region and LOD ≥ 3.0 with PVE ≥ 5.0 was the threshold to determine a QTL detected only in 1 year. The resulting linkage map with identified QTLs were drawn using MapChart version 2.32 . Further, the identified QTLs were used to detect the QTL clusters and meta QTL analysis was performed by comparing them with the QTLs reported in previous studies. Information of all the previously reported QTLs was downloaded from the CottonQTLdb database (http://www.cottonqtldb.org; Release 2.3) developed by Said et al. . The marker defined QTL regions with DNA sequence information were BLAST searched on Cotton Functional Genomics Database (https://cottonfgd.org/)  for identifying the possible candidates genes for the each of the major stable QTL.
All the QTLs were labeled based on their population, trait type, and chromosome information. For example, QTLs for micronaire in population NC06AZ06 × NC11–2091 were labeled as qMIC-CH*-A(D)*-* (detected in both year), qMIC-16-CH*- A(D)*-* (detected in year 2016) or qMIC-17-CH*- A(D)*-* (detected in year 2017).
The names of the QTL clusters are given based on the trait categories of QTLs they contained. For example, Q-* meant a cluster contained only fiber quality traits QTLs; QY-* meant a cluster contained both fiber quality and yield traits QTLs; QYA-* meant a cluster contained fiber quality, yield and morphological traits QTLs and so on.
Availability of data and materials
The datasets supporting the findings of this article are included within the article and its additional files. Additional data used or analyzed during this study is also available from the corresponding author on reasonable request.
Amplified fragment length polymorphism
AGRICULTURAL Research Service
Composite interval mapping
Seed fuzziness grade
- H2 :
The broad-sense heritability
Inter-simple sequence repeat
Logarithm of the odds
U.S. National Cotton Germplasm Collection
Phenotypic variation explained
Quantitative trait locus
Random amplified polymorphic DNA
Restriction fragment length polymorphism
Recombinant inbred line
Segregation distortion region
Short fiber content
Single nucleotide polymorphism
Sequence related amplified polymorphism
Simple sequence repeats
Target region amplified polymorphism
Upper half mean length
- Vg :
- Vp :
Fryxell PA. A revised taxonomic interpretation of Gossypium L. (Malvaceae). Rheedea 2; 1992. p. 108–65.
Grover CE, Zhu X, Grupp KK, Jareczek JJ, Gallagher JP, et al. Molecular confirmation of species status for the allopolyploid cotton species, Gossypium ekmanianum Wittmack. Genet Resour Crop Evol. 2014;62:103–14.
Shim J, Mangat PK, Angeles-Shim RB. Natural variation in wild Gossypium species as a tool to broaden the genetic base of cultivated cotton. J Plant Sci Curr. 2018;Res 2:005.
U.S. Department of Agriculture. Cotton: World Markets and Trade. USDA Foreign Agricultural Service. 2018. https://apps.fas.usda.gov/psdonline/circulars/cotton.pdf
National Cotton Council of America. Overview of the U.S. cotton industry. 2011. https://www.cotton.org/pubs/cottoncounts/upload/Cotton-Industry-Overview_Jan-19-2011.pdf
OECD/Food and Agriculture Organization of the United Nations. “Cotton”, in OECD-FAO Agricultural Outlook 2018-2027, vol. 2018. Rome: OECD Publishing, Paris/food and agriculture Organization of the United Nations. https://doi.org/10.1787/agr_outlook-2018-13-en
Shang L, Liu F, Wang Y, Abduwell A, Cai S, et al. Dynamic QTL mapping for plant height in upland cotton (Gossypium hirsutum). Plant Breed. 2015;134(6):703–12.
Lei L, Zheng HL, Wang JG, Liu HL, et al. Genetic dissection of rice (Oryza sativa L.) tiller, plant height, and grain yield based on QTL mapping and metaanalysis. Euphytica. 2018;214:109.
Su J, Li L, Zhang C, Wang C, Gu L, et al. Genome-wide association study identified genetic variations and candidate genes for plant architecture component traits in Chinese upland cotton. Theor Appl Genet. 2018;131:1299–314.
Bechere E, Turley RB, Auld DL, Zeng L. A new fuzzless seed locus in an Upland cotton (Gossypium hirsutum L.) mutant. Am J Plant Sci. 2012;3:799–804.
Turley RB, Kloth RH. Identification of a third fuzzless seed locus in upland cotton (Gossypium hirsutum L.). J Heredity. 2002;93(5):359–64.
Fang DD. Cotton Fiber genes and stable quantitative trait loci. Cotton Fiber: Physics, Chemistry and Biology; 2018. p. 151–78.
Yao Y, Zhang B, Dong CJ, Du Y, Jiang L, Liu JY. Comparative proteomic and biochemical analyses reveal different molecular events occurring in the process of fiber initiation between wild-type Allotetraploid cotton and its fuzzless-lintless mutant. PLoS One. 2014;10(2):e0117049.
Said J, Lin Z, Zhang X, Song M, Zhang J. A comprehensive meta QTL analysis for fiber quality, yield, yield related and morphological traits, drought tolerance, and disease resistance in tetraploid cotton. BMC Genomics. 2013;14(1):776.
Said J, Song M, Wang H, Lin Z, Zhang X, et al. A comparative meta-analysis of QTL between intraspecific Gossypium hirsutum and interspecific G. hirsutum × G. barbadense populations. Mol Gen Genomics. 2015;290:1003–25.
Lacape JM, Llewellyn D, Jacobs J, Arioli T, et al. Meta-analysis of cotton fiber quality QTLs across diverse environments in a Gossypium hirsutum × G. barbadense RIL population. BMC Plant Biol. 2010;10:132.
Smith CW, Coyle GG. Association of fiber quality parameters and within-boll yield components in upland cotton. Crop Sci. 1997;37:1775–9.
Bourland M. History of cotton breeding and genetics at the University of Arkansas. J Cotton Sci. 2018;22:171–82.
Zhang J. History and progress in cotton breeding, genetics, and genomics in New Mexico. J Cotton Sci. 2018;22:191–210.
Mohan M, Nair S, Bhagwat A, Krishna TG, Yano M, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol Breed. 1997;3:87–103.
Boopathi NM, Thiyagu K, Urbi B, Santhoshkumar M, et al. Marker-assisted breeding as next-generation strategy for genetic improvement of productivity and quality: can it be realized in cotton? Int J of Plant Genomics. 2011;2011:16.
Ribaut JM, Hoisington D. Marker-assisted selection: new tools and strategies. Trends Plant Sci. 1998;3(6):236–8.
Meredith WR Jr. Contributions of introductions to cotton improvement, in “Use of plant introductions in cultivar development, Part 1,”. In: Shands HL, Wiesner LE, editors.Madison: Crop Science Society of America Special Publication; 1991. p. 127–46.
Reinisch AJ, Dong J, Brubaker CL, Stelly DM, Wendel JF, Paterson AH. A detailed RFLP map of cotton, Gossypium hirsutum × Gossypium barbadense: chromosome organization and evolution in a disomic polyploid genome. Genetics. 1994;138(3):829–47.
Multani DS, Lyon BR. Genetic fingerprinting of Australian cotton cultivars with RAPD markers. Genome. 1995;38(5):1005–8.
Lu H, Myers G. Genetic relationships and discrimination of ten influential upland cotton varieties using RAPD markers. Theor Appl Genet. 2002;105(2–3):325–31.
Lin Z, He D, Zhang X, Nie Y, Guo X, et al. Linkage map construction and mapping QTL for cotton fibre quality using SRAP, SSR and RAPD. Plant Breed. 2005;124:180–7.
Noormohammadi Z, Hasheminejad-Ahangarani FY, Sheidai M, Ghasemzadeh-Baraki S, Alishah O. Genetic diversity analysis in opal cotton hybrids based on SSR, ISSR, and RAPD markers. Gen Mol Res. 2013;12(1):256–69.
Abdalla AM, Reddy OUK, El-Zik KM, Pepper AE. Genetic diversity and relationships of diploid and tetraploid cottons revealed using AFLP. Theor Appl Genet. 2001;102(2–3):222–9.
Yu J, Yu S, Lu C, Wang W, Fan S, et al. High-density linkage map of cultivated allotetraploid cotton based on SSR, TRAP, SRAP and AFLP markers. J Integr Plant Biol. 2007;49(5):716–24.
Yu J, Zhang K, Li S, Yu S, Zhai H, et al. Mapping quantitative trait loci for lint yield and fiber quality across environments in a Gossypium hirsutum × Gossypium barbadense backcross inbred line population. Theor Appl Genet. 2013;126(1):275–87.
Yu JZ, Ulloa M, Hoffman SM, Kohel RJ, Pepper AE, et al. Mapping genomic loci for cotton plant architecture, yield components, and fiber properties in an interspecific (Gossypium hirsutum L. × G. barbadense L.) RIL population. Mol Gen Genomics. 2014;289(6):1347–67.
Wang H, Huang C, Guo H, Li X, Zhao W, Dai B, et al. QTL mapping for fiber and yield traits in upland cotton under multiple environments. PLoS One. 2015;10(6):e0130742.
Su J, Pang C, Wei H, Li L, Liang B, et al. Identification of favorable SNP alleles and candidate genes for traits related to early maturity via GWAS in upland cotton. BMC Genomics. 2016;17:687.
Kumar NVM, Katageri IS, Gowda SA, Adiger S, Yadava SK, et al. 63K SNP chip based linkage mapping and QTL analysis for fiber quality and yield component traits in Gossypium barbadense L. cotton. Euphytica. 2019;215:6.
Fan L, Wang L, Wang X, Zhang H, et al. A high-density genetic map of extra-long staple cotton (Gossypium barbadense) constructed using genotyping-by-sequencing based single nucleotide polymorphic markers and identification of fiber traits-related QTL in a recombinant inbred line population. BMC Genomics. 2018;19:489.
Diouf L, Magwanga RO, Gong W, He S, et al. QTL mapping of fiber quality and yield-related traits in an intra-specific upland cotton using genotype by sequencing (GBS). Int J Mol Sci. 2018;19:441.
Ulloa M, Hulse-Kemp AM, Santiago LMD, Stelly DM, Burke JJ. Insights into Upland cotton (Gossypium hirsutum L.) genetic recombination based on 3 high-density single-nucleotide polymorphism and a consensus map developed independently with common parents. Genomics Insights. 2017;10:1–15.
Qi H, Wang N, Qiao W, Xu Q, et al. Construction of a high-density genetic map using genotyping by sequencing (GBS) for quantitative trait loci (QTL) analysis of three plant morphological traits in upland cotton (Gossypium hirsutum L.). Euphytica. 2017;213(83):1–17.
Hulse-Kemp AM, Lemm J, Plieske J, Ashrafi H, Buyyarapu R, et al. Development of a 63K SNP array for cotton and high-density mapping of intraspecific and interspecific populations of Gossypium spp. G3 (Bethesda). 2015;5(6):1187–209.
Kumar S, Banks TW, Cloutier S. SNP discovery through next-generation sequencing and its applications. Int J Plant Genomics. 2012;2012:831460.
Mammadov JA, Aggarwal R, Buyyarapu R, Kumpatla S. SNP markers and their impact on plant breeding. Int J Plant Genomics. 2012;2012(3):728398.
Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33(5):531–7.
Thibodeaux D, Senter H, Knowlton JL, McAlister D, Cui X. The impact of short fiber content on the quality of cotton ring spun yarn. J Cotton Sci. 2008;12:368–77.
Said JI, Knapka JA, Song M, Zhang J. Cotton QTLdb: a cotton QTL database for QTL analysis, visualization, and comparison between Gossypium hirsutum and G. hirsutum × G. barbadense populations. Mol Gen Genomics. 2015;290(4):1615–25.
Yang J, Ma ZY, Wang XF. Progress in studies on genes related to Fiber quality improvement of cotton. Sci Agric Sin. 2016;49(22):4310–22.
Loguercio LL, Zhang JQ, Wilkins TA. Differential regulation of six novel MYB-domain genes defines two distinct expression patterns in allotetraploid cotton (Gossypium hirsutum L.). Mol Gen Genet. 1999;261(4/5):660–71.
Li W, Li DD, Han LH, Tao M, et al. Genome-wide identification and characterization of TCP transcription factor genes in upland cotton (Gossypium hirsutum); 2017. https://doi.org/10.1038/s41598-017-10609-2.
Yang ZR, Zhang CJ, Yang XJ, Liu K, et al. PAG1, a cotton brassinosteroid catabolism gene, modulates fiber elongation. New Phytol. 2014;203(2):437–48.
Liu B, Zhu Y, Zhang T. The R3-MYB gene GhCPC negatively regulates cotton fiber elongation. PLoS One. 2015;10(2):e0116272.
Xu WL, Huang GQ, Wang XL, Wang H, Li XB. Molecular characterization and expression analysis offive novel genes encoding proline-rich proteinsin cotton (Gossypium hirsutum). Prog Biochem Biophys. 2007;34(5):509–17.
Zhou Y, Zhang ZT, Li M, Wei XZ, et al. Cotton (Gossypium hirsutum) 14-3-3 proteins participate in regulation of fibre initiation and elongation by modulating brassinosteroid signalling. Plant Biotechnol J. 2015;13(2):269–80.
Shi YH, Zhu SW, Mao XZ, Feng JX, et al. Transcriptome profiling, molecular biological, and physiological studies reveal a major role for ethylene in cotton fiber cell elongation. Plant Cell. 2006;18(3):651–64.
Huang GQ, Xu WL, Gong SY, Li B, et al. Characterization of 19 novel cotton FLA genes and their expression profiling in fiber development and in response to phytohormones and salt stress. Physiol Plant. 2008;134(2):348–59.
Pan YX, Wang XF, Liu HW, Zhang GY, Ma ZY. Molecular cloning of three UDP-glucuronate decarboxylase genes that are preferentially expressed in Gossypium fibers from elongation to secondary cell wall synthesis. J Plant Biol. 2010;53(5):367–73.
Wang HY, Wang J, Gao P, Jiao GL, et al. Down-regulation of GhADF1 gene expression affects cotton fibre properties. Plant Biotechnol J. 2009;7(1):13–23.
Li HB, Qin YM, Pang Y, et al. A cotton ascorbate peroxidase is involved in hydrogen peroxide homeostasis during fibre cell development. New Phytol. 2007;175(3):462–71.
Li DD, Ruan XM, Zhang J, Wu YJ, et al. Cotton plasma membrane intrinsic protein 2s (PIP2s) selectively interact to regulate their water channel activities and are required for fibre development. New Phytol. 2013;199(3):695–707.
Priyam et al. Sequenceserver: a modern graphical user interface for custom BLAST databases & relevant data sources. 2015. (https://cottonfgd.org/).
Iqbal MJ, Reddy OUK, El-Zik KM, Pepper AE. A genetic bottleneck in the evolution under domestication of upland cotton Gossypium hirsutum L. examined using DNA fingerprinting. Theor Appl Genet. 2001;103:547–54.
Wang B, Guo W, Zhu X, Wu Y, Huang N, Zhang T. QTL mapping of fiber quality in an elite hybrid derived-RIL population of upland cotton. Euphytica. 2006;152:367–78.
Yang X, Zhou X, Wang X, Li Z, Zhang Y, et al. Mapping QTL for cotton fiber quality traits using simple sequence repeat markers, conserved intron-scanning primers, and transcript-derived fragments. Euphytica. 2015;201:215–30.
Tang S, Teng Z, Zhai T, Fang X, Liu F, et al. Construction of genetic map and QTL analysis of fiber quality traits for upland cotton (Gossypium hirsutum L.). Euphytica. 2015;201:195–213.
Huang C, Shen C, Wen T, Gao B, Zhu D, et al. SSR-based association mapping of fiber quality in upland cotton using an eight-way MAGIC population. Mol Gen Genomics. 2018;293(4):793–805.
Cai C, Zhu G, Zhang T, Guo W. High-density 80K SNP array is a powerful tool for genotyping G. hirsutum, accessions and genome analysis. BMC Genomics. 2017;18:654.
Liu R, Gong J, Xiao X, Zhang Z, Li J, et al. GWAS analysis and QTL identification of fiber quality traits and yield components in upland cotton using enriched high-density SNP markers. Front Plant Sci. 2018;9:1067.
Li C, Dong Y, Zhao T, Li L, Li C, et al. Genome-wide SNP linkage mapping and QTL analysis for fiber quality and yield traits in the upland cotton recombinant inbred lines population. Front Plant Sci. 2016;7:1356.
Ma L, Zhao Y, Wang Y, Shang L, Hua J. QTLs analysis and validation for fiber quality traits using maternal backcross population in upland cotton. Front Plant Sci. 2017;8:2168.
Tan Z, Zhang Z, Sun X, Li Q, Sun Y, et al. Genetic map construction and fiber quality QTL mapping using the CottonSNP80K array in upland cotton. Front Plant Sci. 2018;9:225.
Li C, Zhao T, Yu H, Li C, Deng X, et al. Genetic basis of heterosis for yield and yield components explored by QTL mapping across four genetic populations in upland cotton. BMC Genomics. 2018;19:910.
Rong JK, Abbey C, Bowers JE, Brubaker CL, Chang C, et al. A 3347-locus genetic recombination map of sequence-tagged sites reveals features of genome organization, transmission and evolution of cotton (Gossypium). Genetics. 2004;166:389–417.
Lacape JM, Nguyen TB, Thibivilliers S, Bojinov B, et al. A combined RFLP–SSR–AFLP map of tetraploid cotton based on a Gossypium hirsutum × Gossypium barbadense backcross population. Genome. 2003;46(4):612–26.
Nasu S, Suzuki J, Ohta R, Hasegawa K, et al. Search for and analysis of single nucleotide polymorphisms (SNPs) in rice (Oryza sativa, Oryza rufipogon) and establishment of SNP markers. DNA Res. 2002;9:163–71.
Zhang Z, Hu M, Zhang J, Liu D, Zheng J, et al. Construction of a comprehensive PCR-based marker linkage map and QTL mapping for fiber quality traits in upland cotton (Gossypium hirsutum L.). Mol Breeding. 2009;24(1):49–61.
Shen X, Guo W, Lu Q, Zhu X, Yuan Y, Zhang T. Genetic mapping of quantitative trait loci for fiber quality and yield trait by RIL approach in upland cotton. Euphytica. 2007;155:371–80.
Kumar S, Gill BW, Faris JD. Identification and characterization of segregation distortion loci along chromosome 5B in tetraploid wheat. Mol Gen Genomics. 2007;278(2):187–96.
Rungis D, Llewellyn D, Dennis ES, Lyon BR. Simple sequence repeat (SSR) markers reveal low levels of polymorphism between cotton (Gossypium hirsutum L.) cultivars. Aust J Agric Res. 2005;56(3):301–7.
Li C, Dong Y, Zhao T, Ling L, et al. Genome-wide SNP linkage mapping and QTL analysis for fiber quality and yield traits in the Upland cotton recombinant inbred lines population. Front Plant Sci. 2016. https://doi.org/10.3389/fpls.2016.01356.
Song X, Wang K, Guo W, Zhang J, Zhang T. A comparison of genetic maps constructed from haploid and BC1 mapping populations from the same crossing between Gossypium hirsutum L. and Gossypium barbadense L. Genome. 2005;48(3):378–90.
Paterson AH, Saranga Y, Menz M, Jiang C, Wright RJ. QTL analysis of genotype × environment interactions affecting cotton fiber quality. Theor Appl Genet. 2003;106:384–96.
Mei H, Zhu X, Zhang T. Favorable QTL alleles for yield and its components identified by association mapping in Chinese upland cotton cultivars. PLoS One. 2013. https://doi.org/10.1371/journal.pone.0082193.
Huang C, Nie XH, Shen C, You CY, et al. Population structure and genetic basis of the agronomic traits of upland cotton in China revealed by a genome-wide association study using high-density SNPs. Plant Biotechnol J. 2017;15:1374–86.
Ware JO, Benedict LN, Rolfe WH. A recessive naked-seed character in upland cotton. J Hered. 1947;38(10):313–20.
Ramey HH, Beaton PG. Relationships between short fiber content and fiber length uniformity. Textile Res J. 1989;59(2):101–8.
Badigannavar A, Gerald M. Breeding and Genetics. Construction of genetic linkage map and QTL analysis for fiber traits in diploid cotton (Gossypium arboreum × Gossypium herbaceum). J Cotton Sci. 2015;19:15–26.
Andres RJ, Bowman DT, Lawrence KS, Myers G, Chee PW, et al. Effect of leaf shape on boll rot incidence in upland cotton (Gossypium hirsutum). Int J Plant Breeding Genet. 2013;7:132–8.
Ooijen JW. JoinMap 4: software for the calculation of genetic linkage maps in experimental populations. Wageningen: Kyazma BV; 2006.
Kosambi DD. The estimation of map distances from recombination values. Ann Hum Genet. 1943;12:172–5.
Paillard S, Schnurbusch T, Winzeler M, Messmer M, Sourdille P, Abderhalden O, et al. An integrative genetic linkage map of winter wheat (Triticum aestivum L.). Theor Appl Genet. 2003;107:1235–42.
Zeng ZB. Precision mapping of quantitative trait loci. Genetics. 1994;136(4):1457–68.
Wang S, Basten CJ, and Zeng ZB. Windows QTL Cartographer 2.5. Department of Statistics, North Carolina State University, Raleigh, NC. 2012. http://statgen.ncsu.edu/qtlcart/WQTLCart.htm
Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Heredity. 2002;93(1):77–8.
We thank the Cotton Incorporated for its help with fiber quality trait measurements. We also appreciate Dr. Drew Hillhouse and Kelli Kochan of the Texas A&M Institute for Genome Sciences and Society for technical assistance on genotyping with 63 K SNPs array. We thank Cathy Herring and Travis Lassiter of the Central Crops Research Station for their excellent help with field cotton management.
This research was funded by Cotton Incorporated (CI) and NC Cotton Producers’ Association.
Ethics approval and consent to participate
Consent for publication
The authors have read and agreed for publication.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Zhang, K., Kuraparthy, V., Fang, H. et al. High-density linkage map construction and QTL analyses for fiber quality, yield and morphological traits using CottonSNP63K array in upland cotton (Gossypium hirsutum L.). BMC Genomics 20, 889 (2019) doi:10.1186/s12864-019-6214-z
- Upland cotton
- Single nucleotide polymorphism (SNP)
- Recombinant inbred lines (RILs)
- Linkage map
- Quantitative trait locus (QTL)
- QTL clusters
- Fiber quality and yield