Genetic basis of heterosis for yield and yield components explored by QTL mapping across four genetic populations in upland cotton

Li, Cong; Zhao, Tianlun; Yu, Hurong; Li, Cheng; Deng, Xiaolei; Dong, Yating; Zhang, Fan; Zhang, Yi; Mei, Lei; Chen, Jinhong; Zhu, Shuijin

doi:10.1186/s12864-018-5289-2

Research article
Open access
Published: 12 December 2018

Genetic basis of heterosis for yield and yield components explored by QTL mapping across four genetic populations in upland cotton

Cong Li¹,
Tianlun Zhao¹,
Hurong Yu¹,
Cheng Li¹,
Xiaolei Deng¹,
Yating Dong¹,
Fan Zhang¹,
Yi Zhang¹,
Lei Mei¹,
Jinhong Chen¹ &
…
Shuijin Zhu ORCID: orcid.org/0000-0001-6209-9630¹

BMC Genomics volume 19, Article number: 910 (2018) Cite this article

3558 Accesses
12 Citations
Metrics details

Abstract

Background

Quantitative trait loci (QTL) mapping provides a powerful tool to unravel the genetic bases of cotton yield and its components, as well as their heterosis. In the present study, the genetic basis underlying inbreeding depression and heterosis for yield and yield components of upland cotton was investigated in recombinant inbred line (RIL), immortalized F₂ (IF₂), and two backcross (BCF₁) populations based on a high-density SNP linkage map across four environments.

Results

Significant inbreeding depression of fruit branches per plant (FB), boll numbers per plant (BN), seed cotton yield (SY), and lint yield (LY) in RIL population and high levels of heterosis for SY, LY, and boll weight (BW) in IF₂ and two BCF₁ populations were observed. A total of 285 QTLs were identified in the four related populations using a composite interval mapping approach. In the IF₂ population, 26.60% partially dominant (PD) QTLs and 71.28% over-dominant (OD) QTLs were identified. In two BCF₁ populations, 42.41% additive QTLs, 4.19% PD QTLs, and 53.40% OD QTLs were detected. For multi-environment analysis, phenotypic variances (PV) explained by e-QTLs were higher than those by m-QTLs in each of the populations, and the average PV of m-QTLs and e-QTLs explained by QTL × environment interactions occupied a considerable proportion of total PV in all seven datasets.

Conclusions

At the single-locus level, the genetic bases of heterosis varied in different populations. Partial dominance and over-dominance were the main cause of heterosis in the IF₂ population, while additive effects and over-dominance were the main genetic bases of heterosis in two BCF₁ populations. In addition, the various genetic components to heterosis presented trait specificity. In the multi-environment model analysis, epistasis was a common feature of most loci associated with inbreeding depression and heterosis. Furthermore, the environment was a critical factor in the expression of these m-QTLs and e-QTLs. Altogether, additive effects, over-dominance, epistasis and environmental interactions all contributed to the heterosis of yield and its components in upland cotton, with over-dominance and epistasis more important than the others.

Background

Inbreeding depression, the reduced fitness of progenies arising from increased homozygosity [1, 2], and heterosis, wherein cross-fertilization hybrids between diverse varieties or different species exhibit superiority relative to parental performance [3], are fundamentally concerned with inbreeding and outbreeding. Inbreeding depression and heterosis are considered two aspects of the same phenomenon, and both have fundamental importance to applied genetics and breeding. In all cases, the reason for the inbreeding depression is that the inbreeding increases the probability of the homozygosity of deleterious recessive alleles in their progenies [4,5,6]. The vigor lost caused by consequence of inbreeding can be recovered by crossing [6]. Moreover, inbreeding depression may have a large impact on the formation of reproductive disorders between species, while heterosis may play a key role in maintaining genetic variation of populations [7].

In agriculture, the application of heterosis has contributed greatly to the production of many crops. However, the genetic basis of heterosis remains obscure. Three major hypotheses, including dominance [8, 9], over-dominance [10,11,12], and certain types of epistasis [13,14,15] have been proposed to explain heterosis. Quantitative trait locus (QTL) mapping studies in major crops have been performed to explain the genetic basis of heterosis. An appropriate experimental design for the genetic dissection of heterosis is essential. Comstock and Robinson [16] devised the North Carolina design III (Design III) mating scheme, which was the first use of backcross designs to analyze the genetic basis of heterosis. Based on two maize backcross F₃ families (BCF₃, a modified Design III scheme), Stuber et al. [12] reported that over-dominance was the major genetic basis of heterosis for grain yield. A study of Xiao et al. [9] investigated the genetic bases of heterosis in two rice BC₁F₇ populations and concluded that dominance complementation was the major cause. Li et al. [15] and Luo at al. [17] reported that epistasis and over-dominance were the main causes of inbreeding depression and heterosis of grain yield, grain yield components, and biomass in five related rice mapping populations. By re-analyzing the data of maize [12] and rice [9], Garcia et al. [18] reported that dominance was the main contributor of heterosis in maize, while additive × additive epistatic interactions could be the major genetic basis of heterosis in rice. Schön et al. [19] compared QTL mapping results of three previous Design III studies [12, 20, 21] by advanced statistical methods [22]. Their results indicated that the positive interactions of alleles from the opposite heterotic pool would lead to high heterosis for grain yield of maize. Shang et al. [23] investigated the yield heterosis of upland cotton with two BCF₁ populations, which implied partial dominance and over-dominance were the main genetic bases. To dissect the heterotic effects directly, Hua et al. [24, 25] introduced an “immortalized F₂” (IF₂) population derived from pair crosses of RILs of rice and focused on detecting heterotic loci (HL) to explain the genetic basis of heterosis instead of using traditional QTLs. Based on this design, they discovered that single-locus heterotic effects and dominance × dominance (DD) interactions could explain the genetic basis of heterosis in hybrid rice. Zhou et al. [26] detected several HLs for yield and its components in a rice IF₂ population and found that the relative contributions of the genetic components varied with traits. Based on a maize IF₂ population, Tang et al. [27] demonstrated that dominance effects of HL as well as additive × additive interactions were the major genetic bases of heterosis for grain yield and its components. Using the same material, Guo et al. [28] re-analyzed yield heterosis using a reconstructed high-density linkage map. They found that dominance was more important for heterosis than other genetic effects. Moreover, over-dominance and epistasis also contributed to heterosis.

IF₂ and BCF₁ populations are ideal materials for comprehensively dissecting the composition of heterosis. Firstly, the genotypes of IF₂ and BCF₁ populations can be clearly deduced from the original parents and RILs; secondly, these two populations permit replicated trials; lastly, most loci of IF₂ and BCF₁ populations are heterozygous. This provides an opportunity for mapping HL and studying heterosis, rather than analyzing solely based on measuring performance of the trait. It is well known that stably expressed QTLs across multiple environments are deeply favored in marker-assisted selection (MAS). Thus, identifying QTLs and exploring their expression levels and the genetic basis of heterosis under multiple environments in related populations would allow us to map stable QTLs and accelerate the process of breeding.

Upland cotton is the most important natural textile fiber source globally. Currently, it is grown on a total area of 30.9 million ha of land in more than 80 countries [29]. It is urgent to improve the yield of upland cotton cultivars to meet worldwide demand, and maintain profitability for cotton growers. Yield is a complex trait in cotton that is controlled by a large number of QTLs. Several studies have discovered that significant heterosis for yield and yield components exists in upland cotton [30,31,32]. In addition, some QTL mapping studies have been reported that dissect the composition of heterosis for yield and yield components of upland cotton [23, 32, 33], but no studies have been reported on different segregating populations from the same parental combination. In the present study, RIL, IF₂, and two BCF₁ populations were used simultaneously to perform QTL genetic analysis for yield and yield components based on a high-density SNP intraspecific genetic map under multiple environmental conditions. The results will provide meaningful hints at the underlying genetic bases of inbreeding depression and heterosis for yield and yield components, which can be used in cotton breeding.

Methods

Plant materials

Four related genetic populations were used, including a set of 188 RILs (F₈), an IF₂ population, and two BCF₁ populations (Fig. 1). The RILs were derived by single-seed procedure from a cross between two elite upland germplasms, HS46 (P₁) and MARCABUCAG8US-1-88 (P₂). According to a diallel mating design [25], the IF₂ population was produced from crosses between the RILs chosen by random permutations of the 188 RILs (Fig. 1a). This procedure was repeated two times, with each time making 188 hybrids, forming a population consisting of 376 IF₂ hybrids. Both BCF₁ populations were derived from a modified Design III [16, 21], in which two parents were used as the male parents backcrossed with the 188 RILs (Fig. 1b, c). The two BCF₁ populations each contained 188 hybrids named HSBCF₁ and MARBCF₁, referring to HS46 (HS) and MARCABUCAG8US-1-88 (MAR) backcrossed with 188 RIL lines, respectively.

Field planting and phenotypic evaluation

Two separate experiments were conducted at two locations, Yacheng (inland climate) and Baogang (coastal climate) of Sanya, Hainan Province, China, in the cotton growing seasons of 2014 and 2015. All plants of the four populations and the two parental lines were planted in a randomized block design with two replications in each location and with 5.6 m² plot areas. Finally, 29 plants were grown in each row at a spacing of 25 cm between plants. Standard cultivation, weed and insect control practices were performed as the management of the local cotton production.

Ten consecutive plants in the middle of each row were tagged for trait measurement [32, 34]. Data were collected for the following traits: seed cotton yield (SY), lint yield (LY), fruit branches per plant (FB), boll numbers per plant (BN), boll weight (BW), and lint percentage (LP). During the harvest season, twenty fully open bolls in each row were harvested for measurement of BW and LP. SY was determined as the seed cotton weight harvested from each plot and LY was determined by multiplying lint percentage with SY.

Genotype analysis and linkage maps

Young leaves were collected from RILs and two parents. Individual genomic DNA was extracted following a modified CTAB method [35].

The RIL population and two parents were genotyped with the cotton 63 K SNP array [36]. A total of 63,058 SNPs were screened for polymorphism between parents, in which a total of 2618 SNP markers were selected to genotype the RILs [37]. In the IF₂ and two BCF₁ populations, the genotypes for each F₁ were deduced from the RILs and the parental lines that were used as the parents for each cross.

Data analysis

Each year-location was treated as an independent environment. A descriptive statistics model was used to test the basic statistics of phenotypic data for RILs, IF₂s, HSBCF₁s, and MARBCF₁s. One-way analysis of variance (ANOVA) was performed to analyze the difference for yield and yield components between the two parents using SPSS 20.0. Broad-sense heritability was estimated as H² = V_G / (V_G + V_GE / e + V_ε / re), where H² is broad-sense heritability, V_G = genetic variance, V_GE = genotype × environment interaction variance, V_ε = error variance, and e and r are the numbers of environments and replicates, respectively. The V_G, V_GE, and V_ε variances were calculated using the minimum norm quadratic unbiased estimation (MINQUE) approach [38] by in QGA Station 2.0 (http://ibi.zju.edu.cn/software/qga/index.htm).

The hybrid breakdown value (HB), a component of inbreeding depression [39, 40], was calculated for individual RILs as follows: HB = RIL− MP, where MP = (HS46 + MARCABUCAG8US-1-88) / 2. The equation for calculating values of the mid-parental heterosis (MPH) of individual IF₂, HSBCF₁, and MARBCF₁ hybrids for yield traits was as follows: MPH = F₁ – MP [25], where F₁ was the mean value of each hybrid in the IF₂, HSBCF₁, and MARBCF₁ populations, and MP was the average value of the corresponding parents. The MPH datasets were used as the raw data for exploring the genetic basis of yield heterosis.

WinQTL Cartographer 2.5 [41] was used to identify single-locus QTLs with the composite interval mapping (CIM) method. The LOD threshold for declaring a significant QTL was calculated by 1000 permutation tests with a mapping step of 1.0 cM and a significance level of P < 0.05. The MPH datasets only detected the dominance effect under the genetic model of CIM, where the QTL exhibited a significant difference between the heterozygote and the mean of the two parental homozygotes. QTLs were named as: q + trait abbreviation + chromosome number + QTL number [37]. A graphical representation of the linkage map with QTLs marked was created using Map Chart 2.2 [42].

The gene actions in different datasets were estimated as follows: a = (P₁P₁ − P₂P₂) / 2, d = (P₁P₂ − (P₁P₁ + P₂P₂) / 2), BCF₁ = (a + d), where P₁ and P₂ represent the parents, P₁P₁ and P₂P₂ indicate the effects of homozygous genotypes observed in RILs, IF₂s, and BCF₁s; and P₁P₂ represents the effects of heterozygous genotypes in hybrid. The gene action mode for each QTL was calculated by the absolute value of the ratio of dominant and additive effects (|d/a|) [26, 35, 36, 43]. There were some differences between the assessment in IF₂ and BCF₁ populations. For the IF₂ population, QTLs with |d/a| ≤ 1 were considered as completely or partially dominant (D or PD) loci. If |d/a| > 1 or if it was only detectable for the MPH dataset, the QTL was referred to as an over-dominance (OD) locus. The |d/a| value was estimated in two ways; both a and d were estimated from IF₂s for a QTL which only detected in IF₂s; a was from RILs and d was from the MPH dataset for a QTL detected simultaneously in RILs and the IF₂MPH dataset. Moreover, the value of |d/a| in IF₂s was used as the criterion. For BCF₁ populations, a QTL was considered to be an OD locus in the following three cases: (1) MPH (d) times two was higher than BCF₁ performance (a + d) i.e., 2 |d| (MPH) > |a + d| (BCF₁) (equal to |d/a| > 1) for a QTL detected in BCF₁s and the MPH dataset; (2) a was estimated from RILs and d from the MPH dataset with |d/a| > 1 for a QTL detected simultaneously in the RILs and MPH dataset; (3) only detected in MPH dataset. Otherwise, the QTL was considered to be a D or PD locus. QTLs detected only in BCF₁ were referred to as additive (A) loci. When a QTL was present in all three datasets, the judgment depended on the ratio of the effects in the BCF₁s and MPH dataset.

A combined multi-environment model analysis that tests the main-effect QTLs (m-QTLs), epistatic QTLs (e-QTLs), and their environmental interactions (QTL × environment, QE) was implemented using the RILs, IF₂s, two BCF₁s, and three MPH datasets with the inclusive composite interval mapping (ICIM) method in IciMapping 4.1 [44]. The pre-set parameters Scan = 1 cM / PIN = 0.0001 and Scan = 5 cM / PIN = 0.0001 were used to conduct the additive and epistasis QTL mapping analyses, respectively. The threshold LOD for declaring m-QTLs and e-QTLs was calculated using a permutation test at a significance level of P < 0.05, n = 1000. The identified m-QTLs were named using the dataset abbreviation followed by “maq” (multi-environment additive QTL), and then suffixed with the abbreviation of trait and chromosome number, followed by the QTL number. The e-QTLs detected were named using the dataset abbreviation followed by “meq” (multi-environment epistatic QTL), and then with the abbreviation of the trait and the QTL pair number. Datasets were abbreviated was as follows: “R”, “I”, “B₁”, and “B₂” represent RILs, IF₂s, HSBCF₁s, and MARBCF₁s, respectively, and “M” was added after the last three heterozygous population abbreviation to represent their corresponding MPH datasets, i.e., “IM”, “B₁M”, and “B₂M”.

Results

Inbreeding depression and heterosis for yield and yield components

The phenotypic variation for yield and its components among the parents, RIL, IF₂, and two BCF₁ populations, as well as the estimated HB of RILs and the MPH of the IF₂s and two BCF₁s are shown in Table 1 and Table 2, respectively. The female parent, HS46, had significantly greater trait values for FB, BN, BW, SY, and LY than those of MARCABUCAG8US-1-88 in all environments (Additional file 1: Table S1). A wide range of variation was observed in yield and its components in the RILs, IF₂s, HSBCF₁s, and MARBCF₁s (Table 1). In all environments, obvious reductions of the RILs were observed as a result of hybrid breakdown in the traits of FB, BN, SY, and LY (Table 2, Additional file 2: Table S2). The mean deviation of the RILs from the midparental values for LP was found in three environments (but not in 2015Bg), while it was only detected in one environment for BW. High levels of heterosis for SY, LY, and BW were observed in the IF₂ and two BCF₁ populations. However, other yield components showed lower levels of heterosis in these three populations.

Table 1 Phenotypic variation of yield and yield components

Full size table

Table 2 Summary statistics on HB^a percentage of RILs and MPH^a percentage of IF₂s, HSBCF₁s, and MARBCF₁s

Full size table

Different levels of heterosis were found among the different populations across four environments (Additional file 2: Table S2). For SY, IF₂ and MARBCF₁ populations showed the same levels of heterosis, at 24.75 and 28.61%, respectively, which were higher than that of the HSBCF₁ population (15.42%). The mean levels of heterosis of for the LY trait showed the same trend as SY in different populations. For BW, different populations have similar levels of heterosis, although it was slightly higher in the MARBCF₁ population. For FB, the two BCF₁ populations showed the same levels of heterosis, and IF₂s exhibited lower mean heterosis. For BN, the mean levels of heterosis in the MARBCF₁ population were higher than that of the HSBCF₁ population, while the IF₂ population exhibited negative heterosis (− 2.70%). For LP, the order of the mean values in the MPH datasets was IF₂MPH > HSBCF₁MPH > MARBCF₁MPH.

There were some differences between environments (Table 2). In all three populations, the MPH (%) of SY was lower in 2015 than that in 2014. The same trend was found for LY and BW, caused by boll rot during experiments due to high rainfall in 2015 in Sanya. However, the heterosis level of FB showed the opposite trend, where higher levels of heterosis were detected in 2015. Moreover, all of the environments showed low levels of heterosis for LP in all populations except for the 2015Yc environment.

Within each population, heterosis values of individual hybrids varied considerably. Most of the trait values of extreme lines showed high MPH in all environments (Additional file 3: Table S3). For example, in 2014Yc and 2014Bg, the mean heterosis values of SY were more than 83% for the 20 highest-heterosis hybrids of the IF₂, HSBCF₁, and MARBCF₁ populations and were more than 46% in the 2015Yc and 2015Bg experiments.

The broad-sense heritability was analyzed using measurement data from four environments (Table 3). All measures of yield and its components showed moderate heritability, ranging from 56.00 to 86.31%, 46.79 to 64.05%, 39.73 to 65.81%, and 41.20 to 67.21% in the RIL, IF₂, HSBCF₁, and MARBCF₁ populations, respectively, which presented significant genetic and environmental effects. LP exhibited nearly the highest heritability and FB the lowest in all populations. Interestingly, the heritability of all traits was highly consistent between the two BCF₁ populations, which might be related to their closer genetic basis.

Table 3 Analysis of variance for yield and yield components across four populations

Full size table

The phenotypic correlations among the traits varied greatly in the RIL, IF₂, and two BCF₁ populations (Additional file 4: Table S4). This can be illustrated with LY as an example. Consistent with previous reports [45, 46], there were significant positive correlations between LY and SY in all populations, possibly because LY is derived from SY multiplied by lint percentage. Similarly, LY was positively and significantly correlated with BN and BW in all populations, indicating that variation in BN and BW contributed strongly to the variation in LY. The association between LY and LP was significant in three populations except for the RILs, and a significant positive correlation was recovered in two BCF₁ populations but was significantly negative in the IF₂ population, indicating that variation in LP contributed differently to LY variation in different populations. However, LY was only significantly positively correlated with FB in the RIL population.

QTL analysis of yield and yield components in RIL, IF₂, and two BCF₁ populations

A genetic linkage map was previously constructed based on the polymorphic loci identified [37] (Additional file 5: Figure S1). A total of 285 QTLs for yield and its components were detected using CIM in the RILs, IF₂s, two BCF₁s, and three MPH datasets (Additional file 5: Figure S1, Table 4, Additional file 6: Table S5). Among them, 107 QTLs were identified in more than two environments or datasets.

Table 4 Gene actions of QTL identified for yield and yield components by CIM^a across four environments

Full size table

Fruit branches per plant (FB)

A total of 40 QTLs were detected in seven datasets, explaining 3.15–31.66% of the total PV, and ten of them were the stable QTLs that were identified in at least two environments or datasets. Three, six, five, eight, 15, ten, and five QTLs were detected in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively. In the IF₂ population, three QTLs with PD or D effects and 16 with OD effects were observed. Two QTLs with PD effects were simultaneously detected in both IF₂s and IF₂MPHs. In the HSBCF₁ population, two QTLs with A effects and ten with OD effects were found. Four QTLs with apparent OD effects were detected in both HSBCF₁s and HSBCF₁MPHs. In the MARBCF₁ population, six QTLs with A effects, two with PD or D effects, and three with OD effects were observed. Two QTLs with OD effects were detected in MARBCF₁ and its MPH dataset.

Boll numbers per plant (BN)

Forty-two QTLs associated with BN were detected in seven datasets. Among those, 17 were detected in more than two environments or datasets. There were six, 11, ten, ten, six, seven, and seven QTLs in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively. In the IF₂ population, eight QTLs with PD effects and seven QTLs with OD effects were observed. qBN-C01–2, with PD effect, was detected in both IF₂s and IF₂MPHs. In the HSBCF₁ population, six QTLs with A effects, one QTL with PD or D effect, and six QTLs with OD effects were detected. Among them, four QTLs with OD effects were detected in HSBCF₁ and its MPH dataset. In the MARBCF₁ population, seven QTLs with A effects and seven with OD effects were observed. Three QTLs with OD effects were detected in MARBCF₁ and its MPH dataset. qBN-C06–1 with apparent A effect, and qBN-C17–3, with OD effect, were identified in the two environments of MARBCF₁s and MARBCF₁MPHs, respectively. Both of them showed favorable alleles conferred by different parent in their two environments.

Boll weight (BW)

A total of 30 QTLs were identified, explaining 11.41% of the mean total PV. Among them, 11 QTLs were identified in more than two environments or datasets. Three, 11, 14, 12, five, nine, and four QTLs were identified in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively. In the IF₂ population, three QTLs exhibited PD or D effects, while 12 QTLs with |d/a| > 1 showed apparent OD effects. qBW-C10–1, with PD effect, was detected in both IF₂s and IF₂MPHs. In the HSBCF₁ population, ten QTLs with A effects and nine with OD effects were detected. Four QTLs with apparent OD effects were detected in both HSBCF₁s and HSBCF₁MPHs. In the MARBCF₁ population, 11 QTLs with A effects and four with OD effects were observed. qBW-C13–4, with OD effect, was detected in both MARBCF₁s and MARBCF₁MPHs.

Lint percent (LP)

Among 50 identified QTLs related to LP, 27 QTLs were detected in more than two environments or populations. Thirteen, 21, nine, 15, nine, seven, and nine QTLs were detected in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively. In the IF₂ population, five QTLs with PD effects and 17 with OD effects were observed. Six QTLs were simultaneously detected in both IF₂s and IF₂MPHs. The gene action of two QTLs, qLP-C09–3 and qLP-C25–2, was uncertain because of inconsistent dominance degree in different environments. In the HSBCF₁ population, seven QTLs with A effects and seven with OD effects were found. Two QTLs with apparent OD effects were detected in both HSBCF₁s and HSBCF₁MPHs. In the MARBCF₁ population, nine QTLs with A effects, one with PD or D effect, and eight with OD effects were observed. Six QTLs were detected in both MARBCF₁ and its MPH dataset.

Seed cotton yield (SY)

Fifty QTLs were identified on 22 chromosomes in the seven datasets, explaining 13.39% (ranging from 3.40 to 34.83%) of the mean total PV. Twenty-three QTLs were identified in more than two environments or datasets. There were eight QTLs detected in IF₂s and its MPH dataset, among which two QTLs exhibited PD effects and six QTLs showed OD effects. qSY-C18–2, with PD effect, was identified in IF₂s in 2015Bg and in IF₂MPHs in 2015Yc and 2015Bg. In HSBCF₁s and its MPH dataset, five QTLs with A effects and 19 with OD effects were detected. Up to ten QTLs with apparent OD effects were detected in both HSBCF₁s and HSBCF₁MPHs. Among them, qSY-C16–1 showed favorable alleles conferred by different parents in the two environments of HSBCF₁MPHs. In the MARBCF₁ population, five QTLs with A effects, one with a PD or D effect, and 11 with OD effects were observed. Five QTLs were identified simultaneously in MARBCF₁ and its MPH dataset.

Lint yield (LY)

Forty-seven QTLs, explaining 3.06–34.06% of the total PV, were detected using the seven datasets. In the IF₂ hybrids, eight QTLs were detected. Four QTLs with PD effects and nine with OD effects were observed in a combined analysis of IF₂ and its MPH dataset. qLY-C18–1, with PD effect, was detected in both IF₂s and IF₂MPHs. In the HSBCF₁ population, four QTLs with A effects, one with PD effect, and 12 with OD effects were found. Among them, seven QTLs were detected simultaneously in HSBCF₁s and HSBCF₁MPHs. In the MARBCF₁ population, nine QTLs with A effects, two with PD effects, and six with OD effects were observed. Three QTLs were detected simultaneously in MARBCF₁s and MARBCF₁MPHs. qLY-C19–2, with PD effect, was identified in MARBCF₁MPHs in both 2014Yc and 2015Bg, as well as in one environment of the RILs, which showed favorable alleles conferred by different parents in these two environments of MARBCF₁MPHs.

Multi-environment analysis of main-effect QTL and environmental interactions

The m-QTLs and QEs detected for yield and yield components in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs are shown in Fig. 2, Additional file 7: Table S6, and Additional file 8: Table S7.

A total of 48 m-QTLs and QEs were identified in the RIL population. On average, m-QTLs explained 2.37% of the PV, and the QEs explained 0.90% of the PV. Three major m-QTLs related to LP, RmaqLP-C07–1, RmaqLP-C08–1, and RmaqLP-C09–1, were found to account for more than 10% of the total explained PV (PV (A) and PV (AE)). For the IF₂ population, 60 and 50 m-QTLs were identified in IF₂ and IF₂MPH datasets, respectively. On average, m-QTLs detected in the IF₂ and IF₂MPH datasets explained 0.92 and 1.36% of the PV (A), respectively, and the QEs explained 0.97 and 1.26% of the PV (AE), respectively. One locus, IMmaqLP-C10–1, was considered as a major m-QTL with 10.69% of the total PV explained. In the HSBCF₁ population, a total of 24 and 21 m-QTLs were detected in HSBCF₁ and HSBCF₁MPH datasets, respectively. In HSBCF₁s, the number of m-QTLs varied from two to five for different traits, with an average of 2.40% of the PV (A) and 2.53% of the PV (AE). Furthermore, in HSBCF₁MPHs, the number of m-QTLs varied from zero to seven for different traits, with an average of 2.40% of PV (A) and 1.92% of PV (AE). No m-QTL was detected for BW. Two m-QTLs, B₁MmaqSY-C14–1 and B₁MmaqLY-C14–1, were found to have major effects, and were located in the same marker interval of i28957Gh-i36740Gh. In the MARBCF₁ population, there were 28 m-QTLs in MARBCF₁s and 15 in MARBCF₁MPHs detected, on average, explaining 4.25 and 6.12% of the total PV in F₁ performance and MPH, respectively. In MARBCF₁s, B₂maqBN-C03–1 and B₂maqLY-C21–1 explained more than 10% of the total PV. In MARBCF₁MPHs, B₂MmaqBN-C21–1 was identified to be a major m-QTL, with 13.35% of the total PV explained.

Epistatic QTLs in RIL, IF₂, and two BCF₁ populations

A total of 72, 124, 126, 73, 147, 73, and 67 e-QTLs pairs were identified in the RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively (Fig. 2, Table 5, Additional file 9: Table S8, Additional file 10: Table S9). These e-QTLs explained most of the variation for yield traits. For example, the e-QTLs of SY in IF₂s and IF₂MPHs explained more than 80% of the total PV. The e-QTLs in HSBCF₁s and HSBCF₁MPHs explained 91.89 and 51.68% of the total PV for SY, respectively, and those in MARBCF₁s and MARBCF₁MPHs explained 47.75 and 71.12% of the total PV, respectively. In addition, the environmental interactions of these e-QTLs also accounted for considerable PV. On average, the QEs of e-QTLs for each trait explained 12.00, 19.52, 27.27, 12.40, 24.96, 15.41, and 14.45% of the PV in RILs, IF₂s, HSBCF₁s, MARBCF₁s, IF₂MPHs, HSBCF₁MPHs, and MARBCF₁MPHs, respectively.

Table 5 Type of epistatic interactions and the total phenotypic variation explaining by e-QTLs

Full size table

The e-QTLs were classified into three types: (I) two loci with m-QTL; (II) one loci with m-QTL and a locus without significant m-QTL; and (III) two loci without significant m-QTL [15]. For these e-QTLs in the RIL population, three pairs of LP QTLs were type II, and the remaining interactions were type III. No type I interactions were observed (Table 5). For these e-QTLs in the IF₂ population, five pairs in IF₂s and seven pairs in IF₂MPHs were type II and the remaining interactions were type III. Of these e-QTLs in the HSBCF₁ population, five pairs were detected between an interval with significant additive effect and other loci. The remaining interactions occurred between two complementary loci. Of these e-QTLs in the MARBCF₁ population, nine pairs in MARBCF₁s and two pairs in MPHs were type II, and all remaining interactions were type III.

Congruence analysis of the single-locus QTLs and main-effect QTLs

Comparing the identified additive QTLs, the confidence intervals of 63 single-locus QTLs identified by the CIM method overlapped with 77 m-QTLs identified by the ICIM method, of which some single-locus QTLs harbored two or more m-QTLs identified in different datasets (Additional file 5: Figure S1, Additional file 6: Table S5, Additional file 7: Table S6, Additional file 8: Table S7).

For FB, four stable single-locus QTLs, qFB-C05–2, qFB-C06–2, qFB-C06–3, and qFB-C16–1, had the same or overlapping confidence intervals of four m-QTLs, ImaqFB-C05–1, IMmaqFB-C06–1, B₁maqFB-C06–1 and B₁MmaqFB-C16–1, respectively. The six m-QTLs RmaqFB-C02–1, RmaqFB-C07–1, B₁maqFB-C13–1, IMmaqFB-C22–1, RmaqFB-C24–1, and IMmaqFB-C26–1 also had overlapping confidence intervals with the QTLs qFB-C02–1, qFB-C07–1, qFB-C13–1, qFB-C22–1, qFB-C24–1, and qFB-C26–1, respectively, which could only be detected in one environment.

For BN, the confidence interval of the stable single-locus QTL, qBN-C01–2, detected in the 2015Bg environment in IF₂s and IF₂MPHs, harbored two m-QTLs, ImaqBN-C01–1 and IMmaqBN-C01–1. The other two stable single-locus QTLs, qBN-C02–2 and qBN-C18–1, had the same or overlapping confidence intervals with two m-QTLs, B₁maqBN-C02–1 and IMmaqBN-C18–1, respectively. The remaining five m-QTLs had overlapping confidence intervals with four single-locus QTLs that could only be detected in one environment. Among them, qBN-C14–3 harbored two m-QTLs.

For BW, two stable single-locus QTLs, qBW-C08–3 and qBW-C10–1, had the same or overlapping confidence intervals with two m-QTLs, B₁maqBW-C08–1 and ImaqBW-C10–1, respectively. The confidence interval of the stable single-locus QTL qBW-C03–1 harbored two m-QTLs, IMmaqBW-C03–1 and B₂MmaqBW-C03–2. Similarly, the single-locus QTL qBW-C01–2, which was detected in one environment in IF₂s, also harbored two m-QTLs, ImaqBW-C01–2 and IMmaqBW-C01–1. The other six m-QTLs had one-to-one corresponding confidence intervals with six single-locus QTLs that could only be detected in one environment.

For LP, five stable single-locus QTLs, qLP-C10–1, qLP-C13–2, qLP-C13–3, qLP-C18–2, and qLP-C20–1, had the same or overlapping confidence intervals with five m-QTLs, B₂maqLP-C10–1, RmaqLP-C13–2, B₁maqLP-C13–1, IMmaqLP-C18–2, and ImaqLP-C20–1, respectively. The confidence interval of the stable single-locus QTL qLP-C04–2, detected in 2014Yc and 2014Bg in the RILs and 2015Bg in IF₂s, harbored two m-QTLs, ImaqLP-C04–1 and RmaqLP-C04–1. The other four m-QTLs had one-to-one corresponding confidence intervals with four single-locus QTLs, although they could only be detected in one environment.

For SY, seven stable single-locus QTLs, qSY-C02–2, qSY-C05–1, qSY-C13–4, qSY-C13–5, qSY-C18–3, qSY-C23–1, and qSY-C24–2, had the same or overlapping confidence intervals with seven m-QTLs, ImaqSY-C02–1, ImaqSY-C05–1, B₁MmaqSY-C13–2, IMmaqSY-C13–2, ImaqSY-C18–1, ImaqSY-C23–1, and ImaqSY-C24–1, respectively. The confidence interval of the stable single-locus QTL qSY-C05–2, which was simultaneously detected in HSBCF₁s and HSBCF₁MPHs, harbored two m-QTLs, B₁maqSY-C05–1 and B₁MmaqSY-C03–2. In addition, eight m-QTLs had overlapping confidence intervals with six single-locus QTLs that could only be detected in one environment. Among them, two single-locus QTLs harbored two m-QTLs, respectively.

For LY, the confidence intervals of three stable single-locus QTLs, qLY-C02–3, qLY-C24–1, and qLY-C26–2, overlapped with three m-QTLs, IMmaqLY-C02–1, ImaqLY-C24–3, and RmaqLY-C26–1, respectively. Stable QTLs qLY-C21–1 and qLY-C02–2 overlapped with three m-QTLs (B₂MmaqLY-C21–1, ImaqLY-C21–1, and IMmaqLY-C21–1) and two m-QTLs (ImaqLY-C02–1 and B₁maqLY-C02–1), respectively. In addition, eight m-QTLs overlapped with seven single-locus QTLs that could only be detected in one environment. Among them, one single-locus QTL harbored two m-QTLs.

Discussion

Application of RIL, IF₂ and BCF₁ populations

RIL and doubled haploid (DH) populations are permanent populations that can be repeated in different environments to detect valuable QTLs in multi-environments [47, 48]. BCF₁ or IF₂ populations based on RIL or DH populations have been constructed previously to conduct QTL mapping with respect to heterosis [15, 25, 27, 43, 49]. However, no one has used different segregating populations from the same parental combination to study heterosis. Our experimental schemes using related RIL, IF₂ and two BCF₁ populations were specifically designed to allow simultaneous and comprehensive mapping of loci contributing yield and yield components heterosis in upland cotton. Based on this, more heterozygous loci were uncovered, and more QTLs were detected than from a single population. Some QTLs that could not be identified in RIL population could be detected in IF₂/BCF₁ populations, and the QTLs detected in RILs could be confirmed using the IF₂/BCF₁ populations. Furthermore, through the combination of these four populations, both additive and non-additive gene actions of the detected loci were more accurately identified. For instance, the QTL main effects obtained using the F₁ mean values of the IF₂/BCF₁ populations contained both additive and dominance effects while those obtained from the MPH values were estimates of the dominant effect [50]. Similarly, for the epistatic loci, the estimated epistatic effects using the mean F₁ values contained both additive and nonadditive epistatic interactions of the epistatic QTL, while those from MPH values represent the DD interactions [50].

Detection of heterotic loci

Detection of HL using the MPH measurements enabled by the IF₂ and two BCF₁ populations represents another feature of the study. This analysis method effectively separated the single-locus effects causing heterosis from the QTL concerning the trait performance as detected in most previous QTL studies. Making use of the MPH data, we detected 47, 65 and 45 HLs for yield and its components in the IF₂, HSBCF₁, and MARBCF₁ populations, respectively. Moreover, 16 stable HLs were detected in two environments, of which five showed inconsistent parental sources of favorable alleles in different environments, which indicated a high sensitivity of the HLs to the environment. To some extent, this should be taken into account in upland cotton breeding. The remaining 11 stable HLs, including three for BN, five for SY, and two for LY could be important for the application of MAS in upland cotton breeding in the future. Here, the BN trait harbored more stable HLs than other yield components, which may be attributed to its higher average heterosis of the 20 top high-heterosis hybrids (Additional file 3: Table S3). Hua et al. [25] found that only ten of 33 HLs identified in their analysis were detected by QTL analysis using trait performance and indicated that trait performance and heterosis were controlled by different sets of loci. However, HLs detected in our study were not independent, and a subset overlapped with QTLs controlling trait phenotypes. Among all HLs, 12, 30, and 20 HLs identified in IF₂MPH, HSBCF₁MPH, and MARBCF₁MPH datasets were also detected by QTL analysis using the data of the IF₂s, HSBCF₁s, and MARBCF₁s, respectively. This result suggests that the MPH and performance per se of the hybrid might share identical genetic modes of action in upland cotton. In fact, it is impossible to demonstrate the genetic mechanism underlying yield traits without involving heterosis.

Genetic bases of inbreeding depression and heterosis of yield and yield components

In the present study, the levels of hybrid breakdown were ordered as LY > SY > BN > LP > FB > > BW, while heterosis was LY > SY > BW > FB > LP > > BN. All traits showed moderate and low heritability except for LP. This tendency toward more complex yield traits showing much greater levels of inbreeding depression and heterosis has been universally observed in many crops [17, 23]. The traits with serious inbreeding depression did not necessarily possess high heterosis in hybrids, which implied that there are differences in the mechanisms controlling these two biological phenomena. At the same time, wide variations were also observed in mid-parental heterosis of the IF₂ and two BCF₁ populations. The performances of some hybrids were better than those of the MP values of the original parents, while some other hybrids showed the opposite. Similar results were also obtained by Luo et al. [20, 43]. Together, it can be speculated that high heterosis is derived from heterozygosity at certain loci but not from genome-wide heterozygosity [17, 24, 43, 49, 51, 52].

The reduction of the RIL population from the midparental value was highly significant. This might be attributed to the homozygosity of deleterious alleles and/or less fit multilocus genotypes during the development of RILs [15, 39, 40, 53]. Theoretically, the inbreeding depression values of individual RILs and the MPH values of IF₂/BCF₁ hybrids for yield and its components have three components. The first is additive gene action, which leads to the deviation of the RILs from the midparental value and hybrid breakdown [1, 39, 40]. The genes of this group are directly detected in the RILs but are confounded in the IF₂s and BCF₁s. The second is dominant gene action, which leads to the deviation of the F₁ hybrids from their corresponding midparental value. Those genes are segregating and contribute to heterosis in the IF₂s/BCF₁s but are not directly detected in the RILs. The third is nonadditive gene action, which causes disharmonious interactions in inbreeding depression of RILs and beneficial interactions in heterosis of IF₂s and BCF₁s. In fact, there are some overlapping genes between inbreeding depression and heterosis. The overlapping genes of this type are particularly important since they contribute negatively to the mean value of the inbred RILs when homozygous (resulting in hybrid breakdown) and positively to heterosis when heterozygous.

In the IF₂ population of our present study, 25 (26.60%) PD QTLs and 67 (71.28%) OD QTLs were identified. In the HSBCF₁ population, 34 (34.34%) A QTLs, two (2.02%) PD QTLs, and 63 (63.64%) OD QTLs were detected. In the MARBCF₁ population, 47 (51.09%) A QTLs, six (6.52%) PD QTLs, and 39 (42.39%) OD QTLs were detected (Table 4). These results revealed that the genetic basis of heterosis was varied in different populations. At the single-locus level, partial dominance and over-dominance were the main causes of heterosis in the IF₂ population, and additive and over-dominance were the main genetic bases of heterosis in the two BCF₁ populations. Similar results have been discovered in previous studies [28, 33, 43]. In addition, similar to Zhou et al. [26], our results showed that the relative contributions of the various genetic components to heterosis were trait specificity. Over-dominance and additive effects were the main contributors to heterosis for SY, LY, BW, and LP. Over-dominance, partial dominance, and additive effects all had roles in heterosis of BN. Over-dominance was the most important contributor to heterosis of FB. Overall, over-dominance played an important role in the formation of heterosis in these traits.

Epistasis is a common feature of most loci associated with inbreeding depression and heterosis. First, the e-QTLs explained a much greater portion of the total PV than the m-QTLs for the yield in each of the mapping populations, but this was not true for the yield components (Fig. 2, Table 5). This was consistent with the results of Li et al. [39], which indicated that complex traits tended to be determined by a greater degree of epistasis. In a similar experimental design, Xiao et al. [9] detected a single main-effect QTL which had an R² of 6–7% for grain yield in each of the two rice BCF₁ populations. However, the majority of the phenotypic variation was unexplained. Apparently, their failure to detect epistasis was largely attributed to the unavailability of an appropriate analytical method. With a similar experimental design, Shang et al. [23] reported m-QTLs and e-QTLs that had different proportions for yield and yield components in an upland cotton BCF₁ population, but the relative importance of m-QTLs and e-QTLs was not evaluated. Second, most epistasis occurred between complementary loci with no detectable main effects (Table 5). Fewer cases of epistasis occur between m-QTLs and complementary loci. The predominance of epistasis between complementary loci indicate that yield and its component traits related e-QTLs occurred more in multilocus genotypes than in specific alleles at individual loci, which has been demonstrated by a large number of empirical studies [17, 23, 43]. In addition, the environment was a critical factor in the expression of these m-QTLs and e-QTLs. The average PV of m-QTLs and e-QTLs explained by QEs occupied a large proportion of the total PV in all seven datasets.

Implications for MAS in yield improvement of upland cotton

Numerous classical genetic studies have clearly found that the phenotypic relationships between yield and its components in crops are complex, and the genetic bases of heterosis in segregating populations remains poorly understood. The results of our study have several implications. For breaking the yield ceiling of hybrid upland cotton cultivars, simultaneous selection for all yield components, with an emphasis on increased BN and BW, should be much more efficient than selecting for only lint yield. This is because both BN and BW were significantly positively correlation with LY, and the heterosis of BW in three hybrid populations was obvious and showed the same trend with SY and LY.

Although several QTLs for cotton yield traits have been detected previously using an intraspecific map, few shared markers were used in the present research. Furthermore, it is difficult to compare our results with previous yield QTLs due to the use of different maps, population types, population structure, and environments, etc. [33]. In our study, 12 stable QTLs were identified across at least four datasets or environments. Additionally, there were six stable QTLs, qBN-C05–1, qBN-C05–2, qBN-C22–3, qSY-C16–2, qSY-C19–1, and qSY-C24–2, that are also stable HLs. These stable QTLs across multiple populations and environments should greatly promote further interest in the fine mapping of yield traits or implementation of MAS. When we compared the single-locus QTLs from CIM with m-QTLs from ICIM, 29 stable single-locus QTLs overlapped with m-QTLs. qSY-C24–2 could be an important QTL identified in this study, as it was not only identified as a stable QTL and HL across multiple environments but was also simultaneously confirmed by both CIM and ICIM. The large dominance effects of this QTL justify its potential use in genetic improvement of the yield of both inbred and hybrid cultivars through marker-assistant transfer in upland cotton breeding programs.

Conclusion

These results showed that obvious inbreeding depression was found in the RIL population and high levels of heterosis were detected in the IF₂ and BCF₁ populations for yield and its components in upland cotton. Heterosis of yield and its components definitely included the relative contributions of additive effects, partial dominance, over-dominance, and epistatic effects of multiple QTLs, which differed among populations and traits. Through integrating the results from single-locus and multi-environment QTL analysis, over-dominance and epistasis were found to be more important than the others. Furthermore, the heterosis genes can be further exploited because of the detection of significant HLs, which will greatly accelerate the hybrid breeding process of upland cotton.

References

Stebbins GL. The inviability, weakness, and sterility of interspecific hybrids. Adv Genet. 1958;9(9):147–215.
Article CAS PubMed Google Scholar
Wright S. Evolution and the genetics of populations, vol. 4. Chicago: University of Chicago Press; 1977.
Google Scholar
Shull GH. Duplicate genes for capsule-form in Bursa bursa-pastoris. Z I A V. 1914;12(1):97–149.
Google Scholar
Allard RW. Inbreeding depression and heterosis. In: Principles of plant breeding. New York: Wiley; 1960. p. 213–23.
Google Scholar
Simmonds NW. Principles of crop improvement. London and New York: Longman Group; 1979.
Google Scholar
Filho JBM. Inbreeding depression and heterosis. In: Coors JG, Pandey S, editors. The genetics and exploitation of heterosis in crops. Madison: ASA-CSSA-SSSA Societies; 1999. p. 69–80.
Google Scholar
Crow JF. Basic concepts in population, quantitative, and evolutionary genetics. New York: W. H. Freeman and Company; 1986. p. 273.
Google Scholar
Bruce AB. The mendelian theory of heredity and the augmentation of vigor. Science. 1910;32(827):627–8.
Article CAS PubMed Google Scholar
Xiao JH, Li JM, Yuan LP, Tanksley SD. Dominance is the major genetic-basis of heterosis in rice as revealed by Qtl analysis using molecular markers. Genetics. 1995;140(2):745–54.
CAS PubMed PubMed Central Google Scholar
Shull GH. The composition of a field of maize. Ann Breed Assn. 1908;4:296–301.
Google Scholar
Heterosis EEM. Genetics. 1936;21(4):375–97.
Google Scholar
Stuber CW, Lincoln SE, Wolff DW, Helentjaris T, Lander ES. Identification of genetic factors contributing to heterosis in a hybrid from two elite maize inbred lines using molecular markers. Genetics. 1992;132:832–8.
Google Scholar
Stuber CW, Williams WP, Moll RH. Epistasis in maize (Zea mays L.): III. Significance in predictions of hybrid performances. Crop Sci. 1973;13:195–200.
Article Google Scholar
Yu SB, Li JX, Tan YF, Gao YJ, Li XH, Zhang QF, Maroof MAS. Importance of epistasis as the genetic basis of heterosis in an elite rice hybrid. P Natl Acad Sci USA. 1997;94(17):9226–31.
Article CAS Google Scholar
Li ZK, Luo LJ, Mei HW, Wang DL, Shu QY, Tabien R, et al. Overdominant epistatic loci are the primary genetic basis of inbreeding depression and heterosis in rice. I Biomass and grain yield Genetics. 2001;158(4):1737–53.
CAS PubMed Google Scholar
Comstock RE, Robinson HF. Estimation of average dominance of genes. In: Gowen JW, editor. Heterosis. Ames: Iowa State College Press; 1952. p. 495–516.
Google Scholar
Luo LJ, Li ZK, Mei HW, Shu QY, Tabien R, Zhong DB, et al. Overdominant epistatic loci are the primary genetic basis of inbreeding depression and heterosis in rice. II Grain yield components. Genetics. 2001;158(4):1755–71.
CAS PubMed PubMed Central Google Scholar
Garcia AA, Wang S, Melchinger AE, Zeng ZB. Quantitative trait loci mapping and the genetic basis of heterosis in maize and rice. Genetics. 2008;180(3):1707–24.
Article PubMed PubMed Central Google Scholar
Schön CC, Dhillon BS, Utz HF, Melchinger AE. High congruency of QTL positions for heterosis of grain yield in three crosses of maize. Theor Appl Genet. 2010;120(2):321–32.
Article PubMed Google Scholar
Lu H, Romero-Severson J, Bernardo R. Genetic basis of heterosis explored by simple sequence repeat markers in a random-mated maize population. Theor Appl Genet. 2003;107(3):494–502.
Article CAS PubMed Google Scholar
Frascaroli E, Cane MA, Landi P, Pea G, Gianfranceschi L, Villa M, et al. Classical genetic and quantitative trait loci analyses of heterosis in a maize hybrid between two elite inbred lines. Genetics. 2007;176(1):625–44.
Article CAS PubMed PubMed Central Google Scholar
Melchinger AE, Utz HF, Piepho HP, Zeng ZB, Schön CC. The role of epistasis in the manifestation of heterosis: a systems-oriented approach. Genetics. 2007;177(3):1815–25.
Article CAS PubMed PubMed Central Google Scholar
Shang L, Liang Q, Wang Y, Zhao Y, Wang K, Hua J. Epistasis together with partial dominance, over-dominance and QTL by environment interactions contribute to yield heterosis in upland cotton. Theor Appl Genet. 2016;129(7):1429–46.
Article PubMed Google Scholar
Hua J, Xing YZ, Xu CG, Sun XL, Yu SB, Zhang QF. Genetic dissection of an elite rice hybrid revealed that heterozygotes are not always advantageous for performance. Genetics. 2002;162:1885–95.
CAS PubMed PubMed Central Google Scholar
Hua J, Xing Y, Wu W, Xu C, Sun X, Yu S, et al. Single-locus heterotic effects and dominance by dominance interactions can adequately explain the genetic basis of heterosis in an elite rice hybrid. Proc Natl Acad Sci U S A. 2003;100(5):2574–9.
Article CAS PubMed PubMed Central Google Scholar
Zhou G, Chen Y, Yao W, Zhang C, Xie W, Hua J, et al. Genetic composition of yield heterosis in an elite rice hybrid. Proc Natl Acad Sci U S A. 2012;109(39):15847–52.
Article CAS PubMed PubMed Central Google Scholar
Tang J, Yan J, Ma X, Teng W, Wu W, Dai J, et al. Dissection of the genetic basis of heterosis in an elite maize hybrid by QTL mapping in an immortalized F₂ population. Theor Appl Genet. 2010;120(2):333–40.
Article PubMed Google Scholar
Guo T, Yang N, Tong H, Pan Q, Yang X, Tang J, et al. Genetic basis of grain yield heterosis in an “immortalized F₂” maize population. Theor Appl Genet. 2014;127(10):2149–58.
Article PubMed Google Scholar
Fang L, Wang Q, Hu Y, Jia Y, Chen J, Liu B, et al. Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits. Nat Genet. 2017;49(7):1089–98.
Article CAS PubMed Google Scholar
Meredith WR, Bridge R. Heterosis and gene action in cotton, Gossypium hirsutum L. Crop Sci. 1972;12:304–10.
Article Google Scholar
Galanopoulou-Sendouca S, Roupakias D. Performance of cotton F₁ hybrids and its relation to the mean yield of advanced bulk populations. Eur J Agron. 1999;11:53–62.
Article Google Scholar
Liu R, Wang B, Guo W, Qin Y, Wang L, Zhang Y, et al. Quantitative trait loci mapping for yield and its components by using two immortalized populations of a heterotic hybrid in Gossypium hirsutum L. Mol Breeding. 2011;29(2):297–311.
Article Google Scholar
Guo X, Guo Y, Ma J, Wang F, Sun M, Gui L, et al. Mapping heterotic loci for yield and agronomic traits using chromosome segment introgression lines in cotton. J Integr Plant Biol. 2013;55(8):759–74.
Article CAS PubMed Google Scholar
Yu J, Zhang K, Li S, Yu S, Zhai H, Wu M, et al. Mapping quantitative trait loci for lint yield and fiber quality across environments in a Gossypium hirsutum × Gossypium barbadense backcross inbred line population. Theor Appl Genet. 2013;126(1):275–87.
Article PubMed Google Scholar
Paterson AH, Brubaker CL, Wendel JF. A rapid method for extraction of cotton (Gossypium spp.) genomic DNA suitable for RFLP or PCR analysis. Plant Mol Biol Rep. 1993;11:122–7.
Article CAS Google Scholar
Hulse-Kemp AM, Lemm J, Plieske J, Ashrafi H, Buyyarapu R, Fang DD, et al. Development of a 63K SNP array for cotton and high-density mapping of intraspecific and interspecific populations of Gossypium spp. G3 (Bethesda). 2015;5(6):1187–209.
Article Google Scholar
Li C, Dong Y, Zhao T, Li L, Li C, Yu E, et al. Genome-wide SNP linkage mapping and QTL analysis for fiber quality and yield traits in the upland cotton recombinant inbred lines population. Front Plant Sci. 2016;7:1356.
PubMed PubMed Central Google Scholar
Zhu J. Estimation of genetic variance components in the general mixed model. Ph.D. dissertation. Raleigh: North Carolina State University; 1989.
Google Scholar
Li ZK, SRM P, Park WD, Paterson AH, Stansel JW. Epistasis for three grain yield components in rice (Oryza sativa L). Genetics. 1997a;145(2):453–65.
CAS PubMed PubMed Central Google Scholar
Li ZK, SRM P, Paterson AH, Park WD, Stansel JW. Genetic of hybrid sterility and hybrid breakdown in an inter-subspecific rice (Oryza sativa L.) population. Genetics. 1997b;145:1139–48.
CAS PubMed PubMed Central Google Scholar
Wang S, Basten CJ, Zeng ZB, Windows QTL. Cartographer 2.5. Department of Statistics. Raleigh: North Carolina State University; 2012.
Google Scholar
Voorrips RE. MapChart. Software for the graphical presentation of linkage maps and QTLs. J Hered. 2002;93(1):77–8.
Article CAS PubMed Google Scholar
Luo X, Fu Y, Zhang P, Wu S, Tian F, Liu J, et al. Additive and over-dominant effects resulting from epistatic loci are the primary genetic basis of heterosis in rice. J Integr Plant Biol. 2009;51(4):393–408.
Article PubMed Google Scholar
Li H, Ye G, Wang J. A modified algorithm for the improvement of composite interval mapping. Genetics. 2006;175(1):361–74.
Article PubMed Google Scholar
Mei H, Zhu X, Zhang T. Favorable QTL alleles for yield and its components identified by association mapping in chinese upland cotton cultivars. PLoS One. 2013;8(12):e82193.
Article PubMed PubMed Central Google Scholar
Shen X, Guo W, Lu Q, Zhu X, Yuan Y, Zhang T. Genetic mapping of quantitative trait loci for fiber quality and yield trait by RIL approach in upland cotton. Euphytica. 2007;155(3):371–80.
Article CAS Google Scholar
Syed NH, Chen ZJ. Molecular marker genotypes, heterozygosity and genetic interactions explain heterosis in Arabidopsis thaliana. Heredity. 2005;94(3):295–304.
Article CAS PubMed PubMed Central Google Scholar
Ma LY, Bao J, Guo LB, Zeng DL, Li XM, Ji ZJ, et al. Quantitative trait loci for panicle layer uniformity identified in doubled haploid lines of rice in two environments. J Integr Plant Biol. 2009;51(9):818–24.
Article CAS PubMed Google Scholar
Jiang GH, Zeng J, He YQ. Analysis of quantitative trait loci affecting chlorophyll content of rice leaves in a double haploid population and two backcross populations. Gene. 2014;536(2):287–95.
Article CAS PubMed Google Scholar
Mather K, Jinks JL. Biometrical Genetics, 3. In: The study of continuous variation. London and New York: Chap man & Hall; 1982. p. 211 (4): 95–101.
Mei HW, Li ZK, Shu QY, Guo LB, Wang YP, Yu XQ, et al. Gene actions of QTLs affecting several agronomic traits resolved in a recombinant inbred rice population and two backcross populations. Theor Appl Genet. 2005;110(4):649–59.
Article CAS PubMed Google Scholar
Liang Q, Shang L, Wang Y, Hua J. Partial dominance, overdominance and epistasis as the genetic basis of heterosis in upland cotton (Gossypium hirsutum L.). PLoS One. 2015;10(11):e0143548.
Article PubMed PubMed Central Google Scholar
Li ZK, Pinson SRM, Stansel JW, Park WD. Identification of quantitative trait loci (QTL) for heading date and plant height in cultivated rice (Oryza sativa L.). Theor Appl Genet. 1995;91:374–81.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We are grateful to Dr. Xu (Zhejiang University, China) for technical assistance and Dr. Muhammad Daud Khan (Kohat University of Science and Technology, Pakistan), an native English speaker familiar with the scientific terminology, for help with English proofreading.

Funding

This research work was funded by The National Natural Science Fund (31501342), National High Technology Research and Development Program of China (2013AA102601), The National Key Technology R&D program of China (2016YFD0101404), China Agriculture Research System (CARS-18-25), and Jiangsu Collaborative Innovation Center for Modern Crop Production.

Availability of data and materials

All relevant data are within this article and its additional files.

Author information

Authors and Affiliations

Department of Agronomy, Zhejiang University, Zhejiang, 310058, Hangzhou, China
Cong Li, Tianlun Zhao, Hurong Yu, Cheng Li, Xiaolei Deng, Yating Dong, Fan Zhang, Yi Zhang, Lei Mei, Jinhong Chen & Shuijin Zhu

Authors

Cong Li
View author publications
You can also search for this author in PubMed Google Scholar
Tianlun Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Hurong Yu
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolei Deng
View author publications
You can also search for this author in PubMed Google Scholar
Yating Dong
View author publications
You can also search for this author in PubMed Google Scholar
Fan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Mei
View author publications
You can also search for this author in PubMed Google Scholar
Jinhong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shuijin Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CL and SZ designed the experiments and wrote the manuscript. CL and HY analyzed the data. CL, TZ, CHL and XD participated in field trials. YD, FZ, YZ and LM assisted in editing the article. SZ and JC conducted and supervised the experiments. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Shuijin Zhu.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Phenotypic variation of yield and yield components for the parents (PDF 105 kb)

Additional file 2:

Table S2. HB of RILs and MPH percentage of yield and yield components across four environments. (PDF 100 kb)

Additional file 3:

Table S3. Average MPH of 20 top high-heterosis hybrids of yield and yield components. (PDF 100 kb)

Additional file 4:

Table S4. Correlations between yield and yield components estimated in the RIL, IF₂, and two BCF₁ populations. (PDF 61 kb)

Additional file 5:

Figure S1. Chromosomal location of QTLs for yield and yield components in RIL, IF₂, HSBCF₁, MARBCF₁, IF₂MPH, HSBCF₁MPH, and MARBCF₁MPH datasets across four environments. Map distances are given in centimorgans (cM). Solid bars with different colors represent different traits, and the legend is given at the end of figure. FB: fruit branches per plant; BN: boll numbers per plant; BW: boll weight; LP: lint percentage; SY: seed cotton yield; LY: lint yield. (PDF 686 kb)

Additional file 6:

Table S5. QTLs identified for yield and yield components in RILs, IF₂s, HSBCF₁s, MARBCF₁s and their MPH datasets by using the CIM method. (PDF 668 kb)

Additional file 7:

Table S6. Main effects and environmental interactions detected for yield and yield components in RIL, IF₂ and two BCF₁ datasets using the ICIM method. (PDF 404 kb)

Additional file 8:

Table S7. Main effects and environmental interactions detected for yield and yield components in IF₂MPH, HSBCF₁MPH, and MARBCF₁MPH datasets using the ICIM method. (PDF 261 kb)

Additional file 9:

Table S8. Epistatic effects and environmental interactions detected for yield and yield components in RIL, IF₂ and two BCF₁ datasets using the ICIM method. (PDF 648 kb)

Additional file 10:

Table S9. Epistatic effects and environmental interactions detected for yield and yield components in IF₂MPH, HSBCF₁MPH, and MARBCF₁MPH datasets using the ICIM method. (PDF 631 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Li, C., Zhao, T., Yu, H. et al. Genetic basis of heterosis for yield and yield components explored by QTL mapping across four genetic populations in upland cotton. BMC Genomics 19, 910 (2018). https://doi.org/10.1186/s12864-018-5289-2

Download citation

Received: 18 December 2017
Accepted: 20 November 2018
Published: 12 December 2018
DOI: https://doi.org/10.1186/s12864-018-5289-2

Genetic basis of heterosis for yield and yield components explored by QTL mapping across four genetic populations in upland cotton

Abstract

Background

Results

Conclusions

Background

Methods

Plant materials

Field planting and phenotypic evaluation

Genotype analysis and linkage maps

Data analysis

Results

Inbreeding depression and heterosis for yield and yield components

QTL analysis of yield and yield components in RIL, IF2, and two BCF1 populations

Fruit branches per plant (FB)

Boll numbers per plant (BN)

Boll weight (BW)

Lint percent (LP)

Seed cotton yield (SY)

Lint yield (LY)

Multi-environment analysis of main-effect QTL and environmental interactions

Epistatic QTLs in RIL, IF2, and two BCF1 populations

Congruence analysis of the single-locus QTLs and main-effect QTLs

Discussion

Application of RIL, IF2 and BCF1 populations

Detection of heterotic loci

Genetic bases of inbreeding depression and heterosis of yield and yield components

Implications for MAS in yield improvement of upland cotton

Conclusion

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us

QTL analysis of yield and yield components in RIL, IF₂, and two BCF₁ populations

Epistatic QTLs in RIL, IF₂, and two BCF₁ populations

Application of RIL, IF₂ and BCF₁ populations