Skip to main content
  • Research article
  • Open access
  • Published:

Genome-wide association study of agronomic traits in rice cultivated in temperate regions



Rice plants are sensitive to the agro-climate conditions, being photoperiod one of main factor contributing to their adaptation to the region where they are grown. Dissecting the genetic bases underlying diversity in rice populations adapted to specific environmental conditions is a fundamental resource for breeding. In this study we have analysed a collection of japonica varieties adapted to temperate regions to perform association studies with traits of high agronomical interest such as heading date, plant height, number of panicles, panicle length and number of grains per panicle.


We have performed a genome wide association study using a panel of 1713 SNPs that, based on previous linkage disequilibrium estimations, provides a full coverage of the whole genome. We have found a total of 43 SNPs associated with variations in the different traits. The identified SNPs were distributed across the genome except in chromosome 12, where no associated SNPs were found. The inspection of the vicinity of these markers also revealed a set of genes associated with physiological functions strongly linked to agronomic traits. Of special relevance are two genes involved in gibberellin homeostasis that are associated with plant height and panicle length. We also detected novel associated sites with heading date, panicle length and number of grain per panicle.


We have identified loci associated with important agronomic traits among cultivars adapted to temperate conditions. Some of these markers co-localized with already known genes or QTLs, but the association also provided novel molecular markers that can be of help to elucidate the complicated genetic mechanism controlling important agronomic traits, as flowering regulation in the non-dependent photoperiod pathway. The detected associated markers may provide important tools for the genetic improvement of rice cultivars in temperate regions.


One of the main challenges for rice breeders is to develop high yielding cultivars. As rice is a staple food in many countries, high yield is required to provide food to a large and rising population. It is also desirable to increase the incomes of farmers and the economic profitability of the crop.

Since domestication, rice cultivation expanded towards different agro-ecological environments by the generation of new cultivars through selection of adapted plants to the new conditions through intense and continuous breeding activities [8]. One of the barriers that rice overcame to reach temperate regions was the difference in day length that became one of the main determinants of plant adaptation to new areas [10, 11]. As a consequence, the genetic distance between cultivars from tropical regions and those cultivated in temperate regions enlarged enough to produce strong reproductive barriers that hinder the genetic flow between tropical and temperate cultivars [12] and contributing to the constriction of the genetic pool of cultivars and the emergence of different subpopulations. The two main varietal cultivated groups, indica and japonica, are characterized by adaptations to specific climates, according to the agro-ecological conditions where they were cultivated. Indica cultivars are grown in tropical latitudes, whereas japonica cultivars can be found either in tropical or temperate climates [21]. Due to the adaptation of cultivars to specific climates, in particular to the local photoperiod, the use of cultivars from different subpopulation as donors in breeding programs is challenging since it encompasses the introduction of undesirable traits that may not be appropriate for the specific climate requirements.

The area where japonica type is cultivated is wide enough to hold relevant natural diversity covering a wide spectrum of morphological and physiological variations [24]. The characterization of the genetic bases of this diversity will allow the identification of loci that underlie this phenotypic variation, especially those concerning agronomic traits, with direct application to breeding. This achievement will offer novel opportunities to select appropriated donors to incorporate new traits of interest into local cultivars while conserving those characters responsible for temperate climate adaptation.

Yield is a complex trait that involves multiple component traits and depends on several genetic and environmental factors [34]. Adaptation of cultivars to specific agro-ecological conditions is an essential condition for high yield and, in this sense, an optimal heading date is necessary. The number of grains that a plant may produce is also determinant and it can vary according to the number of grains per panicle and the number of panicles, which are quantitative traits [34]. Panicle number is a difficult trait to study as it relays not only on genetic factors but also on environmental and growth conditions as the density of plants in the fields. Other morphological factors as height and panicle length also affect final yield.

Taking advantage of the diversity of rice, based on historic recombination events and linkage disequilibrium across the genome, genome wide association studies (GWAS) have recently become popular to identify QTLs in plant populations. Several GWAS have been performed in a search for genes involved in traits affecting yield. Studies on panicle architecture, number of spikelets, grain size or heading date have been reported [1, 23]. Most of them have been conducted in indica populations or comparing indica and japonica rice or other wild rice species [9, 23, 38]. A few studies have also been performed to detect associations within japonica populations [32, 36]. Still, the genetic basis of the phenotypic variations among temperate cultivars needs further analysis.

In a previous work, we generated a collection of rice cultivars that represented the genetic diversity present in temperate regions possessing an enormous variety of agromorphological and physiological characters. The collection is composed by 193 traditional landraces and modern elite and old cultivars of rice. The study of the population structure and the genetic relationship among the cultivars evidenced a strong substructure in the temperate rice collection, predominantly based on grain type and the origin of the cultivars [24]. We observed the occurrence of relatively high gene flow and elevated rates of admixture between cultivars grown in remote regions, probably favoured by local breeding activities. The results enable genome-wide association studies of complex traits and functional gene investigations among cultivars acclimated to temperate regions preventing the generation of spurious associations due to subpopulation structure and unknown relationships between cultivars.

In this study, the genetic bases underlying complex agronomic traits contributing to high yield in japonica temperate rice were investigated with the purpose of facilitating rice breeding in temperate regions. We conducted a genome-wide association study and identified and located several QTLs for the investigated traits.


Phenotyping evaluation

Plants from a collection of 193 japonica cultivars adapted to temperate regions were cultivated and evaluated during two consecutive years in natural long day field conditions during summer season. All plants were able to flower under these photoperiod conditions. Heading date (DH) of the varieties in the collection ranged from108 to 47 days (Table 1). Plant height (H), number of panicles (PN) and panicle length (PL) were scored (Additional file 1: Table S1). Traits presented large phenotypic variation (Table 1) with coefficient of variation ranging from 0.12, in the case of heading time, to 0.41 in the number of panicles. All the measured variables showed broad sense heritability values (Table 1). Height varied from 182 to 62 cm, displaying the highest heritability value, 0.92. Panicle length ranged from 34.4 to 9.6 cm showing also a high heritability, 0.81.PN varied from 59 to 4 panicles per plants showing the lowest heritability value, 0.54. Distributions of phenotype frequencies in general displayed an approximately normal distribution (Fig. 1), although only panicle length frequencies were normally distributed. Panicle number data collected in Malta field, during 2016 were excluded from the analysis due to differences in scoring between the individual locations and years, showing significantly lower values than the other assays (Additional file 2: Figure S1). As some of the cultivars displayed dehiscence, the number of grains per panicle (GN) was scored in two batches of plants grown in a greenhouse under natural day conditions before they reached full maturity when data was collected.

Table 1 Phenotypic variations of the studied traits
Fig. 1
figure 1

Phenotypic distributions for the studied agronomic traits. (a) heading date, (b) plant height, (c) number of panicles, (d) panicle length and (e) number of grains per panicle. Histograms show data of all plants used in the assays. The resulting curves from fitting the data to a normal distribution are shown

Correlations exhibited by the agronomic traits between each other are shown in Table 2. Panicle length exhibited high correlation with plant height. Days to heading showed moderate correlation with height and panicle length. Panicle length was no significantly correlated with number of grains. Other traits showed weak or no correlation among each other.

Table 2 Correlation coefficients of paired traits

To identify the relation between traits, and to analyse the distribution of cultivars according to their agronomic characteristics, a principal component analysis (PCA) was performed using the phenotypic variables studied in this work. The first principal component (PC1), which accounted for 45.9% of the variance, separated cultivars according to panicle length, number of grains per panicle and plant height (Fig. 2). This is in accordance with the correlation observed between these traits shown in Table 2. On the other side, PC2, explaining 20.1% of the variance distinguished cultivars according to number of panicles.

Fig. 2
figure 2

Principal components analysis (PCA) plots. The proportion of the variance explained by the first (PC1) and second principal (PC2) components is indicated in parenthesis. DH, heading date; H, plant height; PN, number of panicles; PL, panicle length and GN, number of grains per panicle

Association analysis

To detect associations between SNP markers and variations in the evaluated traits in the 193 japonica cultivars in the collection, we used the genotypic dataset generated in a previous study consisting in a panel of 1713 SNPs uniformly distributed across the whole genome with a mean distance of 215 Kb between each other [24]. According to the linkage disequilibrium estimated for this collection, 368 kb in average, [24] and the rice genome size, 321 Mb [13] the number of SNPs in the panel is adequate to detect associations across the whole genome. Among several available linear models, we chose the Mixed Linear Model (MLM) as it produces fewer spurious associations than other methods [41]. It has been suggested that in the case of rice, MLM reduces the number of false positive but in contrast increases the number of false negatives, overcompensating for population structure and relatedness [43]. We have previously shown that our temperate rice collection has a strong structure displaying several subpopulations [24]. Therefore, to avoid false associations due to the population genetic structure, we used two approaches, a first analysis considering the population structure given by the STRUCTURE software [24] and a second association study based on the population structure according to a PCA analysis (Fig. 2). We set a p-value < 0.001 threshold to consider a SNP to be significantly associated with the trait variation. In addition, q-values were also estimated [28]. The quantile-quantile (QQ) plots and Manhattan plots for the analysed traits obtained using the STRUCTURE Qmatrix are shown in Fig. 3. QQ plots for heading date, panicle length and number of grains indicated that the model was well fitted to the data; the observed p-values were uniformly distributed with some deviation at high values from the expected p-values.

Fig. 3
figure 3

Genome-wide association mapping of the traits in this study. Manhattan and quantile-quantile plots for (a) plant height, (b) heading date, (c) panicle length, (d) number of grains per panicle and (e) number of panicles. The red horizontal line indicates genome- wide significant threshold

The statistical analysis revealed 43 SNPs significantly associated with the different traits (Table 3). The identified SNPs were distributed across the genome except in chromosome 12, where no associated SNPs were found (Fig. 4). The explained variance, given by R2 ranged between 5.2 and 14.3%. Some identified SNPs were corroborated by the two approaches, according to the population structure defined by STRUCTURE software or to the PCA analysis, both approaches shared 18 SNPs with a p-value lower than the threshold set for significance. Considering an association locus as a chromosomal region in which the distance between the adjacent pairs of associated SNPs is given by the estimated LD, these SNPs could be grouped in 33 loci associated with the studied traits (Fig. 4, Table 3).

Table 3 List of SNPs significantly associated with the studied traits
Fig. 4
figure 4

Physical map position of significant associated SNPs detected in the GWAS. The identified sites are labelled in dark green for heading date (DH), in red for height (H), in blue for panicle length (PL), in orange for number of grains per panicle (GN) and pink for number of panicles (PN)

Among the associated sites, nine SNPs were significantly associated to heading date (Table 3) of which seven were novel sites. Some associated markers displayed promising allele distribution among the cultivars (Additional file 3: Table S3). This is the case of DH-8. Accessions carrying different alleles of SNP DH-2, localised at position 13,373,075 in chromosome 5, flowered on average 17 days earlier than accessions carrying the alternative allele (Additional file 4: Table S2). The mean values for heading date of accessions carrying different alleles at DH-7, at position 1,366,920 in chromosome 10 was 7 days at flowering (Additional file 4: Table S2). Additionally, for the 20 earliest cultivars, 19 were confirmed to carry the G allele at DH-7 position, meanwhile for the 20 latest ones 14 carried the A allele. We looked for functional genes related to the traits in the vicinity of the SNP in an extended the interval of 368 Kb according to the estimated LD decay. DH-3 and DH-4 co-localized with previously described genes or QTLs. Two QTLs, photoperiod sensitive phase 6 (qPSP-6) and qDTH6, involved in the regulation of flowering mediated by the photoperiod [3, 5] showed positional relationship with DH-3, located at position 7,249,825. Additionally, DH-3 is relatively close, less than 1 Mb, to Su-Se1 (Table 4), a QTL that has also been implicated in the photoperiodic regulation of flowering [40]. The mean value for heading date of accessions carrying different DH-3 alleles showed a difference of 5 days at flowering (Additional file 4: Table S2). For the 20 earliest cultivars, 13 were confirmed to carry the G allele at DH-3 position meanwhile 19 cultivars corresponding to the 20 latest cultivars also carried the same allele.

Table 4 List of known genes or QTLs related to the traits within 0.36 Mb distance interval

Panicle length showed the highest number of associated markers, 14 markers that were distributed across 9 chromosomes (Fig. 4). Five of these markers co-localized with genes involved in plant height or panicle length. PL-6, in chromosome 3, co-localised with rice panicle 3 (rip-3), coding for an α-tubulin that probably participates in the suppression of the panicle elongation during water deficit [27]. Semidwarf gene g / gibberellin insensitive dwarf1 (sdg / gid1) co-localised to PL-8, in chromosome 5 (Table 4). Mutants carrying a null allele of gid1 showed dwarf phenotype [29, 31]. Allele A at PL-8 position was present in 17 cultivars of the corresponding 20 with the shortest panicles. On the other hand, allele G was present in 17 of the 20 cultivars with the longest panicles (Additional file 4: Table S2). In a similar manner, allele G at PL-9, in the same locus that PL-8, was found in 17 of the 20 cultivars with the shortest panicles; meanwhile allele A was present in all cultivars corresponding to the 20 with the longest panicles. PL-13, associated with panicle in chromosome 9, co-localised with LAGGING GROWTH AND DEVELOPMENT 1 (LGD1) implicated in growth and in the formation of panicle. The lgd1 mutant showed altered panicle architecture [30]. Finally, QTL ssd10 (spikelet setting density) is located at the same position as PL-14 in chromosome 10 [33].

We detected 16 significant markers associated with the number of grains per panicle. GN-3, in chromosome 2, co-localized with leucine-rich repeat receptor-like kinase 1 (LRK-1) (Table 4). LRK1 is a plasma membrane protein that, supposedly, regulates rice branch number by enhancing cellular proliferation [42]. One marker, in chromosome 7 was identified associated to height. H-1 co-localized with dense and erect panicle 2 (DEP2) [16]. DEP2 is involved in the elongation of rachis and primary and secondary branches in the panicles and plants carrying mutations in DEP2 exhibited smaller panicles but also reduced height [16]. Finally, two additional markers, in chromosome 2 and 3, were associated with the number of panicles (Table 3). Neither QTLs nor candidate genes were found nearby these markers.


After domestication, despite the non-favourable flowering conditions, rice cultivation area expanded across regions with temperate climate. Modulating the sensitivity photoperiod, plants were able to overcome the inhibition of flowering during long day conditions and reached Northern latitudes. High summer temperatures, accompanied by higher number of hours of light with good solar radiation, constituted an excellent growth environment allowing expansion and originating wide diversity that is reflected in different agronomic traits, as it is shown in this study. The diversity in plants already adapted to the photoperiod is a useful resource for breeders as it may provide donors suitable for their local agronomical conditions. The majority of the analyses that have been previously reported on the identification of variations in genes of agronomic interest have been performed comparing japonica and indica cultivars. Thus, factors involved in phenotypic variations among japonica subpopulation need further investigation.

In this study, we have used GWAS as an effective method to detect associations between genomic regions and phenotypes [9, 19] of cultivars that are grown across regions with temperate climate. We have used a collection of cultivars, generated previously, including cultivars from 23 countries, that represents the genetic diversity present in temperate climate regions and constitutes a useful resource for the identification of genetic factors governing variations in agronomic traits in these regions. In fact, this collection has already been successfully proved to detect polymorphism associated with cold tolerance [25]. The analysis performed in this study has taken into account the structure of the population and relatedness among cultivars, two factors that may led to false associations in GWAS. We have considered two possibilities, one that distributed the collection into four sub-populations according to STRUCTURE software [24] and a second structure generated by PCA analysis. The analyses performed considering both Q matrix gave similar results, and most of the detected SNPs were identified using any of the both possibilities in the case of height, panicle length and panicle number (Table 3). It is remarkable that for DH-7, DH-8 and DH-9, which are positioned within an interval of 505 Kb, the association was detected using either one or the other structure considerations.

Flowering is a trait of great interest for breeders. An optimum heading date, appropriated to the day length in a local region, is necessary for a maximum yield. Control of flowering by photoperiod has supposedly been one of the main factors that contributed to the expansion of rice through regions with long day conditions. It is also a key factor in the adaptation of cultivars to each local condition and it is determined by the latitude where they are grown. Regulation of flowering by the photoperiod has been extensively studied and, nowadays, it is already known that the fine-tuning regulation of Hd3a and RFT1 promotes flowering under short or long day conditions [15]. While Hd3a dictates the transition of rice from vegetative to reproductive stage under short day conditions, RFT1 has been pointed as the major floral activator under long day conditions. Winter cold temperatures in temperate climate regions constrain rice cultivation, preventing two harvests per year even in the case of early flowering. But shortening the growth duration in a few days is still desirable by farmers as it increases crop security by reducing the risk of pathogen attacks or weather adverse conditions like storms at the end of the growing season, frequent in the East coast of Spain. All cultivars in our temperate collection were able to flower under natural long day conditions and heading date varied between 47 and 106 days (Table 1). Nine loci were found to be associated with heading date. In this study we expected to find the source of heading date variation among cultivars already adapted to long days, originated in areas with little photoperiod differences. But, surprisingly, one marker in chromosome 6, DH-4, co-localised with Su-Se-1 and qDTH6, QTLs that were identified in photoperiod regulation studies (Table 4). Su-Se-1 is a dominant photoperiod-sensitive suppression gene identified in a cross between Asominori x IR24 [40]. qDTH6 was isolated in a cross between Hayamasari, an early cultivar from Japan, and Italica Livorno [5]. These authors suggested the possibility that qDTH6 may be Hd1, a key regulatory gene in the photoperiodic control of flowering [37]. Hd1 is located in chromosome 6, at a distance of 1.2 Mb from DH-4, a high distance for considering the association between both genomic regions. It has been reported that the allelic heterogeneity and complex genome structure of the Hd1 causes spurious associations and a shift in the genomic position of the signal corresponding to Hd1 has been observed in some association studies [36]. We have previously investigated the Hd1 structure in 52 cultivars included in the present collection and found 12 different variants that include non-functional alleles of Hd1 [20]. HD1 is a repressor of flowering under long day conditions through the inhibition of Hd3a expression [14] and it has been suggested that the lack of functionality of HD1 is crucial for the adaptation of rice to long day condition in temperate regions [7]. This hypothesis is reinforced by the fact that loss-of function alleles of HD1 are common in cultivated rice in Northern latitudes ([6]; Gómez-Ariza et al., 2015). However, previous studies have concluded that HD1 is not involved in the regulation of flowering under long day conditions, as many cultivars grown in these conditions carried both functional and non-functional allele of HD1 independently of the geographical origin of the cultivars [20]. The possibility of the occurrence of another factor involved in flowering in the immediacy to Hd1 should not be discarded and it needs further investigation. No significant associated markers with heading date were found in the immediacy of Hd3a (2939760–2,942,696) or RFT1 (2926823–2,928,474) located in chromosome 6.

Some novel loci associated to heading date described in this study are promising according to the difference in heading date of cultivars carrying one or the other allele. In these sense, DH-7 and DH-8, both in the same loci in chromosome 10, appeared to be good candidates as the difference in heading date of cultivars carrying one or the other allele is 7 and 5 days respectively (Additional file 4: Table S2). DH-7 and DH-8 accounted for the 7.2 and 6.7% respectively of the explained variance (Table 3). Whether these loci are associated to photoperiod regulation of flowering remains unknown and further investigation is needed.

Several loci were identified associated with panicle length. We didn’t find any functionally characterized gene in the vicinity of 7 of these loci and we could considerer that they represent novel associated sites with panicle length. Some of the detected SNPs were close to genes that participate in panicle elongation but that also affect other parts of the plant. This is the case of PL-8 that mapped at a distance of 0.5 Mb of SALT-RESPONSIVE ERF1 (SERF1), is a transcriptional factor that acts as negative regulator of grain filling. Mutants carrying a null SERF1 allele resulted in larger grains and altered panicle length [26]. OsEBS, at 0.9 Mb of distance from PL-10, produces higher spikelet number in the panicles, leading to an increase in total grain yield although also affects plant height and leaf size [4]. Introgression lines carrying the Oryza rufipogon allele of OsEBS also produce longer panicles than wild type [4]. Oryza sativa dwarf rice with overexpression of gibberellin-induced gene (OsDOG), involved in gibberellin homeostasis, was located at 0.8 Mb from PL-12 and less than 1.2 Mb from H-2. OsDOG is an A20/AN1 zinc-finger protein that negatively regulates gibberellin-mediated cell elongation in rice. Transgenic plants expressing OsDOG exhibited dwarf phenotypes and shorter panicles that did not completely emerge from the leaf sheath due to deficient cell elongation [18]. OsDOG was also found associated with panicle length in a recent study of japonica rice grown under permanent flooding conditions [32]. LAGGING GROWTH AND DEVELOPMENT 1 (LGD1) is also close to PL-13. The lgd1 mutant displays pleotropic effects in rice, showing slow growth, reduced tiller number and plant height, altered panicle architecture and reduced grain yield [30]. On the other side, PL-6 localized close to rice panicle 3 (rip-3), coding for a putative α-tubulin protein, which expression was observed in all the reproductive organs of rice panicle, but not in other parts of the plant. Interestingly, rip-3 acts, supposedly, as suppressor of panicle elongation in the regions of high growth and in periods of water deficit [27]. Panicle length is often associated with yield as long panicles may produce higher number of spikelets and, thus, higher number of grains [17]. But in our collection, no correlation could be found between panicle length and the number of grains per panicle (Table 2). This is in agreement with the fact that no significant markers were detected in common when analysing both panicle length and number of grain per panicle.


In this association study, we have identified molecular markers related to important agronomic traits among cultivars adapted to temperate photoperiod conditions and, thus, they have a direct application in breeding programs. Some of these markers co-localizing with known genes or QTLs, such as DEP2, LRK1 or LGD1, validate our methodology while the study provided novel molecular markers that can be of help to elucidate the complicated genetic mechanism controlling important agronomic traits in japonica rice, as flowering regulation under log day conditions.


Plant material and growing conditions and Phenotyping

A japonica type rice collection composed by 193 cultivars was obtained from different sources [24]: The International Rice Research Institute (IRRI, Philippines), U.S. National Plant Germplasm System (NPGS, USA), Rice Genome Resource Center (RGRC, Japan), Copsemar and Instituto Valenciano de Investigaciones Agrarias (IVIA, Spain). Seeds of different cultivars were cultivated in two locations during 2015 and 2016 summer seasons, Tancat de Malta and Finca de Raga, in Valencia rice growing area in Spain. Plantlets were germinated in pots and manually transplanted to soil in double rows of 20 plants in May and harvested in September. Fields were continuously irrigated by flooding and drained two weeks before harvesting. Two additional batches of plants were grown separately during 2017 summer season in a greenhouse under natural day condition.

Height and heading time were measured in the field. Heading time was measured as 50% of panicle emergence in each variety. Three plants per cultivar were collected from each location and panicle number and panicle length were scored on three representative panicles for each plant. The number of grains per panicle was measured in three different panicles per plant from plants grown in the greenhouse.

Statistical analysis

Phenotypic data analyses (means, coefficients of variation and histograms of frequency distribution) were performed using Statgraphics Plus. Correlations between trait values were examined by Pearson’s correlation coefficient test using statistical software R ( Broad sense heritability was calculated using linear regression between mean value for variety at 2015 and 2016 at Malta field and mean value for variety at 2015 and 2016 at Copsemar, using lm command of R. For grain number regression was performed between plants at different sides of the green house.

Marker trait association analysis

For the marker trait association analysis, a 1713 SNP panel obtained in a previous study was used [24]. This panel was generated from a custom Infinium SNP genotyping array compiling 2697 SNP representative of japonica rice cultivated under long day conditions after removing SNP not present in at least 75% of varieties of the rice core pool, or showing a MAF lower than 5%. Association between phenotype and markers was analyzed using Tassel software (version 5.2.34) [2]. Mixed linear model (MLM) was used with kinship control (K) and with structure control (Q) to avoid spurious associations [41].

K and Q matrix were obtained from a 948 SNP panel, this one was created after filtering the 1713 SNP panel according to the previously estimated linkage disequilibrium decay distance extension for japonica subset from the collection [24]. Kinship matrix was computed with Tassel and two Q matrices were created. First one was obtained using STRUCTURE 2.3.4 software [22] retaining the membership value of three of four groups, being four the optimum estimated number of groups for this collection [24]. The latter was generated after performing a Principal Component Analysis with Tassel and by screen plot where 7 principal components were retained, which accounted for 49% of total variance (Additional file 5: Table S4). The threshold value for significant association marker-phenotype was set at p < 10− 3 due to the effectivity of the use of MLM with correction to avoid spurious associations. Fdr values were calculated using “qvalue” package 1.99.1. Manhattan plots were drawn using modified “qqman” package (version 0.1.4) for R.

To search for functionally characterized genes harboring the detected SNPs or within a 1.2 Mb interval, we looked at the Nipponbare reference annotation database (Os-Nipponbare-Reference-IRGSP-1.0) and explored the OGRO database [35] and, in addition, the Q-TARO database [39] in a search for QTLs in the same trait category.



Heading date


Number of grains per panicle


Genome wide association study


Plant height


linkage disequilibrium


Mixed linear model


Principal component analysis


Panicle length


Number of panicles


  1. Begum H, Spindel JE, Lalusin A, Borromeo T, Gregorio G, Hernandez J, Virk P, Collard B, McCouch SR. Genome-wide association mapping for yield and other agronomic traits in an elite breeding population of tropical rice (Oryza sativa). PLoS One. 2015;10(3):e0119873.

    Article  PubMed Central  PubMed  Google Scholar 

  2. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):2633–5.

    Article  CAS  PubMed  Google Scholar 

  3. Cai HW, Morishima H. Mapping QTLs for heading behavior using RI population derived from a cross between wild and cultivated rice strains. Rice Genetics Newsletter. 1998;15:144–6.

    Google Scholar 

  4. Dong X, Wang X, Zhang L, Yang Z, Xin X, Wu S, Sun C, Liu J, Yang J, Luo X. Identification and characterization of OsEBS, a gene involved in enhanced plant biomass and spikelet number in rice. Plant Biotechnol J. 2013;11(9):1044–57.

    Article  CAS  PubMed  Google Scholar 

  5. Fujino K, Sekiguchi H. Identification of QTLs conferring genetic variation for heading date among Rice varieties at the northern-limit of Rice cultivation. Breeding Sci. 2005;55:141–6.

    Article  CAS  Google Scholar 

  6. Fujino K, Wu J, Sekiguchi H, Ito T, Izawa T, Matsumoto T. Multiple introgression events surrounding the Hd1 flowering-time gene in cultivated rice, Oryza sativa L. Mol Gen Genomics. 2010;284(2):137–46.

    Article  CAS  Google Scholar 

  7. Goretti D, Martignago D, Landini M, Brambilla V, Gomez-Ariza J, Gnesutta N, Galbiati F, Collani S, Takagi H, Terauchi R, Mantovani R, Fornara F. Transcriptional and post-transcriptional mechanisms limit heading date 1 (Hd1) function to adapt Rice to high latitudes. PLoS Genet. 2017;13(1):e1006530.

    Article  PubMed Central  PubMed  Google Scholar 

  8. Huang X, Kurata N, Wei X, Wang ZX, Wang A, Zhao Q, Zhao Y, Liu K, Lu H, Li W, Guo Y, Lu Y, Zhou C, Fan D, Weng Q, Zhu C, Huang T, Zhang L, Wang Y, Feng L, Furuumi H, Kubo T, Miyabayashi T, Yuan X, Xu Q, Dong G, Zhan Q, Li C, Fujiyama A, Toyoda A, Lu T, Feng Q, Qian Q, Li J, Han B. A map of rice genome variation reveals the origin of cultivated rice. Nature. 2012;490(7421):497–501.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Huang X, Zhao Y, Wei X, Li C, Wang A, Zhao Q, Li W, Guo Y, Deng L, Zhu C, Fan D, Lu Y, Weng Q, Liu K, Zhou T, Jing Y, Si L, Dong G, Huang T, Lu T, Feng Q, Qian Q, Li J, Han B. Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nat Genet. 2011;44(1):32–9.

    Article  CAS  PubMed  Google Scholar 

  10. Itoh H, Wada KC, Sakai H, Shibasaki K, Fukuoka S, Wu J, Yonemaru JI, Yano M, Izawa T. Genomic adaptation of flowering-time genes during the expansion of rice cultivation area. Plant J. 2018;94(5):895–909.

    Article  CAS  PubMed  Google Scholar 

  11. Izawa T. Adaptation of flowering-time by natural and artificial selection in Arabidopsis and rice. J Exp Bot. 2007;58(12):3091–7.

    Article  CAS  PubMed  Google Scholar 

  12. Jeung JU, Hwang HG, Moon HP, Jena KK. Fingerprinting temperate japonica and tropical indica rice genotypes by comparative analysis of DNA markers. Euphytica. 2005;146:239–51.

    Article  CAS  Google Scholar 

  13. Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice. 2013;6(1):4.

    Article  PubMed Central  PubMed  Google Scholar 

  14. Kojima S, Takahashi Y, Kobayashi Y, Monna L, Sasaki T, Araki T, Yano M. Hd3a, a rice ortholog of the Arabidopsis FT gene, promotes transition to flowering downstream of Hd1 under short-day conditions. Plant Cell Physiol. 2002;43(10):1096–105.

    Article  CAS  PubMed  Google Scholar 

  15. Komiya R, Yokoi S, Shimamoto K. A gene network for long-day flowering activates RFT1 encoding a mobile flowering signal in rice. Development. 2009;136(20):3443–50.

    Article  CAS  PubMed  Google Scholar 

  16. Li F, Liu W, Tang J, Chen J, Tong H, Hu B, Li C, Fang J, Chen M, Chu C. Rice DENSE AND ERECT PANICLE 2 is essential for determining panicle outgrowth and elongation. Cell Res. 2010;20(7):838–49.

    Article  PubMed  Google Scholar 

  17. Li S, Qian Q, Fu Z, Zeng D, Meng X, Kyozuka J, Maekawa M, Zhu X, Zhang J, Li J, Wang Y. Short panicle1 encodes a putative PTR family transporter and determines rice panicle size. Plant J. 2009;58(4):592–605.

    Article  CAS  PubMed  Google Scholar 

  18. Liu Y, Xu Y, Xiao J, Ma Q, Li D, Xue Z, Chong K. OsDOG, a gibberellin-induced A20/AN1 zinc-finger protein, negatively regulates gibberellin-mediated cell elongation in rice. J Plant Physiol. 2011;168(10):1098–105.

    Article  CAS  PubMed  Google Scholar 

  19. McCouch SR, Wright MH, Tung CW, Maron LG, McNally KL, Fitzgerald M, Singh N, DeClerck G, Agosto-Perez F, Korniliev P, Greenberg AJ, Naredo ME, Mercado SM, Harrington SE, Shi Y, Branchini DA, Kuser-Falcao PR, Leung H, Ebana K, Yano M, Eizenga G, McClung A, Mezey J. Open access resources for genome-wide association mapping in rice. Nat Commun. 2016;7:10532.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  20. Naranjo L, Talon M, Domingo C. Diversity of floral regulatory genes of japonica rice cultivated at northern latitudes. BMC Genomics. 2014;15:101.

    Article  PubMed Central  PubMed  Google Scholar 

  21. Mackill DJ and Lei X. Genetic Variation for Traits Related to Temperate Adaptation of Rice Cultivars. Crop Sci. 1997;37:1340–1346

    Article  Google Scholar 

  22. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59.

    CAS  PubMed  PubMed Central  Google Scholar 

  23. Rebolledo MC, Pena AL, Duitama J, Cruz DF, Dingkuhn M, Grenier C, Tohme J. Combining image analysis, genome wide association studies and different field trials to reveal stable genetic regions related to panicle architecture and the number of Spikelets per panicle in Rice. Front Plant Sci. 2016;7:1384.

    Article  PubMed Central  PubMed  Google Scholar 

  24. Reig-Valiente JL, Viruel J, Sales E, Marques L, Terol J, Gut M, Derdak S, Talon M, Domingo C. Genetic diversity and population structure of Rice varieties cultivated in temperate regions. Rice. 2016;9:58.

    Article  PubMed Central  PubMed  Google Scholar 

  25. Sales E, Viruel J, Domingo C, Marques L. Genome wide association analysis of cold tolerance at germination in temperate japonica rice (Oryza sativa L.) varieties. PLoS One. 2017;12(8):e0183416.

    Article  PubMed Central  PubMed  Google Scholar 

  26. Schmidt R, Schippers JH, Mieulet D, Watanabe M, Hoefgen R, Guiderdoni E, Mueller-Roeber B. SALT-RESPONSIVE ERF1 is a negative regulator of grain filling and gibberellin-mediated seedling establishment in rice. Mol Plant. 2014;7(2):404–21.

    Article  CAS  PubMed  Google Scholar 

  27. Sheoran IS, Koonjul P, Attieh J, Saini HS. Water-stress-induced inhibition of alpha-tubulin gene expression during growth, and its implications for reproductive success in rice. Plant Physiol Biochem. 2014;80:291–9.

    Article  CAS  PubMed  Google Scholar 

  28. Storey JD, Bass AJ, Dabney A, Robinson D: qvalue: Q-value estimation for false discovery rate control. R package version 2.6.0. 2015;

    Google Scholar 

  29. Sui JM, Guo BT, Wang JS, Qiao LX, Zhou Y, Zhang HG, Gu MH, Liang GH. A new GA-insensitive Semidwarf mutant of Rice (Oryza sativa L.) with a missense mutation in the SDG Gene. Plant Mol Biol Report. 2012;30:187–94.

    Article  CAS  Google Scholar 

  30. Thangasamy S, Chen PW, Lai MH, Chen J, Jauh GY. Rice LGD1 containing RNA binding activity affects growth and development through alternative promoters. Plant J. 2012;71(2):288–302.

    Article  CAS  PubMed  Google Scholar 

  31. Ueguchi-Tanaka M, Ashikari M, Nakajima M, Itoh H, Katoh E, Kobayashi M, Chow TY, Hsing YI, Kitano H, Yamaguchi I, Matsuoka M. GIBBERELLIN INSENSITIVE DWARF1 encodes a soluble receptor for gibberellin. Nature. 2005;437(7059):693–8.

    Article  CAS  PubMed  Google Scholar 

  32. Volante A, Desiderio F, Tondelli A, Perrini R, Orasen G, Biselli C, Riccardi P, Vattari A, Cavalluzzo D, Urso S, Ben Hassen M, Fricano A, Piffanelli P, Cozzi P, Biscarini F, Sacchi GA, Cattivelli L, Vale G. Genome-wide analysis of japonica Rice performance under limited water and permanent flooding conditions. Front Plant Sci. 2017;8:1862.

    Article  PubMed Central  PubMed  Google Scholar 

  33. Xiao J, Li J, Yuan L, Tanksley SD. Identification of QTLs affecting traits of agronomic importance in a recombinant inbred population derived from a subspecific rice cross. Theor Appl Genet. 1996;92(2):230–44.

    Article  CAS  PubMed  Google Scholar 

  34. Xing Y, Zhang Q. Genetic and molecular bases of rice yield. Annu Rev Plant Biol. 2010;61:421–42.

    Article  CAS  PubMed  Google Scholar 

  35. Yamamoto E, Yonemaru J, Yamamoto T, Yano M. OGRO: the overview of functionally characterized genes in Rice online database. Rice. 2012;5(1):26.

    Article  PubMed Central  PubMed  Google Scholar 

  36. Yano K, Yamamoto E, Aya K, Takeuchi H, Lo PC, Hu L, Yamasaki M, Yoshida S, Kitano H, Hirano K, Matsuoka M. Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice. Nat Genet. 2016;48(8):927–34.

    Article  CAS  PubMed  Google Scholar 

  37. Yano M, Katayose Y, Ashikari M, Yamanouchi U, Monna L, Fuse T, Baba T, Yamamoto K, Umehara Y, Nagamura Y, Sasaki T. Hd1, a major photoperiod sensitivity quantitative trait locus in rice, is closely related to the Arabidopsis flowering time gene CONSTANS. Plant Cell. 2000;12(12):2473–84.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  38. Yonemaru J, Mizobuchi R, Kato H, Yamamoto T, Yamamoto E, Matsubara K, Hirabayashi H, Takeuchi Y, Tsunematsu H, Ishii T, Ohta H, Maeda H, Ebana K, Yano M. Genomic regions involved in yield potential detected by genome-wide association analysis in Japanese high-yielding rice cultivars. BMC Genomics. 2014;15:346.

    Article  PubMed Central  PubMed  Google Scholar 

  39. Yonemaru J, Yamamoto T, Fukuoka S, Uga Y, Hori K, Yano M. Q-TARO: QTL annotation Rice online database. Rice. 2010;3:194–203.

    Article  Google Scholar 

  40. Yu CY, Wei XJ, Chen LM, Jiang L, Zhai HQ, Wan JM. Identification of a dominant suppressor of photoperiod-sensitive gene using indica/japonica backcrossed progenies in rice (Oryza sativa L.). Rice Genetics Newsletter. 2005;22:54.

    Google Scholar 

  41. Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38(2):203–8.

    Article  CAS  PubMed  Google Scholar 

  42. Zha X, Luo X, Qian X, He G, Yang M, Li Y, Yang J. Over-expression of the rice LRK1 gene improves quantitative yield components. Plant Biotechnol J. 2009;7(7):611–20.

    Article  CAS  PubMed  Google Scholar 

  43. Zhao K, Tung CW, Eizenga GC, Wright MH, Ali ML, Price AH, Norton GJ, Islam MR, Reynolds A, Mezey J, McClung AM, Bustamante CD, McCouch SR. Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat Commun. 2011;2:467.

    Article  PubMed  Google Scholar 

Download references


This work was supported by the INNPACTO program of the Ministerio de Economía y Competitividad, (grant IPT-2011-1244-010000) and by the Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA) (grant RTA2014–00058-C03–00). JLRV was funded by the European Social Fund (fellowship number 01/14-FSE-0).

Availability of data and materials

All data generated or analysed during this study are included in this published article and its supplementary information files. Other datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations



CD conceived the project, designed research, data interpretation, phenotyping and wrote the manuscript. JLR performed GWAS, analysis of the population structure and phenotyping. LM participated in phenotyping and data analysis. MT was involved in drafting and revising critically the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Concha Domingo.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests. This work has been developed in collaboration among a public research institutions (Instituto Valenciano de Investigaciones Agrarias) and one private company (COPSEMAR S. COOP.LTDA.). The commercial affiliation of one of the authors (L.M.) does not alter our adherence to BMC policies on sharing data and materials.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. Mean values of height, days to heading, panicle length, number of panicles and grains per panicle from the different assays in both locations. SD, standard deviation. (XLSX 64 kb)

Additional file 2:

Figure S1. Boxplot showing the distribution of data scored for days to heading, height, number of panicles and panicle length in both locations, Copsemar and Malta, during 2015 and 2016. (PPTX 75 kb)

Additional file 3:

Table S3. Number of alleles carried by the 20 plants with the highest and the lowest value for each trait. Rr: heterozygous; NN: indetermination. (XLSX 13 kb)

Additional file 4:

Table S2. Mean phenotypic values of cultivars carrying different alleles in the significant associated markers. (XLSX 16 kb)

Additional file 5:

Table S4. PCA. Proportion of the total and cumulative proportion of each principal component. (XLSX 16 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Reig-Valiente, J.L., Marqués, L., Talón, M. et al. Genome-wide association study of agronomic traits in rice cultivated in temperate regions. BMC Genomics 19, 706 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: