Skip to main content

Genomic selection accuracies within and between environments and small breeding groups in white spruce



Genomic selection (GS) may improve selection response over conventional pedigree-based selection if markers capture more detailed information than pedigrees in recently domesticated tree species and/or make it more cost effective. Genomic prediction accuracies using 1748 trees and 6932 SNPs representative of as many distinct gene loci were determined for growth and wood traits in white spruce, within and between environments and breeding groups (BG), each with an effective size of N e  ≈ 20. Marker subsets were also tested.


Model fits and/or cross-validation (CV) prediction accuracies for ridge regression (RR) and the least absolute shrinkage and selection operator models approached those of pedigree-based models. With strong relatedness between CV sets, prediction accuracies for RR within environment and BG were high for wood (r = 0.71–0.79) and moderately high for growth (r = 0.52–0.69) traits, in line with trends in heritabilities. For both classes of traits, these accuracies achieved between 83% and 92% of those obtained with phenotypes and pedigree information. Prediction into untested environments remained moderately high for wood (r ≥ 0.61) but dropped significantly for growth (r ≥ 0.24) traits, emphasizing the need to phenotype in all test environments and model genotype-by-environment interactions for growth traits. Removing relatedness between CV sets sharply decreased prediction accuracies for all traits and subpopulations, falling near zero between BGs with no known shared ancestry. For marker subsets, similar patterns were observed but with lower prediction accuracies.


Given the need for high relatedness between CV sets to obtain good prediction accuracies, we recommend to build GS models for prediction within the same breeding population only. Breeding groups could be merged to build genomic prediction models as long as the total effective population size does not exceed 50 individuals in order to obtain high prediction accuracy such as that obtained in the present study. A number of markers limited to a few hundred would not negatively impact prediction accuracies, but these could decrease more rapidly over generations. The most promising short-term approach for genomic selection would likely be the selection of superior individuals within large full-sib families vegetatively propagated to implement multiclonal forestry.


Genomic selection GS, [1] has been proposed as an alternative to conventional pedigree-based selection (PS), where pedigree information is replaced with dense genetic marker information to estimate the genetic worth of each individual. In GS, thousands of markers are modeled simultaneously and the marker effects are summed, in order to compute the genomic (GEBV) equivalent of estimated additive genetic (breeding) values (EBV). GS allows small-effect loci to be captured, obviating significance testing in association and linkage studies. PS has been very effective at capturing the parent average component of the EBV, but cannot capture variation in the proportion of genome shared by pairs of relatives due to Mendelian sampling in the absence of an individual’s own phenotypic record or those of its descendants [2, 3]. Conversely, GS can theoretically capture the Mendelian sampling component in the absence of recorded phenotypes because at an appropriate marker density, all loci that explain some of the phenotypic variation are presumably tagged.

Theoretical [46] and empirical studies in breeding populations of forest trees [79] and other long-generation plant species [10, 11] have shown that substantial genetic gains can be achieved through GS. It came out that the main advantage of GS over conventional breeding in long-lived species was the possibility to reduce the generation time through the selection based on marker effects before phenotypes are available. Thus, any (possible) loss of EBV precision could be compensated for by completing additional (faster) cycles of recurrent selection. However, less attention has been paid to the effect of relatedness between cross-validation (CV) training and testing sets, choice of model and/or the possibility of predicting in new, unknown environments.

The main assumption of GS is that marker coverage is sufficiently dense so that linkage disequilibrium (LD) between markers and causal loci will not be considerably broken up following recombination/meiotic events. One way to increase the chance that markers are in LD with causal loci is to reduce the effective population size (N e ), as it is easily controlled by the plant and tree breeders [4]. It has been shown that the increase in prediction accuracy with a reduction of N e is striking at lower marker densities, but minor at higher marker densities [4]. Thus, one could expect that when working with a population of small effective size, a small number of markers might be sufficient to capture the LD and obtain high prediction accuracies. However, reducing N e also increases within-population relatedness, and because markers, even at low density coverage, can capture close relatedness between individuals in the training (estimation) and testing (validation) sets, EBV predictions in CV testing sets can be non-zero despite the absence of LD [12]. Indeed, empirical studies removing relatedness between training and testing sets at the level of the population (breed) [13], subpopulation [14] or family [1517] levels, including a previous study on white spruce [18], found predictive abilities to be drastically reduced or to fall to zero. Similarly, it was suggested that predictive accuracy was driven by markers tracing family relatedness in pine [19, 20]. This stresses the importance of designing training and testing sets appropriate to the conditions under which GS might be used [17] so that correct conclusions can be drawn and, also, so that marker densities that make it possible to capture not only relatedness but also LD can be used.

Model choice may be important insofar as it must align with the genetic architecture of the trait. Ridge regression (RR) [21, 22] is a model appropriate for traits controlled by a large number of small-effect loci, such as growth and wood traits e.g. [2325], closely resembling assumptions of the infinitesimal pedigree-based (polygenic) model. Methods such as BayesA/BayesB [1] or least absolute shrinkage and selection operator (LASSO) [26] are more relevant for traits controlled by few(er) genes of large(r) effect and are supposedly better at zeroing out (shrink) loci of negligible effect [27]. The simpler, more parsimonious RR is widely used in GS as it often performs as well as more complex models e.g. [9, 28], even though RR is known to better capture relatedness [12], which complicates the interpretation of its effectiveness.

So far, GS prediction accuracy has mostly been evaluated within single environments. However, as reforestation programs take place on a variety of sites, genotype-by-environment (GE) effects can be substantial and should be considered. It has been shown that a multivariate model, where traits measured on different sites are treated as different correlated traits, could improve predictive ability (correlation between observed and predicted phenotype) for lines tested in one environment but not in the other, thus capitalizing on correlated information between environments [29]. However, this model was less powerful for a line not tested in any of the two environments, with predictions drawn solely from information on relatives [29]. Similarly, univariate prediction accuracies dropped when marker effects estimated in a given environment were used to make predictions in different environments [7, 14], although a recent study on white spruce in eastern Canada showed that prediction accuracy could only be slightly reduced in such situations [18]. Thus, given species-specific considerations and the potential variability of the reforestation landscape, it appears important to determine whether the estimated marker effects have accurate predictive value in the different reforestation environments.

The objectives of this study were 1) to evaluate how type of model, relatedness between training and testing sets, marker subsets, and across-environment predictions affect model fit or prediction accuracy for wood and growth traits, 2) to estimate the relative efficiency of marker-based versus pedigree-based models, and 3) to discuss implications for the use of GS in the context of spruce improvement and its long breeding cycles.




Genetic material was sampled from a larger test series (E952) designed to assess the genetic merit of first-generation selections, which had been subdivided into breeding groups delineated by their provenance or geographic region [30]. Crosses were made using a partial diallel mating design within each of six distinct breeding groups (BG) or sublines [31], which were designed to limit future inbreeding to within group. Each parent was used in crosses 1–3 times, giving rise to a mixture of full- and half-sib families within breeding groups. Genetic tests were established in 1997 from 2-year-old nursery-grown seedlings near Asselin Township (S1-C, located in the balsam fir–yellow birch ecological zone; cool weather), and St. Casimir (S2-D, located in the maple–basswood ecological zone; warm weather), Québec, Canada, in a randomized complete block design. Trees were assigned to row-plots of five trees/plot (2 m × 2 m spacing).

For the current study, the base population (n = 39) consisted of 19 parents in a first breeding group (BG1) and 20 parents in a second (BG2) with 27 and 32 full-sib crosses (59 total) made within each group, respectively. The sampled trees were drawn from 16 blocks over the two sites and 25–33 individuals were sampled per full-sib cross.

In 2012, wood cores were extracted from 17-year-old trees with a diameter at breast height (DBH17) ≥ 5 cm on both sites for determination of wood density (ADEN) and microfibril angle (AMFA). AMFA was measured by scanning the radial face of the last complete growth ring closest to the bark with a D8 Discover X-ray diffractometer (Bruker AXS, Madison, WI, USA). Diffraction profiles of a few samples were first calibrated by comparison with values obtained using compound light microscopy before the complete population of samples was scanned. Density was determined by scanning increment cores from pith to bark along the radial face at a resolution of 25 μm in the QTRS-01X Tree Ring Analyzer (Quintek Measurement Systems Inc., Knoxville, TN, USA). A weighted mean density trait was obtained by summing the product between annual density measures and the ratio of respective radial areas to total radial area. Before collecting wood cores, trees were measured for age 17 height (HT17) and diameter (DBH17) growth. Phenotypes outside ±3 phenotypic standard deviations following adjustment for fixed block effects were considered as outliers and set to missing. A total of 1,748 offspring, with at least one non-missing record among the four traits, were retained after cleaning the genotype data (see below); i.e. 851 and 897 from sites S1-C and S2-D, 841 and 907 from breeding groups BG1 and BG2, respectively (Table 1).

Table 1 Number of sampled trees across test sites (S) and breeding groups (BG)


Parents (n = 39) and progeny (n = 1,748) were genotyped with the white spruce Infinium SNP array PgLM3 [32] at a rate of about one SNP per target gene locus. SNP loci were cleaned by removing those loci that failed; they were monomorphic, displayed a call rate < 0.85, had a minor allele frequency (MAF) < 0.005 (less than 10 heterozygous individuals), or a mismatch with both parental genotypes. A total of 6,932 high quality SNPs with a mean MAF = 0.20 were retained for GS analyses. These SNPs were distributed among 6,918 distinct gene loci. Individual genotypes with call rates < 0.80 were removed. Missing genotypes (~ 2.7%) were imputed from the binomial distribution, with success probabilities given the observed allele frequencies.

In order to test for population structure and control for relatedness as an alternative to the numerator relationship matrix (NRM) for association testing, an identity by state (IBS), allele-sharing, realized genomic relationship matrix (GRM, K) was constructed as:

K=Q Q T /k= M P M P T /k

where M is an n × m (individuals × loci) matrix of SNPs coded as 0, 1 and 2 for genotypes 11, 12 and 22, respectively, and P is an n × m matrix with columns containing the frequency of the second allele p m (here the minor allele) at locus m. Subtracting P from M gives Q, which sets mean values of the allele effects (and thus BVs) to zero. Finally, dividing QQT by k = tr (QQT)/m (i.e. the mean of the diagonal) normalizes the GRM to a scale analogous to elements in the A matrix [33]. To test for population structure, the K matrix (= VDVT) was decomposed spectrally with the eigen() function in R v2.15.2 [34], where D is a diagonal matrix of eigenvalues and columns of V are the corresponding eigenvectors of K. Spectral decomposition of the K matrix made it possible to estimate the variation captured by the first eigenvector, which helped determine the extent of the population structure among the breeding groups.

Statistical models

Univariate polygenic model

The univariate, individual tree (“animal”) mixed model for each trait (t) was:

y t = X t b t + Z t a t + e t

where y t is a vector of phenotypic measurements at age 17, b is a vector of fixed block effects, a i is the vector of additive genetic (breeding) values (EBVs), e is the vector of residual error effects, and X and Z are incidence matrices linking phenotypic records (y) to estimates of effects. Expectation of the model is E(a), E(e) = 0 and E(y) = Xb, with additive genetic and error effects distributed as a~N 0 , A σ a 2 and e~N 0 , I σ e 2 where A is the NRM describing the additive genetic relationships among individuals and I is an identity matrix.

Bivariate polygenic model

To test the assumption that a trait measured in different environments is affected by the genotype-by-environment interaction (GEI), the univariate model can be extended to its bivariate counterpart by stacking up the vectors for the two traits as:

y 1 y 2 = X 1 0 0 X 2 b 1 b 2 + Z 1 0 0 Z 2 a 1 a 2 + e 1 e 2

where the total vectors of random effects have covariance structures var[(α1, α2)T] = G = CA and var[(e1, e2)T] = R = EI for the additive genetic and residual effects, respectively, where is the direct (kronecker) product and C and E are matrices containing the additive genetic and residual (co)variance parameters to be estimated as:

C= σ a 1 2 σ a 12 2 σ a 21 2 σ a 2 2 ,andR= σ e 1 2 0 0 σ e 2 2

which indicates that there is a genetic covariance across sites (with information flowing through A) but that the residual covariance across sites is zero because measures took place on different individuals in different environments.

Model parameters were sampled from the posterior distribution using the MCMCglmm v2.16 package [35] in R v2.15.2 [34]. Each model was run for 260,000 iterations with a burn-in of 60,000 and thinning interval of 200, resulting in 1,000 samples saved per chain. Fixed-effect priors were drawn from the default normal distribution with large (108) variance. Weak, proper priors drawn from the inverse-Wishart distribution were set to the phenotypic variance divided by the number of random effects in the model with the degree of belief parameter set to 1 and 2 for uni- and bivariate models, respectively. Mixing and stationarity of the chains was assessed with trace plots of samples from the posterior distribution using the coda package [36]. Posterior distributions of the parameters were summarized using the mode and the highest probability density (HPD) interval (95%) with the MCMCglmm and coda packages. Posterior distributions of heritability (h2) and genetic correlations between traits (r12) were calculated as h 2 = σ a 2 / σ a 2 + σ e 2 and r 12 = σ a 12 2 / σ a 1 2 x σ a 2 2 respectively. Estimated breeding values ((EBVs; â i ) obtained from the bivariate analyses were stored on the site where phenotypes were observed and were subsequently used to determine prediction accuracies with univariate SNP-based GEBVs (see next).

Following bivariate analyses, phenotypes were adjusted for fixed block effects and scaled by within-site phenotypic standard deviation in order to account for (small) differences in phenotypic variation for traits on different sites. No differences in genetic means between the breeding groups were detected. Phenotypes were then combined across sites for use in marker regressions, assuming that traits on different sites were a single trait with a genetic correlation equal to unity.

Marker regression models

In multiple-marker regression, many SNPs are simultaneously estimated as random effects in the individual tree model:

y s =1μ+ Z a + Z u +e

where y s is a vector of adjusted and standardized phenotypic measurements at age 17, 1 is a vector of ones, μ is an intercept, a i is the vector of additive genetic (breeding) values, u k is a vector of random marker effects with the n x m incidence matrix containing marker covariates coded as Z ki  = (0, 1, 2) so that the sum of marker effects approximates the individual (additive) genomic estimated breeding value (GEBV) g i = k = 1 m z i k u k , and e i is the vector of residual error effects. Additive genetic (polygenic) and error effects were assumed to be distributed as in eq. 2. The intercept was assigned a flat (uninformative) prior, whereas variance components were assigned a scaled inverse- χ2 prior density with degrees of freedom (df) and scale (S) parameters set to χ 2 σ e 2 S = 2.5 , d f = 3 , χ 2 σ α 2 S = 1.25 , d f = 3 , χ 2 σ u 2 | S = 3.7 x 10 4 , d f = 3 for the error, additive genetic and marker effects, respectively, as recommended in [37].

Two types of marker regression were performed that differ in the prior distribution of SNP effects. In ridge regression (RR), all SNP effects are assumed to have a common variance by assigning a Gaussian prior as u R R , k ~N 0 , σ u 2 . Under this assumption all marker coefficients are shrunk to the same extent, which is appropriate for traits controlled by many genes of small effect, similar to the assumptions of the infinitesimal pedigree-based (polygenic) model. In contrast, the LASSO allows for marker-specific shrinkage effects, making it appropriate for traits thought to be under control of fewer genes of moderate to large effect. In the LASSO, the distribution of SNP effects are assigned a double exponential (DE) prior as E u L , k λ Π k = 1 m N u L , k 0 , σ e 2 τ k 2 Π k = 1 m exp τ k 2 λ 2 p λ , which states that the marker-specific prior is Gaussian, with marker-specific shrinkage of effects depending on τ2, which is in turn controlled by the prior distribution of the regularization parameter (p(λ)) [37]. The regularization parameter (λ), which allows the degree of shrinkage to be estimated from the data, was assigned a prior Beta distribution as outlined in [37], with parameters shape = 0.52 and rate = 1 × 10−4 yielding λ = 57.5 for our data. Large values of λ produce more informative (sharper) priors on marker effects [26], which will shrink (truly) small effects towards zero to a greater degree than in RR. Pedigree-based (A) best linear unbiased prediction (BLUP), ridge regression (RR) BLUP, and LASSO (L) regression were performed using the BLR package [38] in R v2.15.2 [34].

Single-marker regression (SMR; association testing) was also conducted to test the hypothesis that individual marker loci were in partial or complete linkage disequilibrium with a causal gene and to delineate subsets of markers for further marker regressions. Marker effects were treated as fixed effects (unlike in eq. 5) in SMR and relatedness structure was controlled using the numerator (A) or genomic relationship (K) matrix using the EMMAX algorithm [39] in the rrBLUP (v4.1) package [40] in R (v2.15.2) [34]. Variance components are estimated only once in a base model without SNPs using EMMAX and, subsequently, each SNP is added to the model in turn, equations are solved, and a F-test is constructed for each SNP to test the hypothesis that the SNP effect u k  ≠ 0.

Traits were modeled using 1) all individuals with all SNPs and/or the pedigree and 2) subpopulations of individuals with all or subsets of SNPs and/or the pedigree in cross-validation (CV). SNP subsets for CV were retained for reevaluation using RR in five different ways. First, the SNPs with the largest absolute effects (P < 0.05) using SMR were retained, controlling for relatedness with A or K. Next, the 600 SNPs with largest absolute effects were retained from the full model (all individuals and SNPs) using RR and pedigree information (FM-RRA), which approximated the maximum number of significant SNPs from SMR. SNP effects were estimated in a subpopulation (see below) and subsequently re-evaluated using RR in the same (e.g. BG1) or complementary subpopulation (e.g. BG1 → BG2). In the former case, SNP effects treated as fixed effects in SMR may be biased when estimated and validated in the same non-independent data [41, 42]. Finally, as controls, two sets of 600 SNPs were built by selecting SNPs randomly or by selecting those with the highest observed minor allele frequencies. For each of these controls, the selected SNPs remained the same across traits and subpopulations.

For cross-validation (CV) analyses, individuals (CV1) or full-sib families (CV2) were assigned to 10 folds randomly. In each of 10 rounds of CV, 9 folds acted as the training set in which SNP effects were estimated in order to predict the genetic values of individuals in the remaining fold, whose phenotypes had been withheld. For models tested across subpopulations (BG1 → BG2 or S1 → S2), that is training the models in one subpopulation and testing them in the other one, each CV round used 9 of 10 folds as the training set from subpopulation 1, with a different fold used as the testing set for each round of CV from subpopulation 2. Prediction accuracies were calculated as the correlation (cor (ĝ, â)) between the sum of CV SNP estimates (ĝ) or GEBVs and the additive genetic (breeding) value (â) or EBVs estimated using the bivariate, pedigree-based model and all phenotypic data, which was considered the best estimate of the “true” breeding value.

Subsets of individuals with mean training and testing set sizes of 787 and 87 individuals, respectively, were used to perform marker regressions and test prediction accuracy four different ways:

  • BG1 (or BG2) – within-breeding group CV simulates a situation where related individuals have already been phenotyped on both (or similar) sites, as BG members were distributed across sites. GEI could lower prediction accuracy if present.

  • BG1 → BG2 – between-breeding group CV removes all known relatedness between CV sets, establishes baseline accuracy due to (short-range) LD and thus tests if the model is transferable beyond the study material. As members of both BGs are present in both environments, GEI could have an impact.

  • S1 (or S2) – within-site CV simulates a situation where models are trained by pooling individuals across BGs, with no known relatedness between BG. Evaluation occurs within site, thus GEI cannot affect the results.

  • S1 → S2 – between-site CV simulates a situation where no phenotypes have been observed on related individuals in the second environment, testing the assumption that the traits on different sites have a genetic correlation of one.


Univariate heritabilities (not shown) were similar to their bivariate counterparts (Table 2), suggesting homogeneous phenotypic variance across sites. Although the genetic correlation (r 12 ) was not equal to 1 across sites (Table 2), it was moderately high (for growth traits; r 12  = 0.73), to high (for wood quality traits; r12 = 0.83). This implied that, biologically, the GEI was not strong enough to warrant the more complex (bivariate) model for wood traits (less so for growth traits) and that data might be pooled across sites and considered as a single trait.

Table 2 Bivariate heritabilities ( h 2 ), genetic correlations ( r 12 ), and highest probability density interval (95%) for wood quality and growth traits considered as correlated traits when measured on different sites

Spectral decomposition of the K matrix revealed that < 5% of the variation was captured by the first eigenvector. This indicated that population structure was not important among the breeding groups and that A or K was sufficient to capture the relatedness among individuals. The lack of phenotypic differences observed between the two breeding groups, BG1 and BG2, supported this lack of structure. The K matrix captured a low level of relatedness between the two BGs (Figure 1), implying that some pedigree errors or cryptic relatedness between groups was present. The elements in K underestimated the corresponding non-zero, expected relationship values in A, with median (first and third quartiles: range) values for half- and full-sibs of 0.20 (0.15, 0.25; −0.12 – 0.90) and 0.44 (0.38, 0.49; −0.09 – 1.05), respectively.

Figure 1

The realized genomic relationship ( K ) matrix. K matrix sorted by breeding group (BG) and full-sib family, with BG1 in the upper left and BG2 in the lower right.

Full model with all SNPs

Modeling all individuals (n = 1,748) and SNPs (m = 6,932) simultaneously with ridge regression (FM-RR), which assumes a common variance across SNPs, only improved model fit (lower DIC) and reduced the error variance for ADEN, compared with the pedigree-based model (FM-A) (Table 3). For the other three traits (AMFA, HT17, DBH17), FM-RR resulted in poorer model fits and inflated error variance, signifying either that the pedigree contained valuable information not captured by the markers (despite some potential pedigree errors) and/or that markers were overfitting to spurious effects in the data. Comparing models with both pedigree and marker (FM-RRA) and pedigree-only (FM-A) information confirmed that the pedigree accounted for a large proportion of the variance, even with markers present in the model. However, both sources of information (FM-RRA) resulted in better fits and lower error variances compared with both the FM-A and the FM-RR models (Table 3), implying that markers were partially capturing non-redundant information to the pedigree.

Table 3 Full-model posterior means of variance components using all SNPs ( m= 6,932), all individuals (n = 1,748), and deviance information criterion (DIC) for wood quality and growth traits

Allowing marker-specific shrinkage with the LASSO (FM-L and FM-LA) resulted in only very minor improvement in model fit and reduced error variance over the corresponding models using a common SNP variance (FM-RR and FM-RRA) for all traits (Table 3). The variance of the λ parameter increased from the starting values, indicating a tendency to shrink marker effects more severely towards zero (Table 3). Again, pedigree information proved important in the FM-LA model, allowing λ to further increase, providing even sharper shrinkage. Nevertheless, correlations between SNP effect estimates in complementary models (FM-L vs FM-RR, FM-LA vs FM-RRA) were very high (r ≥ 0.98, data not shown) and, given the limited improvement in LASSO fits, this model was not further considered.

Cross-validation (CV1: individual assignment) with all SNPs

For CV1, prediction accuracies were high for pedigree-only (CV1-A) models because the CV sets were highly related. Marker-only models (CV1-RR) captured a high proportion (0.87 ≤ 0.92) of the CV1-A prediction accuracy for all traits, when modeled within breeding groups (BG1) or environment (S1) (Table 4). Modeling both pedigree and marker effects (CV1-RRA) led to prediction accuracies that approached but did not reach those obtained using the pedigree-only model, meaning that the previously seen better model fits for FM-RRA (Table 3) did not necessarily translate into better predictability in CV. This might have been due to the overfitting of marker effects and/or high multicollinearity between the marker and pedigree components. The same pattern was observed within BG2 and S2 (data not shown). Prediction accuracies were reduced somewhat for wood traits (ADEN, AMFA) across models when the training and testing sets were in different environments (S1 → S2) compared to evaluation within the same environment (S1, Table 4). However, this reduction in accuracy was much more pronounced for growth traits (HT17, DBH17), especially for DBH17. Prediction accuracies were near zero for all traits across models when training and testing sets were in different, genetically unrelated, breeding groups (BG1 → BG2, Table 4), as suspected from previous results with half-sib families [18].

Table 4 Prediction accuracies for wood quality and growth traits estimated with cross-validation sets built with individuals within full-sib families using pedigree (CV1-A), ridge regression (CV1-RR) or both (CV1-RRA) sources of information with all SNPs ( m= 6,932) and various subpopulations of individuals

Cross-validation (CV2: family assignment) with all SNPs

Overall in cross-validation using family assignment (CV2) where relatedness between CV sets did not exceed those of half-sibs, prediction accuracies decreased sharply (e.g. 0.12 – 0.41 for BG1 and S1 with RR) for all models, traits and subpopulations (Table 5), compared with those in CV1, i.e. cross-validation where individuals were assigned to the various sets (Table 4). However, the same trend was also observed for the pedigree-based prediction (CV2-A) and as for CV1, the models based on markers only (CV2-RR) captured between 75% and 104% of the CV2-A prediction accuracy (Table 5). As CV1 and CV2 evaluate exactly the same set of individuals for BG1 and S1, differences in prediction accuracy can only be due to lower relatedness between CV sets, not to different LD patterns across different sets of individuals. That is, if markers were in LD with causal loci, prediction accuracies for marker-based models (CV2-RR) should be superior to those of pedigree-based models (CV2-A), which was not usually observed. Part of the lower prediction accuracy in CV2-RR was captured with CV2-RRA, i.e. when combining pedigree and marker information, meaning that even the weaker pedigree structure (i.e. half-sibs; some families across the training and testing sets sharing one of the parents but not both) provided useful information not captured when using markers only. Furthermore, this lower relatedness between training and testing sets in CV2 resulted in three times greater variability in prediction accuracy among CV folds (MSE, Table 5) than in CV1. Similar to CV1, CV2 prediction accuracies were higher within the same environment (S1) than when predicting between environments (S1 → S2), where prediction accuracies for growth traits were more reduced than those for wood traits (Table 5). As suspected, by removing all known relatedness between training and testing sets (BG1 → BG2), prediction accuracies for RR models fell to around zero for all traits. The fact that the expected prediction accuracy between BGs (BG1 → BG2) for the pedigree-only model (CV2-A) was close to zero indicates that sampling error might have contributed to small, non-zero prediction accuracies for models including markers, such as for wood density (Table 5).

Table 5 Prediction accuracies for wood quality and growth traits estimated with cross-validation sets built with full-sib families, using pedigree (CV2-A), ridge regression (CV2-RR) or both (CV2-RRA) sources of information with all SNPs ( m= 6,932) and various subpopulations of individuals

Cross-validation with SNP subsets

For SNP subsets that were identified and re-evaluated within the same population (BG1 or S1), all model-based methods used to select subsets performed relatively well (Table 6). SNP subsets selected for their largest absolute effects had accuracies slightly superior to those of randomly selected (CVSSRAN) or highest minor allele frequency (CVSSHMAF) subsets for growth traits, but differences for wood traits were less clear. Prediction accuracies were slightly higher when SNP effects had been treated as fixed effects (CVSSA or CVSSK) and then re-evaluated in the same population (BG1 or S1). This trend suggests that, in some cases, completely eliminating many markers of supposedly zero effect did not hamper much the estimation of the fewer “true” marker effects, as was previously hypothesized from results obtained using the LASSO models. However, in many instances, the accuracy estimates were slightly lower than those obtained with the full set of markers. This might be due to the removal of some markers that were in LD with QTLs associated with the traits of interest.

Table 6 Prediction accuracies for wood quality and growth traits estimated with cross-validation sets built with individuals within full-sib families for wood quality and growth traits using ridge regression (RR) for subsets (SS) of SNPs from single-marker regression controlling for A or K , the largest absolute effects from the full model (RRA), the highest minor allele frequencies (HMAF), and a random selection (RAN) for various subpopulations of individuals

When SNP subsets selected in BG1 were re-evaluated in a complementary subpopulation (BG2) with no known relatedness to BG1, prediction accuracies fell to near zero for all subsets and traits (BG1 → BG2, Table 6), similarly to what was observed with the full set of markers (Table 4). For models built on site S1 and evaluated in S2 (S1 → S2), prediction accuracies were moderate for wood traits (r > 0.44), but low for growth traits (r > 0.17, Table 6); again, they were lower than those obtained with all the markers (Table 4).


Accuracy of GS models under various scenarios

It is clear from this and other studies on forest tree species e.g. [18, 19], that current moderately dense SNP panels can be used to capture additive genetic effects remarkably well under certain conditions. GS model accuracy is known to be affected strongly by N e and marker density. For instance, it was shown for a white spruce training set of about the same size as the one used in the present study, and with similar marker density, that prediction accuracy of less than 0.40 could be achieved when the effective population size was over 600 [18]. However, although the number of SNPs in [18] and in this study was about the same, the number of distinct gene loci involved in the present study was more than twofold. This higher genome coverage might have also contributed slightly to improve accuracy estimates as expected from simulation studies for similar population sizes [4]. However, such improvement in accuracy appears to be modest at best, as indicated by accuracies obtained with subsets of random markers or with those with the highest MAF, which were almost the same as those estimated using all markers (Tables 4 and 6).

The prediction accuracy is also affected, to a lesser extent, by trait heritability and the training set size, once a minimum of n ≈ 1,000 – 2,000 individuals has been reached [4]. Given the number of SNPs (m = 6,932) and distinct loci (l = 6,918) used in this study and a white spruce genome size of ~ 2,100 cM [43], one would expect ~ 3.3 SNPs/cM if marker coverage was evenly distributed across the genome. At 2 – 3 SNPs/cM, N e  = 30 – 60 and a training set with n = 1,000 individuals, one should theoretically achieve GS model accuracies of 0.55 – 0.70 [4]. This was seen in this study (and surpassed in some cases) for predictions within breeding groups (Ne ≈ 20) or for the same environment when combining both breeding groups (Ne ≈ 40) using slightly smaller (mean n = 787) training sets. However, as for pedigree-based models, prediction accuracy of marker-based models decreased and fell near zero when close (CV2, Table 5) and all known relatedness (BG1 → BG2, Tables 4 and 5) was removed between the training and testing sets, respectively, similarly to what was observed in another GS study on a white spruce population of larger effective size [18]. This suggests that short-range LD, i.e. that between SNPs and causal loci still persisting in populations after several generations of random mating, might be more limited than expected, or that only a few of the SNPs used were in LD with causal loci (QTLs). Such an observation is reinforced by the trends in accuracy values seen when using subsets of random markers or those with the highest MAF, where accuracies did not decrease much compared with those estimated using all markers (Tables 4 and 6).

Low prediction accuracy between breeding groups in our study was not due to lack of modeling a group factor, as negligible phenotypic and genotypic differences were observed among breeding groups. This is consistent with the very weak genetic differentiation seen among white spruce populations in Québec [44, 45], making the 39 base parents in our study members of one large historical population. These base parents are still mostly unrelated, as spruce pedigrees in Québec have only been documented for two generations. In cattle, 50 K SNPs were deemed adequate to capture causal loci within breed, whereas 300 K SNPs were recommended for consistent cross-breed prediction accuracies, based on simulations and historical knowledge of N e [46]. As expected, empirical cross-breed predictions in cattle fell to zero using 50 K SNPs, although combining breeds into a single training set resulted in prediction accuracies comparable to within-breed predictions [13]. At first glance, this appears analogous to predictions across and within breeding groups obtained in the present study, especially given that when we simultaneously considered both breeding groups when training the genomic prediction models, accuracies did not decrease much compared with those obtained by training models specific to each breeding group. When combining both breeding groups, a breeding group structure was introduced, although short-range LD should not differ between these breeding groups as they are believed to belong to the same ancestral population [18]. Any minor differences between breeding groups would have been absorbed by the marker effects thus biasing them, leading to prediction accuracies that may not be superior to modeling known subpopulation memberships [14].

Reducing marker density and impact on prediction accuracies

Removing uninformative loci from the models appeared to have a positive, although not statistically significant effect on cross-validation for some traits within breeding group (BG1) or site (S1). Compared with the prediction accuracies obtained with all markers, a slight increase was indeed seen when using markers found significantly associated with traits when using either pedigree information (CVSSA) or realized genomic relationships (CVSSK), or markers with the largest absolute effects (CVSSRRA) (Table 6). This increase in accuracy can certainly be accounted for, at least in part, by the fact that these subsets of significant markers were not identified in an independent dataset [41, 42], and the accuracy estimates obtained could consequently be slightly biased upwards, thus accounting for the slight increase observed in accuracy. Such an increase in prediction accuracy was not observed for subsets of random markers, nor for those of highest minor allele frequency.

In a recent study on wood and growth traits in loblolly pine full-sib families, no increase in prediction accuracy was reported for various subsets of markers selected after association studies or randomly selected markers up to a maximum number of about 3,400 markers [20]. However, in another loblolly pine population that was clonally replicated, two different patterns were observed with regard to the influence of marker density on predictive ability [9]. For some traits such as growth, development and wood traits, the maximum value was reached with a few hundred markers and this value did not vary with a larger number of markers (total of 4,825 SNPs), whereas for traits such as wood density and disease-resistance-related traits, a small number of markers associated with these traits (100 to 500 markers) made it possible to reach the maximum value, which decreased with the addition of more markers. For wood density, the predictive ability obtained using subsets of several hundred markers was somewhat higher than, but not different from, subsets of randomly selected markers of similar size, as observed in the present study on white spruce. In a cattle population, genotypes from a 54 K SNP array made it possible to increase, on average, the prediction accuracies for daughter yield deviations by about 0.22 over those obtained with the pedigree information only, which represents an increase of about 60% [27]. Although removing uninformative SNPs did not reduce predictive ability estimates obtained with a 54 K SNP array, the high density arrays used allowed up to tens of thousands of SNPs to remain in the final model. These various results suggest that higher prediction accuracy might also be obtained in trees with subsets of markers significantly associated with the traits, but likely not until much higher density arrays than the current ones are available. The BayesCπ model [47] could be useful for single-step subset selection at higher marker densities because it appears to align more closely with the distribution of marker effects for wood and growth traits. BayesCπ simultaneously performs variable selection and shrinkage of SNP effects remaining in the model with a common variance, although empirical studies in plants have not proven it superior to the simpler RR model [9, 28].

Capture of relatedness and long-range and short-range linkage disequilibrium

The observation that GS prediction accuracies sharply decreased when training and validation individuals were completely unrelated (BG1 → BG2) may be due to various causes. The first could be related to the fact that the long-range LD (co-segregating linkage blocks) generated through controlled crosses among the closely related individuals making up the training set may not be present in a non-related testing set. Linkage studies have shown that marker effects are generally limited to the family (genetic background) in which they were estimated [48] because the full variation of the causal loci in the population is not sampled. Linkage has nevertheless been found to be useful to increase genetic gain, even when no major effects were present, by increasing the precision of the relationship matrix [49]. In GS, where estimation occurs over the entire population, co-segregation of linkage blocks (long-range LD) was found to contribute significantly to prediction accuracy across full-sib families in maize [50]. In white spruce breeding groups such as those used in the present study, the contribution of long-range LD is also likely important, but it would decay more rapidly in subsequent generations than short-range LD [50]. However, combining full-sib families of several breeding groups could make it possible to develop GS models with high accuracies, as it was observed in the present study when individuals were pooled across breeding groups (scenarios S1 or S1 → S2). The persistency of that accuracy will depend on how quickly those linkage blocks decay over generations.

A complementary explanation for the loss of prediction accuracy among unrelated individuals could be that markers were largely capturing existing relationships between CV training and testing sets [15, 16]. It has been reported that genomic prediction accuracies can increase with the number of markers even in the absence of LD because markers capture additive relationships, but these accuracies will only be superior to the pedigree-based ones if markers are in LD with causal loci [12]. In line with [12], we observed in this study that the marker-based (RR) models implicating all markers or subsets of markers did not perform in most cases as well as the pedigree-based models using all the available trees or in most cases of cross-validation. Also in agreement with [12], prediction accuracies generally increased with the marker density used (m = 600 versus m = 6,932), but only slightly (Table 4: all markers versus Table 6: subsets of random markers). These observations imply that larger family sizes (n ≥ 100 individuals per family) might be needed in order to demonstrate the superiority of marker- over pedigree-based prediction accuracies [51] or that the contribution of LD to prediction accuracy was limited. For the former, more family members would allow SNP effects to be estimated more accurately, which would increase prediction accuracy [50]. The latter would mean that prediction accuracies are specific to a group of closely related individuals and may be overestimated because the larger contributions from relatedness and smaller contributions from short-range and long-range LD cannot be disentangled. This observation would argue in favor of GS models developed and implemented within the same population (or within a group of pooled populations, as discussed above) at least for the time being and considering the financial resources generally available to breeders.

One way to think about the level of LD in a population is to consider the relationship matrix estimated with the markers, which can be partitioned into 1) the expected (known) relationships obtained from pedigree information, 2) the deviations around these relationship values due to Mendelian sampling, i.e. those due to the fact that each offspring receives a different set of genes from its parents and that each offspring is different from its expected degree of kinship, and 3) the sampling error [52]. At high marker densities, the Mendelian sampling component can vary, for example, by as much as a ij  = 0.4 − 0.6 [53, 54] relative to the pedigree-based relationship coefficient (expected degree of kinship) of a ij  = 0.5 for full-sibs, and can be used to approximate the level of LD in a population [52]. Thus, to surpass pedigree-based prediction accuracies, it is clear that the precision of relatedness in the numerator relationship matrix (based on expected degree of kinship) needs to be increased. At high marker densities, genetic parameters can be estimated solely from Mendelian sampling components [53], implying that the genomic relationship matrix can be used to model the covariance among distantly related individuals [55], which establishes baseline accuracy due to LD [56]. In this case, however, large (training) population sizes would be required as the precision of quantitative genetic parameters is inversely proportional to the size of the estimated relatedness [55]. In our study, genomic relatedness likely underestimated pedigree-based relatedness, given that the scaling of the genomic relationship was appropriate. This observation suggests that many more markers would be necessary to obtain prediction accuracies superior to those using pedigree information. On the other hand, the “true” breeding values for wood and growth traits of the white spruce population considered in this study are not known and have been estimated using the pedigree information of the complete population and the bivariate individual-tree mixed model. One might think that “true” breeding values are closer to breeding values estimated with the realized genomic relationship matrix, which might influence the comparison of the pedigree-based and marker-based accuracies. As an attempt to answer this question, we re-estimated the accuracies replacing the “true” breeding values by the phenotypes, which are not influenced by the method of estimation as are the breeding values. For most of the scenarios tested, accuracies based on pedigree information were higher than either the genomic-based estimated breeding values or those estimated with combined genomic and pedigree information. Thus, this observation suggests that using the pedigree-based estimated breeding values as proxy for “true” breeding values has no significant impact on the estimation of prediction accuracies.

Genotype-by-environment interaction

The results obtained with the two CV scenarios that tested the impact of GEI (BG1 and S1 → S2) were different for wood and growth traits. Wood traits were relatively unaffected by GEI in the bivariate model (r12 = 0.83), which was reflected in high (BG1) and moderately high (S1 → S2) prediction accuracies when phenotypes obtained in the second site (S2) were included or not in the training model, respectively (Table 4). These good prediction accuracies between environments would support the pooling of wood trait observations across sites and treat them as a single trait over environments. The situation might be different in more heterogeneous environments where GEI might be more important, as generally reported for western North America for instance [57], thus necessitating a careful case-by-case analysis.

Growth traits were more sensitive to GEI than wood traits, as prediction accuracies fell to moderate (BG1) and low (S1 → S2) levels, when phenotypes measured in the second site (S2) were included or not in the training model, respectively (Table 4). Such GEI was previously observed to be modest for growth traits in white spruce, though statistically significant [58]. This trend suggests that not only could a GEI component be beneficial in the statistical model, but information on relatives (markers or pedigree) alone is insufficient to predict genetic values in an environment where phenotypes have not been assessed.

Borrowing correlated information (phenotype of related individuals assessed on all experimental sites) to make predictions across environments was found useful in wheat [29], which may have been facilitated by the evaluation of homogeneous genetic (inbred) lines as opposed to the families or populations used in forestry, which are genetically heterogeneous. For both wood and growth traits, removing information quality (closeness of relatedness, Table 5) and quantity (number of markers, Table 6) between cross-validation sets weakened prediction accuracies, emphasizing the importance of close relatedness when making predictions across environments. When looking at growth traits in particular, it is encouraging to notice that the decrease in accuracy between sites was less pronounced for height than for DBH. Height is generally the preferred selection trait for fast growth by tree breeders [59]. This observation is in line with results reported for low GEI in white spruce for this trait [58], and also with the small reduction in accuracy observed across sites in a previous application of genomic selection to white spruce half-sib families in eastern Canada [18].

The bivariate model, which simultaneously considers the same trait measured on trees tested on two sites as different correlated traits, could be extended to a multivariate or reduced rank factor analytic model [29, 60] if there was an interest in fine-tuning the regional performance of seed orchards. Of course, this does not preclude combining traits over sites if overall performance in a large target environment is sought (selection of generalists instead of specialists). More importantly, it may be of interest to generate GEI for wood traits with silvicultural treatments [61] if interactions under intensive vs extensive management are anticipated.

Implementation of genomic selection in spruce breeding programs

In practice, our results indicate that future prospects for genomic selection in spruces should take place within populations of small effective size combining several breeding groups, and thus implicating individuals of high relatedness within breeding groups. To recapitulate, this is first because marker-based model accuracies do not hold when removing relatedness between CV sets, indicating that with the current marker densities the overall short-range LD between markers and causal loci is either too low to be of practical use or not sufficiently well captured. With the proposed approach, reduction in marker coverage as shown in Table 6 would not have a significant negative impact on prediction accuracies, but as only long-range LD and relatedness are likely captured, prediction accuracy may not be persistent over many generations. Second, as expected, within-population prediction accuracies were high in this study, where effective population size was considerably constrained, and higher than those obtained in a much larger and genetically diverse spruce population with lower degree of relatedness [18]. However, it is clear that GS models could be developed for larger populations, as illustrated in the tested scenario S1, which combines breeding groups G1 and G2, without significantly affecting prediction accuracies (Table 4). Given the number of breeding groups tested, we could not determine what should be the most effective population size for a successful operational use of markers, but it is apparent that it could be at least 50, and maybe up to 100 [4], without losing too much prediction accuracy.

The obvious caveat for the proposed approach is that it is not possible to clearly separate the contributions of both relatedness and long-range LD (co-segregation) from prediction accuracy. Long-range LD may still be important if its contribution to prediction accuracy does not decay as fast as that due to relatedness, especially with recurrent selection, i.e. in a population with the same genetic background. The use of multigenerational data might be one way to disentangle the two sources of information.

A simple form of GS that does not depend on LD would consist of replacing the pedigree-based numerator relationship matrix with the genomic relationship matrix in the standard individual (animal) BLUP model [19], or to blend the two sources of information [62, 63], which is functionally equivalent to estimating and summing SNP effects using ridge regression [12, 51, 64]. We have shown that marker-based models could achieve, in most cases, almost 90% of the accuracy of pedigree-based models, and that data from breeding groups and sites could be combined without noticeably affecting the accuracy results. Based on these findings, we believe that kinship based on realized genomic relationships would be especially useful if pedigrees were unknown or incomplete (e.g. when using open or supplemental mass (pollimix) pollination in closed breeding orchards) to capture any cryptic relatedness not accounted for in the “known” pedigree (half-siblings only, for instance), to correct pedigree errors, and to overcome some of the strong assumptions of using the numerator relationship matrix in the pedigree-based BLUP [22]. This scenario would likely make it possible to obtain prediction accuracies that are higher than those obtained with incomplete pedigree information, and at the same time, to reduce breeding and testing costs. With the populations used in the present study, and ignoring knowledge on male contribution, the relative prediction accuracy of RR models over pedigree-based models (half-sib models) would have varied from 115% to 162%.

Another simple GS scenario that does not depend on our ability to separate the source of prediction accuracy would be to conduct genomic prediction within full-sib families [14]. In this case, the pedigree provides no information, LD would be high, and there would be no risk of linkage phases changing over families. The idea would be to generate large full-sib families (n = 200 and more), phenotype a fraction of the individuals (e.g. 25%), and select both phenotyped and unphenotyped individuals [14] for propagation. Although not tested in this study, the method could be useful in multiclonal forestry to identify superior genotyped seedlings derived from somatic embryos. For embryos originating from previously field-tested and genotyped members of the same cross, no additional field testing would be required. The difference between this method and genomic selection for recurrent selection is that the identified individuals would be propagated and, thus, marker effects tracing large linkage blocks would not need to persist in subsequent generations. Data could be pooled over families (and groups) as done in this study. With such a scenario and the strict assumptions under which this genetic gain potential might be realized, the breeding cycle for the set of families retained could be significantly reduced and gain per time unit more than double, i.e. the gain per year of breeding activities, which includes selection, crossing, and propagation phases (Table 7 and [18]).

Table 7 Potential genetic gain as estimated empirically from a 5% selection intensity made within families using EBVs (based on pedigree information) and GEBV (based on all SNPs [ m= 6,932]), and expected gain per time unit under the assumption that with use of markers and somatic embryogenesis, the breeding cycle could be reduced from 30 to 10 years by avoiding the 20-year field test and production phases


In the short term, genomic selection would be most useful to select superior individuals within large full-sib families. These individuals would be reproduced by vegetative propagation techniques such as somatic embryogenesis (SE), or a mix of SE and rooted cuttings, in order to develop a multivarietal forestry program. With regard to genomic selection for recurrent selection at current marker densities in spruce populations, its efficiency would depend on the presence of close relatives in order to exploit the capture of relatedness and supposedly long-range LD. Thus, a simple marker-based approach that aim to uncover relatedness when pedigrees have not been recorded or are only partially known would seem to be the most cost-effective application of this technology. For such a scenario, the total number of SNPs required would be only a few hundreds to a few thousands. In the longer term, superior marker-over pedigree-based predictions may be achievable in descendant generations of closed populations as short-range LD increases and pedigrees provide less information. However, to achieve this goal, denser marker arrays, more individuals per family, and more sophisticated experiment designs and genomic selection models [17] than those used in the current study will likely need to be considered. A step towards achieving this goal in forest genetics might also be the estimation across generations in order to better evaluate the persistency of marker effects.


  1. 1.

    Meuwissen THE, Hayes BJ, Goddard ME: Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001, 157: 1819-1829.

    CAS  PubMed Central  PubMed  Google Scholar 

  2. 2.

    Daetwyler HD, Villanueva B, Bijma P, Woolliams JA: Inbreeding in genome-wide selection. J Anim Breed Genet. 2007, 124: 369-376. 10.1111/j.1439-0388.2007.00693.x.,

    CAS  PubMed  Article  Google Scholar 

  3. 3.

    Crossa J, Perez P, Hickey J, Burgueño J, Ornella L, Ceron Rojas J, Zhang X, Dreisigacker Babu R, Li Bonnett D, Mathews K: Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity. 2014, 112: 48-60. 10.1038/hdy.2013.16.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  4. 4.

    Grattapaglia D, Resende MDV: Genomic selection in forest tree breeding. Tree Genet Genomes. 2011, 7: 241-255. 10.1007/s11295-010-0328-4.,

    Article  Google Scholar 

  5. 5.

    Iwata H, Hayashi T, Tsumura Y: Prospects for genomic selection in conifer breeding: a simulation study of Cryptomeria japonica. Tree Genet Genomes. 2011, 7: 747-758. 10.1007/s11295-011-0371-9.,

    Article  Google Scholar 

  6. 6.

    Denis M, Bouvet JM: Efficiency of genomic selection with models including dominance effect in the context of Eucalyptus breeding. Tree Genet Genomes. 2013, 9: 37-51. 10.1007/s11295-012-0528-1.,

    Article  Google Scholar 

  7. 7.

    Resende MFR, Muñoz P, Acosta JJ, Peter GF, Davis JM, Grattapaglia D, Resende MDV, Kirst M: Accelerating the domestication of trees using genomic selection: accuracy of prediction models across ages and environments. New Phytol. 2012, 193: 617-624. 10.1111/j.1469-8137.2011.03895.x.,

    PubMed  Article  Google Scholar 

  8. 8.

    Resende MDV, Resende MFR, Sansaloni CP, Petroli CD, Missiaggia AA, Aguiar AM, Abad JM, Takahashi EK, Rosado AM, Faria DA, Pappas GJ, Kilian A, Grattapaglia D: Genomic selection for growth and wood quality in Eucalyptus: capturing the missing heritability and accelerating breeding for complex traits in forest trees. New Phytol. 2012, 194: 116-128. 10.1111/j.1469-8137.2011.04038.x.,

    PubMed  Article  Google Scholar 

  9. 9.

    Resende MFR, Muñoz P, Resende MDV, Garrick DJ, Fernando RL, Davis JM, Jokela EJ, Martin TA, Peter GF, Kirst M: Accuracy of genomic selection methods in a standard data set of loblolly pine (Pinus taeda L.). Genetics. 2012, 190: 1503-1510. 10.1534/genetics.111.137026.,

    PubMed Central  PubMed  Article  Google Scholar 

  10. 10.

    Wong CK, Bernardo R: Genomewide selection in oil palm: increasing selection gain per unit time and cost with small populations. Theor Appl Genet. 2008, 116: 815-824. 10.1007/s00122-008-0715-5.,

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Kumar S, Chagné D, Bink MCAM, Volz RK, Whitworth C, Carlisle C: Genomic selection for fruit quality traits in apple (Malus × domestica Borkh.). PLoS One. 2012, 7: e36674-10.1371/journal.pone.0036674.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  12. 12.

    Habier D, Fernando RL, Dekkers JCM: The impact of genetic relationship information on genome-assisted breeding values. Genetics. 2007, 177: 2389-2397.,

    CAS  PubMed Central  PubMed  Google Scholar 

  13. 13.

    Hayes BJ, Bowman PJ, Chamberlain AC, Verbyla K, Goddard ME: Accuracy of genomic breeding values in multi-breed dairy cattle populations. Genet Sel Evol. 2009, 41: 51-10.1186/1297-9686-41-51.,

    PubMed Central  PubMed  Article  Google Scholar 

  14. 14.

    Windhausen VS, Atlin GN, Hickey JM, Crossa J, Jannink J-L, Sorrells ME, Raman B, Cairns JE, Tarekegne A, Semagn K, Beyene Y, Grudloyma P, Technow F, Riedelsheimer C, Melchinger AE: Effectiveness of genomic prediction of maize hybrid performance in different breeding populations and environments. Genes Genomes Genet. 2012, 2: 1427-1436.,

    Google Scholar 

  15. 15.

    Legarra A, Robert-Granié C, Manfredi E, Elsen JM: Performance of genomic selection in mice. Genetics. 2008, 180: 611-618. 10.1534/genetics.108.088575.

    PubMed Central  PubMed  Article  Google Scholar 

  16. 16.

    Albrecht T, Wimmer V, Auinger H-J, Erbe M, Knaak C, Ouzunova M, Simianer H, Schön CC: Genome-based prediction of testcross values in maize. Theor Appl Genet. 2011, 123: 339-350. 10.1007/s00122-011-1587-7.,

    PubMed  Article  Google Scholar 

  17. 17.

    Makowsky R, Pajewski NM, Klimentidis YC, Vazquez AI, Duarte CW, Allison DB, De Los Campos G: Beyond missing heritability: prediction of complex traits. PLoS Genet. 2011, 7: e1002051-10.1371/journal.pgen.1002051.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  18. 18.

    Beaulieu J, Doerksen T, Clément S, MacKay J, Bousquet J: Accuracy of genomic selection models in a large population of open-pollinated families in white spruce. Heredity. 2014, 113: 343-352. 10.1038/hdy.2014.36.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  19. 19.

    Zapata Valuenzuela J, Whetten RW, Neale DB, McKeand SE, Isik F: Genomic estimated breeding values using genomic relationship matrices in a cloned population of loblolly pine. Genes Genomes Genet. 2103, 3: 909-916.,

    Google Scholar 

  20. 20.

    Zapata Valenzuela J, Lsik F, Maltecca C, Wegrzyn J, Neale D, McKeand S, Whetten R: SNP markers trace familial linkages in a cloned population of Pinus taeda – prospects for genomic selection. Tree Genet Genomes. 2012, 8: 1307-1318. 10.1007/s11295-012-0516-5.,

    Article  Google Scholar 

  21. 21.

    Whittaker JC, Thompson R, Denham MC: Marker-assisted selection using ridge regression. Genet Res. 2000, 75: 249-252. 10.1017/S0016672399004462.

    CAS  PubMed  Article  Google Scholar 

  22. 22.

    Piepho HP: Ridge regression and extensions for genomewide selection in maize. Crop Sci. 2009, 49: 1165-1176. 10.2135/cropsci2008.10.0595.,

    Article  Google Scholar 

  23. 23.

    Brown GR, Bassoni DL, Gill GP, Fontana JR, Wheeler NC, Megraw RA, Davis MF, Sewell MM, Tuskan GA, Neale DB: Identification of quantitative trait loci influencing wood property traits in loblolly pine (Pinus taeda L.). III. QTL verification and candidate gene mapping. Genetics. 2003, 164: 1537-1546.

    CAS  PubMed Central  PubMed  Google Scholar 

  24. 24.

    Ukrainetz NK, Ritland K, Mansfield SD: Identification of quantitative trait loci for wood quality and growth across eight full-sib coastal Douglas-fir families. Tree Genet Genomes. 2008, 4: 159-170. 10.1007/s11295-007-0097-x.,

    Article  Google Scholar 

  25. 25.

    Beaulieu J, Doerksen T, Boyle B, Clément S, Deslauriers M, Beauseigle S, Blais S, Poulin PL, Lenz P, Caron S, Rigault P, Bicho P, Bousquet J, MacKay J: Association genetics of wood physical traits in the conifer white spruce and relationships with gene expression. Genetics. 2011, 188: 197-214. 10.1534/genetics.110.125781.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  26. 26.

    De Los Campos G, Naya H, Gianola D, Crossa J, Legarra A, Manfredi E, Weigel K, Cotes JM: Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 2009, 182: 375-385. 10.1534/genetics.109.101501.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  27. 27.

    Croiseau P, Legarra A, Guillaume F, Fritz S, Baur A, Colombani C, Robert-Granié C, Boichard D, Ducrocq V: Fine tuning genomic evaluations in dairy cattle through SNP pre-selection with the Elastic-Net algorithm. Genet Res. 2011, 93: 409-417. 10.1017/S0016672311000358.,

    CAS  Article  Google Scholar 

  28. 28.

    Heslot N, Yang HP, Sorrells ME, Jannink JL: Genomic selection in plant breeding: a comparison of models. Crop Sci. 2012, 52: 146-160. 10.2135/cropsci2011.06.0297.,

    Article  Google Scholar 

  29. 29.

    Burgueño J, De Los Campos G, Weigel K, Crossa J: Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers. Crop Sci. 2012, 52: 707-719. 10.2135/cropsci2011.06.0299.,

    Article  Google Scholar 

  30. 30.

    Beaulieu J: Breeding program and strategy for white spruce in Quebec. Information Report LAU-X-117E Quebec, QC: Natural Resources Canada, Canadian Forest Service. 1996

    Google Scholar 

  31. 31.

    Burdon RD, Namkoong G: Short note: multiple populations and sublines. Silvae Genet. 1983, 32: 221-222.

    Google Scholar 

  32. 32.

    Pavy N, Gagnon F, Rigault P, Blais S, Deschênes A, Boyle B, Pelgas B, Deslauriers M, Clément S, Lavigne P, Lamothe M, Cooke JEK, Jaramillo Correa JP, Beaulieu J, Isabel N, MacKay J, Bousquet J: Development of high-density SNP genotyping arrays for white spruce (Picea glauca) and transferability to subtropical and nordic congeners. Mol Ecol Resour. 2013, 13: 324-336. 10.1111/1755-0998.12062.,

    CAS  PubMed  Article  Google Scholar 

  33. 33.

    Forni S, Aguilar I, Misztal I: Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information. Genet Sel Evol. 2011, 43: 1-10.1186/1297-9686-43-1.,

    PubMed Central  PubMed  Article  Google Scholar 

  34. 34.

    R Development Core Team: R: a language and environment for statistical computing. 2012, Vienna, Austria: R Foundation for Statistical Computing,,

    Google Scholar 

  35. 35.

    Hadfield JD: MCMC methods for multi-response generalized linear mixed models: the MCMCglmm R package. J Stat Softw. 2010, 33: 1-22.

    Article  Google Scholar 

  36. 36.

    Plummer M, Best N, Cowles K, Vines K: CODA: convergence diagnosis and output analysis for MCMC. R News. 2006, 6: 7-11.

    Google Scholar 

  37. 37.

    Pérez P, De Los Campos G, Crossa J, Gianola D: Genomic-enabled prediction based on molecular markers and pedigree using the BLR package in R. Plant Genome. 2010, 3: 106-116. 10.3835/plantgenome2010.04.0005.,

    PubMed Central  PubMed  Article  Google Scholar 

  38. 38.

    De Los Campos G, Rodriguez PP: BLR: Bayesian linear regression. R package version 13. 2012,,

    Google Scholar 

  39. 39.

    Kang HM, Sul JH, Zaitlen NA, Kong S, Freimer NB, Sabatti C, Eskin E: Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010, 42 (4): 348-354. 10.1038/ng.548.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  40. 40.

    Endelman JB: Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome. 2011, 4: 250-255. 10.3835/plantgenome2011.08.0024.

    Article  Google Scholar 

  41. 41.

    Goddard ME, Wray NR, Verbyla K, Visscher PM: Estimating effects and making predictions from genome-wide marker data. Stat Sci. 2009, 24: 517-529. 10.1214/09-STS306.

    Article  Google Scholar 

  42. 42.

    Wray NR, Yang J, Hayes BJ, Price AL, Goddard ME, Visscher PM: Pitfalls of predicting complex traits from SNPs. Nat Rev Genet. 2013, 14: 507-515. 10.1038/nrg3457.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  43. 43.

    Pavy N, Pelgas B, Laroche J, Rigault P, Isabel N, Bousquet J: A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers. BMC Biol. 2012, 10: 84-10.1186/1741-7007-10-84.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  44. 44.

    Jaramillo Correa JP, Beaulieu J, Bousquet J: Contrasting evolutionary forces driving population structure at expressed sequence tag polymorphisms, allozymes and quantitative traits in white spruce. Mol Ecol. 2001, 10: 2729-2740. 10.1046/j.0962-1083.2001.01386.x.,

    CAS  PubMed  Article  Google Scholar 

  45. 45.

    Namroud M-C, Beaulieu J, Juge N, Laroche J, Bousquet J: Scanning the genome for gene single nucleotide polymorphisms involved in adaptive population differentiation in white spruce. Mol Ecol. 2008, 17: 3599-3613. 10.1111/j.1365-294X.2008.03840.x.

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  46. 46.

    De Roos APW, Hayes BJ, Spelman RJ, Goddard ME: Linkage disequilibrium and persistence of phase in Holstein-Friesian, Jersey and Angus cattle. Genetics. 2008, 179: 1503-1512. 10.1534/genetics.107.084301.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  47. 47.

    Kizilkaya K, Fernando RL, Garrick DJ: Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes. J Anim Sci. 2010, 88: 544-551. 10.2527/jas.2009-2064.,

    CAS  PubMed  Article  Google Scholar 

  48. 48.

    Jannink JL, Bink MCAM, Jansen RC: Using complex plant pedigrees to map valuable genes. Trends Plant Sci. 2001, 6: 337-342. 10.1016/S1360-1385(01)02017-9.,

    CAS  PubMed  Article  Google Scholar 

  49. 49.

    Villanueva B, Pong-Wong R, Fernandez J, Toro MA: Benefits from marker-assisted selection under an additive polygenic genetic model. J Anim Sci. 2005, 83: 1747-1752.

    CAS  PubMed  Google Scholar 

  50. 50.

    Habier D, Fernando RL, Garrick DJ: Genomic-BLUP decoded: a look into the black box of genomic prediction. Genetics. 2013, 194: 597-607. 10.1534/genetics.113.152207.,

    CAS  PubMed Central  PubMed  Article  Google Scholar 

  51. 51.

    Hayes BJ, Visscher PM, Goddard ME: Increased accuracy of artificial selection by using the realized relationship matrix. Genet Res. 2009, 91: 47-60. 10.1017/S0016672308009981.

    CAS  Article  Google Scholar 

  52. 52.

    Goddard ME, Hayes BJ, Meuwissen THE: Using the genomic relationship matrix to predict the accuracy of genomic selection. J Anim Breed Genet. 2011, 128: 409-421. 10.1111/j.1439-0388.2011.00964.x.,

    CAS  PubMed  Article  Google Scholar 

  53. 53.

    Visscher PM, Medland SE, Ferreira MAR, Morley KI, Zhu G, Cornes BK, Montgomery GW, Martin NG: Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLoS Genet. 2006, 2: 41-10.1371/journal.pgen.0020041.,

    Article  Google Scholar 

  54. 54.

    VanRaden PM: Genomic measures of relationship and inbreeding. Proceed Interbull Meeting Dublin Ireland. 2007, 37: 33-36.

    Google Scholar 

  55. 55.

    Visscher PM: Whole genome approaches to quantitative genetics. Genetica. 2009, 136: 351-358. 10.1007/s10709-008-9301-7.,

    PubMed  Article  Google Scholar 

  56. 56.

    Clark SA, Hickey JM, Daetwyler HD, Van Der Werf JHJ: The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes. Genet Sel Evol. 2012, 44: 4-10.1186/1297-9686-44-4.,

    PubMed Central  PubMed  Article  Google Scholar 

  57. 57.

    Campbell RK: Genotype x Environment Interaction: A Case Study for Douglas-fir in Western Oregon. 1992, Portland, OR: USDA Forest Service Research Paper PNW-RP-455. Pacific Northwest Research Station

    Google Scholar 

  58. 58.

    Li P, Beaulieu J, Bousquet J: Genetic structure and patterns of genetic variation among populations in eastern white spruce (Picea glauca). Can J For Res. 1997, 27: 189-198. 10.1139/x96-159.

    Article  Google Scholar 

  59. 59.

    Zobel BJ, Talbert J: Applied Forest Tree Improvement. 1984, New York: Wiley

    Google Scholar 

  60. 60.

    Kelly AM, Cullis BR, Gilmour AR, Eccleston JA, Thompson R: Estimation in a multiplicative mixed model involving a genetic relationship matrix. Genet Sel Evol. 2009, 41: 33-10.1186/1297-9686-41-33.,

    PubMed Central  PubMed  Article  Google Scholar 

  61. 61.

    Apiolaza LA: Improvement objectives for short rotation forestry. New Zealand J For. 2008, 52: 28-30.

    Google Scholar 

  62. 62.

    Legarra A, Aguilar I, Misztal I: A relationship matrix including full pedigree and genomic information. J Dairy Sci. 2009, 92: 4656-4663. 10.3168/jds.2009-2061.,

    CAS  PubMed  Article  Google Scholar 

  63. 63.

    Christensen OF, Lund MS: Genomic prediction when some animals are not genotyped. Genet Sel Evol. 2010, 42: 2-10.1186/1297-9686-42-2.,

    PubMed Central  PubMed  Article  Google Scholar 

  64. 64.

    VanRaden PM: Efficient methods to compute genomic predictions. J Dairy Sci. 2008, 91: 4414-4423. 10.3168/jds.2007-0980.,

    CAS  PubMed  Article  Google Scholar 

Download references


We thank D. Plourde and É. Dussault (Natural Resources Canada) for site maintenance and data collection, S. Blais, F. Gagnon (Univ. Laval) and S. Clément (Natural Resources Canada) for managing the genotype data, A. Beaumont, R. Gagné (Natural Resources Canada), M.-H. Galibois and P. Lenz (Univ. Laval) for data collection, S. Mansfield (Univ. British Columbia) for determination of density profiles and MFA data, the team of A. Montpetit (Univ. McGill and Génome Québec Innovation Centre) for performing the Infinium genotyping assay, and I. Lamarre (Natural Resources Canada) for editing work. We are also grateful to Dr. Dario Grattapaglia (EMBRAPA Genetic Resources and Biotechnology, Brazil) for reading the manuscript and providing insights. This study was made possible through funding from the Natural Resources Canada Genomics R&D Initiative and the Canadian Wood Fibre Centre to JBe and JBo, and from Génome Québec and Genome Canada to JM, JBo and JBe.

Author information



Corresponding author

Correspondence to Jean Beaulieu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

JBe participated to conceiving the study, designing breeding populations, providing access to field experiments, analysing data, drafting the manuscript and obtaining the funding. TKD performed the bulk of statistical analyses and drafting of the manuscript. JM participated in drafting the manuscript and obtaining the funding. AR participated to designing breeding populations, providing access to field experiments and phenotyping populations. JBo participated in conceiving the study, drafting the manuscript, obtaining the funding and supervised the development of markers and genotyping assays. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Beaulieu, J., Doerksen, T.K., MacKay, J. et al. Genomic selection accuracies within and between environments and small breeding groups in white spruce. BMC Genomics 15, 1048 (2014).

Download citation


  • Genomic selection
  • Genotype-by-environment interactions
  • Relatedness
  • Somatic embryogenesis
  • Spruce