Inferring genetic architecture of complex traits using Bayesian integrative analysis of genome and transcriptome data
 Alireza Ehsani^{1}Email author,
 Peter Sørensen^{1},
 Daniel Pomp^{2},
 Mark Allan^{3} and
 Luc Janss^{1}
DOI: 10.1186/1471216413456
© Ehsani et al.; licensee BioMed Central Ltd. 2012
Received: 27 March 2012
Accepted: 24 August 2012
Published: 5 September 2012
Abstract
Background
To understand the genetic architecture of complex traits and bridge the genotypephenotype gap, it is useful to study intermediate omics data, e.g. the transcriptome. The present study introduces a method for simultaneous quantification of the contributions from single nucleotide polymorphisms (SNPs) and transcript abundances in explaining phenotypic variance, using Bayesian wholeomics models. Bayesian mixed models and variable selection models were used and, based on parameter samples from the model posterior distributions, explained variances were further partitioned at the level of chromosomes and genome segments.
Results
We analyzed three growthrelated traits: Body Weight (BW), Feed Intake (FI), and Feed Efficiency (FE), in an F_{2} population of 440 mice. The genomic variation was covered by 1806 tag SNPs, and transcript abundances were available from 23,698 probes measured in the liver. Explained variances were computed for models using pedigree, SNPs, transcripts, and combinations of these. Comparison of these models showed that for BW, a large part of the variation explained by SNPs could be covered by the liver transcript abundances; this was less true for FI and FE. For BW, the main quantitative trait loci (QTLs) are found on chromosomes 1, 2, 9, 10, and 11, and the QTLs on 1, 9, and 10 appear to be expression Quantitative Trait Locus (eQTLs) affecting gene expression in the liver. Chromosome 9 is the case of an apparent eQTL, showing that genomic variance disappears, and that a trimodal distribution of genomic values collapses, when gene expressions are added to the model.
Conclusions
With increased availability of various omics data, integrative approaches are promising tools for understanding the genetic architecture of complex traits. Partitioning of explained variances at the chromosome and genomesegment level clearly separated regulatory and structural genomic variation as the areas where SNP effects disappeared/remained after adding transcripts to the model. The models that include transcripts explained more phenotypic variance and were better at predicting phenotypes than a model using SNPs alone. The predictions from these Bayesian models are generally unbiased, validating the estimates of explained variances.
Keywords
Bayesian Body Weight Feed Intake Genome Transcriptome eQTL VarianceBackground
Large amounts of genomic information generated from Single Nucleotide Polymorphism (SNP) microarrays have become available in recent years for many species[1–3]. This genomic information is used to detect polymorphisms that contribute to variation in economically important traits, such as production traits in farm animals[3]. Microarray technology is also used to screen the expression levels of thousands of genes, i.e., the transcriptome[4, 5]. Studies have shown that genetic background can have a large impact on differential expression[6]. Integrating genome and transcriptome information can help to elucidate the underlying biology of the genotypephenotype map, using expression Quantitative Trait Locus (eQTL) mapping[7].
However, in the eQTL approach, associations between SNPs, transcript level, and phenotypes are analyzed individually. This is likely to lead to “missing heritability”[8], because corrections for multiple testing lead to a high false negative rate and multiple SNPs and transcript level that jointly explain the phenotype are ignored[9, 10]. Here we propose and demonstrate Bayesian models that model all SNPs and transcript level simultaneously to obtain explained variances by the whole genome and whole transcriptome. In these models, we identify eQTLs as those SNPs whose effects disappear when transcript level are added to the model. Genomic and transcriptomicexplained variances are further partitioned by chromosome and genome sections to offer a view of the genetic architecture on different aggregation levels.
The choice of Bayesian variable selection (BVS) models was due to its features to separate markers with large/moderate or small effects, and to locate the important regions in the genome or transcriptome which serves a better QTL mapping method because it produces clearer signals for QTL[11]. Furthermore the prediction based on genomic variables using BVS is more accurate even when the prior is not correct[11–14]. It is important to say that simpler methods suffer from “missing heritability” too[15, 16].
The aim of this study was to explore the contributions of various sources of variation, such as population structure, SNP variants, and gene expression levels, to a set of growth related traits (body weight, feed intake, and feed efficiency) in mice. These traits are very important, both in terms of agricultural production and for obesity in humans. Bayesian mixed models and Bayesian variable selection models were applied to model pedigree, SNPs and/or gene expressions and to derive explained variances for these components. In addition, they were used to partition of SNPs and gene expression by chromosome and genome sections. To validate the estimates of explained variances, the predictive ability of these models was studied using cross validation.
Data
An M16 × ICR F_{2} population of 440 mice was available with complete records for body weight at 8 weeks (BW) and 337 records for feed intake (FI) and feed efficiency (FE), measured during the period 3 weeks to 8 weeks[17]. An additional 89 pedigree records were available that described the family structure up to the F_{0} founder lines. Data was obtained in three batches and the sex of the animals was recorded. At the end of the experiment, the mice were sacrificed and liver tissue was extracted for genomewide expression profiling. RNA isolation, cDNA synthesis, array hybridization, normalization of probe level intensity, and annotation of data were performed as described in detail by[18]. Genotypes for 1806 highly informative single nucleotide polymorphisms (SNPs) were available for each animal. These tagSNPs were used to trace the genomic variation in this F_{2} population. Density functions of phenotypes are available in Additional file1 and the whole data were made publicly available at (http://gbi.agrsci.dk/~pso/BIG_genome_transcriptome/).
Methods
where X is the design matrix for batch and sex effects, Z is a design matrix that links polygenic effects to the observed records, W is a matrix with 1806 SNP covariates, and Q is a matrix with 23,698 gene expression covariates. The SNP and gene expression covariates were centered and scaled to unit variance.
Based on work of[19–22], the Bayesian mixed model version assigns normal prior to the vectors u, a, g, and e in (1), i.e.,$u\sim N\left(0,A{{\displaystyle \sigma}}_{u}^{2}\right),a\sim N\left(0,I{{\displaystyle \sigma}}_{s}^{2}\right),g\sim N\left(0,I{{\displaystyle \sigma}}_{g}^{2}\right),e\sim N\left(0,I{{\displaystyle \sigma}}_{s}^{2}\right)$, where${{\displaystyle \sigma}}_{e}^{2}$ is the polygenic variance and A is the numerator relationship matrix based on pedigree information,${{\displaystyle \sigma}}_{s}^{2}$ is the perSNP explained variance,${{\displaystyle \sigma}}_{g}^{2}$ is the pergene expression explained variance, and${{\displaystyle \sigma}}_{e}^{2}$ is the residual or environmental variance. These four variances are estimated in the model using flat prior distributions, i.e.,${{\displaystyle \sigma}}_{u}^{2},{{\displaystyle \sigma}}_{s}^{2},\phantom{\rule{0.5em}{0ex}}{{\displaystyle \sigma}}_{g}^{2},\phantom{\rule{0.62em}{0ex}}{{\displaystyle \sigma}}_{e}^{2}\sim \phantom{\rule{0.5em}{0ex}}\mathit{\text{Bern}}$. The remaining parameters in (1), μ and b, are assigned flat prior distributions, which is the Bayesian analog of fitting “fixed effects” (unshrunken) estimates. A Markov chain Monte Carlo (MCMC) algorithm was applied in the software bayz[23] to obtain samples from the posterior distribution of the model parameters$f\left(\mu ,b,u,a,g,{{\displaystyle \sigma}}_{u}^{2},{{\displaystyle \sigma}}_{s}^{2},{{\displaystyle \sigma}}_{g}^{2},{{\displaystyle \sigma}}_{e}^{2}y\right)$. MCMC algorithms for sampling effects and variances in mixed models have been extensively described, for a general overview see[24]. The Monte Carlo accuracy of the MCMC algorithm was evaluated by correlating repeated estimates for the parameter vectors u, a and g, requiring a correlation >0.999 from repeated MCMC runs, and by computing the effective sample sizes for the variance components using the R Coda package[25].
The explained variance in y from (1) is var(Zu) + var(Wa) + var(Qg) + var(e). To obtain posterior means (PMs) and posterior standard deviations (PSDs) on the explained variances for SNPs and gene expressions, var(Wa) and var(Qg) were evaluated based on the posterior samples for a and g from the MCMC, i.e., as the PM and PSD of var(Wa^{t}) values over MCMC cycles, where a^{t} is the posterior sample for a from MCMC cycle t. This procedure is not required for the polygenic variance, because Z is a design matrix, unlike W and Q, which are covariate matrices.
where${\tau}_{a1}^{2}$ and${\tau}_{a0}^{2}$ are the “large” and “small” variances in the mixture distribution for a,${\tau}_{g1}^{2}$ and${\tau}_{g0}^{2}$ are the “large” and “small” variances in the mixture distribution for g, and${\gamma}_{a}$ and${\gamma}_{g}$ are vectors of 0/1 indicator variables for a and g, respectively, indicating whether the i th element in a or g, respectively, comes from the distribution with large or small variance. The variances${\tau}_{a1}^{2},\phantom{\rule{0.25em}{0ex}}{\tau}_{a0}^{2},\phantom{\rule{0.25em}{0ex}}{\tau}_{g1}^{2},\phantom{\rule{0.25em}{0ex}}{\tau}_{g0}^{2}$ were all estimated from the data using unbounded flat prior distributions. The constraints${\tau}_{a1}^{2}\phantom{\rule{0.25em}{0ex}}>\phantom{\rule{0.25em}{0ex}}{\tau}_{a0}^{2}$ and${\tau}_{g1}^{2}\phantom{\rule{0.25em}{0ex}}>\phantom{\rule{0.25em}{0ex}}{\tau}_{g0}^{2}$ were applied using a rejection sampler, so that “large” and “small” effects remained identifiable. The priors for the indicator variables were taken as${\gamma}_{\mathit{ai}}\phantom{\rule{0.25em}{0ex}}\sim \phantom{\rule{0.25em}{0ex}}\mathit{\text{Bern}}\left({\pi}_{a}\right)$ and${\gamma}_{\mathit{gi}}\phantom{\rule{0.25em}{0ex}}\sim \phantom{\rule{0.25em}{0ex}}\mathit{\text{Bern}}\left({\pi}_{g}\right)$, where$\mathit{\text{Bern}}\left(\pi \right)$means a Bernoulli distribution for a 0/1 indicator with a probability π for a 1. The parameters${\pi}_{a},\phantom{\rule{0.25em}{0ex}}{\pi}_{g}$were taken as known. The MCMC implementation of this model is relatively straightforward, because conditional on the indicator variables the model remains a mixed model. The updating of the mixture indicators is described in[26]. This model is also run in the software bayz[23], and the Monte Carlo accuracy was evaluated in the same way as the mixed model version.
From the posterior samples for a and g in the variable selection model, explained variances were computed and partitioned by chromosome and by genome section. The variable selection model is more suited to make such a partitioning, because unlike the mixed model version, it allows for different variance contributions per SNP. The explained variances were evaluated in the same way as for the mixed model, by evaluating var(Wa^{t}) and var(Qg^{t}) over MCMC cycles t, except that the a and g samples are obtained under the mixture model prior assumptions. The same expressions can be straightforwardly evaluated for parts of the SNPs or gene expressions to obtain explained variances per chromosome and for small windows of SNPs within chromosomes. Variance within a chromosome was computed using a 5SNP sliding window to obtain a genomic variance profile.
It is difficult to choose an optimal windows size as it depends on extend of LD, marker density and an arbitrary cutoff for what is considered important LD. In the data analyzed here, average R^{2} between adjacent SNPs was 0.55, and average R^{2} between SNPs two apart was 0.39, which we considered sufficiently high to warrant computation of variances in a 5SNP window. To study the relative importance of family structure, SNPs, and gene expressions, six sub models and the complete model (1) were used. These were models that use only pedigree information (PED), only SNP data (SNP), only gene expression data (GEX), SNP + GEX, PED + GEX, PED + SNP, and the complete model PED + SNP + GEX. These models always included sex and batch effects.
The predictive ability of the models was evaluated using an 11fold crossvalidation. For body weight, 440 records were divided randomly in 11 groups, each with 40 individuals. Feed intake and feed efficiency, with 337 records in total, were randomly divided in 10 groups of 30 records and one group of 37 records. The complete model, including all variance parameters, was reestimated on each set of 10 folds and predictions were computed for the phenotypes in the remaining 11^{th} fold. All predictions from the 11fold cross validation were collected to compute correlations between predicted and actual phenotypes, and regressions of predicted phenotypes on actual phenotypes, using the whole data set. The slope of the regression lines of predicted phenotypes on actual phenotypes are expected to be 1 if the model produces unbiased predictions, which would validate the estimates of explained variances. The University of Nebraska Institutional Animal Care and Use Committee approved all procedures and protocols.
Results and discussion
Explained variance in different models for Body Weight (BW), Feed Intake (FI), and Feed Efficiency (FE)
Trait  Explained variances  PED  SNP  GEX  PED + SNP  PED + GEX  SNP + GEX  PED + SNP + GEX 

Body Weight  E  9.96(1.93) 58%  9.82(0.94) 64%  3.57(0.9) 21%  7.07(1.77) 41%  2.43(1.01) 14%  3.08(0.77) 19%  2.06(1) 12% 
P  7.26(3.42) 42%      5.04(3.15) 29%  2.45(1.41) 14%    2.08(1.47) 12%  
S    5.63(0.9) 36%    5.14(1.08) 30%    2.9(0.67) 18%  2.82(0.73) 17%  
G      13.45(1.57) 79%    12.37(1.56) 72%  10.29(1.6) 63%  9.93(1.44) 59%  
Total  17.22  15.45  17.02  17.25  17.25  16.27  16.89  
Feed Intake  E  155.59(42) 47%  202.89(22) 72%  151.89(27) 51%  137.63(40) 42%  95.48(36) 30%  125.91(24) 43%  80.41(34) 25% 
P  174.89(82) 53%      131.88(79) 40%  99.74(57) 31%    89.97(53) 28%  
S    79.53(22) 28%    56.32(22) 18%    56.05(19) 19%  45.09(18) 14%  
G      150.24(41) 49%    125.33(35) 39%  111.84(33) 38%  104.9(33) 33%  
Total  330.48  282.42  302.13  325.83  320.55  293.8  320.37  
Feed Efficiency (×10,000)  E  1.59(0.44) 42%  2.40(0.26) 76%  2.23(0.3) 69%  1.53(0.44) 42%  1.09(0.48) 30%  1.88(0.3) 58%  1.07(0.46) 29% 
P  2.17(0.92) 58%      1.73(0.86) 47%  1.87(0.78) 51%    1.61(0.77) 44%  
S    0.76(0.24) 24%    0.39(0.22) 11%    0.61(0.23) 19%  0.33(0.2) 9%  
G      1.01(0.34) 31%    0.71(0.28) 19%  0.73(0.32) 23%  0.66(0.27) 18%  
Total  3.76  3.16  3.24  3.65  3.67  3.22  3.67 
Overall, explained variances increase by adding gene expression information (GEX; data from liver), i.e., in the most complete model (PED + SNP + GEX) explained variances were 88%, 75%, and 71% for BW, FI, and FE respectively. This confirms the assumption that gene expressions can explain a larger part of phenotypic variance than genetic or genomic information, by capturing environmental, and possibly nonadditive, genetic effects through the gene expressions[5, 30]. Information on the genetic architecture of these traits is best judged from the relative contributions of genomic and transcriptomic data in the SNP + GEX model.
This model shows that, for these traits, the liver transcriptome contributes a larger portion of explained variance. This is most pronounced for BW, with 18% of explained variance from the genome and 63% from the liver transcriptome. Thus, in this case, the predominant model is that SNPs regulate gene expressions to exert their effect on the phenotype.
This method/approach is suitable for genelevel resolution. However, genelevel resolution is highly data dependent, i.e. it requires high marker density and a study population with LD blocks that span small genomic regions. In this work we have used F2 crosses from outbred lines, which has large LD blocks and this kind of data has limited resolution for finemapping of QTL.
One may argue that the most complete model is more interesting to investigate genetic architecture and chromosomal/subchromosomal variance but as we have shown SNPs and pedigree are largely confounded and they explain about the same variance. This confounded explained variance is getting worse in the case that both Pedigree and SNPs are in one model (PED + SNP model) which is shown in higher confidence intervals of explained variance by pedigree. The model with only omics information (SNP + GEX) is therefore simpler, more accurate and as effective as the model that also uses pedigree information. This is interesting for future applications of omics technologies, because we expect that pedigree information often will be absent.
Rank correlation (Spearman) between individual values predicted from different sources of information pedigree (PED), SNPs markers (SNP), and gene expression signals (GEX) in three traits
PED & SNP  SNP & GEX  PED & GEX  

BW  0.94  0.87  0.87 
FI  0.93  0.87  0.88 
FE  0.89  0.68  0.68 
Correlation between predicted and actual phenotypes with different sources of information
Trait  Parameter  PED  SNP  GEX  SNP + PED  GEX + PED  SNP + GEX  SNP + GEX + PED 

Body Weight  ρ  0.76  0.8  0.87  0.80  0.87  0.88  0.88 
β  0.99  0.99  1.01  0.99  1.01  1.02  1.02  
Feed Intake  ρ  0.63  0.64  0.67  0.64  0.66  0.69  0.68 
β  0.98  0.99  0.99  0.96  0.95  0.98  0.96  
Feed Efficiency  ρ  0.46  0.45  0.51  0.46  0.54  0.51  0.55 
β  0.94  0.96  0.86  0.92  0.98  1  0.96 
Conclusions
With increased availability of various omics data, integrative approaches are promising tools for understanding the genetic architecture of complex traits. We have developed a complementary approach to the univariate “eQTL” mapping, by considering Bayesian models that fit all genomewide SNPs and transcript abundances in one model, and that estimate and partition explained variances by chromosome and genome segments. Our results show that, using gene expressions, more of the phenotypic variance can be explained and phenotypes can be better predicted. Predictions were also shown to be unbiased, which validates the assessed explained variances. The improvement of phenotype predictions using gene expression data will be useful for several applications in agriculture and medicine, although it should be assessed on a casebycase basis as to whether a suitable tissue can be sampled for the gene expression measurements. Partitioning of the explained genomic variance at the level of chromosomes and genome segments showed clear examples of eQTL locations as regions where genomic variance disappears when gene expressions are added to the model. Our study used only gene expressions from the liver, and an obvious further extension is to include expressions from other tissues. The QTLs that did not disappear when transcripts are added to the model may be eQTLs that affect gene expression in a tissue other than liver. The Bayesian model is quite efficient for handling large sets of covariates, and extensions to include multiple sets of expressions will be feasible. We have not provided formal statistical tests in this model, but the Bayesian approach lends itself naturally to obtaining confidence intervals for (differences between) parameter estimates. The estimates of total explained variances from the Bayesian mixed model can also be obtained by a residual maximum likelihood (REML) approach. We verified this, and the Bayesian and REML estimates generally agree. However, using REML it is not feasible to utilize mixture priors to better discriminate between SNPs which contribute more or less variance, and to partition the variances at the subchromosome level, which is all straightforward in a Bayesian approach.
Our approach can easily allow up scaling to higherdensity arrays, even to wholegenome sequence data with the variance components analysis as it was for gene expression probes in this study.
Abbreviations
 BW:

Body Weight
 FI:

Feed Intake
 FE:

Feed Efficiency
 SNPs:

Single Nucleotide Polymorphisms
 REML:

Restricted maximum Likelihood
 QTL:

Quantitative trait loci
 eQTL:

Expression Quantitative trait loci.
Declarations
Acknowledgement
This research is supported in part by the Quantomics research project that has been cofinanced by the European commission within the 7th Framework Programme, contract No. 222664. This work is a part of PhD project scholarship from the Ministry of Science, Research and Technology of Iran.
Authors’ Affiliations
References
 Hayes B, Goddard ME: Breakeven cost of genotyping genetic mutations affecting economic traits in Australian pig enterprises. Livest Prod Sci. 2004, 89 (2–3): 235242.View Article
 Wong GKS, et al: A genetic variation map for chicken with 2.8 million singlenucleotide polymorphisms. Nature. 2004, 432 (7018): 717722. 10.1038/nature03156.View ArticlePubMed
 GonzalezRecio O, et al: Nonparametric methods for incorporating genomic information into genetic evaluations: an application to mortality in broilers. Genetics. 2008, 178 (4): 23052313. 10.1534/genetics.107.084293.PubMed CentralView ArticlePubMed
 Cui XG, et al: Improved statistical tests for differential gene expression by shrinking variance components estimates. Biostatistics. 2005, 6 (1): 5975. 10.1093/biostatistics/kxh018.View ArticlePubMed
 Chesler EJ, et al: Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat Genet. 2005, 37 (3): 233242. 10.1038/ng1518.View ArticlePubMed
 Dworkin I, et al: Genomic consequences of background effects on scalloped mutant expressivity in the wing of Drosophila melanogaster. Genetics. 2009, 181 (3): 10651076. 10.1534/genetics.108.096453.PubMed CentralView ArticlePubMed
 Schadt EE, et al: An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet. 2005, 37 (7): 710717. 10.1038/ng1589.PubMed CentralView ArticlePubMed
 Manolio TA, et al: Finding the missing heritability of complex diseases. Nature. 2009, 461 (7265): 747753. 10.1038/nature08494.PubMed CentralView ArticlePubMed
 Zuk O, et al: The mystery of missing heritability: genetic interactions create phantom heritability. Proc Natl Acad Sci U S A. 2012, 109 (4): 11931198. 10.1073/pnas.1119675109.PubMed CentralView ArticlePubMed
 Hoggart CJ, et al: Simultaneous analysis of all SNPs in genomewide and resequencing association studies. PLoS Genet. 2008, 4 (7): e100013010.1371/journal.pgen.1000130.PubMed CentralView ArticlePubMed
 Xu SZ: Estimating polygenic effects using markers of the entire genome. Genetics. 2003, 163 (2): 789801.PubMed CentralPubMed
 Meuwissen THE, Hayes BJ, Goddard ME: Prediction of total genetic value using genomewide dense marker maps. Genetics. 2001, 157 (4): 18191829.PubMed CentralPubMed
 Habier D, Fernando RL, Dekkers JCM: The impact of genetic relationship information on genomeassisted breeding values. Genetics. 2007, 177 (4): 23892397.PubMed CentralPubMed
 de los Campos G, et al: Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 2009, 182 (1): 375385. 10.1534/genetics.109.101501.PubMed CentralView ArticlePubMed
 Yang JA, et al: Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010, 42 (7): 565131. 10.1038/ng.608.PubMed CentralView ArticlePubMed
 Visscher PM, Yang JA, Goddard ME: A commentary on 'common SNPs explain a large proportion of the heritability for human height' by Yang et al. (2010). Twin Res Hum Genet. 2010, 13 (6): 517524. 10.1375/twin.13.6.517.View ArticlePubMed
 Allan MF, Eisen EJ, Pomp D: Genomic mapping of direct and correlated responses to longterm selection for rapid growth rate in mice. Genetics. 2005, 170 (4): 18631877. 10.1534/genetics.105.041319.PubMed CentralView ArticlePubMed
 Dobrin R, et al: Multitissue coexpression networks reveal unexpected subnetworks associated with disease. Genome Biol. 2009, 10 (5): R5510.1186/gb2009105r55.PubMed CentralView ArticlePubMed
 Habier D, Fernando RL, Dekkers JC: Genomic selection using lowdensity marker panels. Genetics. 2009, 182 (1): 343353. 10.1534/genetics.108.100289.PubMed CentralView ArticlePubMed
 Meuwissen TH, et al: A fast algorithm for BayesB type of prediction of genomewide estimates of genetic value. Genet Sel Evol. 2009, 41: 210.1186/12979686412.PubMed CentralView ArticlePubMed
 Luan T, et al: The accuracy of Genomic Selection in Norwegian red cattle assessed by crossvalidation. Genetics. 2009, 183 (3): 11191126. 10.1534/genetics.109.107391.PubMed CentralView ArticlePubMed
 VanRaden PM, et al: Invited review: reliability of genomic predictions for North American Holstein bulls. J Dairy Sci. 2009, 92 (1): 1624. 10.3168/jds.20081514.View ArticlePubMed
 Janss L: bayz manual. 2011, Leiden, the Netherlands: Bayesian Solutions
 Sorensen D, Gianola D: Likelihood, Bayesian and MCMC methods in quantitative genetics. 2002, New York: SpringerVerlag: Statistics for biology and health, 740View Article
 Plummer M, et al: CODA: Convergence Diagnosis and Output Analysis for MCMC, in R News. 2006, 711.
 George EI, Mcculloch RE: Variable selection via Gibbs sampling. J Am Stat Assoc. 1993, 88 (423): 881889. 10.1080/01621459.1993.10476353.View Article
 Kapell DN, et al: Efficiency of genomic selection using Bayesian multimarker models for traits selected to reflect a wide range of heritabilities and frequencies of detected quantitative traits loci in mice. BMC Genet. 2012, 13 (1): 42PubMed CentralView ArticlePubMed
 Rolf MM, et al: Impact of reduced marker set estimation of genomic relationship matrices on genomic selection for feed efficiency in Angus cattle. BMC Genet. 2010, 11: 24PubMed CentralView ArticlePubMed
 Bink MCAM, et al: Bayesian analysis of complex traits in pedigreed plant populations. Euphytica. 2008, 161 (1–2): 8596.View Article
 Chesler EJ, et al: Genetic correlates of gene expression in recombinant inbred strains  a relational model system to explore neurobehavioral phenotypes. Neuroinformatics. 2003, 1 (4): 343357. 10.1385/NI:1:4:343.View ArticlePubMed
 Wuschke S, et al: A metaanalysis of quantitative trait loci associated with body weight and adiposity in mice. Int J Obes. 2007, 31 (5): 829841.
 Keightley PD, et al: A genetic map of quantitative trait loci for body weight in the mouse. Genetics. 1996, 142 (1): 227235.PubMed CentralPubMed
 Brockmann GA, et al: Quantitative trait loci affecting body weight and fatness from a mouse line selected for extreme high growth. Genetics. 1998, 150 (1): 369381.PubMed CentralPubMed
 Thompson R: Variancecomponents and animal breeding  Vanvleck, Ld, Searle, Sr. Biometrics. 1981, 37 (1): 201202. 10.2307/2530542.View Article
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.