Quantitative trait loci mapping for canine hip dysplasia and its related traits in UK Labrador Retrievers
BMC Genomics volume 15, Article number: 833 (2014)
Canine hip dysplasia (CHD) is characterised by a malformation of the hip joint, leading to osteoarthritis and lameness. Current breeding schemes against CHD have resulted in measurable but moderate responses. The application of marker-assisted selection, incorporating specific markers associated with the disease, or genomic selection, incorporating genome-wide markers, has the potential to dramatically improve results of breeding schemes. Our aims were to identify regions associated with hip dysplasia or its related traits using genome and chromosome-wide analysis, study the linkage disequilibrium (LD) in these regions and provide plausible gene candidates. This study is focused on the UK Labrador Retriever population, which has a high prevalence of the disease and participates in a recording program led by the British Veterinary Association (BVA) and The Kennel Club (KC).
Two genome-wide and several chromosome-wide QTLs affecting CHD and its related traits were identified, indicating regions related to hip dysplasia.
Consistent with previous studies, the genetic architecture of CHD appears to be based on many genes with small or moderate effect, suggesting that genomic selection rather than marker-assisted selection may be an appropriate strategy for reducing this disease.
Canine hip dysplasia (CHD) is a pathology characterised by the malformation of the coxo-femoral joint, leading to degeneration of the hip joint, lameness and painful osteoarthritis. It mainly affects large-sized breeds like the German Shepherd and Labrador Retriever and has an important impact due to its high prevalence in these breeds (25-40% in UK Labrador Retrievers , depending on the study). The disorder is a major health concern of dog owners, breeders and organizations (e.g. the American Kennel Club ) and cannot be cured, although it can be ameliorated by surgery and there is some suggestion that modification of certain factors (e.g. reduction of dietary calcium and vitamin D in growing dogs, ) may improve the condition. Although surgical/medical treatment can be used to improve the quality of life of the dog, it has been shown that CHD has a moderately heritable genetic basis  and due to the difficulty of treating the disease, a genetic solution should be explored.
A genetic strategy relies on selecting breeding individuals on the basis of their predicted breeding value, with greater accuracy of prediction offering faster progress. To date these predictions have relied on mass selection on a phenotype measured at a young age, and often prior to the onset of clinical disease. In many countries, this phenotype is based on a radiographic analysis of the pelvic area, and in the UK this scheme is run jointly by the British Veterinary Association (BVA) and The Kennel Club (KC) . In this scheme, animals older than one year of age are scanned, when skeletal maturity is assumed to have been achieved (although no upper age limit is imposed), and from the radiograph, hip score (HS) is calculated as the sum of nine component traits measured on categorical scales on both hips. Thus, hips with a perfect radiographic appearance will have a HS of 0, and the higher the score (up to 106), the greater the degree of CHD and/or degenerative change. Three of these component traits (Norberg Angle, Subluxation and Cranial Acetabular Edge) are measures of joint laxity. The other six are related to osteoarthritis which develops as a result of joint laxity, and therefore are more subject to detrimental age effects . Lewis et al.  also observed that the measures of laxity were more heritable than those associated with osteoarthritis.
Current breeding programmes against the disease are voluntary and recommend breeding from animals with HS (or equivalent) below a given threshold. They have resulted in measurable but moderate success [7–10]. Approaches that could enhance the performance of these programmes include selection based on phenotype-derived estimated breeding values (EBVs) [9–11] and/or marker-assisted or genomic selection [12, 13], where specific markers associated with the disease or genome-wide markers, respectively, are incorporated into breeding values. Previous studies regarding CHD, based on sample sizes of 150-800 animals and using microsatellites [14, 15] or SNPs [16–19], have shown inconsistent results for QTL position depending on the breeds, density of markers, allele frequencies and statistical methods used. These studies, usually focused on Norberg Angle or an equivalent measure to HS, have shown no evidence of a major locus controlling CHD, suggesting a complex disease driven by a number of QTL with small or moderate effect . Although a few positional candidate genes have been proposed , only one study has demonstrated a putative association between mutations in a specific gene (fibrillin 2 gene) and the disease .
The purpose of this study was to perform genome- and chromosome-wide scans to identify QTLs for hip score and the three components associated with laxity in Labrador Retrievers, with the aims of improving selection schemes against the disease and furthering understanding of the biological basis of CHD.
Animals and phenotypes
The genotyped sample comprised 1500 hip-scored Labrador Retrievers born between 2002 and 2008, which provides a representative sample of animals used for breeding. Dogs were evaluated for hip dysplasia based on radiographs according to the UK scoring method, recording the nine hip score components for each separate hip . These components are the following: Norberg Angle (NA), Subluxation (SUB), Cranial Acetabular Edge (CrAE), Dorsal Acetabular Edge (DAE), Cranial Effective Acetabular Rim (CrEAR), Acetabular Fossa (AF), Caudal Acetabular Edge (CAE), Femoral Head and Neck Exostosis (FHNE) and Femoral Head Recontouring (FHR). Owners of animals with hip scores and aged between 1 to 6 years old were contacted and requested to provide buccal DNA swabs and to fill in a questionnaire with details of body measurements (length, weight, girth and body conformation), exercise levels, lifestyle and nutrition, activity and concurrent health problems. Traits incorporated in statistical models are described below.
Radiographs were taken by veterinarians for submission to the British Veterinary Association/Kennel Club hip and elbow scoring schemes, a health screening protocol required before breeding from Kennel Club registered Labrador Retrievers. Hip and elbow score results are available to the public from the Kennel Club (https://www.thekennelclub.org.uk/services/public/mateselect/test/Default.aspx.).
Owners collected saliva samples themselves by using non-invasive buccal swabs after being provided with detailed instructions and an explanatory video. The sample was collected at home. This sampling strategy was chosen instead of involving a journey to a vet practice and collection by a veterinarian as this was deemed less stressful for the dog and of negligible risk. Advice obtained from personnel responsible for the ethical review process in The Roslin Institute (University of Edinburgh) was that no ethical approval was needed under the Animal Scientific Procedures Act (1986) because the technique was quick, non-invasive and painless and therefore was not a regulated procedure. The internal review process at the Institute also approved the research plan.
DNA extraction and SNP genotyping
Extraction of DNA from buccal swabs was performed according to a standard protocol . DNA was re-suspended in water and quantified using a Nanodrop and stored at 4°C until use. Animals were genotyped using the Illumina Canine High Density Beadchip containing 173,662 SNPs .
Quality control procedures
Quality control was performed to assure both sample quality and marker quality . A sample call rate threshold of 90% was applied, removing 275 samples with a low call rate. A further 27 animals were removed due to potential genotyping errors, detected as inconsistencies between the genomic and pedigree relatedness of individuals or between recorded sex and sex determined from the genotyping. The analysis of the age at scoring (Figure 1) showed a detrimental age effect on the hip score in animals older than 5 years. In order to remove this age-related bias, animals older than 5 years when scored (total of 19) were removed from the analysis, thus leading to a final sample size of 1179 animals.
Filtering criteria were applied to markers using Genome Studio software. A total of 59,260 markers were discarded due to low call rate (<98%), low reproducibility (GTS < 0.6), low or confounded signal (ABR mean < 0.3) and low minor allele frequency (MAF < 0.01). Further quality control on the markers was applied using PLINK , removing SNPs on the sex chromosomes and those showing deviation from Hardy-Weinberg equilibrium (threshold of 4.48E-7 applying a Bonferroni correction), finally resulting in 106,282 SNPs for further analysis, with their positions assigned according to the CanFam 2.0 assembly.
Preliminary analyses of population attributes (working status, coat colour) did not show associations with hip score, therefore, the genomic relationship matrix was fitted (as explained below) and was assumed to account for any remaining population structure.
Genome-wide association analysis (GWAS)
Association analyses were performed for total hip score (HS), transformed total hip score (THS, logarithmic transformation, ln(1 + HS), following Lewis et al. ) and for the main three components on both hip sides (left, right and total): Norberg angle (NA), Subluxation (SUB) and Cranial Acetabular Edge (CrAE), using a linear mixed model as follows:
where y is the vector of phenotypes, W is a matrix of covariates with the α vector of associated fixed effects (including the intercept), x is a vector of marker genotypes (coded as 0/1/2) with β representing the regression of the phenotype on the marker genotypes, u is a vector of random polygenic effects and ϵ is a vector of residual errors. The fixed factors considered were the following: gender (1 degree of freedom), age at scoring in years and time spent exercising per day, which was scored from 1 (up to 1 hour) to 4 (more than 4 hours). The latter two factors were fitted as linear and quadratic regressions on deviations from their means. Analyses were conducted using GEMMA , which accounts for population stratification by including the genomic relatedness matrix (GRM, G) , and assuming a model where the vectors of random effects, u, and errors, ϵ, follow multivariate normal (MVN) distributions given by u ~ MVN(0,V G G) and ϵ ~ MVN(0,V E I), where V G and V E are the genetic variance associated with G and environmental variance, respectively.
GEMMA provides regression coefficient for each marker and their statistical significance was assessed using a Wald test. In determining a genome-wide significance threshold of P < 0.05, a conservative Bonferroni correction was made for multiple testing resulting from the large number of markers but not for multiple traits, resulting in a final threshold of P < 4.705E-7. The group of traits considered all share high pairwise genetic correlations, in excess of 0.85 [4, 6], and therefore the appropriate correction for multiple traits is unclear.
Differences in allele frequencies that are artefacts of cryptic population stratification or genotyping errors may inflate test statistics (mean and median χ2 values) above their expectations under the null hypothesis. This inflation was detected and corrected by the use of the inflation factor λ, defined as the ratio of the median of the empirically observed distribution of the test statistic to the expected median (thus quantifying the extent of the inflation and the excess false positive rate), following the method suggested by Amin et al. , assuming that the inflation is roughly constant across the genome.
Genomic, chromosomal and regional variances
An improved estimate of V G , the total genetic variance, was obtained by removing the regression on marker genotype from the GWAS model. Therefore, the following mixed linear was fitted:
where the meaning and distributional assumptions for each of these terms were as described above for the GWAS model. To compare the estimates of genetic variance obtained from genomic and pedigree analyses, the model was re-fitted with u ~ MVN(0,V A A), where A is the numerator relationship matrix derived from the 5- generation pedigree of the phenotyped dogs, and V A is the associated additive genetic variance.
Using similar models the genetic variance was also partitioned among chromosomes using two different approaches: (1) a joint decomposition of the 38 canine chromosomes involving the simultaneous fitting of 38 chromosome-specific GRMs, or (2) chromosome-specific analyses in which the GRM for each chromosome was fitted together with a complementary GRM constructed from the all other SNPs to account for the remaining polygenic effect. In the first approach, the model fitted was of the form:
where y is the phenotype of each individual and, as in previous models, W and α refer to the fixed effects described above. In this approach (joint chromosomal decomposition), each u(i) (i = 1, …, 38) represents a vector of genetic values for chromosome i, such that u(i) ~ MVN(0,V C(i) G(i)) where G(i) is the GRM calculated from only those SNPs on chromosome i, and VC(i) is the associated variance. The estimate of total genetic variance is In the second approach, the total genetic variance was decomposed into a model with only two components:
where u(i) is the effect of chromosome i, as described above, and distributed as MVN(0,V C(i) G(i)) and u(‒i) is a polygenic effect for the remaining genome and distributed as MVN(0,V C(-i) G(-i)) where G(-i) is a GRM constructed from all SNPs other than those on chromosome i. Therefore in this second model, the model was fitted 38 times, once for each of the 38 chromosomes, with analysis of chromosome i providing an estimate for V C(i) .
Regional V G  for windows of 20 SNPs centred upon the GWAS-significant SNPs were evaluated considering the region plus a complementary polygenic effect. Regional V G across the entire genome was also evaluated. In the latter analysis, the genome was divided into distinct regions, each containing 20 adjacent SNPs. The analysis then proceeded similarly to the approach described above for chromosomal variances with two components. Under the first approach, the SNP windows were always centred at the GWAS-significant SNPs, whereas under the second one, the significant SNP was not necessarily at the centre of a window.
The regional variances for significant SNPs and genomic and chromosomal variances were estimated for each trait using ACTA , but the full regional exploration with 20-SNP windows was carried out using REACTA  for computational ease.
An additional analysis of the haplotype structure and linkage disequilibrium (LD) in the genome-wide significant regions was performed using Haploview 4.2 . Genome-wide significant regions were analysed for their haplotype structure and LD, and investigated for functional candidate genes showing strong LD with the significant SNPs.
Four SNPs with genome-wide significance (Table 1) and 73 SNPs with chromosome-wide significance (Additional file 1) were detected in the GWAS analyses and Manhattan plots for two of the traits are shown (Figures 2 and 3). The genomic estimates of the residual (V E ) and genetic (V G ) variances for each trait are shown in Table 2, together with the pedigree-based estimates. Quantile-Quantile (Q-Q) plots of GWAS analyses for all traits after correcting by the inflation factor are shown in the supplementary material (Additional file 2). The supplementary material also contains the estimates of the genetic variances (V C ) at the chromosome level calculated using the two different approaches described above: joint chromosomal decomposition of the 38 chromosomes (Additional file 3) and separate analyses of each chromosome with all other chromosomes included as part of the polygenic effect (Additional file 4). Additional file 5 shows both sets of results in a graphical form. Results for the chromosomal analyses (Additional files 3 and 4) are presented as percentages of the total genetic variance explained by the genomic estimates given in Table 2 and thus add up to 1. The regional heritability analysis determined percentages of genetic variance explained for each trait based on windows of 20 SNPs across the genome (Additional file 6) or only for the regions centered at the GWAS-significant SNPs (Additional file 7).
Hip score (HS) and Transformed hip score (THS)
No SNPs with genome-wide significance were detected for these traits (Table 1), although the application of chromosome-wide thresholds (Additional file 1) identified several significant SNPs for both HS (on Chr 1, 2, 11, 15, 21 and 23) and THS (on Chr 11, 24 and 38).
Chr 9 and 11 clearly explained the highest proportions of genetic variance obtained from the joint chromosomal decomposition for HS (22.6% and 19.7% of the total V C ). In contrast, for THS, V C was more evenly distributed across the genome, with Chr 1, 9, 21 and 24 explaining the highest proportions of V C , 6.6%, 15.3%, 11% and 8.3%, respectively (Additional files 3, 4 and 5).
The estimates of the residual variance (V E ) from genomic analyses were greater (and heritability lower) than their pedigree estimates (Table 2) for HS, indicating that the GRM did not explain as much of the observed phenotypic variance and thus appeared not to capture the full additive genetic variance. In contrast, for THS, the estimates of V E were very similar. The regional heritability analyses (Additional file 6) showed several peaks across the genome, and although no significant regions were detected for HS, a significant region was detected for THS on Chr 21 (41.4 Mb). This region was not concordant with the chromosome-wide significant SNP detected by GWAS for this trait (27.9 Mb) but close to significant SNPs detected for other traits (NA and CrAE) on this chromosome.
Norberg Angle (NA)
A SNP with genome-wide significance (Table 1 and Figure 2) was detected for the right hip on Chr 21 and several SNPs with chromosome-wide significance (Additional file 1) were detected for the right hip (Chr 1, 10, 12, 13, 15, 21, 26, 27, 30 and 38), the left hip (Chr 4, 14, 22, 23 and 25) and the total NA (Chr 12, 21, 23, 27 and 38).
For the right-hip NA, Chr 11 and 21 clearly explained the highest proportions of genetic variance (21.7% and 16.9% of V C ) whereas for the left-hip NA, the total V G was more evenly distributed across the genome, with several chromosomes explaining similar percentages. Results for total NA were a mixture of those for left and right hips, with Chr 3, 4, 9, 11 and 21 explaining the highest proportions of V C , 6.7%, 6.9%, 6.8%, 7.5% and 6.7%, respectively (Additional files 3, 4 and 5).
The estimates of residual variances (V E ) from genomic analyses for all three traits associated with NA were greater (with lower heritability) than the pedigree estimates (Table 2), again indicative that the GRM did not capture the full genetic variance. Regional heritability analyses showed several peaks across the genome, none of which were significant (Additional file 6). The SNP on Chr 21 detected by GWAS explained 13.1% of the genetic variance (V G ) but the standard error was very high (Table 1).
No SNPs with genome-wide significance were detected, although several SNPs with chromosome-wide significance (Additional file 1) were detected for the right hip (Chr 6, 22, 24, 31 and 38), the left hip (Chr 6, 20 and 36) and total SUB (Chr 6, 24, 27 and 35).
For the right-hip SUB, Chr 1, 7, 9, 21 and 24 explained the highest proportions of total genetic variance, 9.7%, 8.2%, 7.5%, 8.3% and 8.7% of the total V G , respectively, whereas for the left-hip SUB, the highest proportions were explained by Chr 9 and 24 (10% and 11%). In the case of SUB total, Chr 1, 9 and 24 explained the highest proportions, 7.2%, 7.2% and 7.3% of the total V G , respectively (Additional files 3, 4 and 5).
The estimates of residual variances (V E ) from genomic analyses for all three traits were again slightly higher (lower heritability) than the pedigree estimates (Table 2), and regional heritability analyses showed several non-significant peaks and one significant peak on Chr 9 for left-hip SUB (16.5 Mb), which was not concordant with SNPs detected for other traits (Additional file 6).
Cranial Acetabular Edge (CrAE)
SNPs with genome-wide significance (Table 1 and Figure 3) were detected on Chr 1 for the right and the left hips, and several SNPs with chromosome-wide significance (Additional file 1) were also detected for the right hip (Chr 13 and 21), the left hip (Chr 12, 15 and 38) and the total CrAE (Chr 12 and 38).
Unlike HS and traits associated with NA and SUB, the genomic estimates of the residual variances (V E ) for the three traits were very close to (and slightly lower than) pedigree estimates (Table 2). The regional heritability analyses did not reveal any significant peaks across the genome (Additional file 6). The SNPs detected by GWAS on Chr 1 showing genome-wide significance explained 13.5-33.2% of the genetic variance (V G ) but again the standard errors were very high (Table 1).
General observations across traits
Due to the Beavis effect, the estimates of effect sizes (Beta coefficients) and the proportion of genetic variance explained by each chromosome or region, considered separately, are expected to be overestimates . Therefore the results obtained by the joint chromosomal decomposition (fitting all the chromosomes simultaneously in the same model) are expected to be more accurate than the chromosomal variances obtained for each chromosome separately when compared with the total (GRM-based) genetic variance. The similarity of our estimates of V G and V C calculated from the joint chromosomal decomposition (results not shown), support this prediction and therefore, we subsequently focus on estimates of individual chromosomal variance from the joint chromosomal decomposition. These estimates revealed a polygenic architecture for all traits, with several chromosomes contributing less than 6% of the total genetic variance and a few chromosomes contributing 15-23%.
All animals, including those with perfect hip scores (HS = 0), were included in the analyses. However, Lewis et al.  detected an excess of zero individual scores for HS, based on non-linearity of the regression of offspring phenotype onto mid-parent phenotype, suggesting a lower precision of evaluation in the lower tail of the trait distribution. To address this, parallel genome-wide GWAS analyses were performed for all traits removing all animals with a HS = 0. Results of these analyses (not shown) were similar to the results shown previously for the full sample of animals.
The present study used a high density canine chip to identify two genome-wide QTLs affecting CHD-related traits. Chr 1 and 21 revealed consistent GWAS peaks and significant SNPs across the analyses in relation to several traits. Other suggestive QTLs were detected as significant at the chromosome-wide level, and may also indicate genomic regions related to hip dysplasia. In order to strengthen the evidence for QTLs identified in our study, we have applied two additional analyses: chromosomal and regional analyses of the explained genetic variance. These methods are designed to estimate variances associated with multiple loci contributing localised variance that may be too small individually to detect by single association tests [28, 33] and to estimate the variance explained by a QTL rather than its allelic substitution effect.
Chr 1 explained ~23% and ~13% of the total V G observed for CrAE left and CrAE total, respectively, and revealed six genome-wide significant SNPs for these traits, as well as chromosome-wide significant SNPs related to HS and NA right. These SNPs were located between two QTLs for CHD status in German Shepherds  and within 10 Mb of a QTL for NA right in Portuguese Water dogs . Genes in the region between 99.00 Mb and 110.00 Mb on Chr 1, which encompasses all of the genome- and chromosome-wide significant SNPs, were investigated. Two genome-wide significant SNPs were found within the SHC3 gene, which is not known to be associated with CHD-related traits, however, within 1 Mb of this gene, there were several functional candidate genes, associated with bone formation or mineralization (SEMA4D, OMD, OGN), cartilage formation (PHF2), and differentiation into joint versus cartilage cells (BARX1). However, none of the SNPs within or closest to these genes showed high LD (assessed by r2) with the SNPs identified by GWAS. The third genome-wide significant SNP was found in an inter-genic region, close to a zinc-finger gene (ZNF677), which again is not known to have any association with CHD-related traits, and several uncharacterised protein-coding genes. The large size of the Chr 1 region, and hence the large number of positional candidate genes, highlights a problem in the identification of causative variants from GWAS studies in species with high levels of LD such as dogs.
While this 11-Mb region on Chr 1 included the greatest number of genome-wide significant SNPs in our study, the frequency of the favourable SNP alleles (associated with low hip score) is already very high, suggesting that the average effect of the favourable allele is small  and consequently little genetic progress in CHD could be made by changes in frequencies at this QTL. This assumes that the allele frequency of the causal variant is similar to the marker, which may not be the case. However, the r2 (and power of detection) between a marker and a causal variant is sensitive to differences in allele frequency, particularly at low frequencies, suggesting that the allele frequencies of the marker and causative variant cannot be radically different.
Chr 21 explained ~17% of the total V G observed for NA right, revealing one genome-wide significant SNP for this trait, several chromosome-wide significant SNPs related to NA, CrAE and HS and a significant peak in the regional analysis for THS. These SNPs correspond to two main regions: a genome-wide significant region situated at ~43 Mb and a chromosome-wide significant region situated at ~27 Mb. For the first case, previous studies have shown the existence of a QTL related to NA in this region for a Labrador-Greyhound cross-breed pedigree  and ~5 Mb from a QTL for CHD status in German Shepherds . The genome-wide significant SNP from our study is found within the OTOG gene, which encodes an ear-specific glycoprotein that has no known association with CHD-related traits. However, as with the Chr 1 region, there are several functional candidate genes between 41.5 and 44.5 MB, the region that includes the genome- and chromosome-wide significant SNPs. These include genes associated with cartilage formation (SOX6), inflammation and osteoarthritis (SAA) and muscle cell differentiation and muscle regeneration (MYOD1, SERGEF). Again, most of the SNPs within or closest to these genes showed low LD with the SNPs identified by GWAS, however, one SNP within SERGEF was strongly correlated (r2 = 0.8) with the genome-significant SNP and two SNPs within SOX6 were moderately correlated with it (r2 = 0.4). In comparison to the region on Chr 1, there is greater potential for genetic progress based on this region as the frequency of the favourable allele is much lower for this QTL.
Chromosome-wide QTLs were detected on several chromosomes and a polygenic pattern was also observed for the distribution of the genetic variance in both the regional and chromosomal studies, with most of the chromosomes explaining less than 6% of the genetic variance but some chromosomes explaining between 10% and 23%. These results confirm a polygenic nature of the disease that could arise if the hip score and its components were the result of multiple interacting processes. It might be the case that genes affecting one or two processes have a small effect on the main phenotype (resulting in many QTLs with small effect), while only genes affecting several processes appear as QTLs with larger effects. The complex aetiology of the disease supports its polygenic nature. Both biological and environmental factors potentially contribute to an unstable hip joint  and many different genes are likely to influence the biological factors (e.g. bone and cartilage formation, muscle and other soft tissue development, inflammatory responses), which when disrupted, could contribute to disease. It is worth noting that while identification of the genes associated with the major QTLs detected in our study would contribute to an improved understanding of the biology of CHD and may help in the development of therapies, our results suggest that these QTLs will not capture most of the genetic variance related to this disease.
Another aspect of the trait complexity found in our study and several others [14, 15, 17, 36] is that some QTL effects are stronger on one side of the body than the other, despite the fact that a high genetic correlation between right and left scores has been observed in both Labrador Retriever and German Shepherd breeds [4, 37]. These findings could result from asymmetrical dogs’ gaits (even with normal hips)  or from suboptimal positioning of dogs when scoring, but also could be a statistical artefact of identification of QTL with small effects.
In order to further explore the QTLs identified in our study, the performance of the SNP-based genomic approach to characterise the genetic architecture of the trait was also compared with the pedigree approach in terms of residual variance (V E ). Residual variance was used to compare the models because in both cases, it comprises the variance component not explained by the model for the phenotypes in the data, whereas, in contrast, the genetic variances obtained using genomic and numerator (pedigree-based) relationship matrices refer to constructed base populations, which may differ. SNP-based methods are not expected to capture all the genetic variance  and our results broadly confirm this hypothesis, with our genomic estimates of V E greater than those provided by a pedigree analysis for most of the traits, but not in all cases. It is expected that an increase in the number of markers will increase the proportion of explained genetic variance [39, 40].
Although some detected QTL regions (e.g. Chr 1 and 21) correspond with those found in previous QTL studies, others have not been seen before and equally, we did not detect effects in several previously identified regions. There was no overlap between the regions showing genome- or chromosome-wide significance and those associated with CHD severity in a recent case-control study of 78 Dutch Labrador Retrievers  (the closest SNPs detected in their study, on Chr 25, were > 5Mb from a chromosome-wide significant region in our study). This inconsistency of QTLs across different studies has been previously pointed out by Zhu et al. , and in addition to the possibility that some results are false positives, might be explained by the different selection pressures applied to each breed or population, the varying LD patterns found in different breeds [42, 43], the specific details of the different studies (number of animals, pedigree structure, marker type and density) and the stochasticity associated with detecting small (but real) effects across multiple regions. Some of these factors probably also account for the limited consistency between GWAS results for the highly correlated traits within this study.
In summary, two genome-wide significant QTLs on Chr 1 and 21 and three chromosome-wide significant QTLs on Chr 11, 21 and 24 were detected in a Labrador Retriever population, associated with hip score or its components. However, none of the chromosomes explained more than 23% of the genetic variance of the traits. These findings, taken with the complex nature of the hip score as the sum of different components, possibly related to different metabolic processes, suggests a genetic architecture based on many genes with small or moderate effect and therefore, marker-assisted selection is not likely to be successful. Alternatively, a genomic selection approach against the disease should be considered .
Coopman F, Verhoeven G, Saunders J, Duchateau L, van Bree H: Prevalence of hip dysplasia, elbow dysplasia and humeral head osteochondrosis in dog breeds in Belgium. Vet Rec. 2008, 163 (22): 654-658. 10.1136/vr.163.22.654.
Fries CL, Remedios AM: The pathogenesis and diagnosis of canine hip dysplasia: A review. Can Vet J. 1995, 36: 494-502.
Lewis TW, Blott SC, Woolliams JA: Genetic evaluation of Hip Score in UK Labrador Retrievers. PLoS One. 2010, 5 (10): e12797-10.1371/journal.pone.0012797.
Willis MB: A review of the progress in canine hip-dysplasia control in Britain. J Am Vet Med Assoc. 1997, 210: 1480-1482.
Lewis TW, Woolliams JA, Blott SC: Genetic Evaluation of the Nine Component Features of Hip Score in UK Labrador Retrievers. PLoS One. 2010, 5 (10): e13610-10.1371/journal.pone.0013610.
Malm S, Fikse WF, Danell B, Strandberg E: Genetic variation and genetic trends in hip and elbow dysplasia in Swedish Rottweiler and Bernese mountain dog. J Anim Breed Genet. 2008, 125: 403-412. 10.1111/j.1439-0388.2008.00725.x.
Leppanen M, Saloniemi H: Controlling canine hip dysplasia in Finland. Prev Vet Med. 1999, 42: 121-131. 10.1016/S0167-5877(99)00059-8.
Hou Y, Wang Y, Xuemei Lu X, Zhang X, Zhao Q, Todhunter RJ, Zhang Z: Monitoring Hip and Elbow Dysplasia Achieved Modest Genetic Improvement of 74 Dog Breeds over 40 Years in USA. PLoS One. 2013, 8 (10): e76390-10.1371/journal.pone.0076390.
Wilson BJ, Nicholas FW, James JW, Wade CM, Thomson PC: Estimated Breeding Values for Canine Hip Dysplasia Radiographic Traits in a Cohort of Australian German Shepherd Dogs. Plos One. 2013, 8 (10): e77470-10.1371/journal.pone.0077470.
Lewis TW, Blott SC, Woolliams JA: Comparative analyses of genetic trends and prospects for selection against hip and elbow dysplasia in 15 UK dog breeds. BMC Genet. 2013, 14: 16-
Lande R, Thompson R: Efficiency of Marker-Assisted Selection in the improvement of quantitative traits. Genetics. 1990, 124: 743-756.
Meuwissen THE, Hayes BJ, Goddard ME: Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics. 2001, 157: 1819-1829.
Chase K, Lawler DF, Adler FR, Ostrander EA, Lark KG: Bilaterally asymmetric effects of quantitative trait loci (QTLs): QTLs that affect laxity in the right versus left coxofemoral (hip) joints of the dog (Canis familiaris). Am J Med Genet A. 2004, 124: 239-247.
Todhunter RJ, Mateescu R, Lust G, Burton-Wurster NI, Dykes NL, Bliss SP, Williams AJ, Vernier-Singer M, Corey E, Harjes C, Quaas RL, Zhang Z, Gilbert RO, Volkman D, Casella G, Wu R, Acland GM: Quantitative trait loci for hip dysplasia in a crossbreed canine pedigree. Mamm Genome. 2005, 16: 720-730. 10.1007/s00335-005-0004-4.
Marschall Y, Distl O: Mapping quantitative trait loci for canine hip dysplasia in German Shepherd dogs. Mamm Genome. 2007, 18: 861-870. 10.1007/s00335-007-9071-z.
Zhu L, Chen S, Jiang Z, Zhang Z, Ku H, Li X, McCann M, Harris S, Lust G, Jones P, Todhunter R: Identification of quantitative trait loci for canine hip dysplasia by two sequential multipoint linkage analysis. J Appl Stat. 2012, 39: 1719-1731. 10.1080/02664763.2012.673121.
Zhu L, Zhang Z, Feng F, Schweitzer P, Phavaphutanon J, Vernier-Singer M, Corey E, Friedenberg S, Mateescu R, Williams A, Lust G, Acland G, Todhunter R: Single nucleotide polymorphisms refine QTL intervals for hip joint laxity in dogs. Anim Genet. 2008, 39: 141-146. 10.1111/j.1365-2052.2007.01691.x.
Pfahler S, Distl O: Identification of Quantitative Trait Loci (QTL) for Canine Hip Dysplasia and Canine Elbow Dysplasia in Bernese Mountain Dogs. PLoS One. 2012, 7 (11): e49782-10.1371/journal.pone.0049782.
Friedenberg SG, Zhu L, Zhang Z, Foels W, Schweitzer PA, Wang W, Fisher PJ, Dykes NL, Corey E, Vernier-Singer M, Jung SW, Sheng X, Hunter LS, McDonough SP, Lust G, Bliss SP, Krotscheck U, Gunn TM, Todhunter RJ: Evaluation of a fibrillin 2 gene haplotype associated with hip dysplasia and incipient osteoarthritis in dogs. Am J Vet Res. 2011, 72 (4): 530-540. 10.2460/ajvr.72.4.530.
Qiagen: QIAamp DNA Blood Midi/Maxi Handbook. 2012
Illumina I: Canine HD BeadChip. Data Sheet: DNA Genotyping. 2010
Turner S, Armstrong LL, Bradford Y, Carlson CS, Crawford DC, Crenshaw AT, de Andrade M, Doheny KF, Haines JL, Hayes G, Jarvik G, Jiang L, Kullo IJ, Li R, Ling H, Manolio T, Matsumoto M, McCarty CA, McDavid AN, Mirel DB, Paschall JE, Pugh EW, Rasmussen LV, Wilke RA, Zuvich RL, Ritchie MD: Quality Control Procedures for Genome Wide Association Studies. Curr Protoc Hum Genet. 2011, 68: 1.19-
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, Sham PC: PLINK: a toolset for whole-genome association and population-based linkage analysis. Am J Hum Genet. 2007, 81: 559-575. 10.1086/519795.
Zhou X, Stephens M: Genome-wide Efficient Mixed Model Analysis for Association Studies. Nat Genet. 2012, 44: 821-824. 10.1038/ng.2310.
Seren U, Vilhjalmsson BJ, Horton MW, Meng D, Forai P, Huang YS, Long Q, Segura V, Nordborg M: GWAPP: A Web Application for Genome-Wide Association Mapping in Arabidopsis. Plant Cell. 2012, 24: 4793-4805. 10.1105/tpc.112.108068.
Amin N, van Duijn CM, Aulchenko YS: A Genomic Background Based Method for Association Analysis in Related Individuals. PLoS One. 2007, 2 (12): e1274-10.1371/journal.pone.0001274.
Nagamine Y, Pong-Wong R, Navarro P, Vitart V, Hayward C, Rudan I, Campbell H, Wilson J, Wild S, Hicks A, Pramstaller PP, Hastie N, Wright AF, Haley CS: Localising Loci underlying Complex Trait Variation Using Regional Genomic Relationship Mapping. PLoS One. 2012, 7 (10): e46501-10.1371/journal.pone.0046501.
Gray A, Stewart I, Tenesa A: Advanced Complex Trait Analysis. Bioinformatics. 2012, 28: 3134-3136. 10.1093/bioinformatics/bts571.
Cebamanos L, Gray A, Stewart I, Tenesa A: Regional Heritability Advanced Complex Trait Analysis for GPU and Traditional Parallel Architectures. Bioinformatics. 2014, doi:10.1093/bioinformatics/btt754
Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21: 263-265. 10.1093/bioinformatics/bth457.
Xu S: Theoretical Basis of the Beavis Effect. Genetics. 2003, 165: 2259-2268.
Uemoto Y, Pong-Wong R, Navarro P, Vitart V, Hayward C, Wilson JF, Rudan I, Campbell H, Hastie ND, Wright AF, Haley CS: The power of regional heritability analysis for rare and common variant detection: simulations and application to eye biometrical traits. Front Genet. 2013, 4: 232-
Falconer DS, Mackay TFC: Introduction to Quantitative Genetics. 1996, Addison Wesley Longman: Harlow
Riser WH, Rhodes WH, Newton CD: Hip Dysplasia. Texbook of Small Animal Orthopaedics. Edited by: Newton CD, Nunamaker DM. 1985, Philadelphia (U.S.A): J. B. Lippincott Company
Todhunter RJ, Zachos TA, Gilbert RO, Erb HN, Williams AJ, Burton-Wurster NI, Lust G: Onset of epiphyseal mineralization and growth plate closure in radiographically normal and dysplastic Labrador retrievers. J Am Vet Med Assoc. 1997, 210: 1458-1462.
Wilson BJ, Nicholas FW, James JW, Wade CM, Tammen I, Raadsma HW, Castle K, Thomson PC: Symmetry of hip dysplasia traits in the German Sheperd Dog in Australia. J Anim Breed Genet. 2011, 128: 230-243. 10.1111/j.1439-0388.2010.00903.x.
Colborne GR, Good L, Cozens LE, Kirk LS: Symmetry of limb mechanics in orthopedically normal trotting Labrador Retrievers. Am J Vet Res. 2011, 72 (3): 336-344. 10.2460/ajvr.72.3.336.
Hayes BJ, Pryce J, Chamberlain AJ, Bowman PJ, Goddard ME: Genetic Architecture of Complex Traits and Accuracy of Genomic Prediction: Coat Colour, Milk-Fat Percentage, and Type in Holstein Cattle as Contrasting Model Traits. PLoS Genet. 2010, 6: e1001139-10.1371/journal.pgen.1001139.
Daetwyler HD, Pong-Wong R, Villanueva B, Woolliams JA: The Impact of Genetic Architecture on Genome-Wide Evaluation Methods. Genetics. 2010, 185 (3): 1021-1031. 10.1534/genetics.110.116855.
Lavrijsen IC, Leegwater PA, Martin AJ, Harris SJ, Tryfonidou MA, Heuven HC, Hazewinkel HA: Genome wide analysis indicates genes for basement membrane and cartilage matrix proteins as candidates for hip dysplasia in Labrador Retrievers. PLoS One. 2014, 9: e87735-10.1371/journal.pone.0087735. doi:87710.81371/journal.pone.0087735
Sutter NB, Eberle MA, Parker HG, Pullar BJ, Kirkness EF, Kruglyak L, Ostrander EA: Extensive and breed-specific linkage disequilibrium in Canis familiaris. Genome Res. 2004, 14: 2388-2396. 10.1101/gr.3147604.
McKay SD, Schnabel RD, Murdoch BM, Matukumalli LK, Aerts J, Coppieters W, Crews D, Neto ED, Gill CA, Gao C, Mannen H, Stothard P, Wang Z, Van Tassell CP, Williams JL, Taylor JF, Moore SS: Whole genome linkage disequilibrium maps in cattle. BMC Genet. 2007, 8: 74-85.
Sánchez-Molano E, Woolliams JA, Blott SC, Wiener P: Assessing the impact of genomic selection against hip dysplasia in the Labrador Retriever dog. J Anim Breed Genet. 2013, doi:10.1111/jbg.12056
The authors are grateful for funding from the BBSRC (Grant number BB/H019073/1 and Institute Strategic Grant support to The Roslin Institute) and The Kennel Club Charitable Trust (to SCB). We are also thankful to the British Veterinary Association and The Kennel Club for providing data on the Labrador Retriever population and Dr. Tom Lewis (Animal Health Trust), Dr. Sophie Eaglen (Roslin) and Alexandra Brown (SRUC) for extracting phenotype data. We also thank Melissa Rolph (AHT) for assistance with project management, Andrew Bain (Roslin) and Louise Monsey (AHT) for assistance with DNA extraction, and Prof. Dave Burt for advice. Genotyping was carried out by Edinburgh Genomics, The University of Edinburgh. Edinburgh Genomics is partly supported through core grants from NERC (R8/H10/56), MRC (MR/K001744/1) and BBSRC (BB/J004243/1).
The authors declare that they have no competing interests.
ESM participated in the study design, carried out the statistical analyses and drafted the manuscript. SCB, DNC, PW and JAW were responsible for the conception, funding, study design and implementation of the project. PW and JAW managed the data analysis and manuscript preparation. DNC and SCB contributed to the data analysis and manuscript preparation. RPW supervised some of the data analysis. All authors have read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Summary of chromosome-wide significant SNPs using a linear mixed model for canine hip dysplasia in UK Labrador Retriever: The table shows chromosome, significant SNPs, position (in base pairs according to CanFam 2.0), associated trait, minor allele and its frequency, Beta coefficient (minor allele substitution effect), P-value of the GWAS analysis and percentage of genetic variance explained by a region of 21 SNPs centred in the significant SNP. (XLSX 19 KB)
Additional file 2: Quantile-Quantile plots for GWAS analyses. Figures show the Q-Q plots after correcting by the inflation factor. (PDF 301 KB)
Additional file 3: Percentage of genetic variance explained per chromosome: The joint chromosomal decomposition model fitted the 38 chromosomes simultaneously. (Note: equivalent to the red bars in Additional file 5). Percentages are related to the total genetic variance explained by the genomic method (given in Table 2). The proportion of the genome accounted for by each chromosome (PG) is also given. (XLSX 11 KB)
Additional file 4: Percentage of genetic variance explained per chromosome: The model considers a separate analysis of each chromosome including a complementary GRM accounting for the polygenic effect. (Note: equivalent to the blue bars in Additional file 5). Percentages are related to the total genetic variance explained by the genomic method (given in Table 2). The proportion of the genome accounted for by each chromosome (PG) is also given. (XLSX 14 KB)
Additional file 5: Percentage of genetic variance explained per chromosome. Blue bars correspond to estimates analysing each chromosome separately (Additional file 4), and red bars correspond to estimates obtained through the joint chromosomal decomposition (Additional file 3). Error estimates for the first method (dashed lines) were obtained based on a Taylor series approximation. Lack of convergence of the REML analyses for the second method meant that the variance/covariance matrix, and thus the errors, could not be estimated. (PDF 897 KB)
Additional file 6: Regional heritability analysis. P-values for the significance of the genetic variance explained by each window of 20 SNPs for all traits, considering a Bonferroni-corrected threshold (red line). (PDF 1 MB)
Additional file 7: Percentage of genetic variance explained for all significant SNPs. Contains the percentage of genetic variance explained by all significant SNPs (% VG SNPs) or by the regions of 20 SNPs centered at the significant SNPs (% VG Regions). (XLSX 11 KB)
About this article
Cite this article
Sánchez-Molano, E., Woolliams, J.A., Pong-Wong, R. et al. Quantitative trait loci mapping for canine hip dysplasia and its related traits in UK Labrador Retrievers. BMC Genomics 15, 833 (2014). https://doi.org/10.1186/1471-2164-15-833