Bivariate genome-wide association study identifies novel pleiotropic loci for lipids and inflammation
BMC Genomics volume 17, Article number: 443 (2016)
Genome-wide association studies (GWAS) have identified multiple genetic loci for C-reactive protein (CRP) and lipids, of which some overlap. We aimed to identify genetic pleiotropy among CRP and lipids in order to better understand the shared biology of chronic inflammation and lipid metabolism.
In a bivariate GWAS, we combined summary statistics of published GWAS on CRP (n = 66,185) and lipids, including LDL-cholesterol, HDL-cholesterol, triglycerides, and total cholesterol (n = 100,184), using an empirical weighted linear-combined test statistic. We sought replication for novel CRP associations in an independent sample of 17,743 genotyped individuals, and performed in silico replication of novel lipid variants in 93,982 individuals. Fifty potentially pleiotropic SNPs were identified among CRP and lipids: 21 for LDL-cholesterol and CRP, 20 for HDL-cholesterol and CRP, 21 for triglycerides, and CRP and 20 for total cholesterol and CRP. We identified and significantly replicated three novel SNPs for CRP in or near CTSB/FDFT1 (rs10435719, Preplication: 2.6 × 10−5), STAG1/PCCB (rs7621025, Preplication: 1.4 × 10−3) and FTO (rs1558902, Preplication: 2.7 × 10−5). Seven pleiotropic lipid loci were replicated in the independent set of MetaboChip samples of the Global Lipids Genetics Consortium. Annotating the effect of replicated CRP SNPs to the expression of nearby genes, we observed an effect of rs10435719 on gene expression of FDFT1, and an effect of rs7621025 on PCCB.
Our large scale combined GWAS analysis identified numerous pleiotropic loci for CRP and lipids providing further insight in the genetic interrelation between lipids and inflammation. In addition, we provide evidence for FDFT1, PCCB and FTO to be associated with CRP levels.
Genome-wide association studies (GWAS) have identified hundreds of genetic loci for cardiovascular disease and it’s risk factors, including chronic inflammation and lipids [1–3]. Some of the identified genetic variants are associated with more than one phenotype, termed genetic pleiotropy . Examples are APOC1(rs4420638) and HNF1A (rs1183910), which are associated both with lipids and C-reactive protein (CRP) [2, 3]. As randomized clinical trials have shown a coextending effect of statin treatment on the lowering of LDL-cholesterol and CRP, we do expect inflammation and lipids to share certain biological pathways [5, 6]. Moreover, there is accumulating evidence that the pleiotropic effects are partially independent, although the biological mechanisms are not fully understood . The identification of further pleiotropic genes could provide insight into the biological mechanisms that link chronic inflammation to lipids.
Therefore, we aimed to identify further shared genes for lipids and CRP. In order to enhance the statistical power of genetic studies to find pleiotropic genes for the correlated phenotypes of interest, we applied a method that combines GWAS meta-analysis summary statistics allowing for mixed directions of effect, a common observed phenomenon in genetic pleitropy . In a second step we sought to replicate novel associations with lipids and CRP in an independent sample of 93,982 genotyped individuals for lipids and 17,743 genotyped individuals for CRP. We identified multiple overlapping genetic variants between CRP and lipids and confirmed novel genes implicated in the biology of chronic inflammation.
Bivariate genome-wide association analysis
We performed bivariate GWAS meta-analyses by combining summary statistics (Z test statistics) from the univariate GWAS of CRP pairing with the summary statistics of each GWAS of the lipid phenotypes, using an empirical-weighted linear-combined test statistics (eLC) . This method allows mixed genetic effects in the univariate phenotype GWAS, a phenomenon commonly observed in genetic studies.
CRP and LDL-cholesterol
Manhattan plots for the bivariate GWAS are depicted in Fig. 1. Table 1 indicates the results from the bivariate analysis combining CRP and LDL-cholesterol genetic association data. The bivariate analysis resulted in 21 potentially pleiotropic loci. We identified fourteen loci associated with CRP levels which had no genome-wide significant SNP in the original GWAS of CRP. These potential novel associations were located in or near CELSR2, IRF2BP2, ABCG8, GCNT4, HLA-DQB1, FRK, TRIB1, FADS2, ST3GAL4, BRAP, C12orf51, CARM1/LDLR, NCAN and RASIP1. The potential novel associations for LDL-cholesterol were located in or near GCKR, IL1F10, RORA, RASIP1 and in HNF4A. The SNPs identified in the bivariate GWAS near HLA-DQB1, FRK, BRAP, c12orf51 and CARM1/LDLR were not genome-wide significant in the original univariate GWAS on LDL-cholesterol, however other SNPs in their vicinity were significant in the original GWAS on LDL-cholesterol and the loci have thus been reported previously. The variants in and near PPP1R3B, HNF1A and APOC1 were already genome-wide significant in both GWAS of CRP and LDL-cholesterol.
CRP and HDL-cholesterol
We identified 20 potential pleiotropic SNPs (Table 2). The variants near CELSR2, STAG1, HLA-DRA, JMJD1C, FADS1, LIPC, CETP, LYPLA3, LIPG and MC4R were not genome-wide significant in the original CRP meta-GWAS analysis. Seven SNPs were potentially novel for both CRP and HDL-cholesterol: the SNP rs12742376 located in C1orf172 on chromosome 1 (P bivariate = 1.4 × 10−8), rs7621025 in STAG1 on chromosome 3 (P bivariate = 1.2 × 10−9), rs9378212 near HLA-DRA (P bivariate = 6.7 × 10−10), rs10761731 in JMJD1C (P bivariate = 2.2 × 10−8), rs1936797 in RSPO3 on chromosome 6 (P bivariate = 6.7 × 10−9), rs4871137 near SNTB1 (P bivariate = 3.3 × 10−8) on chromosome 8 and the FTO SNP rs1558902 (P bivariate = 5.0 × 10−9) on chromosome 16. The variants near CELSR2 and PLTP were not significant in the original GWAS on HDL-cholesterol, but these loci were identified in the original GWAS. The variants in or near PABPC4, BAZ1B, PPP1R3B, APOC1 and HNF4A were already genome-wide significant in both the CRP and HDL-cholesterol univariate GWAS.
CRP and Triglycerides
Table 3 lists the 21 potentially pleiotropic SNPs that were identified combining the GWAS results of triglycerides and CRP. For triglycerides, we identified eleven potential novel associations compared to the original GWAS located in or near PABPC4, LEPR, ADAR, CRP, IL1F10, PPP1R3B, CTSB/FDFT1, ARNTL, CABP1, MC4R and HPN. The variant near PLA2G6 was not genome-wide significant in the original GWAS, but this locus was identified in the original GWAS. The variants in and near ADAR, MSL2L1, HLA-C, CTSB/FDFT1, LPL, ARNTL, FADS1, CETP, MC4R, SF4, HPN, ZNF335/PLTP and PLA2G6 were potential novel associations with CRP level. Five loci were not genome-wide significant in either the original GWAS on CRP or triglycerides: the SNP rs1127311 within ADAR on chromosome 1 (P bivariate = 6.4 × 10−9), rs10435719 located 77Kb upstream of CTSB on chromosome 8 (P bivariate = 2.0 × 10−10), rs10832027 located in the second intron of ARNTL on chromosome 11 (P bivariate = 9.4 × 10−9), rs571312 on chromosome 18 near MC4R (P bivariate = 2.8 × 10−8), and the chromosome 19 rs1688043 in the fifth intron of HPN (P bivariate = 4.1 × 10−8). In both the original GWAS of CRP and triglycerides, GCKR and APOC1 were already genome-wide significant.
CRP and total cholesterol
Twenty potentially pleiotropic SNPs were identified combining CRP and total cholesterol (Table 4). The SNPs in or near ZNF644, SLC44A4, C7orf50 and RORA were potentially novel for total cholesterol. The variants near HLX, ABCG5, IL1F10, C7orf60 and CARM1 were not genome-wide significant in the GWAS on total cholesterol, but the loci were identified in this original GWAS. For CRP, ZNF664, CELSR2, HLX, IRF2BP2, ABCG5, GCNT4, SLC44A4, HLA-DQB1, FRK, ST3GAL4, CARM1 and NCAN were potentially novel compared to the univariate GWAS. The SNPs near ZNF644 and C7orf50 were novel pleiotropic loci for both CRP and total cholesterol.
Replication of the novel pleiotropic loci
In total, we sought replication for 36 potential novel SNPs for CRP in 17,743 genotyped individuals from three independent cohort studies. Using a Bonferroni corrected threshold for multiple testing (0.05/36 = 1.4 × 10−3), three SNPs remained significantly associated with CRP levels when we performed replication analysis (Additional file 1: Table S1). These variants included the SNPs rs10435719 in CTSB/FDFT1 (P replication = 2.6 × 10−5), rs1558902 near FTO (P replication = 2.7 × 10−5) and rs7621025 near STAG1 (P replication = 1.4 × 10−3).
We aimed replication for 23 potential novel SNPs for lipids (4 for LDL-cholesterol, 7 for HDL-cholesterol, 9 for triglycerides and 3 for total cholesterol) in an in silico analysis including 93,982 individuals. We could significantly replicate 2 variants for LDL-cholesterol (HNF4A and RASIP1), three for HDL-cholesterol (C1orf172, RSPO3 and STAG1), one for triglycerides (CTSB) and one for total cholesterol (C7orf50) (Additional file 1: Table S2).
Expression Quantitative Trait Loci (eQTL)
To annotate the effect of the replicated pleiotropic variants to the expression level of nearby genes, we investigated the association between the pleiotropic variants and gene expression levels in three different tissues relevant to CRP and lipids by use of large publicly available datasets: whole blood (N = 5311) , liver (N = 427  and 266 ) and adipose tissue  (N = 111). For the replicated pleiotropic variant rs10435719 near CTSB and FDFT1, we observed significant associations in whole blood with expression levels of two genes: CTSB itself (P = 1.67 × 10−6), and FDFT1 (P = 1.10 × 10−96). In addition, the SNP rs7621025 near STAG1 and PCCB was strongly associated with expression of the gene PCCB in whole blood (P = 1.1 × 10−40). No eQTL effect was observed in the liver and adipose tissue.
We identified fifty potential pleiotropic SNPs which affect both CRP and lipid levels, of which we replicated three novel CRP variants: rs10435719 (CTSB/FDFT1), rs7621025 (STAG1/PCCB) and rs1558902 (FTO). In silico expression analyses suggested a role for rs10435719 in the gene expression of both CTSB and FDFT1 and rs7621025 appeared to have an effect on the gene expression of PCCB.
The locus harboring rs10435719 near CTSB and FDFT1 that was identified for CRP in our study has previously been identified for triglycerides in the joint analysis of the Global Lipids Genetics Consortium combining GWAS data with Metabochip association results . We observed a significant effect of rs10435719 on the expression of both CTSB and FDFT1. The effect of the CRP increasing allele (T) was weakly associated with a decrease in the expression of CTSB, whilst we observed a strong association of the T-allele with an increase of FDFT1 gene expression. FDFT1 encodes the enzyme squalene synthase which is involved in the cholesterol biosynthesis . Apart from lipids, FDFT1 has been identified in a GWAS on fatty liver disease . Squalene Synthase Inhibitors (SSI) have been developed and are successful in the reduction of cholesterol levels as well as CRP levels . This pleiotropic effect of cholesterol synthesis blockers on both lipid levels and inflammation is thought to be the consequence of altered isoprenoids levels that may activate pro-inflammatory pathways . The observation that the CRP increasing allele is associated with an increase in FDFT1 gene expression suggests an effect of rs10435719 on serum CRP through FDFT1. However, we searched in large databases to identify robust eQTL effects of the novel variants. Therefore, we were unable to test the association between the expression and CRP and we cannot draw a firm conclusion on the causal effect of the gene expression in the association between the genetic variant and CRP.
We identified the SNP rs7621025 (STAG1/PCCB) as a pleiotropic variant for HDL-cholesterol and CRP. We confirmed the effect of rs7621025 on serum CRP in an independent set of individuals and this genomic region has been identified in a GWAS of lipids . The SNP rs7621025 is located within STAG1, but has a strong effect on the expression of PCCB, located ±300 kb downstream of rs7621025 on chromosome 3. PCCB has been identified in a GWAS of the protein fibrinogen, an acute phase response protein sharing many genes with CRP . Our results provide further evidence that the PCCB gene is involved in inflammation.
We identified the FTO gene as a pleiotropic locus for CRP and HDL-cholesterol. The A allele of rs1558902 was associated with an increase of CRP and a decrease in HDL cholesterol. In several GWAS on BMI, the A allele of rs1558902 was also associated with an increase in BMI [19, 20]. Previous studies have highlighted the causal effect of obesity on inflammation , and the effect directions are consistent with mediation of both the association with CRP and HDL-cholesterol by BMI. We have previously shown that the effect of FTO on CRP is indeed mediated through BMI . Further research is needed to demonstrate whether this is also true for HDL-cholesterol. Our results provide further evidence for the role of obesity in inflammation and highlight the pleiotropic effects of the FTO locus on both chronic inflammation and lipid metabolism.
Genetic pleiotropy can be divided in biological and mediated pleiotropy . In biological pleiotropy, the effect of the pleiotropic variant on two or more phenotypes is independent. In mediated pleiotropy, one phenotype mediates the association between the genetic variant and the second phenotype. Both biological and mediated pleiotropic effects may occur for CRP and lipids . In the current study, we did not disentangle the different subtypes of pleiotropy. Moreover, we observed pleiotropic variants with an opposite direction of effect than expected based on the phenotypical correlation in observational epidemiological studies. In biological pleiotropy, opposite directions of effect may occur. As an example, although CRP and LDL-cholesterol are positively associated in observational epidemiological studies, the A-allele of the SNP rs1183910 (HNF1A) is associated with lower CRP levels but higher LDL-cholesterol. Opposite direction of effects are often seen in genetic studies and highlight the complex interplay between correlated phenotypes, in our study CRP and lipids . We did not disentangle the different subtypes of pleiotropy, which is a limitation of the current study.
Our study has certain strengths. We add to previous studies showing that the multivariate method we applied can be effectively utilized to identify potential novel and pleiotropic loci. This method only requires GWAS summary data instead of individual level data from all participating cohorts. Thanks to close collaboration between studies across the world, researchers have performed large GWAS meta-analyses for a vast amount of phenotypes and this data is available for further research. Second, we used the largest GWAS meta-analyses that have so far been done on CRP and lipid levels to identify pleiotropic genetic loci. By doing so, we enhanced the statistical power to detect these loci considerably. Third, we provided robust evidence for three novel CRP loci by replication in an independent sample of genotyped individuals. A limitation of the bivariate meta-analysis is that very strong signals in one of the individual traits may overshadow the weak association with the other phenotype. We set a criterion for the univariate p-values <0.05 to minimize the chance of false positive findings. In many instances the effect of the pleiotropic loci on CRP or lipids is very small. We did not replicate all our pleiotropic loci. This could be due to lack of power in the replication. In concordance, we replicated a larger proportion of the lipid variants in the larger lipid replication sample compared to CRP. Also, variants closer to significance did replicate in the replication study of both CRP and lipids. Also, several variants had substantial heterogeneity I2 in the replication which lowers the power for replication. Furthermore, the replication sample size was for some variants smaller than 17,743 due to absence of the variants in one or more of the replication studies. However, we cannot rule out the possibility that bivariate p-values are driven by strong associations with one of the phenotypes and produce false positive results. In addition, for the replication of the lipid variants, we used the Metabochip results from the GLGC. Several variants selected for replication were not present on the Metabochip. Although we selected the best available proxy SNP for replication, variants in moderate LD may have limited power for replication. The method used in the current manuscript to prioritize variants with pleiotropic effects among inflammation and cholesterol are hypothesis generating and further functional work regarding the role of the identified variants in cholesterol metabolism and inflammation is necessary.
Our results provide evidence for substantial overlap in genetic susceptibility for chronic inflammation and lipid metabolism. In addition, through bivariate genome-wide association studies and replication in an independent sample of individuals we could identify novel genes for CRP.
The present study includes three stages. First, we performed a bivariate GWAS combining published GWAS data on CRP and lipids to identify pleiotropic variants for CRP and lipids. In a second step, we sought replication of novel associations in independent samples of genotyped individuals. Finally, we carried out functional analyses in a third step to point out potential underlying transcriptional mechanisms.
We used the data from the largest published GWAS on CRP as well as the publically available GWAS on lipids from GLGC to explore the genetic pleiotropy between inflammation and lipids [2, 3]. We combined summary association test statistics from the CRP GWAS separately with the GWAS on HDL-cholesterol, LDL-cholesterol, triglycerides and total cholesterol. The CRP GWAS meta-analysis included 65,000 individuals from 15 different studies in the discovery panel and after replication, 18 loci were genome-wide significantly associated with serum CRP level . The lipids GWAS comprised 100,184 individuals for total cholesterol, 95,454 for LDL-cholesterol, 99,900 for HDL-cholesterol and 96,598 for triglycerides across 46 studies. The lipid GWAS identified a total of 95 lipid loci (52 for total cholesterol, 37 for LDL-cholesterol, 47 for HDL-cholesterol and 32 for triglycerides) . The CRP and lipids GWAS used HapMap imputed data (build 36). All studies that contributed genotype data to the CRP GWAS also contributed data to the lipids GWAS. We ensured that effect alleles were harmonized across the two GWAS before applying the bivariate GWAS method. Overall, 2,501,549 common Single Nucleotide Polymorphisms (SNPs) were tested for their association with CRP and total cholesterol, 2,501,711 with CRP and triglycerides, 2,501,543 with CRP and HDL-cholesterol and 2,501,749 with CRP and LDL-cholesterol. An aggregated p-value was calculated using the method described below.
Bivariate genome-wide association study
To better understand the shared biology of CRP and lipids by further identifying shared genes between CRP and lipids, we aimed to increase power by combining the summary statistics from the CRP and lipid GWAS. We chose to use a recently introduced method that performs bivariate GWAS allowing for mixed directions of effect. The method combines summary statistics (Z test statistics) from univariate GWAS of CRP pairing with the summary statistics of each univariate GWAS meta-analysis of lipid phenotypes, using an empirical-weighted linear-combined test statistics (eLC), implemented in a C++ eLX package. We have recently used this method in the identification of pleiotropic genes for menopause and menarche and the details of the method are presented elsewhere. [8, 24]. eLC allows having opposite direction of effect on the combined phenotypes, which is common between CRP and cholesterol phenotypes [2, 3]. Briefly, eLC directly combines correlated Z test statistics (calculated as β/SE derived from the original GWAS) obtained from univariate GWAS meta-analyses with a weighted sum of univariate test statistics to empirically maximize the overall association signals and also to account for the phenotypical correlations among CRP and lipids. Our eLC approach is expressed as
where Tk is a matrix of K statistics for K phenotypes (for bivariate, K is equal to 2) and c is a given non-negative constant. The optimal weighting is estimated empirically using the Monte Carlo Simulation  and the bona-fide p-values for eLC test statistics are calculated through permutation. The sample covariance matrix of the test statistics of all SNPs from the univariate GWAS analyses is used as an approximation of the variance-covariance matrix Σ of univariate test statistics. Σ:
where Z 1 and Z 2 consist of unbiased univariate test statistics of all the SNPs for the two traits on genome-wide scale for the first (Z1) and second (Z2) trait. The null hypothesis in the bivariate analysis is β_1 = 0 AND β_2 = 0; the H1 is β_1 not equal to 0 or β_2 not equal to 0. The results were considered genome-wide significant when (1) the bivariate p-values were < 5 × 10−8 and (2) the bivariate p-value was at least one order of magnitude lower than both individual trait p-values and (3) when the individual trait p-values were at least nominally significant (p-value < 0.05). When multiple SNPs were significant in a locus, the SNP with the lowest p-value was chosen for replication. The eLC method is implemented in eLX package using C++ (see Weblinks).
The bivariate GWAS resulted in three possible scenarios. First, the pleiotropic variant or the locus harboring the pleiotropic variant (defined as ±500 MB of the pleiotropic SNP) was genome-wide significant in both the primary univariate GWAS of CRP and the lipid trait. Second, the pleiotropic signal was significant in either the CRP or the lipid univariate GWAS. Third, the pleiotropic signal was neither genome-wide significant in the CRP nor in the lipid GWAS. Per definition, a variant is considered pleiotropic when there is robust evidence for an association with two or more phenotypes. Therefore, we only selected the variants that were not genome-wide significant in the primary univariate GWAS for replication in an independent sample of genotyped samples. We intended to replicate the novel associations with CRP levels in three cohort studies that did not contribute to the original CRP GWAS. The independent cohorts were the second (n = 1943) and third (n = 2962) cohort of the Rotterdam Study and the LifeLines cohort study (n = 12,838; Additional file 1) [6, 7, 26, 27]. The total sample size for the replication of potentially novel CRP variants comprised 17,743 individuals. In an attempt to replicate the potential novel lipid variants, we performed an in silico replication in the publicly available association results from the participants of the GLGC that did not contribute to the original lipids GWAS we used for the pleiotropy analysis. This replication set comprises 93,982 individuals genotyped using the Metabochip array [13, 28]. For the SNPs that were not available on the Metabochip, we selected the best available proxy SNP on the Metabochip for replication (r 2 > 0.5). We used a Bonferroni corrected p-value of 0.05 divided by the number of SNPs tested for replication as a threshold of significance in the replication study.
Ethics, consent and permissions
All participants of the Rotterdam and Lifelines study provided written informed consent.
Expression Quantitative Trait Loci (eQTL)
In an attempt to annotate the pleiotropic variants to a pleiotropic gene, we searched in tissues related to lipids and inflammation for eQTL effects of the pleiotropic variants or reasonable proxy variants (r 2 > 0.80).
The eQTL analyses in whole blood comprised 5311 individuals from seven studies in the discovery setting with both genetic and gene expression data available . The discovery meta-analysis including the seven studies (EGCUT, InCHIANTI, Rotterdam Study, Fehrmann, HVH, SHIP-TREND and DILGOM). Results are publicly available (access URL: http://genenetwork.nl/bloodeqtlbrowser/). eQTLs were deemed cis when the distance between the SNP and the midpoint of the RNA probe was <250 kb. We only considered a significant eQTL effect of the pleiotropic SNP when the p-value exceeded the FDR corrected threshold for multiple testing.
We searched for liver eQTL effects by use of the eQTL browser provided by the university of Chicago (access URL: http://eqtl.uchicago.edu/cgi-bin/gbrowse/eqtl/). The liver tissue dataset by Schadt et al. comprised 427 individuals from European ancestry with liver specific gene expression and genotyping data available . An eQTL was deemed cis when the SNP was within 1 Mb of the annotated start or stop site of the corresponding structural gene. The authors used an FDR correction of 10 % for a significant association. The dataset by Innocenti et al. comprised 266 individuals from 2 different studies. Cis eQTL was defined as <250 kb from the gene transcription start site and the FDR for significant association was set to 5 % .
We used the GTEx adipose tissue dataset (access URL: http://www.gtexportal.org/home/eqtls/tissue?tissueName=Adipose_Subcutaneous) to search for potential eQTLs in adipose tissue. The dataset consisted of 111 individuals with both gene expression and genotype data available  Cis radius was defined as +/- 1mb from transcription start site. A eQTL was deemed significant when the FDR q-value < =5 %.
genome-wide association study
Deloukas P, Kanoni S, Willenborg C, Farrall M, Assimes TL, Thompson JR, Ingelsson E, Saleheen D, Erdmann J, Goldstein BA. Large-scale association analysis identifies new risk loci for coronary artery disease. Nat Genet. 2012. doi:10.1038/ng.2480.
Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Ripatti S, Chasman DI, Willer CJ. Biological, clinical and population relevance of 95 loci for blood lipids. Nature. 2010;466(7307):707–13.
Dehghan A, Dupuis J, Barbalic M, Bis JC, Eiriksdottir G, Lu C, Pellikka N, Wallaschofski H, Kettunen J, Henneman P. Meta-analysis of genome-wide association studies in > 80 000 subjects identifies multiple loci for C-reactive protein levels. Circulation. 2011;123(7):731–8.
Solovieff N, Cotsapas C, Lee PH, Purcell SM, Smoller JW. Pleiotropy in complex traits: challenges and strategies. Nat Rev Genet. 2013;14:483–95.
Albert MA, Danielson E, Rifai N, Ridker PM. Effect of statin therapy on C-reactive protein levels. JAMA. 2001;286(1):64–70.
Ridker PM, Danielson E, Fonseca FAH, Genest J, Gotto Jr AM, Kastelein JJP, Koenig W, Libby P, Lorenzatti AJ, MacFadyen JG. Reduction in C-reactive protein and LDL cholesterol and cardiovascular event rates after initiation of rosuvastatin: a prospective study of the JUPITER trial. Lancet. 2009;373(9670):1175–82.
Jain MK, Ridker PM. Anti-inflammatory effects of statins: clinical evidence and basic mechanisms. Nat Rev Drug Discov. 2005;4:977–87.
Perry JRB, Hsu Y-H, Chasman DI, Johnson AD, Elks C, Albrecht E, Andrulis IL, Beesley J, Berenson GS, Bergmann S. DNA mismatch repair gene MSH6 implicated in determining age at natural menopause. Hum Mol Genet. 2013. doi:10.1093/hmg/ddt620.
Westra H-J, Peters MJ, Esko T, Yaghootkar H, Schurmann C, Kettunen J, Christiansen MW, Fairfax BP, Schramm K, Powell JE. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat Genet. 2013;45(10):1238–43.
Schadt EE, Molony C, Chudin E, Hao K, Yang X, Lum PY, Kasarskis A, Zhang B, Wang S, Suver C. Mapping the genetic architecture of gene expression in human liver. PLoS Biol. 2008;6(5):e107.
Innocenti F, Cooper GM, Stanaway IB, Gamazon ER, Smith JD, Mirkov S, Ramirez J, Liu W, Lin YS, Moloney C. Identification, replication, and functional fine-mapping of expression quantitative trait loci in primary human liver tissue. PLoS Genet. 2011;7(5):e1002078.
Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, Hasz R, Walters G, Garcia F, Young N. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–5.
Global Lipids Genetics Consortium. Discovery and refinement of loci associated with lipid levels. Nat Genet. 2013. doi:10.1038/ng.2797.
Pandit J, Danley DE, Schulte GK, Mazzalupo S, Pauly TA, Hayward CM, Hamanaka ES, Thompson JF, Harwood HJ. Crystal structure of human squalene synthase. A key enzyme in cholesterol biosynthesis. J Biol Chem. 2000;275(39):30610–7.
Chalasani N, Guo X, Loomba R, Goodarzi MO, Haritunians T, Kwon S, Cui J, Taylor KD, Wilson L, Cummings OW. Genome-wide association study identifies variants associated with histologic features of nonalcoholic fatty liver disease. Gastroenterology. 2010;139(5):1567–76. e1566.
Stein EA, Bays H, O'Brien D, Pedicano J, Piper E, Spezzi A. Lapaquistat acetate development of a squalene synthase inhibitor for the treatment of hypercholesterolemia. Circulation. 2011;123(18):1974–85.
Schönbeck U, Libby P. Inflammation, immunity, and HMG-CoA reductase inhibitors statins as antiinflammatory agents? Circulation. 2004;109(21 suppl 1):II-18–26.
Dehghan A, Yang Q, Peters A, Basu S, Bis JC, Rudnicka AR, Kavousi M, Chen M-H, Baumert J, Lowe GDO. Association of novel genetic loci with circulating fibrinogen levels a genome-wide association study in 6 population-based cohorts. Circ Cardiovasc Genet. 2009;2(2):125–33.
Speliotes EK, Willer CJ, Berndt SI, Monda KL, Thorleifsson G, Jackson AU, Allen HL, Lindgren CM, Luan J, Mägi R. Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index. Nat Genet. 2010;42(11):937–48.
Locke AE, Kahali B, Berndt SI, Justice AE, Pers TH, Day FR, Powell C, Vedantam S, Buchkovich ML, Yang J. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518(7538):197–206.
Timpson NJ, Nordestgaard BG, Harbord RM, Zacho J, Frayling TM, Tybjærg-Hansen A, Smith GD. C-reactive protein levels and body mass index: elucidating direction of causation through reciprocal Mendelian randomization. Int J Obes. 2011;35(2):300–8.
Ligthart S, de Vries PS, Uitterlinden AG, Hofman A, Franco OH, Chasman DI, Dehghan A, CHARGE Inflammation working group. Pleiotropy among common genetic loci identified for cardiometabolic disorders and C-Reactive protein. PLoS One. 2015;10(3):e0118859.
van Diepen JA, Berbée JFP, Havekes LM, Rensen PCN. Interactions between inflammation and lipid metabolism: relevance for efficacy of anti-inflammatory drugs in the treatment of atherosclerosis. Atherosclerosis. 2013;228(2):306–15.
Hsu YH, Chen X. Identifying Pleiotropic Genetic Effects: A Two-Stage Approach Using Genome-Wide Association Meta-Analysis Data. https://sites.google.com/site/multivariateyihsianghsu/. Accessed 7 June 2016.
The Multi-Phenotype GWAS Analysis For Pleiotropic Genetic Effects [https://sites.google.com/site/multivariateyihsianghsu/]. Accessed 7 June 2016.
Hofman A, Murad SD, van Duijn CM, Franco OH, Goedegebure A, Ikram MA, Klaver CCW, Nijsten TEC, Peeters RP, Stricker BHC. The Rotterdam Study: 2014 objectives and design update. Eur J Epidemiol. 2013;28(11):889–926.
Almqvist C, Adami H-O, Franks PW, Groop L, Ingelsson E, Kere J, Lissner L, Litton J-E, Maeurer M, Michaëlsson K. LifeGene—a large prospective population-based study of global relevance. Eur J Epidemiol. 2011;26(1):67–77.
Voight BF, Kang HM, Ding J, Palmer CD, Sidore C, Chines PS, Burtt NP, Fuchsberger C, Li Y, Erdmann J. The metabochip, a custom genotyping array for genetic studies of metabolic, cardiovascular, and anthropometric traits. PLoS Genet. 2012;8(8):e1002793.
Data on lipid traits have been contributed by GLGC investigators and have been downloaded from http://www.broadinstitute.org/mpg/pubs/lipids2010/.
We thank Behrooz Z. Alizadeh, Annemieke Boesjes, Marcel Bruinenberg, Noortje Festen, Pim van der Harst, Ilja Nolte, Lude Franke, Mitra Valimohammadi for their help in creating the GWAS database, and Rob Bieringa, Joost Keers, René Oostergo, Rosalie Visser, Judith Vonk for their work related to data-collection and validation. The authors are grateful to the study participants, the staff from the LifeLines Cohort Study and Medical Biobank Northern Netherlands, and the participating general practitioners and pharmacists. Researchers interested in using the LifeLines data must obtain approval for a specific analysis plan from the scientific board of LifeLines to obtain access to the data. Researchers using the data are required to follow the terms of a signed agreement containing a number of clauses designed to ensure protection of privacy and compliance with relevant laws.
We thank Pascal Arp, Mila Jhamai, Marijn Verkerk, Lizbeth Herrera and Marjolein Peters for their help in creating the GWAS database in the Rotterdam Study, and Karol Estrada, Yurii Aulchenko and Carolina Medina-Gomez for the creation and analysis of imputed data. The authors are grateful to the study participants, the staff from the Rotterdam Study and the participating general practitioners and pharmacists.
SL carried out the pleiotropy analyses, the replication analyses in the Rotterdam cohorts and drafted the manuscript. AV analyzed the LifeLines cohort data and contributed to the manuscript writing. YHH developed the pleiotropy method, created the software and helped interpreting the results. RS, AGU, AH, BZA and OHF designed the study and critically reviewed the manuscript. AD designed the study, helped interpreting the results and critically reviewed the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Sources of funding
The LifeLines Cohort Study is supported by the Netherlands Organization of Scientific Research NWO (grant 175.010.2007.006), the Economic Structure Enhancing Fund (FES) of the Dutch government, the Ministry of Economic Affairs, the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the Northern Netherlands Collaboration of Provinces (SNN), the Province of Groningen, University Medical Center Groningen, the University of Groningen, Dutch Kidney Foundation and Dutch Diabetes Research Foundation. The authors wish to acknowledge the services of the LifeLines Cohort Study, the contributing research centers delivering data to LifeLines, and all the study participants.
The generation and management of GWAS genotype data for the Rotterdam Study is supported by the Netherlands Organisation of Scientific Research NWO Investments (nr. 175.010.2005.011, 911-03-012). This study is funded by the Research Institute for Diseases in the Elderly (014-93-015; RIDE2), the Netherlands Genomics Initiative (NGI)/Netherlands Organisation for Scientific Research (NWO) project nr. 050-060-810. The Rotterdam Study is funded by Erasmus Medical Center and Erasmus University, Rotterdam, Netherlands Organization for the Health Research and Development (ZonMw), the Research Institute for Diseases in the Elderly (RIDE), the Ministry of Education, Culture and Science, the Ministry for Health, Welfare and Sports, the European Commission (DG XII), and the Municipality of Rotterdam.
eLc methods in eLX package: https://sites.google.com/site/multivariateyihsianghsu/.
Study-specific Methods Section for the Replication Section. Table S1. Replication Results for C-Reactive Protein. Table S2. Replication Results for lipids. Table S3. Proxy variants for the Single Nucleotide Polymophisms not Available on the Metabochip Array. (DOCX 30 kb)
About this article
Cite this article
Ligthart, S., Vaez, A., Hsu, YH. et al. Bivariate genome-wide association study identifies novel pleiotropic loci for lipids and inflammation. BMC Genomics 17, 443 (2016). https://doi.org/10.1186/s12864-016-2712-4
- C-reactive protein
- Genome-wide association study
- Genetic pleiotropy