A meta-analysis of genome-wide association studies of follicular lymphoma
© Skibola et al.; licensee BioMed Central Ltd. 2012
Received: 12 June 2012
Accepted: 25 September 2012
Published: 1 October 2012
B-cell non-Hodgkin lymphoma represents a diverse group of hematological malignancies, of which follicular lymphoma (FL) is one of the most common subtypes. Family and epidemiological studies suggest an important genetic role in the etiology of FL. In recent genome-wide association studies (GWAS) of FL, several genetic susceptibility loci have been identified on chromosome 6p21.33 (rs6457327) and 6p21.32 (rs10484561, rs2647012) in the human leukocyte antigen class I and class II regions. To identify new genetic variants and further elucidate the genetic basis of FL, a meta-analysis was performed of the top 1000 SNPs associated with FL risk from two GWAS in the US, Denmark and Sweden (592 cases, 1541 controls), with independent validation in 107 cases and 681 controls.
rs9275517 and rs3117222 in the HLA class II region were validated and inversely associated with FL risk (rs9275517: OR = 0.63, 95% CI = 0.55-0.73, p = 4.03 × 10-11; rs3117222: OR = 0.66, 95% CI = 0.57-0.77, p = 1.45 × 10-7). rs9275517, which is in high linkage disequilibrium with rs2647012 (r2 = 0.9), was no longer associated with FL after conditioning on rs2647012. The rs3117222 association was independent of established FL SNPs, but not of the HLA-DPB1*0301 allele. Using publicly available gene expression profiles with matching genotype information, we found that rs3117222 also was significantly correlated with increased HLA-DPB1 expression.
By performing a meta-analysis of two GWAS of FL, we further validated the relevance of HLA-DPB1*0301 as a protective allele in the pathogenesis of FL. Moreover, the protective rs3117222 A allele correlated with increased levels of HLA-DPB1, suggesting a possible disease mechanism involving HLA-DPB1 expression regulation. Our results add further support to the major role of HLA genetic variation in the pathogenesis of FL.
KeywordsFollicular lymphoma (FL) Genome-wide association studies (GWAS) Human leukocyte antigen (HLA) Meta-analysis
Follicular lymphoma (FL) is a B-cell neoplasm that represents the second most common form of B-cell non-Hodgkin lymphoma (NHL). An important role for inherited genetic susceptibility for FL has been supported by recent genome-wide association studies (GWAS) where three independent susceptibility alleles have been identified in the HLA class I (rs6457327) and class II (rs10484561, rs2647012) regions[2, 3]. Follow-up HLA sequencing studies revealed that rs10484561 is in complete linkage disequilibrium (LD) with the DRB1*0101-DQA1*0101-DQB1*0501 extended haplotype, and rs2647012 is highly correlated with DRB1*15-DQA1*01-DQB1* 06. An independent inverse association with FL risk also was found for DPB1*0301. Because previous GWAS only attempted to validate the top 40 variants associated with FL[2, 3], here we conducted a meta-analysis of the top 1000 SNPs from existing GWAS data in 592 FL cases and 1541 controls from Denmark/Sweden (SCALE) and the San Francisco Bay Area (SF-NHL2) to identify new genetic variants and further elucidate the genetic basis of FL. Validation genotyping of associated SNPs was conducted in 107 FL cases and 681 controls from an independent NHL case–control study population (SF-NHL1). The effect of validated SNP genotypes on gene expression levels also was investigated using publicly available microarray data.
Results and discussion
Summary results for the validated follicular lymphoma associated single-nucleotide polymorphisms
Gene(s) within 5 kb (hg18)
Logistic OR (95% CI)
Logistic OR (95% CI)
Logistic OR (95% CI)
OR* (95% CI)
0.67 (0.53- 0.85)
0.67 (0.51- 0.88)
To determine whether rs9275517 and rs3117222 are independently associated with FL risk, we adjusted our models for the previously identified FL-risk loci, rs10484561, rs2647012, and rs6457327[1–3]. The results showed that rs9275517 was no longer associated with FL after conditioning on rs2647012 in any of the studies (p-valueSF = 0.31, p-valueSCALE = 0.63, Additional file4: Table S4). Further, rs9275517 and rs2647012 are in high LD (r2 = 0.9 in HapMap-CEU), also suggesting that the signal observed for rs9275517 is not independent of the previously validated protective rs2647012 allele. In contrast, rs3117222 maintained independent statistical significance when each SNP was fitted in the logistic regression model (p < 0.02, Additional file4: Table S4). Because rs3117222 maps 6 kb downstream of the HLA-DPB1 gene, we also tested the independence of rs3117222 with HLA-DPB1 alleles including DPB1*0301, an allele inversely associated with FL based on HLA sequencing studies of the SF-NHL2 study population. Haplotype analysis in the SF-NHL2 data showed a significant difference in the frequencies of the HLA-DPB1*0301-rs3117222 haplotype in cases versus controls (p-value = 9.00 × 10-4, Additional file5: Table S5) and the LD analysis suggested that rs3117222 was in LD with HLA-DPB1*0301 (r2 = 0.24, D' = 0.96). When DPB1*0301 was included in the model, rs3117222 was no longer associated with FL (p-value = 0.33), indicating that although rs3117222 is independent of previously reported FL-associated SNPs, its influence on FL risk cannot be delineated from the protective HLA-DPB1*0301 allelic association. Nonetheless, through rs3117222, this study provides the first validation of the HLA-DPB1 locus as protective in the pathogenesis of FL in two additional independent studies (SCALE and SF-NHL1).
In the present study, we found that rs3117222 was inversely associated with FL risk independent of previously published FL SNPs, but not independent of the HLA-DPB1*0301 allele. Elevated levels of HLA-DPB1 transcripts found in the presence of the protective rs3117222 A allele suggest that changes in HLA-DPB1 expression may be involved in the etiology of FL. These findings emphasize the important role of HLA genetic variation in FL disease etiology and the need for further studies to clarify the mechanisms through which HLA class II expression contributes to FL pathogenesis.
Description of GWAS subjects, genotyping and statistical analyses
Full details of the study design and methods, genotyping, quality control and statistical analyses have been described elsewhere. A total of 312,768 markers genotyped in 213 FL cases and 750 controls passed our quality control criteria and were used for genome-wide association analysis. Population stratification and cryptic relatedness were tested and corrected for as described previously, resulting in a final inflation factor λ = 1.04. Associations with FL were tested using a Cochran-Armitage trend test in PLINK 1.07. Odd ratios (OR) and 95% confidence intervals (95% CI) were calculated using the mid-p method from the epitools package in R.
Full details of the study design and methods, genotyping, quality control and statistical analyses have been published. The final analysis included 298,680 genetic variants available for 379 cases and 791 controls. Study subjects with evidence of cryptic family relationships or outliers in terms of population stratification were removed as described previously, resulting in a final inflation factor λ = 1.028. Wald tests, treating minor allele counts as continuous covariates were used to test for association.
Validation study (SF-NHL1)
Full details of this population-based San Francisco Bay Area case–control study of NHL (1,591 cases, 2,515 controls) have been published[12, 13]. Here, a subset of HIV-negative, non-Hispanic white individuals (107 FL cases, 681 controls) were used for validation.
The SF-NHL1 and SF-NHL2 study protocols were approved by the UCSF ethics review committee (Nos. 10–03698 and 10–00629, respectively), and the SCALE study was approved by the Ethical Review Board at Karolinska Institutet, Stockholm, Sweden (Dnr 99–154, 2007/624-32). All study participants provided informed consent.
Meta-analysis of the SF-NHL2 and SCALE GWAS
We selected for meta-analysis the top 1000 SNPs most significantly associated with FL in the SF and SCALE GWAS. Among these SNPs, 13 (rs9275572, rs10484561, rs6457617, rs2858331, rs3763313, rs7755224, rs1493202, rs2857106, rs2157051, rs12529049, rs10511017, rs587791 and rs3130617) were present in the top 1000 of both GWAS. Selected SCALE GWAS SNPs that were not genotyped or did not pass direct genotyping quality control in the SF study were imputed using BEAGLE 3.0.3 with haplotype data from HapMap phase II-CEU as a reference. Similarly, selected SF GWAS SNPs that were not genotyped in the SCALE study were imputed from the 1000 Genomes pilot1 CEU (August 2009 release) and the HapMap Phase II release 22 CEU datasets using IMPUTEv1. ORs for each individual study were estimated using unconditional logistic regression under an additive model in PLINK. Before meta-analysis, genomic control (GC) was applied to each study by multiplying the standard error of the effect estimates by the square root of the study-specific inflation factor (λ = 1.04 in SF, λ = 1.03 in SCALE). ORs and GC-corrected standard errors were then combined in a meta-analysis under fixed- and random-effects inverse variance models using the metagen function from the meta package in R. Heterogeneity across studies was tested with the Cochran’s Q test and quantified with the I2 heterogeneity index.
Validation - SNP selection and analysis
We selected for validation those SNPs with a random-effects p-value threshold <10-4 in the meta-analysis. To focus on newly associated loci not previously reported, the top 40 SNPs associated with FL from each GWAS were excluded from further validation, as these had been tested previously[2, 3]. For the remaining markers, we used PLINK to group SNPs in LD based on genotypes from HapMap-CEU r28, resulting in 20 independent loci (r2 < 0.05). Eleven of these SNPs were taken forward to the validation stage based on 1) lowest meta-analysis p-value and 2) evidence of a secondary signal at each locus in the original GWAS. Association analyses were conducted using trend and logistic regression tests in PLINK 1.07. P-values were adjusted for multiple comparisons using the Benjamini-Hochberg FDR correction from the p-adjust function in R and considered significant at a FDR adjusted p-value level = 0.05. ORs and 95% CI were calculated for the variant allele carriers using the epitools package. To estimate overall association in the discovery and validation phases, logistic regression ORs and standard errors for the validated SNPs were combined with GC-corrected estimates from the original GWAS in a meta-analysis under fixed- and random-effects models.
Statistical analysis adjusting for previously associated variants
Logistic regression analysis was conducted adjusting for the additive effects of FL-associated SNPs that were entered separately into the model as covariates using a 0,1,2 allele dosage coding. The same approach was used in the SF-NHL2 dataset to test for independent effects between SNPs and HLA alleles that were previously typed in 205 FL cases and 82 controls as part of the SF GWAS. HLA alleles were coded as binary alleles where A = present and B = absent.
Linkage disequilibrium (LD) and haplotype analysis
LD metrics between SNPs were based on European samples (CEU) from HapMap release 28. LD between rs3117222 and HLA-DPB1 alleles were based on genotype and HLA typing data from the SF GWAS study. Haplotype analyses were carried out with Haploview.
Gene expression analysis
Associations with gene expression were investigated using two publicly available datasets, one with 166 adipose, 160 skin, and 156 lymphoblastoid cell lines derived from a subset of healthy female twins of the MuTHER study, and a second dataset with three cell types (fibroblast, lymphoblastoid and T-cell) derived from umbilical cords of 75 Geneva GenCord individuals. Correlation between SNPs and gene expression levels was assessed by the Spearman rank correlation test with p-values adjusted by permutations using the Genevar application.
Diffuse large B-cell lymphoma
Genome-wide association studies
Human leukocyte antigen
Single nucleotide polymorphism.
SF-NHL1/2 : SF-NHL2 and SF-NHL1 studies were supported by National Institutes of Health grants CA122663, CA154643-01A1 and CA104682 (C.F.S.) and grants CA45614 and CA89745 from the National Cancer Institute, National Institutes of Health (P.M.B). SCALE : We are grateful to professor L Klareskog, Center for Molecular Medicine, and L Alfredsson, Institute of Environmental Medicine, at the Karolinska Institute, Stockholm, Sweden for sharing DNA from their EIRA study control population. The SCALE study is supported financially through the Agency for Science & Technology and Research of Singapore (A*STAR), the National Cancer Institute (CA69269-02), the Swedish Cancer Society (09 0659, 07 0653 to K.E.S.), the Swedish Research Council (K2008-64X-20737-01-2 to K.E.S.) and the Danish Medical Research Council.
- Skibola CF, Bracci PM, Halperin E, Conde L, Craig DW, Agana L, Iyadurai K, Becker N, Brooks-Wilson A, Curry JD, Spinelli JJ, Holly EA, Riby J, Zhang L, Nieters A, Smith MT, Brown KM: Genetic variants at 6p21.33 are associated with susceptibility to follicular lymphoma. Nat Genet. 2009, 41: 873-875. 10.1038/ng.419.PubMed CentralView ArticlePubMed
- Conde L, Halperin E, Akers NK, Brown KM, Smedby KE, Rothman N, Nieters A, Slager SL, Brooks-Wilson A, Agana L, Riby J, Liu J, Adami HO, Darabi H, Hjalgrim H, Low HQ, Humphreys K, Melbye M, Chang ET, Glimelius B, Cozen W, Davis S, Hartge P, Morton LM, Schenk M, Wang SS, Armstrong B, Kricker A, Milliken S, Purdue MP, Vajdic CM, Boyle P, Lan Q, Zahm SH, Zhang Y, Zheng T, Becker N, Benavente Y, Boffetta P, Brennan P, Butterbach K, Cocco P, Foretova L, Maynadié M, de Sanjosé S, Staines A, Spinelli JJ, Achenbach SJ, Call TG, Camp NJ, Glenn M, Caporaso NE, Cerhan JR, Cunningham JM, Goldin LR, Hanson CA, Kay NE, Lanasa MC, Leis JF, Marti GE, Rabe KG, Rassenti LZ, Spector LG, Strom SS, Vachon CM, Weinberg JB, Holly EA, Chanock S, Smith MT, Bracci PM, Skibola CF: Genome-wide association study of follicular lymphoma identifies a risk locus at 6p21.32. Nat Genet. 2010, 42: 661-664. 10.1038/ng.626.PubMed CentralView ArticlePubMed
- Smedby KE, Foo JN, Skibola CF, Darabi H, Conde L, Hjalgrim H, Kumar V, Chang ET, Rothman N, Cerhan JR, Brooks-Wilson AR, Rehnberg E, Irwan ID, Ryder LP, Brown PN, Bracci PM, Agana L, Riby J, Cozen W, Davis S, Hartge P, Morton LM, Severson RK, Wang SS, Slager SL, Fredericksen ZS, Novak AJ, Kay NE, Habermann TM, Armstrong B, Kricker A, Milliken S, Purdue MP, Vajdic CM, Boyle P, Lan Q, Zahm SH, Zhang Y, Zheng T, Leach S, Spinelli JJ, Smith MT, Chanock SJ, Padyukov L, Alfredsson L, Klareskog L, Glimelius B, Melbye M, Liu ET, Adami HO, Humphreys K, Liu J: GWAS of follicular lymphoma reveals allelic heterogeneity at 6p21.32 and suggests shared genetic susceptibility with diffuse large B-cell lymphoma. PLoS Genet. 2011, 7: e1001378-10.1371/journal.pgen.1001378.PubMed CentralView ArticlePubMed
- Skibola CF, Akers NK, Conde L, Ladner M, Hawbecker SK, Cohen F, Ribas F, Erlich HA, Goodridge D, Trachtenberg EA, Smith MT, Bracci PM: Multi-locus HLA class I and II allele and haplotype associations with follicular lymphoma. Tissue Antigens. 2012, 79: 279-286. 10.1111/j.1399-0039.2012.01845.x.PubMed CentralView ArticlePubMed
- Nica AC, Parts L, Glass D, Nisbet J, Barrett A, Sekowska M, Travers M, Potter S, Grundberg E, Small K, Hedman AK, Bataille V, Tzenova Bell J, Surdulescu G, Dimas AS, Ingle C, Nestle FO, di Meglio P, Min JL, Wilk A, Hammond CJ, Hassanali N, Yang TP, Montgomery SB, O'Rahilly S, Lindgren CM, Zondervan KT, Soranzo N, Barroso I, Durbin R, Ahmadi K, Deloukas P, McCarthy MI, Dermitzakis ET, Spector TD: MuTHER Consortium the architecture of gene regulatory variation across multiple human tissues: the MuTHER study. PLoS Genet. 2011, 7:
- Dimas AS, Deutsch S, Stranger BE, Montgomery SB, Borel C, Attar-Cohen H, Ingle C, Beazley C, Gutierrez Arcelus M, Sekowska M, Gagnebin M, Nisbett J, Deloukas P, Dermitzakis ET, Antonarakis SE: Common regulatory variation impacts gene expression in a cell type-dependent manner. Science. 2009, 325: 1246-1250. 10.1126/science.1174148.PubMed CentralView ArticlePubMed
- Diepstra A, van Imhoff GW, Karim-Kos HE, van den Berg A, te Meerman GJ, Niens M, Nolte IM, Bastiaannet E, Schaapveld M, Vellenga E, Poppema S: HLA class II expression by Hodgkin Reed-Sternberg cells is an independent prognostic factor in classical Hodgkin's lymphoma. J Clin Oncol. 2007, 25: 3101-3108. 10.1200/JCO.2006.10.0917.View ArticlePubMed
- Rimsza LM, Farinha P, Fuchs DA, Masoudi H, Connors JM, Gascoyne RD: HLA-DR protein status predicts survival in patients with diffuse large B-cell lymphoma treated on the MACOP-B chemotherapy regimen. Leuk Lymphoma. 2007, 48: 542-546. 10.1080/10428190601078605.View ArticlePubMed
- Maier LA, McGrath DS, Sato H, Lympany P, Welsh K, Du Bois R, Silveira L, Fontenot AP, Sawyer RT, Wilcox E, Newman LS: Influence of MHC class II in susceptibility to beryllium sensitization and chronic beryllium disease. J Immunol. 2003, 171: 6910-6918.View ArticlePubMed
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81: 559-575. 10.1086/519795.PubMed CentralView ArticlePubMed
- Package epitools, R. http://cran.r-project.org/web/packages/epitools/)
- Holly EA, Bracci PM: Population-based study of non-Hodgkin lymphoma, histology, and medical history among human immunodeficiency virus-negative participants in San Francisco. Am J Epidemiol. 2003, 158: 316-327. 10.1093/aje/kwg145.View ArticlePubMed
- Skibola CF, Bracci PM, Halperin E, Nieters A, Hubbard A, Paynter RA, Skibola DR, Agana L, Becker N, Tressler P, Forrest MS, Sankararaman S, Conde L, Holly EA, Smith MT: Polymorphisms in the estrogen receptor 1 and vitamin C and matrix metalloproteinase gene families are associated with susceptibility to lymphoma. PLoS One. 2008, 3:
- Browning BL, Browning SR: A fast, powerful method for detecting identity by descent. Am J Hum Genet. 2011, 88: 173-182. 10.1016/j.ajhg.2011.01.010.PubMed CentralView ArticlePubMed
- Marchini J, Howie B, Myers S, McVean G, Donnelly P: A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007, 39: 906-913. 10.1038/ng2088.View ArticlePubMed
- Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21: 263-265. 10.1093/bioinformatics/bth457.View ArticlePubMed
- Yang TP, Beazley C, Montgomery SB, Dimas AS, Gutierrez-Arcelus M, Stranger BE, Deloukas P, Dermitzakis ET: Genevar: a database and Java application for the analysis and visualization of SNP-gene associations in eQTL studies. Bioinformatics. 2010, 26: 2474-2476. 10.1093/bioinformatics/btq452.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.