- Open Access
Characterization of familial breast cancer in Saudi Arabia
BMC Genomics volume 16, Article number: S3 (2015)
The contribution of genetic factors to the development of breast cancer in the admixed and consanguineous population of the western region of Saudi Arabia is thought to be significant as the disease is early onset. The current protocols of continuous clinical follow-up of relatives of such patients are costly and cause a burden on the usually over-stretched medical resources. Discovering the significant contribution of BRCA1/2 mutations to breast cancer susceptibility allowed for the design of genetic tests that allows the medical practitioner to focus the care for those who need it most. However, BRCA1/2 mutations do not account for all breast cancer susceptibility genes and there are other genetic factors, known and unknown that may play a role in the development of such disease. The advent of whole-exome sequencing is offering a unique opportunity to identify the breast cancer susceptibility genes in each family of sufferers. The polymorphisms/mutations identified will then allow for personalizing the genetic screening tests accordingly. To this end, we have performed whole-exome sequencing of seven breast cancer patients with positive family history of the disease using the Agilent SureSelect™ Whole-Exome Enrichment kit and sequencing on the SOLiD™ platform.
We have identified several coding single nucleotide variations that were either novel or rare affecting genes controlling DNA repair in the BRCA1/2 pathway.
The disruption of DNA repair pathways is very likely to contribute to breast cancer susceptibility in the Saudi population.
The discovery of the BRCA1 and BRCA2 genes as major breast cancer susceptibility genes led to great advances in the genetic screening for the disease and the understanding of its inheritance [1, 2]. Several other genes were found to play a role in increasing susceptibility to breast cancer but at a markedly lower frequency and penetrance. These genes include ATM, TP53, CHECK2, PTEN, STK11, PALB2, BRIP and the RAD51 genes [3–11]. GWAS studies led to the identification of 21 susceptibility loci that are considered only as low risk alleles [9, 12–17]. All these factors combined can account for only 35% of heritable breast cancer with the majority of cases remain with an unknown genetic etiology . This problem is confounded for the admixed and consanguineous population of the western region of Saudi Arabia where virtually no research has been done so far to elucidate the genetic background of heritable breast cancer. A remarkable characteristic of breast cancer in this population is the relatively younger age of onset of the disease where the majority of cases (sporadic or familial) are diagnosed with invasive ductal carcinoma before they are 50 years old . This early onset could be attributed, at least partly, to undetermined genetic susceptibility factors accumulating in the population due to consanguineous marriages and increased exposure to environmental insults due to life-style shifts in the past two decades.
Sanger sequencing of all known breast cancer susceptibility genes could be a daunting task. Developments in massively parallel sequencing technology and whole-exome sequencing alleviate many of the problems associated with such approach and allow for the simultaneous determination of known factors as well as the discovery of novel ones. And in the age of personalized medicine, whole-exome sequencing of each breast cancer patient is fast becoming a standard approach towards genetic diagnosis . In the present study we employed whole-exome sequencing of seven cases diagnosed with familial breast cancer and with unknown BRCA1 or BRCA2 status. We determined the BRCA1 and BRCA2 status in these cases and report the identification of several rare variants that can potentially explain breast cancer susceptibility in each case analyzed.
Materials and methods
Patients were selected for this study if they have a first-degree relative(s) diagnosed with breast cancer. Peripheral blood was obtained from the patients following obtaining their informed consent and their family history of breast cancer. Patients’ recruitment and blood sampling was all performed according to the institutional ethical procedures (Additional file 1 Figure S1). Genomic DNA was prepared using the Qiagen QIAamp DNA Blood Mini kit according to the manufacturer’s recommendations.
Whole-exome sequencing and SNP genotyping
Three micrograms of genomic DNA was sheared using the Covaris S2 system. Exome capture was performed on seven cases and six non-cancer controls using the SureSelect Whole-Exome Enrichment version 2 kit from Agilent. Fragment libraries were prepared from the captured exomes for sequencing on the SOLiD 4 platform (AB). Sequencing for each library was performed on one part of the quad slide and fragments were sequenced in in single reads of 50 bp. Sequence capture and primary analysis were performed by the instruments ICS and SETS softwares. SNP genotyping using Taqman was performed using assay ID (C___7530120_20) from Life Technologies targeting the rs1799950 SNP. Genotyping was performed on DNA from peripheral blood of breast cancer patients or non-cancer controls.
Color-space sequences in .csfasta and .qual files were exported to LifeScope software were mapping to the human genome version 19 (hg19) was performed using standard settings. Identification of single nucleotide polymorphisms was achieved by the diBayes software incorporated in the LifeScope pipeline. Variant call format (vcf) files were analyzed using the SNPs & Variation Suite 7 (SVS7) from Golden Helix where short-listing of candidate SNVs was performed by filtering the detected SNVs to include only those with more than 10x coverage and MQV of >=20. Rare variants were identified by filtering out SNVs present in the 1000genomes or NHLBI Exome sequencing data. Disease-associated SNVs were determined following filtering out rare SNVs found in the 6 non-cancer control cases from the same ethnic background. Damaging nonsynonymous variations were determined by the SIFT, PolyPhen or Mutation Taster softwares within the SVS7 suite.
Exome sequencing revealed several single nucleotide variants affecting key genes that could be involved in increased susceptibility to breast cancer. The single nucleotide variants or short indels obtained for every sample were filtered against the NHLBI Exome project and the 1000genomes project databases. Novel or rare variants (MAF of <0.01) were filtered against our in-house database of exome sequencing of non-cancer patients or healthy individuals. The statistics of each breast cancer exome sequenced are shown in Table 1. The mutational status of BRCA1 and BRCA2 in the sequenced samples was unknown. Therefore, variants affecting those genes were analyzed first. We have identified one novel frameshift mutation affecting BRCA2 caused by an -/AC insertion affecting one patient only (Table 2). Other BRCA1 or BRCA2 variants identified were previously reported in dbSNP137. However, the nonsense variant represented by SNP rs80358972 is very rare and no information about its MAF could be found. We have found this variant in one BC patient only. Other Missense single nucleotide variants affecting BRCA1 and BRCA2 were identified. However, when selection is based on rarity and degree of predicted damage to the protein, SNP rs1799950 is found in one patient. In order to determine the frequency of the rs1799950 SNP in our cohort, we performed Taqman® SNP genotyping assay on DNA obtained from the peripheral blood of 204 breast cancer patients samples as well as 120 non-cancer controls. The rs1799950 SNP was in a highly significant Hardy-Weinberg disequilibrium in the patient group (X2=133.124) compared to the control group (X2=0.108). The GG state of the rs1799950 SNP is significantly associated with breast cancer compared to the AA and AG states combined (p=0.0003, OR=22.79, CI=1.366-380.1).
Predisposition to breast cancer is often caused by genetic defects in DNA repair mechanisms. Therefore, SNVs affecting known genes with DNA repair function were examined (Table 3). In addition, SNVs were also identified affecting the APC, EGF and EGFR genes. An interesting mutation c.148G>A / p.Ala62Thr is found affecting the PARP1-interacting region of the Cockayne Syndrome group B (ERCC6) gene. Analysis of DNA from the family of the affected female revealed that this mutation segregated in the heterozygous state in one sibling affected with breast cancer as well as in the mother who also suffered from breast cancer. The father did not harbor this mutation (Figure 1). This SNV was recently reported by the 1000Genome project (rs186839348) where it was found only once in 1094 individuals. We could not detect this SNV in 228 non-cancer control samples from Saudi Arabia.
Breast cancer incidence is on the rise in the Kingdom of Saudi Arabia with a remarkable number of those affected are being diagnosed before they are 50 years old . The early-onset of the breast cancer in this population could be partly explained by the accumulation of breast cancer predisposition genetic factor(s) due to high incidence of consanguineous marriages. The effects of these genetic factor(s) is probably becoming more evident now due to the social and life-style changes brought upon by the relatively recent positive economical upheavals in the country. In order to identify such genetic factors, we performed a pilot whole-exome sequencing study on DNA obtained from the peripheral blood of seven cases suffering from hereditary breast cancer. First, the status of the known breast cancer predisposition factors, mainly BRCA1 and BRCA2, was determined. We could not identify recurrent BRCA1/2 mutations in our cohort. However, we identified a novel insertion that led to a frameshift mutation (p.Thr363fs) in BRCA2 causing the synthesis of a truncated and presumably dysfunctional protein. We identified another rare mutation in BRCA2 in one of our patients. Represented by the rs80358972 SNP, the p.Arg2494Stop affecting BRCA2 has been reported by the Breast Cancer Information Core submitted by Myriad Genetics as a direct result of their diagnostic services. Additionally, we have identified the relatively rare rs1799950 SNP in BRCA1 which is a p.Gln356Arg mutation reported by the 1000Genomes project to have an MAF of 0.026. We found this SNP in our cohort with a MAF of 0.058 (7 heterozygous cases in 120 non-cancer cases). The minor allele frequency of the rs1799950 SNP did not differ significantly from controls. However, we observed an increase in the number of breast cancer cases displaying the homozygous GG minor allele state that is not seen in the control cases. When the GG state is analyzed in comparison to the combined frequency of the AA and AG states, a highly significant association with breast cancer becomes evident. The rs1799950 SNP is one of 25 SNPs in cancer predisposition genes that were identified to confer minor but cumulatively significant risk of breast cancer . However, a later study dismissed the association of the rs1799950 SNP with breast cancer . Unfortunately, it is difficult to perform direct comparisons between our findings and reported studies due to the differences in sample size and the ethnic makeup of the cohorts analyzed.
Whole-exome sequencing revealed several candidate risk factors for breast cancer. We made the assumption that the most likely risk factor is a gene(s) involved in DNA repair, cell cycle or apoptosis . Applying this filter to the SNVs obtained reveal rare polymorphisms that could affect important genes such as WRN, APC, EGF, EGFR and ERCC6. The contribution of these SNVs towards increasing predisposition to breast cancer remains unknown. Therefore, we analyzed the segregation with breast cancer of the SNVs affecting ERCC6 (p.Ala62Thr) and WRN (p.Ala616Pro) in a family with reported breast cancer affecting three generations (case_574). The WRN p.Ala616Pro was detectable in the two siblings diagnosed with breast cancer. However, this SNV could not be found in the mother who died of breast cancer. In contrast, the ERCC6 p.Ala62Thr SNV segregated with breast cancer in the same family and it was not detectable in the father or control samples. This mutation affects the PARP1-interaction region of ERCC6, also known as Cockayne Syndrome group B (CSB) . ERCC6-dependent activation of the poly(ADP-ribose)polymerases, or PARPs is an early event in the cellular response to genotoxic stress . Carrying a variant ERCC6 therefore will cause a less-efficient DNA repair response and could therefore lead to an increased predisposition to breast cancer.
This is the first report on the breast cancer predisposition factors in the population of the Kingdom of Saudi Arabia. The high consanguinity and life-style shifts in this population are coupled to an early-onset breast cancer and the SNVs identified in this study could partly explain this phenomenon. We have identified a novel BRCA2 mutation as well as found a case with a very rare nonsense mutation truncating the BRCA2 protein. We demonstrate the potential importance of the homozygous risk allele to breast cancer predisposition in the Saudi population. We suggest that mutations in the ERCC6 gene could be considered as potential risk factors for breast cancer. Although no recurrent mutations were identified, this study validates the use of whole-exome sequencing for the determination of the “breast cancer predisposition genome”.
Miki Y, Swensen J, Shattuck-Eidens D, Futreal PA, Harshman K, Tavtigian S, Liu Q, Cochran C, Bennett LM, Ding W, et al: A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science. 1994, New York, NY, 266 (5182): 66-71. 10.1126/science.7545954.
Wooster R, Bignell G, Lancaster J, Swift S, Seal S, Mangion J, Collins N, Gregory S, Gumbs C, Micklem G: Identification of the breast cancer susceptibility gene BRCA2. Nature. 1995, 378 (6559): 789-792. 10.1038/378789a0.
Borresen AL, Andersen TI, Garber J, Barbier-Piraux N, Thorlacius S, Eyfjord J, Ottestad L, Smith-Sorensen B, Hovig E, Malkin D, et al: Screening for germ line TP53 mutations in breast cancer patients. Cancer research. 1992, 52 (11): 3234-3236.
Lynch ED, Ostermeyer EA, Lee MK, Arena JF, Ji H, Dann J, Swisshelm K, Suchard D, MacLeod PM, Kvinnsland S, et al: Inherited mutations in PTEN that are associated with breast cancer, cowden disease, and juvenile polyposis. American journal of human genetics. 1997, 61 (6): 1254-1260. 10.1086/301639.
Giardiello FM, Brensinger JD, Tersmette AC, Goodman SN, Petersen GM, Booker SV, Cruz-Correa M, Offerhaus JA: Very high risk of cancer in familial Peutz-Jeghers syndrome. Gastroenterology. 2000, 119 (6): 1447-1453. 10.1053/gast.2000.20228.
Meijers-Heijboer H, van den Ouweland A, Klijn J, Wasielewski M, de Snoo A, Oldenburg R, Hollestelle A, Houben M, Crepin E, van Veghel-Plandsoen M, et al: Low-penetrance susceptibility to breast cancer due to CHEK2(*)1100delC in noncarriers of BRCA1 or BRCA2 mutations. Nature genetics. 2002, 31 (1): 55-59. 10.1038/ng879.
Renwick A, Thompson D, Seal S, Kelly P, Chagtai T, Ahmed M, North B, Jayatilake H, Barfoot R, Spanova K, et al: ATM mutations that cause ataxia-telangiectasia are breast cancer susceptibility alleles. Nature genetics. 2006, 38 (8): 873-875. 10.1038/ng1837.
Seal S, Thompson D, Renwick A, Elliott A, Kelly P, Barfoot R, Chagtai T, Jayatilake H, Ahmed M, Spanova K, et al: Truncating mutations in the Fanconi anemia J gene BRIP1 are low-penetrance breast cancer susceptibility alleles. Nature genetics. 2006, 38 (11): 1239-1241. 10.1038/ng1902.
Rahman N, Seal S, Thompson D, Kelly P, Renwick A, Elliott A, Reid S, Spanova K, Barfoot R, Chagtai T, et al: PALB2, which encodes a BRCA2-interacting protein, is a breast cancer susceptibility gene. Nature genetics. 2007, 39 (2): 165-167. 10.1038/ng1959.
Meindl A, Hellebrand H, Wiek C, Erven V, Wappenschmidt B, Niederacher D, Freund M, Lichtner P, Hartmann L, Schaal H, et al: Germline mutations in breast and ovarian cancer pedigrees establish RAD51C as a human cancer susceptibility gene. Nature genetics. 2010, 42 (5): 410-414. 10.1038/ng.569.
Loveday C, Turnbull C, Ramsay E, Hughes D, Ruark E, Frankum JR, Bowden G, Kalmyrzaev B, Warren-Perry M, Snape K, et al: Germline mutations in RAD51D confer susceptibility to ovarian cancer. Nature genetics. 2011, 43 (9): 879-882. 10.1038/ng.893.
Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, Ballinger DG, Struewing JP, Morrison J, Field H, Luben R, et al: Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007, 447 (7148): 1087-1093. 10.1038/nature05887.
Stacey SN, Manolescu A, Sulem P, Rafnar T, Gudmundsson J, Gudjonsson SA, Masson G, Jakobsdottir M, Thorlacius S, Helgason A, et al: Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer. Nature genetics. 2007, 39 (7): 865-869. 10.1038/ng2064.
Ahmed S, Thomas G, Ghoussaini M, Healey CS, Humphreys MK, Platte R, Morrison J, Maranian M, Pooley KA, Luben R, et al: Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2. Nature genetics. 2009, 41 (5): 585-590. 10.1038/ng.354.
Thomas G, Jacobs KB, Kraft P, Yeager M, Wacholder S, Cox DG, Hankinson SE, Hutchinson A, Wang Z, Yu K, et al: A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1). Nature genetics. 2009, 41 (5): 579-584. 10.1038/ng.353.
Turnbull C, Ahmed S, Morrison J, Pernet D, Renwick A, Maranian M, Seal S, Ghoussaini M, Hines S, Healey CS, et al: Genome-wide association study identifies five new breast cancer susceptibility loci. Nature genetics. 2010, 42 (6): 504-507. 10.1038/ng.586.
Ghoussaini M, Fletcher O, Michailidou K, Turnbull C, Schmidt MK, Dicks E, Dennis J, Wang Q, Humphreys MK, Luccarini C, et al: Genome-wide association analysis identifies three new breast cancer susceptibility loci. Nature genetics. 2012, 44 (3): 312-318. 10.1038/ng.1049.
Gracia-Aznarez FJ, Fernandez V, Pita G, Peterlongo P, Dominguez O, de la Hoya M, Duran M, Osorio A, Moreno L, Gonzalez-Neira A, et al: Whole exome sequencing suggests much of non-BRCA1/BRCA2 familial breast cancer is due to moderate and low penetrance susceptibility alleles. PloS one. 2013, 8 (2): e55681-10.1371/journal.pone.0055681.
Mehmood A, Te OB, Urcia JC, Khan A: Tumor Registry Annual Report. 2011, Kingdom of Saudi Arabia: King Faisal Specialist Hospital & Research Center
Chouchane L, Mamtani R, Dallol A, Sheikh JI: Personalized medicine: a patient-centered paradigm. Journal of translational medicine. 2011, 9: 206-10.1186/1479-5876-9-206.
Johnson N, Fletcher O, Palles C, Rudd M, Webb E, Sellick G, dos Santos Silva I, McCormack V, Gibson L, Fraser A, et al: Counting potentially functional variants in BRCA1, BRCA2 and ATM predicts breast cancer susceptibility. Human molecular genetics. 2007, 16 (9): 1051-1057. 10.1093/hmg/ddm050.
Baynes C, Healey CS, Pooley KA, Scollen S, Luben RN, Thompson DJ, Pharoah PD, Easton DF, Ponder BA, Dunning AM: Common variants in the ATM, BRCA1, BRCA2, CHEK2 and TP53 cancer susceptibility genes are unlikely to increase breast cancer risk. Breast cancer research : BCR. 2007, 9 (2): R27-10.1186/bcr1669.
Thorslund T, von Kobbe C, Harrigan JA, Indig FE, Christiansen M, Stevnsner T, Bohr VA: Cooperation of the Cockayne syndrome group B protein and poly(ADP-ribose) polymerase 1 in the response to oxidative stress. Molecular and cellular biology. 2005, 25 (17): 7625-7636. 10.1128/MCB.25.17.7625-7636.2005.
Flohr C, Burkle A, Radicella JP, Epe B: Poly(ADP-ribosyl)ation accelerates DNA repair in a pathway dependent on Cockayne syndrome B protein. Nucleic acids research. 2003, 31 (18): 5332-5337. 10.1093/nar/gkg715.
The authors would like to extend their gratitude to Shylu Mathew, Lubna Mira, Manal Shabat and Nada Salem for their technical support. The authors would like to thank the patients recruited in this study for their support and understanding. The authors acknowledge the generous funding received through the Kingdom of Saudi Arabia’s National Science, Technology and Innovation Plan (NSTIP) Project No. 09-BIO-818-03.
Publication charges for this article have been funded by the Center of Excellence in Genomic Medicine Research, King Abdulaziz University
This article has been published as part of BMC Genomics Volume 16 Supplement 1, 2015: Selected articles from the 2nd International Genomic Medical Conference (IGMC 2013): Genomics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/16/S1.
The authors declare that they have no competing interests.
AM conceived the study, collected samples and interviewed patients. MG collected samples and contributed to the study design. SH performed the whole-exome analysis and Sanger sequencing. SK supported patients’ recruitment to the study. HT and FT contributed to patients’ recruitment. JM provided archival samples from patients’ relatives. IRH interviewed the patients and provided genetic counseling. TK, AGC, AMA and MHQ contributed to the study design. NS performed whole-exome sequencing, AD developed the study, performed whole-exome sequencing and bioinformatics and wrote the manuscript.