Skip to main content
  • Methodology article
  • Open access
  • Published:

Genotyping of BCL11A and HBS1L-MYB SNPs associated with fetal haemoglobin levels: a SNaPshot minisequencing approach

Abstract

Background

B-thalassaemia and sickle cell disease (SCD) are two of the most common monogenic diseases that are found in many populations worldwide. In both disorders the clinical severity is highly variable, with the persistence of fetal haemoglobin (HbF) being one of the major ameliorating factors. HbF levels are affected by, amongst other factors, single nucleotide polymorphisms (SNPs) at the BCL11A gene and the HBS1L-MYB intergenic region, which are located outside the β-globin locus. For this reason, we developed two multiplex assays that allow the genotyping of SNPs at these two genomic regions which have been shown to be associated with variable HbF levels in different populations.

Results

Two multiplex assays based on the SNaPshot minisequencing approach were developed. The two assays can be used to simultaneous genotype twelve SNPs at the BCL11A gene and sixteen SNPs at HBS1L-MYB intergenic region which were shown to modify HbF levels. The different genotypes can be determined based on the position and the fluorescent colour of the peaks in a single electropherogram. DNA sequencing and restriction fragment length polymorphism (PCR-RFLP) assays were used to verify genotyping results obtained by SNaPshot minisequencing.

Conclusions

In summary, we propose two multiplex assays based on the SNaPshot minisequencing approach for the simultaneous identification of SNPs located at the BCL11A gene and HBS1L-MYB intergenic region which have an effect on HbF levels. The assays can be easily applied for accurate, time and cost efficient genotyping of the selected SNPs in various populations.

Background

B-thalassaemia and SCD are common autosomally inherited disorders that are found in many populations around the world. B-thalassaemia is caused by mutations that lead to low or complete absence of expression of the β-globin gene while in SCD a point mutation in the β-globin gene causes changes in the protein structure [1, 2]. In β-thalassaemia the excess of unbound alpha globin chains precipitate in the red cell precursors leading to their destruction in the bone marrow resulting in ineffective erythropoiesis. Similarly, excess of unbound alpha globin chains induce membrane damage in mature thalassaemic red cells leading to haemolysis [3]. The pathogenesis of SCD initiates when the deoxy-sickle haemoglobin molecules are polymerized leading to the deformation of the structure of the red blood cell. In both disorders, the clinical phenotypes are highly variable, ranging from relatively mild to severe anaemia, due to environmental and genetic factors [2, 4].

One of the major ameliorating factors of clinical severity, in both conditions, is the persistence of HbF (α2γ2) production [5]. During fetal life, HbF is the predominant haemoglobin produced which switches to adult haemoglobin (HbA, α2β2) around birth [6]. By the end of the first year of life the levels of HbF decrease to less than 1% of total haemoglobin. In some individuals haemoglobin switching is not complete and a significant amount of HbF is produced during adulthood, resulting in a condition called hereditary persistence of fetal haemoglobin (HPFH) [7]. In SCD and β-thalassaemia, high γ-globin expression inhibits HbS polymerization and α-globin precipitation, respectively [8]. Point mutations at the promoter of the γ-globin gene and deletions within the β-globin gene cluster can result in increased levels of HbF [2]. However, HbF levels are also affected by genetic factors outside of the β-globin gene cluster.

Genetic association studies have identified SNPs in major loci that are associated with the variation of HbF levels in patients with SCD or β-thalassaemia and in healthy adults. These loci are the BCL11A gene on chromosome 2 (2p16.1) and the HBS1L-MYB intergenic region on chromosome 6 (6q23.3) [9, 10]. The polymorphisms of BCL11A gene that have been described and are associated with variable HbF levels are located within the second intron of this gene. The HBS1L-MYB intergenic polymorphisms (HMIP) are present in three linkage disequilibrium (LD) blocks with most of the effect on HbF levels and numbers of F cells contributed by the second block [913].

The BCL11A gene encodes a zinc finger transcription factor that represses HbF synthesis (γ-globin expression) [1416]. Expression of the two major isoforms (XL, L) of BCL11A protein in human adult erythroid cells is needed to repress the γ-globin expression [14, 15]. The BCL11A protein occupies sites within the locus control region (LCR) and intergenic regions of the β-globin locus but does not associate with the promoter of the γ-globin genes indicating that the regulation of γ-globin expression is a complex process [17]. Moreover, BCL11A interacts with transcription factors such as GATA1, FOG1 and SOX6 and is required for the recruitment of the NuRD and LSD1/CoREST chromatin remodeling complexes to the β-globin locus in primary erythroid cells [14, 16, 18]. MYB is a proto-oncogene that is encoding the c-MYB transcription factor, which plays an essential role in erythroid differentiation [1921]. Absence of c-myb in mice is lethal due to failure to develop haematopoiesis in fetal liver [19]. Disruption of the Hbs1l-Myb intergenic region in mice suppresses Myb levels and leads to elevated expression of embryonic globins and haematopoietic parameters similarly to human HPFH [22]. Moreover, down-regulation of MYB transcription factor through over-expression of microRNAs miR-15a and 16-1 results in elevations in HbF gene expression [23].

Thus, analysis of SNPs that are located in the BCL11A gene and the HBS1L-MYB intergenic region and have an effect on HbF levels can provide crucial genetic information enabling patient stratification and can help predict the severity of disease in newborns with β-thalassaemia and SCD. Furthermore, in the future such information could help clinicians to provide more personalized patient care according to the expected phenotype. Finally, determination of these SNPs will help to better understand the molecular mechanisms of action of modifying factors influencing the production of haemoglobin.

Routinely, identification of SNPs is based on detection by amplification-refractory mutation system (ARMS), PCR followed by PCR-RFLP and Sanger DNA sequencing. These assays are costly and time consuming in cases where several SNPs need to be studied, hence there is a need for an accurate, sensitive and low cost assay for detection of multiple polymorphisms in a single assay.

Here we report the development of two multiplex assays based on the SNaPshot-minisequencing approach for simultaneous screening of twelve and sixteen known SNPs, which have an effect on HbF levels in different populations, in the BCL11A gene and HBS1L-MYB intergenic region, respectively. In each assay we use multiplex PCR followed by multiplex single-base extension with a fluorescent dideoxynucleotide of a primer which anneals immediately adjacent to the SNP under investigation. The extended products are separated and fluorescently detected by capillary electrophoresis. This molecular tool is rapid and cost effective for the detection of multiple SNPs in the BCL11A gene and HBS1L-MYB intergenic region. It can help associate specific SNPs and/or haplotypes to the variation of HbF levels in particular populations but also to enhance our general understanding of their contribution to HbF modulation.

Methods

Biological samples

We used genomic DNA samples from one individual with high levels of HbF, and 25 anonymous confirmed β-thalassaemia patients previously genotyped for haemoglobinopathy mutations in the Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics. The study was conducted in accordance with the Declaration of Helsinki and was approved by the National Bioethics Committee of Cyprus (Reference number: ΕΕΒΚ/ΕΠ/2012/05). The genomic DNA was extracted from peripheral blood using the Gentra Puregene Kit (Qiagen) according to the manufacturer’s protocol and kept at -80°C for long-term storage. The DNA concentration and purity were measured using the Nanodrop ND-1000 spectrophotometer (NanoDrop Technologies).

SNP selection

A set of twelve BCL11A and sixteen HBS1L-MYB SNPs were chosen based on various published studies [913, 2433] and the new updated haemoglobinopathy sequence variation database, the IthaGenes of the ITHANET Portal at http://www.ithanet.eu/db/ithagenes. Taken together, the data suggest that these polymorphisms are associated with variable HbF levels in different ethnic populations with HPFH, β-thalassaemia, sickle cell anaemia, β-thalassaemia heterozygotes and HbE heterozygotes. The selected SNPs are listed in Table 1, where the reference SNP number is indicated, along with their HGVS name and the geographical/ethnic origin of each SNP study cohort.

Table 1 Selected BCL11A and HBS1L-MYB intergenic region SNPs

Design and optimization of PCR and minisequencing primers

The genetic regions including the selected BCL11A and HBS1L-MYB SNPs were identified by the nucleotide Basic Local Alignment Search Tool (BLAST) and a total of forty-four forward and reverse PCR oligonucleotide primers were designed with the Biomaths Calculator program at http://www.promega.com/techserv/tools/biomath/calc11.htm. All oligonucleotides were designed to have melting temperatures of 57-64°C and their lengths ranged between 17 and 29 nucleotides (nt). The primer sequences were screened to avoid homology to other loci or repetitive sequences in the genome by using the BLAST alignment tool. Specifically, the primer sequences were screened using the “Human genomic plus transcript (Human G + T)” database, the “megablast” algorithm and were checked for low complexity regions. It is highly recommended, when looking for homologies, to use the BLAST tool, which is more sensitive rather than other alignment tools such as BLAT [34]. When using the BLAST tool for primer screening, appropriate care should be taken for the presence of pseudogenes or duplicated genes that could interfere with the assay. Secondary structures and potential primer-primer interactions were predicted using the OligoAnalyzer 3.1 tool at http://eu.idtdna.com/analyzer/Applications/OligoAnalyzer/. All multiplex PCR primer sequences are listed in Table 2, along with three sequencing primers that were designed in a similar way. For each region (BCL11A or HBS1L-MYB), the amplicons encompassing the SNPs of interest were amplified in two separate multiplex PCR (mPCR) reactions, A and B. The sizes of the amplicons ranged from 69 to 649 bp for the BCL11A mPCRs and from 60 to 221 bp for the HBS1L-MYB mPCRs.

Table 2 BCL11A and HBS1L-MYB multiplex PCR amplification primers

SNPs for each of the two genetic loci were genotyped in one multiplex SNaPshot minisequencing assay. Minisequencing primers were designed with their 3′ base corresponding to the base immediately before the SNP to be investigated. Therefore, for each SNP, the base subsequent to the minisequencing primer would identify the polymorphism. Minisequencing primer design was performed according to the recommendations of Sanchez et al., 2003 [35] using the Biomaths Calculator program. The sequences of the twenty-eight primers used in these two assays are listed in Table 3. The sequences of the minisequencing primers were checked for possible hairpin structures, primer-dimer formation or homology to other loci, as described above. In order to allow an adequate distribution of the primer-extension products, non-homologous neutral sequences were incorporated at the 5′-ends of the minisequencing primers to adjust their lengths and obtain a more balanced base composition [36]. For six primers such of an addition was not necessary while for one long primer, a poly(C)-tail was also added to the 5′-end of the neutral sequence [35]. Thus, the length of the minisequencing primers ranged from 20 to 57 nt for BCL11A and from 16 to 68 nt for HBS1L-MYB. To allow sufficient electrophoretic resolution of the multiplex extension products, minisequencing oligonucleotides that are expected to give the same fluorescent signal for SNPs in neighboring positions were designed so that they differ by a minimum of 4 nt in length. All PCR oligonucleotides were purified by standard desalting while minisequencing oligonucleotides were purified by high performance liquid chromatography (Metabion International AG).

Table 3 BCL11A and HBS1L-MYB minisequencing primers

Multiplex minisequencing detection of SNPs

Genotypic detection of BCL11A and HBS1L-MYB SNPs was performed separately for each target region using two mPCRs followed by a multiplex SNaPshot minisequencing reaction [37]. In the first step, two amplification reactions were performed (mPCRA and mPCRB) using 100 ng of genomic DNA, 1X Amplitaq Gold Buffer (Applied Biosystems), 200 mΜ each dNTP (Invitrogen), 1.25 units of Amplitaq Gold DNA Polymerase (Applied Biosystems), primer mix (final concentrations ranged between 0.1-0.8 μM in mPCRA and 0.1-0.4 μM in mPCRB for BCL11A and 0.2-0.8 μM in both mPCRA and mPCRB for HBS1L-MYB, see Table 2), and double distilled water to a final volume of 25 μl. The amplification reaction was carried out in a Veriti 96-well Thermal Cycler (Applied Biosystems). The reaction was initialized at 95°C for 10 min, followed by 35 cycles for 30 sec at 95°C, 60 sec at 61°C; 90 sec at 72°C and a final extension for 7 min at 72°C. Amplified products were analyzed in a 2% and 3% w/v ethidium bromide-stained agarose gel for BCL11A and HBS1L-MYB respectively. The sizes of the amplified PCR products were 69 to 333 bp in mPCRA and 411 to 649 bp in mPCRB for BCL11A and 66 to 221 bp in mPCRA and 60 to 190 bp in mPCRB for HBS1L-MYB (see Table 2).

In order to avoid the participation of primers and unincorporated dNTPs in the subsequent minisequencing primer extension reaction, 2.5 μl of mPCRA and 2.5 μl mPCRB products were combined and treated with an Exo I/Antarctic Phosphatase mix consisting of 5 units of E. coli exonuclease I (Exo I, New England Biolabs) and 1 unit of Antarctic phosphatase (New England Biolabs). The reaction mix was incubated at 37°C for 1 hour and then the enzymes were inactivated by incubating at 75°C for 15 min. The treated PCR products were used as template in a multiplex SNaPshot minisequencing extension reaction containing a pool of polymorphism-specific primers. The reaction extends the minisequencing primers by the addition of one fluorescent ddNTP, producing different products which are specific for each SNP allele. We performed the SNaPshot minisequencing reaction in a 10 μl final volume using 1 μl of treated PCR product, 5 μl of the minisequencing primer cocktail (final primer concentrations ranged between 0.02-0.2 μM for BCL11A and 0.05-0.3 μM for HBS1L-MYB, see Table 3) and 4 μl of SNaPshot® Multiplex Ready Reaction Mix (Applied Biosystems). The primer extension conditions consisted of 96°C for 10 sec, followed by 25 cycles for 10 sec at 96°C, 5 sec at 52°C; 30 sec at 60°C. Afterwards, the samples were treated by the addition of 1X Antarctic Phosphatase Buffer and 0.5 units of Antarctic phosphatase (New England Biolabs) at 37°C for 1 hour followed by enzyme inactivation at 75°C for 15 min.

Analysis of the treated minisequencing products was performed by capillary electrophoresis on an ABI 3130xl Genetic Analyzer (Applied Biosystems) preceded by a denaturation step. A mix of 8.5 μl of HiDi™ formamide (Applied Biosystems) and 0.5 μl of GeneScan-120 LIZ size standard (Applied Biosystems) was added to 1 μl of treated single-nucleotide extension product and denatured at 95°C for 5 min. The mixture was immediately transferred on ice for 5 min and loaded for electrophoresis onto the ABI 3130xl genetic analyzer (Applied Biosystems) using a 36-cm capillary, POP-7 polymer and the DS02 matrix (Applied Biosystems) for 20 minutes with 10 seconds of injection following the manufacturer’s instructions. Multiplex extension products were visualized and analyzed automatically with GeneMapper™ v4.1 Software (Applied Biosystems).

Minisequencing genotypic profile verification

To verify the genotyping of the selected BCL11A and HBS1L-MYB polymorphisms obtained with the multiplex SNaPshot minisequencing assay we carried out Sanger DNA sequencing and/or PCR-RFLP analyses of 25 random β-thalassaemia samples. For sequencing, multiplex amplified mPCRs (A and B) products were purified using the QIAquick PCR purification kit (Qiagen). Nine cycle sequencing amplification reactions were prepared for BCL11A and thirteen for HBS1L-MYB, in each case using 20-30 ng purified mPCR product, 0.5X Big Dye Terminator v1.1 Sequencing Buffer [5X] (Applied Biosystems), 0.5X Terminator ready reaction mix v1.1 [2.5X] (Applied Biosystems), sequencing primer at a concentration of 0.5 μM (see Table 2), and double distilled water to a final volume of 10 μl. The amplification cycling profile was performed in a Veriti 96-well Thermal Cycler (Applied Biosystems) and initialized by 96°C for 1 min, followed by 25 cycles for 10 sec at 96°C, 5 sec at 50°C; 4 min at 60°C. The samples mixtures were sequenced by the ABI 3130xl genetic analyzer (Applied Biosystems) following the manufacturer’s instructions.

The PCR-RFLP method was applied for the detection of two SNPs at the BCL11A locus (rs7606173, rs6706648) and five SNPs at the HBS1L-MYB locus (rs28384513, rs11759553, rs9376090, rs4895440, rs4895441). Singleplex PCR reactions were performed in a final volume of 25 μl using 100 ng of genomic DNA, 1X Amplitaq Gold Buffer (Applied Biosystems), 200 mΜ of each dNTP (Invitrogen), 1.25 units of Amplitaq Gold DNA Polymerase (Applied Biosystems) and primers at 0.2 μM concentration as shown in Table 2. The thermocycling conditions were exactly the same as for the mPCRs. 8 μl of PCR product was digested with 1 μl restriction enzyme (5 units/ μl) (for BCL11A locus AvaII was used for rs7606173 SNP; AatII for rs6706648, and for HBS1L-MYB locus BstxI was used for rs28384513; RsaI: rs11759553; MspI: rs9376090; HinfI: rs4895440; and Hpy188III: rs4895441, all restriction enzymes were purchased from New England Biolabs) for 1 hour at 37°C and visualized in 2% w/v ethidium bromide-stained agarose gel to distinguish between the genotypes of each SNP allele on the basis of susceptibility to digestion by the enzymes.

Results

Detection of 12 BCL11A and 16 HBS1L-MYB SNPs by multiplex minisequencing assays

In order to generate the genetic profile for SNPs located in two of the most common genetic modifiers of HbF, we developed two multiplex minisequencing assays that can be applied to genotype twelve BCL11A and sixteen HBS1L-MYB polymorphisms, respectively (Table 1). Our selection of SNPs to be studied was limited to well-characterized SNPs [913, 2433] with a proven association with variable HbF levels in a number of different populations with HPFH, β-thalassaemia, sickle cell anaemia and heterozygotes for β-thalassaemia or HbE. 25 β-thalassaemia samples were utilized in our laboratory for the establishment of the genotyping protocol. No prior information was available regarding the BCL11A and HBS1L-MYB regions for the samples studied.

Multiplex SNaPshot minisequencing is achieved by mixing primers of different lengths with the SNaPshot mix and the multiplex PCR template. Products of this reaction differ notably in size and are produced by simultaneous multiplex single fluorescent base extension at the 3′-end of each primer [37]. Each fluorescent fragment is assigned a size based on its relative mobility with respect to the GeneScan-120LIZ size standard (15-120 nt) as it migrates through the POP-7 polymer during electrophoresis. The major and minor alleles are differentiated based on the emission of the fluorescent-dye-labelled ddNTP terminator. The analyzed fragments are represented as coloured peaks by the analysis software as follows: green for A; black: C; blue: G; and red: T [37]. In this way the two different polymorphic allele sites can easily be distinguished. Samples that are heterozygous for one interrogated SNP are expected to display two peaks, one for the major allele and one for the minor while homozygotes display one peak which corresponds either to the major or to the minor allele. The relative size of the multiplex minisequencing primers determines the order of the extension products (peaks) on the electropherogram; nevertheless, the mobility of the labeled extension products through the polymer is also influenced by the molecular mass of the label dye, so the observed product size can diverge from the actual size of the primer. Another factor is the nature of the polymer that is used: fragments smaller than 50 bp migrate with slightly different mobilities through the POP-7 polymer than expected resulting in a minor shift compared to what one would anticipate based on the relative size of the minisequencing primer (User Bulletin: Using the SNaPshot® Multiplex System with the POP-7™ Polymer on Applied Biosystems 3730/3730xl DNA Analyzers and 3130/3130xl Genetic Analyzer, Applied Biosystems 2005, P/N. 4367258 Rev. B). During design of the primers avoiding hairpins, primer self-complementarity and primer-dimers was taken into consideration because stable oligonucleotide secondary structures can affect signals.

To examine the feasibility of the assay design and to confirm the identity of the peaks obtained, we first tested each minisequencing primer in a singleplex SnaPshot minisequencing reaction, and then in multiplex SnaPshot minisequencing reaction for each assay by using one high HbF sample and one β-thalassaemia sample with variable genotypic profiles previously determined using sequencing. During optimization of the assays, the observed size of one single-base extension product in BCL11A and six in HBS1L-MYB were affected by strong electrophoretic mobility variation. Overlaps between these and neighbouring peaks carrying identical labels resulted in poor resolution. The length of these seven primers was subsequently altered to ensure good distribution of the assay products (Table 3 contains the final minisequencing primers). Primer concentrations for use in the minisequencing assays were adjusted to generate easily readable peak heights. These were determined following several rounds of optimization. The multiplex electropherograms obtained can depict a minimum of 12 extension product peaks to a maximum of 24 peaks for the BCL11A assay, and 16 to 32 for the HBS1L-MYB assay depending on the genotype of the samples screened. The analyzed data show that in order to have sufficient electrophoretic separation between neighbouring peaks a minimum of 3 nt difference in length is required between the primers giving rise to those peaks. Taken together, our data indicate that the detectable signal for both alleles of the SNPs under investigation are obtained with high reproducibility, indicating that the 12-plex BCL11A and 16-plex HBS1L-MYB minisequencing primer set can be used for genotyping (Figure 1, https://mynotebook.labarchives.com/share/BMC%2520Genomics%2520Minisequencing%2520Data/MC4wfDIzODI4LzAvVHJlZU5vZGUvNDEyMDU0Njc2N3wwLjA). Moreover, three unexpected non-specific peaks exist in the electropherogram (two in the BCL11A assay and one in the HBS1L-MYB assay, Figure 1), though their positions do not interfere with the genotyping of the samples.

Figure 1
figure 1

Detection of BCL11A and HBS1L-MYB SNPs by multiplex minisequencing assays. A. Electropherogram shows the multi detection of 12 BCL11A SNPs from a β-thalassaemia donor. Peaks corresponds to the fluorescence signal that is detected for each SNP. The relative fluorescence units for the detected fragments as they occurred over time are represented along y-axis and size (nt) along the x-axis. B. Peaks presented on 4 separate electropherograms according to their colour, which indicates the fluorescent ddNTP extension. In the cases where the sample is homozygous for a specific SNP, the alternative allele is shown with a dotted peak in order to provide an indication of its position, i.e. G for rs766432; A: rs1427407; A: rs6706648; T: rs4671393; and T: rs6545816 that do not exist in the genotypic profile of this sample. Asterisks indicate non-specific peaks; these do not interfere with the genotyping of the flanking SNPs. For SNPs detected with a reverse primer, the peak colours correspond to the complementary bases of the denoted genotype. C. BCL11A SNPs profile of the β-thalassaemia donor. D. Electropherogram shows the multi detection of 16 HBS1L-MYB SNPs from a β-thalassaemia donor. E. Separate coloured peaks are denoted as described in (B). Dotted peaks indicate the position of the secondary peak that represents the other allele G for rs28384513, A; rs6929404, C; rs1320963 and C; rs6904897 that do not exist in the genotypic profile of this sample. The asterisk indicates a non-specific peak which does not affect the genotyping of the flanking SNPs. F. HBS1L-MYB SNPs profile of the β-thalassaemia donor.

After the successful detection of all polymorphisms with these two multiplex SNaPshot minisequencing assays, 25 genomic DNA β-thalassaemia patients were tested each carrying an unknown genotypic SNPs profile. Screening the study cohort for the selected BCL11A SNPs by the described multiplex assay resulted in identifying all 12 heterozygote and 24 homozygote genotypes possible. For HBS1L-MYB SNPs screening, all 16 heterozygote and 30 homozygote genotypes were detected. The minor genotypes AA for rs6929404 and GG for rs1320963 were not observed in any of the samples.

Validation of the multiplex minisequencing assay

To evaluate the accuracy of the assays we screened the same cohort of 25 β-thalassaemia patients as above with sequencing and/or PCR-RFLP for all 12 BCL11A and 16 HBS1L-MYB SNPs. The proportion of correctly identified SNPs (true positives) as well as the existence of any false positive or false negative peaks was examined. Our minisequencing results showed 100% agreement with the genotypes determined by sequencing and restriction fragment analysis. No false positive or false negative minisequencing results were found. These validation data prove that the new multiplex minisequencing assays developed are highly accurate and suitable for the detection of the twelve BCL11A and sixteen HBS1L-MYB polymorphisms.

Discussion

In this study, we describe two simple multiplex assays for simultaneous detection of twelve and sixteen SNPs that are located in the BCL11A gene and the HBS1L-MYB intergenic region, respectively. These polymorphisms have been shown to have an effect on HbF levels, a major ameliorating factor of SCD and β-thalassaemia in various populations [5]. The designed assays are based on the SNaPshot minisequencing technique, which was found to be an accurate method for SNP genotyping in many biology fields including population and forensic genetics [35, 38].

The assay for genotyping the selected BCL11A SNPs is a two-step strategy based on the multiplex amplification of nine fragments where the SNPs are located, in two separate multiplex PCR reactions, followed by one multiplex SNaPshot minisequencing reaction. Similarly, genotyping the selected SNPs in the HBS1L-MYB intergenic region is based on the multiplex amplification of thirteen fragments, in two separate multiplex PCR reactions, followed by one multiplex SNaPshot minisequencing reaction.

To validate the SNP genotyping results obtained by SNaPshot minisequencing we performed Sanger DNA sequencing or PCR-RFLP analyses of 25 DNA samples. The results of Sanger DNA sequencing or PCR-RFLP were always in agreement with those of the SNaPshot minisequencing.

Using the SNaPshot minisequencing technique we were able to determine the genotypes of the samples based on the position and the fluorescent colour of the peaks on the electropherograms generated. The minor shifts observed in the position of the peaks compared to the expected size may be due to the different molecular weights of each of the four fluorescent ddNTPs which can be incorporated at the end of each primer and to the POP-7 polymer used. These shifts are consistent for all samples that were tested and did not interfere with the analysis of the polymorphisms. Neutral tails incorporated at the ends of the minisequencing primers ensured sufficient spacing of the peaks and accommodated any unexpected small shifts in peak positions.

A crucial step for the success of the minisequencing assay is the correct design of the multiplex minisequencing primers. Given that the 3′ end of the minisequencing primer must be immediately adjacent to the SNP of interest, there is little flexibility in the positioning of the primer, although parameters like the length of the primer and its orientation are more amenable to manipulation. In deciding the orientation of the primers, it is necessary to take into consideration the presence of any additional sequence variations flanking the SNP of interest as these may cause problems in the annealing and extension of the primer [39, 40]. It is for this precise reason that we did not include SNP rs7775698 in the assay for genotyping the HBS1L-MYB intergenic region even though this SNP is known to be associated with elevated levels of HbF [31, 32]. Specifically, 3 more SNPs overlap with or are immediately adjacent to SNP rs7775698. These are rs371998411 and rs66650371, which are 3 bp deletions, and rs55634702, which is a 2 bp insertion (Figure 2). Consequently, designing a single minisequencing primer either upstream or downstream of the SNP rs7775698 is not sufficient to genotype the SNPs at this position. Sanger DNA sequencing is one of the techniques that can be used to resolve this issue.

Figure 2
figure 2

Sequence variations close to SNP rs7775698. Nucleotide sequences between chromosome 6: 135460308 to 135460368 bp. The sequences shown are: the rs7775698 major allele C, the rs7775698 minor allele T, the rs371998411 (CTA) 3 bp deletion, the rs66650371 (TAC) 3 bp deletion and the rs55634702 (TA) 2 bp insertion. In red is indicated the sequence variation.

Based on our observations that the multiplex PCR reactions for both the BCL11A and HBS1L-MYB assays were well standardized and that no amplification problems were experienced for any of the samples that we genotyped, we concluded that it is possible to reduce the time necessary for the completion of the whole assay by omitting the agarose gel electrophoresis step for the detection and analysis of the initial multiplex PCR products. Moreover, the assay can be easily customized for the analysis of only a subset of the SNPs or the addition of more SNPs without major changes in the conditions for either assay.

The advantage of the minisequencing technique as a means of screening for polymorphisms is that several selected polymorphisms can be detected simultaneously in a single reaction, with the results presented in a single electropherogram. The technique can be easily performed in any laboratory that is equipped with a thermocycler and a DNA sequencer. The SNaPshot minisequencing technique is highly accurate and sensitive and is suitable for the genotyping of large numbers of clinical samples relatively quickly. The possibility of using this technique for simultaneous identification of several polymorphisms at distant genetic locations in a single reaction makes it cost and time efficient compared to other commonly used genotyping techniques such as PCR-RFLP and Sanger DNA sequencing. Although PCR-RFLP is a simple technique, its use for genotyping the large numbers of SNPs selected for the two loci under investigation is time consuming and impractical because several PCR reactions must be carried out per locus. More importantly, PCR-RFLP is not applicable for all SNPs under investigation because of the absence of informative restriction enzyme recognition sites for all SNPs or the presence of additional SNPs within the recognition site. Similarly, Sanger DNA sequencing for the selected SNPs in our assays requires carrying out nine sequencing reactions for the BCL11A SNPs and thirteen sequencing reactions for the HBS1L-MYB SNPs compared to a single SNaPshot minisequencing reaction. Although it is possible to screen large scale sequences or SNPs with next generation sequencing and arrays, respectively, these high-throughput techniques are expensive and require specialized equipment which is not commonly available in most molecular biology laboratories.

In addition, the two described multiplex minisequencing assays can be combined with multiplex ligation-dependent probe amplification (MLPA) technique or Gap-PCR for rapid detection of any deletions or duplications of one or more sequences in or near the α–globin genes [4143], since this is an important modifier of the severity of SCD and β–thalassaemia [44].

Conclusion

The primary goal of this study was to develop a simple and rapid method to genotype SNPs in the BCL11A gene and in the HBS1L-MYB intergenic region that were shown in various studies to be associated with variable HbF levels in several populations. Multiplex SNaPshot minisequencing, a powerful method that can be used for analyzing SNPs distantly located to each other in a single reaction was chosen for this purpose. Taking the advantage of this method, two multiplex SNaPshot minisequencing assays were developed for simultaneous genotyping of twelve SNPs in the BCL11A gene and sixteen SNPs in the HBS1L-MYB intergenic region. The method was applied to screen 25 different samples and the results obtained were verified by DNA sequencing and PCR-RFLP. Concluding, the assays developed can be successfully applied for accurate, time and cost efficient simultaneous genotyping of SNPs associated with HbF levels in the BCL11A gene and in the HBS1L-MYB intergenic region in HPFH, β-thalassaemia, SCD or other hemoglobinopathy populations.

References

  1. Steinberg MH: Genetic etiologies for phenotypic diversity in sickle cell anemia. Sci World J. 2009, 9: 46-67.

    Article  Google Scholar 

  2. David J, Weatherall JBC: The Thalassaemia Syndromes. 2001, Oxford, UK: Blackwell Scientific, 4

    Google Scholar 

  3. Galanello R, Origa R: Beta-thalassemia. Orphanet J Rare Dis. 2010, 5: 11-10.1186/1750-1172-5-11.

    Article  PubMed Central  PubMed  Google Scholar 

  4. Ingram VM: Separation of the peptide chains of human globin. Nature. 1959, 183: 1795-1798. 10.1038/1831795a0.

    Article  CAS  PubMed  Google Scholar 

  5. Stamatoyannopoulos G: Control of globin gene expression during development and erythroid differentiation. Exp Hematol. 2005, 33 (3): 259-271. 10.1016/j.exphem.2004.11.007.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  6. Bauer DE, Kamran SC, Orkin SH: Reawakening fetal hemoglobin: prospects for new therapies for the beta-globin disorders. Blood. 2012, 120 (15): 2945-2953. 10.1182/blood-2012-06-292078.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  7. Thein SL, Craig JE: Genetics of Hb F/F cell variance in adults and heterocellular hereditary persistence of fetal hemoglobin. Hemoglobin. 1998, 22 (5–6): 401-414.

    Article  CAS  PubMed  Google Scholar 

  8. Akinsheye I, Alsultan A, Solovieff N, Ngo D, Baldwin CT, Sebastiani P, Chui DH, Steinberg MH: Fetal hemoglobin in sickle cell anemia. Blood. 2011, 118 (1): 19-27. 10.1182/blood-2011-03-325258.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  9. Thein SL, Menzel S, Peng X, Best S, Jiang J, Close J, Silver N, Gerovasilli A, Ping C, Yamaguchi M, et al: Intergenic variants of HBS1L-MYB are responsible for a major quantitative trait locus on chromosome 6q23 influencing fetal hemoglobin levels in adults. Proc Natl Acad Sci USA. 2007, 104 (27): 11346-11351. 10.1073/pnas.0611393104.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  10. Menzel S, Garner C, Gut I, Matsuda F, Yamaguchi M, Heath S, Foglio M, Zelenika D, Boland A, Rooks H, et al: A QTL influencing F cell production maps to a gene encoding a zinc-finger protein on chromosome 2p15. Nat Genet. 2007, 39 (10): 1197-1199. 10.1038/ng2108.

    Article  CAS  PubMed  Google Scholar 

  11. Galarneau G, Palmer CD, Sankaran VG, Orkin SH, Hirschhorn JN, Lettre G: Fine-mapping at three loci known to affect fetal hemoglobin levels explains additional genetic variation. Nat Gen. 2010, 42 (12): 1049-1051. 10.1038/ng.707.

    Article  CAS  Google Scholar 

  12. Lettre G, Sankaran VG, Bezerra MA, Araujo AS, Uda M, Sanna S, Cao A, Schlessinger D, Costa FF, Hirschhorn JN, et al: DNA polymorphisms at the BCL11A, HBS1L-MYB, and beta-globin loci associate with fetal hemoglobin levels and pain crises in sickle cell disease. Proc Natl Acad Sci USA. 2008, 105 (33): 11869-11874. 10.1073/pnas.0804799105.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  13. So CC, Song YQ, Tsang ST, Tang LF, Chan AY, Ma ES, Chan LC: The HBS1L-MYB intergenic region on chromosome 6q23 is a quantitative trait locus controlling fetal haemoglobin level in carriers of beta-thalassaemia. J Med Gen. 2008, 45 (11): 745-751. 10.1136/jmg.2008.060335.

    Article  CAS  Google Scholar 

  14. Sankaran VG, Menne TF, Xu J, Akie TE, Lettre G, Van Handel B, Mikkola HK, Hirschhorn JN, Cantor AB, Orkin SH: Human fetal hemoglobin expression is regulated by the developmental stage-specific repressor BCL11A. Science. 2008, 322 (5909): 1839-1842. 10.1126/science.1165409.

    Article  CAS  PubMed  Google Scholar 

  15. Sankaran VG, Xu J, Ragoczy T, Ippolito GC, Walkley CR, Maika SD, Fujiwara Y, Ito M, Groudine M, Bender MA, et al: Developmental and species-divergent globin switching are driven by BCL11A. Nature. 2009, 460 (7259): 1093-1097.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  16. Xu J, Sankaran VG, Ni M, Menne TF, Puram RV, Kim W, Orkin SH: Transcriptional silencing of {gamma}-globin by BCL11A involves long-range interactions and cooperation with SOX6. Genes Dev. 2010, 24 (8): 783-798.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  17. Jawaid K, Wahlberg K, Thein SL, Best S: Binding patterns of BCL11A in the globin and GATA1 loci and characterization of the BCL11A fetal hemoglobin locus. Blood Cells Mol Dis. 2010, 45 (2): 140-146.

    Article  CAS  PubMed  Google Scholar 

  18. Xu J, Bauer DE, Kerenyi MA, Vo TD, Hou S, Hsu YJ, Yao H, Trowbridge JJ, Mandel G, Orkin SH: Corepressor-dependent silencing of fetal hemoglobin expression by BCL11A. Proc Natl Acad Sci USA. 2013, 110 (16): 6518-6523.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  19. Mucenski ML, McLain K, Kier AB, Swerdlow SH, Schreiner CM, Miller TA, Pietryga DW, Scott WJ, Potter SS: A functional c-myb gene is required for normal murine fetal hepatic hematopoiesis. Cell. 1991, 65 (4): 677-689.

    Article  CAS  PubMed  Google Scholar 

  20. Sandberg ML, Sutton SE, Pletcher MT, Wiltshire T, Tarantino LM, Hogenesch JB, Cooke MP: c-Myb and p300 regulate hematopoietic stem cell proliferation and differentiation. Dev Cell. 2005, 8 (2): 153-166.

    Article  CAS  PubMed  Google Scholar 

  21. Vegiopoulos A, Garcia P, Emambokus N, Frampton J: Coordination of erythropoiesis by the transcription factor c-Myb. Blood. 2006, 107 (12): 4703-4710.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  22. Suzuki M, Yamazaki H, Mukai HY, Motohashi H, Shi L, Tanabe O, Engel JD, Yamamoto M: Disruption of the Hbs1l-Myb locus causes hereditary persistence of fetal hemoglobin in a mouse model. Mol Cell Biol. 2013, 33 (8): 1687-1695.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  23. Sankaran VG, Menne TF, Scepanovic D, Vergilio JA, Ji P, Kim J, Thiru P, Orkin SH, Lander ES, Lodish HF: MicroRNA-15a and -16-1 act via MYB to elevate fetal hemoglobin expression in human trisomy 13. Proc Natl Acad Sci USA. 2011, 108 (4): 1519-1524.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  24. Uda M, Galanello R, Sanna S, Lettre G, Sankaran VG, Chen W, Usala G, Busonero F, Maschio A, Albai G, et al: Genome-wide association study shows BCL11A associated with persistent fetal hemoglobin and amelioration of the phenotype of beta-thalassemia. Proc Natl Acad Sci USA. 2008, 105 (5): 1620-1625.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  25. Badens C, Joly P, Agouti I, Thuret I, Gonnet K, Fattoum S, Francina A, Simeoni MC, Loundou A, Pissard S: Variants in genetic modifiers of beta-thalassemia can help to predict the major or intermedia type of the disease. Haematologica. 2011, 96 (11): 1712-1714.

    Article  PubMed Central  PubMed  Google Scholar 

  26. He Y, Lin W, Luo J: Influences of genetic variation on fetal hemoglobin. Pediatr Hematol Oncol. 2011, 28 (8): 708-717.

    Article  CAS  PubMed  Google Scholar 

  27. Solovieff N, Milton JN, Hartley SW, Sherva R, Sebastiani P, Dworkis DA, Klings ES, Farrer LA, Garrett ME, Ashley-Koch A, et al: Fetal hemoglobin in sickle cell anemia: genome-wide association studies suggest a regulatory region in the 5′ olfactory receptor gene cluster. Blood. 2010, 115 (9): 1815-1822.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  28. Bhatnagar P, Purvis S, Barron-Casella E, DeBaun MR, Casella JF, Arking DE, Keefer JR: Genome-wide association study identifies genetic variants influencing F-cell levels in sickle-cell patients. J Hum Genet. 2011, 56 (4): 316-323.

    Article  CAS  PubMed  Google Scholar 

  29. Sedgewick AE, Timofeev N, Sebastiani P, So JC, Ma ES, Chan LC, Fucharoen G, Fucharoen S, Barbosa CG, Vardarajan BN, et al: BCL11A is a major HbF quantitative trait locus in three different populations with beta-hemoglobinopathies. Blood Cells Mol Dis. 2008, 41 (3): 255-258.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  30. Danjou F, Anni F, Perseu L, Satta S, Dessi C, Lai ME, Fortina P, Devoto M, Galanello R: Genetic modifiers of beta-thalassemia and clinical severity as assessed by age at first transfusion. Haematologica. 2012, 97 (7): 989-993. doi: 910.3324/haematol.2011.053504. Epub 052012 Jan 053522

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  31. Nuinoon M, Makarasara W, Mushiroda T, Setianingsih I, Wahidiyat PA, Sripichai O, Kumasaka N, Takahashi A, Svasti S, Munkongdee T, et al: A genome-wide association identified the common genetic variants influence disease severity in beta0-thalassemia/hemoglobin E. Hum Genet. 2010, 127 (3): 303-314. doi: 310.1007/s00439-00009-00770-00432

    Article  CAS  PubMed  Google Scholar 

  32. Farrell JJ, Sherva RM, Chen ZY, Luo HY, Chu BF, Ha SY, Li CK, Lee AC, Li RC, Li CK, et al: A 3-bp deletion in the HBS1L-MYB intergenic region on chromosome 6q23 is associated with HbF expression. Blood. 2011, 117 (18): 4935-4945. doi:4910.1182/blood-2010-4911-317081. Epub 312011 Mar 317088

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  33. Menzel S, Jiang J, Silver N, Gallagher J, Cunningham J, Surdulescu G, Lathrop M, Farrall M, Spector TD, Thein SL: The HBS1L-MYB intergenic region on chromosome 6q23.3 influences erythrocyte, platelet, and monocyte counts in humans. Blood. 2007, 110 (10): 3624-3626. Epub 2007 Aug 3621

    Article  CAS  PubMed  Google Scholar 

  34. Kent WJ: BLAT–the BLAST-like alignment tool. Genome Res. 2002, 12 (4): 656-664.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  35. Sanchez JJ, Borsting C, Hallenberg C, Buchard A, Hernandez A, Morling N: Multiplex PCR and minisequencing of SNPs–a model with 35 Y chromosome SNPs. Forensic Sci Int. 2003, 137 (1): 74-84.

    Article  CAS  PubMed  Google Scholar 

  36. Lindblad-Toh K, Winchester E, Daly MJ, Wang DG, Hirschhorn JN, Laviolette JP, Ardlie K, Reich DE, Robinson E, Sklar P, et al: Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse. Nat Genet. 2000, 24 (4): 381-386.

    Article  CAS  PubMed  Google Scholar 

  37. Fiorentino F, Magli MC, Podini D, Ferraretti AP, Nuccitelli A, Vitale N, Baldi M, Gianaroli L: The minisequencing method: an alternative strategy for preimplantation genetic diagnosis of single gene disorders. Mol Hum Reprod. 2003, 9 (7): 399-410.

    Article  CAS  PubMed  Google Scholar 

  38. Paneto GG, Kohnemann S, Martins JA, Cicarelli RM, Pfeiffer H: A single multiplex PCR and SNaPshot minisequencing reaction of 42 SNPs to classify admixture populations into mitochondrial DNA haplogroups. Mitochondrion. 2011, 11 (2): 296-302.

    Article  CAS  PubMed  Google Scholar 

  39. Pastinen T, Kurg A, Metspalu A, Peltonen L, Syvanen AC: Minisequencing: a specific tool for DNA analysis and diagnostics on oligonucleotide arrays. Genome Res. 1997, 7 (6): 606-614.

    CAS  PubMed  Google Scholar 

  40. Syvanen AC: From gels to chips: “minisequencing” primer extension for analysis of point mutations and single nucleotide polymorphisms. Hum Mutat. 1999, 13 (1): 1-10.

    Article  CAS  PubMed  Google Scholar 

  41. Colosimo A, Gatta V, Guida V, Leodori E, Foglietta E, Rinaldi S, Cappabianca MP, Amato A, Stuppia L, Dallapiccola B: Application of MLPA assay to characterize unsolved alpha-globin gene rearrangements. Blood Cells Mol Dis. 2011, 46 (2): 139-144.

    Article  CAS  PubMed  Google Scholar 

  42. Harteveld CL, Voskamp A, Phylipsen M, Akkermans N, den Dunnen JT, White SJ, Giordano PC: Nine unknown rearrangements in 16p13.3 and 11p15.4 causing alpha- and beta-thalassaemia characterised by high resolution multiplex ligation-dependent probe amplification. J Med Genet. 2005, 42 (12): 922-931.

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  43. Liu YT, Old JM, Miles K, Fisher CA, Weatherall DJ, Clegg JB: Rapid detection of alpha-thalassaemia deletions and alpha-globin gene triplication by multiplex polymerase chain reactions. Br J Haematol. 2000, 108 (2): 295-299.

    Article  CAS  PubMed  Google Scholar 

  44. Thein SL: Genetic modifiers of the beta-haemoglobinopathies. Br J Haematol. 2008, 141 (3): 357-366.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors thank George Christopoulos at Molecular Genetics Thalassaemia Department, Τhe Cyprus Institute of Neurology and Genetics for critical help on technical aspects of protocol development. This study was supported by grants from the Cyprus Research Promotion Foundation/European Regional Development Fund (ΥΓΕΙΑ/∆ΥΓΕΙΑ/0311(ΒΙΕ)39), the European Commission (7th Framework Programme, ThalamoSS, 306201) and Τhe Cyprus Institute of Neurology and Genetics.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marina Kleanthous.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MK introduced the concept of the genotyping of BCL11A and HBS1L-MYB SNPs with the two SNaPshot Minisequencing Assays and provided advice on methodology development. PF designed the experiments, aided in the implementation of the experiments and drafted the manuscript. IK developed the laboratory protocols for the two SNaPshot Minisequencing assays to the level of standard screening procedure, performed the analysis of the data and drafted the manuscript. MP provided advice on methodology development and on manuscript writing. All authors read and approved the final manuscript.

Pavlos Fanis, Ioanna Kousiappa contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( https://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Fanis, P., Kousiappa, I., Phylactides, M. et al. Genotyping of BCL11A and HBS1L-MYB SNPs associated with fetal haemoglobin levels: a SNaPshot minisequencing approach. BMC Genomics 15, 108 (2014). https://doi.org/10.1186/1471-2164-15-108

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/1471-2164-15-108

Keywords