A simple, high throughput method to locate single copy sequences from Bacterial Artificial Chromosome (BAC) libraries using High Resolution Melt analysis
© Vu et al; licensee BioMed Central Ltd. 2010
Received: 23 January 2010
Accepted: 12 May 2010
Published: 12 May 2010
The high-throughput anchoring of genetic markers into contigs is required for many ongoing physical mapping projects. Multidimentional BAC pooling strategies for PCR-based screening of large insert libraries is a widely used alternative to high density filter hybridisation of bacterial colonies. To date, concerns over reliability have led most if not all groups engaged in high throughput physical mapping projects to favour BAC DNA isolation prior to amplification by conventional PCR.
Here, we report the first combined use of Multiplex Tandem PCR (MT-PCR) and High Resolution Melt (HRM) analysis on bacterial stocks of BAC library superpools as a means of rapidly anchoring markers to BAC colonies and thereby to integrate genetic and physical maps. We exemplify the approach using a BAC library of the model plant Arabidopsis thaliana. Super pools of twenty five 384-well plates and two-dimension matrix pools of the BAC library were prepared for marker screening. The entire procedure only requires around 3 h to anchor one marker.
A pre-amplification step during MT-PCR allows high multiplexing and increases the sensitivity and reliability of subsequent HRM discrimination. This simple gel-free protocol is more reliable, faster and far less costly than conventional PCR screening. The option to screen in parallel 3 genetic markers in one MT-PCR-HRM reaction using templates from directly pooled bacterial stocks of BAC-containing bacteria further reduces time for anchoring markers in physical maps of species with large genomes.
Whole genome sequence data is currently unavailable for the overwhelming majority of plant species, including most crops. For such cases, the integration of linkage and physical maps provides a vital information platform to accelerate processes such as positional cloning, comparative genome analysis, and clone-by-clone sequencing [1–3]. One key limiting step in compiling a comprehensive link between physical and linkage maps is the ability to locate large insert clones (e.g. from Bacterial Artificial Chromosome [BAC] or Yeast Artificial Chromosome [YAC] libraries) that contain the polymorphic markers used in linkage mapping. Traditionally, this has been achieved by colony hybridisation using (usually) radioactively-labelled cloned DNA, PCR products, or oligonucleotides [4, 5]. There are several disadvantages of using colony hybridisation as a means of identifying clones that contain target DNA; these largely centre on the difficulty in setting appropriate hybridization conditions to minimise false positive and false negative results, but also include the need for appropriate facilities and procedures to handle radio-labelled probes . These problems can be overcome if a PCR-based screening strategy is adopted using Sequence Tagged Site (STS) markers [6, 7].
Several authors have argued that the efficiency of PCR-based screening can be greatly improved through the use of structured multidimentional BAC pooling strategies [7, 8]. Compared with hybridization to high-density colony filters, the PCR screening of multidimentional BAC pools is less prone to the confounding effects of repetitive elements and tends to be more cumbersome when radiolabels are used . Issues of error arise from both approaches, however, with the relative abundance of type 1 and type 2 errors varying according to post-hybridization wash stringency or PCR annealing temperatures respectively. Importantly, the number of independent amplifications required to determine a clone's address can be reduced by smart pooling strategies. The use of conventional PCR for BAC pool screening to anchor genetic markers to physical maps with a high throughput retains some shortcomings, most notably the common need to: i) isolate and normalise DNA from a large number of clones (the whole library) to ensure that sufficient template is available from each BAC for reliable amplification of single copy targets by conventional PCR. ii) anchor and score each marker separately (multiplexing is generally difficult). iii) use agarose gel electrophoresis as crude verification (by size) of the fidelity of target amplification. This last step is simple but comparatively slow and difficult to automate.
Results and discussion
Rationale of the method
Comparison of screening BAC library by conventional PCR and by MT-PCR-HRM.
Screening by conventional PCR using 3D pooling strategy
Screening by MT-PCR-HRM using 2D pooling strategy
Maximum number of 384-well plates to be pooled in one super pool
10 384-well plates
25 384-well plates
Number of super pools to be tested to identify a positive superpool (N: The total plate number of the BAC library)
PCR reactions (PCR/PCR-HRM) needed to identify a positive super pool
Reactions needed to identify the plate ID for one positive super pool
Reactions needed to identify the clone ID from one positive plate
Total number of reactions needed to get positive BAC clone ID from whole library
N/10 + 50
N/25 + 28
Checking on agarose gel
0.16 £/1 PCR reaction + cost for agarose gel electrophoresis
0.17 £/1 PCR-HRM reaction
Procedure duration to anchor 1 marker
Manually from gel photos
Semi-Automatic (figures and summary tables can be exported from HRM rotor system)
MT-PCR-HRM pooling strategy to accommodate for variable genome size.
1C genome (Mb)
No 384 well plates
No super pools
No matrix pools
No single pools
Total required simplex MT-PCR-HRM
Total required triplex MT-PCR-HRM
The practical example
We use A. thaliana as an exemplar to demonstrate the usefulness of MT-PCR-HRM in combination with a multidimensional BAC pooling strategy for the anchoring of markers into physical maps.
The experimental design comprises of three main steps: 1) pooling of BAC freezing stocks; 2) multiplex pre-amplification of BAC pools with up to 50 primer pairs; 3) localisation of positive BAC clones by identifying the positive subset of pools via selective amplification followed by HRM analysis (for each marker separately, or for up to three markers simultaneously) to confirm locus identity.
1) Pooling strategy for PCR screening of a genomic BAC library
The size of large insert libraries is normally adjusted according to the haploid genome size, the mean insert size and the expected genome coverage of the library. A typical BAC library has approximately 100 Kb average insert size. It is recommended to use a BAC library with at least 3 times coverage for PCR-based screening purposes . Such genomic BAC libraries are typically stored in microtiter plates in a 384-well format.
Several pooling strategies can be applied for PCR-based screening. The number of dimensions (D) deployed in a pooling strategy defines the number subsets of pools that may contain a specific BAC clone. Screening techniques based on conventional PCR normally require 3-D or 6-D pooling systems [5, 7–10] in order to ensure reliable amplification from a single BAC clone. With the more sensitive and accurate MT-PCR-HRM technique, a two-dimension pooling strategy (Figure 1) is sufficient. For studies aiming to anchor genetic markers into physical maps of large genomes, the minimum number of screening reactions is decisive in setting labour and consumables costs. The pooling strategy proposed here allows screening of millions of clones with as few as the number of superpools plus 28 MT-PCR-HRM reactions (Figure 1). Based on our experience, each super pool should not comprise more than twenty five 384-well plates. Using 5 μL per BAC clone, a super pool contains in total 5 × 25 × 384 μL (= 48 mL). In our illustrative example, an A. thaliana library  was used that contained twenty four 384-well plates. To evaluate whether the enhanced sensitivity of this method has utility for organisms with much larger genomes, we simulated 'rare single locus' hits in a substantially larger library by creating a superpool containing 5 μL of a single positive BAC clone and 3200 ([25 × 384]/3) negative BAC clones of 15 μL each. We were able to exploit this approach to estimate the capability of the technique to detect single locus hits for genomes of much larger size (Table 2).
According to the proposed pooling strategy, the required number of super pools for a typical 3× coverage BAC library with a 100 Kb average insert size is estimated for several important genomes (Table 2). For genomes larger than 1500 Mb, the number of PCRs required to anchor one genetic marker is approximately equivalent. The number of plates per superpool can be varied in a certain frame (i.e., less than 25) according to the genome size of the organism under study.
2) MT-PCR pre-amplification
Applying a suitable pooling strategy, the number of PCR amplifications required to identify the positive BAC clones can be reduced significantly. However, because of a low template concentration for STS sites and further dilution by E. coli DNA, conventional PCR-based screening may over-dilute single locus targets to such an extent that amplifications often fail. DNA extraction from the BACs is required for better template accessibility [7, 9]. Instead, we introduced a multiplex pre-amplification step on super BAC pools (each containing twenty five 384-well plates) directly from freezing stocks, applying up to 50 primer pairs simultaneously. Before pre-amplification, it has to be confirmed that each primer pair applied works individually on total genomic DNA by PCR-HRM.
The pre-amplification increases the sensitivity and specificity of MT-PCR-HRM on pooled templates (Figure 2). Without pre-amplification a higher cycle number is generally required for conventional PCR (40 to >45 cycles), yielding an excessively high background to product ratio (Figure 2).
3) Identification of marker-containing BAC clones by high resolution melting (HRM) analysis
HRM curve analysis after PCR-HRM provides a rapid, sensitive means for the BAC screening. The Rotor-Gene™ 6000 (Qiagen) with a real-time rotary analyzer prevents unwanted temperature deviation inside the thermal cycler [15, 16]. Furthermore, intercalating dye improves HRM sensitivity, allows omission of the gel electrophoresis step and enhances true positive hit rates . The positive BAC pools yield clear melting curves instead of a background line as obtained in the case of negative BAC pools (Figure 3). Because melting curve analysis can differentiate between divergent sequences, the option of a multiplex PCR for up to 3 loci is applicable to reduce consumables and labour (Figure 4). The entire MT-PCR-HRM run for a single locus (or for three loci together) can be completed within 90 minutes.
The same approach is applicable to organisms of much larger genome sizes (see Table 2). For example, using the tri-plex MT-PCR-HRM option in which three loci exhibiting non-overlapping melt profiles are combined into a single reaction, only 45 PCRs are required to screen approximately 480,000 clones of the 16 Gbp wheat genome for each of three markers represented in (an) individual BAC(s) (Table 2). Thus, this strategy potentially opens the possibility for high throughput-low cost screening across even very large genomes. Besides the integration of genetic and physical maps, the potential of this protocol is to support ongoing genome sequencing projects e.g., for barley http://barleygenome.org/, wheat http://www.wheatgenome.org/, potato http://www.potatogenome.net/, swine http://piggenome.org/, cattle http://www.bovinegenome.org etc.
Here, the new MT-PCR-HRM technology has been applied to BAC matrix pools for simple, low-cost and high-throughput anchoring of genetic markers to physical maps. Using a BAC library of the model plant Arabidopsis thaliana, we were able to show that MT-PCR-HRM can get reliable amplification of single copy targets from freezing stocks equivalent to twenty five 384-well plate super pools. A two-dimension pooling strategy can locate a target clone within a superpool by 28 reactions. The method also allows effective multiplexing to screen, in parallel, 3 genetic markers in one MT-PCR-HRM reaction and is particularly suited for genome characterisation initiatives.
Primer sequences used to amplify A. thaliana genetic markers.
Preparation of BAC pools from freezing stock of genomic BAC library
We used the Arabidopsis BAC library Mi/P1 in this protocol as an exemplar. The library plates stored at -80°C were thawed for 30 minutes and handled carefully in a flow cabinet to avoid contamination with clones from adjoining wells of the microtiter plate or exogenous contamination. After careful spin down of the plates, ethanol-wetted (70%) paper is used to clean the lip and walls of plates. Multichannel pipettes or robotic pipetting systems are used to take 5 μL of each colony for pooling into single, matrix or super pools.
Pre-amplification increases the concentration of the specific DNA templates in the freezing stock pools and therefore improves screening efficiency in the next step by avoiding false negative or false positive results. Modified MT-PCR is performed in 20 μL containing 10 μL of the supplied 2× Biomix (Biomix kit, Bioline), 2 μL freezing stock from the super pool, 5 μM of each primer (forward and reverse) multiplexed up to 50 markers and 4 mM MgCl2. Cycling conditions are 95°C for 10 min; then 20 cycles of 94°C for 20 s, 55°C for 30 s and 72°C for 1 min. The products are diluted in HPLC grade water to a final volume of 100 μL and stored at -20°C. To demonstrate the high multiplexing capacity of pre-amplification directly from pooled freezing stocks, 47 primer pairs (Additional file 1) from the list of 107 markers mapped to the 1.9 Mb FCA region of A. thaliana were combined with three primer pairs shown Table 3 in one pre-amplification reaction. Subsequent application of the latter 3 primer pairs for selective MT-PCR-HRM screening of the BAC library identified the BACs harbouring the 3 markers.
To identify super pools (Figure 1a) that contain the positive BAC with the corresponding marker, the MT-PCR-HRM reactions (10 μL) for a single primer pair in question is performed in 0.1 mL tubes containing 2 μL of the diluted pre-amplification product, 5 μL of SensiMixPlus SYBR (Quantace) and 5 μM of each forward and reverse primer. The '3-step PCR with melt' should be setup in Qiagen Rotor-Gene 6000 (Qiagen) at following conditions: 95°C for 10 min, followed by 35 cycles at 95°C for 20 s, 57°C for 30 s and 72°C for 50 s. To gain fluorescence for each cycling at 72°C requires the operator to choose the 'green' option. High resolution melting analysis is performed at ramp from 65°C to 90°C, raising by 0.3°C each step, pausing 90 s at pre-melt condition as a first step and pausing 2 s for each step thereafter. To acquire melting fluorescence, 'green' needs be chosen. In silico computer graphics reveals the live PCR run, the concentration of PCR product at the stationary stage and the subsequent melt curves. First order differential plots of the melt curves of the PCR product are created by the software provided with the Rotor-Gene™ 6000.
To identify an individual plate, MT-PCR-HRM screening is to be performed with freezing stocks of matrix pools as template (Figure 1b). It is possible to perform multiplex MT-PCR-HRM with up to three markers as long as distinguishable melting curves can be obtained (see Figure 4). MT-PCR-HRM components and conditions are as in the super pool step. After surveying positive plates for every marker, positive BAC clones' ID (Figure 1d) will be identified by PCR-HRM using 2-D single plate pooling (Figure 1c) that pools across rows or columns to identify positive BAC clones. Markers that yield positive signals in one plate can be used for multiplex PCR. PCR-HRM components and conditions are as in the super pool step.
With the list of positive BAC clones for the marker in question, the anchoring process is completed.
We thank Biohybrids International Ltd and Sumatra Biosciences for the funding of this work.
- Beyer A, Bandyopadhyay S, Ideker T: Integrating physical and genetic maps: from genomes to interaction networks. Nat Rev Genet. 2007, 8 (9): 699-710. 10.1038/nrg2144.PubMed CentralPubMedView Article
- Griffiths S, Sharp R, Foote TN, Bertin I, Wanous M, Reader S, Colas I, Moore G: Molecular characterization of Ph1 as a major chromosome pairing locus in polyploid wheat. Nature. 2006, 439 (7077): 749-752. 10.1038/nature04434.PubMedView Article
- Feuillet C, Travella S, Stein N, Albar L, Nublat A, Keller B: Map-based isolation of the leaf rust disease resistance gene Lr10 from the hexaploid wheat (Triticum aestivum L.) genome. Proc Natl Acad Sci USA. 2003, 100 (25): 15253-15258. 10.1073/pnas.2435133100.PubMed CentralPubMedView Article
- Druka A, Kudrna D, Kannangara CG, von Wettstein D, Kleinhofs A: Physical and genetic mapping of barley (Hordeum vulgare) germin-like cDNAs. Proc Natl Acad Sci USA. 2002, 99 (2): 850-855. 10.1073/pnas.022627999.PubMed CentralPubMedView Article
- Asakawa S, Abe I, Kudoh Y, Kishi N, Wang Y, Kubota R, Kudoh J, Kawasaki K, Minoshima S, Shimizu N: Human BAC library: construction and rapid screening. Gene. 1997, 191 (1): 69-79. 10.1016/S0378-1119(97)00044-9.PubMedView Article
- Green ED, Olson MV: Chromosomal region of the cystic fibrosis gene in yeast artificial chromosomes: a model for human genome mapping. Science. 1990, 250 (4977): 94-98. 10.1126/science.2218515.PubMedView Article
- Yim YS, Moak P, Sanchez-Villeda H, Musket TA, Close P, Klein PE, Mullet JE, McMullen MD, Fang Z, Schaeffer ML: A BAC pooling strategy combined with PCR-based screenings in a large, highly repetitive genome enables integration of the maize genetic and physical maps. BMC Genomics. 2007, 8: 47-10.1186/1471-2164-8-47.PubMed CentralPubMedView Article
- Klein PE, Klein RR, Cartinhour SW, Ulanch PE, Dong J, Obert JA, Morishige DT, Schlueter SD, Childs KL, Ale M: A high-throughput AFLP-based method for constructing integrated genetic and physical maps: progress toward a sorghum genome map. Genome Res. 2000, 10 (6): 789-807. 10.1101/gr.10.6.789.PubMed CentralPubMedView Article
- Farrar K, Donnison IS: Construction and screening of BAC libraries made from Brachypodium genomic DNA. Nat Protoc. 2007, 2: 1661-1674. 10.1038/nprot.2007.204.PubMedView Article
- Stanley KK, Szewczuk E: Multiplexed tandem PCR: gene profiling from small amounts of RNA using SYBR Green detection. Nucleic Acids Res. 2005, 33 (20): e180-10.1093/nar/gni182.PubMed CentralPubMedView Article
- Krause J, Dear PH, Pollack JL, Slatkin M, Spriggs H, Barnes I, Lister AM, Ebersberger I, Paabo S, Hofreiter M: Multiplex amplification of the mammoth mitochondrial genome and the evolution of Elephantidae. Nature. 2006, 439 (7077): 724-727. 10.1038/nature04432.PubMedView Article
- Wojdacz TK, Dobrovic A, Hansen LL: Methylation-sensitive high-resolution melting. Nat Protoc. 2008, 3 (12): 1903-1908. 10.1038/nprot.2008.191.PubMedView Article
- Gundry CN, Vandersteen JG, Reed GH, Pryor RJ, Chen J, Wittwer CT: Amplicon melting analysis with labeled primers: a closed-tube method for differentiating homozygotes and heterozygotes. Clin Chem. 2003, 49 (3): 396-406. 10.1373/49.3.396.PubMedView Article
- Reed GH, Kent JO, Wittwer CT: High-resolution DNA melting analysis for simple and efficient molecular diagnostics. Pharmacogenomics. 2007, 8 (6): 597-608. 10.2217/14622422.214.171.1247.PubMedView Article
- Do H, Krypuy M, Mitchell PL, Fox SB, Dobrovic A: High resolution melting analysis for rapid and sensitive EGFR and KRAS mutation detection in formalin fixed paraffin embedded biopsies. BMC Cancer. 2008, 8: 142-10.1186/1471-2407-8-142.PubMed CentralPubMedView Article
- Herrmann MG, Durtschi JD, Bromley LK, Wittwer CT, Voelkerding KV: Amplicon DNA melting analysis for mutation scanning and genotyping: cross-platform comparison of instruments and dyes. Clin Chem. 2006, 52 (3): 494-503. 10.1373/clinchem.2005.063438.PubMedView Article
- Yao-Guang L, Norihiro M, Alejandro VT, Robert FW: Generation of a high-quality P1 library of Arabidopsis suitable for chromosome walking. Plant J. 1995, 7 (2): 351-358. 10.1046/j.1365-313X.1995.7020351.x.View Article
- Thangavelu M, James AB, Bankier A, Bryan GJ, Dear PH, Waugh R: HAPPY mapping in a plant genome: reconstruction and analysis of a high-resolution physical map of a 1.9 Mbp region of Arabidopsis thaliana chromosome 4. Plant Biotechnol J. 2003, 1 (1): 23-31. 10.1046/j.1467-7652.2003.00001.x.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.