- Research article
- Open Access
BAC library resources for map-based cloning and physical map construction in barley (Hordeum vulgare L.)
BMC Genomics volume 12, Article number: 247 (2011)
Although second generation sequencing (2GS) technologies allow re-sequencing of previously gold-standard-sequenced genomes, whole genome shotgun sequencing and de novo assembly of large and complex eukaryotic genomes is still difficult. Availability of a genome-wide physical map is therefore still a prerequisite for whole genome sequencing for genomes like barley. To start such an endeavor, large insert genomic libraries, i.e. Bacterial Artificial Chromosome (BAC) libraries, which are unbiased and representing deep haploid genome coverage, need to be ready in place.
Five new BAC libraries were constructed for barley (Hordeum vulgare L.) cultivar Morex. These libraries were constructed in different cloning sites (Hind III, EcoR I, Mbo I and BstX I) of the respective vectors. In order to enhance unbiased genome representation and to minimize the number of gaps between BAC contigs, which are often due to uneven distribution of restriction sites, a mechanically sheared library was also generated. The new BAC libraries were fully characterized in depth by scrutinizing the major quality parameters such as average insert size, degree of contamination (plate wide, neighboring, and chloroplast), empty wells and off-scale clones (clones with <30 or >250 fragments). Additionally a set of gene-based probes were hybridized to high density BAC filters and showed that genome coverage of each library is between 2.4 and 6.6 X.
BAC libraries representing >20 haploid genomes are available as a new resource to the barley research community. Systematic utilization of these libraries in high-throughput BAC fingerprinting should allow developing a genome-wide physical map for the barley genome, which will be instrumental for map-based gene isolation and genome sequencing.
Bacterial artificial chromosome (BAC) libraries are the large DNA insert libraries of choice and an indispensible tool for map based cloning, physical mapping, molecular cytogenetics, comparative genomics and genome sequencing. In contrary to their name, BACs are not artificial chromosomes per se, but rather are artificial bacterial F factor derived constructs . Although BACs could carry inserts approaching 500 Kb in length, insert sizes are typically between 80 and 200 Kb in plants [2–8]. Cloning into BAC vectors rarely leads to chimeric or rearranged inserts [9–13] due to the presence of F factor genes (parA and parB) that prevent bacteria from maintaining more than one BAC simultaneously. An additional advantage of BAC clones is their easy manipulation and propagation compared to viral or yeast-vector based systems [14–16]. Consequently BACs have supplanted YACs as the dominant vector for large insert libraries and have been abundantly used in large-scale physical mapping projects [17–21]. Physical maps are pivotal for whole genome sequencing strategies of large and complex genomes. They are also instrumental to the scientific community for gene isolation [21, 22]. A genome-wide physical map of the maize genome was built as a basis for genome sequencing . A chromosome-specific BAC library strategy has been adopted for bread wheat (Triticum aestivum L.) to cope with the presence of three highly related homoeologous genomes [20, 24]. For the diploid barley (Hordeum vulgare L.) genome, the International Barley genome Sequencing Consortium (IBSC)  set out to develop a deep coverage well ordered whole genome physical map  as a platform for trait isolation and genome sequencing.
Large insert genomic libraries which are unbiased and representing few folds of the haploid genome are a key factor for successful generation of a physical map . BAC libraries with very large inserts can be readily constructed with the partial digestion method; however, unbiased large-insert BAC libraries may be built only from mechanically sheared high molecular weight genomic DNA in order to generate random fragments across the genome . A synergistic approach of combining libraries created by different methods will help in reducing gaps in the physical map that may result from uneven distribution of restriction sites of the employed restriction endonucleases. BAC maps which provided the basis for genome sequencing [18, 23, 27, 28] benefited immensely by combining multiple libraries.
Until recently, four BAC libraries of barley have been published. One was derived from a North American six-rowed malting variety 'Morex' with 313,344 gridded clones (6.3-fold haploid genome coverage . Two further libraries have been reported for the cultivars Haruna Nijo and Cebada Capa. More recently a fourth library was constructed from a doubled haploid barley line CS134 derived from a cross between the Australian malting variety 'Clipper' and the Algerian landrace Sahara 3771 . It is noteworthy that all these libraries have been extensively used for characterizing and isolating genomic regions of interest [31–34]. However, for barley in general and Morex in particular, the depth of available resources (haploid genome coverage, diverse restriction enzymes, etc.) was far too shallow to provide raw material for a genome wide physical map.
Here, we report on the development of five BAC libraries derived from cultivar 'Morex', which has been selected by IBSC as the reference genotype for genome sequencing. The aim of IBSC was to generate BAC resources by different complementing approaches in order to reach sufficient and synergistic genome coverage for a representative whole genome physical mapping. The new libraries which are publicly available are described here.
Barley seeds of the progeny "Morex 2003#9" kindly provided by Professor Patrick Hayes at Oregon State University, USA, was used in the library construction. About 200-400 seeds were grown under green house conditions. For the isolation of nuclei for high molecular weight (HMW) DNA preparation, etiolated leaves were harvested from 4-6 weeks old plants.
Construction of BAC libraries from partially restricted DNA
The libraries HVVMRXALLhB, HVVMRXALLhC, HVVMRXALLmA and HVVMRXALLeA were constructed from partially digested and size fractionated high molecular weight (HMW) DNA following previously published procedures [32, 35, 36] (Table 1). In brief, purified DNA (150 Kb-500 Kb) was ligated to the linearized and dephosphorylated vector. The ligation mixture was transformed into competent cells by electroporation. White recombinant colonies were selected on LB plates containing 12.5 - 20 μg/μl chloramphenicol, 90 μg/μl 5-Bromo-4-Chloro-3-Indolyl-Beta-D-Galacto-Pyranoside (X-Gal), 90 μg/μl Isopropyl-beta-D-Thiogalactopyranoside (IPTG), and picked robotically.
Recombinant clones were transferred into individual wells of microtiter plates, grown and then stored at -80°C. Library HVVMRXALLrA (Table 1) was produced from mechanically sheared DNA as previously described [35, 37]. Briefly, the HMW DNA (at least 20 μg) plugs were melted at 75°C, mechanically sheared and size fractionated on a Clamped Homogeneous Electrical Field (CHEF) gel. The large DNA fragments were then subjected to end-repairing and polished by Lucigen's DNA terminator kit. The "polished" blunt-ended DNA was ligated to a BstX I linker to create 5' protruding, non-complementary (CACA) ends according to manufacturer's instruction. The linker-ligated large DNA fragments were size-fractionated by pulsed-field gel electrophoresis, permitting the simultaneous removal of excess free linker and isolation of the sized genomic fragments (100~200 Kb). The size-fractionated DNA retains 5' extending ends, which can be ligated to the complementary 5' (TGTG) ends of the BstX I digested pSMART-BAC cloning vector. Electroporation of the BAC ligation mixture was performed and BAC clones were randomly picked for BAC DNA preparation, Not I digestion and insert size check. The final BAC library was assembled from transformed ligations that delivered clones with average insert size of 100 Kb or larger.
DNA isolation of plasmid-DNA
BAC-Plasmid DNA was isolated in a semi-automated approach utilizing NucleoSpin 96 Flash kit (Macherey&Nagel, Germany). Bacterial cultures were inoculated with a 96 pin replicator directly from glycerol stocks (384-well storage plates) into deep-well plates (96-well; Macherey&Nagel, Germany) containing 1.3 ml/well of either 2× YT-medium  (HVVMRXALLhA, HVVMRXALLeA, HVVMRXALLhB, HVVMRXALLhC), or 2x LB-medium  (HVVMRXALLmA, HVVMRXALLrA), respectively. In order to introduce positive and negative controls to each culture plate, two pins were removed from the replicator. After inoculating 94 clones, a positive control clone (HVVMRXALLhA0318G23) was introduced manually to the well H01 whereas well H12 was not inoculated with any clone thus serving as a negative control to monitor cross contaminations from the inoculation procedure. The cultures were grown at 37°C for 16-22 h agitated at 250 rpm on an orbital shaker (Infors AG, Switzerland). Cells were harvested by centrifugation (Heraeus Multifuge 35-R, thermo electron cooperation) of culture plates at 2,500 rpm for 15 min. The BAC DNA was isolated according to the manufacturer's instructions and eventually suspended in 50 μl molecular de-ionized water.
High information content fingerprinting
High information content fingerprinting (HICF) was essentially performed according to published procedures . In brief, 42 μl of BAC DNA was inoculated with 8 μl of a restriction mix consisting of two units of Bam HI, Eco RI, Xba I, Xho I and Hae III (New England Biolabs NEB, Germany), 1× NEB Buffer 2, 1× BSA, 0.5 μg DNAase-free RNase A and 0.02% beta-mercaptoethanol for 3 h at 37°C. Ten μl of restricted product was incubated with the labeling cocktail containing 0.3 μl SNaPshot Multiplex Reaction Mix (Applied Biosystems, Germany), 2 μl NEB-Buffer 2, 2.5 μl 100 mM Tris/HCl (pH 9.0) and 5.2 μl de-ionized water (1 h at 65°C).
Fragmented and labeled DNA was precipitated by adding 5 μl 2.5 M sodium-acetate and 100 μl 99% ethanol (-20°C) followed by incubation at -80°C for 15 min. DNA was collected by centrifugation at 4,200 rpm for 30 min. The pellet was washed with 100 μl 70% ethanol, air dried and re-suspended in 9.8 μl Hi-DiTM Formamide and 0.2 μl GS1200LIZ size standard (Applied Biosystems, USA). The samples were denatured at 95°C for 5 min before loading to the capillary sequencer ABI3730xl (Applied Biosystems). The capillary electrophoresis was performed on 50 cm capillary arrays using ABI's default run module for 108 min 3730 running-buffer with EDTA and 3730 POP-7TM polymer (Applied Biosystems, Germany).
Analysis of fingerprinting data
Peak areas, peak heights and fragment sizes of each BAC fingerprint profile were collected by ABI's data collection program. The raw data was assessed for sizing quality using GeneMapper v4.0 (Applied Biosystems, Germany). An electronic fingerprint was assigned with the software FPPipeliner v1.0 and further analyzed for organelle contamination, neighboring, and plate-wide contamination with FPMiner (BioinforSoft LLC, USA).
The software was also used for automatic elimination of vector borne fragments in all fingerprint profiles. Furthermore FPminer was used to distinguish the peaks between true fragments and those originating from background noise or 'snapshot' artifacts. The edited profiles were exported as sizes files in order to perform contig assembly with the assembly program FPC V9.0 .
Insert size determination
For insert size determination 10 μl of isolated BAC-plasmid-DNA was digested for 4 h at 37°C with 5 Units Not I (Fermentas, Germany) in 1× Buffer 3 containing 1× BSA. The digested DNA was separated together and sized with a low range Pulsed Field Gel Electrophoresis (PFGE) marker (New England Biolabs) by PFGE (CHEF DRIII, Biorad, Germany) on 1% agarose gels in 0.5 × TBE; (14°C, 6.0 V/cm, angle = 120, initial switch time 5 sec, final switch time 15 sec, run time = 16 h and ramping = linear).
Screening of BAC libraries
Screenings of all BAC libraries were performed on high density colony filters (see additional file 1). Hybridizations were performed as described previously . Membranes were prehybridized with 6× SSC, 5× Denhardt and 1 mg of denatured Salmon-sperm (Stratagene, USA) for 3 h at 68°C. Approximately 25 ng of probe was labeled separately with Megaprime kit (GE Healthcare, USA) and purified with Centrisep™ Columns (Applied Biosystems, Germany) according to manufacturer's instructions. Prior to hybridizations the probes were pooled and denatured at 95°C for 5 min followed by snap cooling on ice for another 5 min. Hybridizations were performed for at least 16 h at 68°C. Subsequently, membranes were washed once in buffer 1 (2× SSC, 0.1% SDS) followed by buffer 2 (1× SSC, 0.1% SDS) each at 68°C for 30 min. The filters were exposed for 4 h on imaging plates (Fuji film, Germany) and scanned on a FLA-3000 Phosphoimager (Fuji film, Germany). Positive BAC coordinates were identified with the software HDRF (Incogen, USA) and confirmed either by colony PCR or via colony hybridization . Barley probes were designed from EST-sequences originating from the HarvEST Assembly 35  (see additional file 2). Additionally 17 wheat probes were hybridized to the filter set of library HVVMRXALLhC. Prior to hybridization, quality and the copy number of the wheat probes was evaluated on Southern blots containing DNA from wheat nulli-tetrasomic lines as described by Pallotta et al., 2000 .
Ordering of BAC libraries and filters
The library HVVMRXALLhA was published before  and can be obtained from Clemson University Genomics Institute (CUGI) . The libraries HVVMRXALLhB, HVVRMXALLeA, HVVMRXALLmA, and HVVMRXALLrA are available from the Centre National de Ressources Génomiques Végétales (CNRGV) . The high density colony arrays are available for the respective BAC libraries from the two resources centers CUGI and CNRGV (see additional file 1). The HVVRMXALLeA library and its filters can also be ordered from CUGI . Library HVVMRXALLhC and filter sets were constructed and screened at Australian Center of Plant Functional Genomics (ACPFG, Adelaide, Australia).
Results and Discussion
BAC libraries are the foundation for map-based gene isolation and physical map construction for un-sequenced genomes. Such physical maps were instrumental for sequencing several important plant genomes like rice  and maize [6, 46]. Even for smaller plant genomes that are principally amenable for whole genome shotgun sequencing (WGS), the additional support provided by a physical map greatly facilitated ordering of the sequence contigs into scaffolds or super-scaffolds [47–49]. In crop species with genomes larger than 5 Gbp like barley, access to a physical map was proposed to be crucial to endeavor whole genome sequencing . Additionally, a physical map would facilitate tremendously the isolation of genes underlying important traits in the Triticeae species. The systematic and high-throughput characterization of libraries is a pre-requisite for developing physical maps.
Diverse BAC libraries to ensure high genome representation
Five new BAC-libraries of barley cultivar Morex were constructed (see Table 1). Of those, four libraries were constructed from partially digested high-molecular weight (HMW) DNA. Two of the libraries (HVVMRXALLhB and HVVMRXALLhC) were derived by partial digestion with enzyme Hin dIII, whereas the remaining was derived from partial digest with Eco RI (HVVMRXALLeA) or Mbo I (HVVMRXALLmA) (Table 1), respectively. The enzymes Hind III and Eco RI recognize 6 bp palindromes whereas Mbo I cleaves at a 4 bp palindromic site. A fifth library was obtained from cloning mechanically sheared HMW DNA.
The rationale behind constructing independent BAC libraries by partial digestion with different restriction endonucleases is that the frequency of occurrence of a specific palindrome in the DNA sequence is a function of the bp-composition of a species genome and of the recognition site . Selecting multiple enzymes with a different recognition sequence would limit the risk of under-representation of specific regions of the genome of interest in the resulting BAC map . The strategy of combining different BAC libraries was previously followed in other physical mapping projects such as soybean, bovine, Brassica rapa and maize [23, 50–52]. To further overcome the bias of under-represented regions in libraries made of partially digested DNA, one BAC library was generated from mechanically sheared DNA (HVVMRXALLrA, Table 1). As described for rice , gaps in physical maps may occur because of non-random distribution of cloning sites, unstable DNA structures in E. coli hosts like Z-DNA, long inverted terminal repeats and AT-rich sequences [54, 55]. Closure of such gaps is crucial to reach completion of a physical map. For example random sheared fosmid clones enabled the filling of gaps in the rice physical map in regions where there was no restriction site for BAC libraries . Interestingly these clones contained genes of agronomical importance. Furthermore, its demonstrated that megabase-size DNA lacking any restriction site can be mechanically sheared as well as the DNA from other genomic regions  resulting in evenly distributed BACs across the genome. Therefore such libraries hold a high potential of gap closure. For example the random sheared BACs of the Arabidopsis thaliana genome played a crucial role in centrometric gap closure of the Arabidopsis physical map . Therefore, generating a single random sheared BAC library with sufficient genomic coverage provides an important BAC resource and a complementing tool for a generic physical map of the barley genome.
BAC libraries provide 25-fold genome coverage
Genome representation of a given BAC library is important as it allows predicting the potential to find any given gene at least on a single BAC clone. Genome representation is a function of the overall number of unique clones and their respective insert sizes. Insert sizes of the clones were determined by Not I digestion and Pulsed Field Gel Electrophoresis (PFGE) of about 1330 clones (Table 2, Figure 1) as well as by HICF of ~10,000 BACs for each library (Figure 2).
Clones from the HVVMRXALLeA library contained the second largest average insert size of 125 Kb, but insert sizes showed more variation around the mean value as determined by HICF (Figure 3). Libraries HVVMRXALLhA and HVVMRXALLhB contributed clones with medium insert sizes between 97 Kb and 100 Kb. For these two libraries the variation of insert size and average number of fragments around the mean value was more distinct (Table 1, Figure 3). The library HVVMRXALLrA obtained from randomly sheared DNA showed the smallest average insert size of 92 Kb. Each library represented between 2.4 to 6.6-fold the haploid barley genome (Table 1). Together with the previously published BAC library of Yu et al. (2000) , more than 25-fold combined haploid genome coverage is available now in BAC libraries of the six-rowed malting barley cultivar Morex (Table 1). The probability to recover any specific sequence of interest is > 99% across all libraries .
Irrespective of the BAC cloning method (restriction enzyme, DNA-shearing; see above) the average BAC insert size has a major impact on the contribution to the physical map.
There is a positive relation between the BAC insert size and number of fragments depending on the chosen fingerprinting technique [54, 57]. During this study, for the investigated barley libraries, we observed a positive correlation between "insert size" and "number of fragments" as mentioned before (see Table 1, Figure 3). Furthermore, Meyers et al. (2004)  investigated the contribution of overall fragment numbers per clone vs reliability of clone overlap at a given suslton score, a key parameter used in FPC (F ingerp rint C ontig ). It was observed that increasing total fragments per clone in turn increases the overlapping BACs at a given Sulston score thus decreasing the occurrence of false-positives. But there is potential fragment size saturation where an increased number of bands does produce false overlaps in a contig assembly .
After assembling the BACs into contigs, the Minimal Tiling Path (MTP) selection will be the basis for BAC-by-BAC sequencing. There is a preference of selecting large insert clones  which has the advantage that less BACs must be chosen for the MTP and a maximum of sequence information could be obtained from each BAC . But also the risk of a chimeric or contaminated BAC should be kept in mind .
For the maize physical map large insert sized BACs were used as "seed" BACs in the maize MTP construction, which provided the highest information content to confirm overlaps between adjacent BACs . For some genome regions large or medium-size clones generated by different methods and or techniques (e.g. BACs from a different BAC library, fosmids) were chosen to fill gaps indicating that depending on the sequence, different type of clones were needed to cover the genome . Therefore the five BAC libraries described in this study provide an optimal resource for whole genome physical mapping of the barley genome with minimal gaps.
Quality parameters of BAC resources
During the cloning procedure of a BAC library there is a risk of over-representation of organelle DNA which is mixed in various amounts with isolated nuclei in the process of preparing high molecular weight (HMW) DNA. A random clone set of each BAC-library (10,279 -10,685 samples) was investigated by HICF (see above). This also included a BAC clone known to represent the entire chloroplast of cv. Morex . Including this clone into HICF provided a reference fingerprint which then could be compared to all other high-quality BAC fingerprints. At a threshold of higher than 50% identical fragments to the chloroplast control, BAC clones were flagged as originating mainly from chloroplast DNA. The highest percentage of chloroplast-BACs (1.85%) was found in the library HVVMRXALLhC (see Table 3). Medium-level chloroplast-contamination was observed for the libraries HVVMRXALLhB, HVVMRXALLhA and HVVMRXALLrA with 0.92%, 0.78% and 0.45%, respectively. The smallest amount of chloroplast-DNA contamination was observed in HVVMRXALLeA (0.11%) and HVVMRXALLmA (0.07%). Due to the lack of sequenced BAC clones that represent the entire mitochondria of barley, contamination of BAC libraries by mitochondrial DNA was not determined.
During the process of clone picking, plate replicating and re-arraying of clones there is a risk of introducing contaminations between BAC clones even if lab automation is used. Such contaminations maybe observed by fragment pattern identity of neighboring clones within a multi-well plate. The potential neighboring and/or plate-wide contaminations were determined by comparing HICF profiles of the ~10,000 clones fingerprinted for each library. If the overall fragment identity of two clones at neighboring position within one plate or at identical position in subsequent plates of the library is higher than 50%, these clones were flagged. The highest rate of potential neighbor (2.73%) and plate-wide (7.28%) contamination was observed in library HVVMRXALLhA. For this library no values for these two parameters were given by Yu et al., 2000 . During this study we used a copy made several years ago which in between has been extensively used for other purposes. Therefore we cannot rule out that contaminations introduced over time during plate handling.
Potential neighbor contaminations were found to be in the range between 1.01% and 2.09% for the other libraries and plate-wide contaminations were as high as 1.44% to 5.76% (Table 3).
Contaminated clones may be identified also by overall fragment number in HICF analysis. If a single glycerol stock would contain two different BACs of similar size, HICF analysis would indicate twice the number of fragments as compared to a normal clone of the same library.
Besides contaminations and clones with too few or too many fingerprint fragments, empty vector clones or non-viable clones can compromise the quality of a BAC library since such "empty" wells in BAC-library plates increases the preparation costs and increases the need for larger number of clones to be processed in fingerprinting if used for physical map construction. A very small fraction of "empty" wells was found for all libraries (> 0.35%-2.9%, Table 3).
The number of fragments after HICF, is an exclusion parameter for clones during systematic physical map construction. In contrast, BAC clones with very small inserts would provide too little information from HICF for being valuable for physical mapping. Clones with less than 30 fragments would have a very small overlap to other clones and would therefore most likely stay as singletons or overlaps would remain uncertain. If a high number of small inserts were obtained in a library, size selection of HMW DNA before cloning would probably be inefficient, because small fragments tend to co-migrate with larger fragments in highly concentrated samples  and sheared large DNA fragments are far less efficient to be cloned. Therefore both cases - too many and too few fragments compared to the average - would need to be filtered in a systematic physical mapping project. In this study the average number of fragments over all libraries was 98.6. The percentage of clones which fell into the range of <30 and >250 fragments varied among the libraries (Table 3). It cannot be ruled out that large BAC clones containing large numbers of highly conserved tandem repeats, centromeric and telomeric repetitive sequences could potentially produce less than 30 fragments by HICF (Cheng-cang Wu, unpublished data). This may explain partly the highest percentage (SHC: 14.58% in Figure 3) of clones with <30 or >250 fragments found in the sheared BAC library (HVVMRXALLrA) which is expected to cover regions underrepresented in libraries obtained by partial digest of HMW DNA. However, further experimentation is required to test this hypothesis.
Experimental validation of genome representation
Theoretical assumptions about genome coverage of newly developed BAC libraries based on clone numbers and average insert sizes of sample clones remains uncertain since such analyses do not reveal potential redundancy in libraries introduced during the cloning procedure (i.e. overgrowth of transformation assays). Therefore, high density colony arrays of all libraries were screened with a set of single- or low-copy gene probes (see additional file 3 and 4). The library HVVMRXALLhA was excluded from this screening, since it was already intensively characterized in previous studies [29, 63, 64].
The libraries HVVMRXALLeA, HVVMRXALLmA, HVVMRXALLrA, and HVVMRXALLhB were hybridized with ten RFLP-markers . These markers were tested before by Southern analysis to represent single or low-copy sequences (data not shown) and were known to be distributed on barley chromosomes 2H, 3H, 5H - 7H (additional file 3). A single colony filter per library comprising 55,296 clones was probed (additional file 1). On average, 1 to 7 BAC addresses could be identified (Figure 4A).
None of these four libraries showed any significant pattern of library amplification since the number of positive signals obtained correlated well with the expected number (Figure 4A, additional file 4). All clones identified by screening of high density colony filters were analyzed by HICF and for six of the ten GBR probes (GBR0048, GBR0605, GBR1550, GBR1597, GBR1790, GBR1823) all clones assembled into single contigs confirming the single-copy character of the probes. BACs identified by GBR1433, GBR1837, GBR1710 and GBR1610 assembled in two, three, nine or even ten contigs, respectively. Given the single copy nature of the probes in previous Southern analysis, the finding of two or three independent contigs per single copy probe may be explained by too little overlap of positive BAC clones in the area carrying the respective genes thus not allowing FPC to build single contigs. In the cases of larger number of contigs it is likely that such markers cross-hybridized to more paralogous genes than could be expected from the previous Southern evaluation (data not shown).
One obvious observation was that there are more hits than average for single-copy probes in the library HVVMRXALLhA; but the positive clone numbers are consistently corresponding to contig numbers (probably paralogous gene numbers) only in the sheared BAC library HVVMRXALLrA for all three low-copy probes: GBR1837, GBR1710, and GBR1610 (Figure 4A). The un-biasness of the sheared BAC library compared to the partial digestion BAC libraries may be apparent by screening more DNA probes including repetitive sequences.
The entire HVVMRXALLhC library was screened with a set of seventeen wheat EST-derived probes previously mapped to wheat chromosome 3D. Because wheat and barley genomes are closely related, probes from one species can easily be used against genomic filter of the other. These probes were first hybridized on wheat nulli-tetrasomic lines in order to verify the 3D location and afterwards on the HVVMRXALLhC-filters to identify the syntenic barley regions. In total 8 of 17 EST-Markers (p58, p67, p77, p84, p88, p119, p188, p195) gave exactly the expected number of BACs (coverage of the filter set = 3.4 x for the entire HVVMRXALLhC-library). The remaining probes revealed at least a single BAC address. On average the probes revealed 2.8 BAC addresses (Figure 4B). The copy number of the probes was calculated in wheat nulli-tetrasomic lines and therefore could differ in the barley genome due to sequence variations.
In this paper we report on the development and characterization of a set of five new publicly available BAC libraries of barley cultivar Morex - a cultivar selected by the International Barley Sequencing Consortium as reference genotype for genome sequencing . Altogether the libraries represent >25-fold the haploid genome of barley. The libraries were generated from HMW DNA partially digested with different restriction endonucleases or mechanical shearing in order to reduce the risk of genome regions being under-represented in the libraries - an aspect which would interfere with the aim of developing a physical map of the entire barley genome. Based on the analyzed quality parameters and the obtained experimental evidences it can be concluded that the new libraries (1) represent a comprehensive reagent for gene discovery and (11) can be utilized for developing a generic physical map of barley.
Peterson DG, Tomkins JP, Frisch DA, Wing RA, Paterson AH: Construction of plant bacterial artificial chromosomes (BAC) libraries: an illustrated guide. Journal of Agricultural genomics. 2000, 5:
Adam-Blondon AF, Bernole A, Faes G, Lamoureux D, Pateyron S, Grando M, Caboche M, Velasco R, Chalhoub B: Construction and characterization of BAC libraries from major grapevine cultivars. Theor Appl Genet. 2005, 110 (8): 1363-1371. 10.1007/s00122-005-1924-9.
Budiman M, Mao L, Wood TC, Wing RA: A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing. Genome Res. 2000, 10 (1): 129-136.
Vilarinhos AD, Piffanelli P, Lagoda P, Thibivilliers S, Sabau X, Carreel F, D'Hont A: Construction and characterization of a bacterial artificial chromosome library of banana (Musa acuminata Colla). Theor Appl Genet. 2003, 106 (6): 1102-1106.
Wang CM, Lo LC, Feng F, Gong P, Li J, Zhu ZY, Lin G, Yue GH: Construction of a BAC library and mapping BAC clones to the linkage map of Barramundi, Lates calcarifer. BMC Genomics. 2008, 9 (139):
Yim YS, Davis GL, Duru NA, Musket TA, Linton EW, Messing JW, McMullen MD, Soderlund CA, Polacco ML, Gardiner JM, Coe EH: Characterization of three maize bacterial artificial chromosome libraries toward anchoring of the physical map to the genetic map using high-density bacterial artificial chromosome filter hybridization. Plant Physiol. 2002, 130 (4): 1686-1696. 10.1104/pp.013474.
Yuksel B, Paterson AH: Construction and characterization of a peanut Hind III BAC library. Theorl Appl Genet. 2005, 111 (4): 630-639. 10.1007/s00122-005-1992-x.
Ming R, Moore PH, Zee F, Abbey CA, Ma H, Paterson AH: Construction and characterization of a papaya BAC library as a foundation for molecular dissection of a tree-fruit genome. Theor Appl Genet. 2001, 102 (6-7): 892-899. 10.1007/s001220000448.
Boysen C, Simon MI, Hood L: Fluorescence-based sequencing directly from bacterial and P1-derived artificial chromosomes. Biotechniques. 1997, 23 (6): 978-982.
Cai L, Taylor JF, Wing RA, Gallagher DS, Woo SS, Davis SK: Construction and characterization of a bovine bacterial artificial chromosome library. Genomics. 1995, 29 (2): 413-425. 10.1006/geno.1995.9986.
Kim UJ, Birren BW, Slepak T, Mancino V, Boysen C, Kang HL, Simon MI, Shizuya H: Construction and characterization of a human bacterial artificial chromosome library. Genomics. 1996, 34 (2): 213-218. 10.1006/geno.1996.0268.
Woo SS, Jiang JM, Gill BS, Paterson AH, Wing RA: Construction and characterization of a bacterial artificial chromosome library of Sorghum-bicolor. Nucleic Acids Res. 1994, 22 (23): 4922-4931. 10.1093/nar/22.23.4922.
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XQH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang JH, Miklos GLG, Nelson C, Broder S, Clark AG, Nadeau C, McKusick VA, Zinder N, et al: The sequence of the human genome. Science. 2001, 291 (5507): 1304-1351. 10.1126/science.1058040.
Burke DT, Olson MV: Preparation of clone libraries In yeast artificial-chromosome vectors. Methods Enzymol. 1991, 194: 251-270.
Paterson AH: The DNA revolution. Genome Mapping in Plants. Edited by: Paterson AH. 1996, San Diego: Academic Press, 7-21.
Marra MA, Kucaba TA, Dietrich NL, Green ED, Brownstein B, Wilson RK, McDonald KM, Hillier LW, McPherson JD, Waterston RH: High throughput fingerprint analysis of large-insert clones. Genome Res. 1997, 7 (11): 1072-1084.
Gregory SG, Sekhon M, Schein J, Zhao SY, Osoegawa K, Scott CE, Evans RS, Burridge PW, Cox TV, Fox CA, Hutton RD, Mullenger IR, Phillips KJ, Smith J, Stalker J, Threadgold GJ, Birney E, Wylie K, Chinwalla A, Wallis J, Hillier L, Carter J, Gaige T, Jaeger S, Kremitzki C, Layman D, Maas J, McGrane R, Mead K, Walker R, et al: A physical map of the mouse genome. Nature. 2002, 418 (6899): 743-750. 10.1038/nature00957.
McPherson JD, Marra M, Hillier L, Waterston RH, Chinwalla A, Wallis J, Sekhon M, Wylie K, Mardis ER, Wilson RK, Fulton R, Kucaba TA, Wagner-McPherson C, Barbazuk WB, Gregory SG, Humphray SJ, French L, Evans RS, Bethel G, Whittaker A, Holden JL, McCann OT, Dunham A, Soderlund C, Scott CE, Bentley DR, Schuler G, Chen HC, Jang WH, Green ED, et al: A physical map of the human genome. Nature. 2001, 409 (6822): 934-941. 10.1038/35057157.
Gonzalez VM, Garcia-Mas J, Arus P, Puigdomenech P: Generation of a BAC-based physical map of the melon genome. BMC Genomics. 2008, 11 (339):
Paux E, Sourdille P, Salse J, Saintenac C, Choulet F, Leroy P, Korol A, Michalak M, Kianian S, Spielmeyer W, Lagudah E, Somers D, Kilian A, Alaux M, Vautrin S, Berges H, Eversole K, Appels R, Safar J, Simkova H, Dolezel J, Bernard M, Feuillet C: A physical map of the 1-gigabase bread wheat chromosome 3B. Science. 2008, 322 (5898): 101-104. 10.1126/science.1161847.
Schulte D, Close TJ, Graner A, Langridge P, Matsumoto T, Muehlbauer G, Sato K, Schulman AH, Waugh R, Wise RP, Stein N: The International Barley Sequencing Consortium-at the threshold of efficient access to the barley genome. Plant Physiol. 2009, 149 (1): 142-147. 10.1104/pp.108.128967.
Stein N: Physical Mapping in the Triticeae. Genetics and genomics of the Triticeae. Edited by: Muehlbauer FCGJ. 2009, New York: Springer Publishers, 317-335.
Coe E, Cone K, McMullen M, Chen SS, Davis G, Gardiner J, Liscum E, Polacco M, Paterson A, Sanchez-Villeda H, Soderlund C, Wing R: Access to the maize genome: an integrated physical and genetic map. Plant Physiol. 2002, 128 (1): 9-12. 10.1104/pp.010953.
Safar J, Bartos J, Janda J, Bellec A, Kubalakova M, Valarik M, Pateyron S, Weiserova J, Tuskova R, Cihalikova J, Vrana J, Simkova H, Faivre-Rampant P, Sourdille P, Caboche M, Bernard M, Dolezel J, Chalhoub B: Dissecting large and complex genomes: flow sorting and BAC cloning of individual chromosomes from bread wheat. Plant J. 2004, 39 (6): 960-968. 10.1111/j.1365-313X.2004.02179.x.
International Barley genome Sequencing Consortium (IBSC) web page. [http://barleygenome.org]
Godiska R, Mead DA, Dhodda V, Hochstein R, Karsi A, Ravin N, Wu CC: Bias-free cloning of 'unclonable' DNA for simplified genomic finishing. 2008, Sudbury, MA: Jones and Bartlett Publishers
Tao QZ, Chang YL, Wang JZ, Chen HM, Islam-Faridi MN, Scheuring C, Wang B, Stelly DM, Zhang HB: Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis. Genetics. 2001, 158 (4): 1711-1724.
Wu CC, Nimmakayala P, Santos FA, Springman R, Scheuring C, Meksem K, Lightfoot DA, Zhang HB: Construction and characterization of a soybean bacterial artificial chromosome library and use of multiple complementary libraries for genome physical mapping. Theor Appl Genet. 2004, 109 (5): 1041-1050. 10.1007/s00122-004-1712-y.
Yu Y, Tomkins JP, Waugh R, Frisch DA, Kudrna D, Kleinhofs A, Brueggeman RS, Muehlbauer GJ, Wise RP, Wing RA: A bacterial artificial chromosome library for barley (Hordeum vulgare L.) and the identification of clones containing putative resistance genes. Theor Appl Genet. 2000, 101 (7): 1093-1099. 10.1007/s001220051584.
Saisho D, Myoraku E, Kawasaki S, Sato K, Takeda K: Construction and characterization of a bacterial artificial chromosome (BAC) library from the Japanese malting barley variety 'Haruna Nijo'. Breeding Sci. 2007, 57 (1): 29-38. 10.1270/jsbbs.57.29.
Edwige IBS, Arnaud B, Karine B, Patricia FR, Robbie W, Beat K, Michel C, Catherine F, Boulos C: Direct targeting and rapid isolation of BAC clones spanning a defined chromosome region. Funct Integr Genomics. 2005, 5 (2): 97-103. 10.1007/s10142-004-0127-9.
Shi BJ, Sutton T, Collins NC, Pallotta M, Langridge P: Construction of a barley bacterial artificial chromosome library suitable for cloning genes for boron tolerance, sodium exclusion and high grain zinc content. Plant Breeding. 2010, 129 (3): 291-296. 10.1111/j.1439-0523.2009.01762.x.
Stein N, Perovic D, Kumlehn J, Pellio B, Stracke S, Streng S, Ordon F, Graner A: The eukaryotic translation initiation factor 4E confers multiallelic recessive Bymovirus resistance in Hordeum vulgare (L.). Plant J. 2005, 42 (6): 912-922. 10.1111/j.1365-313X.2005.02424.x.
Komatsuda T, Pourkheirandish M, He CF, Azhaguvel P, Kanamori H, Perovic D, Stein N, Graner A, Wicker T, Tagiri A, Lundqvist U, Fujimura T, Matsuoka M, Matsumoto T, Yano M: Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene. Proc Natl Acad Sci USA. 2007, 104 (4): 1424-1429. 10.1073/pnas.0608580104.
Osoegawa K, Woon PY, Zhao BH, Frengen E, Tateno M, Catanese JJ, de Jong PJ: An improved approach for construction of bacterial artificial chromosome libraries. Genomics. 1998, 52 (1): 1-8. 10.1006/geno.1998.5423.
Luo M, Wing RA: An improved method for plant BAC library construction. Methods Mol Biol. 2003, 236 (3): 20-
Random Shear BAC Library Construction. [http://lucigen.com/store/skin1/images/pages/RS-BACwhitepaper.pdf]
Sambrook JRD: Molecular cloning - A laboratory manual. 2001, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press, 3
Luo MC, Thomas C, You FM, Hsiao J, Shu OY, Buell CR, Malandro M, McGuire PE, Anderson OD, Dvorak J: High-throughput fingerprinting of bacterial artificial chromosomes using the SNaPshot labeling kit and sizing of restriction fragments by capillary electrophoresis. Genomics. 2003, 82 (3): 378-389. 10.1016/S0888-7543(03)00128-9.
Soderlund C, Humphray S, Dunham A, French L: Contigs built with fingerprints, markers, and FPCV4.7. Genome Res. 2000, 10 (11): 1772-1787. 10.1101/gr.GR-1375R.
HarvEST web page. [http://harvest.ucr.edu/]
Pallotta MA, Graham RD, Langridge P, Sparrow DHB, Barker SJ: RFLP mapping of manganese efficiency in barley. Theor Appl Genet. 2000, 101 (7): 1100-1108. 10.1007/s001220051585.
Clemson university genomics institute's (CUGI) online ordering web page. [https://www.genome.clemson.edu/online_orders]
Centre national de ressources genomiques vegetales (CNRGV) web page. [http://cnrgv.toulouse.inra.fr/en/library/barley]
International Rice Genome Seqeuncing Project. [http://rgp.dna.affrc.go.jp/IRGSP/]
Schnable PS, Ware D, Fulton RS, Stein JC, Wei FS, Pasternak S, Liang CZ, Zhang JW, Fulton L, Graves TA, Minx P, Reily AD, Courtney L, Kruchowski SS, Tomlinson C, Strong C, Delehaunty K, Fronick C, Courtney B, Rock SM, Belter E, Du FY, Kim K, Abbott RM, Cotton M, Marchetto P, Ochoa K, Jackson SM, Gillam B, et al: The B73 maize genome: complexity, diversity, and dynamics. Science. 2009, 326 (5956): 1112-1115. 10.1126/science.1178534.
Farrar K, Donnison IS: Construction and screening of BAC libraries made from Brachypodium genomic DNA. Nat Protoc. 2007, 2 (7): 1661-1674. 10.1038/nprot.2007.204.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang HB, Wang XY, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang LF, Carpita NC, et al: The Sorghum bicolor genome and the diversification of grasses. Nature. 2009, 457 (7229): 551-556. 10.1038/nature07723.
Vogel JP, Garvin DF, Mockler TC, Schmutz J, Rokhsar D, Bevan MW, Barry K, Lucas S, Harmon-Smith M, Lail K, Tice H, Grimwood J, McKenzie N, Huo NX, Gu YQ, Lazo GR, Anderson OD, You FM, Luo MC, Dvorak J, Wright J, Febrer M, Idziak D, Hasterok R, Lindquist E, Wang M, Fox SE, Priest HD, Filichkin SA, Givan SA, et al: Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature. 2010, 463 (7282): 763-768. 10.1038/nature08747.
Wu CC, Sun SK, Nimmakayala P, Santos FA, Meksem K, Springman R, Ding K, Lightfoot DA, Zhang HB: A BAC and BIBAC-based physical map of the soybean genome. Genome Res. 2004, 14 (2): 319-326. 10.1101/gr.1405004.
Snelling WM, Chiu R, Schein JE, Hobbs M, Abbey CA, Adelson DL, Aerts J, Bennett GL, Bosdet IE, Boussaha M, Brauning R, Caetano AR, Costa MM, Crawford AM, Dalrymple BP, Eggen A, Wind A, Floriot S, Gautier M, Gill CA, Green RD, Holt R, Jann O, Jones SJM, Kappes SM, Keele JW, de Jong PJ, Larkin DM, Lewin HA, McEwan JC: A physical map of the bovine genome. Genome Biol. 2007, 8 (8):
Mun JH, Kwon SJ, Yang TJ, Kim HS, Choi BS, Baek S, Kim JS, Jin M, Kim JA, Lim MH, Lee SI, Kim HI, Kim H, Lim YP, Park BS: The first generation of a BAC-based physical map of Brassica rapa. BMC Genomics. 2008, 9 (280):
Ammiraju JSS, Luo MZ, Goicoechea JL, Wang WM, Kudrna D, Mueller C, Talag J, Kim H, Sisneros NB, Blackmon B, Fang E, Tomkins JB, Brar D, MacKill D, McCouch S, Kurata N, Lambert G, Galbraith DW, Arumuganathan K, Rao KR, Walling JG, Gill N, Yu Y, SanMiguel P, Soderlund C, Jackson S, Wing RA: The Oryza bacterial artificial chromosome library resource: Construction and analysis of 12 deep-coverage large-insert BAC libraries that represent the 10 genome types of the genus Oryza. Genome Res. 2006, 16 (1): 140-147.
Meyers BC, Scalabrin S, Morgante M: Mapping and sequencing complex genomes: let's get physical'. Nat Rev Genet. 2004, 5 (8): 578-588. 10.1038/nrg1404.
Ravin NV, Ravin VK: Use of a linear multicopy vector based on the mini-replicon of temperate coliphage N15 for cloning DNA with abnormal secondary structures. Nucleic Acid Res. 1999, 27 (17): e13-10.1093/nar/27.17.e13.
Clake , Carbon : A colony bank containing synthetic CoI EI hybrid plasmid. Cell. 1976, 9 (1): 91-99. 10.1016/0092-8674(76)90055-6.
Katagiri T, Kidd C, Tomasino E, Davis JT, Wishon C, Stern JE, Carleton KL, Howe AE, Kocher TD: A BAC-based physical map of the Nile tilapia genome. BMC Genomics. 2005, 6 (89):
Shultz JL, Yesudas C, Yaegashi S, Afzal AJ, Kazi S, Lightfoot DA: Three minimum tile paths from bacterial artificial chromosome libraries of the soybean (Glycine max cv. 'Forrest'): tools for structural and functional genomics. J Plant Methods. 2006, 2 (9):
Wei FS, Stein JC, Liang CZ, Zhang JW, Fulton RS, Baucom RS, De Paoli E, Zhou SG, Yang LX, Han YJ, Pasternak S, Narechania A, Zhang LF, Yeh CT, Ying K, Nagel DH, Collura K, Kudrna D, Currie J, Lin JK, Kim H, Angelova A, Scara G, Wissotski M, Golser W, Courtney L, Kruchowski S, Graves TA, Rock SM, Adams S: Detailed Analysis of a Contiguous 22-Mb Region of the Maize Genome. Plos Genet. 2009, 5 (11):
Wei FS, Zhang J, Zhou S, He R, Schaeffer M, Collura K, Kudrna D, Faga BP, Wissotski M, Golser W, Rock SM, Graves TA, Fulton RS, Coe E, chnable PS, Schwartz DC, Ware D, Clifton SW, Wilson RK, Wing RA: The Physical and Genetic Framework of the Maize B73 Genome. Plos Genetics. 2009, 5 (11):
Saski C, Lee SB, Fjellheim S, Guda C, Jansen RK, Luo H, Tomkins J, Rognli OA, Daniell H, Clarke JL: Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes. Theor Appl Genet. 2007, 115 (4): 571-590. 10.1007/s00122-007-0567-4.
Frijters ACJ, Zhang Z, vanDamme M, Wang GL, Ronald PC, Michelmore RW: Construction of a bacterial artificial chromosome library containing large EcoRI and HindIII genomic fragments of lettuce. Theor Appl Genet. 1997, 94 (3-4): 390-399. 10.1007/s001220050428.
Brunner S, Keller B, Feuillet C: A large rearrangement involving genes and low-copy DNA interrupts the microcollinearity between rice and barley at the Rph7 locus. Genetics. 2003, 164 (2): 673-683.
Choi DW, Rodriguez EM, Close TJ: Barley Cbf3 gene identification, expression pattern, and map location. Plant Physiol. 2002, 129 (4): 1781-1787. 10.1104/pp.003046.
Stein N, Prasad M, Scholz U, Thiel T, Zhang HN, Wolf M, Kota R, Varshney RK, Perovic D, Grosse , Graner A: A 1,000-loci transcript map of the barley genome: new anchoring points for integrative grass genomics. Theor Appl Genet. 2007, 114 (5): 823-839. 10.1007/s00122-006-0480-2.
We greatly acknowledge the technical help of Ulrike Beyer, Anne Kusserrow, Manuela Knauft and Zahra Schoaei. Many thanks to Dr. Tatjana Sretenovic for her help in the initial steps of the project. The work was financially supported by grants of the Leibniz-Society and the Federal State of Saxony-Anhalt in frame of "WGL-Pakt für Forschung und Innovation" and the German Ministry of Education and Research (FKZ0314000, GABI-FUTURE-BARLEX) to NS. BAC libraries HVVMRXALLhB, -hC and -eA were produced by financial support from the University of Adelaide, the Australian Research Council (International Science Linkages CG120174), the Grains Research and Development Corporation and the Government of South Australia.
NS conceived the project in collaboration with AG and PL. DS and RA performed the fingerprinting, barley probe hybridization and data analysis steps. BS constructed the HVVMRXALLhB and HVVMRXALLhC libraries, MA and CS constructed the HVVMRXALLeA library. PJ constructed the HVVMRXALLmA library and library HVVMRXALLrA was constucted by CCW. DF performed the hybridisation of wheat probes to BAC membranes. The manuscript was written by RA, DS, and NS. All authors read and approved the manuscript.
Daniela Schulte, Ruvini Ariyadasa contributed equally to this work.
Electronic supplementary material
About this article
- Bacterial Artificial Chromosome
- Bacterial Artificial Chromosome Clone
- Bacterial Artificial Chromosome Library
- Minimal Tiling Path
- High Information Content Fingerprinting