Giant panda BAC library construction and assembly of a 650-kb contig spanning major histocompatibility complex class II region
- Chang-Jun Zeng†1, 2,
- Hui-Juan Pan†1, 2,
- Shao-Bin Gong1, 2,
- Jian-Qiu Yu1, 2,
- Qiu-Hong Wan1, 2 and
- Sheng-Guo Fang1, 2Email author
© Zeng et al; licensee BioMed Central Ltd. 2007
Received: 08 March 2007
Accepted: 08 September 2007
Published: 08 September 2007
Giant panda is rare and endangered species endemic to China. The low rates of reproductive success and infectious disease resistance have severely hampered the development of captive and wild populations of the giant panda. The major histocompatibility complex (MHC) plays important roles in immune response and reproductive system such as mate choice and mother-fetus bio-compatibility. It is thus essential to understand genetic details of the giant panda MHC. Construction of a bacterial artificial chromosome (BAC) library will provide a new tool for panda genome physical mapping and thus facilitate understanding of panda MHC genes.
A giant panda BAC library consisting of 205,800 clones has been constructed. The average insert size was calculated to be 97 kb based on the examination of 174 randomly selected clones, indicating that the giant panda library contained 6.8-fold genome equivalents. Screening of the library with 16 giant panda PCR primer pairs revealed 6.4 positive clones per locus, in good agreement with an expected 6.8-fold genomic coverage of the library. Based on this BAC library, we constructed a contig map of the giant panda MHC class II region from BTNL2 to DAXX spanning about 650 kb by a three-step method: (1) PCR-based screening of the BAC library with primers from homologous MHC class II gene loci, end sequences and BAC clone shotgun sequences, (2) DNA sequencing validation of positive clones, and (3) restriction digest fingerprinting verification of inter-clone overlapping.
The identifications of genes and genomic regions of interest are greatly favored by the availability of this giant panda BAC library. The giant panda BAC library thus provides a useful platform for physical mapping, genome sequencing or complex analysis of targeted genomic regions. The 650 kb sequence-ready BAC contig map of the giant panda MHC class II region from BTNL2 to DAXX, verified by the three-step method, offers a powerful tool for further studies on the giant panda MHC class II genes.
The giant panda (Ailuropoda melanoleuca), as one of most widely recognized conservation icons in the world, was once distributed over southern and eastern China and extended to northern Burma and northern Vietnam. Unfortunately, habitat loss and fragmentation, low genetic diversity and small population size all lead to current endangered status of this rare species. The estimated 1100 giant pandas survive only in a fraction of their historical range, six completely isolated mountain ranges [1, 2]. In order to protect this rare species, considerable efforts were made in different research fields. However, the panda genome still remains unknown. One goal of this study is to construct a bacterial artificial chromosome (BAC) library for the giant panda in order to provide a new tool for panda genome physical mapping.
Genes of major histocompatibility complex (MHC) form one of the most important genetic systems for infectious disease resistance in vertebrates . MHC-encoded genes have been demonstrated to be associated with susceptibility to numerous infectious , take part in mate choice of vertebrates [4, 5] and control the compatibility between mother and fetus during pregnancy , making the MHC a research field of considerable biological interest.
The giant panda MHC remains relatively little known. Furthermore, the limited references were published by our research group: (1) the giant panda MHC was located on chromosome 9q by fluorescence in situ hybridization ; (2) the levels of genetic variation for the MHC class II DRB and DQA loci in the giant panda were low, only 7 DRB alleles and 6 DQA ones survived in current populations [8, 9]. On the other hand, it has been reported that the giant pandas are particularly susceptible to infectious disease and parasites, such as 100% for ascariasis and 20% for ticks, resulting in a 66.67% mortality rate from ascariasis [10–12]. As a result, exploring genetic characteristics of multiple MHC loci in the giant panda has become more and more essential. However, the number of MHC genes in the giant panda keeps unknown all the time.
Based on an increasing number of published data from human, Horton et al. divided the human MHC into five physically adjacent subregions: extended class I (from HIST1H2AA to MOG), classical class I (from C6orf40 to MICB), class III (from PPIP9 to NOTCH4), classical class II (from C6orf10 to HCG24), and extended class II (from COL11A2 to RPL12P1) . The HLA class II cluster comprises the classical class II genes (HLA-DR, -DQ and -DP) and the non-classcial class II gene (HLA-DM and -DO) . Based on multiple suits of MHC data from different mammals, scientists have found that the mammalian MHC classical class II subregion generally contains DR, DQ, DP, DM and DO genes and the organization of the classical class II (BTNL2 ~ DR ~ DQ ~ DOB ~ DM ~ DOA ~ DP) is relatively conserved [14–18]. Additionally, some genes within extended class II subregion, such as COL11A2 and DAXX, are strongly conserved in vertebrates from bony fish to human in the evolutionary spectrum . Due to complete lack of genomic and mapping resources, the MHC studies for the giant pandas become more tedious and difficult. As a result, another goal of this study is to construct a contig spanning the whole classical class II and part of the extended class II regions (i.e. from BTNL2 to DAXX) based on the giant panda BAC library.
Results and discussion
Library screening results based on PCR amplification of 16 pairs of primers*
Primer sequences (5'→3')
PCR product size (bp)
Number of positive clones
MHC class III
MHC class II
MHC class II
MHC class II
MHC class I
MHC class III
Compared the giant panda BAC library with other mammalian BAC libraries [24–27], the average insert size of the panda BAC library was relatively small. In this study, we separated the partial digestion HMW DNA with two size-fractionated methods as described by Osoegawa et al.  and further removed the trapped small fragment by concentration with modification procedures. In general, difficulties in cloning are encountered when attempting to increase the size of the cloned fragments. Removing small restriction fragments is vital for constructing a high-quality BAC genomic library. Although the small fragments could be eliminated more efficiently by PFGE, small DNA fragments are subject to remaining trapped in or co-migrated with the desirable-size fragments . Hence, the primary reasons of large DNA molecules unable to be cloned into BAC vectors are probably due to damage to larger DNA molecules during the process of preparation and purification and subsequent preferential cloning of smaller DNA molecules .
In order to construct a BAC clone-based contig map of giant panda MHC class II region between BTNL2 and DAXX loci, we designed ten sets of MHC primers based on homologous class II MHC genes [15, 16, 31] and partial MHC sequences of giant pandas (AY973813-16), which were from BTNL2, DRA, DRB, DQA, DOB, LMP2, DMB, DMA, COL11A2, and DAXX genes, respectively. All the positive BAC clones were end-sequenced for designing primers but some ones failed to implement library screening. Two BAC clones (692B2 and 1262B6) were subject to shotgun sequencing to recruit new end sequences. All the sequences were analyzed first using RepeatMasker  in order to avoid that repetitive elements were designed as primers, which is of great importance for subsequent library screening. Similarly, only gene-specific primerpairs could be employed to verify overlapping between BAC clones. In the present study, the ten suits of gene-located primers were all from unique regions as shown in mammals such as BTNL2, DOB, LMP2, DMB, DMA, COL11A2 and DAXX, or from single loci as validated by population investigation like DRA [unpublished data], DRB  and DQA . Therefore, these specific primers will help to assemble the BAC contig map of MHC class II region without confusing.
Primer list used for constructing the final contig with minimum tiling path
Primer sequences (5'→3')
Comparisons of the MHC class II region of human HLA with dog DLA and cat FLA showed that the region from BTNL2 to DAXX was relatively conserved [15, 16, 31], having a similar organization: BTNL2 ~ DR ~ DQ ~ DOB ~ DM ~ DOA ~ DP ~ COL11A2 ~ DAXX. Nonetheless, both DLA and FLA lost some genes and shrank the DP genes into small pseudogenes [15, 31], resulting in the regions between BTNL2 and DAXX of DLA and FLA were shortened from 928 kb of HLA to 669 kb and 675 kb (data from GenBank), respectively. The giant panda MHC class II region presented not only arrangement characteristic of mammalian MHC class II loci; containing BTNL2, DR, DQ, DO, DM, COL11A2, and DAXX (Figure 3), but also shortening feature of carnivore MHC class II; being approximately 650 kb long. Worthy of pointing out, we designed multiple sets of primers for both DPA and DPB based on the homologous counterparts of human, dog and cat but all failed to amplify the DP genes (DPA and DPB), suggesting that the giant panda probably possesses highly variable DP genes (such as shrunken ones like those in the dog and cat) or completely lacks DP region. For example, cattle, goat and sheep possess ruminant-specific DY (DYA and DYB) instead of DP (DPA and DPB) [31, 32]. As a consequence, the structure, organization, amount, and exact order of giant panda MHC class II genes will be unavailable until the contig is sequenced finally.
The present study reported the construction and characterization of a 6.8-fold giant panda genomic BAC library with an average insert size of 97 kb. The library has been demonstrated to be of good quality by the isolation of multiple BAC clones containing 16 known genes. The giant panda BAC library thus provides a useful platform for physical mapping, genome sequencing or complex analysis of targeted genomic regions. The giant panda MHC class II region was relatively conserved with the counterparts of human, dog, cat and other mammalian MHC class II region. The 650 kb sequence-ready BAC contig map of the giant panda MHC class II region from BTNL2 to DAXX offers a powerful tool for further studies on the giant panda MHC class II genes. Consequently, we hope that the research works reported here could accelerate different aspects of giant panda studies in order to protect this rare species more effectively.
The BAC library was constructed following a previous protocol  using the copy-control pCC1BAC vector (Epicentre, Madison, USA). Transformation of the ligation products was performed using TransforMax EPI300 Electrocompetent E. coli cells (Epicentre, Madison, USA). A gene pulser II apparatus (Bio-Rad, Hercules, USA) was used and the applied conditions were as follows: voltage 0.9–1.7 kV, resistance 100 ohms, and impedance 25 μF for a 2.5 ms pulse in a 0.1 cm disposable cuvette (Bio-Rad, Hercules, USA). Note that difference in field strength was used for eventually enriching the library. The giant panda BAC library was arrayed into 43 superpools (49 × 96 clones) and screened using a 4D-PCR method .
Preparation high-molecular-weight (HMW) DNA
Whole blood was collected from giant pandas and stored in heparinized sterilized tubes. The lymphocytes cells were harvested by centrifugation and resuspended in ice-cold phosphate-buffered saline (PBS). An equal volume of liquefied (50°C) 2% certified low melt agarose (Bio-Rad, Hercules, USA) was mixed with the cells suspension (~1 × 108 cells/ml), and the whole mixture was poured into a disposable plug mold (Bio-Rad, Hercules, USA). The following treatments were conducted as described by Osoegawa et al. . The DNA plugs were stored in 0.5 M EDTA at 4°C for use.
Partial restriction digestion of the giant panda HMW DNA with Hind III
Five plugs were equilibrated twice at 4°C with sterilized 0.5 × TE buffer (pH 8.0), each for 1 hour. Then each plug was incubated with 400 μl Hind III (TaKaRa) reaction buffer (1 × M buffer, 100 μg/ml BSA, 4 mM spermidine), on ice for 30 minutes. Subsequently, 3.6 units of Hind III per DNA plug was added and incubated on ice for 20 minutes to diffuse completely into plug. Partial restriction digestion was carried out by incubating the reaction mixture in a 37°C water bath for 20 minutes. The reaction was stopped by the addition of 1/10 volume of 0.5 M EDTA, pH 8.0. Partially digested giant panda DNA was separated according to a previous protocol  with some modifications. Size-fractionated DNA with 150 to 300 kb size range were further concentrated at 4 V/cm, 5 second pulse time at 12°C for 10 hours and thus the trapped small fragments were removed. The sliced agarose plugs containing DNA fragments were placed into dialysis membranes (Spectrum laboratories, Rancho Dominguez, USA) and large DNA molecules were retrieved by electroelution under the condition of 6 V/cm with 30 second pulse time at 12°C for 3 hours.
Insert size analysis
To analyze the size of insert DNA fragments in this library, 174 randomly selected clones were grown in 10 ml LB containing 12.5 μg/ml chloramphenicol, and the BAC DNA was isolated by alkaline lysis . To avoid contamination of E. coli genomic DNA, the DNA was digested with plasmid-safe ATP-dependent DNase (Epicentre, Madison, USA) at 37°C for 2 hours. The purified DNA was digested overnight at 37°C with 25 units of Not I. Pulse field gel electrophoresis (PFGE) was performed at 14°C for 16 hours using 6.0 V/cm with 1 – 40 second switch time. Mid-range PFG marker (New England Biolabs) was used as DNA size marker. The gel was stained with ethidium bromide and photographed.
For evaluating and testing the coverage rate of this BAC genomic library, PCR-based library screening was performed  using 16 giant panda PCR primers (Table 1). For each superpool, seven 1D-PCR and seven 2D-PCR reactions were carried out first to ascertain which 96-well plate contains the target sequence. Then, eight 3D-PCR and twelve 4D-PCR reactions were conducted to find out which BAC clone the target sequence was located in. Finally, each positive clone pre-identified in 4D-PCR was verified individually by the second PCR using the same primer set to avoid false positive results during the process of screening superpools. All the PCR products of positive clones were sequenced three times.
Routine-, shotgun- and end-sequencing
Conventional PCR products were ligated into pMD18-T vector (Takara) and transformed with DH5α competent E. coli cells (Takara). The shotgun library was constructed from BAC DNA and mini-preparation was conducted according to previous protocols . For end sequencing, the positive clones were cultured in LB plus 12.5 μg/ml chloramphenicol and BAC DNA were extracted with Axyprep plasmid miniprep kit (Axygen, CA, USA). End sequencing of the BAC clones was performed on an ABI 377 automated DNA sequencer using pCC1BAC vector-derived sequencing primers: T7 forward (5'-TAA TAC GAC TCA CTA TAG-3') and pCC1/pEpiFOS RP-2 reverse (5'-TAC GCC AAG CTA TTT AGG TGA GA-3'). Routine- and end-sequencing were all performed on a LI-COR 4200 DNA sequencer with sequiTherm EXCEL II DNA sequencing kit (Epicentre Madison, USA).
Primer design based on homologous, end- and shotgun-sequences
Locus-specific primers were based on homologous sequences from GenBank (Table 1 and 2). The end sequences were analyzed by RepeatMasker  to exclude repetitive elements. The end-primers were designed from the BAC end sequences generated using the T7 and RP-2 vector primers. However, because (1) some ends failed to obtain nucleotide sequences; (2) some end sequences were unable to design suitable primerpairs; (3) some end primers screened BAC library unsuccessfully, two BAC clones (692B2 and 1262B6) underwent shotgun sequencing to design new primers adjacent to ends. All the end primers were utilized to amplify the BAC clones and construct the contig by defining overlapping with other BAC clones. End primers of the growing sub-contigs were designed again with primer premier software (version 5.0) and used to further screen the positive BACs to identify new overlapping clones until the gap was filled.
DNA fingerprinting and contig assembly
The BAC DNA, extracted from MHC positive clones using QIAGEN Large Construct Kit (Qiagen, CA, USA) that can avoid E. coli genomic contamination, was fingerprinted by restriction enzyme digestion with Hind III and EcoR I. DNA fragments were separated on 1% agarose gel with 2 V/cm for 18 hours in 1 × TAE buffer. Restriction fragment patterns were visually compared to ascertain the extent of overlapping between adjacent clones. Finally, 11 BAC clones with minimized overlap were chosen and manually assembled into a contig with minimal tiling path.
Special thanks go to three anonymous reviewers for comments on the earlier version of the manuscript. We thank Dr. Luping He and Qing Ye for their kindly help in preparing and arraying the library. We also give thanks to Chengdu research base of giant panda breeding, Sichuan province, P.R. China. This work was supported by grants from the National Basic Research Program of China (973 program) (No.2007CB411600), from the National Science Fund for Distinguished Young Scholars (No. 30325009), and from the office for Giant Panda of China State Forestry Administration (No. WH0418).
- Channell R, Lomolino MV: Dynamic biogeography and conservation of endangered species. Nature. 2000, 403: 84-86. 10.1038/47487.PubMedView Article
- Wan QH, Fang SG, Wu H, Fujihara T: Genetic differentiation and subspecies development of the giant panda as revealed by DNA fingerprinting. Electrophoresis. 2003, 24: 1353-1359. 10.1002/elps.200390174.PubMedView Article
- Hill AV: The immunogenetics of human infectious diseases. Annual Review of Immunology. 1998, 16: 593-617. 10.1146/annurev.immunol.16.1.593.PubMedView Article
- Grob B, Knapp LA, Martin RD, Anzenberger G: The major histocompatibility complex and mate choice: Inbreeding avoidance and selection of good genes. Experimental and Clinical Immunogenetics. 1998, 15: 119-129. 10.1159/000019063.PubMedView Article
- Penn DJ, Potts WK: The evolution of mating preferences and major histocompatibility complex genes. American Naturalist. 1999, 153: 145-164. 10.1086/303166.View Article
- Ober C: MHC class II compatibility in aborted fetuses and term infants of couples with recurrent spontaneous abortions. Journal of Reproductive Immunology. 1993, 25: 195-207. 10.1016/0165-0378(93)90063-N.PubMedView Article
- Zeng CJ, Yu JQ, Pan HJ, Wan QH, Fang SG: Assignment of giant panda MHC class II gene cluster to chromosome 9q by fluorescence in situ hybridization. Cytogenetic and Genome Research. 2005, 109: 534H-10.1159/000084222.View Article
- Wan QH, Zhu L, Wu H, Fang SG: Major histocompatibility complex class II variation in the giant panda (Ailuropoda melanoleuca). Molecular Ecology. 2006, 15 (9): 2441-2450. 10.1111/j.1365-294X.2006.02966.x.PubMedView Article
- Zhu L, Ruan XD, Ge YF, Wan QH, Fang SG: Low major histocompatibility complex class II DQA diversity in the Giant Panda (Ailuropoda melanoleuca). 2007, 8: 29-
- Feng WH, Wang RL, Zhong SM, Ye ZY, Cui ZX, Zeng JH: Analysis on the dead cause of the anatomical carcass of giant panda (Ailuropoda Melanoleuca). A study on breeding and diseases of the giant panda. Edited by: Feng WH, Zhang AJ. 2001, Sichuan scientific & technical publishers, Chengdu, China, 244-248.
- Ye ZY: The control of the diseases of giant panda in field: report of 50 cases. A study on breeding and diseases of the giant panda. Edited by: Feng WH, Zhang AJ. 2001, Sichuan scientific & technical publishers, Chengdu, China, 313-315.
- Ye ZY, Li YS, Wang Q, Zhang YF, Hu HG: Ectoparasites disease of the giant panda. A study on breeding and diseases of the giant panda. Edited by: Feng WH, Zhang AJ. 2001, Sichuan scientific & technical publishers, Chengdu, China, 313-315.
- Horton R, Wilming L, Rand V, Lovering RC, Bruford EA, Khodiyar VK, Lush MJ, Povey S, Talbot CC, Wright MW, Wain HM, Trowsdale J, Ziegler A, Beck S: Gene map of the extended human MHC. Nature Reviews Genetics. 2004, 5: 889-899. 10.1038/nrg1489.PubMedView Article
- Kulski JK, Shiina T, Anzai T, Kohara S, Inoko H: Comparative genomic analysis of the MHC: the evolution of class I duplication blocks, diversity and complexity from shark to man. Immunological Reviews. 2002, 190: 95-122. 10.1034/j.1600-065X.2002.19008.x.PubMedView Article
- Yuhki N, Beck T, Stephens RM, Nishigaki Y, Newmann K, O'Brien SJ: Comparative genome organization of human, murine, and feline MHC class II region. Genome Research. 2003, 13: 1169-1179. 10.1101/gr.976103.PubMed CentralPubMedView Article
- The MHC sequencing consortium: Complete sequences and gene map of a human major histocompatibility complex. Nature. 1999, 401: 921-923. 10.1038/44853.View Article
- Kumánovics A, Takada T, Lindahl KF: Genomic organization of the mammalian MHC. Annual Review of Immunology. 2003, 21: 629-657. 10.1146/annurev.immunol.21.090501.080116.PubMedView Article
- Wright H, Ballingall KT: Mapping and characterization of the DQ subregion of the ovine MHC. Animal Genetics. 1994, 25: 243-249.PubMedView Article
- Beck S, Trowsdale J: Sequence organization of the class II region of human MHC. Immunological Reviews. 1999, 167: 201-210. 10.1111/j.1600-065X.1999.tb01393.x.PubMedView Article
- Animal Genome Size Database. [http://www.genomesize.com]
- Asakawa S, Abe I, Kudoh Y, Kishi N, Wang Y, Kubota R, Kudoh J, Kawasaki K, Minoshima S, Shimizu N: Human BAC library: construction and rapid screening. Gene. 1997, 191: 69-79. 10.1016/S0378-1119(97)00044-9.PubMedView Article
- Zhang YP, Wang W, Su B: Microsatellite DNAs and kinship indentification of giant panda. Zoological Research. 1995, 16: 301-306. [in Chinese]
- Lü Z, Johnson WE, Menotti-raymond M, Yuhki N, Martenson JS, Mainka S, Huang SQ, Zheng ZH, Li GH, Pan WS, Mao XR, O'Brien SJ: Patterns of genetic diversity in remaining giant panda populations. Conservation Biology. 2001, 15: 1596-1607. 10.1046/j.1523-1739.2001.00086.x.View Article
- Suzuki K, Asakawa S, Iida M, Shimanuki S, Fujishima N, Hiraiwa H, Murakami Y, Shimizu N, Yasue H: Construction and evaluation of a porcine bacterial artificial chromosome library. Animal Genetics. 2000, 31: 8-12. 10.1046/j.1365-2052.2000.00588.x.PubMedView Article
- Rogel-Gaillard C, Piumi F, Billault A, Bourgeaux N, Save JC, Urien C, Salmon J, Chardon : Construction of a rabbit bacterial artificial chromosome (BAC) library: application to the mapping of the major histocompatibility complex to position 12q1.1. Mammalian Genome. 2001, 12: 253-255. 10.1007/s003350010260.PubMedView Article
- Liu HB, Liu K, Wang JF, Ma RZ: A BAC clone-based physical map of ovine major histocompatibility complex. Genomics. 2006, 88: 88-95. 10.1016/j.ygeno.2006.02.006.PubMedView Article
- Liu W, Zhao YH, Liu ZL, Zhang Y, Lian ZX, Li N: Construction of a 7-fold BAC library and cytogenetic mapping of 10 genes in the giant panda (Ailuropoda melanoleuca). BMC Genomics. 2006, 7: 294-10.1186/1471-2164-7-294.PubMed CentralPubMedView Article
- Osoegawa K, de Jong PJ, Frengen E, Ioannou PA: Construction of bacterial artificial chromosome (BAC/PAC) libraries. Current protocols in molecular biology on CD-ROM. Edited by: Ausubel FM, Kingston RE et al. 2001, John Wiley & sons, Inc, unit 5.9
- Cheng YL, Mancino V, Birren B: Transformation of E. coli with large DNA molecules by electroporation. Nucleic Acids Research. 1995, 23: 1990-1996. 10.1093/nar/23.11.1990.View Article
- Birren B, Green ED, Klapholz S, Myers RM, Riethman H, Roskams J: Bacterial Artificial Chromosomes. Genome Analysis: A laboratory manual. Vol. 3 Cloning Systems. Edited by: Birren et al. 1997, Cold Spring Harbor Laboratory Press, 242-295.
- Debenham SL, Hart EA, Ashurst JL, Howe KL, Quail MA, Ollier WER, Binns MM: Genomic sequence of the class II region of the canine MHC: comparison with the MHC of other mammalian species. Genomics. 2005, 85: 48-59. 10.1016/j.ygeno.2004.09.009.PubMedView Article
- RepeatMasker web server. [http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker]
- Hess M, Goldammer T, Gelhaus A, Ried K, Rappold G, Eggen A, Bishop MD, Schwerin M, Horstmann RD: Physical assignment of the bovine MHC class IIa and class IIb genes. Cytogenetics and Cell Genetics. 1999, 85: 244-247. 10.1159/000015302.PubMedView Article
- Ballingall K, MacHugh N, Taracha E, Mertens B, McKeever D: Transcription of the unique ruminant class II major histocompatibility complex-DYA and DIB genes in dendritic cells. European Journal of Immunology. 2001, 31: 82-86. 10.1002/1521-4141(200101)31:1<82::AID-IMMU82>3.0.CO;2-X.PubMedView Article
- Sambrook J, Russell DW: Molecular Cloning: A Laboratory Manual. 2001, New York: Cold Spring Harbor Laboratory Press, 3
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.