Open Access

Comparative analysis of a BAC contig of porcine chromosome 13q31-q32 and human chromosome 3q21-q22

  • Mario Van Poucke1,
  • David Bourry1, 2,
  • François Piumi3,
  • Marc Mattheeuws1,
  • Alex Van Zeveren1,
  • Patrick Chardon3 and
  • Luc J Peelman1Email author
BMC Genomics20056:133

DOI: 10.1186/1471-2164-6-133

Received: 10 August 2005

Accepted: 21 September 2005

Published: 21 September 2005

Abstract

Background

The gene(s) encoding the ETEC F4ab/ac receptors, involved in neonatal diarrhoea in pigs (a disease not yet described in humans), is located close to the TF locus on Sscr13. In order to reveal and characterize possible candidate genes encoding these receptors, a porcine physical map of the TF region is indispensable.

Results

A contig of 33 BAC clones, covering approximately 1.35 Mb surrounding the TF locus on Sscr13q31-q32, was built by chromosome walking. A total of 22,552 bp from the BAC contig were sequenced and compared with database sequences to identify genes, ESTs and repeat sequences, and to anchor the contig to the syntenic region of the human genome sequence (Hsap3q21-q22). The contig was further annotated based on this human/porcine comparative map, and was also anchored to the Sanger porcine framework map and the integrated map of Sscr13 by RH mapping.

Conclusion

The annotated contig, containing 10 genes and 2 ESTs, showed a complete conservation of linkage (gene order and orientation) with the human genome sequence, based on 46 anchor points. This underlines the importance of the human/porcine comparative map for the identification of porcine genes associated with genetic defects and economically important traits, and for assembly of the porcine genome sequence.

Background

Neonatal diarrhoea, often caused by ETEC F4 bacteria, is a common problem in pig production. These bacteria use their fimbriae to adhere to specific receptors on the brush borders of enterocytes of their host. This adhesion is a prerequisite for infection and promotes bacterial colonization of the small intestine. The colonizing bacteria produce enterotoxins that stimulate the secretion of water and electrolytes into the lumen of the small intestine and lead to diarrhoea and often death in neonatal pigs [1]. ETEC F4 resistance, acquired by receptor phenotype differences of the host, seems to be inherited as an autosomal recessive Mendelian trait [2], whereby the gene(s) encoding the ETEC F4ab/ac receptors have been linked to several loci on Sscr13 [37]. Based on the tight linkage of the ETEC F4ab/ac receptor loci to microsatellite markers Swr926 (Locus P) and Swc22 (Locus G) by Peelman [5], a BAC contig covering this region and containing TF was built by chromosome walking. The contig was annotated by comparing BAC sequences with sequences from nucleotide databases and by comparative mapping with the human genome sequence in order to provide a basis for the identification of the ETEC F4ab/ac gene(s) by the candidate gene approach.

Results and discussion

Construction of the BAC contig

The construction of the BAC contig was started at 2 microsatellite marker loci, Swr926 and Swc22, estimated to be 1 cM apart from each other and closely linked to the ETEC F4ab/ac receptor loci, according to the porcine genetic map of Peelman [5]. From those 2 loci, 2 subcontigs were built by chromosome walking in both directions until the gap between the 2 was filled. The resulting BAC contig, comprising 33 BAC clones, is shown in Figure 1C All 66 BAC ends were sequenced and submitted to the GenBank database as GSSs [GenBank:CG993013-CG993078]. On 4 occasions, 2 BAC clones turned out to possess the same end (5'-215D7 with 5'-409C1, 129E6-3' with 225H9-3', 5'-613G8 with 5'-1002E2, and 5'-696F10 with 240G11-3'). From 52 of the 62 unique sequences, primers were designed to construct the contig and to screen for new overlapping clones. By dividing the total number of overlaps between the BAC ends and the BAC clones by the total number of BAC ends an estimated contig depth of 3.3 was calculated. Since the average length of the BAC inserts is 135,000 bp, we have covered a region of approximately (33/3.3) × 135,000 = 1.35 Mb.
Figure 1

Comparative map of the annotated BAC contig with its syntenic region on Hsap3q21-q22. The contig is drawn in part C. Black triangles represent BAC end sequences from which primers were designed to construct the contig. The black circle represents the only internal BAC sequence from which primers were designed to construct the contig. White circles show overlaps of these BAC sequences with other BAC clones. White triangles represent BESs from which it was impossible to design primers. The triangles point towards the 3'-side of the BAC clone. Encircled triangles represent BAC ends that are also present in the Sanger framework map. Black diamonds represent microsatellite positions. Black squares represent genes annotated by PCR, whereas white squares represent genes annotated by hybridization. Annotated sequences (genes are in regular, ESTs in italic) from the BAC contig are represented on a plane map (B) and their homology with the human genome sequence is illustrated with dotted lines (A). The orientation of the human genes and finished HTGs (used to assemble the human genome sequence) are represented in (A) by arrows [10]. Repeat sequences (white rectangle = LINE, black rectangle = SINE, black bow = LTR element, white bow = DNA element) are shown on a plane map in D. The RH mapping results are shown in E.

Annotation of the BAC end sequences

A total of 22,552 bp of the BAC contig (62 unique BAC ends and 1 internal BAC fragment [GenBank:CZ692943]) were sequenced and annotated by NIX [8]. The sequences had an overall GC content of 41.48%, which is less than the 46.17% for Sscr7q found in an analogous study of Barbosa and co-workers [9].

The BESs contained 2 gene fragments (MGC3040 and TF) and 2 ESTs (CA778263 and AA461333) located on the human genome (Figure 1A–C) [10]. In 35 of the 62 BESs, homologous sequences could be found within 12 consecutive finished HTGs used to assemble the Hsap3q21-q22 region of the human genome sequence (Figure 1A–C) [10]. These homologies were studied in detail by BLAST 2 sequence comparisons of the BESs with their orthologs (based on the 35.1 latest human genome build). Repeat sequences were excluded (RepeatMasker) and only single hits were taken into account. Orthologous sequences longer than 50 bp had on average a length of 150 bp, a sequence identity of 80% and an e-value of 1e-20. Smaller fragments were only considered as orthologs if at least 2 of them were located close to each other at their expected orthologous position. An extended conservation of synteny between Sscr13 and Hsap3 was already shown by the comparative map of Van Poucke and co-workers [11], based on chromosome painting results of Goureau and co-workers [12]. But taking into account the orientation of the finished HTGs and the position of the orthologous sequences within these HTGs, a perfect comparative map could be established showing even 100% conserved linkage in this region.

Based on this comparative map the BAC contig covers approximately 1.40 Mb of the human genome (from 134.075 Mb to 135.475 Mb on Hsap3) [10], which is close to the BAC contig length calculated above. The BAC sequences also contained 17 LINEs, 10 SINEs, 3 LTR elements and 1 DNA element (Figure 1D), resulting in an average density of 0.77 LINEs/kb, 0.45 SINEs/kb, 0.14 LTR elements/kb and 0.05 DNA elements/kb. Barbosa and co-workers [9] found an average density of 0.35 LINEs/kb, 0.61 SINEs/kb and 0.17 LTR+DNA elements/kb on Sscr7q.

Comparative mapping with the human syntenic region

Based on this detailed comparative map between Hsap3q21-q22 and the BAC contig, the latter could be annotated by comparative mapping. H41, TOPBP1, TF, SRPRB, RAB6B, SLCO2A1 and RYK could be found in the contig by PCR (Figure 1A–C). Also PICA, a gene not yet described in human but located on the pig EST map of the NCBI human genome map viewer [10] between TOPBP1 and TF, was found in the contig by PCR at the orthologous region (Figure 1A–C). It showed sequence homology with the finished HTG sequence AC083905. MGC3040 and BFSP2 could be found in the contig by BAC colony hybridisation (Figure 1A–C). For MGC3040, TF and SLCO2A1 two regions of the gene were annotated in the contig. Their locations showed that those genes were organised in the same orientation as in human. All the comparative mapping results confirmed the conserved linkage (gene order and orientation) based on the sequence homologies of the BESs (Figure 1A–C).

Fifteen BAC ends are also located on the Sanger porcine framework map [13]. On average, they show 98.5% sequence identity, and are located in the same order (Figure 1C). This was expected because (1) the framework map was constructed by fingerprinting and BES alignment on the human sequence, and (2) this region shows 100% conserved linkage with the human genome. So, for this region, the Sanger map assembly, based on the assumption of conservation between both species, is correct. But because of inter- and intrachromosomal rearrangements between the human and the porcine chromosomes [11, 12], the Sanger framework map contains some errors. This underlines the importance of the chromosome walking approach for the development of an exact map.

Based on the characteristics of the genes annotated in this contig, SLCO2A1 could be a candidate gene encoding the ETEC F4ab/ac receptor. It is a single copy gene encoding the prostaglandin transporter, a 12-transmembrane organic anion cell surface transporter that is expressed in the small intestine. The presence of different mRNA transcripts suggests that several functionally distinct mRNAs may arise by alternative splicing and/or alternative promotors [14]. It is also assumed that SLCO2A1 contains several different substrate binding sites, to which binding does not always result in substrate translocation across the membrane [15].

RH mapping

During chromosome walking, 4 loci were mapped with the IMpRH panel (data are submitted to the IMpRH server [16]) in order to detect possible chromosome jumping, to estimate the remaining gap between the 2 subcontigs, and to anchor the contig to the integrated comparative map [11]. Using the IMpRH server, 2-point distances were calculated between BAC ends 409C1-3', 5'-613G8, 5'-991F11 and an internal sequence of BAC 8A9, and microsatellite markers Swr926 and Swc22, that were previously mapped on the IMpRH map (Figure 1E) [17]. Based on these distances, the contig covers a region of approximately 40 cR. Thus, 1 cR equals approximately 33.750 kb in our contig. The distance between Swr926 and Swc22 was measured as 17 cR. Because the same distance was measured as 1 cM on a linkage map of Peelman [5], the cR/cM ratio in our contig is 17. Hawken and co-workers [17] measured values for Sscr13 of 59.9 kb/cR and 30.4 cR/cM with the linkage map of Rohrer and co-workers [18].

Methods

Primer design and amplicon verification

All primers, designed with Primer3 [20], were confirmed to not generate an amplicon of the same length with bacterial DNA as a template. Primers used for RH mapping were also checked not to generate an amplicon of the same length with hamster DNA as a template. The construction of the BAC contig was started with primers amplifying porcine microsatellite markers Swr926 [GenBank:AF235467] and Swc22 [GenBank:AF225193]. During the construction of the BAC contig, new primers were designed based on the BESs [GenBank:CG993013-CG993078]. Information on those primers can be found in the corresponding GenBank files. For annotation by comparative mapping with the human genome, primers were designed based on orthologous human and/or porcine sequences. Information on new primers is presented in Table 1. These PCR products were cloned in pCRII (Invitrogen, Merelbeke, Belgium), sequenced for verification with the Thermo Sequenase Primer Cycle Sequencing Kit (Amersham Biosciences, Uppsala, Sweden) and submitted to GenBank [GenBank:AY5182650-AY5182658, DQ104835, DQ104841]. Because of sequence homology, primers for PICA were confirmed to not amplify TF. Primers for RYK were described earlier [11].
Table 1

Information on new primers for genes annotated by PCR

Gene

Forward Primer (5'- 3')

Annealing temperature

Porcine Acc.No.

Reverse Primer (5'- 3')

Amplicon size

H41

GGCAAGAGTGAAGCAAATGG

60°C

AY518265

TCAAAAACATAACCCCAGCAA

395 bp

TOPBP1

CCTGAATCTCTTTATCCACATACTT

57°C

AY518266

CATTTGATGGTGCTGACTCTT

318 bp

PICA

TGGACGCGAAGCTCTAT

59°C

U36916

TCCGAGTTACAATTCAAGATG

1.286 bp

TF (exon 2)

CCAATAAGTGCTCCAGTTTC

56°C

X12386

CCCTGATGGCTTTGATG

111 bp

SRPRB

CGCCTTCCATCCCTACCT

58°C

AY518267

AACCGCCCTTTGACTGCT

756 bp

RAB6B

CATTGGGATTGACTTCTTGTC

58°C

AY518268

GATGTAGCTGGGGATCAGG

313 bp

SLCO2A1 (exon 3)

GCCGTCCTCATCATCTTTGT

60°C

DQ104835

GAAGTGCGGGAGGGTGA

117 bp

SLCO2A1 (exon 9)

CCTTGGGGATGCTGTTTG

60°C

DQ104841

TGGAGATGGTGATGATGGTG

96 bp

BAC screening and contig building by chromosome walking

The INRA porcine BAC library was screened by PCR [21]. Approximately 20 μg BAC DNA was purified from a 100 ml culture of the isolated BAC clones by using the Qiagen Plasmid Midi Kit (Westburg, Leusden, The Netherlands). The primers used to isolate the BAC clones were used to amplify the same amplicon on 20 ng BAC DNA for verification. Both ends of the isolated BAC clones were sequenced with 5 μg BAC DNA as template by using the Thermo Sequenase Primer Cycle Sequencing Kit (Amersham Biosciences, Uppsala, Sweden). Primers based on those BESs were used to construct the contig by defining overlaps with all other BAC clones. Primers at both ends of the growing subcontigs were used to screen the BAC library for new overlapping clones until the gap between Swr926 and Swc22 was filled.

Annotation

Annotation of the contig was performed by analyzing all BAC sequences on the NIX server (allowing integration and display of many gene identification programs, such as BLAST against EMBL, EST, STS and GSS databases [8], but not operational anymore), and by comparative mapping using PCR and BAC colony hybridization. These and similar sequence comparisons such as BLAST 2, can also be performed via the NCBI BLAST server [22]. Gene symbols, names and positions were based on the NCBI Gene Entrez [23] and NCBI Map viewer [10] with the latter also used for the identification of the human HTGs.

BAC colony hybridization

For annotation purposes by comparative mapping with the human genome, 2 IMAGE clones (3163990 [GenBank:BC000568] at the MGC3040 locus and 2472940 [GenBank:AI954686] at the BFSP2 locus), located in the human syntenic region, were ordered (MRC geneservice, Cambridge, UK). Inserts of these clones were used as radiolabeled probes for BAC colony hybridization.

RH mapping

During chromosome walking, 4 loci were mapped on the IMpRH panel [24] in order to detect possible chromosome jumping, to estimate the remaining gap between the 2 subcontigs, and to anchor the contig to the integrated comparative map [11]. Swr926 and Swc22 [17] and TF [25] were already located on the IMpRH map.

Conclusion

A porcine BAC contig containing 33 BAC clones and covering approximately 1.35 Mb of Sscr13q31-q32 was constructed. The annotated contig, containing 10 genes and 2 ESTs, showed a complete conservation of linkage with Hsap3q21-q22, based on 46 anchor points, providing further evidence for conservation of linkage on a fine scale. This underlines the importance of the comparative mapping strategy between human and pig, not only in the search for genes in pig but also as a basis for the assembly of the porcine genome [13, 19]. The contig also contains 15 anchor points with the Sanger porcine framework map [13], 4 anchor points (Swr926, Swc22, TF and RYK) with the integrated map of Sscr13 [11] and 2 (Swr926, Swc22) with the porcine Map Viewer [10].

List of abbreviations

BAC: 

bacterial artificial chromosome

BES: 

BAC end sequence

BFSP2: 

beaded filament structural protein 2 phakinin

bp: 

basepairs

cM: 

centiMorgan

cR: 

centiRay

EMBL: 

European Molecular Biology Laboratory

EST: 

expressed sequence tag

ETEC: 

enterotoxigenic Escherichia coli

GSS: 

genomic survey sequence

H41: 

hypothetical protein H41

Hsap: 

Homo sapiens

HTGs: 

high throughput genomic sequences

IMpRH: 

INRA-University of Minnesota porcine Radiation Hybrid

kb: 

kilobasepairs

LINE: 

long interspersed nuclear elements

LTR: 

long terminal repeat

Mb: 

megabasepairs

MGC3040: 

hypothetical protein MGC3040

PICA: 

porcine inhibitor of carbonic anhydrase

RAB6B RAB6B: 

member RAS oncogene family

RH: 

radiation hybrid

RYK RYK: 

receptor-like tyrosine kinase

SINE: 

short interspersed nuclear elements

SLCO2A1: 

solute carrier organic anion transporter family member 2A1

SRPRB: 

signal recognition particle receptor B subunit

Sscr: 

Sus scrofa

STS: 

Sequence Tag Site

TF: 

transferrin

TOPBP1: 

topoisomerase (DNA) II binding protein 1

Declarations

Acknowledgements

The authors wish to thank Dominique Vander Donckt and Linda Impe for excellent technical assistance. This work was supported by the Ministry of Trade and Agriculture Brussels (grant No. 5687A) and co-financed by Gentec and Rattlerow Seghers. We also thank Drs. Martine Yerle and Denis Milan (INRA, Castanet-Tolosan, France) for providing the IMpRH panel.

Authors’ Affiliations

(1)
Department of Animal Genetics and Breeding, Faculty of Veterinary Medicine, Ghent University
(2)
Department of Organic Chemistry, Faculty of Sciences, Ghent University
(3)
Laboratoire de Radiobiologie et d'Etude du Génome, UMR INRA-CEA

References

  1. Nagy B, Fekete PZ: Enterotoxigenic Escherichia coli (ETEC) in farm animals. Vet Res. 1999, 30: 259-284.PubMedGoogle Scholar
  2. Gibbons RA, Sellwood R, Burrows MR, Hunter PA: Inheritance of resistance to neonatal E. coli diarrhoea in the pig: examination of the genetic system. Theor Appl Genet. 1977, 51: 65-70. 10.1007/BF00299479.PubMedView ArticleGoogle Scholar
  3. Guerin G, Duval-Iflah Y, Bonneau M, Bertaud M, Guillaume P, Ollivier L: Evidence for linkage between K88ab, K88ac intestinal receptors to Escherichia coli and transferrin loci in pigs. Anim Genet. 1993, 24: 393-396.PubMedView ArticleGoogle Scholar
  4. Edfors-Lilja I, Gustafsson U, Duval-Iflah Y, Ellergren H, Johansson M, Juneja RK, Marklund L, Andersson L: The porcine intestinal receptor for Escherichia coli K88ab, K88ac: regional localization on chromosome 13 and influence of IgG response to the K88 antigen. Anim Genet. 1995, 26: 237-242.PubMedView ArticleGoogle Scholar
  5. Peelman LJ: Genetic investigation of the resistance mechanisms of the pig against diarrhea caused by E. coli. Verh K Acad Geneeskd Belg. 1999, 61: 489-515.PubMedGoogle Scholar
  6. Python P, Jorg H, Neuenschwander S, Hagger C, Stricker C, Burgi E, Bertschinger HU, Stranzinger G, Vogeli P: Fine-mapping of the intestinal receptor locus for enterotoxigenic Escherichia coli F4ac on porcine chromosome 13. Anim Genet. 2002, 33: 441-447. 10.1046/j.1365-2052.2002.00915.x.PubMedView ArticleGoogle Scholar
  7. Jorgensen CB, Cirera S, Anderson SI, Archibald AL, Raudsepp T, Chowdhary B, Edfors-Lilja I, Andersson L, Fredholm M: Linkage and comparative mapping of the locus controlling susceptibility towards E. COLI F4ab/ac diarrhoea in pigs. Cytogenet Genome Res. 2003, 102: 157-162. 10.1159/000075742.PubMedView ArticleGoogle Scholar
  8. NIX. [http://www.hgmp.mrc.ac.uk/NIX/]
  9. Barbosa A, Demeure O, Urien C, Milan D, Chardon P, Renard C: A physical map of large segments of pig chromosome 7q11-q14: comparative analysis with human chromosome 6p21. Mamm Genome. 2004, 15: 982-995.PubMedView ArticleGoogle Scholar
  10. NCBI Map viewer. [http://www.ncbi.nlm.nih.gov/mapview/]
  11. Van Poucke M, Yerle M, Chardon P, Jacobs K, Genet C, Mattheeuws M, Van Zeveren A, Peelman LJ: A refined comparative map between porcine chromosome 13 and human chromosome 3. Cytogenet Genome Res. 2003, 102: 133-138. 10.1159/000075738.PubMedView ArticleGoogle Scholar
  12. Goureau A, Yerle M, Schmitz A, Riquet J, Milan D, Pinton P, Frelat G, Gellin J: Human and porcine correspondence of chromosome segments using bidirectional chromosome painting. Genomics. 1996, 36: 252-262. 10.1006/geno.1996.0460.PubMedView ArticleGoogle Scholar
  13. Porcine Genome Physical Mapping Project. [http://www.sanger.ac.uk/Projects/S_scrofa/]
  14. Lu R, Kanai N, Bao Y, Schuster VL: Cloning, in vitro expression, and tissue distribution of a human prostaglandin transporter cDNA(hPGT). J Clin Invest. 1996, 98: 1142-1149.PubMedPubMed CentralView ArticleGoogle Scholar
  15. Pucci ML, Bao Y, Chan B, Itoh S, Lu R, Copeland NG, Gilbert DJ, Jenkins NA, Schuster VL: Cloning of mouse prostaglandin transporter PGT cDNA: species-specific substrate affinities. Am J Physiol. 1999, 277: R734-R741.PubMedGoogle Scholar
  16. Milan D, Hawken R, Cabau C, Leroux S, Genet C, Lahbib Y, Tosser G, Robic A, Hatey F, Alexander L, Beattie C, Schook L, Yerle M, Gellin J: IMpRH server: an RH mapping server available on the Web. Bioinformatics. 2000, 16: 558-559. 10.1093/bioinformatics/16.6.558.PubMedView ArticleGoogle Scholar
  17. Hawken RJ, Murtaugh J, Flickinger GH, Yerle M, Robic A, Milan D, Gellin J, Beattie CW, Schook LB, Alexander LJ: A first-generation porcine whole-genome radiation hybrid map. Mamm Genome. 1999, 10: 824-830. 10.1007/s003359901097.PubMedView ArticleGoogle Scholar
  18. Rohrer GA, Alexander LJ, Hu Z, Smith TP, Keele JW, Beattie CW: A comprehensive map of the porcine genome. Genome Res. 1996, 6: 371-391.PubMedView ArticleGoogle Scholar
  19. Wernersson R, Schierup MH, Jorgensen FG, Gorodkin J, Panitz F, Staerfeldt HH, Christensen OF, Mailund T, Hornshoj H, Klein A, Wang J, Liu B, Hu S, Dong W, Li W, Wong GK, Yu J, Wang J, Bendixen C, Fredholm M, Brunak S, Yang H, Bolund L: Pigs in sequence space: a 0.66X coverage pig genome survey based on shotgun sequencing. BMC Genomics. 2005, 6: 70-10.1186/1471-2164-6-70.PubMedPubMed CentralView ArticleGoogle Scholar
  20. Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.PubMedGoogle Scholar
  21. Rogel-Gaillard C, Bourgeaux N, Billault A, Vaiman M, Chardon P: Construction of a swine BAC library: application to the characterization and mapping of porcine type C endoviral elements. Cytogenet Cell Genet. 1999, 85: 205-211. 10.1159/000015294.PubMedView ArticleGoogle Scholar
  22. NCBI BLAST. [http://www.ncbi.nih.gov/BLAST/]
  23. NCBI Entrez Gene. [http://www.ncbi.nih.gov/entrez/query.fcgi?db=gene]
  24. Yerle M, Pinton P, Robic A, Alfonso A, Palvadeau Y, Delcros C, Hawken R, Alexander L, Beattie LB, Milan D, Gellin J: Construction of a whole genome radiation hybrid panel for high-resolution gene mapping in pigs. Cytogenet Cell Genet. 1998, 82: 182-188. 10.1159/000015095.PubMedView ArticleGoogle Scholar
  25. Van Poucke M, Yerle M, Tuggle C, Piumi F, Genet C, Van Zeveren A, Peelman LJ: Integration of porcine chromosome 13 maps. Cytogenet Cell Genet. 2001, 93: 297-303. 10.1159/000057001.PubMedView ArticleGoogle Scholar

Copyright

© Van Poucke et al; licensee BioMed Central Ltd. 2005

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.