Skip to main content

Genomes of Helicobacter pylori from native Peruvians suggest admixture of ancestral and modern lineages and reveal a western type cag-pathogenicity island



Helicobacter pylori is presumed to be co-evolved with its human host and is a highly diverse gastric pathogen at genetic levels. Ancient origins of H. pylori in the New World are still debatable. It is not clear how different waves of human migrations in South America contributed to the evolution of strain diversity of H. pylori. The objective of our 'phylogeographic' study was to gain fresh insights into these issues through mapping genetic origins of H. pylori of native Peruvians (of Amerindian ancestry) and their genomic comparison with isolates from Spain, and Japan.


For this purpose, we attempted to dissect genetic identity of strains by fluorescent amplified fragment length polymorphism (FAFLP) analysis, multilocus sequence typing (MLST) of the 7 housekeeping genes (atp A, efp, ure I, ppa, mut Y, trp C, yph C) and the sequence analyses of the bab B adhesin and oip A genes. The whole cag pathogenicity-island (cag PAI) from these strains was analyzed using PCR and the geographic type of cag A phosphorylation motif EPIYA was determined by gene sequencing. We observed that while European genotype (hp-Europe) predominates in native Peruvian strains, approximately 20% of these strains represent a sub-population with an Amerindian ancestry (hsp-Amerind). All of these strains however, irrespective of their ancestral affiliation harbored a complete, 'western' type cag PAI and the motifs surrounding it. This indicates a possible acquisition of cag PAI by the hsp-Amerind strains from the European strains, during decades of co-colonization.


Our observations suggest presence of ancestral H. pylori (hsp-Amerind) in Peruvian Amerindians which possibly managed to survive and compete against the Spanish strains that arrived to the New World about 500 years ago. We suggest that this might have happened after native Peruvian H. pylori strains acquired cag PAI sequences, either by new acquisition in cag-negative strains or by recombination in cag positive Amerindian strains.


Helicobacter pylori is a Gram-negative bacterium that established itself in the human stomach possibly thousands of years ago [1]. This opportunistic pathogen infects over 50% of the worlds' population, causing no harm to most colonized people [2]. Only a small subset of infected people experience H. pylori-associated illnesses such as chronic gastritis, peptic ulcer disease, gastric carcinoma, and mucosa-associated lymphoid tissue (MALT) lymphoma. Associations of various clinical outcomes with disease-specific virulence factors remain dogmatic [3] years after the completion of genome sequences [4]. The debate has been further intensified as some studies have posed the possibility that H. pylori infection has some protective effects in esophageal diseases [3]. Also, possible symbiotic associations have been proposed based on the finding that H. pylori harbor protective, bacteriocin like effect and may therefore be beneficial to its host [5].

Subsequent to the decipherment of the potential of polymorphic DNA markers in reconstruction of human migration and phylogeography [6, 7], pathogen genotypes were successfully used in tracking and analyzing patterns of human migrations [810] in different continents. Recently, sequence variation in H. pylori has provided a window into human population migration [11] and also revealed that impact of religions on stratification of human ethnic groups can be analyzed based on H. pylori haplotypes [12].

Ancient origins and dissemination of H. pylori are quite debatable in the context of the vast South American continent that has witnessed many different waves of population migration [13], especially in view of the fact that H. pylori has been present in this continent since pre-Columbian times [14]. However, evolution of virulence and fitness in such 'ancient' strains that arrived first in the Americas and then, possibly out-competed by the influx of 'modern' strains from Europe [14] remains largely unexplored.

A landmark study based on PCR based DNA motif analysis proposed that H. pylori jumped recently from animals to humans and, therefore, the acquisition of H. pylori by humans may be a recent phenomenon [15]. This study has been the basis for the idea of 'H. pylori free New World' [15]. However, two independent studies based on large-scale analyses of candidate gene polymorphisms contrasted the idea of recent acquisition and suggest that H. pylori might have co-evolved with humans [11, 16]. In view of these intriguing ideas on ancient origin of H. pylori, additional evidences based on strains from different geographical regions (especially those with a rich history of multiple waves of human migrations such as the South Americas) are clearly needed.

We attempted to dissect gene pool diversity of Amerindian isolates of H. pylori from Peru with an objective to explore ancient and modern features of the Peruvian H. pylori strains corresponding to different waves of human migration. We also looked if it is possible to link some of the native Peruvian strains to their ancestors in Asia.


FAFLP based genotyping, candidate gene sequence analysis and multi-locus sequencing

Phylogenetic relationships assessed by FAFLP genotyping of Peruvian isolates (Figure 1A) revealed 2 different lineages corresponding to the hp-Europe and hsp-Amerind. Representative isolates from both the lineages obtained by FAFLP were subjected to reconfirmation by MLST analysis. We observed that all the isolates corresponding to the Amerind type FAFLP profile were found to be genetically closer to strains from Alaska and were therefore, accepted as genotype hsp-Amerind. All the other isolates from Peru clustered with those from Spain and therefore joined MLST genotype hp-Europe.

Figure 1

A. FAFLP analysis of H. pylori strains analyzed from native Peruvians (n = 27). The phylogenetic tree was developed based on various amplitypes generated for individual isolates after allele scoring and generation of similarity profiles in the form of binary tables. Genetic relationships in the form of a tree were deduced using MEGA 3.0 software using bootstrapping method at 10000 bootstrap trials. Two different lineages observed in the tree are colored as per the previous conventions [11, 20]. Six isolates from among those represented in the FAFLP tree (highlighted with circles) were also analyzed subsequently by MLST. B. Phylogenetic analysis of representative hsp-Amerind sequences from our isolates (SJM 83, 92) and those of other Amerind and non-Amerind (western) H. pylori sequences previously described by Ghose et al. from Caracas and Puerto Ayacucho in Venezuela [16]. SJM 83 and 92 formed a separate branch (maroon color) only with Amerind isolates (right). Both the Amerindian and western lineages (green color) are differentially colored as per previous conventions [11, 20]. Sequence alignment (left) of these Amerind sequences revealed significant conservation at the level of nucleotides.

Figure 2

MLST analysis based on concatenated gene sequences of 7 housekeeping genes of H. pylori (Kimura-2 parameter). The phylogenetic tree was based on a total of 19 sequence records (concatenated) obtained under this study (SJM, HUPB, HU, CPY) while incorporating other ~400 sequence records from pubMLST database ( which were specific to different genotypes in the world (Courtesy, Daniel Falush). Different genetic populations (Hp) and subpopulations (hsp) or genotypes are named and differentially colored after previous conventions [11, 20]. All SJM isolates, Amerindians (SJM23 and 92-highlighted) and non-Amerindians (arrowheads) analyzed by us from Peru are highlighted in bold face black fonts with green twigs indicating presence of a western type cag PAI.

We looked at the geographic signatures of oip A gene sequences in Peruvian isolates and found that 7 of the 27 isolates including the 2 hsp-Amerind isolates had an Eastern type oip A (CT < 6) signature; 4 of these had a non functional gene (frame out) (Table 1). Nineteen of the Peruvian isolates had a western type oip A (CT ≥ 6), 15 of which had a functionally intact coding frame. It appears that oip A sequence conveys the European ancestry for a majority of these isolates, including some hsp-Amerind, however, oip A is a virulence linked gene and its resolution power for lineage identification is not as robust as that of the 7 housekeeping genes we used, or the bab B gene. It is likely that oip A might have been exchanged or recombined between different lineages (like most of the other virulence genes such as the cag PAI) to give hybrid genotypes.

Table 1 Characteristics of 26 Peruvian H. pylori isolates obtained from Amerindians and distribution, presence/absence and or rearrangement of various gene loci in their genomes; 0 – region not amplified, 1 – region present.

The bab B gene sequences of our hsp-Amerind isolates were compared to the sequences previously reported for Amerindian isolates from Venezuela by Ghose et al. [16] by multiple sequence alignment and phylogenetic analysis (Figure 1B). These phylogenies, when studied in the light of MLST based inferences, revealed distinct affinities between Peruvian Amerindian strains and Venezuelan Amerind strains, pointing thus to their common Asian origins (Figure 1B).

In summary, our overall phylogenetic analyses, demonstrated that about 20% of the strains from native Peruvian patients that we examined share common genotypic patterns with Asian (Amerind) strains and therefore, Ancient Asian gene pool must have been the origin of the present day strains inhabiting the native Peruvians.

Analysis of the cagPAI and the cagA gene in isolates from Spain, Peru and Japan

Overlapping primer amplification strategy to span entire cag PAI worked very well with our hsp-Amerind isolates (Figure 3A) where all the constituent genes of the PAI were successfully amplified. This indicates that the 5 Amerind strains we looked at represent a 'chimera genome' made up of an ancient like core genome component (MLST and bab B typing) and a modern type flexible genome component (cag PAI and its right junction typing). Tyrosine phosphorylation of the immunodominant cag A protein is known to occur at the EPIYA motifs at the C-terminus by the SRC family of kinases [17, 18]. This EPIYA motif consists of 4 distinct EPIYA sites – EPIYA-A, -B, -C and -D, based on the amino acid sequence that neighbor it (Figure 3B). Based on the presence of these EPIYA motifs, cag A can be distinguished into the Western type (W) in case the -C site is present and the East Asian (EA) type in case where the -C site is replaced by the -D site. Our data for the type of EPIYA motifs present in Spanish (n = 6), Peruvian (n = 27), and Japanese (n = 16) strains revealed that the western type of cag A was predominant among all the Spanish and Peruvian strains. The Japanese isolates alone revealed East Asian type EPIYA D signatures (Figure 3B).

Figure 3

A. (Top) PCR based analysis of whole cag PAI of hsp-Amerind isolates from Peru. Overlapping primers spanning all of the constituent genes (see methods) amplified all the corresponding PCR products in the expected size range as described previously [30]. M indicates molecular weight marker (100 bp ladder). (Bottom) Amino acid signature of cag A (phosphorylation motifs – colored) – characteristic of modern cag A EPIYA (EPIYA C) as observed for all the SJM isolates from Peru. B. Pictorial depiction (right) of different types of cag A-EPIYA motif types prevalent in different H. pylori populations and their distribution in our isolates (boxes on the left).

Results of the PCR of cag PAI from 26 Peruvian isolates studied in the present context have been depicted in Table 1. Briefly, the strains from Peru and Spain did not readily amplify regions of the cag PAI as did Japanese isolates, suggesting thereby a distinct allelic diversity at the primer binding sites in the western PAI of these isolates. This observation places native Peruvian strains closer to Spanish strains and therefore hints that both the Spanish and Peruvian strains harbor a similar type of cag PAI.

The right junction of the cag PAI also revealed similar acquaintances based on PCR based insertion deletion and substitution analysis of the region spanning cag A right junction to glr, for the 6 Peruvian isolates we analyzed. Such genotypes for all the isolates we used from Spain, Japan and Peru were determined earlier by Kersulyte et al [15]. Collectively, this study and the previous observations [15] demonstrated that the cag right junction motif types were shared by Spanish and Peruvian isolates and that Japanese isolates did not share genetic affinities with Peruvian strains. This observation again places native Peruvian strains closer to Spanish strains and therefore hints that both the Spanish and Peruvian strains share cag PAI insertion sites and the regions flanking them.


H. pylori is presumed to be an ancient colonizer of the human stomach which possibly co-evolved with its host. We report that at least some of the H. pylori strains found in Peru share considerable homology with strains found in Asia. This supports the hypothesis that H. pylori was associated with its host well before Asian people crossed the Bering strait (20,000 years BP) to colonize America. We also report that the cag PAI was acquired by native Peruvian strains probably from a European source. This might have occurred as a single import, most probably during the last 500 years of the spread of H. pylori in the South American continent. It is evident from the fact that the Amerindian Peruvian strains, though of an Asian descent, do not share characteristic features of Asian cag PAIs but show homology to the PAIs of European strains. Alternatively, the Amerindian strains might have gained the PAI through a series of recombination events over a period of time. But this hypothesis appears weak when we take into account the short time span of 500 years within which the European strains spread in the Americas.

We tried to potentate these ideas and to provide evidence in favor of the ancient origin of the pathogen and the possibility that some extraneous gene cassettes might have been acquired by otherwise symbiotic H. pylori, sometimes during its natural history. To support this proposition, our methodology was targeted with a two pronged approach to i) further substantiate ancient link of the pathogen and ii) to prove that the pathogenicity island was a 'recent' addition to the genome of H. pylori. Our analyses based on FAFLP and MLST linked about 20% of the Peruvian isolates to Amerindian genotype, and conveyed that H. pylori was most probably introduced to the New World by Asian people. We, therefore, disagree with the idea of an 'H. pylori free New World' [15]. This disagreement of interpretation arose possibly because Kersulyte et al. [15] looked at only a few loci in the genome and stressed mainly on the motifs surrounding the cag PAI on its right junction.

We also looked at the cag PAI of such strains and found that the Peruvian isolates we tested carried western type (EPIYA C) cag islands (Figure 3B). We did not record any eastern type signature in the cag PAI (EPIYA D motif) in Peruvian isolates, given a distinct presence of ethnic Japanese in Peru. This inference also came from MLST data (Figure 2), where none of the Peruvian isolates clustered with Japanese genotype hp-EastAsia (Figure 2). Similarly, the absence of Asian/Amerindian type islands (or their remnants in native Peruvian isolates we analyzed, leads us to speculate that their ancestors in Asia were seemingly benign due to (natural?) absence of functional cag genes. This finding potentates the idea that cag genes in Peru mainly originated in Europe and therefore confirms the scenario proposed by Kersulyte et al. [15] as far as the cag PAI and its right junction is concerned.

Kauser et al., [19] from our group have previously described PCR analyses of the cag PAI for more than 300 strains from different parts of the world. Of these, the majority of strains from whom complete PAI was amplified were from Japan (57%) whereas only 18.6% strains from Peru and 13.3% strains from Spain could support amplification of an intact cag PAI, due mainly to allelic diversity present at the primer annealing sites. These observations were further endorsed by our present analysis that revealed that all the Peruvian isolates we analyzed carried only the European type PAI.

It has been recently reported that true Amerind strains either do not carry a cag PAI or carry only a vestigial, incomplete PAI [20]. It is already well known that H. pylori can import short stretches of DNA from strains from very different populations when they (presumably co-) infect individuals in the same location. However, it appears that the cag PAI and the region surrounding it had been exchanged by the Amerindian strains in Peru, where, the human population underwent major changes in recent history with the arrival of European conquerors and settlers. The long isolated Amerindian H. pylori strains thus came in contact with the European strains, which harbored a cag PAI. It is established that the cag PAI might give selective advantages during host colonization and therefore, "endogenous" Helicobacter, in Peru could be out-competed in a human community with newly arrived cag PAI positive strains introduced by the European conquerors. The "endogenous" Helicobacter strains could moreover acquire cag PAI during mixed infection with western cag PAI positive strains leading to the observed strains with Eastern-like core genome content and a western-type cag PAI. So this finding would really be interesting if it is clear that the entire cag-island had been exchanged. Then, there would be a very interesting question about the mechanism by which an entire island had been transferred; there is no doubt that such exchanges have happened at some point, but if they happened within the last 500 years, then, there would be much better chances of catching it in action. This is probably where the present data lead us to, and, we suggest that future efforts may be directed towards confirmation of this evolutionary mechanism.

Finally, it is possible that phylogenetic methods based on highly recombining gene loci [2127] may not be fully perfect to predict genetic relationships in terms of inheritance from different ancestral populations, especially when we use tools such as Mega 3.0 [28] which do not support admixture analysis. Partly in view of this possibility and to ensure that our conclusions did not represent shortcomings of a single approach, we adopted an integrated genotyping strategy for the cag PAI [29, 30] as well as the core genome through MLST of less recombining, neutral genes that encode cytoplasmic enzymes. Given the fact that these housekeeping genes are selectively neutral and uniform as compared to virulence associated loci such as the flagellins and vac A [31], recombinant and hybrid alleles that blur phylogeographic inferences, could be a rare occurrence rather than a rule. Nonetheless, it will be important to ascertain proportions of nearly pure and hybrid alleles among native Peruvian H. pylori through admixture analyses based on sophisticated population genetics tools [32] that reveal contemporary gene flow and proportion of different nucleotides inherited from ancestral populations on an evolutionary time-scale.


In summary, our study based on profiling of several gene loci and neutral markers revealed certain distinctive genetic features of the H. pylori gene pool in Peru. Important among these features are a demonstrated genetic link with Asian (Amerindian) strains and the presence of an intact cag PAI with western type of gene motifs that point to recent acquisition of this important pathogenicity island. This indicates possible lateral gene transfers during colonization of Peruvian Amerindians with both 'ancient' and 'modern' strains for several generations. Such an admixed gene pool could be an important source of genetic information on pathogen evolution in real time to possibly understand how gene acquisition and loss on a population wide scale shape virulence and fitness in different pathogens.

This could also possibly provide for a reasonable model of geographic evolution to understand acquisition of virulence in pathogenic bacteria over a period of time.


Bacterial strains, genomic DNA and diagnostic PCR

Genomic DNA preparations for strains originating from Peru, Spain and Japan were provided by D. E. Berg and Asish Mukhopadhyay (Washington University, St. Louis, Mo.). These DNA were isolated from patients diagnosed with gastric ulcers from Spain (HupB); gastric cancer and DU cases from Japan (CPY, Hu); and from gastritis cases alone from Peru (SJM). However, in the current study, the clinical background of the individual isolates was not taken into account. The Peruvian isolates we looked at (n = 27) were originally from Native Peruvian people mainly of Amerindian ancestry from Lima [15]. PCR based analysis of genes namely glm M, bab B [16] and oip A was carried out to ascertain the quality of DNA samples we used. Also these PCR assays served as amplification level controls for the analysis of insertion, deletion and substitution in the cag PAI.

Integrated genotyping of H. pylori based on chromosomal DNA signatures

Whole genome fingerprinting based on FAFLP genotyping was performed as described previously [2123]. Briefly, the profiling of whole genome micro-restriction fingerprints with Eco RI/Mse I enzymes using fluorescence tagged primer pairs Eco RI+A/Mse I+0 and Eco RI+G or A/Mse I+0 was performed for all the strains. The PCR amplified fragments for each of the strains were then subjected to electrophoretic separation on a 5% acrylamide gel and scoring of the fluorescent markers was done using an automated DNA analysis workstation (ABI Prism 3100 DNA sequencer). Cluster analysis of DNA profiles was conducted on the basis of fingerprint characteristics. All the data obtained through molecular genotyping and DNA profiling were deposited in the genoBASE pylori database [33]. The genoBASE pylori server was queried for comparative analyses.

Genotyping based on candidate genes oip A and bab B was carried out as described [2426]. Short stretches of oip A gene were analyzed to determine the 'geographic signature' based on CT repeat [27]. In addition, 600 bp region each from the 7 housekeeping genes spread throughout the genome [atp A, efp, ure I, ppa and mut Y, trp C, yph C] was amplified and sequenced for all the isolates exactly as described previously [11]. Sequencing was performed with both forward and reverse primers, using an ABI Prism3100 DNA sequencer (Applied Biosystems, USA). PCR and direct sequencing were performed at least twice to determine and confirm the DNA sequences for each isolate. Consensus sequence for each of the samples was generated using Genedoc (version 2.6.002). Multiple alignments of sequenced nucleotides were carried out using Clustal X (version 1.81). Neighbor joining trees were constructed in Mega 3.0 [28], using bootstrapping at 10000 bootstrap trials (FAFLP and bab B) and through Kimura-2 parameters (for MLST). For construction of phylogenetic trees based on MLST genotyping procedures, sequences of 7 housekeeping genes of strains belonging to different established genotypes were obtained from the pubMLST database [34] (courtesy, Daniel Falush).

The nucleotide sequences of the 7 housekeeping genes for the 6 hsp-Amerind isolates we analyzed have been deposited in the GenBank [Accession numbers, GenBank:DQ462362–462367 (atp A), GenBank:DQ462368–462373 (efp), GenBank:DQ462374–462379 (mut Y), GenBank:DQ462380–462385 (ppa), GenBank:DQ462386–462391 (trp C), GenBank:DQ462392–462397 (ure I), GenBank:DQ462398–462403 (yph C)]. These and other sequences can also be requested from the authors.

Profiling of the cagPAI

PCR analyses were carried out to find the status of the cag PAI using 8 sets of primers that amplified the cag A gene, its promoter region, the cag E and cag T genes and the left end of the cag PAI [25, 26, 29]. We also analyzed whole cag-PAI of the representative isolates (SJM92 and SJM23) from hspAmerind by PCR using overlapping primers as described by Blomstergren and colleagues [30]. The 3' end of the cag A gene was amplified using primers mentioned elsewhere [17] and the amplified products for strains from Spain, Peru, and Japan were sequenced with forward and reverse primers. The consensus sequences were then translated into amino acid sequences using GeneDoc software (version 2.6.002) and were then assigned to the Western or the East Asian group based on the C or D repeat present respectively in the EPIYA motif [18]. Chromosomal rearrangements are known to give rise to 5 types of insertion-deletion and substitution motifs in the region between the 3' end of cag A gene and the 3' end of the glutamate racemase (glr) gene. Although the statuses of these motifs for the Peruvian strains we analyzed were described previously by Kersulyte and colleagues [15], we re-assessed 6 of them by PCR exactly as described earlier [15].


  1. 1.

    Covacci A, Telford JL, Giudice GD, Parsonnet J, Rappuoli R: Helicobacter pylori virulence and genetic geography. Science. 1999, 284: 1328-1333. 10.1126/science.284.5418.1328.

    PubMed  CAS  Article  Google Scholar 

  2. 2.

    Montecucco C, Rappuoli R: Living dangerously: how Helicobacter pylori survives in the human stomach. Nat Rev Mol Cell Biol. 2001, 2: 457-466. 10.1038/35073084.

    PubMed  CAS  Article  Google Scholar 

  3. 3.

    Ahmed N, Sechi LA: Helicobacter pylori and gastroduodenal pathology: New threats of the old friend. Annals Clin Microbiol Antimicrobials. 2005, 4: 1-10.1186/1476-0711-4-1.

    Article  Google Scholar 

  4. 4.

    Alm RA, Ling LSL, Moir DT, King BL, Brown ED, Doig PC, Smith DR, Noonan B, Guild BC, deJonge BL, Carmel G, Tummino PJ, Caruso A, Nickelsen MU, Mills DM, Ives C, Gibson R, Merberg D, Mills SD, Jiang Q, Taylor DE, Vovis GF, Trust TJ: Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori. Nature. 1999, 397: 176-180. 10.1038/16495.

    PubMed  Article  Google Scholar 

  5. 5.

    Putsep K, Branden CI, Boman HG, Normark S: Antibacterial peptide from H. pylori. Nature. 1999, 398: 671-672. 10.1038/19439.

    PubMed  CAS  Article  Google Scholar 

  6. 6.

    Cavalli-Sforza LL: The DNA revolution in population genetics. TIG. 1998, 14: 60-65.

    PubMed  CAS  Article  Google Scholar 

  7. 7.

    Cavalli-Sforza LL, Feldman MW: The application of molecular genetic approaches to the study of human evolution. Nat Genet. 2003, 33 (Suppl): 266-275. 10.1038/ng1113.

    PubMed  CAS  Article  Google Scholar 

  8. 8.

    Holmes EC: The phylogeography of human viruses. Mol Ecol. 2004, 13: 745-756. 10.1046/j.1365-294X.2003.02051.x.

    PubMed  Article  Google Scholar 

  9. 9.

    Pavesi A: Utility of JC polyomavirus in tracing the pattern of human migrations dating to prehistoric times. J Gen Virol. 2005, 86: 1315-1326. 10.1099/vir.0.80650-0.

    PubMed  CAS  Article  Google Scholar 

  10. 10.

    Wirth T, Meyer A, Achtman M: Deciphering host migrations and origins by means of their microbes. Mol Ecol. 2005, 14: 3289-3306. 10.1111/j.1365-294X.2005.02687.x.

    PubMed  CAS  Article  Google Scholar 

  11. 11.

    Falush D, Wirth T, Linz B, Pritchard JK, Stephens M, Kidd M, Blaser MJ, Graham DY, Vacher S, Perez-Perez GI, Yamaoka Y, Me'graud F, Otto K, Reichard U, Katzowitsch E, Wang X, Achtman M, Suerbaum S: Traces of human migrations in Helicobacter pylori populations. Science. 2003, 299: 1582-1585. 10.1126/science.1080857.

    PubMed  CAS  Article  Google Scholar 

  12. 12.

    Wirth T, Wang X, Linz B, Novick RP, Lum JK, Blaser M, Morelli G, Falush D, Achtman M: Distinguishing human ethnic groups by means of sequences from Helicobacter pylori: lessons from Ladakh. Proc Natl Acad Sci USA. 2004, 101: 4746-4751. 10.1073/pnas.0306629101.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  13. 13.

    Underhill PA, Jin L, Zemans R, Oefner PJ, Cavalli-Sforza LL: A pre-Columbian Y chromosome-specific transition and its implications for human evolutionary history. Proc Natl Acad Sci USA. 1996, 93: 196-200. 10.1073/pnas.93.1.196.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  14. 14.

    Yamaoka Y, Orito E, Mizokami M, Gutierrez O, Saitou N, Kodama T, Osato MS, Kim JG, Ramirez FC, Mahachai V, Graham DY: Helicobacter pylori in north and south America before Columbus. FEBS Lett. 2002, 517: 180-184. 10.1016/S0014-5793(02)02617-0.

    PubMed  CAS  Article  Google Scholar 

  15. 15.

    Kersulyte D, Mukhopadhyay AK, Velapatino B, Su WW, Pan ZJ, Garcia C, Hernandez V, Valdez Y, Mistry RS, Gilman RH, Yuan Y, Gao H, Alarcon T, Lopez-Brea M, Nair GB, Chowdhury A, Datta S, Shirai M, Nakazawa T, Ally R, Segal I, Wong BCY, Lam SK, Olfat F, Boren T, Engstrand L, Torres O, Schneider R, Thomas JE, Czinn S, Berg DE: Differences in genotypes of Helicobacter pylori from different human populations. J Bacteriol. 2000, 182: 3210-3218. 10.1128/JB.182.11.3210-3218.2000.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  16. 16.

    Ghose C, Perez-Perez GI, Bello MGD, Pride DT, Bravi CM, Blaser MJ: East Asian genotypes of Helicobacter pylori strains in Amerindians provide evidence for its ancient human carriage. Proc Natl Acad Sci USA. 2002, 99: 15107-15111. 10.1073/pnas.242574599.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  17. 17.

    Yamaoka Y, Kodama T, Kashima K, Graham DY, Sepulveda AR: Variants of the 3' region of the cag A gene in Helicobacter pylori isolates from patients with different H. pylori-associated diseases. J Clin Microbiol. 1998, 36: 2258-2263.

    PubMed  CAS  PubMed Central  Google Scholar 

  18. 18.

    Hatakeyama M: Oncogenic mechanisms of Helicobacter pylori cag A protein. Nature Rev Cancer. 2004, 4: 688-694. 10.1038/nrc1433.

    CAS  Article  Google Scholar 

  19. 19.

    Kauser F, Khan AA, Hussain MA, Carroll IM, Ahmad N, Tiwari S, Shouche Y, Das B, Alam M, Ali SM, Habibullah CM, Sierra R, Megraud F, Sechi LA, Ahmed N: The cag pathogenicity island of Helicobacter pylori is disrupted in a majority of patient isolates from different human populations. J Clin Microbiol. 2004, 42: 5302-5308. 10.1128/JCM.42.11.5302-5308.2004.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  20. 20.

    Gressmann H, Linz B, Ghai R, Pleissner KP, Schlapbach R, Yamaoka Y, Kraft C, Suerbaum S, Meyer TF, Achtman M: Gain and loss of multiple genes during the evolution of Helicobacter pylori. PLoS Genet. 2005, 1 (4): e43-10.1371/journal.pgen.0010043.

    PubMed  PubMed Central  Article  Google Scholar 

  21. 21.

    Ahmed N, Khan AA, Alvi A, Tiwari S, Jyothirmayee CS, Kauser F, Ali M, Habibullah CM: Genomic analysis of Helicobacter pylori from Andhra Pradesh, south India: molecular evidence for three major genetic clusters. Curr Sci. 2003, 85: 101-108.

    Google Scholar 

  22. 22.

    Carroll IM, Ahmed N, Beesley SM, Khan AA, Ghousunnissa S, O'Morain CA, Smyth CJ: Fine-structure molecular typing of Irish Helicobacter pylori isolates and their genetic relatedness to strains from four different continents. J Clin Microbiol. 2003, 41: 5755-5759. 10.1128/JCM.41.12.5755-5759.2003.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  23. 23.

    Carroll IM, Ahmed N, Beesley SM, Khan AA, Ghousunnissa S, O'Morain CA, Habibullah CM, Smyth CJ: Microevolution between paired antral and paired antral and corpus Helicobacter pylori isolates recovered from individual patients. J Med Microbiol. 2004, 53: 669-677. 10.1099/jmm.0.05440-0.

    PubMed  CAS  Article  Google Scholar 

  24. 24.

    Kauser F, Hussain MA, Ahmed I, Ahmad N, Habeeb A, Khan AA, Ahmed N: Comparing genomes of Helicobacter pylori strains from the high altitude desert of Ladakh, India. J Clin Microbiol. 2005, 43: 1538-1545. 10.1128/JCM.43.4.1538-1545.2005.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  25. 25.

    Kauser F, Hussain MA, Ahmed I, Srinivas S, Devi SM, Majeed AA, Rao KR, Khan AA, Sechi LA, Ahmed N: Comparative genomics of Helicobacter pylori isolates recovered from ulcer disease patients in England. BMC Microbiol. 2005, 5: 32-10.1186/1471-2180-5-32.

    PubMed  PubMed Central  Article  Google Scholar 

  26. 26.

    Prouzet-Mauleon V, Hussain MA, Lamouliatte H, Kauser F, Megraud F, Ahmed N: Pathogen evolution in vivo: genome dynamics of two isolates obtained nine years apart from a duodenal ulcer patient infected with a single Helicobacter pylori strain. J Clin Microbiol. 2005, 43: 4237-4241. 10.1128/JCM.43.8.4237-4241.2005.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  27. 27.

    Ando T, Peek RM, Pride D, Levine SM, Takata T, Lee YC, Kusugami K, van der Ende A, Kuipers EJ, Kusters JG, Blaser MJ: Polymorphisms of Helicobacter pylori HP0638 reflect geographic origin and correlate with cagA status. J Clin Microbiol. 2002, 40: 239-246. 10.1128/JCM.40.1.239-246.2002.

    PubMed  CAS  PubMed Central  Article  Google Scholar 

  28. 28.

    Kumar S, Tamura K, Nei M: MEGA3: Integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief Bioinfor. 2004, 5: 150-163. 10.1093/bib/5.2.150.

    CAS  Article  Google Scholar 

  29. 29.

    Ikenoue T, Maeda S, Gura KO, Akanuma M, Mitsuno Y, Imai Y, Yoshida H, Shiratori Y, Omata M: Determination of Helicobacter pylori virulence by simple gene analysis of the cag pathogenicity island. Clin Diag Lab Immunol. 2001, 8: 181-186. 10.1128/CDLI.8.1.181-186.2001.

    CAS  Google Scholar 

  30. 30.

    Blomstergren A, Lundin A, Nilsson C, Engstrand L, Lundeberg J: Comparative analysis of the complete cag pathogenicity island sequence in four Helicobacter pylori isolates. Gene. 2004, 328: 85-93. 10.1016/j.gene.2003.11.029.

    PubMed  CAS  Article  Google Scholar 

  31. 31.

    Achtman M, Azuma T, Berg DE, Ito Y, Morelli G, Pan ZJ, Suerbaum S, Thompson S, van der Ende A, van Doorn LJ: Recombination and clonal groupings within Helicobacter pylori from different geographical regions. Mol Microbiol. 1999, 32: 459-470. 10.1046/j.1365-2958.1999.01382.x.

    PubMed  CAS  Article  Google Scholar 

  32. 32.

    Falush D, Stephens M, Pritchard JK: Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003, 164: 1567-1587.

    PubMed  CAS  PubMed Central  Google Scholar 

  33. 33.

    GenoBASE pylori database. []

  34. 34.

    pubMLST database. []

Download references


We are thankful to Seyed E. Hasnain for his guidance and support. We are highly grateful to Asish Mukhopadhyay and Douglas E. Berg for providing us with DNA samples of Peruvian and other strains as a gift. We are thankful to Daniel Falush for support with international MLST data and advice. We are also thankful to the International Society for Genomic and Evolutionary Microbiology (ISOGEM) for supporting and endorsing the study. Financial support from the Department of Biotechnology, Government of India (CDFD core grants) is gratefully acknowledged.

Author information



Corresponding author

Correspondence to Niyaz Ahmed.

Additional information

Authors' contributions

SMD and IA performed MLST typing and phylogenetic analysis. SMD also helped in analysis of bab B and oip A genotyping. AAK and AA performed PCR based analysis of the cag PAI genes. SAR extended bioinformatics support. LAS provided expert clinical consultation and contributed to manuscript writing. NA performed FAFLP analysis, planned and supervised the study, wrote the manuscript and provided overall leadership. All the authors read and approved the final manuscript.

S Manjulata Devi, Irshad Ahmed contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Devi, S.M., Ahmed, I., Khan, A.A. et al. Genomes of Helicobacter pylori from native Peruvians suggest admixture of ancestral and modern lineages and reveal a western type cag-pathogenicity island. BMC Genomics 7, 191 (2006).

Download citation


  • Core Genome
  • Pylorus Strain
  • European Strain
  • South American Continent
  • Western Type