Skip to main content

Comparative analysis of cancer genes in the human and chimpanzee genomes



Cancer is a major medical problem in modern societies. However, the incidence of this disease in non-human primates is very low. To study whether genetic differences between human and chimpanzee could contribute to their distinct cancer susceptibility, we have examined in the chimpanzee genome the orthologous genes of a set of 333 human cancer genes.


This analysis has revealed that all examined human cancer genes are present in chimpanzee, contain intact open reading frames and show a high degree of conservation between both species. However, detailed analysis of this set of genes has shown some differences in genes of special relevance for human cancer. Thus, the chimpanzee gene encoding p53 contains a Pro residue at codon 72, while this codon is polymorphic in humans and can code for Arg or Pro, generating isoforms with different ability to induce apoptosis or interact with p73. Moreover, sequencing of the BRCA1 gene has shown an 8 Kb deletion in the chimpanzee sequence that prematurely truncates the co-regulated NBR2 gene.


These data suggest that small differences in cancer genes, as those found in tumor suppressor genes, might influence the differences in cancer susceptibility between human and chimpanzee. Nevertheless, further analysis will be required to determine the exact contribution of the genetic changes identified in this study to the different cancer incidence in non-human primates.


Cancer is a major and growing clinical problem in modern societies. Although usually referred to as a single disease, cancer represents more than 200 different pathologies, which are characterized by an uncontrolled cell growth that may derive in the invasion of surrounding tissues and the subsequent generation of metastasis in distant organs of the body [1, 2]. Tumor development is a complex process in which genetic, epigenetic and environmental factors are implicated [36]. The importance of genetic factors in cancer is now well established, as mutations in specific genes have been associated with the neoplastic transformation and development of specific cancer types [3, 7, 8]. This fact is further supported by the existence of hereditary cancer syndromes, caused by germ-line mutations in specific genes and responsible for about 5% of all diagnosed malignancies [911]. Over the last two decades, a number of studies have focused on the identification of the different genes that can contribute to cancer. These studies have led to the conclusion that alterations in three types of genes – oncogenes, tumor-suppressor genes, and stability genes – are mainly associated with the genesis of cancer [3]. These studies have alsocontributed to elucidate the molecular mechanisms through which these genes act during tumor development and progression [12]. Finally, in some cases this knowledge has resulted in the introduction of new therapeutic strategies for cancer treatment [1316].

Chimpanzees (Pan troglodytes) represent our most closely related organism. They have a more similar physiology to human than any other model organism, and the study of several human diseases in chimpanzee has led to a better understanding of pathologies such as hepatitis or AIDS [17, 18]. Interestingly, a number of works have reported that cancer incidence in non-human primates is very low. This fact is especially evident for epithelial neoplasms such as breast, prostate or lung carcinomas, which are responsible for more than 20% of human deaths but whose incidence in great apes is less than 2% [1822]. The difference in cancer incidence between human and chimpanzees can mainly derive from three facts: i) exposure to different environmental factors, including diet and habits, ii) differences in life expectancy and iii) genetic differences that might result in humans being more susceptible to cancer development.

The completion of the first draft of the chimpanzee genome sequence has opened the possibility to study whether genetic differences between human and chimpanzee could contribute to the observed differences in cancer susceptibility between both species [23]. To address this question, we have used the chimpanzee genome sequence to identify and compare the orthologous genes of a set of 333 human genes that have been previously implicated in cancer development [3, 7]. This analysis has revealed that all analyzed human cancer genes are present in the chimpanzee and exhibit a high degree of conservation between both species.

However, further detailed analysis of a series of genes of special interest in human cancer such as p53 and BRCA1, has revealed some differences with the corresponding chimpanzee counterparts. In this work, we present the results of this comparative genomic analysis and discuss the putative relevance of the observed genetic changes for explaining some aspects of the differential cancer susceptibility between human and chimpanzee.


Analysis of cancer genes in the chimpanzee genome

As an initial attempt to study cancer-associated genes in the chimpanzee, we have compared 333 human cancer genes to the chimpanzee genome and identified and analyzed the orthologous genes (see Additional data file 1). The set of human cancer genes was selected from the literature based on mutational analysis and/or roles in processes such as chromosomal stability, promoter methylation or control of mitotic checkpoints [3, 4, 7, 2428]. As can be seen in Figure 1, more than 50% of the genes belong to three main functional categories: transcription factors, phosphorylation (including kinases, kinase inhibitors and phosphatases), and DNA repair. Other functional groups include structural proteins, GTPases and GTPase regulators or proteins involved in ubiquitylation. Finally, other minor groups include tumor suppressors, the protein and RNA components of the telomerase, or proteins implicated in apoptosis among others.

Figure 1

Distribution of cancer genes in different functional categories. The percentage of genes present in each category is indicated.

To perform this study, the cDNA sequence from all human cancer genes was first compared with the chimpanzee genome in order to determine the presence or absence of the corresponding orthologs, and then to predict their complete cDNA sequences. This analysis revealed that all analyzed human cancer genes are present in the chimpanzee genome. The high sequence coverage of this set of cancer genes (96% coverage at the nucleotide level, 1,161,044 nucleotides), has allowed us to perform a detailed comparison – at the nucleotide and amino acid level – of cancer genes between both species (see Additional data file 1 for more details).

Direct comparison of human and chimpanzee cancer genes indicates that they are highly conserved, showing 99.38% identities at the protein level, and 99.19% at the nucleotide level, what is similar to the average amino acid identities between both organisms (99.38%) [23]. Interestingly, we have identified 71 cancer genes (representing 21% of the total) encoding proteins which are 100% identical to their human counterparts (Additional data file 2). The conservation of cancer genes between human and chimpanzee suggests that they perform essential cellular functions and are in agreement with previous studies [29]. This proposal is consistent with the fact that most analyzed cancer genes encode intracellular proteins highly conserved from yeast to human, and implicated in fundamental processes such as cell signalling, cell cycle control and maintenance of genomic stability [3].

Analysis of insertions and deletions in protein coding regions from thechimpanzee genome

Despite the high conservation between human and chimpanzee cancer genes described above, in the course of the present study we identified 20 chimpanzee genes (6% of all analyzed genes) that encode proteins containing or lacking specific residues due to the insertion or deletion of codons in the corresponding open reading frames (Table 1). The analysis of these genes can be important in order to identify human-specific changes in protein coding genes. A total of 9 chimpanzee genes encode proteins containing extra amino acids in their sequence, 6 of them located within trinucleotide repeats resulting in longer expansions in chimpanzee, with up to 4 extra glutamine residues in the case of MN1. On the other hand, 12 chimpanzee genes encode proteins with less amino acids than their human counterparts. Similarly to the expanded proteins in chimpanzee, 8 of the 12 expansions in human proteins were located in trinucleotide repeats. The higher proportion of longer alleles in humans than in chimpanzee is consistent with known results [30]. Interestingly, analysis of these expanded regions using information retrieved from EST databases, resulted in the finding that six human genes, ASPSCR1, DEK, FANCC, MLLT3, MYST4 and ZNF384, are polymorphic for these loci. As functional trinucleotide tandem repeats that vary in humans have been shown to be also variable in chimpanzee [30], it is likely that all these genes present variability in both species. These results suggest that these regions might be prone to genetic instability, and open the possibility to investigate if all these genes showing human-chimpanzee variability in trinucleotide repeats are polymorphic in humans and influence the risk of tumor development.

Table 1 Cancer genes with codons inserted/deleted in chimpanzee. Summary of chimpanzee cancer genes showing inserted/deleted codons with respect to the human sequence. The number of inserted/deleted codons, the amino acid change and whether the insertion/deletion occurred in trinucleotide expanded regions is shown. The status of this codon in human is also indicated.

During the course of this analysis, we also observed that 52 genes (representing 15.6% of all analyzed cancer genes) contained putative frameshifts and/or premature stop codons in the coding sequences (Additional data file 3). This high number of conflictive regions is not likely to reflect bona fide differences between these species, but the result of sequencing problems or artefacts derived from the assembly process. To evaluate these possibilities, we took advantage of the availability of two independent assemblies for the chimpanzee genome, called ARACHNE and PCAP [23]. The ARACHNE assembly, containing more coverage and fewer order conflicts, is used by most public genome browsers and was employed in the initial steps of this study. The PCAP assembly is a de novo assembly that is non-redundant with ARACHNE and provides an additional resource for this kind of sequencing conflicts. To distinguish artefacts of the assembly process from real differences or sequencing errors, we compared the conflictive regions identified in cancer genes to the PCAP assembly. This strategy resulted in the resolution of 83% of the conflicts, as they were correct in the PCAP assembly, thereby suggesting that they represented errors generated during the assembly process. Nevertheless, 9 cancer genes (BCL3, ELL, GAS7, MLL, MLLT3, NF1, RECQL4, SMARCB1 and TPR) still contained the same frameshift in both assemblies. Because these genes might represent real differences between human and chimpanzee cancer genes, we PCR-amplified all these conflictive regions, and the resulting DNA fragments were subjected to direct sequencing. This strategy allowed us to confirm that these genes do not contain frameshifts or premature stop codons, and encode functional proteins in all cases. These findings call the attention to the degree of accuracy of the chimpanzee genome despite the existence of two assemblies. Ongoing efforts to increase the shotgun coverage from ~4 fold to ~8 fold redundancy will produce a higher quality sequence necessary for a reliable ascertainment of specific interesting discrepancies.

Analysis of missense mutations in cancer genes in the chimpanzee genome

Mutations in human cancer genes can be originated by different causes, including chromosomal translocations resulting in fusion proteins, disruption of the coding sequence by frameshifts or premature stop codons, and point mutations that modify the structure and function of the corresponding protein. As mentioned above, we did not find evidence in the chimpanzee genome of gene fusions, frameshifts or premature stop codons in the set of analyzed cancer genes. However, it is interesting to study the status in the chimpanzee genome of residues that are variable in human genes and are associated with cancer. For this purpose, we extracted the information of variant alleles in human genes from the Human Gene Mutation Database (HGMD) and the Online Mendelian Inheritance in Man (OMIM) databases [31, 32], and determined the corresponding residue in the chimpanzee protein. We found two genes associated with breast cancer, BRCA2 and ERBB2, in which the chimpanzee sequence differed from the human in residues that have been reported to be polymorphic in humans. Both alleles, BRCA2 (N372H) and ERBB2 (V655I), have been associated with different risk of developing breast cancer [33, 34]. Interestingly, in both cases the chimpanzee sequence (N372 and I655) corresponds to the allele that has been associated with a reduced risk of cancer, and both are being found at high frequencies in humans, around 0.8 for N372 [34] and from 0.7 to 0.9 for I665.

We also identified at least two cases in genes associated with colon carcinomas, MLH1 (A441T) and MSH2 (S323C), and one with prostate carcinomas, PON1 (I102V), in which the susceptibility allele in human appears to be the wild-type allele in chimpanzee. However, due to the limited clinical and functional data on these variants, the biological significance of these findings is still unknown, and further studies will be required to determine the pathogenic activity of these alleles.

Analysis of the tumor suppressor gene p53 in the chimpanzee genome

A detailed analysis of the gene encoding the tumor suppressor p53, which is the most frequently mutated gene in human cancer [35], showed a single amino acid difference between human (Arg at position 72) and chimpanzee (Pro72). Interestingly, this p53 codon is polymorphic in human and the allele encoding Pro72 is frequent in some populations [36]. Analysis of chimpanzee DNA from four different geographic regions revealed a Pro/Pro genotype in all cases, what confirmed our findings using the chimpanzee genomic sequence. However, we cannot rule out the possibility that this codon might be polymorphic in chimpanzee and that the Arg72 allele could be present in some individuals, what would require the analysis of a greater number of samples. In this sense, sequencing of 14 chimpanzee samples revealed a Pro/Pro genotype, suggesting that this allele would be the most common one in chimpanzee, with a Poisson distribution maximum estimate of allele frequency of 0.1 at a 95% confidence interval.

Additionally, we have sequenced the same p53 region in different primates, including gorilla, orangutan or mandrill, and determined that codon 72 codes for Pro in all species (Fig. 2). The fact that apes and some Old World monkeys contain Pro at this position, suggests that it is the ancestral allele and the Arg72 allele is unique to the human lineage. Nevertheless, the ancestral Pro/Pro genotype is still present in humans, representing about 8.3% of the Central-European population and 45% of the African yoruba population (see dbSNP rs1042522) [37]. This finding raises interesting questions about the different susceptibility to cancer of both organisms, as several studies have shown that both isoforms are functionally different in their ability to induce apoptosis or interact with p73 [3840].

Figure 2

Amino acid alignment of the p53 proline-rich domain from different primates. Schematic representation of p53 protein (AD: activator domain, PR: proline-rich domain; DB: DNA-binding domain; TD: tetramerization domain; and RD: C-terminal regulatory domain). The proline-rich domain containing Arg72 (arrowhead) was sequenced in different primate species and aligned to the human sequence.

Analysis of the BRCA1 locus in the chimpanzee genome

The tumor suppressor BRCA1 has an unusual evolutionary history, in that ratios of replacement to silent nucleotide substitutions (KA/KS) are greater than one when comparing human and chimpanzee lineages, but not in lineages of other primates or other mammals [4143]. This observation is consistent with positive selection pressure on BRCA1 during recent hominoid evolution. We evaluated the complete human and chimpanzee genomic sequences of BRCA1 in light of this observation. In human and chimpanzee, the BRCA1 locus spans ~130 kb, including the BRCA1 gene itself and a partially duplicated 5' (telomeric) region. The duplication includes the highly conserved bidirectional promoter regulating BRCA1 and the adjacent gene NBR2, a BRCA1 partial pseudogene, and NBR1/M17S2. The number of nucleotide differences between human and chimpanzee sequences at the BRCA1 locus is not remarkable, 1.15%, and does not vary appreciably between the BRCA1 locus and the duplicated region. However, humans and chimpanzee sequences differ by an 8 Kb insertion/deletion within the partially duplicated region. The deletion on the chimpanzee sequence prematurely truncates the NBR2 gene. The consequences to BRCA1 expression in chimpanzee of the truncation of the adjacent, co-regulated gene are not known. In mouse, the region is not duplicated and the bidirectional promoter co-regulates Brca1 and Nbr1 [44].


The availability of the chimpanzee genome sequence provides an excellent opportunity to explore the genetic basis for some of the biological differences between human and our most closely related organism [23]. An striking finding in this regard is the observation that non-human primates show a lower incidence of cancer than humans, specifically for epithelial carcinomas [18, 20]. To investigate if this different susceptibility to cancer was due to genetic differences between humans and chimpanzees, we performed a comparative analysis in the chimpanzee genome of a set of 333 human genes that have been directly implicated in tumor development [3, 7]. This analysis revealed that all human cancer genes contain a clear ortholog in the chimpanzee genome, and show a percentage of identities at the protein level similar to the genome average (99.38%). Nevertheless, the strict conservation of this group of more than 330 genes contrasts with the recent analysis of other groups of genes, including a set of 560 proteases, which shows that at least 7 genes are absent in either human or chimpanzee genomes [45], or the CD33-related Siglecs, which also show specific loss of some genes in the human lineage [46]. These data suggest that despite cancer genes show a percentage of identities similar to the genome average, they are perfectly conserved between human and chimpanzee, confirming previous studies showing a higher conservation of genes implicated in essential cellular functions [29].

This study has shown that despite the high degree of identity in cancer genes, there are 1542 amino acid changes whose contribution to the different cancer susceptibility between human and chimpanzee should be further investigated. Nevertheless, additional factors contributing to the observed differences could include changes in diet, lifestyle or exposure to mutagenic agents [4749]; physiological differences in immune system or in life expectancy and aging rates [50]; variations in gene expression, alternative splicing or in DNA methylation patterns [51, 52]; or alterations in other genes not analyzed in this study. In this regard, we must emphasize that we have compiled a group of 333 genes that are causally implicated in cancer development as a result of mutational analysis or owing to their participation in processes such as control of mitotic checkpoints, chromosomal stability or promoter methylation. This set of genes includes those annotated in a recent census of human cancer genes [7], but also incorporates novel genes described to be mutated in human carcinomas [3, 4, 2528]. However, the possibility that other yet unknown cancer-related genes are responsible for the different susceptibility to this disease between human and chimpanzee, cannot be ruled out.

In relation to the existence of functional differences in the immune system of human and non-human primates, it is now clear that the immune and inflammatory responses play important roles in cancer development and progression [5356]. Accordingly, structural and functional variations in genes associated with these processes might be responsible for the species-specific susceptibility to certain tumors. Life expectancy is another factor that might contribute to the observed differences in cancer incidence between human and chimpanzee. Although age is known to represent a major risk factor for cancer development, the observation that non-human primates have a lower incidence of epithelial carcinomas but not other cancers [18], suggests that life expectancy might contribute only partially to the observed differences in cancer susceptibility. On the other hand, and despite the strong conservation in the coding regions of cancer genes from human and chimpanzee, it is possible that differences in regulatory elements outside these coding regions can result in changes in the expression levels or in the tissue distribution patterns [57]. In fact, recent reports have shown considerable differences in the expression levels of orthologous genes from different primate species [5860], supporting the idea that regulatory changes might account for some interspecies differences, including cancer susceptibility [61].

Despite the high conservation of cancer genes between both species, we identified 20 genes containing several codon insertions or deletions in their protein coding regions, although the functional significance of these differences, including their putative association with cancer, will require further studies. It is interesting to note that in 70% of these cases, the insertion/deletion event occurred within trinucleotide repeats, suggesting that these regions could be prone to genetic instability, in both human and chimpanzee genomes. Analysis of EST databases allowed us to confirm this hypothesis, as six human genes, ASPSCR1, DEK, FANCC, MLLT3, MYST4 and ZNF384, were polymorphic in the number of repeats in these regions. In the case of genes implicated in cancer, it will also be interesting to study if some of the identified haplotypes could confer a higher risk of tumor development.

The fact that most cancer genes show a high degree of conservation between human and chimpanzee, prompted us to analyze in more detail the observed changes in genes previously reported to be of special relevance in human cancer, such as the tumor suppressors p53 and BRCA1. We found that chimpanzee p53 shows a single amino acid difference with human p53, resulting in a protein containing Pro instead of Arg at position 72. Interestingly, this codon is polymorphic in human, and the Pro72 residue is common in all studied human populations [36, 62]. Sequencing of this region in other non-human primates allowed us to confirm that this Pro residue has been conserved during evolution, and suggests that the Arg72 allele, present in 55–92% of the human population, arose in the human lineage. The presence of Pro72 in chimpanzee might have physiological consequences, as several reports have found interesting functional differences between both p53 isoforms. In fact, p53-Arg72 protein has an increased ability to induce apoptosis, to translocate from the nucleus to the mitochondria, to be degraded by the E6 oncoprotein of human papillomavirus or to inactivate p73 when mutated [3840]. Different studies have provided evidence for an increasedrisk of cancer development associated with the Arg72 genotype, although this topic has been controversial and further studies will be necessary to definitely address this question [39, 40, 63]. Therefore, and although the presence of Pro72 in chimpanzee p53 might contribute to the reduced cancer susceptibility in non-human primates, further work will be required to confirm this hypothesis.

Analysis of the tumor suppressor BRCA1 has shown that it has been subjected to positive selection during recent hominoid evolution, as the overall KA/KS ratios are greater than one when comparing human and chimpanzee lineages, but not on lineages of other primates or mammals, as has been recently shown [43]. Additionally, the chimpanzee BRCA1 locus contains an 8 Kb deletion in the partially duplicated 5' region. This deletion prematurely truncates the NBR2 gene, which is regulated by a bi-directional promoter that co-regulates NBR2 and BRCA1. The consequences to BRCA1 expression in chimpanzee of the truncation of the adjacent, co-regulated gene are not known. The distinctive evolution of human and chimpanzee BRCA1 suggest that an evolutionary approach may be important to understanding selection at this, and perhaps other cancer associated genes [43, 64, 65].


In summary, in this work we have performed an analysis of a defined set of 333 genes to try to elucidate the molecular basis of human-chimpanzee differences in one aspect of significant biomedical relevance: the interspecies variability in cancer susceptibility. The overall picture emerging from this comparative analysis is one that reflects the high degree of conservation in this group of cancer genes, although specific differences in relevant genes such as p53 and BRCA1 can illuminate new functional and evolutionary aspects of these tumor-suppressor genes. Altogether, the limited genetic variability found in this human-chimpanzee comparative analysis might contribute to the different cancer susceptibility between these closely related species. However, further investigations will be necessary to determine the influence of these genetic changes in cancer, or whether additional factors, such as changes in gene regulation, immune system genes, life expectancy or environmental influences, might also contribute to this process.


Bioinformatic screening of the chimpanzee genome

A database of human cancer genes was constructed by using information from the literature, and the corresponding cDNA and protein sequences were retrieved from the GenBank database. Each single human cDNA sequence was compared against the ARACHNE and PCAP chimpanzee genome assemblies by using a combination of BLAT and BLAST algorithms [66, 67]. The corresponding chimpanzee cDNA and protein sequences were extracted and compared to the human ortholog by using the EMBOSS sequence analysis package. The full list of cancer genes, accession numbers and comparison to the human orthologs are available at in the Additional data file 1. In all cases, the primary chimpanzee cDNA sequence was obtained from the ARACHNE assembly, as this sequence is used by the Chimpanzee Genome Sequencing Consortium and by most public genome browsers (NBCI, Ensembl, or UCSC). However, in those cases where the chimpanzee gene was incomplete, the PCAP assembly was used to fill gaps in the genome, and the corresponding sequence was incorporated to the chimpanzee cDNA sequence for our analysis.

Conflictive regions were defined as those presenting frameshifts or premature stop codons in the chimpanzee coding sequence. Those regions were carefully examined in both ARACHNE and PCAP assemblies, as well as the corresponding human region in the human genome sequence and EST databases. In four cases (LMO2, MLLT7, MN1, and MYST4) the presence of a putative frameshift or premature stop codon in the chimpanzee sequence resulted from incorrect human sequence entries that contained frameshifts in the deposited entry. Modification of the human cDNA with the aid of EST and genomic sequences resulted in curated human cDNA and protein sequences, and the absence of frameshift or premature stop codon in the corresponding chimpanzee prediction. All other conflictive regions in the chimpanzee genome were solved by PCR amplification and direct sequencing of the corresponding gene using chimpanzee genomic DNA.

PCR amplification and direct sequencing of chimpanzee genes

Chimpanzee, gorilla, orangutan and mandrill genomic DNA from different geographic regions was obtained using standard phenol-chloroform procedures. To analyze conflictive regions in the chimpanzee genome, we designed specific oligonucleotides flanking the exon of interest (Additional data file 3), and the corresponding region was PCR amplified from chimpanzee total genomic DNA in a Perkin Elmer 9700 thermocycler using High Fidelity Taq DNA polymerase or GC-RICH PCR system for regions rich in GC content (Roche Diagnostics). The amplified product was purified and subjected to automatic sequencing using the 5' oligonucleotide as primer using an ABI Prism 310 DNA sequencer (Applied Biosystems).


  1. 1.

    Hahn WC, Weinberg RA: Modelling the molecular circuitry of cancer. Nat Rev Cancer. 2002, 2: 331-341. 10.1038/nrc795.

    PubMed  Article  Google Scholar 

  2. 2.

    Pantel K, Brakenhoff RH: Dissecting the metastatic cascade. Nat Rev Cancer. 2004, 4: 448-456. 10.1038/nrc1370.

    PubMed  Article  Google Scholar 

  3. 3.

    Vogelstein B, Kinzler KW: Cancer genes and the pathways they control. Nat Med. 2004, 10: 789-799. 10.1038/nm1087.

    PubMed  Article  Google Scholar 

  4. 4.

    Feinberg AP, Tycko B: The history of cancer epigenetics. Nat Rev Cancer. 2004, 4: 143-153. 10.1038/nrc1279.

    PubMed  Article  Google Scholar 

  5. 5.

    Thilly WG: Have environmental mutagens caused oncomutations in people?. Nat Genet. 2003, 34: 255-259. 10.1038/ng1205.

    PubMed  Article  Google Scholar 

  6. 6.

    Borgono CA, Diamandis EP: The emerging roles of human tissue kallikreins in cancer. Nat Rev Cancer. 2004, 4: 876-890. 10.1038/nrc1474.

    PubMed  Article  Google Scholar 

  7. 7.

    Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, Rahman N, Stratton MR: A census of human cancer genes. Nat Rev Cancer. 2004, 4: 177-183. 10.1038/nrc1299.

    PubMed  PubMed Central  Article  Google Scholar 

  8. 8.

    Sugimoto M, Tahara H, Ide T, Furuichi Y: Steps involved in immortalization and tumorigenesis in human B-lymphoblastoid cell lines transformed by Epstein-Barr virus. Cancer Res. 2004, 64: 3361-3364. 10.1158/0008-5472.CAN-04-0079.

    PubMed  Article  Google Scholar 

  9. 9.

    King MC, Marks JH, Mandell JB: Breast and ovarian cancer risks due to inherited mutations in BRCA1 and BRCA2. Science. 2003, 302: 643-646. 10.1126/science.1088759.

    PubMed  Article  Google Scholar 

  10. 10.

    Umar A, Risinger JI, Hawk ET, Barrett JC: Testing guidelines for hereditary non-polyposis colorectal cancer. Nat Rev Cancer. 2004, 4: 153-158.

    PubMed  Article  Google Scholar 

  11. 11.

    Lerman C, Shields AE: Genetic testing for cancer susceptibility: the promise and the pitfalls. Nat Rev Cancer. 2004, 4: 235-241. 10.1038/nrc1301.

    PubMed  Article  Google Scholar 

  12. 12.

    Hanahan D, Weinberg RA: The hallmarks of cancer. Cell. 2000, 100: 57-70. 10.1016/S0092-8674(00)81683-9.

    PubMed  Article  Google Scholar 

  13. 13.

    Druker BJ, Talpaz M, Resta DJ, Peng B, Buchdunger E, Ford JM, Lydon NB, Kantarjian H, Capdeville R, Ohno-Jones S, Sawyers CL: Efficacy and safety of a specific inhibitor of the BCR-ABL tyrosine kinase in chronic myeloid leukemia. N Engl J Med. 2001, 344: 1031-1037. 10.1056/NEJM200104053441401.

    PubMed  Article  Google Scholar 

  14. 14.

    Ross JS, Fletcher JA, Bloom KJ, Linette GP, Stec J, Symmans WF, Pusztai L, Hortobagyi GN: Targeted therapy in breast cancer: the HER-2/neu gene and protein. Mol Cell Proteomics. 2004, 3: 379-398. 10.1074/mcp.R400001-MCP200.

    PubMed  Article  Google Scholar 

  15. 15.

    Adams J: The proteasome: a suitable antineoplastic target. Nat Rev Cancer. 2004, 4: 349-360. 10.1038/nrc1361.

    PubMed  Article  Google Scholar 

  16. 16.

    Lynch TJ, Bell DW, Sordella R, Gurubhagavatula S, Okimoto RA, Brannigan BW, Harris PL, Haserlat SM, Supko JG, Haluska FG, Louis DN, Christiani DC, Settleman J, Haber DA: Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N Engl J Med. 2004, 350: 2129-2139. 10.1056/NEJMoa040938.

    PubMed  Article  Google Scholar 

  17. 17.

    Lanford RE, Bigger C, Bassett S, Klimpel G: The chimpanzee model of hepatitis C virus infections. Ilar J. 2001, 42: 117-126.

    PubMed  Article  Google Scholar 

  18. 18.

    Varki A: A chimpanzee genome project is a biomedical imperative. Genome Res. 2000, 10: 1065-1070. 10.1101/gr.10.8.1065.

    PubMed  Article  Google Scholar 

  19. 19.

    Waters DJ, Sakr WA, Hayden DW, Lang CM, McKinney L, Murphy GP, Radinsky R, Ramoner R, Richardson RC, Tindall DJ: Workgroup 4: spontaneous prostate carcinoma in dogs and nonhuman primates. Prostate. 1998, 36: 64-67. 10.1002/(SICI)1097-0045(19980615)36:1<64::AID-PROS12>3.0.CO;2-G.

    PubMed  Article  Google Scholar 

  20. 20.

    Beniashvili DS: An overview of the world literature on spontaneous tumors in nonhuman primates. J Med Primatol. 1989, 18: 423-437.

    PubMed  Google Scholar 

  21. 21.

    McClure HM: Tumors in nonhuman primates: observations during a six-year period in the Yerkes primate center colony. Am J Phys Anthropol. 1973, 38: 425-429. 10.1002/ajpa.1330380243.

    PubMed  Article  Google Scholar 

  22. 22.

    Seibold HR, Wolf RH: Neoplasms and proliferative lesions in 1065 nonhuman primate necropsies. Lab Anim Sci. 1973, 23: 533-539.

    PubMed  Google Scholar 

  23. 23.

    Mikkelsen TS, Hillier LW, Eichler EE, Zody MC, Jaffe DB, Yang S, Enard W, Hellmann I, Lindblad-Toh K, Altheide TK, Archidiacono N, Bork P, Butler J, Chang JL, Cheng Z, Chinwalla AT, deJong P, Delehaunty KD, Fronick CC, Fulton LL, Gilad Y, Glusman G, Gnerre S, Graves TA, Hayakawa T, Hayden KE, Huang X, Ji H, Kent WJ, King MC, KulbokasIII EJ, Lee MK, Liu G, López-Otín C, Makova KD, Man O, Mardis ER, Mauceli E, Miner TL, Nash WE, Nelson JO, Pääbo S, Patterson NJ, Pohl CS, Pollard KS, Prüfer K, Puente XS, Reich D, Rocchi M, Rosenbloom K, Ruvolo M, Richter DJ, Schaffner SF, Smit AFA, Smith SM, Suyama M, Taylor T, Torrents D, Tuzun E, Varki A, Velasco G, Ventura M, Wallis JW, Wendl MC, Wilson RK, Lander ES, Waterston RH: Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005, 437: 69-87. 10.1038/nature04072.

    Article  Google Scholar 

  24. 24.

    Zhou BB, Elledge SJ: The DNA damage response: putting checkpoints in perspective. Nature. 2000, 408: 433-439. 10.1038/35044005.

    PubMed  Article  Google Scholar 

  25. 25.

    Maser RS, DePinho RA: Connecting chromosomes, crisis, and cancer. Science. 2002, 297: 565-569. 10.1126/science.297.5581.565.

    PubMed  Article  Google Scholar 

  26. 26.

    Rajagopalan H, Jallepalli PV, Rago C, Velculescu VE, Kinzler KW, Vogelstein B, Lengauer C: Inactivation of hCDC4 can cause chromosomal instability. Nature. 2004, 428: 77-81. 10.1038/nature02313.

    PubMed  Article  Google Scholar 

  27. 27.

    Wang Z, Shen D, Parsons DW, Bardelli A, Sager J, Szabo S, Ptak J, Silliman N, Peters BA, van der Heijden MS, Parmigiani G, Yan H, Wang TL, Riggins G, Powell SM, Willson JK, Markowitz S, Kinzler KW, Vogelstein B, Velculescu VE: Mutational analysis of the tyrosine phosphatome in colorectal cancers. Science. 2004, 304: 1164-1166. 10.1126/science.1096096.

    PubMed  Article  Google Scholar 

  28. 28.

    Wang Z, Cummins JM, Shen D, Cahill DP, Jallepalli PV, Wang TL, Parsons DW, Traverso G, Awad M, Silliman N, Ptak J, Szabo S, Willson JK, Markowitz SD, Goldberg ML, Karess R, Kinzler KW, Vogelstein B, Velculescu VE, Lengauer C: Three classes of genes mutated in colorectal cancers with chromosomal instability. Cancer Res. 2004, 64: 2998-3001. 10.1158/0008-5472.CAN-04-0587.

    PubMed  Article  Google Scholar 

  29. 29.

    Thomas MA, Weston B, Joseph M, Wu W, Nekrutenko A, Tonellato PJ: Evolutionary dynamics of oncogenes and tumor suppressor genes: higher intensities of purifying selection than other genes. Mol Biol Evol. 2003, 20: 964-968. 10.1093/molbev/msg110.

    PubMed  Article  Google Scholar 

  30. 30.

    Andrés AM, Soldevila M, Lao O, Volpini V, Saitou N, Jacobs HT, Hayasaka I, Calafell F, Bertranpetit J: Comparative genetics of functional trinucleotide tandem repeats in humans and apes. J Mol Evol. 2004, 59: 329-339. 10.1007/s00239-004-2628-5.

    PubMed  Article  Google Scholar 

  31. 31.

    Stenson PD, Ball EV, Mort M, Phillips AD, Shiel JA, Thomas NS, Abeysinghe S, Krawczak M, Cooper DN: Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat. 2003, 21: 577-581. 10.1002/humu.10212.

    PubMed  Article  Google Scholar 

  32. 32.

    Hamosh A, Scott AF, Amberger J, Bocchini C, Valle D, McKusick VA: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2002, 30: 52-55. 10.1093/nar/30.1.52.

    PubMed  PubMed Central  Article  Google Scholar 

  33. 33.

    Xie D, Shu XO, Deng Z, Wen WQ, Creek KE, Dai Q, Gao YT, Jin F, Zheng W: Population-based, case-control study of HER2 genetic polymorphism and breast cancer risk. J Natl Cancer Inst. 2000, 92: 412-417. 10.1093/jnci/92.5.412.

    PubMed  Article  Google Scholar 

  34. 34.

    Healey CS, Dunning AM, Teare MD, Chase D, Parker L, Burn J, Chang-Claude J, Mannermaa A, Kataja V, Huntsman DG, Pharoah PD, Luben RN, Easton DF, Ponder BA: A common variant in BRCA2 is associated with both breast cancer risk and prenatal viability. Nat Genet. 2000, 26: 362-364. 10.1038/81691.

    PubMed  Article  Google Scholar 

  35. 35.

    Soussi T: The p53 tumor suppressor gene: from molecular biology to clinical investigation. Ann N Y Acad Sci. 2000, 910: 121-37; discussion 137-9.

    PubMed  Article  Google Scholar 

  36. 36.

    Beckman G, Birgander R, Sjalander A, Saha N, Holmberg PA, Kivela A, Beckman L: Is p53 polymorphism maintained by natural selection?. Hum Hered. 1994, 44: 266-270.

    PubMed  Article  Google Scholar 

  37. 37.

    HapMap C: The International HapMap Project. Nature. 2003, 426: 789-796. 10.1038/nature02168.

    Article  Google Scholar 

  38. 38.

    Dumont P, Leu JI, Della Pietra AC, George DL, Murphy M: The codon 72 polymorphic variants of p53 have markedly different apoptotic potential. Nat Genet. 2003, 33: 357-365. 10.1038/ng1093.

    PubMed  Article  Google Scholar 

  39. 39.

    Marin MC, Jost CA, Brooks LA, Irwin MS, O'Nions J, Tidy JA, James N, McGregor JM, Harwood CA, Yulug IG, Vousden KH, Allday MJ, Gusterson B, Ikawa S, Hinds PW, Crook T, Kaelin WGJ: A common polymorphism acts as an intragenic modifier of mutant p53 behaviour. Nat Genet. 2000, 25: 47-54. 10.1038/75586.

    PubMed  Article  Google Scholar 

  40. 40.

    Storey A, Thomas M, Kalita A, Harwood C, Gardiol D, Mantovani F, Breuer J, Leigh IM, Matlashewski G, Banks L: Role of a p53 polymorphism in the development of human papillomavirus-associated cancer. Nature. 1998, 393: 229-234. 10.1038/30400.

    PubMed  Article  Google Scholar 

  41. 41.

    Huttley GA, Easteal S, Southey MC, Tesoriero A, Giles GG, McCredie MR, Hopper JL, Venter DJ: Adaptive evolution of the tumour suppressor BRCA1 in humans and chimpanzees. Australian Breast Cancer Family Study. Nat Genet. 2000, 25: 410-413. 10.1038/78092.

    PubMed  Article  Google Scholar 

  42. 42.

    Fleming MA, Potter JD, Ramirez CJ, Ostrander GK, Ostrander EA: Understanding missense mutations in the BRCA1 gene: an evolutionary approach. Proc Natl Acad Sci U S A. 2003, 100: 1151-1156. 10.1073/pnas.0237285100.

    PubMed  PubMed Central  Article  Google Scholar 

  43. 43.

    Pavlicek A, Noskov VN, Kouprina N, Barrett JC, Jurka J, Larionov V: Evolution of the tumor suppressor BRCA1 locus in primates: implications for cancer predisposition. Hum Mol Genet. 2004, 13: 2737-2751. 10.1093/hmg/ddh301.

    PubMed  Article  Google Scholar 

  44. 44.

    Whitehouse C, Chambers J, Catteau A, Solomon E: Brca1 expression is regulated by a bidirectional promoter that is shared by the Nbr1 gene in mouse. Gene. 2004, 326: 87-96. 10.1016/j.gene.2003.10.008.

    PubMed  Article  Google Scholar 

  45. 45.

    Puente XS, Gutierrez-Fernandez A, Ordonez GR, Hillier LW, Lopez-Otin C: Comparative genomic analysis of human and chimpanzee proteases. Genomics. 2005, 86: 638-647. 10.1016/j.ygeno.2005.07.009.

    PubMed  Article  Google Scholar 

  46. 46.

    Angata T, Margulies EH, Green ED, Varki A: Large-scale sequencing of the CD33-related Siglec gene cluster in five mammalian species reveals rapid evolution by multiple mechanisms. Proc Natl Acad Sci U S A. 2004, 101: 13251-13256. 10.1073/pnas.0404833101.

    PubMed  PubMed Central  Article  Google Scholar 

  47. 47.

    Coffey DS: Similarities of prostate and breast cancer: Evolution, diet, and estrogens. Urology. 2001, 57: 31-38. 10.1016/S0090-4295(00)00938-9.

    PubMed  Article  Google Scholar 

  48. 48.

    Bingham S, Riboli E: Diet and cancer--the European Prospective Investigation into Cancer and Nutrition. Nat Rev Cancer. 2004, 4: 206-215. 10.1038/nrc1298.

    PubMed  Article  Google Scholar 

  49. 49.

    Poirier MC: Chemical-induced DNA damage and human cancer risk. Nat Rev Cancer. 2004, 4: 630-637. 10.1038/nrc1410.

    PubMed  Article  Google Scholar 

  50. 50.

    Finch CE, Stanford CB: Meat-adaptive genes and the evolution of slower aging in humans. Q Rev Biol. 2004, 79: 3-50. 10.1086/381662.

    PubMed  Article  Google Scholar 

  51. 51.

    Enard W, Fassbender A, Model F, Adorjan P, Pääbo S, Olek A: Differences in DNA methylation patterns between humans and chimpanzees. Curr Biol. 2004, 14: R148-9. 10.1016/S0960-9822(04)00072-7.

    PubMed  Article  Google Scholar 

  52. 52.

    Kalnina Z, Zayakin P, Silina K, Line A: Alterations of pre-mRNA splicing in cancer. Genes Chromosomes Cancer. 2005, 42: 342-357. 10.1002/gcc.20156.

    PubMed  Article  Google Scholar 

  53. 53.

    Balbín M, Fueyo A, Tester AM, Pendás AM, Pitiot AS, Astudillo A, Overall CM, Shapiro SD, López-Otín C: Loss of collagenase-2 confers increased skin tumor susceptibility to male mice. Nat Genet. 2003, 35: 252-257. 10.1038/ng1249.

    PubMed  Article  Google Scholar 

  54. 54.

    Coussens LM, Werb Z: Inflammation and cancer. Nature. 2002, 420: 860-867. 10.1038/nature01322.

    PubMed  PubMed Central  Article  Google Scholar 

  55. 55.

    Balkwill F: Cancer and the chemokine network. Nat Rev Cancer. 2004, 4: 540-550. 10.1038/nrc1388.

    PubMed  Article  Google Scholar 

  56. 56.

    Vakkila J, Lotze MT: Inflammation and necrosis promote tumour growth. Nat Rev Immunol. 2004, 4: 641-648. 10.1038/nri1415.

    PubMed  Article  Google Scholar 

  57. 57.

    King MC, Wilson AC: Evolution at two levels in humans and chimpanzees. Science. 1975, 188: 107-116.

    PubMed  Article  Google Scholar 

  58. 58.

    Cáceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH, Lockhart DJ, Preuss TM, Barlow C: Elevated gene expression levels distinguish human from non-human primate brains. Proc Natl Acad Sci U S A. 2003, 100: 13030-13035. 10.1073/pnas.2135499100.

    PubMed  PubMed Central  Article  Google Scholar 

  59. 59.

    Enard W, Khaitovich P, Klose J, Zollner S, Heissig F, Giavalisco P, Nieselt-Struwe K, Muchmore E, Varki A, Ravid R, Doxiadis GM, Bontrop RE, Pääbo S: Intra- and interspecific variation in primate gene expression patterns. Science. 2002, 296: 340-343. 10.1126/science.1068996.

    PubMed  Article  Google Scholar 

  60. 60.

    Khaitovich P, Muetzel B, She X, Lachmann M, Hellmann I, Dietzsch J, Steigele S, Do HH, Weiss G, Enard W, Heissig F, Arendt T, Nieselt-Struwe K, Eichler EE, Pääbo S: Regional patterns of gene expression in human and chimpanzee brains. Genome Res. 2004, 14: 1462-1473. 10.1101/gr.2538704.

    PubMed  PubMed Central  Article  Google Scholar 

  61. 61.

    Bond GL, Hu W, Bond EE, Robins H, Lutzker SG, Arva NC, Bargonetti J, Bartel F, Taubert H, Wuerl P, Onel K, Yip L, Hwang SJ, Strong LC, Lozano G, Levine AJ: A single nucleotide polymorphism in the MDM2 promoter attenuates the p53 tumor suppressor pathway and accelerates tumor formation in humans. Cell. 2004, 119: 591-602. 10.1016/j.cell.2004.11.022.

    PubMed  Article  Google Scholar 

  62. 62.

    Matlashewski GJ, Tuck S, Pim D, Lamb P, Schneider J, Crawford LV: Primary structure polymorphism at amino acid residue 72 of human p53. Mol Cell Biol. 1987, 7: 961-963.

    PubMed  PubMed Central  Article  Google Scholar 

  63. 63.

    Koushik A, Platt RW, Franco EL: p53 codon 72 polymorphism and cervical neoplasia: a meta-analysis review. Cancer Epidemiol Biomarkers Prev. 2004, 13: 11-22.

    PubMed  Article  Google Scholar 

  64. 64.

    Di Tommaso A, Soler C, Roos C, Kitzis A, Ladeveze V: The ink4a/arf locus evolution in primates: characterization of three ARF sequences. DNA Cell Biol. 2004, 23: 167-173. 10.1089/104454904322964760.

    PubMed  Article  Google Scholar 

  65. 65.

    Zhang J, Rosenberg HF: Diversifying selection of the tumor-growth promoter angiogenin in primate evolution. Mol Biol Evol. 2002, 19: 438-445.

    PubMed  Article  Google Scholar 

  66. 66.

    Kent WJ: BLAT--the BLAST-like alignment tool. Genome Res. 2002, 12: 656-664. 10.1101/gr.229202. Article published online before March 2002.

    PubMed  PubMed Central  Article  Google Scholar 

  67. 67.

    Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.

    PubMed  PubMed Central  Article  Google Scholar 

Download references


This work has been supported by grants from Comisión Interministerial de Ciencia y Tecnología-Spain; Fundación la Caixa, European Union (Cancer Degradome-FP6) and Ramón y Cajal Program (XSP). The Instituto Universitario de Oncología is supported by Obra Social Cajastur-Asturias and Red de Centros de Cancer-Instituto Carlos III, Spain. We thank Dr. W. Enard for chimpanzee DNA samples, and S. Alvarez and P. Martín Bringas for excellent technical assistance. Some samples were provided by the Barcelona Zoo under the agreement of the Primate DNA Bank with the Pompeu Fabra University.

Author information



Corresponding author

Correspondence to Xose S Puente.

Additional information

Authors' contributions

XSP conceived this project, participated in the design and bioinformatic analysis, and drafted the manuscript. CLO also conceived this project and participated in the elaboration of the gene census. GV and AGF performed the bioinformatic analysis, gene comparison and PCR amplification and sequencing. MCK performed the analysis of the BRCA1 locus. JB provided DNA samples from different primates and helped with the analysis of p53. All authors read and approved the final manuscript.

Electronic supplementary material


Additional data file 1: Human and chimpanzee cancer genes comparison. This file contains the complete list of human and chimpanzee cancer genes, and comparison at the nucleotide and protein level. (XLS 2 MB)


Additional data file 2: Chimpanzee cancer gene products identical to human orthologs. List of human genes identical to chimpanzee orthologs. Gene name, accession number, chromosomal location, protein length and functional category are indicated. (DOC 140 KB)


Additional data file 3: Frameshifts in chimpanzee cancer genes. List of chimpanzee cancer genes for which frameshifts or premature stop codons are present in the ARACHNE assembly, as well as the correct sequence obtained by direct sequencing or comparison with the PCAP assembly. (XLS 33 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Puente, X.S., Velasco, G., Gutiérrez-Fernández, A. et al. Comparative analysis of cancer genes in the human and chimpanzee genomes. BMC Genomics 7, 15 (2006).

Download citation


  • Additional Data File
  • Cancer Susceptibility
  • Cancer Gene
  • Premature Stop Codon
  • Bidirectional Promoter