Interspecies hybridization on DNA resequencing microarrays: efficiency of sequence recovery and accuracy of SNP detection in human, ape, and codfish mitochondrial DNA genomes sequenced on a human-specific MitoChip
© Flynn and Carr; licensee BioMed Central Ltd. 2007
Received: 24 January 2007
Accepted: 25 September 2007
Published: 25 September 2007
Iterative DNA "resequencing" on oligonucleotide microarrays offers a high-throughput method to measure intraspecific biodiversity, one that is especially suited to SNP-dense gene regions such as vertebrate mitochondrial (mtDNA) genomes. However, costs of single-species design and microarray fabrication are prohibitive. A cost-effective, multi-species strategy is to hybridize experimental DNAs from diverse species to a common microarray that is tiled with oligonucleotide sets from multiple, homologous reference genomes. Such a strategy requires that cross-hybridization between the experimental DNAs and reference oligos from the different species not interfere with the accurate recovery of species-specific data. To determine the pattern and limits of such interspecific hybridization, we compared the efficiency of sequence recovery and accuracy of SNP identification by a 15,452-base human-specific microarray challenged with human, chimpanzee, gorilla, and codfish mtDNA genomes.
In the human genome, 99.67% of the sequence was recovered with 100.0% accuracy. Accuracy of SNP identification declines log-linearly with sequence divergence from the reference, from 0.067 to 0.247 errors per SNP in the chimpanzee and gorilla genomes, respectively. Efficiency of sequence recovery declines with the increase of the number of interspecific SNPs in the 25b interval tiled by the reference oligonucleotides. In the gorilla genome, which differs from the human reference by 10%, and in which 46% of these 25b regions contain 3 or more SNP differences from the reference, only 88% of the sequence is recoverable. In the codfish genome, which differs from the reference by > 30%, less than 4% of the sequence is recoverable, in short islands ≥ 12b that are conserved between primates and fish.
Experimental DNAs bind inefficiently to homologous reference oligonucleotide sets on a re-sequencing microarray when their sequences differ by more than a few percent. The data suggest that interspecific cross-hybridization will not interfere with the accurate recovery of species-specific data from multispecies microarrays, provided that the species' DNA sequences differ by > 20% (mean of 5b differences per 25b oligo). Recovery of DNA sequence data from multiple, distantly-related species on a single multiplex gene chip should be a practical, highly-parallel method for investigating genomic biodiversity.
The development of DNA microarrays or "chips" has greatly increased the rate at which genomic data can be gathered. This highly-parallel technology has enabled thousands of genes or tens of thousands of single nucleotide polymorphisms (SNPs) to be recovered in a single experiment . Most such arrays have been designed to assay variation within single species, such as Drosophila melanogaster  or humans, either among tissues (e.g., cancerous versus non-cancerous cell lines ([3, 4], or among individuals that differ in some biomedically significant trait (e.g., obese versus non-obese patients . But for evolutionary biologists, it is of greater interest to know how variation among species will be accommodated on species-specific microarrays. Consider two complementary questions. Can a microarray designed for the genome of one species recover accurate information on the genome of a closely-related species? Can a microarray that incorporates assays for homologous but distantly-related genes of different species successfully discriminate DNA from those species ?
The answers to these questions are of practical and theoretical interest. Evolutionary biologists are often interested in genetic relationships within and among more or less closely-related taxa of non-model organisms. Current costs of novel microarray design and execution of species-specific microarray experiments are prohibitive. Population and taxonomic studies would be more practical, if a common microarray design were useful over a broader range of taxa, say species within genera or closely-related families. Alternatively, if any given design is specific to a limited range of taxa, multiplex studies of more distantly-related taxa on the same microarray may be feasible. Here, we address the former question, and its implications for the latter, by measuring the ability of a species-specific DNA re-sequencing microarray to recover information from experimental DNAs over a wide range of sequence divergences.
DNA microarrays are commonly used to measure differential gene expression in cDNA libraries synthesized from mRNA transcriptomes, so as to determine which genes are active, where, and at what levels across experimental treatments (reviewed in ). Variant Detector Arrays (VDAs) measure not gene expression, but rather to variation in single-nucleotide polymorphism (SNP) among samples of interest . VDAs rely on the ability of a ssDNA in the experimental sample to recognize and bind to its perfect oligonucleotide complement. A refinement of VDA microarrays is to evaluate, not just known SNPs, but all potential SNPs within a particular gene region. As developed by Affymetrix for their "GeneChip" protocols , a reference DNA sequence is represented on the microarray as a series of overlapping 25-base oligonucleotides ("oligos"), one for each position in the sequence. For each oligo, three additional variant oligos are included, each of which varies the central (13th) base. All possible SNP variants of a reference sequence of length n are thus represented on the microarray by a set of 4 × n oligonucleotides. An experimental sequence with any particular SNP variant in this quartet will hybridize with greatest fidelity to its exact complement, as indicated by the relative intensities of each of the probes bound at that position [10, 11]. This procedure has been dubbed "resequencing," since it re-reads multiple homologous sequences in comparison with the reference sequence.
Over the past 25 years, studies of mtDNA have been extraordinarily successful in clarifying evolutionary relationships within and among species, due to a number of useful properties, including maternal inheritance, high rate of sequence evolution, and lack of recombination . Gene order is broadly conserved across diverse vertebrate taxa. Recent comparative studies of multiple complete mtDNA genomes, both within [12, 13] and among species [14, 15], have demonstrated the power of genomics to investigate phenomena of intra- and interspecific population biology and evolution based on well-resolved, highly-corroborated gene trees . The mtDNA genome has also been implicated in a number of human biomedical conditions [16, 17].
One of the first applications of mtDNA to the study of evolution was an evaluation of the tempo and mode of molecular evolution of higher primates . Studies of mtDNA and other genetic macromolecules have now established that the closest relatives of humans (Homo sapiens) are chimpanzees (Pan spp., including the Common Chimpanzee (P. troglodytes) and the Pygmy Chimp or Bonobo (P. paniscus)), with which we share a common ancestor ~5 MYBP . The next closest relatives of chimps and humans are gorillas (G. gorilla), from which the chimp/human lineage diverged perhaps 7 MYBP . Levels of mtDNA genome diversity vary among hominoid primate species, and are apparently lowest in Homo [21, 12], due in part to our quite recent emergence "Out of Africa." Common Chimpanzees have a more polymorphic mitochondrial genome than humans, and variability within the Mountain Gorilla is as high as that between the two Pan species . The greater diversity of apes in comparison with humans may be due to their historically more fragmented populations, differences in male and female migration, or directional selection . There is now extensive interest in comparing the genetic material of humans and their closest relatives. The nuclear genome sequences of chimps and humans are more than 98% similar, and the focus of investigation is those differences that contribute to the uniqueness of the human species .
We investigated the efficiency and accuracy of microarray resequencing where experimental and microarray reference sequences are from different species, and the influence of the degree of sequence divergence on that performance. We use a human-specific mitochondrial DNA array  to resequence the homologous genomes of another human, as well as our two closest relatives, chimpanzee and gorilla, and a distant relative, Atlantic Cod (Gadus morhua). We compare these results to those obtained by conventional dideoxy sequencing. These experiments explore the limits of interspecies in silico hybridization, and in so doing contribute to the design and use of resequencing arrays for the study of intra- and interspecific population genomic evolution .
Dideoxy reference sequences
Percentage of 25-bp regions in the chimpanzee and gorilla mtDNA genomes that contain a given number of SNPs with respect to the tiled human mtDNA reference sequence on the MitoChip microarray
# SNPs SN
For the human comparison, we used an individual whose mtDNA genome sequence was known to differ from the tiled reference sequence by 32 SNPs (0.21% sequence divergence) in the region sequenced here .
Comparison with microarray sequencing
Efficiency, accuracy, and errors rates of microarray resequencing
High + Low confidence
Missed SNP +
at SNP site
High + Low confidence
Miscalled SNP +
at non-SNP site
low confidence N
The content and arrangement of the cells in these tables therefore differs from conventional 2 × 2 contingency tables, so as to emphasize the computation of correct, incorrect, and 'N' calls. The inclusion of an 'N' category also makes a conventional ROC analysis problematic. It is important to appreciate that accurate identification of SNPs sites ('true positives') is a more important criterion of success than the total number of correct calls, including non-SNP sites ('true negatives'), because the latter do not contribute informative data to phylogenetic analysis. For example, given 1000 sites with 10 SNPs, the correct identification of all 10 SNPs along with 890 invariant sites and 100 'N's is a more desirable outcome than correct identification of 5 SNPs along with 5 SNP erros, 985 invariant sites, and 5 'N's, even though the conventional accuracy rates are 90% and 99%, respectively.
Efficiency and accuracy of microarray sequencing
Efficiency, accuracy, and error of microarray resequencing of the human mtDNA genome (see table 2 for definitions)
24 + 8 = 32
0 + 0 = 0
0.16 + 0.04 = 0.20 %
14705 + 714 = 15419
0 + 1
95.17 + 4.62 = 99.79
0 + < 0.01 = < 0.01 %
14729 + 722
= 15451 (> 99.99 %)
+ 1 (< 0.01 %) N
(a) Efficiency, accuracy, and error of microarray resequencing and (b) SNP density in the intervals ± 12 bp surrounding correct and incorrect calls of SNP and constant sites in chimpanzee mtDNA (see table 2 for definitions)
498 + 432 = 930
14 + 1940 = 1954
3.23 + 2.80 = 6.03%
0.09+12.56 = 12.65%
7290 + 4925 = 12215
17 + 336 = 353
47.18 + 31.87 =
0.11 + 2.17 = 2.28%
7788 + 5357
31 (0.20%) errors
= 13145 (85.07%)
+ 2276 (14.73%) N
(a) Efficiency, accuracy, and error of microarray resequencing and (b) SNP density in the intervals ± 12 bp surrounding correct and incorrect calls of SNP and constant sites in gorilla mtDNA (see table 2 for definitions)
744 + 331 = 1075
89 + 1170 = 1259
4.81 + 2.14 = 6.96%
0.58 + 7.57 = 8.15%
9553 + 3040 = 12593
219 + 30 6 = 525
61.82 + 19.67 =
1.42 + 1.98 = 3.40%
10297 + 3371
308 (1.99%) errors
= 13668 (88.45%)
+ 1476 (9.55%) N
8.64 + 17.16 =
13.66 + 17.63 =
7.73 + 13.47 =
15.23 + 17.11 =
Without reference to the known dideoxy sequences, as would be the case for de novo microarray sequencing of an unknown genome, low-confidence calls cannot be assigned a priori as either correct or incorrect. Then, in the chimpanzee, 50.40% of calls were correct at high-confidence, 49.40% were "N" s. There were 0.20% errors as before, and only 38.81% of SNPs were identified with high confidence. In the gorilla, 66.64% of calls were correct at high confidence, 30.37% were "N" s. There were 1.99% errors, and only 46.50% of SNPs were identified with high confidence.
Accuracy and error rate of microarray resequencing for 6299 bases called with high-confidence in both chimpanzee and gorilla
6272 (99.59%) correct
26 (0.41%) errors
6166 (97.89%) correct
133 (2.11%) errors
Effect of SNP density on efficiency, accuracy, and probe intensity
In the chimpanzee experiment, correct high-confidence calls were made at a mean SNP density of 4.72% (i.e., 1.18 interspecific SNPs per 25 bps). Incorrect, high confidence calls occurred at SNP densities of 6.44 ~9.14 %. Correct, low-confidence calls occurred in regions with approximately 11% SNP density, above which, low confidence, incorrect calls occurred (Table 4). SNP densities in the gorilla followed the same general trends: correct high-confidence calls occurred at SNP densities < 8.64%. Incorrect, high-confidence calls occurred at SNP densities of 13.66 to 15.23%. Low confidence, correct calls occurred in regions with 13.47 to 17.16% SNP density, above which, low confidence, incorrect calls occurred (Table 5).
Resequencing of Atlantic Cod mtDNA
Efficiency and accuracy of interspecies resequencing
Microarray resequencing of human mtDNA sequences that differ by << 1% from the tiled human reference approach 100% efficiency and accuracy. Microarray resequencing of chimpanzee and gorilla DNA sequences, which differ by 8 and 10% from the tiled human sequence, recovers ~85% of those sequences, with < 2% error. Considered without respect to the known reference sequences, as would be the case if these were new individuals from the same species sequenced for the first time, efficiency of high-confidence sequence recovery falls to 67% in chimpanzee and to 50% in gorilla. Within this subset, overall error rates remain < 2%, however, accuracy of SNP identification falls from > 98% in chimpanzee to < 80% in gorilla.
Thus, microarray resequencing of experimental DNA genomes that diverge on average less than ~10% from the reference is able to recover a large part of the target sequence correctly. However, many of these calls are made at low confidence. The error rate is relatively low, but errors are more common at interspecies SNP sites than elsewhere, and the error rate increases sharply with the small added sequence divergence from chimpanzee to gorilla. Errors occur more or less uniformly over a wide range of probe intensities and confidence values. An increase in the stringency of the confidence criterion beyond a certain point does not increase accuracy, and only excludes more of the data (Figure 2).
Influence of SNP density on efficiency and accuracy
In a microarray experiment, the presence of a SNP in an experimental sequence affects not only its binding to the oligo quartet tiling the corresponding position, but also to the 24 additional quartets in the 12 bp on either side of the SNP position. Among these 100 oligos, only one will match the target perfectly, 27 will mismatch at one position (three because of the chip design, 24 because of the SNP), and 72 will mismatch at two positions. Thus, reduced probe binding strength is expected on either side of a SNP, even at invariant sites. We typically observed Ns within a few bp of isolated SNPs. In human mtDNA, SNPs are typically spaced at 100s of bp with respect to the tiled reference, and are frequently associated with runs of lower-confidence Ns . Regions in the ape genomes where SNPs are spaced > 25 bp apart are also associated with Ns, and are typically called correctly and at high confidence. Runs of Ns are associated with interspecific SNPs among higher primates resequenced on a human nuclear BRCA-specific microarray , where the SNP density is much lower than in mtDNA .
Where two SNPs occur within the 25 bp region covered by a particular SNP-specific oligo quartet, probe binding is affected at intermediate invariant positions. The pattern is specific and predictable. Consider two SNPs at an interval of 25 b, where one oligo quartet tiles one invariant position exactly 12 bp from either SNP. All four oligos in this set will have mismatches at their terminal (1st and 25th) positions, and three of four have an internal mismatch at the central (13th) position. Binding and probe intensity will be severely reduced by these two or three mismatches, in comparison to the two adjacent positions, where only one or two mismatches occur. Where SNPs are spaced 13 <n # 25b apart, the interference will extend to [(2)(25 - n) + 1] oligo quartets tiling the intermediate positions ("Flynn' s Rule"). In the ape data, we typically observe low probe intensity at all positions between two SNPs that occur within 25b. Precise patterns for any given oligo will be influenced by other factors, such as [G+C] content and distribution.
Multiple SNPs in the 25b region tiled by the oligo quartet further destabilize binding, and extensive tracts of Ns and miscalls are common. The data in Tables 4 &5 suggest some general guidelines, when SNP densities are expressed as expected numbers of SNP differences between experimental DNA and tiled oligo target, as in Table 1. High-confidence, accurate calls occur where the experimental sequence differs from the tiled array by 1 ~2 SNPs per 25b oligo, as is the case for about 67% and 54% of chimpanzee and gorilla genomes, respectively. At differences of 3 ~4 SNPs/oligo, high-confidence incorrect calls are common, which result in positive misidentification of SNPs; 28% and 34% of the chimpanzee and gorilla genomes fall in this category. At these densities, there is still sufficient sequence similarity for some probe-target hybridization to occur, although not always with accurate results. Where there are 5 SNPs/oligo or more (that is, when the experimental DNA differs from the reference oligos by an average of 20%), the decreased homology prevents binding with sufficient fidelity to discriminate accurately among SNP-specific oligos and generate high-confidence calls. This matches the prediction from Figure 6. The remaining 6% and 12% of the two ape genomes are at least as divergent as this from human. In cod, where an average of > 9 SNPs/oligo are expected, more than 95% of the cod mtDNA genome binds weakly if at all to the human-specific microarray.
Multi-species resequencing: implications for the "ArkChip"
For the primate genomicist, the optimum result of these experiments would have been efficient and accurate interspecific probe-target annealing with performance identical to that obtained within species. However, the goal of the present experiments was not to recover the chimpanzee, gorilla, or cod sequences, but rather to ascertain the limits of specificity of the human microarray. For the non-primate genomicist, the desirable result would be a complete failure of heterologous DNA to anneal to the human microarray. In the case of fish mtDNA, this is very nearly achieved (Figure 4). This "failure" indicates that it should be possible to tile both mammal and fish mtDNA genomes on the same microarray, apply a mixed pool of both species= DNAs to the chip, obtain species-specific annealing, and generate efficient and accurate sequences of both, simultaneously.
This is the essential idea behind the ArkChip. Using the new generation of microarrays that accommodates > 120 Kbp of reference sequence, we have designed a multispecies tiling that includes the complete forward and reverse sequences of the mtDNA genomes (including Control Regions) of three mammal species in different orders, three ray-finned fish species in different subclasses, and one bird species. Minimum interspecific divergence for these comparisons is > 23% (. Experiments show that species genomes in two- and four-taxon combinations, from different orders and classes, are successfully and accurately sequenced (A. T. Duggan and S. M. Carr, work in progress).
Although the gorilla was only ~2% more divergent from human than the chimpanzee, the corresponding 3 ~4-fold increase in SNP identification errors indicates that this degree of divergence is at or beyond the limit the useful limits of interspecies microarray sequencing. The log-linear trend line suggests extinction of usable probe annealing at 15 ~20% divergence. It will be useful to define this empirically. For this purpose, our next closest primate relatives are orang-utans (Pongo: ~14% mtDNA sequence difference) and gibbons (Hylobates: ~17% difference) , followed by Old World Monkeys (Cercopithecidae, inc. Papio: ~25% difference) . At the other end of the scale, mtDNA from our ancient cousins, such as Homo neanderthalensis ), might provide information as to how microarrays perform at less than 8% divergence. Alternatively, given the multispecies ArkChip, the three species of Atlantic wolffish (Anarhichas) , caribou and reindeer (New and Old World Rangifer, respectively) , and various cod species (Gadus)  all provide pairs that are only a few percent divergent.
Sources of DNA
Primate DNA was obtained from the roots of ten hairs plucked from a live chimpanzee (Pan troglodytes) at the Jardin Zoologique du Quebec, and from frozen heart tissue from a Western Lowland Gorilla (Gorilla gorilla) in the collection of the Royal Ontario Museum. DNA extractions were done with the QIAGEN QIAamp DNA Mini Kit Tissue Protocol. DNA from an Atlantic Cod (Gadus morhua) was purified by similar means.
Sequence and positions of primers used to amplify and/or sequence the mtDNA genomes of chimpanzee and gorilla
Sequence (5'-> 3')
We used a commercial MitoChip microarray (Affymetrix) to resequence a 15452 bp of the coding portion of the human mitochondrial DNA (mtDNA) genome, excluding the CR control region and including two rDNA, 22 tDNA, and 13 protein-coding genes . These features are tiled both as the heavy and light strands (designated "sense" and "antisense") strands, such that every base is assayed twice. To fill up the balance of the available 30 Kb feature array, the MitoChip includes duplicate tiling of this portion of the mtDNA genome, without the 12S and 16S rDNA genes. For these 12805 positions, there are thus a total of four replicates.
Properties of PCR amplicons used for microarray resequencing of chimpanzee and gorilla: size, required mass and observed concentration, and volume added to pool
Chimpanzee PCR Amplicons
Gorilla PCR Amplicons
Experimental results from chimpanzee, gorilla, and cod were assembled with those from a human . Output from each array experiment consisted of eight sets of probe intensity values, corresponding to the A, C, G, and T oligonucleotide variants of the sense and antisense strands at each of 15,452 tiled positions. Elaborate scoring algorithms based on likelihood methods have been developed . We applied a simpler arithmetic algorithm as follows. Sense and antisense probe intensities were summed to give four base-specific intensity scores for each position, and the highest and second- highest scores for each position were identified, along with the sum of intensity scores across all four bases. The difference between the two highest intensities was divided by the sum, which yielded a value defined as the differential signal-to-noise ratio (dS/N). This value expresses the confidence placed on each call. The approach is similar to those used previously , except that it includes standardization for total probe intensity.
Comparison of the probe intensity values and dS/N scores for the partially duplicated region of the genome on the microarray shows them in all cases to be virtually identical to the main series [results not shown].
Distribution of SNP density between the ape dideoxy sequences and the tiled human sequence was calculated as a sliding window of 25 bp, starting at Position 13 of the tiled sequence. The numbers of interspecific SNPs versus intraspecific miscalls (positions where the microarray call differed from the dideoxy sequence) within each of the two primate sequences were compared in a sliding window of 25 bp extending 12b on either side of each position, starting at Position 13 of the tiled reference. The SNP versus mismatch densities were averaged over all calls in each of the eight classes of Table 2.
To compare intra- and interspecific differentiation among the mtDNA genomes, we performed a phylogenetic analysis with the program PAUP (36). We aligned the 15,452b of the three primate dideoxy sequences in this paper, together with the homologous portions of five additional sequences from GenBank listed in Figure 5. We performed a branch-and-bound search, with all positions weighted equally. The tree was rooted with Gorilla as the outgroup to Pan and Homo.
We thank Siobhan Coady and Lynette Peddle at Newfound Genomics for providing respectively access to and expertise on the resequencing technology, Angela Pope for advice on experimental conditions and data from her MSc thesis in advance of publication, staff at the Royal Ontario Museum, the Toronto Zoo, and the Jardin Zoologique du Quebec for providing the gorilla and chimpanzee samples, Kim Johnstone for assistance with the cod experiments, Ana Duggan for the results of preliminary analysis of the multispecies ArkChip data, and H. Dawn Marshall for advice on the experiments and comments on the MS. We thank Justyna Ciszewska-Carr and three anonymous reviewers for comments on an earlier draft. This MS is based on the BSc (hons) thesis of SMCF, who thanks her brother Matthew for his assistance, as well as her parents Patrick and Molly and her friends for their support. These experiments were supported by the Canadian Department of Fisheries and Oceans, as part of a Grants and Contribution Agreement to SMC.
- Wang DG, Fan JB, Siao CJ, Berno A, Young P, Sapolsky R, Ghandour G, Perkins N, Winchester E, Spencer J, Kruglyak L, Stein L, Hsie L, Topaloglou T, Hubbell E, Robinson E, Mittmann M, Morris MS, Shen N, Kilburn D, Rioux J, Nusbaum C, Rozen S, Hudson TJ, Lipshutz R, Chee M, Lander ES: Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science. 1998, 280: 1077-1082. 10.1126/science.280.5366.1077.PubMedView ArticleGoogle Scholar
- Arbeitman MN, Furlon EE, Imam F, Johnson E, Null BH, Baker BS, Krasnow MA, Scott MP, Davis RW, White KP: Gene expression during the life cycle of Drosophila melanogaster . Science. 2002, 297: 2270-2275. 10.1126/science.1072152.PubMedView ArticleGoogle Scholar
- Ross DT, Scherf U, Eisen MB, Perou CM, Rees C, Spellman P, Iyer V, Jeffrey SS, Van de Rijn M, Waltham M, Pergamenschikov A, Lee JC, Lashkari D, Shalon D, Myers TG, Weinstein JN, Botstein D, Brown PO: Systematic variation in gene expression patterns in human cancer cell lines. Nature Genet. 2000, 24: 227-235. 10.1038/73432.PubMedView ArticleGoogle Scholar
- Weigelt B, Glas AM, Wessels LF, Witteveen AT, Peterse JL, van't Veer LJ: Gene expression profiles of primary breast tumors maintained in distant metastases. PNAS. 2003, 100: 15901-15905. 10.1073/pnas.2634067100.PubMed CentralPubMedView ArticleGoogle Scholar
- Tiffin N, Adie E, Turner F, Brunner HG, van Driel MA, Oti M, Lopez-Bigas N, Ouzounis C, Perez-Iratxeta C, Andrade-Navarro MA, Adeyemo A, Patti ME, Semple CA, Hide W: Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes. Nuc Acids Res. 2006, 34: 3067-3081. 10.1093/nar/gkl381.View ArticleGoogle Scholar
- Carr SM, Marshal HD, Duggan AT, Flynn SMC, Johnstone KA, Pope AM, Wilkerson CD: Phylogeographic genomics of mitochondrial DNA: patterns of intraspecific evolution and a multi-species, microarray-based DNA sequencing strategy for biodiversity studies. Comp Biochem Physiol D. 2007,Google Scholar
- Gibson G, Muse S: A primer of genome science. 2006, Sunderland, MA: Sinauer Associates, 2Google Scholar
- Dong S, Wang E, Hsie L, Cao Y, Chen X, Gingeras TR: Flexible use of high-density oligonucleotide arrays for single-nucleotide polymorphism discovery and validation. Genome Res. 2001, 11: 1418-1424. 10.1101/gr.171101.PubMed CentralPubMedView ArticleGoogle Scholar
- Warrington JA, Shah NA, Chen X, Janis M, Liu C, Kondapalli S, Reyes V, Savage MP, Zhang Z, Watts R, DeGuzman M, Berno A, Snyder J, Baid J: New development in high-throughput resequencing and variation detection using high density microarrays. Genome Res. 2002, 19: 402-409.Google Scholar
- Hacia JG: Resequencing and mutational analysis using oligonucleotide microarrays. Nature Genet. 1999, 21: 42-7. 10.1038/4469.PubMedView ArticleGoogle Scholar
- Wilson AC, Cann RL, Carr SM, 11 co-authors: Mitochondrial DNA and two perspectives on evolutionary genetics. Biol J Linnean Soc. 1985, 26: 375-400. 10.1111/j.1095-8312.1985.tb02048.x.View ArticleGoogle Scholar
- Ingman M, Kaessmann H, Paabo S, Gyllenstan U: Mitochondrial genome variation and the organization of modern humans. Nature. 2000, 408: 708-713. 10.1038/35047064.PubMedView ArticleGoogle Scholar
- Carr SM, Marshall HD: Intraspecific phylogeographic genomics from multiple complete mtDNA genomes in Atlantic Cod (Gadus morhua): Origins of the “Codmother” and the “Out of Newfoundland” hypothesis of population expansion. Genetics. 2007, in reviewGoogle Scholar
- Davis CS, Delisle I, Stirling I, Siniff DB, Strobeck C: A phylogeny of the extant Phocidae inferred from complete mitochondrial DNA coding regions. Mol Phylogenet Evol. 2004, 33: 363-377. 10.1016/j.ympev.2004.06.006.PubMedView ArticleGoogle Scholar
- Coulson MW, Marshall HD, Pepin P, Carr SM: Mitochondrial genomics of gadine fishes: implications for taxonomy and biogeographic origins from whole-genome data sets. Genome. 2006, 49: 1515-1530. 10.1139/G06-083.Google Scholar
- Maitra A, Cohen Y, Gillespie SE, Mambo E, Fukushima N, Hoque MO, Shah N, Goggins M, Califano J, Sidransky D, Chakravarti A: The human MitoChip: a high-throughput sequencing microarray for mitochondrial mutation detection. Genome Res. 2004, 14: 812-819. 10.1101/gr.2228504.PubMed CentralPubMedView ArticleGoogle Scholar
- Wallace DC: A mitochondrial paradigm of metabolic and degenerative diseases, aging, and cancer: a dawn for evolutionary medicine. Ann Rev Genet. 2005, 39: 359-407. 10.1146/annurev.genet.39.110304.095751.PubMed CentralPubMedView ArticleGoogle Scholar
- Brown WM, George M, Wilson AC: Rapid evolution of animal mitochondrial DNA. PNAS. 1979, 76: 1967-1971. 10.1073/pnas.76.4.1967.PubMed CentralPubMedView ArticleGoogle Scholar
- Horai S, Hayasaka K, Kondo R, Tsugane K, Takahata N: Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs. PNAS. 1995, 92: 532-536. 10.1073/pnas.92.2.532.PubMed CentralPubMedView ArticleGoogle Scholar
- Ennard W, Pääbo S: Comparative primate genomics. Ann Rev Genomics Human Genet. 2004, 5: 351-378. 10.1146/annurev.genom.5.061903.180040.View ArticleGoogle Scholar
- Gagneux P, Wills C, Gerloff U, Tautz D, Morin PA, Boesch C, Fruth B, Hohmann G, Ryder OA, Woodruff DS: Mitochondrial sequence show diverse evolutionary histories of African hominoids. PNAS. 1999, 96: 5077-5082. 10.1073/pnas.96.9.5077.PubMed CentralPubMedView ArticleGoogle Scholar
- Garner KJ, Ryder OA: Mitochondrial DNA diversity in gorillas. Mol Phylogenet Evol. 1996, 6: 39-48. 10.1006/mpev.1996.0056.PubMedView ArticleGoogle Scholar
- Wise CA, Srami M, Rubinsztein DC, Easteal S: Comparative nuclear and mitochondrial genome diversity in humans and chimpanzees. Mol Biol Evol. 1997, 14: 707-716.PubMedView ArticleGoogle Scholar
- Chimpanzee Sequencing and Analysis Consortium: Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005, 437: 69-87. 10.1038/nature04072.View ArticleGoogle Scholar
- Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG: Sequence and organization of the human mitochondrial genome. Nature. 1981, 290: 457-465. 10.1038/290457a0.PubMedView ArticleGoogle Scholar
- Pope AM: Population genomics of the founding population of the island of Newfoundland, based on complete mtDNA genome sequences. 2007, MSc thesis, Department of Biology. Memorial University of NewfoundlandGoogle Scholar
- Simon C, Buckley T, Frati F, Stewart J, Beckenbach A: Incorporating molecular evolution into phylogenetic analysis, and a new compilation of conserved polymerase chain reaction primers for animal mitochondrial DNA. Ann Rev Ecol Syst. 2006, 37: 545-579. 10.1146/annurev.ecolsys.37.091305.110018.View ArticleGoogle Scholar
- Hixson JE, Brown WM: A comparison of the small ribosomal RNA genes from the mitochondrial DNA of the great apes and humans: sequence, structure, evolution, and phylogenetic implications. Mol Biol Evol. 1986, 3: 1-18.PubMedGoogle Scholar
- Johnstone KA, Marshall HD, Carr SM: Biodiversity genomics for species at risk: patterns of DNA sequence variation within and among complete mitochondrial genomes of three species of wolffish (Anarhichas spp.). Can J Zool. 2007, 85: 151-158. 10.1139/Z06-191.View ArticleGoogle Scholar
- Hacia JG, Makalowski W, Edgemon K, Erdos MR, Robbins CM, Fodor SP, Brody LC, Collins FS: Evolutionary sequence comparisons using high-density oligonucleotide arrays. Nature Genet. 1998, 18: 155-158. 10.1038/ng0298-155.PubMedView ArticleGoogle Scholar
- Hacia JG, Fan JB, Ryder O, Jin L, Edgemon K, Ghandour G, Mayer RA, Sun B, Hsie L, Robbins CM, Brody LC, Wang D, Lander ES, Lipshutz R, Fodor SP, Collins FS: Determination of ancestral alleles for human single-nucleotide polymorphisms using high-density oligonucleotide arrays. Nature Genet. 1999, 22: 164-167. 10.1038/9674.PubMedView ArticleGoogle Scholar
- Arnason U, Gullberg A, Xu X: A complete mitochondrial DNA molecule of the white-handed gibbon,Hylobates lar, and comparison among individual mitochondrial genes of all hominoid genera. Hereditas. 1996, 124: 185-189. 10.1111/j.1601-5223.1996.00185.x.View ArticleGoogle Scholar
- Krings M, Capelli C, Tschentscher F, Geisert H, Meyer S, von Haeseler A, Grossschmidt K, Possnert G, Paunovic M, Paabo S: A view of Neandertal genetic diversity. Nature Genet. 2000, 26: 144-146. 10.1038/79855.PubMedView ArticleGoogle Scholar
- Wilkerson CR: Population genetics of Woodland Caribou (Rangifer tarandus tarandus) on the island of Newfoundland. 2007, MSc thesis, Department of Biology. Memorial University of NewfoundlandGoogle Scholar
- Cutler DJ, Zwick ME, Carrasquillo MM, Yoh CT, Tobin KP, Kashuk C, Mathews DJ, Shah NA, Eichler EE, Warrington JA, Chakravarti A: High-throughput variation detection and genotyping using microarrays. Genome Res. 2001, 11: 1913-192.PubMed CentralPubMedGoogle Scholar
- Swofford DL: PAUP*: Phylogenetic analysis using parsimony and other methods. V. 4.0 beta. Florida State University
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.