Mitogenomic phylogeny of the common long-tailed macaque (Macaca fascicularis fascicularis)
© Liedigk et al.; licensee BioMed Central. 2015
Received: 16 December 2014
Accepted: 6 March 2015
Published: 21 March 2015
Long-tailed macaques (Macaca fascicularis) are an important model species in biomedical research and reliable knowledge about their evolutionary history is essential for biomedical inferences. Ten subspecies have been recognized, of which most are restricted to small islands of Southeast Asia. In contrast, the common long-tailed macaque (M. f. fascicularis) is distributed over large parts of the Southeast Asian mainland and the Sundaland region. To shed more light on the phylogeny of M. f. fascicularis, we sequenced complete mitochondrial (mtDNA) genomes of 40 individuals from all over the taxon’s range, either by classical PCR-amplification and Sanger sequencing or by DNA-capture and high-throughput sequencing.
Both laboratory approaches yielded complete mtDNA genomes from M. f. fascicularis with high accuracy and/or coverage. According to our phylogenetic reconstructions, M. f. fascicularis initially diverged into two clades 1.70 million years ago (Ma), with one including haplotypes from mainland Southeast Asia, the Malay Peninsula and North Sumatra (Clade A) and the other, haplotypes from the islands of Bangka, Java, Borneo, Timor, and the Philippines (Clade B). The three geographical populations of Clade A appear as paraphyletic groups, while local populations of Clade B form monophyletic clades with the exception of a Philippine individual which is nested within the Borneo clade. Further, in Clade B the branching pattern among main clades/lineages remains largely unresolved, most likely due to their relatively rapid diversification 0.93-0.84 Ma.
Both laboratory methods have proven to be powerful to generate complete mtDNA genome data with similarly high accuracy, with the DNA-capture and high-throughput sequencing approach as the most promising and only practical option to obtain such data from highly degraded DNA, in time and with relatively low costs. The application of complete mtDNA genomes yields new insights into the evolutionary history of M. f. fascicularis by providing a more robust phylogeny and more reliable divergence age estimations than earlier studies.
KeywordsSoutheast Asia Sundaland Sanger sequencing High-throughput sequencing DNA-capture
Macaques (genus Macaca) represent one of the most successful extant primate radiations. They colonized a large geographic range, from continents to islands, making them unique among non-human primates . Fossils indicate that they arose in northern Africa around 7 million years ago (Ma) . During their expansion into Asia in the late Miocene, the genus diversified into various species groups and species that have been defined by morphological, behavioral and molecular characters, and by their geographic distribution [2-9].
Taxonomic and phylogenetic affiliations of the various macaque species have been matter of debate for several decades [1-7,10]. According to current classifications the genus comprises 22 species, which are divided into seven species groups [9,11]; among them three monotypic species groups, (1) the M. sylvanus group, (2) the M. arctoides group and (3) the M. fascicularis group, and four polytypic groups, (4) the Sulawesi macaques group with six species, (5) the M. mulatta group with three species, (6) the M. sinica group with five species and (7) the M. silenus group with five species. As with the other species groups, the species composition of the M. fascicularis group has changed over time. Delson  and Fooden [3,4] included four species (M. mulatta, M. cyclopis, M. fuscata, M. fascicularis), but Groves  moved M. mulatta, M. cyclopis and M. fuscata in their own species group, the M. mulatta group, and integrated M. arctoides in the M. fascicularis group. Zinner et al.  likewise recognized the members of the M. mulatta group as a distinct species group and additionally excluded M. arctoides proposing a monotypic M. fascicularis group.
The geographic origin of M. fascicularis and dispersal scenarios that led to its current distribution are still a matter of debate. Delson  suggested that macaques entered Sundaland, probably in the Pliocene, during periods of low sea level and ancestral M. fascicularis became isolated there when rising sea levels and geological activity fragmented Sundaland. During the Pleistocene, M. fascicularis extended its range again [2,14]. This largely corresponds to the observed higher level of nucleotide diversity found in long-tailed macaque populations from Sundaland compared to the populations from the Asian mainland and Malay Peninsula [18,23]. This scenario is also in agreement with the fact that the earliest fossils of M. fascicularis, or at least those of a close relative, were found on Java [2,14,33]. Currently, the species is also found on islands that were never connected to the Asian mainland or Sundaland, including islands beyond the Wallace line (e.g., Lombok, Sumbawa, Flores, Timor) and the Philippines. Accordingly, it was assumed that humans introduced M. fascicularis to the islands east of the Wallace line ca. 4,000 year ago , while the Philippines were most likely naturally colonized during two independent immigration events . The species’ survival in areas, where it has been introduced by humans (e.g., Hong Kong, Taiwan, Papua New Guinea, New Britain, and various Pacific islands), indicates its significant ecological plasticity. Long-tailed macaques are naturally adapted to riverine and coastal environments such as mangrove and gallery forests [34,35]. They primarily feed on fruits and seeds , but as indicated by one of its common names, crab-eating macaque, they also include crabs, shrimps, clams and fish in their diet . They are able to swim and even to dive . Hence, it is likely that long-tailed macaques were able to cross short distances between islands actively.
The objective of this study is to shed more light on the phylogeny of M. f. fascicularis, the most widespread subspecies of the long-tailed macaques, occurring on the Southeast Asian mainland and Sundaland islands, including parts of the Philippines and east as far as Timor. Therefore, we generated complete mitochondrial (mtDNA) genomes from 40 long-tailed macaque individuals either by traditional polymerase chain reaction (PCR) amplification followed by Sanger sequencing or by DNA-capture and high-throughput sequencing. We expect that the analysis of complete mtDNA genomes provides a better resolution of phylogenetic relationships among lineages than only short mtDNA fragments.
We generated 42 complete mtDNA genome sequences from 40 M. fascicularis individuals, either by classical PCR followed by Sanger sequencing (10 individuals) or by DNA-capture and high-throughput sequencing (32 individuals). For two museum samples (IDs: 20, 31) both methods were applied which yielded sequences with 100% identity, thus indicating similarly high accuracy of both methods. For mtDNA genomes that were captured and sequenced on the Ion PGM sequencing platform we obtained an average of 97,583 (12,599-230,683) trimmed reads with an average read length of 96 bp, resulting in an average 285-fold coverage. Sequences in the overlapping parts were identical and all protein-coding genes were correctly translated without any premature stop codons, indicating that no nuclear mitochondrial-like sequences (numts) are present in our dataset. All newly generated mtDNA genomes had a length of 16,561 to 16,567 bp, and consisted of 22 transfer RNA genes, 2 ribosomal RNA genes, 13 protein coding genes and the control region.
By applying different methods, classical PCR amplification followed by Sanger sequencing and DNA-capture with subsequent high-throughput sequencing, we successfully obtained complete mtDNA genome data from 40 M. fascicularis individuals. Both methods have proven to be useful to gain such data with similarly high accuracy, but the DNA-capture and high-throughput sequencing approach is less costly and time consuming [40-42]. Moreover, DNA extracted from museum material and fecal material is normally highly degraded. Fortunately, some of our museum and fecal samples contained DNA in sufficient quality so that the complete mtDNA genome could be amplified via 21 overlapping PCRs. Usually, due to the high degree of DNA degradation, generating complete mtDNA genomes would require a much larger number of PCR amplifications. In contrast, DNA-capture does not need a certain DNA fragment size, because any size of DNA fragment can be captured and subsequently sequenced. However, degraded DNA should be solely investigated in special laboratories and with various precautions to prevent contamination.
Our results concerning the phylogenetic relationships among macaque and non-macaque taxa and estimated divergence ages are largely in line with previous molecular studies [5,7,10,19,43-48]. For the phylogenetic relationships among M. fascicularis haplotypes, we obtained higher statistical support for most nodes in our tree, as compared to earlier mtDNA studies which used only fragments of the mtDNA genome [5,7,17-20,22]. Nevertheless, some nodes in our study are still missing significant statistical support, thus leaving some phylogenetic relationships, in particular those between populations from Timor, Java, Mauritius and Bangka/Borneo/Philippines, unresolved. Such results are common when clades or lineages diverged within a short time period [42,48-52].
In contrast to Tosi and Coke  who found Sumatran individuals to be part of the Sundaland clade (referring to our Clade B), the Sumatran individuals, which we analyzed, are nested within the Asian mainland clade (referring to our Clade A). A possible explanation for this contradiction is the different geographic origin of studied individuals, with the South Sumatran samples of Tosi and Coke  clustering with Sundaland sequences and our North Sumatran samples clustering with mainland sequences. Thus, both major M. f. fascicularis mtDNA clades are likely to be present on Sumatra similar to the presence of both Y chromosomal haplogroups on the island .
Since mtDNA is only inherited via the maternal line and macaques live mainly in female philopatric societies [53,54], mtDNA data can be utilized to reveal insights into genetic differences among regional populations and to trace their phylogeographic history . According to our phylogenetic reconstruction and estimated divergence ages, M. f. fascicularis initially split into two clades 1.70 (1.36-2.04) Ma, with representatives of both lineages being found today on Sumatra (according to our study and ). Possible explanations therefore are (1) Sumatra is the place of origin of M. f. fascicularis, (2) Sumatra is the place of origin of only Sundaland M. f. fascicularis, while long-tailed macaques from the mainland invaded the island later, or (3) long-tailed macaques on Sumatra became extinct and the island was later re-colonized from the mainland and other Sundaland islands. The hypothesis that Sumatra is the place of M. f. fascicularis origin is supported by the observed high mtDNA diversity found on the island compared to other regions where the subspecies occurs (e.g., ). However, not in support of this hypothesis is the paraphyly of haplotypes from the mainland and Malay Peninsula, and the respective branching pattern among them and the Sumatra haplotypes, which suggests that the northern Sumatra population came in from the mainland. To test whether Sumatra or any other island, e.g., Java [2,18] is the place of origin of Sundaland or all M. f. fascicularis populations, needs further investigations and, particularly, should include data of M. f. fascicularis from southern Sumatra.
As in previous studies [7,19,26], we found long-tailed macaques from the Philippines clustering within the Borneo clade. Since the Bornean individual, which is most closely related to the Philippine specimens, is from the furthest east of Borneo (Sabah, Tawau Hill Park), this branching pattern fosters the previously proposed hypothesis of a colonization of the Philippines via Borneo [1,26]. Within the last million years, the Philippines have never been connected to the Southeast Asian mainland or Borneo via a continuous land bridge . One possible exception is the island of Palawan, which has been considered to have had a land connection to Borneo during sea level low-stands in the late Pleistocene . The previously proposed Philippine colonization hypothesis [1,26] via Palawan and appending islets seems plausible (stepping-stone colonization). A recent study suggests that there may have been at least two dispersal events from Borneo into the Philippines, first one via Palawan resulting in M. f. philippinensis found in the north of the Philippine Archipelago (Figure 1), and a later one via the Sulu Archipelago that resulted in M. f. fascicularis in the south . Our Philippine sample most likely belongs to this southern taxon.
One noteworthy outcome from our study is the early divergence of a monophyletic Timor clade within the Sundaland clade (Figure 2). It appears that this clade diverged some 0.93 (1.12-0.74) Ma from the other Sundaland lineages. This finding is supported by an analysis of blood protein polymorphisms from samples across the Indonesian and Timor island arc, indicating that populations east of the Wallace Line (Lombok and Sumbawa) have greatly differentiated from those to the west . Our mtDNA-based estimate, however, significantly predates the earliest finds of macaques in Timor’s archaeological record, which appear at the same time, i.e. as the first evidence of pottery and domesticated pig in one site a few thousand years ago, indicating human translocations . Similarly, on Flores, an island further west, but still east of the Wallace Line, long-tailed macaques only appear in the archaeological record around 7,000 years ago . It is unclear what underlies the apparent major discrepancy between the present phylogenetic analysis and the zooarchaeological record, but an introduction by humans as proposed by Fooden  seems unlikely, although the possibility remains that the detected Timor haplotypes originated from somewhere else in Sundaland, a place that was not sampled in our study.
Both applied laboratory methods have proven to be powerful to generate complete mtDNA genome data with similarly high accuracy, with the DNA-capture and high-throughput sequencing approach as the most promising and only practical option to obtain such data from highly degraded DNA, fast and relatively cheap. Our study provides new insights into the evolutionary history of M. f. fascicularis, most prominent we obtained first evidence for the presence of haplotypes in North Sumatra that are related to Asian mainland haplotypes and the clearly distinct and phylogenetically old Timor clade. However, to fully resolve the phylogeny of long-tailed macaques, to identify their origin and the dispersal routes leading to their current distribution, to assess their full genetic diversity and to explore to which extent secondary gene flow occurred between local populations, it is fundamental to include further M. f. fascicularis populations from throughout their range into future studies. In these studies both, mitochondrial and a large number of nuclear loci, should be analyzed. Moreover, to fully understand the evolutionary history of the species, the other subspecies of M. fascicularis should be incorporated in such studies as well. Since long-tailed macaques are an important model species in biomedical research and considering intra-specific variation in genetics, physiology and behavior, more attention should be paid to the selection of study specimens.
Blood samples were taken during routine health checks by experienced veterinarians and not specifically for this study. All research complied with protocols approved by the Animal Welfare Body of the DPZ in Germany and the Department of Wildlife and National Parks in Malaysia, and adhered to the legal requirements of the countries, in which research was conducted. The study was carried out in compliance with the principles of the American Society of Primatologists for the ethical treatment of non-human primates (https://www.asp.org/society/resolutions/EthicalTreatmentOfNonHumanPrimates.cfm). No animals were sacrificed for this study.
We collected and sequenced mtDNA genomes from 40 long-tailed macaque individuals originating from 16 sites throughout the species’ range in Southeast Asia and Sundaland, and from the introduced population on Mauritius (Figure 1, Additional file 2: Table S2). Thirty-one of our samples (sample IDs: 4, 5, 11–31, 33–40) derived from museum specimens housed in the Bavarian State Collection of Zoology (ZSM) in Munich, Germany. Respective specimens were collected between 1904 and 1949. Dried muscle tissue attached to the skeleton was taken with sterilized scalpels and tweezers, and gloves and masks were worn during sample collection to avoid contamination. Museum samples were stored dry in tubes or plastic envelopes. Additionally, we included seven fresh fecal samples, stored in 90% ethanol, which were collected during field surveys (IDs: 6–10, 32, 41). We further obtained high-quality DNA extracted from blood samples from each one individual from Covance Inc. (Münster, Germany) and the German Primate Center (DPZ, Göttingen, Germany), which originated from the Philippines (ID: 42), and Mauritius (ID: 43), respectively. For all samples, we tried to obtain information about the exact geographic provenance, but this was not always possible. While for all fecal samples, GPS coordinates were recorded, information about the exact origin of the samples from the Philippines and Mauritius was not available. Likewise, we were not able to identify the exact provenance of five Bornean samples (IDs: 33–37, derived from “west coast Borneo”), while for all other museum samples the exact origin could be determined. Thus, 38 samples can be geographically clearly assigned to M. f. fascicularis (IDs: 4–41). The individual from Mauritius (ID: 43) most likely refers also to M. f. fascicularis because it is believed that this introduced population originated from Sumatra or at least from Sundaland [19,61], while the individual from the Philippines (ID: 42) could be either M. f. philippinensis or M. f. fascicularis (due to its haplotype it is most likely M. f. fascicularis). The individuals from Timor refer to the holotype (ID: 24) and paratypes (IDs: 25–31) of Pithecus fascicularis limitis Schwarz, 1913, which is recognized as synonym of M. f. fascicularis [6,12]. Beside the 40 samples mentioned above, we obtained a blood sample from an additional M. f. fascicularis individual (ID: 3) from Convance Inc. which originated from Vietnam. The mtDNA genome of this individual was already published , but its DNA was used to prepare baits for DNA-capture. For detailed sample information see Additional file 2: Table S2.
For the extraction of total genomic DNA we used two different methods. First, we applied a kit-based method using the First-DNA All Tissue kit (Gen-Ial). All fecal and five of the museum samples (IDs: 11, 14, 20, 31, 38) were extracted with this method following respective protocols provided by the company. To avoid and check for cross-sample contamination, all working steps were carried out in separate laboratories and under Captair Bio PCR cabinets (Erlab), gloves and masks were permanently worn, and negative extraction controls were routinely performed. Further, samples were treated one by one, and workbenches were decontaminated with UV light before and after each extraction. After extraction, DNA concentration was measured on a NanoDrop ND-1000 spectrophotometer and samples were stored at −20°C until further processing.
Secondly, 28 museum samples (IDs: 4, 5, 12, 13, 15–31, 33–37, 39, 40) were extracted in a special ancient DNA laboratory applying a protocol for nondestructive DNA extraction [62,63] with slight modifications . All working steps were carried out in Thermo Scientific Safe 2020 biological safety cabinets. For each step (sample preparation, DNA extraction) different cabinets were used, and before and after each sample, cabinets were cleaned with DNA decontamination solution and treated with UV light for at least 30 min. Concentration of extracted DNAs was measured on a Qubit 2.0 fluorometer and DNA samples were frozen at −20°C until further processing. For comparative reasons, two museum samples (IDs: 20, 31) were extracted with both methods.
DNA amplification and Sanger sequencing
We generated complete mtDNA genomes from the high-quality samples from the Philippines and Mauritius as well as from three of the fecal samples (IDs: 7, 32, 41) and five of the museum samples (IDs: 11, 14, 20, 31, 38) by traditional PCR amplification followed by Sanger sequencing. All working steps (PCR setup, gel electrophoresis, PCR product purification, sequencing) were conducted in separate laboratories and under Captair Bio PCR cabinets to prevent cross-sample contamination. Further, negative PCR controls (without template DNA) were routinely conducted. To minimize the risk of amplifying numts for the two high-quality DNA samples, we produced two overlapping long-range PCR products (8 kb and 10 kb) followed by 21 nested PCRs with product sizes of 1.0-1.2 kb and an overlap of 100–300 bp applying methods described elsewhere . Since DNA extracted from fecal and museum samples is usually degraded, the complete mtDNA genome from these samples was directly amplified via the 21 PCRs mentioned above and not first via two long-range PCRs. PCR conditions were the same as for the nested PCRs above, but sometimes the number of cycles was increased to 60. As template, we added 10–50 ng DNA to the reaction. PCR performance and product sizes were checked on 1% agarose gels, and after purification, PCR products were sequenced on an ABI 3130xL sequencer using the BigDye Terminator Cycle Sequencing kit (Applied Biosystems) and both amplification primers. Information on primers and PCR conditions is available upon request. Sequences were checked with 4Peaks 1.7.1 (www.mekentosj.com) and mtDNA genomes were assembled with SeaView 4.4.0 . Annotation was performed with DOGMA  and manually verified.
DNA-capture and high-throughput sequencing
Complete mtDNA genomes from 28 museum (IDs: 4, 5, 12, 13, 15–31, 33–37, 39, 40) and four fecal samples (IDs: 6, 8–10) were generated using a DNA-capture approach followed by high-throughput sequencing according to Maricic et al.  with slight modifications (see below) to adapt the workflow to the Ion PGM sequencing system (Ion Torrent). To prevent contamination, all working steps were carried out in dedicated ancient DNA and/or special high-throughput sequencing laboratories, and various negative controls were applied. After DNA extraction and concentration measurement, barcoded sequencing libraries were established using the Ion Plus Fragment Library kit and the Ion Xpress Barcode Adapters. Adapter ligation and the subsequent amplification of the samples were performed according to the protocol for Ion Xpress Plus gDNA Fragment Library Preparation. Afterwards, we pooled the adapter-ligated and amplified libraries in equal concentrations to a total of 2 μg. As bait we used mtDNA genomes of each one long-tailed macaque individual from Vietnam (ID: 3) and Mauritius (ID: 43). The respective complete mtDNA genomes were amplified via two overlapping PCR products (see above). Afterwards, we sheared the PCR products to an average of ca. 1,000 bp fragments with a Bioruptor Pico. We diluted 1.5 μg of PCR product to a volume of 150 μl, split the sample into three (50 μl each) and sonicated each six times with 10 seconds “ON” and 90 seconds “OFF”. One μl of the sheared PCR product was size-checked on the Agilent 2100 Bioanalyzer with the high sensitivity DNA kit. Fragments were subsequently end-repaired, biotinylated by ligating the Bio-T/B adapter , and immobilized on streptavidin-coated beads. Bait and the pooled single-stranded libraries were combined and four phosphorylated blocking oligos (BO1.P1.F: CCACTACGCCTCCGCTTTCCTCTCTATGGGCAGTCGGTGAT-phosphate, BO2.P1.R: ATCACCGACTGCCCATAGAGAGGAAAGCGGAGGCGTAGTGG-phosphate, BO3.A.F: CCATCTCATCCCTGCGTGTCTCCGACTCAG-phosphate, BO4.A.R: CTGAGTCGGAGACACGCAGGGATGAGATGG-phosphate) were added. After 48 h of hybridization at 65°C, library molecules that did not hybridize were washed out and the enriched library pool was eluted. Subsequently, the concentration of the enriched library pool was measured by qPCR (Ion Library Quantitation Kit) and sequenced on the Ion PGM sequencer using a 316v2 or 318v2 chip and the Ion PGM Sequencing 400 Kit protocol. The raw sequencing reads were quality-filtered, and adapters and barcodes were trimmed with the PGM Torrent Suite Software 4.2. The extracted reads were initially assembled with the Newbler program (GS Reference Mapper) of the 454 Sequencing System Software 2.5 from command line with standard parameters. The mtDNA genome of the Vietnamese M. f. fascicularis individual (ID: 3) was used as reference. Batch processing was done by custom Perl scripts. The resulting contigs, typically ranging from 1 to 4 sequences per mtDNA genome, were manually assembled into genomes with SeaView and annotated with DOGMA. All gaps between contigs could be closed by combining the results from multiple sequencing runs.
For phylogenetic reconstructions, we expanded our dataset with additional mtDNA genome sequences from macaque and non-macaque taxa derived from Genbank. The dataset comprised 60 mtDNA genomes including 43 M. fascicularis individuals (3 from Genbank including ID: 3), at least one representative of the other six macaque species groups (2 M. sylvanus, 1 M. arctoides, 3 M. mulatta, 2 M. thibetana, 1 M. tonkeana, 1 M. silenus) and various outgroup taxa (1 Theropithecus gelada, 1 Papio hamadryas, 1 Chlorocebus pygerythrus, 1 Colobus guereza, 1 Pongo abelii, 1 Pan troglodytes, 1 Homo sapiens). For detailed sample information and Genbank accession numbers see Additional file 2: Table S2.
Sequences were aligned with Muscle 3.7  as implemented in SeaView and manually corrected. Indels and poorly align positions were removed with Gblocks 0.91b  using standard settings. Identical sequences were subsequently excluded (IDs: 21 = 23, 27 = 28 = 31, 33 = 34), resulting in a final dataset of 56 unique mtDNA genome haplotypes. For ML and Bayesian tree reconstructions, we applied the programs RAxML 0.93  and MrBayes 3.1.2 [70,71], respectively. ML calculations in RAxML were run with the CAT-GTR model and 1,000 bootstrapping replications. For Bayesian tree reconstructions in MrBayes, we conducted four Markov Chain Monte Carlo (MCMC) runs with a default temperature of 0.2 and the TrN + I + G model as selected as best-fit model in jModeltest 2.1  under the Bayesian information criterion (BIC) and the Decision Theory Performance-based Selection (DT). All repetitions were run for 1 million generations with tree and parameter sampling setting in every 100 generations. The first 25% of samples were discarded as burn-in, resulting in 75,001 trees per run. The adequacy of the burn-in and convergence of all parameters was assessed via the uncorrected potential scale reduction factor (PSRF)  as calculated by MrBayes and by visual inspection of the trace of the parameters across generations using TRACER 1.5 . To check whether posterior clade probabilities were also converging, AWTY  was applied. Posterior probabilities for each split and a phylogram with mean branch lengths were calculated from the posterior density of trees.
Divergence ages from the dataset were estimated with BEAST 1.6.1  applying a Bayesian MCMC method with a relaxed molecular clock approach . A relaxed lognormal model of lineage variation and a Birth-Death Process prior for branching rates was assumed. The following five fossil-based calibration points were used with a normal distribution prior for respective nodes: (1) the Homo – Pan split 6.5 Ma with a 95% CI of 0.5 Ma [78-80], (2) the split between Pongo and the Homo + Pan clade at 14 Ma (95% CI: 1.0 Ma) , (3) the divergence of Theropithecus and Papio 5 Ma (95% CI: 1.5 Ma) [82,83], (4) the split between African (M. sylvanus) and Asian macaques at 5.5 Ma (95% CI: 1.0 Ma) [83,84] and (5) the divergence of hominids and cercopithecids at 27.5 Ma (95% CI: 3.5) [85-87]. Four replicates were run in BEAST for 25 million generations with tree and parameter sampling occurring every 100 generations. TRACER was used to assess the adequacy of a 10% burn-in and the convergence of all parameters via visual inspection of the trace of the parameter across generations. Sampling distributions were combined (25% burn-in) using the software LogCombiner 1.6.1. A consensus chronogram with node height distribution was generated and visualized with TreeAnnotator 1.6.1 and FigTree 1.4.0 .
Availability of supporting data
The alignments supporting the results of this article are available in the Data Dryad repository, https://doi.org/10.5061/dryad.c801k.
We are grateful to the Bavarian State Collection of Zoology, Covance Inc. and the German Primate Center for providing valuable long-tailed macaque samples, as well as to the Universiti Kebangsaan Malaysia, the Department of Wildlife and National Parks (PERHILITAN), the Sarawak Forest Department, the Sabah Wildlife Department and Sabak Parks for sharing fecal samples. This research was financially supported by the German Primate Center and by grants to Badrul Munir Md-Zain (FRGS/1/2012/STWN10/UKM/02/3, DLP-2013-006, ERGS/1/2013/STWN10/UKM/02/1). We further thank Sabine Hutschenreuther, Christiane Schwarz, Nico Westphal and Jens Gruber for support during sample collection, laboratory work or data analysis.
- Abegg C, Thierry B. Macaca evolution and dispersal in insular south-east Asia. Biol J Linn Soc. 2002;75:555–76.View ArticleGoogle Scholar
- Delson E. Fossil macaques, phyletic relationships and a scenario of deployment. In: Lindburg DG, editor. The Macaques: Studies in Ecology, Behavior, and Evolution. New York: Van Nostrand Reinhold; 1980. p. 10–30.Google Scholar
- Fooden J. Provisional classification and key to living species of macaques (Primates: Macaca). Folia Primatol. 1976;25:225–36.View ArticlePubMedGoogle Scholar
- Fooden J. Classification and distribution of living macaques (Macaca Lacepede, 1799). In: Lindburg DG, editor. The Macaques: Studies in Ecology, Behavior, and Evolution. New York: Van Nostrand Reinhold; 1980. p. 1–9.Google Scholar
- Tosi AJ, Morales JC, Melnick DJ. Comparison of Y chromosome and mtDNA phylogenies leads to unique inferences of macaque evolutionary history. Mol Phylogenet Evol. 2000;17:133–44.View ArticlePubMedGoogle Scholar
- Groves CP. Primate Taxonomy. Washington, DC: Smithsonian Institution Press; 2001.Google Scholar
- Tosi AJ, Morales JC, Melnick DJ. Paternal, maternal, and biparental molecular markers provide unique windows onto the evolutionary history of macaque monkeys. Evolution. 2003;57:1419–35.View ArticlePubMedGoogle Scholar
- Anandam MV, Bennett EL, Davenport TRB, Davies NJ, Detwiler KM, Engelhardt A, et al. Family Cercopithecidae (Old World Monkeys) – Species accounts for Cercopithecidae. In: Mittermeier RA, Rylands AB, Wilson DE, editors. Handbook of the Mammals of the World. Volume 3, Primates. Barcelona: Lynx Edicions; 2013. p. 628–753.Google Scholar
- Zinner D, Fickenscher GH, Roos C. Family Cercopithecidae (Old World Monkeys). In: Mittermeier RA, Rylands AB, Wilson DE, editors. Handbook of the Mammals of the World. Volume 3, Primates. Barcelona: Lynx Edicions; 2013. p. 550–627.Google Scholar
- Ziegler T, Abegg C, Meijaard E, Perwitasari-Farajallah D, Walter L, Hodges JK, et al. Molecular phylogeny and evolutionary history of Southeast Asian macaques forming the M. silenus group. Mol Phylogenet Evol. 2007;42:807–16.View ArticlePubMedGoogle Scholar
- Roos C, Boonratana R, Supriantna J, Fellowes JR, Groves CP, Nash SD, et al. An updated taxonomy and conservation status review of Asian primates. Asian Primates J. 2014;4:2–38.Google Scholar
- Fooden J. Systematic review of Southeast Asian longtail macaques, Macaca fascicularis (Raffles, ). Fieldiana Zool. 1995;81:1–206.Google Scholar
- Fooden J. Tail length variation in Macaca fascicularis and M. mulatta. Primates. 1997;38:221–31.View ArticleGoogle Scholar
- Fooden J. Comparative review of fascicularis-group species of macaques (Primates: Macaca). Fieldiana Zool. 2006;107:1–43.Google Scholar
- Roos C, Zinner D. Diversity and evolutionary history of macaques which special focus on rhesus and long-tailed macaques. In: Blümel J, Korte S, Schenck E, Weinbauer GF, editors. The Nonhuman Primate in Nonclinical Drug Development and Safety Assessment. New York: Elsevier; 2015. p. 3–16.View ArticleGoogle Scholar
- Harihara S, Saitou N, Hirai M, Aoto N, Terao K, Cho F, et al. Differentiation of mitochondrial DNA types in Macaca fascicularis. Primates. 1988;29:117–27.View ArticleGoogle Scholar
- Tosi AJ, Morales JC, Melnick DJ. Y-chromosome and mitochondrial markers in Macaca fascicularis indicate introgression with Indochinese M. mulatta and a biogeographic barrier in the Isthmus of Kra. Int J Primatol. 2002;23:161–78.View ArticleGoogle Scholar
- Smith DG, McDonough JW, George DA. Mitochondrial DNA variation within and among regional populations of long-tail macaques (Macaca fascicularis) in relation to other species of the fascicularis group of macaques. Am J Primatol. 2007;69:182–98.View ArticlePubMedGoogle Scholar
- Tosi AJ, Coke CS. Comparative phylogenetics offer new insights into the biogeographic history of Macaca fascicularis and the origin of the Mauritanian macaques. Mol Phylogenet Evol. 2007;42:498–504.View ArticlePubMedGoogle Scholar
- Blancher A, Bonhomme M, Crouau-Roy B, Terao K, Kitano T, Saitou N. Mitochondrial DNA sequence phylogeny of 4 populations of the widely distributed cynomolgus macaque (Macaca fascicularis fascicularis). J Hered. 2008;99:254–64.View ArticlePubMedGoogle Scholar
- Kanthaswamy S, Satkoski J, George D, Kou A, Erickson BJ, Smith DG. Interspecies hybridization and the stratification of nuclear genetic variation of rhesus (Macaca mulatta) and long-tailed macaques (Macaca fascicularis). Int J Primatol. 2008;29:1295–311.View ArticlePubMed CentralPubMedGoogle Scholar
- Shiina T, Tanaka K, Katsuyama Y, Otabe K, Sakamoto K, Kurata M, et al. Mitochondrial DNA diversity among three subpopulations of cynomolgus macaques (Macaca fascicularis) originating from the Indochinese region. Exp Anim. 2010;59:567–78.View ArticlePubMedGoogle Scholar
- Kanthaswamy S, Ng J, Satkoski Trask J, George DA, Kou AJ, Hoffman LN, et al. The genetic composition of populations of cynomolgus macaques (Macaca fascicularis) used in biomedical research. J Med Primatol. 2013;42:120–31.View ArticlePubMed CentralPubMedGoogle Scholar
- Abdul-Latiff MAB, Ruslin F, Fui VV, Abu MH, Rovie-Ryan JJ, Abdul-Patah P, et al. Phylogenetic relationships of Malaysia’s long-tailed macaques, Macaca fascicularis, based on cytochrome b sequences. Zookeys. 2014;407:121–39.View ArticlePubMedGoogle Scholar
- Abdul-Latiff MAB, Ruslin F, Faiq H, Hairul MS, Rovie-Ryan JJ, Abdul-Patah P, et al. Continental monophyly and molecular divergence of Peninsular Malaysia’s Macaca fascicularis fascicularis. Biomed Res Int. 2014;2014:897682.View ArticlePubMed CentralPubMedGoogle Scholar
- Smith DG, Ng J, George D, Trask JS, Houghton P, Singh B, et al. A genetic comparison of two alleged subspecies of Philippine cynomolgus macaques. Am J Phys Anthropol. 2014;155:136–48.View ArticlePubMedGoogle Scholar
- Bonhomme M, Cuartero S, Blancher A, Crouau-Roy B. Assessing natural introgression in 2 biomedical model species, the rhesus macaque (Macaca mulatta) and the long-tailed macaque (Macaca fascicularis). J Hered. 2009;100:158–69.View ArticlePubMedGoogle Scholar
- Stevison LS, Kohn MH. Divergence population genetic analysis of hybridization between rhesus and cynomolgus macaques. Mol Ecol. 2009;18:2457–75.View ArticlePubMedGoogle Scholar
- Satkoski Trask JA, Garnica WT, Smith DG, Houghton P, Lerche N, Kanthaswamy S. Single-nucleotide polymorphisms reveal patterns of allele sharing across the species boundary between rhesus (Macaca mulatta) and cynomolgus (M. fascicularis) macaques. Am J Primatol. 2013;75:135–44.View ArticlePubMed CentralPubMedGoogle Scholar
- Yan G, Zhang G, Fang X, Zhang Y, Li C, Ling F, et al. Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaque. Nat Biotechnol. 2011;29:1019–23.View ArticlePubMedGoogle Scholar
- Hamada Y, Urasopon N, Hadi I, Malaivijitnond S. Body size and proportions and pelage color of free-ranging Macaca mulatta from a zone of hybridization in northeastern Thailand. Int J Primatol. 2005;27:497–513.View ArticleGoogle Scholar
- Haus T, Ferguson B, Rogers J, Doxiadis G, Certa U, Rose NJ, et al. Genome typing of nonhuman primate models: implications for biomedical research. Trends Genet. 2014;30:482–7.View ArticlePubMedGoogle Scholar
- Aimi M, Aziz F. Vertebrate fossils from the Sangiran Dome, Mojokerto, Trinil and Sambungmacam areas. In: Watanabe N, Kadar D, editors. Quaternary Geology of the Hominid Fossil Bearing Formations in Java, Report of the Indonesia-Japan Joint Research Project CTA-41, 1976–1979. Bandung: Geological Research & Development Centre; 1985. p. 155–98.Google Scholar
- Fittinghoff Jr NA, Lindburg DG. Riverine refuging in east Bornean Macaca fascicularis. In: Lindburg DG, editor. The Macaques: Studies in Ecology, Behavior, and Evolution. New York: Van Nostrand Rheinhold; 1980. p. 182–214.Google Scholar
- Wheatley BP. Feeding and ranging of East Bornean Macaca fascicularis. In: Lindburg DG, editor. The Macaques: Studies in Ecology, Behavior and Evolution. New York: Van Nostrand Rheinhold; 1980. p. 215–46.Google Scholar
- Yeager C. Feeding ecology of the long-tailed macaque (Macaca fascicularis) in Kalimantan Tengah, Indonesia. Int J Primatol. 1996;17:51–62.View ArticleGoogle Scholar
- Stewart AME, Gordon CH, Wich SA, Schroor P, Meijaard E. Fishing in Macaca fascicularis: a rarely observed innovative behavior. Int J Primatol. 2008;29:543–8.View ArticleGoogle Scholar
- Gumert MD, Malaivijitnond S. Marine prey processed with stone tools by Burmese long-tailed macaques (Macaca fascicularis aurea) in intertidal habitats. Am J Phys Anthropol. 2012;149:447–57.View ArticlePubMedGoogle Scholar
- Supplementary data from: Mitogenomic phylogeny of the common long-tailed macaque (Macaca fascicularis fascicularis). Dryad Digital Repository, [http://dx.doi.org/10.5061/dryad.c801k]
- Maricic T, Whitten M, Pääbo S. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PLoS One. 2010;5:e14004.View ArticlePubMed CentralPubMedGoogle Scholar
- Gunnarsdóttir ED, Li M, Bauchet M, Finstermeier K, Stoneking M. High-throughput sequencing of complete human mtDNA genomes from the Philippines. Genome Res. 2011;21:1–11.View ArticlePubMed CentralPubMedGoogle Scholar
- Guschanski K, Krause J, Sawyer S, Valente LM, Bailey S, Finstermeier K, et al. Next-generation museomics disentangle one of the largest primate radiations. Syst Biol. 2013;62:539–54.View ArticlePubMed CentralPubMedGoogle Scholar
- Deinard A, Smith DG. Phylogenetic relationships among the macaques: evidence from the nuclear locus NRAMP1. J Hum Evol. 2001;41:45–59.View ArticlePubMedGoogle Scholar
- Perelman P, Johnson WE, Roos C, Seuánez HN, Horvath JE, Moreira MA, et al. A molecular phylogeny of living primates. PLoS Genet. 2011;7:e1001342.View ArticlePubMed CentralPubMedGoogle Scholar
- Springer MS, Meredith RW, Gatesy J, Emerling CA, Park J, Rabosky DL, et al. Macroevolutionary dynamics and historical biogeography of primate diversification inferred from a species supermatrix. PLoS One. 2012;7:e49521.View ArticlePubMed CentralPubMedGoogle Scholar
- Finstermeier K, Zinner D, Brameier M, Meyer M, Kreuz E, Hofreiter M, et al. A mitogenomic phylogeny of living primates. PLoS One. 2013;8:e69504.View ArticlePubMed CentralPubMedGoogle Scholar
- Pozzi L, Hodgson JA, Burrell AS, Sterner KN, Raaum RL, Disotell TR. Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes. Mol Phylogenet Evol. 2014;75:165–83.View ArticlePubMedGoogle Scholar
- Liedigk R, Roos C, Brameier M, Zinner D. Mitogenomics of the Old World monkey tribe Papionini. BMC Evol Biol. 2014;14:176.View ArticlePubMed CentralPubMedGoogle Scholar
- Roos C, Zinner D, Kubatko LS, Schwarz C, Yang M, Meyer D, et al. Nuclear versus mitochondrial DNA: evidence for hybridization in colobine monkeys. BMC Evol Biol. 2011;11:77.View ArticlePubMed CentralPubMedGoogle Scholar
- Liedigk R, Yang M, Jablonski NG, Momberg F, Geissmann T, Lwin N, et al. Evolutionary history of the odd-nosed monkeys and the phylogenetic position of the newly described Myanmar snub-nosed monkey Rhinopithecus strykeri. PLoS One. 2012;7:e37418.View ArticlePubMed CentralPubMedGoogle Scholar
- Zinner D, Wertheimer J, Liedigk R, Groeneveld LF, Roos C. Baboon phylogeny as inferred from complete mitochondrial genomes. Am J Phys Anthropol. 2013;150:133–40.View ArticlePubMed CentralPubMedGoogle Scholar
- Carbone L, Harris RA, Gnerre S, Veeramah KR, Lorente-Galdos B, Huddleston J, et al. Gibbon genome and the fast karyotype evolution of small apes. Nature. 2014;513:195–201.View ArticlePubMed CentralPubMedGoogle Scholar
- Pusey AE, Packer C. Dispersal and philopatry. In: Smuts BB, Cheney DL, Seyfarth RM, Wrangham RW, Struhsaker TT, editors. Primate Societies. Chicago: University of Chicago Press; 1987. p. 250–66.Google Scholar
- de Ruiter JR, Geffen E. Relatedness of matrilines, dispersing males and social groups in long-tailed macaques (Macaca fascicularis). Proc Biol Sci. 1998;265:79–87.View ArticlePubMed CentralPubMedGoogle Scholar
- Avise JC. Molecular Markers, Natural History, and Evolution. Sunderland: Sinauer Associates; 2004.Google Scholar
- Esselstyn JA, Widmann P, Heaney LR. The mammals of Palawan Island, Philippines. Proc Biol Soc Wash. 2004;117:271–302.Google Scholar
- Heaney LR. Biogeography of mammals in Southeast Asia: estimates of rates of colonization, extinction, and speciation. Biol J Linn Soc. 1986;28:127–65.View ArticleGoogle Scholar
- Kawamoto Y, Ischak TM, Supriatna J. Genetic variations within and between troops of the crab-eating macaque (Macaca fascicularis) on Sumatra, Java, Bali, Lombok and Sumbawa, Indonesia. Primates. 1984;25:131–59.View ArticleGoogle Scholar
- Glover I. Archaeology in eastern Timor, 1966–67. Terra Australis. 1986;11:1–241.Google Scholar
- van den Bergh GD, Meijer HJM, Due Awe R, Morwood MJ, Szabó K, van den Hoek Ostende LW, et al. The Liang Bua faunal remains: a 95 k.yr. sequence from Flores, East Indonesia. J Hum Evol. 2009;57:527–37.View ArticlePubMedGoogle Scholar
- Satkoski Trask J, George D, Houghton P, Kanthaswamy S, Smith DG. Population and landscape genetics of an introduced species (M-fascicularis) on the island of Mauritius. PLoS One. 2013;8:e53001.View ArticlePubMed CentralPubMedGoogle Scholar
- Rohland N, Siedel H, Hofreiter M. Nondestructive DNA extraction methods for mitochondrial DNA analyses of museum specimens. Biotechniques. 2004;36:814–21.PubMedGoogle Scholar
- Rohland N, Siedel H, Hofreiter M. A rapid column-based ancient DNA extraction method for increased sample throughput. Mol Ecol Resour. 2010;10:677–83.View ArticlePubMedGoogle Scholar
- Haus T, Akom E, Agwanda B, Hofreiter M, Roos C, Zinner D. Mitochondrial diversity and distribution of African green monkeys (Chlorocebus Gray, 1870). Am J Primatol. 2013;75:350–60.View ArticlePubMed CentralPubMedGoogle Scholar
- Gouy M, Guindon S, Gascuel O. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010;27:221–4.View ArticlePubMedGoogle Scholar
- Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–5.View ArticlePubMedGoogle Scholar
- Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.View ArticlePubMed CentralPubMedGoogle Scholar
- Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17:540–52.View ArticlePubMedGoogle Scholar
- Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90.View ArticlePubMedGoogle Scholar
- Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP. Bayesian inference of phylogeny and its impact on evolutionary biology. Science. 2001;294:2310–4.View ArticlePubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–4.View ArticlePubMedGoogle Scholar
- Posada D. Selection of models of DNA evolution with jModelTest. Methods Mol Biol. 2009;537:93–112.View ArticlePubMedGoogle Scholar
- Gelman A, Rubin D. Inference from iterative simulation using multiple sequences. Statist Sci. 1992;7:457–72.View ArticleGoogle Scholar
- Rambaut A, Drummond AJ: Tracer: MCMC trace analysis tool, version 1.5. [http://tree.bio.ed.ac.uk/software/tracer/]
- Nylander JA, Wilgenbusch JC, Warren DL, Swofford DL. AWTY (are we there yet?): a system for graphical exploration of MCMC convergence in Bayesian phylogenetics. Bioinformatics. 2008;24:581–3.View ArticlePubMedGoogle Scholar
- Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007;7:214.View ArticlePubMed CentralPubMedGoogle Scholar
- Drummond AJ, Ho SY, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:e88.View ArticlePubMed CentralPubMedGoogle Scholar
- Vignaud P, Duringer P, Mackaye HT, Likius A, Blondel C, Boisserie JR, et al. Geology and palaeontology of the Upper Miocene Toros-Menalla hominid locality, Chad. Nature. 2002;418:152–5.View ArticlePubMedGoogle Scholar
- Brunet M, Guy F, Pilbeam D, Lieberman DE, Likius A, Mackaye HT, et al. New material of the earliest hominid from the Upper Miocene of Chad. Nature. 2005;434:752–5.View ArticlePubMedGoogle Scholar
- Lebatard AE, Bourlès DL, Duringer P, Jolivet M, Braucher R, Carcaillet J, et al. Cosmogenic nuclide dating of Sahelanthropus tchadensis and Australopithecus bahrelghazali: Mio-Pliocene hominids from Chad. Proc Natl Acad Sci U S A. 2008;105:3226–31.View ArticlePubMed CentralPubMedGoogle Scholar
- Kelley J. The hominoid radiation in Asia. In: Hartwig WC, editor. The Primate Fossil Record. Cambridge: Cambridge University Press; 2002. p. 369–84.Google Scholar
- Leakey MG. Evolution of Theropithecus in the Turkana Basin. In: Jablonski NG, editor. Theropithecus, the Rise and Fall of a Primate Genus. Cambridge: Cambridge University Press; 1993. p. 85–124.View ArticleGoogle Scholar
- Delson E. Cercopithecinae. In: Delson E, Tattersall I, Van Couvering JA, Brooks AS, editors. Encyclopedia of Human Evolution and Prehistory. New York: Garland Publishing Inc; 2000. p. 166–71.Google Scholar
- Alba DM, Delson E, Carnevale G, Colobero S, Delfino M, Giuntelli P, et al. First joint record of Mesopithecus and cf. Macaca in the Miocene of Europe. J Hum Evol. 2014;67:1–18.View ArticlePubMedGoogle Scholar
- Zalmout IS, Sanders WJ, MacLatchy LM, Gunnell GF, Al-Mufarreh YA, Ali MA, et al. New Oligocene primate from Saudi Arabia and the divergence of apes and Old World monkeys. Nature. 2010;466:360–5.View ArticlePubMedGoogle Scholar
- Pozzi L, Hodgson JA, Burrell AS, Disotell TR. The stem catarrhine Saadanius does not inform the timing of the origin of crown catarrhines. J Hum Evol. 2011;61:209–10.View ArticlePubMedGoogle Scholar
- Stevens NJ, Seiffert ER, O’Connor PM, Roberts EM, Schmitz MD, Krause C, et al. Palaeontological evidence for an Oligocene divergence between Old World monkeys and apes. Nature. 2013;497:611–4.View ArticlePubMedGoogle Scholar
- Rambaut A. FigTree: tree figure drawing tool, version 1.4.0. [http://tree.bio.ed.ac.uk/software/figtree/]
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.