- Research article
- Open Access
Genomic insights on the ethno-history of the Maya and the ‘Ladinos’ from Guatemala
BMC Genomics volume 16, Article number: 131 (2015)
Guatemala is a multiethnic and multilingual country located in Central America. The main population groups separate ‘Ladinos’ (mixed Native American-African-Spanish), and Native indigenous people of Maya descent. Among the present-day Guatemalan Maya, there are more than 20 different ethnic groups separated by different languages and cultures. Genetic variation of these communities still remains largely unexplored. The principal aim of this study is to explore the genetic variability of the Maya and ‘Ladinos’ from Guatemala by means of uniparental and ancestry informative markers (AIMs).
Analyses of uniparental genetic markers indicate that Maya have a dominant Native American ancestry (mitochondrial DNA [mtDNA]: 100%; Y-chromosome: 94%). ‘Ladino’, however, show a clear gender-bias as indicated by the large European ancestry observed in the Y-chromosome (75%) compared to the mtDNA (0%). Autosomal polymorphisms (AIMs) also mirror this marked gender-bias: (i) Native American ancestry: 92% for the Maya vs. 55% for the ‘Ladino’, and (ii) European ancestry: 8% for the Maya vs. 41% for the ‘Ladino’. In addition, the impact of the Trans-Atlantic slave trade on the present-day Guatemalan population is very low (and only occurs in the ‘Ladino’; mtDNA: 9%; AIMs: 4%), in part mirroring the fact that Guatemala has a predominant orientation to the Pacific Ocean instead of a Caribbean one. Sequencing of entire Guatemalan mitogenomes has led to improved Native American phylogeny via the addition of new haplogroups that are mainly observed in Mesoamerica and/or the North of South America.
The data reveal the existence of a fluid gene flow in the Mesoamerican area and a predominant unidirectional flow towards South America, most likely occurring during the Pre-Classic (1800 BC-200 AD) and the Classic (200–1000 AD) Eras of the Mesoamerican chronology, coinciding with development of the most distinctive and advanced Mesoamerican civilization, the Maya. Phylogenetic features of mtDNA data also suggest a demographic scenario that is compatible with moderate local endogamy and isolation in the Maya combined with episodes of gene exchange between ethnic groups, suggesting an ethno-genesis in the Guatemalan Maya that is recent and supported on a cultural rather than a biological basis.
The Republic of Guatemala is located in Central America, bordering the Pacific Ocean, between El Salvador and Mexico, and the Caribbean Sea between Honduras and Belize (Figure 1). Guatemala is a multiethnic, multicultural and multilingual country with an estimated population of about 14.7 million people in 2011 (according to the Instituto Nacional de Estadística of Guatemala; INE; http://www.ine.gob.gt).
The main populations are the ‘Ladinos’ (~60%), a term used in Central America (deriving from ‘latino’), and especially in Guatemala, to refer to a mix of Native American and Spanish (and eventually of Africans), and the Maya or ‘Indígena’ (~40%), that constitutes the second most important group in the country. The ‘Ladino’ population of Guatemala is officially recognized by the Ministerio de Educación (MINEDUC; http://www.mineduc.gob.gt/) as a heterogeneous population, which expresses itself in Spanish as a maternal language and possesses specific cultural traits of ‘Hispanic’ origin mixed with indigenous cultural elements. Already in 1690, the chronicler Francisco Antonio de Fuentes y Guzmán described the ‘Ladinos’ as ‘mestizos, mulatos and negros’. There is extensive historical documentation indicating trend in Guatemala to marriages between different ethnic groups . Although the demographic impact of Europeans in Guatemala is difficult to quantify, it is estimated that in the beginning of the XVII century, the indigenous population surviving in Guatemala (and other Central American countries such as El Salvador) constituted only 10% of the total population living in the region before the arrival of Europeans . The impact of the slave trade in Guatemala is also difficult to estimate. Some documentation indicate that in 1773, the population of Santiago de Los Caballeros de la Antigua Guatemala (‘the capital of Centro America’) had 30.000 people, and about 36% of them were ‘mulatos’ (admixed between Africans and Europeans or Natives), and in 1782, the ‘mulatos’ constituted 32% of a total of 13.000 inhabitants in the city of Nueva Guatemala de la Asunción  (the present Capital city of Guatemala). These figures could indirectly indicate the existence of an important amount of slaves in the regions. In contrast, the Trans-Atlantic Slave Trade Database (http://www.slavevoyages.org/) shows that only a few hundred slaves disembarked directly in Guatemala. However, the arrival of important amounts of slaves from other neighboring countries that were more connected to the slave trade (such as Honduras and Belize) cannot be disregarded.
Among the present-day Maya from Guatemala, there are more than 20 different ethnic groups including the K’iche’ (9.1%), Kaqchikel (8.4%), Mam (7.9%), Q’eqchi’ (6.3%), and minority groups such as Achi, Akatek, Chuj, Ixil, Jakaltek, Poqomam, Poqomchi’, Q’anjob’al, Tz’utujil, Uspantek, etc. (altogether 8.6%; according to the 2001 census). Ethnicity names usually refer to the indigenous language spoken by the group members. Although Spanish is the official language in Guatemala today, there are 23 officially recognized Native American languages. It is not uncommon that people from one region of Guatemala do not understand the language of a neighboring region. For most Mayan inhabitants, Spanish is a second language, and many Maya do not speak Spanish at all in some areas of the country. Today, the largest proportion of the Guatemalan Maya population lives in the highlands (where the majority of the studied samples of the present study have been taken), but there are also inhabitants in other rural areas, such as El Quiché department. Other minority Native American groups in Guatemala are the Garifuna and Xinka (0.1%).
The Maya constituted vast kingships during a long period over the Mesoamerican landscape (a term describing Mexico and Central America within which a number of pre-Columbian societies flourished before the Spanish colonization in the XVI century ), a reign that lasted for about three thousand years (kya) and was one of the most advanced civilizations within the New World. The first concrete traces of the Mayan civilization (dating back to the Pre-Classic period around 1,800 BCE) were found in the Mirador Basin of the northern department of Petén (Guatemala), though some settlements are thought to be over 6 kya old . The Mirador Basin is part of a larger region (known as the Guatemala’s Maya Biosphere Reserve) that overall is considered to be the cradle of ancient Maya civilization (>175 archaeological sites).
According to current knowledge, a single language existed among the earliest Maya. This Proto-Mayan is thought to have been spoken at least ~4 kya ago and may be the common ancestor of all modern Mayan languages today, as well as the Classic Maya languages documented in the hieroglyphic inscriptions . Reconstructive and descriptive linguistic studies of ancient Proto-Mayan target the Guatemalan highlands as the birthplace of this ancestral language . Because of the isolation of Maya posted by vast distances and the ecological diversity of their territories, regional conflicts, sporadic migrations, and ever-changing political systems, their language has had acquired many pronunciations, and over time, those dialects have spawned new languages .
During the Pre-Classic Period (2000 BCE–250 CE), a great linguistic diversity developed, comprising 16 language families. Unlike other scattered populations of Mesoamerica, the Maya were centered in one geographical area covering the entire Yucatan Peninsula and modern-day Guatemala; Belize and parts of East Mexico; and the western region of El Salvador and Honduras.
It was during the Classic period (AD 250–900) that the Maya civilization reached the peak of its power and influence and it was one of the most dominant indigenous societies of Mesoamerica. During this period, the Mayan civilization had become a complex and dynamic entity of independent city-states undergoing a series of population expansions and contractions [5,7,8]. These fluctuations may reflect episodes of migration at various times during the Classic period. By the Late Classic period (AD 600–900), much of the Maya region was organized into two competing “super-states,” headed by the hegemonic powers of Tikal and Calakmul [2,9]. By the terminal Classic period, massive declines in population size led to the abandonment of many Maya territories. The reason for this subsidence remains largely unknown, although theories invoke environmental over-exploitation with all of its consequences as well as constant warfare in a landscape divided among numerous competing city-states as the main reasons.
In the Post-Classic (AD 900–1,500) period, the fragmentation process led to a fusion of Maya settlements from the southern Yucatán highlands into the regionally dominating K’iche’ and later Kaqchikel states . Constant strains within the Maya region and with non-Mayan groups (Aztecs and Toltec of Mexico) led to the final collapse of the civilization prior to the arrival of the Spanish .
Nowadays, Maya descendants occupy the territories of Mexico, Guatemala, Belize, El Salvador, and Honduras. Maya people mostly follow their traditional way of life, including costumes, indigenous languages, and religious ceremonies. One of their most remarkable cultural traits is the faithful count of days according to the Maya calendar.
Historic evidence based on patterns of material culture (ceramics), as well as geographic variability in agricultural practices and socio-political structures suggest a degree of regional isolation which leads to an explanation of the Classic Maya population structure as a model of isolation by distance (IBD). This model describes the tendency of populations that are geographically closer to be more similar than populations that are further apart . Such a model would consider ancient Maya as relatively non-mobile population groups and inter-population gene flow restricted to neighboring sites. Over their millenary history and given the great distances between their communities, strengthened by separation through geographical barriers, warfare, and their political system of independent city-states, it may be expected that the Maya would have diverged into several distinct populations. Contradictory evidence (mostly inscription based) shows however long-distance trade, elite visits and marriage and intercity conflicts with captive-taking, as well as the mobility of general populations. On the other hand, Mayan art, architecture and rituals suggest a high degree of cohesiveness throughout their domain. Overall, there is enough evidence indicating that certain gene flow occurred across the entire Maya area during the Classic period.
While both a rich archaeological record and hieroglyphic dataset led to a better understanding of the Classic Maya population history compared to most other ancient Native American cultures , biological investigations of ancient Guatemala population history are mainly limited to osteology  and dental studies [12,13]. These studies arrived however to contradictory findings. Thus, dental morphology examinations found evidence for biological discontinuity at Seibal (Petén) between the Late and Terminal Classic periods . Using the same approach, it could be shown that skeletons of Jaina (Yucatán, Mexico) demonstrated a stronger affiliation to the Petén site than they did to nearby Chichen Itza in Mexico . In addition, distinct regional clustering could not be found by odontometric comparison of individuals from five Maya sites in the Yucatán Peninsula ; these studies suggested that extensive gene flow dominated Classic Maya population structure with genetic exchange not only between intrazonal neighbors, but at least partially between long distances. Analyses of odontometric variation, as carried out by Sherer et al. , suggested the existence of important gene flow during the Classic period; these authors found an overall F ST value of 0.018, indicating little among-group variability for the Classic Maya sites tested. Furthermore, chemical evidence by isotope analyses points in the same direction, suggesting that migration of both elites and general populations was greatest during the Early Classic period [17,18].
Analysis of uniparental DNA markers (especially mitochondrial DNA [mtDNA] variation) and autosomal DNA markers has been widely used to explore demographic patterns throughout the Americas [19-36]. However, genetic analyses on present-day populations from Guatemala are limited to only a few studies. The study by Ibarra-Rivera et al.  on autosomal STRs (typically used in forensic genetics) revealed that Maya showed fewer alleles and decreased levels of heterozygosity compared to Asians, Europeans, North And South Americans, and even non-Mayan Mesoamericans. This study also supplied evidence that the Guatemalan Mayan groups appeared less genetically variable than their Yucatan counterparts, which supports the thesis that the Guatemalan Maya have experienced less gene flow than the Maya from the Mexican Peninsula . Note that increased allelic diversity is expected in the Yucatan’s plateau because of the greater accessibility afforded by a lack of major geographical barriers [5,39]. The limited genetic diversity of the Maya was indicative of isolation, founder effects, bottlenecks, limited gene flow from neighboring non-Mayan peoples, and/or possible inbreeding. Overall, the results of this study suggested that though some genetic variability exists between Mayan groups, there is a higher degree of homogeneity between them than when compared with other Mesoamerican populations. This led to the presumption that distinct Mayan settlements did not evolve in isolation from one another. In fact, the data indicate that the cultural similarities shared by the Maya are reflected in their genetic profiles and are not merely a result of geographic and/or cultural interactions . Furthermore, data on HLA genes , and polymorphic Alu insertion (PAI) loci  in the actual Guatemalan population were also generated. When comparing HLA allelic frequencies in the Maya from Guatemala with other worldwide populations, the Maya were found to differ genetically from the Mixe and Oaxacan-Mexican Native Americans and to show a close relationship to the Arhuacs, Kogi, and Arsario tribes of the Caribbean. Based on the Alu insertion polymorphisms, two geographically adjacent Mayan populations from the Guatemalan highlands (K’iche’ and Kaqchikel) were found to be more similar to each other than to populations from Yucatán. Other studies reported autosomal data on the distribution of standard autosomal STRs in the actual ‘Ladino’ as in the Maya population of Guatemala [37,41] and Mexico .
Genetic data on uniparental markers of Guatemalans are also very limited. Nonetheless, forensic genetics played a key role in the investigation of the Maya homicide during Guatemala’s 30-year-long Civil War by the Fundación de Antropología Forense de Guatemala (FAFG; http://www.fafg.org/) . Almost 20 years ago, DNA extraction from bones and sequencing of the mtDNA HVS-I was carried out after the exhumation of a 10-year-old clandestine mass grave. The study involved samples collected in a Quiché Indian village located close to the provincial capital of Santa Cruz de Quiché , and lead to the determination of 16 different mtDNA haplotypes. Recently, a study on 17 Y-STR loci in a set of 115 ‘Mestizo’ and 110 Maya males allowed further insight into the actual level of genetic variability and population structure of the populations of the country . The authors clearly identified Guatemalans as predominantly Native Americans and detected a population sub-structure differentiating ‘Mestizos’ from Mayans to certain extent.
Finally, studies on Mayan ancient DNA were not successful due to the poor preservation of Maya skeletons .
The aim of the present study is to shed light on the demographic and the ethno-history of the Guatemalan indigenous populations and characterize the admixture proportions of Mayas and ‘Ladinos’ of Guatemala by means of uniparental (Y-chromosome and mtDNA) and Ancestry Informative Markers (AIMs).
Results and discussion
Mitochondrial DNA control region variation
Additional file 1 reports the full mtDNA control region plus mtSNP haplotypes obtained in the present study, and provides the haplogroup classification according to the level of phylogenetic resolution obtained.
Guatemala shows a main mtDNA Native American component (99%). All haplotypes, except one, can be classified into one of the main Native American mtDNA haplogroups: A2 (75%), B2 (14%), and C1 (10%), (Figure 2A). Within A2, the most common sub-haplogroups are A2 + C64T (35%), A2p (9%) and A2w1 (9%). Within B2, the most common sub-haplogroup is B2t, accounting for 5% of the total mtDNAs.
All Guatemalan Native American profiles were resolved into a maximum parsimony network (Figure 3). The 16 K’iche’ HVS-I haplotypes reported by Boles et al.  were also included in these networks (see also Additional file 1). The phylogeny of haplogroup A2, the most common haplogroup in the Maya and ‘Ladino’ (Figure 3A), is mainly star-like, but it also shows some derived branches containing haplotypes that appear overrepresented, and as is the case for branches A2 + @T16362C, A2 + T16092C, A2q, A2p3a, etc. (Figure 3A). The most common control region haplotype corresponds to the root of haplogroup A2. One interesting feature of the network of haplogroup A2 is the large proportion of haplotypes that are shared between the different ethnic groups. This is particularly notable for those better represented our sample, that is, Q’eqchi’, Poqomchi’ and K’iche’. In other words, there is no particular clade that is overrepresented in one of the Maya groups.
Although the sample size is lower than for A2, haplogroups B2 and C1 show similar phylogenetic patterns (Figure 3B and C, respectively).
As expected, the admixed population of ‘Ladino’ follows the same pattern as the Maya; their haplotypes are scattered through the different branches of the A2, B2, and C1 phylogeny.
A phylogeographic connection between Guatemala and North and South America is evident not only for the most common haplotypes but also when examining singular haplotypes. For instance, A2 haplotypes #GT06 and #LaTinta_20 (Additional file 1), characterized by a reversion at T16362C and T16140C on top of the basal haplotype, is uncommon in North America, but appears in Mexico  and in the North of South America, such as Bolivia , Peru , etc. A similar distribution has the haplotype root of A2 + C16266T. The lineage B2o, observed in four Guatemalans, is also found all over the American continent, from Native North Americans  to the Southern Cone .
However, a large number of other haplotypes have a clear predominant or even exclusive distribution in Mesoamerica. Some examples are: (i) A2u members appear mainly in Panama , El Salvador , and Mexico [31,50], (ii) root of A2 + A16299G is also very frequent in El Salvador , Costa Rica , Nicaragua , and Panama , and (iii) root of A2 + A16274G (haplogroup A2d1a) appearing in Panama [29,53], and Mexico .
Additional searches of all of the profiles included in Figure 3 were carried out in other databases. For instance, in haplotype queries performed in EMPOP, and excluding the so-called “admixed” individuals from the USA (composed in part by individuals coming from different American countries, e.g. Mexico), indicated that the great majority of haplotypes were found almost exclusively in Mesoamerica (~61% in Mexico and ~33% in Guatemala).
Some of these and other phylogeographic features became even clearer when entire genomes were examined (see next section).
In the Guatemalan samples, there was only one haplotype (#GT24) of recent Sub-Saharan ancestry belonging to L3b1a. This haplotype could have arrived in Guatemala at the times of the Trans-Atlantic slave trade [55,56], although this haplogroup is more common in East Africa than in West or West-Central Africa [57,58]. It is important to note that this donor does not show any notable African autosomal ancestry (AFR: 0.3%, EUR: 64.6%, AME: 35.1%; see below) and describes itself as ‘Ladino’. This suggests that the carrier of this African lineage is not a very recent arrival into Guatemala.
Phylogeography of Maya mitogenomes and Time of the Most Recent Common Ancestor (TMRCA)
Ten out of the twelve mitogenomes analyzed belong to haplogroup A2, while two belong to haplogroup B2 (Additional file 2). The mitogenomes obtained allowed improved resolution of the mtDNA phylogeny within these Native American haplogroups. In particular, we have defined eight new clades, namely A2w1a, A2w1b, A2w2, A2w3, A2w4, A2p3, A2ar, and B2t1 (together with other minor sub-clades).
Two Guatemalan B2 haplotypes allowed the topology of the haplogroup B2t branch to be re-defined with respect to the current Build 16 version of Phylotree (Figure 4). There are four complete genomes belonging to B2t available; all of them share the sequence motif A10792G-A15244G-C16259T-T16357C-C16467T. The two Maya mitogenomes were found in two Q’eqchi’; these haplotypes are identical and share the transition C16095T. The other two mitogenomes (one sampled in Mexico and the other of unknown origin) and one semi-complete genome (EF657505; coding region) share the transition G15884A and determine the sub-lineage B2t1 (Figure 4). A search of the characteristic HVS-I motif of B2t in the literature (excluding substitution C16467T because it is absent in most control region datasets) reveals that this is a minor haplogroup in America; however, it is present at moderate frequency (5%) in the Nahua, in ‘Mestizos’ from Mexico [31,50] and in an ‘Hispanic’ dataset from USA . Further search of B2t control region profiles in EMPOP reveals the relevant presence of B2t members in Mexico, most of them in the Zoque, which is an ethnic group inhabiting neighboring regions (Chiapas) to Guatemala. There are also a few B2t members classified as ‘admixed’ individuals from the USA (which could constitute recent arrivals from Mexico). B2t therefore has a dominant Mesoamerican distribution. The estimated coalescence age of B2t and B2t1 is 5.8 and 4.1 kya, respectively (Table 1).
Ten Maya mitogenomes could be classified within different sub-branches of haplogroup A2 (Figure 5). Five of these mitogenomes carry the very stable diagnostic coding region variant C10199T (one mutational hit in Phylotree and 0 in ), and therefore belong to haplogroup A2p. Figure 5 shows the updated topology for A2p, which is reconstructed on the basis of 13 mitogenomes and one coding region segment (EF657488). A2p, as a whole, can be dated in 8.8 kya (Table 1). Seven out of these 14 mtDNAs were sampled in Mexico and five in Guatemala (there is no geographic information for the other two; both belong to sub-clade A2p2). Two pairs of mitogenomes from Mexico allow two sub-branches of haplogroup A2p1 (A2p motif + G5585A-T6488C-A8537G) to be determined: A2p1a (A2p1 motif + T16092C) and A2p1b (A2p1 motif + C16400T). A2p1 is a recent clade with an estimated divergence age of 3.7 kya, while A2p1a and A2p1b are 1.3 and 2.0 kya old, respectively. The five Guatemalan samples belong, together with one Mexican haplotype (HQ012055), to haplogroup A2p3 (A2p motif + C16234T); however, all of the Mayan sequences share the substitution T16209C, thus determining a new sub-branch, A2p3a. A2p3a is also relatively new, with an estimated age of 1.5 kya. By searching the control region motif of A2p in the EMPOP dataset, A2p, as a whole, appears mainly in South America (Colombia and Venezuela). A few members of A2p3 can be found in control region databases in Mexico , or even sporadically in the North of South America (Venezuela ), but the Maya clade A2p3a seems to be basically restricted to the Guatemalan territory.
The sequence motif A7124G-T1101C defines haplogroup A2w, and its topology was determined by 32 mitogenomes, a large number of them analyzed within The 1000 Genome Project in a Colombian sample set (Figure 5). Thus, 27 A2w mitogenomes appeared in Colombia, two in Guatemala, one in Mexico, and one in a ‘Hispanic’ population from the USA (there is an additional mitogenome of unknown origin). In Phylotree, there is only one sub-lineage determined within this haplogroup, namely, A2w1, with the diagnostic motif 573.XC-C16187T. We describe here three additional branches: A2w2, A2w3, and A2w4. The two A2w Guatemalan profiles match entirely and were found in the Q’eqchi’; they share six transitions and belong to the sub-clade A2w1a1. The general topology of A2w is far to be star-like (as measured by the star-likeness index, Additional file 3), and its estimated age using the average distance to the root is paradoxically larger than the age of the entire A2w. This indirectly denotes that sampling of mitogenomes belonging to this sub-clade is probably sub-optimal (dominated by haplogroup members mainly from Colombia). Confirmatory evidence comes from the fact that a search of the control region motif of A2w1a reveals the presence of this haplogroup, mainly all across the Mesoamerican territory, e.g. Panama , Costa Rica , Nicaragua , El Salvador , the Garifunas (and Chocó from Caribbean Colombia) . Additional haplotype searches in EMPOP indicate further matches in Honduras and Guatemala (as well as some admixed individual in the USA). Therefore, A2w1 has a wide Mesoamerican distribution but is most likely very prevalent in many South American locations apart from Colombia. Unfortunately, most of the A2w1 sub-branches are not searchable through control region motifs (Figure 5). TMRCA of A2w, as estimated from maximum likelihood (ML), is 9.9 kya; A2w1 would be its oldest sub-clade (9.3 kya). A2w3 is much younger (3.5 kya), as there are some minor sub-clades such as A2w1b (1.5 kya) or A2w1a1a (1.6 kya). The clade containing the two Guatemalan sequences, A2w1a1, is 7.9 kya.
The two HVS-I variants T125C-T127C alone determine haplogroup A2ar. It is important to note that this seeming distinctive sequence motif occurs independently several times in the worldwide phylogeny (e.g. D4l2, L0d2d, M12a1; Phylotree). There are three mitogenomes in A2ar, curiously, two of them sampled in Peru, and one in a Q’eqchi’ individual. This suggests that this minor clade has a Mesoamerican and South American distribution. A2ar seems to be an old sub-clade (12.2 kya) within the phylogeny of A2 (Table 1).
There are two additional A2 Q’eqchi’ individuals sharing exactly the same variants (#4 and #5 in Figure 5). The control region motif of this branch is rare; the most closely related mtDNA in the Americas is a Colla individual from Argentina .
Demographic patterns of the Maya as inferred from mtDNA data
The phylogeny of mtDNA control region haplotypes suggests a complex demographic history in Guatemala as a result of the superposition of different demographic events. The control region network mirrors a main star-like topology, most likely indicating the existence of a recent demographic expansion in the region. This expansion could perfectly fit with the growth of the main Maya centers during the Classic period, about 1.8 kya ago. Superposed to this star-like phylogeny are some deep branches that seem to signal an underlying ancient, more stationary demographic history (which is more clearly revealed by analysis of the complete mitogenomes). The presence of some derived haplotypes from the root occurring at a relatively high frequency reflects the existence of founder events in the different ethnic groups or relative isolation. Furthermore, the presence of identical haplotypes in the analyzed mitogenomes adds further support to the existence of moderate isolation of Maya into relatively small consanguineous groups. However, gene flow between these isolated groups also occurred in Guatemala, as testified by the existence of many haplotypes shared between different Maya groups [19,20,33].
Analysis of mitogenomes reveals a few interesting features of the past Guatemalan demography (Figures 4 and 5). Some haplogroups, for example A2ar and A2w (and some of its sub-clades), date back to the Paleo-Indian period in the chronology of Mesoamericans. These clades appeared about 10–12 kya, and could have arisen in Mesoamerica or in the limits with North America soon after the initial colonization of the Americas; they could have moved in successive colonization waves as far as to the southern continental cone (as already reported for other clades [19,20] or based on autosomal markers ). When examining the combined picture provided by the mitogenomes and the control region data, this is supported by the high prevalence of these clades in Mesoamerica and in South America, but only sporadically in admixed individuals from North America.
Some mtDNA clades examined in the present study provide clear evidence for the existence of an important gene flow occurring between the territories of Mesoamerica and South America during the Pre-Classic Era about 4 kya, connecting Mexico, Central American populations and South America (testified by the presence of some of these lineages in Venezuela [A2p3] or Colombia [A2w] or in Peru [A2ar]). The data cannot disregard the possibility of migrations from South America to Central America and the Caribbean. There are previous evidences pointing to this possibility [21,30,65] but the magnitude of these migrations needs further investigation.
The phylogeographic characteristics of other mtDNA clades, however, point to demographic movements occurring to a more regional scale, almost exclusively within the Mesoamerican area. A number of these clades date back to the Pre-Classic and Classic Era (Figure 6); the development of the main Classic Maya Centers during the Classic period. This pattern can only be explained if a considerable gene flow across the different Maya territories is assumed.
Other population movements occurring during the Post-Classic Era (involving the Aztec, Mixtec, Totonac, Pipil, K’ich’e, Kaqchikel, among others) could also contribute to the dispersal of these lineages into this region. In particular, the role of the Nahua people [66,67], also referred in the literature to as Aztecs (Aztec civilization), could be particularly important as a source of more recent gene flow between Mexico and Guatemala. The Nahua received different denominations in different places. For instance, these groups were known as Pipiles in Guatemala, and their language was known to be a variant called Náhuatl Pipil. Various source of evidence (archaeological, linguistic, etc.) suggest that the Nahuas could have originated in the deserts of northern Mexico and southwestern USA and migrated into Central Mexico in several waves. Although the origin of the Nahua people is uncertain; it is well-known that the Nahua occupied the Mesoamerican territories ranging across modern-day Central Mexico to southwards in Central America in the XVI century, including Guatemala, El Salvador, Nicaragua, and even as far South as Panama . The Pipiles were extinguished with the arrival of the Spaniards in colonial times, and the Nahua were gradually assimilated into ‘Mestizo’ society in most places. The last of the southern Nahua populations are the Pipil of El Salvador . Some lineages found in Guatemala, such as haplogroup B1t1, are still found at high frequencies in present-day Nahua-speaking people from Mexico .
The large amount of shared variability observed between the different Maya ethnic groups (and with other Mesoamerican populations) analyzed in the present study and the lack of specific variability characterizing them is in agreement with a previous genetic study which obtained signs of genetic homogeneity among various Maya groupings by G-tests . In contrast, these authors found significant heterogeneity from pair-wise comparisons between the Maya and other regional non-Mayan populations . This would suggest that Mayan ethno-genesis is most likely very recent, perhaps occurring during the development of the Nahua civilization (1,100-500 ya). The large divergences observed in other cultural aspects of the Maya, such as linguistic ones, have probably developed very recently in the overall history of the Maya.
Y-chromosomal SNP variation
The complete genotype results for Y-chromosomal SNP variation are given in Additional file 4. Haplogroup Q is the major branch on the Y-chromosome tree (89%) in the male Maya population set (Figure 2B). Q1a3a1(×Q1a3a1a-c) represents the most common haplogroup (81%), and 8% of the Y-haplotypes fall within Q (×Q1a3a1). The remaining subjects belong to the European haplogroups R1 (9%) and J2 (2%). The R1 sub-clades detected in Guatemalans were R1b1a2*(×xR1b1a2a1a1, R1b1a2a1a2a1b1a, R1b1a2a1a2b, R1b1a2a1a2c1a1a1), represented by three samples (two Q’eqchi’ and one ‘Ladino’), and one R1a1 member observed in one single K’iche’ individual. The J2 carrier self-describes as ‘Ladino’ and also reported two generations of ‘Ladino’ ancestry interrupted by a Q’eqchi’ maternal grandmother. J2 is the most common haplogroup in Europe .
The individual described above (#GT24) bearing the mtDNA Sub-Saharan haplotype L3b1a carries a Y-chromosomal haplogroup R1*(×R1a, R1b1) of European ancestry.
Principal component analysis (PCA) and admixture analysis based on AIMs
PCA plot (Figure 7A) based on the 46 AIMs analyzed in the present study (Additional file 5) shows the relationship of the Guatemalan individuals with the three main CEPH panel continental groups, namely, Africans, Europeans and Native Americans, in the Euclidean space. The three reference continental populations show a clear differentiation (Figure 7A; left). PC1 (28%) separates Africans from non-Africans, while PC2 (17%) separates Europeans from the other two groups. Guatemalan Maya profiles all fall within the Native American cluster. Instead, ‘Ladino’ profiles form an scattered cluster located between Native Americans and Europeans; this pattern becomes clearer in a second PCA when eliminating the African reference samples (see PC1 [29%] in Figure 7A, right). The projection of the ‘Ladino’ profiles towards the European pole in the PCA mirrors a moderate European admixture in these individuals. On the other hand, there is no clear differentiation between different Maya ethnic groups.
Additional analysis carried out using Discriminant analysis of Principal Components (DAPC) underlines the outcomes of PCA and provides further assessment of between-population structures (Additional file 6).
The admixture bar-plot in Figure 7B indicates the ancestral membership for each individual in the three reference populations (African, European, and Native American) and the Guatemalan AIM profiles. Only the results for the optimal k = 3 are represented. These three components perfectly separate the profiles belonging to each of the main ancestral continental populations. The admixture bar-plot shows that most of the Guatemalan individuals have a dominant Native American ancestry (see also Figure 2C). However, a tiny portion of European co-ancestry at different scales can be observed across all Mayans. Therefore, admixture analysis agrees well with the results observed in PCA. Thus, for instance, those Guatemalan profiles with a higher European component correspond to those located close to the European cluster in the PCA (Figure 7A). Also consistent with previous analysis is the finding that no notable differences could be detected between the different ethnic Maya groups analyzed in this study. As expected, European co-ancestry is substantially higher in the ‘Ladino’ samples.
In contrast to the significant Native American and European ancestry of Guatemalans, the average African component is very low in Guatemala, and it appears almost exclusively in ‘Ladinos’ (3.6%). There was only one Maya who shows a moderate percentage of African co-ancestry (4.4%). This subject (#LaTinta_08, female) is of self-described Q’eqchi’ ancestry and carries a Native American mtDNA haplotype (B2t). This percentage of African ancestry in this Q’eqchi’ individual could simply mirror the variability of ancestry estimates using panels of AIMs containing a limited amount of SNPs , and not necessarily a real African genome ancestry.
Finally, for the AIM-InDel marker rs34122827, we found a third allelic state in one Mayan sample (#Marco_03). This allele corresponds to a T deletion occurring in the long allele background (allele 2D68Tdel). Interestingly, this variant was found to be specific to Europeans , whereas the carrier in our study is of K’iche’ ancestry.
The results of uniparental loci show that the Maya population samples are mainly composed of Native American haplogroups with a minor presence of sub-Saharan (only on the mtDNA) and/or European lineages (only on the Y chromosome). AIM-InDels also points to the predominant Native American nature of the Maya (Figure 2C). In addition, ancestry proportions were different between ‘Ladinos’ and Mayans for the Native American and the European components, which is in agreement with previous studies .
In ‘Ladinos’, the main ancestry proportions are the Native American component (mtDNA: 91%; AIMs: 55%), and the European component in the male-specific genome (Y-chromosome: 75%). These results mirror the important demographic impact of the European colonizers in Guatemala (with a large effective population size) and their role in the extinction of the Native American population from the region. In particular, the patterns observed in ‘Ladinos’ indicate that the male population from Guatemala suffered more dramatically the consequences of the European conquest as mirrored by the differential ancestry components of the mtDNA and the Y-chromosome in the ‘Ladinos’. The Native American component of present-day Guatemalans was much better preserved in both male and female Maya, probably thanks to their geographic isolation in very inaccessible areas of the country.
Although African slaves arrived in Guatemala in the period between the VI and XVII century to replace the indigenous population as a labor force , our data indicate that the African genetic legacy in Guatemala is very low, and this agrees well with the documentation indicating the few amount of slaves arriving directly to the country. This is in contrast to other American populations, e.g. in Colombia , Brazil  and the Caribbean , but is in agreement with the patterns observed in El Salvador , which has no coast in the Caribbean (Guatemala has also limited contact with the Caribbean sea and even today, the country has difficult access through this coast). As shown by the admixture analysis based on AIMs, African ancestry is higher for ‘Ladinos’ (3.6%) than for the Maya (virtually 0%). The results as a whole are also in good agreement with the census: in modern-day Guatemala, ‘Afro-Guatemalan’ individuals comprise only ~1% of the total population and are found solely in a few communities living at the Caribbean coast where no subjects were recruited for this study.
Overall, the data reveal the existence of a fluid gene flow in Mesoamerica and a predominant unidirectional flow towards South America. The main movements could have occurred during the Pre-Classic (1800 BC-200 AD) and the Classic (200–1000 AD) Eras of the Mesoamerican chronology. This period coincide with development of the Maya, which was the most distinctive and advanced Mesoamerican civilization. Phylogenetic features of control region mtDNA data and the mitogenomes analyzed also suggest a demographic scenario that is compatible with moderate local endogamy and isolation in the Maya combined with episodes of gene exchange between ethnic groups. This pattern of variability is in agreement with a recent ethno-genesis of the Maya, which seems more established in cultural rather than a biological basis.
There is one main limitation in the present study. Thus, most of the demographic inferences carried out in this study are devoted to the analysis of the mtDNA variation (which only records the demographic processes affecting exclusively the female population). This is mainly due to the fact that the level of resolution provided by the mtDNA in our study is high compared to the resolution obtained for the other markers analyzed. Y-chromosomal and autosomal markers were genotyped in order to provide a more complete picture of the genetic patterns of Guatemalans. For instance, these markers have revealed the existence of an important gender-bias in this country (as it occurs in other American countries [74,75]), which moreover differs in ‘Ladinos’ and Maya.
Lastly, the data generated in the present study represent one of the very few genetic studies carried out in Native Guatemalans, and the ethnic groups sampled are analyzed here for the first time with a particular combination of uniparental and AIMs. The results provide new insight into the admixture characteristics of the Guatemalan population, with a clear gender bias observed in the ‘Ladinos’ but virtually absent in the Maya. The data also show important insights into the demography and the ethno-history of the Guatemalans and the important role of Mesoamerica as a passageway between North and South America. Last but not least, the data are also of particular interest from a forensic genetics point of view, as the results of our study may also contribute to the on-going work of the Fundación de Antropología Forense de Guatemala (FAFG) in prosecuting crimes against humanity that took place during the 1960–1996 civil war .
Sampling and DNA extraction
‘Ladino’ individuals were mainly recruited in cities (Guatemala Capital City and Cobán in the department of Alta Verapaz), while indigenous people were sampled directly in their communities in and around the highlands of Guatemala (Verapaz), the geographic heart of the country (Figure 1). Guatemala does not have legal regulations on the usage of native inhabitants DNA pool (according to the Instituto Nacional de Ciencias Forenses during the sample period; INACIF; http://www.inacif.gob.gt/). However, we ensured that every subject fully understood the aim of our study, which conforms to the Spanish Law for Biomedical Research (Law 14/2007-3 of July) and which was approved by the ethical commission of the Universidade de Santiago de Compostela. A document of informed consent was translated by a native Maya translator to the members of the villages and in particular to the donors. In case volunteers were analphabetic, fingerprints were used as signatures. The individual ethnic origin of participants was recorded by a detailed genealogy questionnaire. If self-reported family relationships were recognized during recruitment, just samples from one family member were considered for the analysis, independently from the degree of relationship. Distant relationships cannot be disregarded.
Recruitment of samples was limited by two main factors: (i) Maya linguistic diversity and their reservation towards medical study participation, and (ii) logistical difficulties for DNA sampling in a partially rough terrain with very difficult access.
DNA extraction was carried out from saliva samples on buccal swabs by organic standard procedures.
Mitochondrial DNA sequencing and mtDNA SNP genotyping
A total of 110 samples (2 Achi, 2 Kaqchikel, 2 K’iche’, 11 ‘Ladino’, 18 Poqomchi’, and 75 Q’eqchi’) where analyzed for the mtDNA (see Additional file 1 for more details). All samples were amplified and double-strand sequenced for the entire mtDNA control region. Mutations are referenced with respect to the revised Cambridge Reference Sequence (rCRS) [76,77]. Haplogroup nomenclature follows Phylotree Build 16 (http://www.phylotree.org; ). The sequences were initially classified into haplogroups using HaploGrep  and manually checked according to recommendations . Potential sequence artifacts were checked as reported previously [81-83]. In order to increase the phylogenetic resolution of mtDNA HVS-I/II within the Native American phylogeny, we genotyped coding region mtDNA SNPs (mtSNPs) using a single multiplex SNaPshot reaction, as described previously [64,84]. Unexpected mtSNP phylogenetic patterns according to the known phylogeny were confirmed by repeating the SNP genotyping using single-plex minisequencing and automatic sequencing.
Based on the information provided by the control region profiles (Additional file 1), 12 Native American lineages (carried by 10 Q’eqchi’ and 2 Poqomchi’) were selected for entire mtDNA genome sequencing following previously described protocols [27,85]; Additional file 2. The criterion for selection was mainly based on the particularities of the mutational changes carried by these profiles when compared against the known variability in other Native American datasets and phylogeny. The complete genomes analyzed in the present study have been submitted to GenBank under the accession numbers KM051465-KM051476.
Y-chromosome SNP genotyping
A total of 58 males (1 Kaqchikel, 2 K’iche’, 4 ‘Ladino’, 8 Poqomchi’, and 43 Q’eqchi’) were genotyped for the Y-chromosome (Additional file 4) using a set of 26 SNP markers (see phylogeny in Additional file 7). Sixteen of these SNPs were analyzed in two reactions (Additional file 4) following the strategy of compound multiplexes described previously . We adopted the revised haplogroup tree by the Y Chromosome Consortium YCC (2008)  and nomenclature adjustments according to the Y-DNA Haplogroup Tree 2013 by the International Society of Genetic Genealogy  (Additional file 7). We also applied two additional multiplex reaction containing SNPs M242, M3, M19, M194 and M199 (Additional file 4), which identify Native American populations as described before , as well as Y-SNPs M167, M222, U106, U198 and U152 (Additional file 4) belonging to the R1b1a2 haplogroup, in order to determine the most frequent European haplogroups.
Genotyping of Ancestry Informative Markers (AIMs)
The same samples analyzed for mtDNA were also genotyped for 46 AIM-InDel markers , which allow the proportion of ancestry accounted for main continental groups to be estimated (Additional file 5). AIM-Indelplex PCR amplification and capillary electrophoresis were performed as described previously .
Phylogenetic analysis and estimation of coalescent times
We used HVS-I data to build phylogenetic networks with the aid of the program Network 18.104.22.168 [90,91] and by hand. Hypervariable sites in HVS-I segment such as A16182C, A16183C, and T16519C were not considered (as usual).
Maximum parsimony trees were built for the complete genomes obtained in the present study and those collected from the literature belonging to haplogroups represented by the Guatemalan mitogenomes, and following the known worldwide phylogeny (Phylotree). Estimation of the coalescent times of the most recent common ancestor (TMRCA) was computed using two different procedures.
TMRCA was initially calculated using a ML procedure (Table 1). For this purpose, the software PAML 3.13  was used assuming the HKY85 mutation model (ignoring indels, as usual) and using gamma-distributed rates (approximated by a discrete distribution with 32 categories) and three partitions: HVS-I (positions 16051–16400), HVS-II (positions 68–263), and the remainder.
TMRCA was also computed from the averaged distance (ρ) of the haplotypes of a clade to the respective root haplotype together with a heuristic estimate of the standard error (σ) calculated from an estimate of the genealogy (Additional file 3). These estimates were computed on the mitogenomes considering (i) the whole variation observed (excluding indels and hotspots) and (ii) using only synonymous mutations. The ‘star-likeness’ of the trees was measured using the star index ρ/n × σ 2; this index can take values between 1/n (single haplotype representing n mtDNAs) and 1 (perfect star phylogeny) [23,93].
Both methods show very similar divergence ages when applied to mitogenomes. However, the averaged distance to the root shows an anomalous behavior on A2w1 and its sub-clades, with ages that are about twice (averaged on all sub-clades) larger than estimates based on ML (compare to a 1.2 of averaged discrepancy for the rest of the sub-clades). Estimates based on synonymous mutations show also large discrepancies with the ML method. In addition, A2w1 shows very low values of star-likeness (Additional file 3), which could be indicative of an overrepresentation of the A2w1 mitogenomes sampled in South America (coupled with the underrepresentation of A2w1 members from other Mesoamerican locations where this clade is probably present) or simply due to a limited sample size in this phylogenetic branch. Overall, the existence of a non-star-likeness phylogenetic pattern in A2w1 is what makes the ML method more reliable and consistent for the estimation of TMCRA. Thus, ML estimates were used for discussion throughout the text.
Mutational distances were converted into years using the corrected molecular clock proposed by Soares et al. .
Admixture proportions from autosomal data were inferred by comparing genetic profiles from the present study with those publicly available from the Human Genome Diversity Cell Line Panel, HGCP-CEPH (Centre d’Etude du Polymorphisme Humain; ). These reference parental samples (N = 327) came from populations of three different continents: Africa (Central African Republic, Democratic Republic of Congo, Kenya, Namibia, Nigeria, Senegal, South Africa; N = 105), Europe (France, Italy, Orkney Islands, Russia, Russia Caucasus; N = 158), and America (Brazil, Colombia, Mexico; N = 64). Present-day East Asians were not taken into account as a reference population, assuming that these populations did not substantially contribute to the recent genetic heritage of the Guatemalan people, as is the case in other American locations [28,71,95].
Statistical analysis of AIMs included different tools aimed at disentangling the population structure of the Guatemalan study samples. Multivariate analyses were carried out using Principal Component Analysis (PCA). PCA condenses in a few principal components (usually two; PC1 and PC2) an initial set of data that can contain quantitative variables, into a group of fewer variables resulting in a linear combination of the originals.
To further estimate individual ancestry proportions we used ADMIXTURE . This software uses a ML estimation of individual ancestries from multilocus SNP data (AIMs).
Finally, phylogeographic searchers of mtDNA profiles were carried out on an in-house database containing >27,000 mitogenomes and >170,000 partial (mainly HVS-I) mtDNA sequences. Additional exploratory haplotype searchers were carried out on EMPOP (http://empop.org), Familytree (https://familysearch.org/), and the Sorenson (http://www.smgf.org/) databases. Note that frequencies obtained from these additional database searchers provide only approximate figures given that their web-interfaces were not conceived specifically for population genetic purposes (e.g. forensic casework in the case of EMPOP).
Lokken P, Lutz C. Génesis y evolución de la población afrodescendiente en Guatemala y El Salvador (1524–1824): Rina Cáceres Gómez; San José, Costa Rica. 2008.
Martin S, Grube N. Chronicle of the Maya Kings and Queens: Deciphering the Dynasties of the Ancient Maya. London, UK: Thames & Hudson; 2008.
Coe MD. The Maya: Ancient peoples and places. New Hartford, CT, U.S.A: Thames & Hudson Ltd; London, UK; 2011.
England NC, Maldonado RZ. Mayan Languages. Oxford, UK: Oxford Bibliographies; 2013.
Sharer RJ, Traxler LP. The Ancient Maya. Standford, USA: Stanford University Press; 2006.
Coe MD. Breaking the Maya code. 3rd ed. London: Thames & Hudson; London; England; 2012.
Santley RS. Demographic archaeology in the Maya lowlands. Albuquerque: University of New Mexico Press; Mexico; 1990.
Fry RE. Disjunctive growth in the Maya lowlands. Albuquerque: University of New Mexico Press; Mexico; 1990.
Grube N, Eggebrecht E, Seidel M. Maya: Gottkönige im Regenwald. Potsdam 2012: Könemann im Tandem: h.f.ullmann publishing GmbH; 2012.
Wright S. Isolation by Distance. Genetics. 1943;28(2):114–38.
Whittington SL, Reed DM. Bones of the Maya: studies of ancient skeletons. Washington, D.C.: Smithonian Institution Press; Washington, USA; 1997.
Jacobi KP. Last rites for the Tipu Maya: genetic structuring in a colonial cemetery. Alabama, USA: University of Alabama Press; 2000.
Scherer AK. Population structure of the Classic period Maya. Am J Phys Anthropol. 2007;132(3):367–80.
Austin DM. The Biological Affinity of the Ancient Populations of Altar de Sacrificios and Seibal. Estudios de Cultura Maya. 1978;11:57–73.
Pompa-Padilla JA. Antropología Dental: Aplicación en Poblaciones Prehispánicas. Mexico DF; Mexico. Texas, USA: Instituto Nacional de Antropología e Historia; 1990.
Cucina A, Blos VT. Dental morphometry and biological affinity in pre-contact and contact Maya populations from the peninsula of Yucatan. Mexicon. 2004;26:14–9.
White CD, Spence MW, Longstaffe FJ, Law KR. Testing the nature of Teotihuacan imperialism at Kaminaljuyu using phosphate oxygen-isotope ratios. J Anthropol Res. 2000;56:535–58.
White CD, Longstaffe FJ, Law KR. Revisiting the Teotihuacan connection at Altun Ha. Anc Mesoam. 2001;12:65–72.
Perego UA, Angerhofer N, Pala M, Olivieri A, Lancioni H, Kashani BH, et al. The initial peopling of the Americas: a growing number of founding mitochondrial genomes from Beringia. Genome Res. 2010;20(9):1174–9.
Perego UA, Achilli A, Angerhofer N, Accetturo M, Pala M, Olivieri A, et al. Distinctive Paleo-Indian migration routes from Beringia marked by two rare mtDNA haplogroups. Curr Biol. 2009;19(1):1–8.
Bodner M, Perego UA, Huber G, Fendt L, Röck AW, Zimmermann B, et al. Rapid coastal spread of First Americans: Novel insights from South America’s Southern Cone mitochondrial genomes. Genome Res. 2012;22(5):811–20.
Roewer L, Nothnagel M, Gusmão L, Gomes V, González M, Corach D, et al. Continent-wide decoupling of Y-Chromosomal genetic variation from language and geography in native South Americans. PLoS Genet. 2013;9(4):e1003460.
Achilli A, Perego UA, Bravi CM, Coble MD, Kong QP, Woodward SR, et al. The phylogeny of the four pan-American MtDNA haplogroups: implications for evolutionary and disease studies. PLoS One. 2008;3(3):e1764.
Tamm E, Kivisild T, Reidla M, Metspalu M, Smith DG, Mulligan CJ, et al. Beringian standstill and spread of Native American founders. PLoS One. 2007;2(9):e829.
Salas A, Acosta A, Álvarez-Iglesias V, Cerezo M, Phillips C, Lareu MV, et al. The mtDNA ancestry of admixed Colombian populations. Am J Hum Biol. 2008;20:584–91.
Salas A, Lovo-Gómez J, Álvarez-Iglesias V, Cerezo M, Lareu MV, Macaulay V, et al. Mitochondrial echoes of first settlement and genetic continuity in El Salvador. PLoS One. 2009;4(9):e6882.
Catelli ML, Álvarez-Iglesias V, Gómez-Carballa A, Mosquera-Miguel A, Romanini C, Borosky A, et al. The impact of modern migrations on present-day multi-ethnic Argentina as recorded on the mitochondrial DNA genome. BMC Genet. 2011;12:77.
Taboada-Echalar P, Álvarez-Iglesias V, Heinz T, Vidal-Bralo L, Gómez-Carballa A, Catelli L, et al. The genetic legacy of the pre-Colonial period in contemporary Bolivians. PLoS One. 2013;8(3):e58980.
Perego UA, Lancioni H, Tribaldos M, Angerhofer N, Ekins JE, Olivieri A, et al. Decrypting the mitochondrial gene pool of modern Panamanians. PLoS One. 2012;7(6):e38337.
Mendizabal I, Sandoval K, Berniell-Lee G, Calafell F, Salas A, Martínez-Fuentes A, et al. Genetic origin, admixture, and asymmetry in maternal and paternal human lineages in Cuba. BMC Evol Biol. 2008;8:213.
Sandoval K, Buentello-Malo L, Peñaloza-Espinosa R, Avelino H, Salas A, Calafell F, et al. Linguistic and maternal genetic diversity are not correlated in Native Mexicans. Hum Genet. 2009;126(4):521–31.
Wang S, Lewis CM, Jakobsson M, Ramachandran S, Ray N, Bedoya G, et al. Genetic variation and population structure in native Americans. PLoS Genet. 2007;3(11):e185.
Reich D, Patterson N, Campbell D, Tandon A, Mazieres S, Ray N, et al. Reconstructing Native American population history. Nature. 2012;488(7411):370–4.
Bryc K, Velez C, Karafet T, Moreno-Estrada A, Reynolds A, Auton A, et al. Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations. Proc Natl Acad Sci U S A. 2010;107 Suppl 2:8954–61.
Rasmussen M, Anzick SL, Waters MR, Skoglund P, DeGiorgio M, Stafford Jr TW, et al. The genome of a Late Pleistocene human from a Clovis burial site in western Montana. Nature. 2014;506(7487):225–9.
Rasmussen M, Li Y, Lindgreen S, Pedersen JS, Albrechtsen A, Moltke I, et al. Ancient human genome sequence of an extinct Palaeo-Eskimo. Nature. 2010;463(7282):757–62.
Ibarra-Rivera L, Mirabal S, Regueiro MM, Herrera RJ. Delineating genetic relationships among the Maya. Am J Phys Anthropol. 2008;135(3):329–47.
Herrera RJ, Rojas DP, Terreros MC. Polymorphic Alu insertions among Mayan populations. J Hum Genet. 2007;52(2):129–42.
Coe MD, Kootz R. Mexico: from the Olmec to the Aztecs. New York: Thames & Hudson; 2002.
Gómez-Casado E, Martínez-Laso J, Moscoso J, Zamora J, Martín-Villa M, Pérez-Blas M, et al. Origin of Mayans according to HLA genes and the uniqueness of Amerindians. Tissue Antigens. 2003;61(6):425–36.
Martínez-Espin E, Martínez-González LJ, Fernández-Rosado F, Entrala C, Álvarez JC, Lorente JA, et al. Guatemala mestizo population data on 15 STR loci (Identifiler Kit). J Forensic Sci. 2006;51(5):1216–8.
Sánchez C, Barrot C, Ortega M, González-Martín A, Gorostiza A, Corbella J, et al. Genetic diversity of 15 STRs in Choles from northeast of Chiapas (Mexico). J Forensic Sci. 2005;50(6):1499–501.
García M, Martínez L, Stephenson M, Crews J, Peccerelli F. Analysis of complex kinship cases for human identification of civil war victims in Guatemala using M-FISys software. Forensic Sci Int Genet Suppl Ser. 2009;2(1):250–2.
Boles TC, Snow CC, Stover E. Forensic DNA testing on skeletal remains from mass graves: a pilot project in Guatemala. J Forensic Sci. 1995;40(3):349–55.
Martínez-González LJ, Saiz M, Álvarez-Cubero MJ, Gómez-Martin A, Álvarez JC, Martínez-Labarga C, et al. Distribution of Y chromosomal STRs loci in Mayan and Mestizo populations from Guatemala. Forensic Sci Int Genet. 2012;6(1):136–42.
Iglesias MJ, Ciudad A, Arroyo E, Adanez J, Álvarez S. Aplicaciones de la antropología molecular a la arqueología Maya: el caso de Tikal. Guatemala City: Ministerio de Cultura y Deportes, Instituto de Antropología e Historia, Asociación Tikal; 2001.
Corella A, Bert F, Perez-Perez A, Gene M, Turbon D. Mitochondrial DNA diversity of the Amerindian populations living in the Andean Piedmont of Bolivia: Chimane, Moseten, Aymara and Quechua. Ann Hum Biol. 2007;34(1):34–55.
Fehren-Schmitz L, Reindel M, Cagigao ET, Hummel S, Herrmann B. Pre-Columbian population dynamics in coastal southern Peru: A diachronic investigation of mtDNA patterns in the Palpa region by ancient DNA analysis. Am J Phys Anthropol. 2010;141(2):208–21.
Budowle B, Allard MW, Fisher CL, Isenberg AR, Monson KL, Stewart JE, et al. HVI and HVII mitochondrial DNA data in Apaches and Navajos. Int J Legal Med. 2002;116(4):212–5.
Guardado-Estrada M, Juárez-Torres E, Medina-Martínez I, Wegier A, Macias A, Gomez G, et al. A great diversity of Amerindian mitochondrial DNA ancestry is present in the Mexican mestizo population. J Hum Genet. 2009;54(12):695–705.
Morera B. Análisis del polimorfismo del ADNmt en la población general de Costa Rica: un asunto pendiente. Revista Latinoamericana de Derecho Médico y Medicina Legal. 2002;7(1):21–34.
Nuñez C, Baeta M, Sosa C, Casalod Y, Ge J, Budowle B, et al. Reconstructing the population history of Nicaragua by means of mtDNA, Y-chromosome STRs, and autosomal STR markers. Am J Phys Anthropol. 2010;143(4):591–600.
Kolman CJ, Bermingham E. Mitochondrial and nuclear DNA diversity in the Choco and Chibcha Amerinds of Panama. Genetics. 1997;147(3):1289–302.
Green LD, Derr JN, Knight A. mtDNA affinities of the peoples of North-Central Mexico. Am J Hum Genet. 2000;66(3):989–98.
Salas A, Carracedo Á, Richards M, Macaulay V. Charting the Ancestry of African Americans. Am J Hum Genet. 2005;77(4):676–80.
Salas A, Richards M, Lareu MV, Scozzari R, Coppa A, Torroni A, et al. The African diaspora: mitochondrial DNA and the Atlantic slave trade. Am J Hum Genet. 2004;74(3):454–65.
Salas A, Richards M, De la Fé T, Lareu MV, Sobrino B, Sánchez-Diz P, et al. The making of the African mtDNA landscape. Am J Hum Genet. 2002;71(5):1082–111.
Soares P, Alshamali F, Pereira JB, Fernandes V, Silva NM, Afonso C, et al. The Expansion of mtDNA Haplogroup L3 within and out of Africa. Mol Biol Evol. 2012;29(3):915–27.
Monson KL, Miller KWP, Wilson MR, DiZinno JA, Budowle B. The mtDNA Population Database: an integrated software and database resource for forensic comparison. Forensic Sci Commun. 2002;4:2.
Soares P, Ermini L, Thomson N, Mormina M, Rito T, Rohl A, et al. Correcting for purifying selection: an improved human mitochondrial molecular clock. Am J Hum Genet. 2009;84(6):740–59.
Gómez-Carballa A, Ignacio-Veiga A, Álvarez-Iglesias V, Pastoriza-Mourelle A, Ruiz Y, Pineda L, et al. A melting pot of multicontinental mtDNA lineages in admixed Venezuelans. Am J Phys Anthropol. 2012;147(1):78–87.
Kolman CJ, Bermingham E, Cooke R, Ward RH, Arias TD, Guionneau-Sinclair F. Reduced mtDNA diversity in the Ngöbé Amerinds of Panamá. Genetics. 1995;140(1):275–83.
Salas A, Richards M, Lareu MV, Sobrino B, Silva S, Matamoros M, et al. Shipwrecks and founder effects: divergent demographic histories reflected in Caribbean mtDNA. Am J Phys Anthropol. 2005;128(4):855–60.
Álvarez-Iglesias V, Jaime JC, Carracedo Á, Salas A. Coding region mitochondrial DNA SNPs: targeting East Asian and Native American haplogroups. Forensic Sci Int Genet. 2007;1:44–55.
Lalueza-Fox C, Calderon FL, Calafell F, Morera B, Bertranpetit J. MtDNA from extinct Tainos and the peopling of the Caribbean. Ann Hum Genet. 2001;65(Pt 2):137–51.
Canger U. Five studies inspired by Náhuatl verbs in -oa. Travaux du Cercle Linguistique de Copenhague vol XIX. Copenhagen, Denmark: C.A. Reitzels Boghandel; 1980.
Canger U. Nahuatl dialectology: A survey and some suggestions. Int J Am Linguist. 1988;54(1):28–72.
Fowler QJ. Ethnohistoric sources on the Pipil Nicarao: a critical analysis, vol. 32. Durham, NC: Duke University Press and the American Society for Ethnohistory; 1985.
Jobling MA, Tyler-Smith C. The human Y chromosome: an evolutionary marker comes of age. Nat Rev Genet. 2003;4(8):598–612.
Pardo-Seco J, Martinón-Torres F, Salas A. Evaluating the accuracy of AIM panels at quantifying genome ancestry. BMC Genomics. 2014;30(15(1)):543.
Pereira R, Phillips C, Pinto N, Santos C, dos Santos SE, Amorim A, et al. Straightforward inference of ancestry and admixture proportions through ancestry-informative insertion deletion multiplexing. PLoS One. 2012;7(1):e29684.
Rina C. Africanos y afromestizos en la historia colonial de Centroamérica. Costa Rica: Oficina Regional de la UNESCO para Centroamérica y Panamá; 2008.
Alves-Silva J, da Silva SM, Guimaraes PE, Ferreira AC, Bandelt HJ, Pena SD, et al. The ancestry of Brazilian mtDNA lineages. Am J Hum Genet. 2000;67(2):444–61.
Cárdenas JM, Heinz T, Pardo-Seco J, Álvarez-Iglesias V, Taboada-Echalar P, Sánchez-Diz P, et al. The multiethnic ancestry of Bolivians as revealed by the analysis of Y-chromosome markers. Forensic Sci Int Genet. 2014;14:210–8.
Salas A, Jaime JC, Álvarez-Iglesias V, Carracedo Á. Gender bias in the multiethnic genetic composition of central Argentina. J Hum Genet. 2008;53(7):662–74.
Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999;23(2):147.
Salas A, Coble M, Desmyter S, Grzybowski T, Gusmâo L, Hohoff C, et al. A cautionary note on switching mitochondrial DNA reference sequences in forensic genetics. Forensic Sci Int Genet. 2012;6(6):e182–4.
van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009;30(2):E386–94.
Kloss-Brandstätter A, Pacher D, Schonherr S, Weissensteiner H, Binna R, Specht G, et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum Mutat. 2011;32(1):25–32.
Bandelt H-J, van Oven M, Salas A. Haplogrouping mitochondrial DNA sequences in Legal Medicine/Forensic Genetics. Int J Legal Med. 2012;126(6):901–16.
Salas A, Carracedo Á, Macaulay V, Richards M, Bandelt H-J. A practical guide to mitochondrial DNA error prevention in clinical, forensic, and population genetics. Biochem Biophys Res Commun. 2005;335(3):891–9.
Salas A, Prieto L, Montesino M, Albarrán C, Arroyo E, Paredes-Herrera MR, et al. Mitochondrial DNA error prophylaxis: Assessing the causes of errors in the GEP’02-03 proficiency testing trial. Forensic Sci Int. 2005;148(2–3):191–8.
Yao Y-G, Salas A, Logan I, Bandelt H-J. mtDNA data mining in GenBank needs surveying. Am J Hum Genet. 2009;85(6):929–33. author reply 933.
Quintáns B, Álvarez-Iglesias V, Salas A, Phillips C, Lareu MV, Carracedo Á. Typing of mitochondrial DNA coding region SNPs of forensic and anthropological interest using SNaPshot minisequencing. Forensic Sci Int. 2004;140(2–3):251–7.
Brisighelli F, Capelli C, Alvarez-Iglesias V, Onofri V, Paoli G, Tofanelli S, et al. The Etruscan timeline: a recent Anatolian connection. Eur J Hum Genet. 2009;17(5):693–6.
Brión M, Sobrino B, Blanco-Verea A, Lareu MV, Carracedo Á. Hierarchical analysis of 30 Y-chromosome SNPs in European populations. Int J Legal Med. 2005;119(1):10–5.
Karafet TM, Mendez FL, Meilerman MB, Underhill PA, Zegura SL, Hammer MF. New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. Genome Res. 2008;18(5):830–8.
Genealogy ISoG. Y-DNA Haplogroup Tree 2013 Version: [8.76], Date: [3 October 2013]. 2013. http://www.isogg.org/tree/ [Date of access: 24, 10, 2013].
Blanco-Verea A, Jaime JC, Brión M, Carracedo Á. Y-chromosome lineages in native South American population. Forensic Sci Int Genet. 2010;4(3):187–93.
Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16(1):37–48.
Bandelt HJ, Forster P, Sykes BC, Richards MB. Mitochondrial portraits of human populations using median networks. Genetics. 1995;141(2):743–53.
Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci: CABIOS. 1997;13(5):555–6.
Slatkin M. Gene genealogies within mutant allelic classes. Genetics. 1996;143(1):579–87.
Cann HM, de Toma C, Cazes L, Legrand MF, Morel V, Piouffre L, et al. A human genome diversity cell line panel. Science. 2002;296(5566):261–2.
Heinz T, Álvarez-Iglesias V, Pardo-Seco J, Taboada-Echalar P, Gómez-Carballa A, Torres-Balanza A, et al. Ancestry analysis reveals a predominant Native American component with moderate European admixture in Bolivians. Forensic Sci Int Genet. 2013;7(5):537–42.
González JR, Armengol L, Sole X, Guino E, Mercader JM, Estivill X, et al. SNPassoc: an R package to perform whole genome association studies. Bioinformatics. 2007;23(5):644–5.
Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19(9):1655–64.
We greatly thank all the sample contributors in Guatemala. JS was supported by research grants from the German FAZIT-STIFTUNG and the German Academic Exchange Service (DAAD). VAI was supported by funding from the EUROFORGEN project and the Xunta de Galicia (EM 2012/045). The research leading to these results has received funding from the People Program (Marie Curie Actions) of the European Union’s Seventh Framework Program FP7/2007-2013/ under REA grant agreement n° 290344, and the grants from the “Ministerio de Ciencia e Innovación” (SAF2008-02971) from the Plan Galego IDT, Xunta de Galicia (EM 2012/045) given to AS.
The authors declare that they have no competing interests.
JS and AS conceived the study. JS, VAL, AMM, MGB, AGC, and AS analyzed the data. JS and AS drafted the article and all the authors have critically revised the manuscript and given final approval of the version to be published.
Jens Söchtig and Antonio Salas contributed equally to this work.
Mitochondrial DNA hypervariable region sequencing and mtDNA SNP typing data for the Guatemalan samples analyzed in the present study.
TMRCA computed from the averaged distance ( ρ ) of the haplotypes of a clade to the respective root haplotype.
Y–chromosome SNP data for the Guatemalan samples analyzed in the present study.
Autosomes data for the Guatemalan samples analyzed in the present study.
Discriminant Analysis of Principal Components (DAPC) of Guatemalan samples carried out on AIMs.
Y chromosomal phylogenetic tree. Polymorphism names are indicated above the lines (branches) and corresponding ‘rs’ numbers are shown below these lines. Bolded checkmarks (left) indicate haplogroups observed in the Guatemalan samples. The branch marked by M62 is now classified as private by the ISOGG consortium meaning that, according to the consortium “this SNP has not met the population distribution criteria for placement on the tree”.
About this article
Cite this article
Söchtig, J., Álvarez-Iglesias, V., Mosquera-Miguel, A. et al. Genomic insights on the ethno-history of the Maya and the ‘Ladinos’ from Guatemala. BMC Genomics 16, 131 (2015). https://doi.org/10.1186/s12864-015-1339-1
- Autosomal SNPs