ArticlePDF Available

High-resolution mitochondrial DNA analysis sheds light on human diversity, cultural interactions, and population mobility in Northwestern Amazonia

Abstract and Figures

Objectives Northwestern Amazonia (NWA) is a center of high linguistic and cultural diversity. Several language families and linguistic isolates occur in this region, as well as different subsistence patterns, with some groups being foragers and others agriculturalists. In addition, speakers of Eastern Tukanoan languages are known for practicing linguistic exogamy, a marriage system in which partners are taken from different language groups. In this study, we use high-resolution mitochondrial DNA sequencing to investigate the impact of this linguistic and cultural diversity on the genetic relationships and population structure of NWA groups. Methods We collected saliva samples from individuals representing 40 different NWA ethnolinguistic groups and sequenced 439 complete mitochondrial genomes to an average coverage of 1,030×. Results The mtDNA data revealed that NWA populations have high genetic diversity with extensive sharing of haplotypes among groups. Moreover, groups who practice linguistic exogamy have higher genetic diversity, while the foraging Nukak have lower genetic diversity. We also find that rivers play a more important role than either geography or language affiliation in structuring the genetic relationships of populations. Discussion Contrary to the view of NWA as a pristine area inhabited by small human populations living in isolation, our data support a view of high diversity and contact among different ethnolinguistic groups, with movement along rivers probably facilitating this contact. Additionally, we provide evidence for the impact of cultural practices, such as linguistic exogamy, on patterns of genetic variation. Overall, this study provides new data and insights into a remote and little-studied region of the world.
Content may be subject to copyright.
High-resolution mitochondrial DNA analysis sheds
light on human diversity, cultural interactions, and population
mobility in Northwestern Amazonia
Leonardo Arias
Chiara Barbieri
Guillermo Barreto
Mark Stoneking
Brigitte Pakendorf
Department of Evolutionary Genetics, Max
Planck Institute for Evolutionary
Anthropology, Leipzig D-04103, Germany
Department of Linguistic and Cultural
Evolution, Max Planck Institute for the
Science of Human History, Jena D-07745,
Laboratorio de Gen
etica Molecular
Humana, Universidad del Valle, Cali,
Dynamique du Langage, UMR5596, CNRS
& Universit
e de Lyon, Lyon Cedex 07
69363, France
Leonardo Arias, Department of
Evolutionary Genetics, Max-Planck-Institute
for Evolutionary Anthropology, Deutscher
Platz 6, Leipzig, D-04103, Germany.
Funding information
Departamento Administrativo de Ciencia,
Tecnología e Innovaci
Max Planck Society
Objectives: Northwestern Amazonia (NWA) is a center of high linguistic and cultural diversity.
Several language families and linguistic isolates occur in this region, as well as different subsistence
patterns, with some groups being foragers and others agriculturalists. In addition, speakers of East-
ern Tukanoan languages are known for practicing linguistic exogamy, a marriage system in which
partners are taken from different language groups. In this study, we use high-resolution mitochon-
drial DNA sequencing to investigate the impact of this linguistic and cultural diversity on the
genetic relationships and population structure of NWA groups.
Methods: We collected saliva samples from individuals representing 40 different NWA ethnolin-
guistic groups and sequenced 439 complete mitochondrial genomes to an average coverage of
Results: The mtDNA data revealed that NWA populations have high genetic diversity with exten-
sive sharing of haplotypes among groups. Moreover, groups who practice linguistic exogamy have
higher genetic diversity, while the foraging Nukak have lower genetic diversity. We also find that
rivers play a more important role than either geography or language affiliation in structuring the
genetic relationships of populations.
Discussion: Contrary to the view of NWA as a pristine area inhabited by small human populations
living in isolation, our data support a view of high diversity and contact among different ethnolin-
guistic groups, with movement along rivers probably facilitating this contact. Additionally, we
provide evidence for the impact of cultural practices, such as linguistic exogamy, on patterns of
genetic variation. Overall, this study provides new data and insights into a remote and little-
studied region of the world.
haplogroup, South America, language, exogamy, Amazonia
Northwestern Amazonia (NWA) contains tremendous biological, lin-
guistic, and cultural diversity, which likely reflects the heterogeneity of
the landscape, especially the complex and extensive network of rivers
found in this area. The region (Figure 1) extends from the Andean foot-
hills in the west to the area between the Orinoco River and the Rio
Negro in the east, and extends south until the confluence between the
Rio Negro and the Amazon River. The northern border is defined by
the Eastern Andean Cordillera and the Colombian-Venezuelan llanos,
and the southern by the full length of the Putumayo River (Eriksen,
In terms of linguistic diversity, NWA harbors ethnolinguistic groups
belonging to the main South American language families accepted by
Am J Phys Anthropol.2017;118. V
C2017 Wiley Periodicals, Inc.
Received: 20 February 2017
Revised: 17 September 2017
Accepted: 7 October 2017
DOI: 10.1002/ajpa.23345
most linguists (Campbell, 1997; Chacon, 2014; Dixon and Aikhenvald,
1999), namely Arawakan, Carib, Tupi, and Quechua. Additionally,
several local language families are also present, such as Tukanoan,
Guahiban, Huitotoan, Boran, Peba-Yaguan, Piaroa-Saliban, and Maku-
Puinave (see for a classification of the language families),
as well as various isolate languages like Tikuna, Cofan, and Kamentsa
(Landaburu, 2000). Furthermore, several indigenous groups live in vol-
untary isolation, and almost nothing is known about their linguistic
affiliation (Franco, 2002). The area has been proposed as the place of
origin of the Arawakan family, since it contains the highest linguistic
diversity within the family (Aikhenvald, 1999; Heckenberger, 2002;
Zucchi, 2002). In addition, all 20 languages of the Tukanoan family are
found in the area. These are classified into two branches: the Western
Tukanoan branch distributed along the Putumayo, Caquet
a, and Napo
Rivers and the Eastern Tukanoan branch along the Vaup
es, Rio Negro,
and Apaporis Rivers and their tributaries (Chacon, 2014). The Carib,
Tupi, and Quechua language families are probably recent immigrants to
NWA, since only one language per family is present in the area, while
the majority of languages within these families are found elsewhere. In
addition, the Tupi language Nheengat
u or Lingua Geral is found in the
region; however, this is a very recent introduction spread by mission-
aries during the 17th and 18th centuries and by traders during the
rubber boom in the 19th century, when it was used as a trade language
(Sorensen, 1967; Stenzel, 2005).
In terms of cultural diversity, while NWA has often been viewed as
a pristine area inhabited only by small, isolated, seminomadic tribes
with an economy based on hunting and gathering (Denevan, 1992;
Meggers, 1954), in fact there is considerable variation in subsistence
and marriage practices. Although some groups are traditional foragers,
others engage in agriculture, and instead of being isolated, archeologi-
cal and anthropological evidence now shows that NWA was indeed
part of a continent-wide network of exchange and trade. Complex soci-
eties organized in chiefdoms and multiethnic confederations arose in
the region, and multilingualism and extensive interactions among eth-
nolinguistic groups were the norm (Heckenberger, 2002; Hornborg,
2005; Santos-Granero, 2002; Vidal, 1997).
In particular, the groups speaking Eastern Tukanoan languages and
some of their Arawakan neighbors living in the basin of the Vaup
River and Rio Negro engage in an exceptional marital practice known
as linguistic exogamy (Aikhenvald, 1996; Chacon and Cay
on, 2013;
Sorensen, 1967; Stenzel, 2005). According to this cultural norm, indi-
viduals are required to marry someone from a different language group,
with each individuals linguistic affiliation being determined by the lan-
guage of the father. Linguistic exogamy thus creates a situation of
FIGURE 1 Geographic location of the sampling sites. Every triangle corresponds to a single community, which may contain more than one
ethnolinguistic group. 1. Curripaco and Bare, 2. Matapi, 3. Ach-Piapoco, 4. Yucuna, 5. Carijona, 6. Desano, Yuruti, Pisamira, and Karapana, 7.
Pira-Wanano, 8. Siriano, 9. Tanimuka, 10. Tukano, 11. Tuyuca and Tatuyo, 12. Coreguaje, 13. Siona, 14. Guayabero, 15. Sikuani, 16. Murui,
17.Uitoto, 18. Kamentsa, 19. Nukak, 20. Puinave, 21. Pasto, 22. Yagua, 23. Saliba, 24. Inga, 25. Tikuna, 26. Cocama
multilingualism and movement of people (particularly women, since it is
accompanied by patrilocality and patrilineality) among the groups par-
ticipating in the system (Sorensen, 1967).
Historical linguistics, cultural anthropology, and archeology are the
main disciplines that have traditionally addressed questions regarding
the origins, pre-history, and genetic relationships of NWA ethnolinguis-
tic groups (Campbell, 1997; Chacon, 2014; Heckenberger, 2008; Lath-
rap, 1970; Meggers, 1948; Nettle, 1999). However, due to the
incomplete archeological record, the limitations of linguistic methods
based on lexical cognates to establish deep time relationships (Dediu
and Levinson, 2012; Hock and Joseph, 2009), and the insufficiency of
documentation and description of a large number of the NWA soci-
eties, many of these questions remain to be fully answered. The oldest
archeological evidence of human occupation in NWA comes from a
single site on the Middle Caquet
a River, which has been dated
between 9250 and 8100 BP. It contains a great variety of stone arti-
facts, carbonized seeds and other botanical remains from different
palm species, as well as phytoliths of bottle gourd, leren, and pumpkin
(Aceituno, Loaiza, Delgado-Burbano, & Barrientos, 2013; Gnecco and
Mora, 1997), indicating that these early human groups relied on vege-
table resources that are still being exploited by contemporary societies
in NWA.
One hypothesis about the peopling of NWA was proposed by
u (1950), who suggested that the region was first inhabited
by hunter-gatherer populations (HGPs), perhaps the ancestors of the
Maku-Puinave groups, most of whom still practice a foraging lifestyle.
Proto-Arawakan groups then started expanding into the region from
their place of origin located between the Orinoco River and the Rio
Negro (Heckenberger, 2002; Lathrap, 1970). Finally, the Tukanoans are
assumed to have arrived in the area and displaced peoples speaking
Arawakan and Maku-Puinave languages from the Vaup
es (the Tukano-
ans probably came from the Napo-Putumayo, where Western Tukano-
ans still live). However, this scenario does not account for the presence
of groups belonging to the Carib, Guahiban, Huitotoan, and Boran lan-
guage families and the various language isolates in the region.
Genetic studies can provide insights into population history, and
indeed studies of mitochondrial DNA (mtDNA) genetic variation in
Native American populations have contributed greatly to our knowl-
edge about the peopling of the Americas. Early studies using restriction
fragment length polymorphisms (RFLP) and sequencing of the hyper-
variable region one (HVS-I) identified five founder lineages or hap-
logroups, designated as AD and X (Bailliet, Rothhammer, Carnese,
Bravi, & Bianchi, 1994; Barbieri, Heggarty, Castri, Luiselli, & Pettener,
2011; Gaya-Vidal et al., 2011; Keyeux, Rodas, Gelvez, & Carter, 2002;
Lewis et al., 2007; Schurr, 2004; Torroni et al., 1993). Whereas hap-
logroups AD are widely distributed in the Americas, haplogroup X is
restricted to North America (Bolnick and Smith, 2003; Malhi, Schultz, &
Smith, 2001). The analysis of HVS-I in several Native American popula-
tions showed that haplogroups A-D exhibit similar levels of diversity
(Bonatto and Salzano, 1997), supporting the hypothesis of a single ori-
gin of all Native American populations from a Northeast Asian source.
Additionally, HVS-I data have been used to determine the genetic rela-
tionships among indigenous populations in South America and to test
hypotheses concerning how genetic variation is structured at the
regional and continental levels (Barbieri et al., 2011; Gaya-Vidal et al.,
2011; Lewis et al., 2007; Marrero et al., 2007; Melton et al., 2007).
These studies revealed that Andean (or western) populations show
higher levels of diversity and low genetic distances in contrast to the
Eastern populations, who show the opposite pattern. However, in pre-
vious studies, NWA populations have been generally underrepresented,
and hence inferences about the genetic structure of the entire Amazo-
nian region are based on data from a small number of populations.
Recent developments in sequencing technology allow the determi-
nation of complete mtDNA genomes at the population level and thus
enable unbiased insights into the maternal history of human popula-
tions (Delfin et al., 2014; Gunnarsdottir, Li, Bauchet, Finstermeier, &
Stoneking, 2011; Kivisild, 2015). At present, no such studies are
reported for South American indigenous populations. Available studies
of complete mtDNA genomes from Native Americans have been
restricted to a limited number of individuals carrying particular hap-
logroups, usually selected based on their HVS-I sequences (Achilli et al.,
2013; Bodner et al., 2012; de Saint Pierre et al., 2012; Fagundes et al.,
2008; Lee and Merriwether, 2015; Perego et al., 2009, 2010), or to
archeological remains from different time periods (Fehren-Schmitz
et al., 2015; Llamas et al., 2016). These studies have primarily focused
on inferences about the peopling of the continent, the number of
migrations, the divergence times of haplogroups and changes in the
effective population size through time.
Nevertheless, several problems and biases are associated with this
sampling strategy. First, the overall diversity might be underestimated,
since individuals carrying the same HVS-I sequence can exhibit consid-
erable variation in the coding region (Gunnarsdottir et al., 2011). Sec-
ond, the reconstruction of demographic trends can be skewed, since
the estimation of effective population sizes through time using Bayes-
ian coalescent methods (i.e., Bayesian skyline plots [BSPs] in BEAST)
can generate spurious signals of population growth when based on
samples selected by haplogroup (Gunnarsdottir et al., 2011). Last, the
histories and origins of specific populations cannot be investigated,
since the coalescent age of a particular lineage does not correspond to
the age of the population, especially when the diversity within each lin-
eage is unknown (Schurr, 2004).
In this study, we use complete mtDNA sequencing in a large and
representative sample of populations covering the extant ethnolinguis-
tic diversity from NWA to reconstruct their maternal history, as well as
to determine their genetic diversity and to make inferences about the
origins of this diversity. Finally, we aim to investigate the impact of
pre-historic population dynamics and cultural interactions on the struc-
ture of the genetic variation observed among present-day NWA
Sample collection
Samples from unrelated individuals belonging to 40 ethnolinguistic
groups were collected during several expeditions carried out by one of
TABLE 1 Sampled ethnolinguistic groups with information on merged groups (see Population SamplesSection) given below the compound
Population Label in Figure 1 nCensus size
Language family Subsistence strategy
River/place of residence
Yucu-Matapi 39 Arawakan AG Mirití-Paran
Yucuna 4 31 550 Arawakan AG Mirití-Paran
Matapi 2 8 220 Arawakan AG Mirití-Paran
Curripaco 17 Arawakan AG Atabapo
Curripaco 1 16 7,827 Arawakan AG Atabapo
Bare 1 1 NA
Arawakan AG Atabapo
Ach-Piapoco 24 Arawakan AG Meta
Achagua 3 6 283 Arawakan AG Meta
Piapoco 3 18 4,926 Arawakan AG Meta
1 311 Arawakan AG Mirití-Paran
Carijona 5 8 307 Carib AG Upper-Vaup
Cofan 6 877 Cofan AG Guam
Barasano 4 2,008 Eastern Tukanoan AG Upper-Vaup
Desano 6 17 2,457 Eastern Tukanoan AG Upper-Vaup
Kubeo 5 6,647 Eastern Tukanoan AG Upper-Vaup
Other-ET 10 Eastern Tukanoan AG Upper-Vaup
Tuyuca 11 7 642 Eastern Tukanoan AG Upper-Vaup
Yuruti 6 1 687 Eastern Tukanoan AG Upper-Vaup
Pisamira 6 1 61 Eastern Tukanoan AG Upper-Vaup
Karapana 6 1 464 Eastern Tukanoan AG Upper-Vaup
Pira-Wanano 13 Eastern Tukanoan AG Upper-Vaup
Piratapuyo 7 8 697 Eastern Tukanoan AG Upper-Vaup
Wanano 7 5 1,395 Eastern Tukanoan AG Upper-Vaup
Siriano 8 10 749 Eastern Tukanoan AG Upper-Vaup
Tanimuka 9 10 1,247 Eastern Tukanoan AG Mirití-Paran
Tuka-Tatuyo 10 Eastern Tukanoan AG Upper-Vaup
Tukano 10 8 6,996 Eastern Tukanoan AG Upper-Vaup
Tatuyo 11 2 331 Eastern Tukanoan AG Upper-Vaup
Siona 13 17 734 Western Tukanoan AG Putumayo
Coreguaje 12 19 2,212 Western Tukanoan AG Caquet
Sikuani 15 16 23,006 Guahiban HGP Guaviare
Guayabero 14 35 1,118 Guahiban HGP Guaviare
Saliba 23 16 1,929 Piaroa-Saliban AG Meta
26 7,343 Huitotoan AG Putumayo
Murui 16 18 Huitotoan AG Putumayo
Uitoto 17 8 Huitotoan AG Putumayo
Puinave 20 19 6,604 Maku-Puinave HGP Inirida
Nukak 19 16 1,483 Maku-Puinave HGP Interfluvial
Pasto 21 14 69,789 Pasto AG Andean
Kamentsa 18 11 4,773 Kamentsa AG Andean
Inga 24 17 19,079 Quechuan AG Andean
Tikuna 25 18 7,102 Tikuna AG Amazonas
Cocama 26 17 792 Tupi AG Amazonas
Yagua 22 13 297 Peba-Yaguan AG Amazonas
the authors (L.A.) in five departments (administrative divisions) of
NWA, namely: Amazonas, Guainía, Guaviare, Meta and Putumayo
(Table 1, Figure 1). The samples consisted of either saliva (n 5400), col-
lected as 3 mL of saliva in 3 mL of lysis buffer (Quinque, Kittler, Kayser,
Stoneking, & Nasidze, 2006), or blood samples (n560) stabilized with
EDTA. Written informed consent was obtained from each participant,
and from the community leader and/or local/regional indigenous
organizations, after giving a full description of the aims of the study.
Local translators and fieldwork assistants helped to explain and trans-
late into the local languages when individuals or communities were not
proficient in Spanish. Additionally, each participant answered a short
questionnaire soliciting information regarding their birthplace, language,
ethnic affiliation and that of their parents and grandparents. The study
was approved by the ethics committee of the Universidad del Valle in
Cali, Colombia and the Ethics Commission of the University of Leipzig
Medical Faculty. All procedures were undertaken in accordance with
the Declaration of Helsinki on ethical principles and an export permit
was issued by the Colombian Ministry of Health and Social Protection.
DNA sequencing and sequence processing
The DNA was extracted from blood samples with the salting out
method (Miller, Dykes, & Polesky, 1988) and from the saliva samples
with the QIAamp DNA Midi kit (Qiagen), starting from 2.0 mL of the
saliva/buffer mixture. The concentration of DNA was quantified with a
NanoDrop 8000 spectrophotometer (Thermo Scientific). We prepared
genomic libraries with double indices and enriched for full mtDNA
genomes using a hybridization-capture method described previously in
Kircher, Sawyer, and Meyer (2012) and Maricic, Whitten, and Paabo
(2010). From the enriched libraries, paired-end sequences of 100 bp
length were generated on the Illumina Hiseq 2500 platform. Base-
calling was performed using freeIbis (Renaud, Kircher, Stenzel, & Kelso,
2013), and Illumina adapters were trimmed and completely overlapping
paired sequences were merged using leeHOM (Renaud, Stenzel, &
Kelso, 2014a). The sequencing data were de-multiplexed using deML
(Renaud, Stenzel, Maricic, Wiebe, & Kelso, 2014b) and the sequences
aligned against the human reference genome 19 using BWAsaln algo-
rithm (Li and Durbin, 2009). After duplicate removal using PicardTools
v2.1.1 (, we performed an
iterative alignment for each library individually to obtain mtDNA con-
sensus sequences. In the first step, we extracted all sequencing reads
of a library that aligned either to the mitochondrial genome or to a list
of nuclear copies of mtDNA (Li, Schroeder, Ko, & Stoneking, 2012). We
subsequently aligned these reads to the revised Cambridge Reference
Sequence (rCRS; Andrews et al. 1999) using BowTie2svery-sensitive
algorithm (Langmead and Salzberg, 2012) and called a consensus
sequence. In the second step, the reads were re-aligned to the librarys
respective consensus sequence generated in the first step, using the
same BowTie2 settings. After the second alignment step, we called a
final consensus sequence that was used throughout the rest of the
analysis. Final sequences in fasta format were aligned to the rCRS
(Andrews et al., 1999) with the multiple sequence alignment software
Mafft (Katoh and Standley, 2013), and manually inspected for align-
ment errors with Bioedit ver. 7.2.5 (Hall, 1999). The two poly-C regions
(np 303315 and 16,18316,194) were excluded from the subsequent
analyses. Although one position (16,189) diagnostic for haplogroup B2
is therefore not considered in the haplogroup calling analysis, the addi-
tional substitutions defining this haplogroup that occur elsewhere in
the mitochondrial genome enable unambiguous assignment of sequen-
ces to this lineage.
Population samples
We considered populations with a sample size of 10 individuals or
more, and merged populations with sample sizes smaller than 10 based
on linguistic criteria when our initial analyses did not show significant
genetic differences, as follows (Table 1). The Arawakan groups Achagua
(n56) and Piapoco (n518) were merged into a single population, since
their indigenous reservations are adjacent and individuals often inter-
marry (data available on request); one Bare (n51) individual was added
to the Curripaco (n516) sample among whom he was living when
sampled on the Atabapo River; the Yucuna (n531) and Matapi (n58)
were merged into a single population, since they both speak Yucuna,
live along the same river, and intermarry (data available on request);
and the Murui (n518) and Uitoto (n58) were merged, as these two
groups belong to the same language family, which is composed of
several dialects that are mutually intelligible (
resource/languoid/id/huit1251, accessed on May 31, 2017). Finally,
TABLE 1 (Continued)
Population Label in Figure 1 nCensus size
Language family Subsistence strategy
River/place of residence
Guambiano 1 23,462 Barbacoan AG Andean
Nasa 1 138,501 Nasa AG Andean
Mestizo 9 NA Mestizo
Total 439
Adapted from Arango and S
anchez (2004).
AG, agriculturalist; HGP, Hunter-gatherer populations, data from D-PLACE (Kirby et al., 2016) and HG (,
accessed on June 6, 2017).
NA: data not available.
Populations with label in italics were not considered in the population-based analyses.
Census data reports the population size including groups that speak five dialectal varieties.
Mestizo is an autonym used by people of mixed ancestry.
following the latest classification of the Tukanoan family (Chacon,
2014), the Eastern Tukanoan groups Piratapuyo (n58) and Wanano
(n55) were merged as Pira-Wanano; Tukano (n58) and Tatuyo
(n52) were merged as Tuka-Tatuyo; and Tuyuca (n57), Yuruti (n51),
Pisamira (n51), and Karapana (n51) were merged as Other-ET. The
only group with a sample size smaller than 10 that we retained as a
separate group in the analyses were the Carijona (n58), since this is
the only Carib-speaking group living in NWA. Moreover, they are at
risk of disappearing both physically and culturally, with <30 active
speakers of Carijona scattered in two communities, and they occupy an
important place in the ethno-history of the region (Franco, 2002). We
excluded Barasano (n54), Kubeo (n55), Cofan (n56), Cabiyari
(n51), Guambiano (n51), and Nasa (n51) individuals from all the
analyses except the haplotype networks, since this analysis represents
the evolutionary relationships among individual sequences. We further-
more excluded nine individuals with maternal ancestry tracing outside
of NWA (labeled Mestizoin Table 1) from all analyses. After merging
and filtering as described above, 412 sequences from 24 groups were
kept in the population-based analyses.
Data analysis
Based on information from D-PLACE (Kirby et al. 2016) and HG data-
base (, accessed on June 6,
2017), we divided the populations into agriculturalists (AG) and HGP.
In the latter category, we placed the Nukak, who currently still practice
a foraging way of life, as well as the Puinave, Sikuani, and Guayabero,
who have all adopted agriculture only very recently (Kondo, 2002;
Uribe Tob
on and Instituto Colombiano de Cultura, 1992).
The haplogroup affiliation of the individual sequences was deter-
mined with Haplogrep (Kloss-Brandstatter et al., 2011), based on Phy-
lotree build 16 (van Oven and Kayser, 2009). Haplogroup frequencies
by population were estimated by simple counting, and a correspon-
dence analysis (CA) based on the frequency of sub-haplogroups (e.g.,
A2a) was performed and visualized with the R-packages FactoMineR
(Le, Josse, & Husson, 2008) and factoextra (Kassambara and Mundt,
2016), respectively.
Population-based statistical analyses were performed with Arle-
quin v3.5 (Excoffier and Lischer, 2010). These include the analysis of
molecular variance (AMOVA), estimation of molecular diversity indices,
the estimation of pairwise genetic distances based on U
D test of selective neutrality. A multidimensional scaling analysis (MDS)
was performed on the matrix of pairwise U
values to visualize the
distances between populations. Additionally, we performed a Mantel
test to evaluate the correlations between genetic distances and geo-
graphic distances. The matrix of geographic distances was built using
the geographic coordinates of the location where the majority of sam-
ples for each ethnolinguistic group were collected and then calculating
the great circle distances between locations via the R packages ade4
and geosphere (Dray and Dufour, 2007; Hijmans, 2016). Furthermore,
a multiple regression analysis on distance matrices (MRM) (Goslee and
Urban, 2007) with the form: MRM(as.dist(gen.dist) as.dist(geo.dist)1
as.dist(rivers.dist)) was performed. This analysis takes into consideration
a matrix of geographic distances and a matrix of proximity along rivers
as predictor variables of the genetic distances (pairwise U
between populations (Pugach et al., 2016; Yunusbayev et al., 2012).
For the matrix of river distances, a value of zero was given to popula-
tions living along the same river or on rivers that are closely connected,
and a value of one was given to populations living on different rivers.
The sharing of haplotypes between populations was estimated
with in-house R scripts as the proportion of pairs of identical sequen-
ces shared between populations. Additionally, networks of haplotypes
were constructed with the software Network ver. and visual-
ized with Network Publisher ver. (http://www.fluxus-engineer- Finally, BSPs were constructed by population and by
haplogroup (i.e., A2, B2, C1, and D1) with BEAST ver. 1.8.2 (Drum-
mond, Suchard, Xie, & Rambaut, 2012). For this analysis, the best sub-
stitution model was estimated with jModeltest 2.1.7 (Darriba, Taboada,
Doallo, & Posada, 2012), and BEAST was used to estimate whether a
strict or a relaxed clock model best fits the data. This analysis was per-
formed on both the complete sequences and the sequences partitioned
TABLE 2 Frequency of haplogroups for the 24 NWA ethnolinguis-
tic groups included in the population analyses
Population nA2 B2 C1 D1
Yucu-Matapi 39 0.28 0.10 0.56 0.05
Curripaco 17 0.18 0.53 0.24 0.06
Ach-Piapoco 24 0.54 0.04 0.42 0.00
Carijona 8 0.13 0.25 0.63 0.00
Desano 17 0.29 0.12 0.41 0.18
Other-ET 10 0.50 0.00 0.30 0.20
Pira-Wanano 13 0.31 0.08 0.38 0.23
Siriano 10 0.40 0.10 0.40 0.10
Tanimuka 10 0.50 0.10 0.40 0.00
Tuka-Tatuyo 10 0.30 0.10 0.30 0.30
Siona 17 0.59 0.35 0.00 0.06
Coreguaje 19 0.11 0.16 0.74 0.00
Sikuani 16 0.25 0.00 0.75 0.00
Guayabero 35 0.43 0.23 0.34 0.00
Saliba 16 0.19 0.06 0.56 0.19
Mur-Uitoto 26 0.23 0.12 0.31 0.35
Puinave 19 0.11 0.42 0.47 0.00
Nukak 16 0.00 0.31 0.69 0.00
Pasto 14 0.36 0.14 0.21 0.29
Kamentsa 11 0.18 0.09 0.64 0.09
Inga 17 0.59 0.06 0.29 0.06
Tikuna 18 0.44 0.00 0.44 0.11
Cocama 17 0.29 0.35 0.29 0.06
Yagua 13 0.62 0.15 0.23 0.00
into coding (57716023) and non-coding (16024-576) regions, apply-
ing the corresponding substitution rates reported previously in Soares
et al. (2009).
We generated 439 complete mitochondrial sequences to an average
coverage per sample of 1,0303, which were deposited in GenBank
with accession numbers: KY645515-KY645943 and MF152733-
MF152739. All sequences belonged to one of the main Native Ameri-
can haplogroups, namely A2, B2, C1, and D1. Haplogroups A2 and C1
were the most frequent lineages in the NWA populations (excluding
the so-called Mestizos), with more than half of all sequences belong-
ing to A2 and C1 together. Table 2 provides a breakdown of the hap-
logroup frequencies for the ethnolinguistic groups included in the
population analyses.
The CA (Figure 2) shows the clustering of populations based on
the frequency of sub-haplogroups (Supporting Information Table S1).
We observed differences among populations without a clear clustering
by language family, with the exception of the Eastern Tukanoan
groups. Most of these are clustered on the left side of the plot,
although the Tanimuka do not cluster with the other Eastern Tukanoan
groups. Additionally, Guayabero and Sikuani (who speak languages
belonging to the Guahiban family) were located close to each other in
the lower left pane of the plot. In addition to language affiliation, a few
populations clustered due to geographic proximity, namely the
Kamentsa, Pasto and Inga, who all live close to one another in the
Andean foothills. In other cases, the relatively close proximity of popu-
lations in the plot could be attributed to their being settled along the
same river or on rivers that are part of the same basin (Supporting
Information Figure S1), such as the Curripaco and Puinave, who live on
the Inírida and Atabapo Rivers.
Molecular diversity indices
The genetic variation in these communities was assessed through dif-
ferent molecular diversity indices (Figure 3 and Supporting Information
Table S2). On average, the gene diversity in these groups was high
(0.9), but there were also differences amongst them. For example, East-
ern Tukanoan groups showed consistently high values of gene diver-
sity, with the exception of the Tanimuka, who had one of the lowest
values (0.73). The Western Tukanoan groups Coreguaje (0.92) and
Siona (0.82) showed lower values than Eastern Tukanoan groups.
Among Arawakans, the Ach-Piapoco had the lowest value (0.77). The
hunter-gatherer group Nukak showed the lowest gene diversity of all
groups (0.64), with only four haplotypes observed among the 16 indi-
viduals analyzed. Additionally, agriculturalist groups tended to have
higher gene diversities (average 50.92) than hunter-gatherer groups
(average50.80) (Mann-Whitney U test, p-value5.03).
FIGURE 2 CA based on the sub-haplogroup frequencies by population. Populations are color-coded by linguistic affiliation
The mean number of pairwise differences per population showed
less variation, with an average of 41.07 617.86 differences. The
smallest values were found in Sikuani (24.08 611.18) and Nukak
(31.07 614.33), and the largest values were observed in Cocama
(44.64 620.37), Carijona (42.71620.8), and Siriano (41.38619.66)
The D values of Tajimas test of neutrality (Tajima, 1989) ranged
from 20.735 to 2.318. Under neutrality, Tajimas D is expected to be
equal to zero, and significant departures are interpreted as a result of
selection or changes in population size. Although none of the D val-
ues were significant (all p-values >0.2; Supporting Information Table
S2), positive D values >1.2 were obtained for Guayabero, Nukak,
Sikuani, and Tanimuka, which may reflect recent reductions in the
size of these populations. This hypothesis was supported both by the
distribution of pairwise differences by population (Supporting Infor-
mation Figure S2), which showed increased frequencies for the cate-
gory of small differences (0 and 1 differences) and for the category
of large differences (50 or more), as well as by the Bayesian recon-
struction of population size changes through time (BSP plots, Sup-
porting Information Figure S3). Furthermore, the Tanimuka and
Nukak had the lowest gene diversity values of any population
Shared haplotypes
A total of 216 different haplotypes were observed among the 412
sequences included in this analysis. Of these, 146 were unique haplo-
types (i.e., found in single individuals) and 70 haplotypes were shared:
39 exclusively within populations, 18 exclusively between populations,
and 13 both within and between populations. The shared haplotypes
accounted for 64.6% of all the sequences analyzed. This amount of hap-
lotype sharing between populations is quite high when compared with
other population-based studies of complete mitochondrial genomes
(Table 3). In other studies, the majority of shared haplotypes were gen-
erally observed within populations, with the exception of two African
datasets from Burkina Faso and Zambia (Barbieri, Butthof, Bostoen, &
Pakendorf, 2013; Barbieri et al. 2012), which showed low levels of shar-
ing both within and between populations. The highest level of sharing
between populations was observed for Siberian populations spread
over a large geographic area (Duggan et al., 2013); the NWA popula-
tions analyzed in this study showed the second highest value of sharing
between populations.
Figure 4 shows the proportion of pairs of sequences shared
between and within NWA populations. Siriano, Other-ET, and Pasto
FIGURE 3 Molecular diversity indices by population. Dashed lines correspond to average values, except for TajimasDtestwhereit
corresponds to zero. Populations are color-coded by linguistic affiliation as in Figure 2
were the only groups without shared haplotypes within the popula-
tions, although Siriano and Other-ET did share with other populations.
The majority of between-group haplotype sharing involved Arawakan
and Eastern Tukanoan groups. The Arawakan groups shared mostly
with groups living in close proximity (Supporting Information Figure
S4), for example, Yucu-Matapi with Tanimuka; Curripaco with Puinave
and Nukak; and Ach-Piapoco with Saliba and with the Guahiban groups
Sikuani and Guayabero. Most Eastern Tukanoan groups, who practice
linguistic exogamy, shared haplotypes among each other (except for
Tanimuka, who shared only with Yucu-Matapi). In contrast, the
TABLE 3 Shared haplotypes in a worldwide sample of complete mitochondrial sequences sampled at the population level
Shared within
Shared between
populations Sources
NW Amazonia 412 216 0.676 0.241 0.144 Present study
Burkina Faso 335 332 0.991 0.006 0.003 Barbieri et al. (2012)
SW Zambia 169 146 0.897 0.048 0.055 Barbieri et al. (2013)
Botswana/Namibia 218 128 0.75 0.188 0.133 Barbieri et al. (2014)
Philippines 365 233 0.734 0.227 0.077 Delfin et al. (2014)
Sumatra 72 48 0.771 0.229 0.021 Gunnarsdottir et al. (2011)
Taiwan 549 299 0.669 0.308 0.084 Ko et al. (2014)
Oceania 1,331 650 0.689 0.277 0.106 Duggan et al. (2014)
Siberia 525 244 0.574 0.336 0.217 Duggan et al. (2013)
113 90 0.867 0.133 0 Mizuno et al. (2014)
Note. The proportions do not sum up to 1 since some haplotypes are shared both within and between populations.
The individuals from Mexico are all Native Americans from the Mazahua and Zapotec ethnic groups.
FIGURE 4 Matrix of shared haplotypes between populations. The color scale indicates the proportion of the total haplotypes that are
shared within (on the diagonal) or between (below the diagonal) populations
Western Tukanoan groups Siona and Coreguaje shared primarily within
their populations and did not share haplotypes with the Eastern
Tukanoan groups.
The groups from the Andean foothillsInga, Kamentsa, and Pasto
showed different patterns of shared haplotypes, despite the fact that
they live in close geographic proximity. The Pasto, a group that has lost
its native language and is largely incorporated into the admixed local
population, shared no haplotypes with any population. The Kamentsa
shared haplotypes only with the Inga, while the Inga also shared haplo-
types with three other groups located further inside the Amazonian
areaCarijona, Coreguaje, and Mur-Uitoto. Finally, of the three groups
living on the banks of the Amazon River close to the town of Leticia,
the Cocama shared with both the Yagua and Tikuna, whereas the latter
two groups did not share with one another.
Haplotype networks
The networks of haplotypes (Supporting Information Figure S5AD)
complement the patterns of sequence sharing, but in addition allow us
to discern clusters of related (not just identical) haplotypes. We
observed that some of these clusters included sequences from differ-
ent language families while others were restricted to specific language
families or to groups living in close geographic proximity (Supporting
Information Table S3). For instance, Arawakan and Eastern Tukanoan
groups exhibited several haplotypes within haplogroups A2, B2, and C1
(Supporting Information Figures S5AC and Supporting Information
Table S3) that were either shared or separated by only a few muta-
tional steps. Notably, several of these clusters also included individuals
speaking Maku-Puinave languages. Clusters of haplotypes restricted to
specific groups are represented by clusters I and II of haplogroup D,
which are exclusive to Eastern Tukanoan and Huitotoan populations,
respectively. Furthermore, the haplotypes of the Quechuan populations
and the Kamentsa, who live in close proximity in the Andean foothills,
were either shared between them or closely related. Finally, the haplo-
types of the Guayabero and Sikuani (Guahiban) were mostly differenti-
ated from those of other populations and generally shared by several
individuals within the family (Clusters II and III in Supporting
Information Figure S5A; Cluster I in Supporting Information Figure
S5C). The sequences belonging to cluster I in haplogroup C lack the
diagnostic mutation A13263G for haplogroup C, but contain other
diagnostic mutations that allow unambiguous assignment to hap-
logroup C. MtDNAs with this variant were previously identified in east-
ern Colombia by RFLP typing (Torres et al., 2006), where they occurred
at high frequency in Guahibo, Piapoco, and Saliba groups. Given their
high frequencies in the Guahiban groups, these haplotypes appear to
belong to an autochthonous lineage that has diffused into other groups
living in the Orinoco basin.
Genetic structure and genetic distances
The AMOVA (Table 4) allows us to test different hypotheses about
how genetic variation is structured in NWA. We defined groups apriori
based on their language affiliation, geographic proximity, and distribu-
tion along major rivers or their tributaries to evaluate how much of the
observed variation was explained by each grouping strategy. We
observed that, of the three strategies, grouping populations by their
distribution along rivers resulted in the largest among-group compo-
nent of the genetic variance. In contrast, both language and geography
were a poor predictor of the genetic structure, showing negative and
non-significant values for the component of variance due to differen-
ces among groups. Although grouping by rivers performed better than
grouping by geography or language, it still did not provide a very good
description of the genetic structure, since the percentage of variance
due to differences among populations within groups was still higher
than the among groups component, suggesting the existence of other
influences on substructure within populations.
The matrix of genetic distances between populations based on
pairwise U
values (Supporting Information Figure S6) was used to
construct an MDS plot (Figure 5). The populations did not form any
clear clustering: the majority of populations were grouped together in
the center of the plot (indicated by the inner circle in Figure 5) with an
average pairwise U
50.03, while around the main cluster a second
group of populations showed higher differentiation (external circle,
average U
50.07). Sikuani, Siona, and the hunter-gatherer Nukak
TABLE 4 Analysis of molecular variance
No. groups Among groups Within groups Within populations Global FST
One group 1 11.12** 88.88 0.1112
14 21.33 12.37** 88.96** 0.1104
6 1.04 10.24** 88.72** 0.1128
11 5.42** 5.99** 88.59** 0.1141
1. Arawak: Yucu-Matapi, Curripaco, Ach-Piapoco; 2. Carib: Carijona; 3. Eastern-Tukanoan: Desano, Pira-Wanano, Siriano, Tuka-Tatuyo, Other-ET,
Tanimuka; 4. Western-Tukanoan: Coreguaje, Siona; 5. Guahiban: Sikuani, Guayabero; 6.Huitoto: Mur-Uitoto; 7. Maku-Puinave: Puinave, Nukak; 8.
Kamentsa; 9. Pasto; 10. Piaroa-Saliba: Saliba; 11. Peba-Yaguan: Yagua; 12. Quechua: Inga; 13.Tikuna; 14. Tupi: Cocama.
1. Saliba, Ach-Piapoco; 2.
Sikuani, Guayabero, Nukak, Desano, Pira-Wanano, Siriano, Tuka-Tatuyo, Other-ET, Carijona; 3. Coreguaje, Siona, Mur-Uitoto, Inga, Kamentsa, Pasto; 4.
Curripaco, Puinave; 5. Yucu-Matapi, Tanimuka; 6. Cocama, Tikuna, Yagua.
1. Meta: Saliba, Ach-Piapoco; 2. Vaupes: Desano, Pira-Wanano, Siriano,
Tuka-Tatuyo, Other-ET, Carijona; 3. Guaviare: Guayabero, Sikuani; 4. Interfluve: Nukak; 5. Atabapo-Inirida: Curripaco, Puinave; 6. High-Putumayo: Inga,
Kamentsa, Pasto; 7. Middle-Putumayo: Siona; 8. Lower-Putumayo: Mur-Uitoto; 9. Middle-Caqueta: Coreguaje; 10. Mirití-Parana: Yucu-Matapi,
Tanimuka; 11. Amazon: Cocama, Tikuna, Yagua.
appeared as outliers with high genetic differentiation (average
50.22). This picture did not change after adding an additional
dimension to the MDS plot (Supporting Information Figure S7). Particu-
larly striking were the small genetic distances between the Eastern
Tukanoan groups, who clustered together in the center of the MDS
plot. Although the Tanimuka appeared more distant from the main
cluster of Eastern Tukanoan groups, their pairwise U
values were not
significantly different (Supporting Information Figure S6) and the aver-
age U
(0.02) indicated low genetic differentiation among all Eastern
Tukanoan groups. In contrast, the Coreguaje and the Siona, who speak
languages of the Western Tukanoan branch, showed larger genetic dis-
tances, both with the Eastern Tukanoan groups and with each other.
Populations from each of the other language families did not form clus-
ters with their linguistic relatives. For example, Arawakan groups occu-
pied different positions in the plot and their U
values were
significantly different.
The results of the Mantel test showed a lack of significant correla-
tion between geographic distances, estimated as great-circle distances,
and the matrix of pairwise U
values (r50.07, p-value50.28). How-
ever, since rivers emerged as an important factor explaining the struc-
ture of genetic variation in the AMOVA results (Table 4), we also
performed a MRM, where we added rivers as an additional predictor
variable. Adding rivers to the regression model resulted in an increase
in the amount of variation explained by the model (Table 5), with rivers
being a significant predictor (p-value 50.01). We then jack-knifed over
populations (Pugach et al. 2016; Ramachandran et al., 2005) and
identified three populations as outliersSikuani, Siona, and Nukak
groups that appeared as outliers in the MDS plot as well (Figure 5). We
performed the multiple regression analysis excluding the outliers, and
this resulted in an increase of 3.4% in the R square value, a better cor-
relation between genetic and geographic distances, and geography
becoming a significant predictor factor (p-value <0.05) (Table 5 and
Supporting Information Figure S8), although rivers were no longer a
significant predictor of genetic subdivision.
Bayesian demographic reconstruction
BSPs were generated for each haplogroup (A2, B2, C1, and D1) and
population. All four haplogroups showed an increase in effective popu-
lation size between 17,500 and 25,000 years before present. This sig-
nal was more evident for haplogroups A2 and C1, which had the
highest number of sequences (Supporting Information Figure S9). In
contrast, the BSP plots by population showed different outcomes. We
observed four main trajectories (Supporting Information Figure S3): (a)
a signal of population size increase shown by Yucu-Matapi, Curripaco,
Desano, Siriano, Inga, Pasto, Mur-Uitoto, Tikuna, and Cocama (exem-
plified by Yucu-Matapi in Supporting Information Figure S3A); (b)
population stability through time shown by Ach-Piapoco, Tanimuka,
Coreguaje, Siona, Kamentsa, Puinave, and Yagua (exemplified by Core-
guaje in Supporting Information Figure S3B); and (c) population con-
traction shown by Sikuani, Guayabero, and Nukak (exemplified by
Nukak in Supporting Information Figure S3C), which is particularly
FIGURE 5 MDS plot based on U
genetic distances. Stress value is given in percentage. The inner circle indicates populations with low
genetic differentiation and the outer circle indicates populations with moderate differentiation. mPhiST is the average pairwise U
within each circle
striking for Sikuani (Supporting Information Figure S3D). These differ-
ences in the effective population size through time suggest that these
populations have independent demographic histories.
We have investigated the genetic diversity of ethnolinguistic groups
from NWA at the level of complete mitochondrial genomes. This area
is underrepresented in previous studies, and our data help to fill a gap
in our knowledge about the genetic diversity of modern human popula-
tions. We have found that NWA harbors a considerable amount of
genetic diversity, with evidence for contact among different ethnolin-
guistic groups, contrary to the common picture of Amazonian popula-
tions as being small and isolated with low genetic diversity (Fuselli
et al., 2003; Wang et al., 2007). NWA populations show values of
nucleotide diversity as high as or higher than those observed in most
other non-African populations (Supporting Information Figure S10),
and they display the second-highest amount of sequence sharing in a
world-wide comparison (Table 3). The complete mitochondrial genome
is the maximum level of resolution one can achieve to differentiate
individuals and populations at the maternal level, so the presence of
identical sequences among populations living in distant geographic
areas likely indicates their common ancestry and/or recent contact.
Lack of genetic structure along linguistic lines
Although our dataset includes populations speaking languages belong-
ing to different language families, we found that linguistic affiliation is a
poor predictor of genetic structure, as shown by the AMOVA (Table 4)
and the CA plot based on sub-haplogroup frequencies (Figure 2 and
Supporting Information Table S1). These results indicate that language
does not constitute a barrier to gene flow, and that groups have been
interacting with neighboring groups for some time, especially along riv-
ers, which in our analyses performed better in explaining the patterns
of genetic diversity. Archeological and linguistic evidence demonstrates
that NWA has been an area of intense contact and movement of peo-
ples of different cultural traditions, as evidenced by the diffusion of
ceramic styles (Heckenberger, 2002; Lathrap, 1970; Zucchi, 2002) and
shared subsistence strategies, by the existence of language areas and
contact-induced linguistic change (Aikhenvald, 1999), and the general-
ized multilingualism among groups (Sorensen, 1967; Stenzel, 2005).
Likewise, ethnographic studies provide additional evidence of contact
among groups. For example, both Arawakans and Eastern Tukanoans
share a ceremonial complex for male initiation known as Yurupari, in
which sacred flutes and trumpets are only played by males, as well as
sharing myths concerning the hero K
uwai (Hugh-Jones, 1979; Jackson,
1983; Vidal, 2002). In addition, the Eastern Tukanoan groups from the
Pira-Parana and Apaporis Rivers (Barasano, Makuna, and Tanimuka)
reveal Arawakan influence, since they also practice dances with masks
during the season of high abundance of the palm tree fruit pupunha
(Bactris gasipaes) (Hugh-Jones, 1979).
The genetic distances among populations provide additional evi-
dence in this regard. Although the global U
value of 0.11 indicates
moderate differentiation (Hartl and Clark, 2007), this value is driven by
three populations, namely the Siona, Sikuani, and Nukak. These are
highly differentiated from the other populations, most likely reflecting
the effects of genetic drift due to bottlenecks, as indicated by the posi-
tive Tajimas D values (Figure 3) and the distribution of pairwise differ-
ences (Supporting Information Figure S2). When we exclude these
populations, we observe an average pairwise U
of 0.07, and popula-
tions appear close together in the MDS plot (Figure 5), indicating low
genetic differentiation among NWA populations.
In this general picture, the Eastern Tukanoan groups stand apart,
since they cluster together in the CA and MDS plots (Figures 2 and 5),
and their pairwise genetic distances are small and non-significant (Sup-
porting Information Figure S6). Linguists have proposed a time depth
for the Tukanoan family of 20002500 years, based on a comparison
of the diversity in Tukanoan languages with the diversity in Romance
and Germanic languages (Chacon, 2014). The time depth of the Eastern
Tukanoan branch (and thus the time to the most recent common
ancestor of the Eastern Tukanoan languages) would be even more
recent, and as such might indicate that the peoples speaking these lan-
guages share recent common genetic ancestry as well (at least on the
maternal side). However, the Eastern Tukanoan groups practice linguis-
tic exogamy, and the close genetic relationships among these popula-
tions might be the result of this marital system, in which women move
among different ethnolinguistic groups. The consequences of the lin-
guistic exogamy are also evident in the gene diversity values and the
TABLE 5 Multiple regression analysis on distance matrices
gen.dist all populations
Reg.coefficient p-value R
Simple regression geo.dist 3.28 310
0.444 0.009 0.444
Multiple regression geo.dist 9.64 310
0.983 0.036 0.126
rivers.dist 5.71 310
gen.dist without outliers
Simple regression 6.70 310
0.026 0.067 0.026
Multiple regression 5.76 310
0.112 0.070 0.040 1.38 310
patterns of shared haplotypes. Eastern Tukanoans show the highest
gene diversity values and share more haplotypes among themselves
than with other non-Eastern Tukanoan groups. In addition, their haplo-
types tend to be closely related, as seen in the phylogenetic networks
(Supporting Information Figure S5). Analyses of the Y-chromosome as
well as nuclear markers will help to disentangle the effects of linguistic
exogamy versus recent common ancestry on the patterns of genetic
variation among Eastern Tukanoan groups.
The Tanimuka stand apart from the other Eastern Tukanoan
groups in the analyses, and this may reflect their settlement further
south, along the Apaporis and Mirití-Paran
a Rivers. Moreover, they do
not participate in the linguistic exogamic system with other Eastern
Tukanoan groups, but interact mainly with the Arawakan groups
Yucuna and Matapi. These interactions are reflected in the patterns of
haplotype sharing (Figure 4) as well as in their language, which shows
evidence of Arawakan influence (Barnes, 1999; Chacon, 2014).
The role of rivers in structuring genetic variation
Besides language, geography is another important factor in structuring
the patterns of genetic variation in human populations (Ramachandran
et al., 2005; Sch
onberg, Theunert, Li, Stoneking, & Nasidze, 2011;
Wang et al., 2007). One of the most salient characteristics of the physi-
cal landscape of NWA is the high density of rivers that drain the area,
and their importance for human populations was earlier recognized by
explorers and ethnographers that traveled through the region (Koch-
unberg, 1995; Wallace, 1853). We found that the distribution of
populations along rivers is an additional important factor influencing
their genetic structure. AMOVA (Table 4) shows that clustering popula-
tions according to the rivers where they are distributed explains more
of the genetic variation that is due to differences among groups than
does grouping them by linguistic affiliation, that is, populations living in
the same river basin or along closely connected rivers are genetically
more similar than those living on different rivers. This pattern is also
observed in the distribution of sub-haplogroups among populations,
which drive their location in the CA plot (Supporting Information Figure
S1). For example, the Curripaco and Puinave, who live on the Inírida
and Atabapo Rivers, are located close together in the plot. The pres-
ence of Coreguaje, Yucu-Matapi, and Mur-Uitoto in the center of the
plot could reflect their presence in a region where the Putumayo and
a Rivers are separated by their shortest distance, therefore facil-
itating contact among people inhabiting the basins and tributaries of
these two rivers. Indeed, one Murui individual was sampled in a Core-
guaje community, and two Uitoto individuals were sampled in the
Mirití-Parana region, thus providing evidence for the movement of
people among these groups. The results of the MRM analysis provide
additional evidence in this regard: even though no correlation between
genetic distances and geographic distances was observed via the Man-
tel test, adding river distances as an additional predictor variable
resulted in an increase of around 3% of the R-square value (Table 5),
indicating that rivers contribute to explaining a slightly higher percent-
age of the variation observed in the genetic distances.
Rivers in Amazonia serve a double function in providing a means
of communication as well as subsistence, and the wide distribution of
certain cultural traits (e.g., the production of Saladoid-Barrancoid
ceramics and circular plaza village settlement patterns) has been associ-
ated with the expansion of Arawakan-speaking populations along the
extensive system of NWA waterways (Heckenberger, 2002; Hornborg,
2005; Lathrap, 1970; Lowie, 1948). They also mark a distinction in sub-
sistence strategies between the more numerous river peoplewho
build canoes, settle along rivers, and rely on horticulture and fishing,
and the forest peoplewho inhabit the interfluvial areas, settle away
from the major rivers, and base their subsistence on foraging (Epps and
Stenzel, 2013). Additionally, the rivers have profound meanings and are
embedded in the cosmogonies of several NWA indigenous groups. The
Eastern Tukanoan creation myths describe the journeys that the ances-
tors of the people made to settle this world on board an anaconda
canoe that traveled up the Vaupes River; from the anacondasbodyall
the Tukanoan siblings emerged (Chernela, 2010; Jackson, 1983). Ara-
wakan groups also describe a series of ever returning voyages from the
sacred center of the world and the place of emergence of the first
ancestors at the rapids of Hípana on the Aiary River, covering the major
arteries of the Rio Negro, Orinoco, and Amazon Rivers (Wright, 2002;
Zucchi, 2002). Therefore, our findings about the role of rivers in struc-
turing the genetic variation are in keeping with the ethnographically
demonstrated role that rivers play for NWA populations.
The lack of fit between genetic and simple geographic distances
may be the result of relatively recent movements and the displacement
of ethnolinguistic groups from their traditional territories. Population
dynamics and population sizes were drastically altered during the last
five centuries, starting with early colonial times (16th and 17th centu-
ries), when many groups were decimated by newly introduced epidem-
ics and moved away from the accessible margins of the major rivers to
avoid the slave raids of the Spanish, Portuguese, and Dutch colonizers
(Santos-Granero 2002). Similar perturbations happened during the time
of the Christian missions in the 18th century, when many groups were
forced to relocate to multiethnic mission settlements, and finally during
the rubber boom between the 19th and beginning of the 20th centu-
ries, when the groups who managed to escape the mercenaries exploit-
ing the rubber fields resettled in remote areas in the headwaters of
small rivers (Dixon and Aikhenvald, 1999; Hill and Santos-Granero,
2002; Stenzel, 2005). The inferred reduction in population size of the
Tanimuka, Sikuani, Guayabero, and Nukak, as indicated by their low
diversity values, the positive Tajimas D values (Figure 3), the distribu-
tion of pairwise differences (Supporting Information Figure S2), and the
reconstruction of effective population sizes (Supporting Information
Figure S3C,D), might be a result of these social upheavals.
The impact of subsistence strategies
on the genetic diversity
NWA contains groups with different subsistence strategies, with manioc
(Manihot esculenta) the main staple among horticulturalist groups, who
are best described as riverine horticultural societies, given their close
association with rivers. The Nukak, in contrast, are traditionally foragers,
who still rely on hunting and gathering, and move throughout the exten-
sive area between the Guaviare and Inírida Rivers. Furthermore, the
Guayabero, Sikuani, and Puinave are traditional foragers who have only
recently undergone the transition to agriculture, and are therefore con-
sidered as HGP together with Nukak in our analyses (Table 1). Our data
show that agricultural societies (AG) have higher levels of diversity on
average than forager groups (HGP) as indicated by the Mann-Whitney U
test (p-value 5.03), while the HGP groups have larger values of Tajimas
D statistic (Figure 3) and do not show signals of population expansion
(Supporting Information Figure S3). These findings agree with patterns
reported for other HGPs around the world (Aime et al., 2013; Excoffier
and Schneider, 1999; Oota et al., 2005) and contrast with the genetic sig-
nature of an agricultural way of life, namely higher effective population
size (Patin et al., 2014), higher levels of diversity, and significantly nega-
tive values of Tajimas D test (Aime et al., 2013).
However, subsistence strategies are flexible and diverse among
NWA populations. Horticulturalists complement their diet with occa-
sional hunting and/or gathering of several kinds of palm fruit, and
extensive exchanges between AG and HGP groups have been
reported (Jackson, 1983; Milton, 1984). In this system, HGP popula-
tions usually provide meat and several products from the forest, such
as the poison curare for the tips of darts and arrows, in exchange for
different cultivated products, such as manioc and other trade goods
(Epps and Stenzel, 2013; Jackson, 1983; Milton, 1984). Nonetheless,
this exchange seems to be exclusively restricted to goods and labor,
with little or no intermarriage documented between AG and HGP
groups (Aikhenvald, 1996). In contrast, we observed shared haplo-
types between AG and HGP groups, which likely reflects recent inter-
marriage or recent common ancestry. For example, the most frequent
haplotype in the Arawakan AG group Curripaco (Haplotype H_84 in
Supporting Information Figure S4) is observed at high frequency in the
HGP Nukak (and in the Eastern Tukanoan AG group Siriano). More-
over, the HGP Puinave share several haplotypes with the AG group
Curripaco (H_219, H_161, H_117 in Supporting Information Figure
S4), a likely result of intermarriage between these groups, since there
are communities on the Inírida River where one finds individuals from
both groups. Similarly, the Guahiban HGP groups Sikuani and Guaya-
bero exhibit a haplotype at high frequency (H_43 in Supporting Infor-
mation Figure S4) that is shared with the AG Ach-Piapoco as well as
further haplotypes related to haplotypes found in AG Arawakan
groups (Clusters I and II in Supporting Information Figure S5B,C). This
may reflect contact among them, since there are Piapoco communities
on the lower Guaviare River as well as Sikuani communities on the
Meta River, places where these groups overlap.
However, it is difficult to determine the direction of the gene flow
or to distinguish between contact and common ancestry as explana-
tions for shared mtDNA haplotypes. Nevertheless, it is plausible that,
where haplotypes are shared, the source population is the one in which
the haplotype is present at higher frequency. For instance, the shared
haplotype between the HGP Puinave and the AG Curripaco (H_219 in
Supporting Information Figure S4) has a likely origin in Puinave,
because of its higher frequency and the presence of related haplotypes
in Puinave (Cluster I Supporting Information Figure S5B). The source of
the shared haplotype among the HGP Nukak and the AG Curripaco
and Siriano (H_84 in Supporting Information Figure S4) is more difficult
to infer, since its frequency is similar in the Nukak and in the Curripaco;
furthermore, three other haplotypes present in the HGP Nukak and
Guayabero are only one mutation apart from it (Cluster II in Supporting
Information Figure S5B). Therefore, it is likely that this haplotype, too,
moved from the HGP populations into the AG Curripaco. A similar
explanation could be given for H_43 in Figure S4, which is part of the
Cluster I in Supporting Information Figure S5C, moving from the HGP
Guayabero and Sikuani into the AG Ach-Piapoco.
Thus, these observations seem to fit a scenario of asymmetric
gene flow in which women move from HGP to AG, a pattern that has
been reported for populations in Central and Southern Africa (Barbieri
et al., 2014; Destro-Bisol et al., 2004; Verdu et al., 2013). However,
this scenario will need to be further refined by analyses of Y-
chromosome and genome-wide data, which will allow us to determine
whether the gene flow among groups was sex-biased (i.e., involving the
movement of only females or only males among groups) and to make
inferences about the time and magnitude of these events.
In conclusion, this study provides new data from this remote and
little-studied part of the world, which allow insights into the impact of
cultural practices on the patterns of genetic variation and on the popu-
lation dynamics of NWA groups. Although our current data do not
allow us to distinguish whether the population movements took place
prior to or as a consequence of European contact, analyses of Y-
chromosome variation and genome-wide data will shed further light on
the genetic history of NWA. Furthermore, historical genetic studies will
benefit from more archeological work in NWA, since huge areas remain
completely unexplored.
We are grateful to all sample donors, communities, community lead-
ers, and regional indigenous organizations. L.A. especially gives
thanks to Consejo Regional Indígena del Guaviare, Asociaci
on de
Autoridades Tradicionales y Cabildos de los Pueblos Indígenas del
Municipio de Leguízamo y Alto Resguardo Predio Putumayo,
on de Cabildos Indígenas del Trapecio Amaz
onico, Organ-
on Zonal Indígena Del Putumayo, the staff of Parques Nacio-
nales Naturales in Puerto Leguízamo, the office of Indigenous Affairs
in Puerto Inírida, Rafael Rodriguez, William Yucuna, the late Gustavo
Arias, and all people who helped during the fieldtrips for their valua-
ble collaboration and warm welcome during our stay in their com-
munities. We also acknowledge Roland Schroeder for laboratory
technical assistance, and Enrico Macholdt, Alexander Huebner, Irina
Pugach, and Michael Dannemann for advice with the data analyses.
B.P. acknowledges the LABEX ASLAN (ANR-10-LABX-0081) of Uni-
e de Lyon for its financial support within the program Inves-
tissements dAvenir(ANR-11-IDEX-0007) of the French government
operated by the National Research Agency (ANR). L.A. was supported
by a graduate grant from COLCIENCIAS; research was supported by
funds from the Max Planck Society.
Leonardo Arias
Aceituno, F. J., Loaiza, N., Delgado-Burbano, M. E., & Barrientos, G.
(2013). The initial human settlement of Northwest South America
during the Pleistocene/Holocene transition: Synthesis and perspec-
tives. Quaternary International,301,2333.
Achilli, A., Perego, U. A., Lancioni, H., Olivieri, A., Gandini, F., Hooshiar
Kashani, B., ... Torroni, A. (2013). Reconciling migration models to
the Americas with the variation of North American native mitoge-
nomes. Proceedings of the National Academy of Science of the United
States of America,110, 1430814313.
Aikhenvald, A. Y. (1996). Areal diffusion in Northwest Amazonia: The
case of Tariana. Anthropological Linguistics,38,73116.
Aikhenvald, A. Y. (1999). The Arawak language family. In: Dixon RMW,
and Aikhenvald AY, editors. The amazonian languages (pp. 65102).
Cambridge; New York: Cambridge University Press.
Aime, C., Laval, G., Patin, E., Verdu, P., Segurel, L., Chaix, R., ... Auster-
litz, F. (2013). Human genetic data reveal contrasting demographic
patterns between sedentary and nomadic populations that predate
the emergence of farming. Molecular Biology & Evolution,30, 2629
Andrews, R. M., Kubacka, I., Chinnery, P. F., Lightowlers, R. N., Turnbull,
D. M., & Howell, N. (1999). Reanalysis and revision of the Cambridge
reference sequence for human mitochondrial DNA. Nature Genetics,
23, 147.
Arango, R., & S
anchez, E. (2004). Los pueblos indígenas de Colombia en el
umbral del nuevo milenio : poblaci
on, cultura y territorio : bases para el
fortalecimiento social y econ
omico de los pueblos indígenas. Bogot
Departamento Nacional de Planeaci
on. 526 p.
Bailliet, G., Rothhammer, F., Carnese, F. R., Bravi, C. M., & Bianchi, N. O.
(1994). Founder mitochondrial haplotypes in Amerindian populations.
American Journal of Human Genetics,55,2733.
Barbieri, C., Butthof, A., Bostoen, K., & Pakendorf, B. (2013). Genetic
perspectives on the origin of clicks in Bantu languages from south-
western Zambia. European Journal of Human Genetics,21, 430436.
Barbieri, C., Guldemann, T., Naumann, C., Gerlach, L., Berthold, F., Naka-
gawa, H., ... Pakendorf, B. (2014). Unraveling the complex maternal
history of Southern African Khoisan populations. American Journal of
Physical Anthropology,153, 435448.
Barbieri, C., Heggarty, P., Castri, L., Luiselli, D., & Pettener, D. (2011).
Mitochondrial DNA variability in the Titicaca basin: Matches and mis-
matches with linguistics and ethnohistory. American Journal of Human
Barbieri, C., Whitten, M., Beyer, K., Schreiber, H., Li, M., & Pakendorf, B.
(2012). Contrasting maternal and paternal histories in the linguistic
context of Burkina Faso. Molecular Biology & Evolution,29, 1213
Barnes, J. (1999). Tucano. In: Dixon RMW, and Aikhenvald AY, editors.
The amazonian languages (pp. 207225). New York: Cambridge Uni-
versity Press.
Bodner, M., Perego, U. A., Huber, G., Fendt, L., Rock, A. W., Zimmer-
mann, B., ... Parson, W. (2012). Rapid coastal spread of First Ameri-
cans: Novel insights from South Americas Southern Cone
mitochondrial genomes. Genome Research,22, 811820.
Bolnick, D. A., & Smith, D. G. (2003). Unexpected patterns of mitochon-
drial DNA variation among Native Americans from the southeastern
United States. American Journal of Physical Anthropology,122, 336
Bonatto, S. L., & Salzano, F. M. (1997). Diversity and age of the four
major mtDNA haplogroups, and their implications for the peopling
of the New World. American Journal of Human Genetics,61, 1413
Campbell, L. (1997). American Indian languages: The historical linguistics of
native America (pp. 528). New York: Oxford University Press.
Chacon, T. (2014). A revised proposal of proto-tukanoan consonants and
Tukanoan family classification. International Journal of American Lin-
guistics,80, 275322.
Chacon, T., & Cay
on, L. (2013). Consideraç~
oes sobre a exogamia linguís-
tica no Noroeste Amaz^
onico. Revista de Letras da Universidade
olica de Brasília,6,620.
Chernela, J. M. (2010). The Wanano Indians of the Brazilian Amazon: A
Sense of Space. Austin: University of Texas Press. 207 p.
Darriba, D., Taboada, G. L., Doallo, R., & Posada, D. (2012). jModelTest
2: more models, new heuristics and parallel computing. Nature Meth-
ods,9, 772.
de Saint Pierre, M., Gandini, F., Perego, U. A., Bodner, M., Gomez-Car-
balla, A., Corach, D., ... Olivieri, A. (2012). Arrival of Paleo-Indians to
the southern cone of South America: New clues from mitogenomes.
PLoS One,7, e51311.
Dediu, D., & Levinson, S. C. (2012). Abstract Profiles of Structural Stabil-
ity Point to Universal Tendencies, Family-Specific Factors, and
Ancient Connections between Languages. Plos One,7(9):e45198.
Delfin, F., Min-Shan Ko, A., Li, M., Gunnarsdottir, E. D., Tabbada, K. A.,
Salvador, J. M., ... Stoneking, M. (2014). Complete mtDNA genomes
of Filipino ethnolinguistic groups: a melting pot of recent and ancient
lineages in the Asia-Pacific region. European Journal of Human Genet-
ics,22, 228237.
Denevan, W. M. (1992). The Pristine Myth: The landscape of the Ameri-
cas in 1492. Annals of the Association of American Geographers,82,
Destro-Bisol, G., Donati, F., Coia, V., Boschi, I., Verginelli, F., Caglia, A.,
... Capelli, C. (2004). Variation of female and male lineages in sub-
Saharan populations: the importance of sociocultural factors. Molecu-
lar Biology & Evolution,21, 16731682.
Dixon, R. M. W., & Aikhenvald, A. Y. (1999). The Amazonian languages
(Vol. xxviii, pp. 446). Cambridge University Press.
Dray, S., & Dufour, A. B. (2007). The ade4 package: implementing the
duality diagram for ecologists. Journal of Statistical Software,22,
Drummond, A. J., Suchard, M. A., Xie, D., & Rambaut, A. (2012). Bayesian
phylogenetics with BEAUti and the BEAST 1.7. Molecular Biology &
Evolution,29, 19691973.
Duggan, A. T., Whitten, M., Wiebe, V., Crawford, M., Butthof, A., Spitsyn,
V., ... Pakendorf, B. (2013). Investigating the prehistory of Tungusic
peoples of Siberia and the Amur-Ussuri region with complete mtDNA
genome sequences and Y-chromosomal markers. PLoS One,8,
Epps, P., & Stenzel, K. (2013). Upper Rio Negro: Cultural and linguistic
interaction in northwestern Amazonia (pp. 1350). Rio de Janeiro:
Museu do
Indio FUNAI, Museu Nacional.
Eriksen, L. (2011). Nature and culture in prehistoric Amazonia [PhD].
Lund: Lund University.
Excoffier, L., & Lischer, H. E. (2010). Arlequin suite ver 3.5: A new series
of programs to perform population genetics analyses under Linux and
Windows. Molecular Ecology Resources,10, 564567.
Excoffier, L., & Schneider, S. (1999). Why hunter-gatherer populations do
not show signs of pleistocene demographic expansions. Proceedings
of the National Academy Sciences of the United States of America,96,
Fagundes, N. J., Kanitz, R., Eckert, R., Valls, A. C., Bogo, M. R., Salzano, F.
M., ... Bonatto, S. L. (2008). Mitochondrial population genomics sup-
ports a single pre-Clovis origin with a coastal route for the peopling
of the Americas. American Journal of Human Genetics,82, 583592.
Fehren-Schmitz, L., Llamas, B., Lindauer, S., Tomasto-Cagigao, E., Kuz-
minsky, S., Rohland, N., ... Haak, W. (2015). A re-appraisal of the
early Andean human remains from Lauricocha in Peru. PLoS One,10,
Franco, R. (2002). Los carijonas de Chiribiquete (pp. 318). Bogot
on Puerto Rastrojo.
Fuselli, S., Tarazona-Santos, E., Dupanloup, I., Soto, A., Luiselli, D., & Pet-
tener, D. (2003). Mitochondrial DNA diversity in South America and
the genetic history of Andean highlanders. Molecular Biology & Evolu-
tion,20, 16821691.
Gaya-Vidal, M., Moral, P., Saenz-Ruales, N., Gerbault, P., Tonasso, L., Vil-
lena, M., ... Dugoujon, J. M. (2011). mtDNA and Y-chromosome
diversity in Aymaras and Quechuas from Bolivia: different stories and
special genetic traits of the Andean Altiplano populations. American
Journal of Physical Anthropology,145, 215230.
Gnecco, C., & Mora, S. (1997). Late Pleistocene early Holocene tropical
forest occupations at San Isidro and Pena Roja, Colombia. Antiquity,
71, 683690.
Goslee, S. C., & Urban, D. L. (2007). The ecodist package for
dissimilarity-based analysis of ecological data. Journal of Statistical
Gunnarsdottir, E. D., Li, M., Bauchet, M., Finstermeier, K., & Stoneking,
M. (2011). High-throughput sequencing of complete human mtDNA
genomes from the Philippines. Genome Research,21,111.
Hall, T. A. (1999). BioEdit: A user-friendly biological sequence alignment
editor and analysis program for Windows 95/98/NT. Nucleic Acids
Symposium Series,41,4.
Hartl, D. L., & Clark, A. G. (2007). Principles of population genetics (Vol.
xv, pp. 652). Sunderland, MA: Sinauer Associates, Inc.
Heckenberger, M. J. (2002). Rethinking the Arawakan Diaspora: Hierar-
chy, Regionality, and the Amazonian Formative. In: Hill JD, and
Santos-Granero F, editors. Comparative Arawakan Histories: Rethinking
Language Family and Culture Area in Amazonia. Urbana: University of
Illinois Press. p 99122.
Heckenberger, M. J. (2008). Amazonian Mosaics: Identity, Interaction,
and Integration in the Tropical Forest. In H. Silverman & W. H. Isbell
(Eds.), The handbook of South American archaeology (pp. 941961).
New York, NY: Springer New York.
Hijmans, R. J. (2016). geosphere: Spherical Trigonometry. R package version
Hill, J. D., & Santos-Granero, F. (2002). Comparative Arawakan histories:
Rethinking language family and culture area in Amazonia (pp. 340).
Urbana: University of Illinois Press.
Hock, H. H., & Joseph, B. D. (2009). Language History, Language Change,
and Language Relationship: An Introduction to Historical and Compara-
tive Linguistics. Berlin: Mouton De Gruyter. 588 p.
Hornborg, A. (2005). Ethnogenesis, regional integration, and ecology in
prehistoric Amazonia - Toward a system perspective. Current Anthro-
pology,46, 589620.
Hugh-Jones, S. (1979). The palm and the Pleiades: initiation and cosmology
in northwest Amazonia (Vol. xvi, pp. 332). Cambridge; New York:
Cambridge University Press.
Jackson, J. E. (1983). The fish people: Linguistic exogamy and Tukanoan
identity in northwest Amazonia (Vol. xix, pp. 287). Cambridge:
Cambridge University Press.
Kassambara, A., & Mundt, F. (2016). factoextra: Extract and Visualize the
Results of Multivariate Data Analyses. R package version 1.0.3 ed:
Katoh, K., & Standley, D. M. (2013). MAFFT multiple sequence alignment
software version 7: Improvements in performance and usability.
Molecular Biology & Evolution,30, 772780.
Keyeux, G., Rodas, C., Gelvez, N., & Carter, D. (2002). Possible migration
routes into South America deduced from mitochondrial DNA studies
in Colombian Amerindian populations. Human Biology,74, 211233.
Kirby, K. R., Gray, R. D., Greenhill, S. J., Jordan, F. M., Gomes-Ng, S.,
Bibiko, H. J., Blasi, D. E., Botero, C. A., Bowern, C., Ember, C. R. et al.
(2016). D-PLACE: A Global Database of Cultural, Linguistic and Envi-
ronmental Diversity. Plos One,11(7):e0158391.
Kircher, M., Sawyer, S., & Meyer, M. (2012). Double indexing overcomes
inaccuracies in multiplex sequencing on the Illumina platform. Nucleic
Acids Research,40, e3.
Kivisild, T. (2015). Maternal ancestry and population history from whole
mitochondrial genomes. Investigating Genetics,6,3.
Kloss-Brandstatter, A., Pacher, D., Schonherr, S., Weissensteiner, H.,
Binna, R., Specht, G., & Kronenberg, F. (2011). HaploGrep: a fast and
reliable algorithm for automatic classification of mitochondrial DNA
haplogroups. Human Mutation,32,2532.
Ko, A. M., Chen, C. Y., Fu, Q., Delfin, F., Li, M., Chiu, H. L., Stoneking,
M., & Ko, Y. C. (2014). Early Austronesians: into and out of Taiwan.
Am J Hum Genet,94(3):426436.
unberg, T. (1995). Dos a~
nos entre los indios: viajes por el noroeste
no, 1903-1905. Bogot
a: Editorial Universidad Nacional. 364 p.
Kondo, R. (2002). En pos de los guahibos: Prehist
oricos, hist
oricos y
actuales: Con pistas ling
uísticas (pp. 181). Bogot
a, Colombia: Editorial
Alberto Lleras Camargo.
Landaburu, J. (2000). Clasificaci
on de las lenguas indígenas de Colombia.
In: P. M. S. Gonz
alez, M. M. L. Rodríguez, & Instituto Caro y Cuervo
(Eds.), Lenguas indígenas de Colombia, una visi
on descriptive (pp. 848).
e de Bogot
a: Instituto Caro y Cuervo.
Langmead, B., & Salzberg, S. L. (2012). Fast gapped-read alignment with
Bowtie 2. Nature Methods,9, 357359.
Lathrap, D. W. (1970). The upper Amazon (pp. 256). London: Thames &
Le, S., Josse, J., & Husson, F. (2008). FactoMineR: An R package for mul-
tivariate analysis. Journal of Statistical Software,25,118.
Lee, E. J., & Merriwether, D. A. (2015). Identification of whole mitochon-
drial genomes from Venezuela and implications on regional phyloge-
nies in South America. Human Biology,87,2938.
Lewis, C. M., Jr., Lizarraga, B., Tito, R. Y., Lopez, P. W., Iannacone, G. C.,
Medina, A., ... Stone, A. C. (2007). Mitochondrial DNA and the peo-
pling of South America. Human Biology,79, 159178.
Li, H., & Durbin, R. (2009). Fast and accurate short read alignment with
Burrows-Wheeler transform. Bioinformatics,25, 17541760.
Li, M., Schroeder, R., Ko, A., & Stoneking, M. (2012). Fidelity of capture-
enrichment for mtDNA genome sequencing: influence of NUMTs.
Nucleic acids research,40(18):e137.
Llamas, B., Fehren-Schmitz, L., Valverde, G., Soubrier, J., Mallick, S., Roh-
land, N., ... Haak, W. (2016). Ancient mitochondrial DNA provides
high-resolution time scale of the peopling of the Americas. Science
Advances,2, e1501385.
Lowie, R. H. (1948). The tropical forests: An introduction. In: J. H.
Steward (Ed.), Handbook of South American Indians (pp. 156). Wash-
ington: U.S. G.P.O.
Malhi, R. S., Schultz, B. A., & Smith, D. G. (2001). Distribution of mito-
chondrial DNA lineages among Native American tribes of Northeast-
ern North America. Human Biology,73,1755.
Maricic, T., Whitten, M., & Paabo, S. (2010). Multiplexed DNA sequence
capture of mitochondrial genomes using PCR products. PLoS One,5,
Marrero, A. R., Silva-Junior, W. A., Bravi, C. M., Hutz, M. H., Petzl-Erler,
M. L., Ruiz-Linares, A., ... Bortolini, M. C. (2007). Demographic and
evolutionary trajectories of the Guarani and Kaingang natives of Bra-
zil. American Journal of Physical Anthropology,132, 301310.
Meggers, B. J. (1954). Environmental limitation on the development of
culture. American Anthropologist,56, 801824.
Meggers, J. (1948). The archeology of the Amazon Basin. In J. H. Stew-
ard (Ed.), Handbook of South American Indians (pp. 149166). Wash-
ington: U.S. G.P.O.
Melton, P. E., Briceno, I., Gomez, A., Devor, E. J., Bernal, J. E., & Craw-
ford, M. H. (2007). Biological relationship between Central and South
American Chibchan speaking populations: evidence from mtDNA.
American Journal of Physical Anthropology,133, 753770.
Miller, S. A., Dykes, D. D., & Polesky, H. F. (1988). A simple salting out
procedure for extracting DNA from human nucleated cells. Nucleic
Acids Research,16, 1215.
Milton, K. (1984). Protein and Carbohydrate Resources of the Maku Indi-
ans of Northwestern Amazonia. American Anthropologist,86,727.
Nettle, D. (1999). Linguistic diversity of the Americas can be reconciled
with a recent colonization. Proceedings of the National Academy of Sci-
ence of the United States of America,96, 33253329.
u, C. (1950). Reconhecimento dos rios Iç
ana, Ayarí e Uaup
Journal de la Soci
e des Am
ericanistes,39, 125182.
Oota, H., Pakendorf, B., Weiss, G., von Haeseler, A., Pookajorn, S., Set-
theetham-Ishida, W., ... Stoneking, M. (2005). Recent origin and cul-
tural reversion of a hunter-gatherer group. PLoS Biology,3, e71.
Patin, E., Siddle, K. J., Laval, G., Quach, H., Harmant, C., Becker, N., ...
Quintana-Murci, L. (2014). The impact of agricultural emergence on
the genetic history of African rainforest hunter-gatherers and agricul-
turalists. Nature Communication,5, 3163.
Perego, U. A., Achilli, A., Angerhofer, N., Accetturo, M., Pala, M., Olivieri,
A., ... Kong, Q. P. et al, (2009). Distinctive Paleo-Indian migration
routes from Beringia marked by two rare mtDNA haplogroups. Cur-
rent Biology,19,18.
Perego, U. A., Angerhofer, N., Pala, M., Olivieri, A., Lancioni, H., Hooshiar
Kashani, B., ... Torroni, A. (2010). The initial peopling of the Ameri-
cas: A growing number of founding mitochondrial genomes from
Beringia. Genome Research,20, 11741179.
Pugach, I., Matveev, R., Spitsyn, V., Makarov, S., Novgorodov, I., Osakov-
sky, V., ... Pakendorf, B. (2016). The complex admixture history and
recent southern origins of Siberian populations. Molecular Biology &
Evolution,33, 17771795.
Quinque, D., Kittler, R., Kayser, M., Stoneking, M., & Nasidze, I. (2006).
Evaluation of saliva as a source of human DNA for population and
association studies. Analytical Biochemistry,353, 272277.
Ramachandran, S., Deshpande, O., Roseman, C. C., Rosenberg, N. A.,
Feldman, M. W., & Cavalli-Sforza, L. L. (2005). Support from the rela-
tionship of genetic and geographic distance in human populations for
a serial founder effect originating in Africa. Proceedings of the
National Academy of Science of the United States of America,102,
Renaud, G., Kircher, M., Stenzel, U., & Kelso, J. (2013). freeIbis: an effi-
cient basecaller with calibrated quality scores for Illumina sequencers.
Bioinformatics,29, 12081209.
Renaud, G., Stenzel, U., & Kelso, J. (2014a). LeeHom: Adaptor trimming
and merging for Illumina sequencing reads. Nucleic Acids Research,
42, e141.
Renaud, G., Stenzel, U., Maricic, T., Wiebe, V., & Kelso, J. (2014b). deML:
Robust demultiplexing of Illumina sequences using a likelihood-based
approach. Bioinformatics,31(5):770772.
os-Granero, F. (2002). The Arawakan Matrix: Ethos, Language, and
History in Native South America. In: Hill JD, and Santos-Granero F,
editors. Comparative Arawakan histories: Rethinking language family
and culture area in Amazonia (pp. 2550). Urbana: University of Illi-
nois Press.
onberg, A., Theunert, C., Li, M., Stoneking, M., & Nasidze, I. (2011).
High-throughput sequencing of complete human mtDNA genomes
from the Caucasus and West Asia: high diversity and demographic
inferences. European Journal of Human Genetics,19, 988994.
Schurr, T. G. (2004). The peopling of the New World: Perspectives from
molecular anthropology. Annual Review of Anthropology,33, 551583.
Soares, P., Ermini, L., Thomson, N., Mormina, M., Rito, T., Rohl, A., ...
Richards, M. B. (2009). Correcting for purifying selection: an
improved human mitochondrial molecular clock. American Journal of
Human Genetics,84, 740759.
Sorensen, A. P. (1967). Multilingualism in the Northwest Amazon. Ameri-
can Anthropologist,69, 670684.
Stenzel, K. (2005). Multilingualism in the northwest Amazon, revisited.
Memorias del Congreso de Idiomas Indígenas de Latinoam
erica-II (pp. 1
28). Austin: University of Texas.
Tajima, F. (1989). Statistical method for testing the neutral mutation
hypothesis by DNA polymorphism. Genetics,123, 585595.
Torres, M. M., Bravi, C. M., Bortolini, M. C., Duque, C., Callegari-Jacques,
S., Ortiz, D., ... Ruiz-Linares, A. (2006). A revertant of the major
founder Native American haplogroup C common in populations from
northern South America. American Journal of Human Biology,18,59
Torroni, A., Schurr, T. G., Cabell, M. F., Brown, M. D., Neel, J. V., Larsen,
M., ... Wallace, D. C. (1993). Asian affinities and continental radia-
tion of the four founding Native American mtDNAs. American Journal
of Human Genetics,53, 563590.
Uribe Tob
on, C. A., & Instituto Colombiano de Cultura, H. (1992). Geo-
grafía humana de Colombia. Santa Fe de Bogot
a: Instituto Colombiano
de Cultura Hisp
van Oven, M., & Kayser, M. (2009). Updated comprehensive phyloge-
netic tree of global human mitochondrial DNA variation. Human
Mutation,30, E386E394.
Verdu, P., Becker, N. S., Froment, A., Georges, M., Grugni, V., Quintana-
Murci, L., ... Austerlitz, F. (2013). Sociocultural behavior, sex-biased
admixture, and effective population sizes in Central African Pygmies
and non-Pygmies. Molecular Biology & Evolution,30, 918937.
Vidal, S. M. (1997). Liderazgo y confederaciones multi
etnicas amerindias
en la amazonia luso-hispana del. Siglo XVIII Revista Antropol
Vidal, S. M. (2002). Secret religious cults and political leadership: Multieth-
nic confederacies from Northwestern Amazonia (pp. 248268). Compar-
ative Arawakan Histories: University of Illinois Press.
Wallace, A. R. (1853). A narrative of travels on the Amazon and Rio Negro:
With an account of the native tribes, and observations on the climate,
geology, and natural history of the Amazon valley (Vol. viii, pp. 541).
London: Reeve and Co.
Wang, S., Lewis, C. M., Jakobsson, M., Ramachandran, S., Ray, N., Bed-
oya, G., ... Gallo, C. et al, (2007). Genetic variation and population
structure in native Americans. PLoS Genetics,3, e185.
Wright, R. M. (2002). Prophetic traditions among the Baniwa and other
Arawakan peoples of the northwest Amazon (pp. 269294). Compara-
tive Arawakan Histories: University of Illinois Press.
Yunusbayev, B., Metspalu, M., Jarve, M., Kutuev, I., Rootsi, S., Metspalu, E., ...
Villems, R. (2012). The Caucasus as an asymmetric semipermeable barrier
to ancient human migrations. Molecular Biology & Evolution,29,359365.
Zucchi, A. (2002). A new model of the Northern Arawakan expansion. In
J. D. Hill & F. Santos-Granero (Eds.), Comparative Arawakan histories:
Rethinking language family and culture area in Amazonia (pp. 199
222). Urbana: University of Illinois Press.
Additional Supporting Information may be found online in the sup-
porting information tab for this article.
How to cite this article: Arias L, Barbieri C, Barreto G, Stoneking
M, Pakendorf B. High-resolution mitochondrial DNA analysis
sheds light on human diversity, cultural interactions, and popula-
tion mobility in Northwestern Amazonia. Am J Phys Anthropol.
... The calculation of the molecular diversity indices shown in Tables S1 and S2 [7,9,30,40,41], the estimation of the Fst distance matrices used for MDS analysis (φst for mtDNA haplotypes and Rst for Y-STRs), the shared haplotypes for both the mtDNA and Y chromosome data, and the AMOVA measures were performed with Arlequin v3.5 [42]. To visualize the distances between populations, a principal component analysis (PCA) was performed with mtDNA haplogroups frequency data by smart pcabelonging to the Eigensoft package [43] on a dataset of 53 populations from all over northern South America [6,7,9,25,40,[44][45][46][47][48][49][50][51][52][53][54][55], shown in Table S3. The frequencies of the mitochondrial sub-lineages are shown in the table and pie chart in Table S4. ...
... For all of the haplotype-level analyses, two comparative datasets of both the mtDNA and the Y-STRs haplotypes have been used. The mtDNA dataset includes the 313 full mtDNA genomes listed in Table S5 from the northwestern area of South America (Peru, Ecuador, Colombia, and Bolivia), which were selected on the basis of the geographical proximity of these populations to our area of interest and on the availability of data regarding the linguistic affiliation [27,28,40,[54][55][56][57][58]; moreover, the Y-STRs dataset includes 316 haplotypes for 14 loci listed in Table S6 over the same geographical area and belonging to the same language families where this kind of information was available [7,9,30,40,55]. We attempted to build two datasets that were as comparable as possible. ...
... We also tested the genetic variation at the level of complete mtDNA sequences by means of the molecular diversity indices (Table S1). The haplotype diversity (HD) of mitochondrial sequences is very high (0,99) and is comparable to some other agriculturalist northwestern populations whose entire mtDNA genomes are available [54]. Indeed, the Tsachilas are fishermen and slash-andburn agriculturalists who settled in the forest in single-family houses [65]. ...
Full-text available
Together with Cayapas, the Tsachilas constitute the oldest population in the country of Ecuador and, according to some historians, they are the last descendants of the ancient Yumbos. Several anthropological issues underlie the interest towards this peculiar population: the uncertainty of their origin, their belonging to the Barbacoan linguistic family, which is still at the center of an intense linguistic debate, and the relations of their Yumbo ancestors with the Inca invaders who occupied their ancient territory. Our contribution to the knowledge of their complex past was the reconstruction of their genetic maternal and paternal inheritance through the sequencing of 70 entire mitochondrial genomes and the characterization of the non-recombinant region of the Y chromosome in 26 males. For both markers, we built comprehensive datasets of various populations from the surrounding geographical area, northwestern South America, NW, with a known linguistic affiliation, and we could then compare our sample against the overall variability to infer relationships with other Barbacoan people and with other NW natives. We found contrasting patterns of genetic diversity for the two markers, but generally, our results indicated a possible common origin between the Tsachilas, the Chachi, and other Ecuadorian and Colombian Barbacoans and are suggestive of an interesting ancient linkage to the Inca invaders in Yumbo country.
... (Golden Software, LLC), using the Kriging method for interpolating frequency values. To investigate associations between haplotypes and linguistic affiliation, and thus scrutinize the native component in the country, we searched for shared haplotypes among Paraguay and Native South American populations [26,[62][63][64][65][66][67][68][69] (see details in Supplementary Table S4). ...
... One C1 haplotype was previously described only in two samples from a Peruvian native population speaking Jivaro [69]. Few D haplotypes were found in native communities from northwestern Amazonia [68]. ...
... Therefore, the presence in Paraguay of lineages from different sources in addition to Tupi-Guarani cannot be discarded. Nonetheless, an association between genetics and linguistics or ethnicity can be misleading due to genetic flow that has been reported among native groups with different linguistic and/or ethnic backgrounds [68,95]. ...
Full-text available
Immigrants from diverse origins have arrived in Paraguay and produced important demographic changes in a territory initially inhabited by indigenous Guarani. Few studies have been performed to estimate the proportion of Native ancestry that is still preserved in Paraguay and the role of females and males in admixture processes. Therefore, 548 individuals from eastern Paraguay were genotyped for three marker sets: mtDNA, Y-SNPs and autosomal AIM-InDels. A genetic homogeneity was found between departments for each set of markers, supported by the demographic data collected, which showed that only 43% of the individuals have the same birthplace as their parents. The results show a sex-biased intermarriage, with higher maternal than paternal Native American ancestry. Within the native mtDNA lineages in Paraguay (87.2% of the total), most haplogroups have a broad distribution across the subcontinent, and only few are concentrated around the Paraná River basin. The frequency distribution of the European paternal lineages in Paraguay (92.2% of the total) showed a major contribution from the Iberian region. In addition to the remaining legacy of the colonial period, the joint analysis of the different types of markers included in this study revealed the impact of post-war migrations on the current genetic background of Paraguay.
... Q-BY15730, observed in modern individuals from North-West Amazonia, was dated at 13.2 kya, slightly before the first human traces in the area, and its phylogeography draws a displacement from Ecuador/Colombia into the Amazon region following river streams. The importance of rivers in the settlement and human activities in this part of South America was previously discussed by Arias et al. [89], and the distribution of Q-BY15730 points to a Pacific Coastal origin. It should be noted that Q-BY104773, which is of similar age and occupies roughly the same area as Q-BY15730, does not show such a marked cline in Colombia. ...
... It should be noted that Q-BY104773, which is of similar age and occupies roughly the same area as Q-BY15730, does not show such a marked cline in Colombia. A possible explanation for the peculiar distribution of Q-BY15730 could be that it is mainly found in Tukanoan groups, where the custom of patrilocality is particularly strong, thus strengthening the geographic separation of male lineages [36,89]. ...
Full-text available
Uniparental genetic systems are unique sex indicators and complement the study of autosomal diversity by providing landmarks of human migrations that repeatedly shaped the structure of extant populations. Our knowledge of the variation of the male-specific region of the Y chromosome in Native Americans is still rather scarce and scattered, but by merging sequence information from modern and ancient individuals, we here provide a comprehensive and updated phylogeny of the distinctive Native American branches of haplogroups C and Q. Our analyses confirm C-MPB373, C-P39, Q-Z780, Q-M848, and Q-Y4276 as the main founding haplogroups and identify traces of unsuccessful (pre-Q-F1096) or extinct (C-L1373*, Q-YP4010*) Y-chromosome lineages, indicating that haplogroup diversity of the founder populations that first entered the Americas was greater than that observed in the Indigenous component of modern populations. In addition, through a diachronic and phylogeographic dissection of newly identified Q-M848 branches, we provide the first Y-chromosome insights into the early peopling of the South American hinterland (Q-BY104773 and Q-BY15730) and on overlying inland migrations (Q-BY139813).
... Amazonian indigenous peoples inhabit areas of immense biological, linguistic, and cultural diversity (Arias et al. 2018), which means that their reserves represent areas of primary importance for conservation. Indigenous communities play an important role in mitigating the detrimental effects of deforestation throughout their development activities and conservation projects (Alcorn 1993, Zimmerman et al. 2001. ...
Full-text available
The Amazon basin is undergoing processes of global change that threaten one of the planet's greatest sources of biological and cultural richness. Among different conservation strategies, indigenous reserves play a key role by reducing the impact of factors such as the deforestation caused by the expansion of the agricultural frontier. Despite the demonstrated importance of these indigenous territories, there is a lack of studies evaluating their contribution to the regional diversity in Colombia, especially for invertebrate groups. By surveying indigenous territories for the first time in the Amazonas department , we investigated the army ant fauna inhabiting different land use types: forests; "chagras" in different stages of regeneration; and pastures. In a sampling plot that represents just 0.0014 % of the department's area, four genera and eight army ants species were recorded, representing 47 % of known species in the Amazonas department and 18 % in Colombia. Our sampling revealed three new records for the department and Neivamyrmex diana as a new record for the country. Mature forests harbored six species, while the chagras system retained up to five species. Notably, when placing our results in a national context, we revealed several country regions and departments (e.g., Córdoba, Sucre, Vaupés) that presented limited or no information on army ant diversity. Our results provide justification for the continuous evaluation, monitoring and conservation of indigenous reserves, based not only on their demonstrated cultural and ancestral importance but to their potential to retain high biodiversity.
... Amazonian indigenous peoples inhabit areas of immense biological, linguistic, and cultural diversity (Arias et al. 2018), which means that their reserves represent areas of primary importance for conservation. Indigenous communities play an important role in mitigating the detrimental effects of deforestation throughout their development activities and conservation projects (Alcorn 1993, Zimmerman et al. 2001. ...
Full-text available
The Amazon basin is undergoing processes of global change that threaten one of the planet’s greatest sources of biological and cultural richness. Among different conservation strategies, indigenous reserves play a key role by reducing the impact of factors such as the deforestation caused by the expansion of the agricultural frontier. Despite the demonstrated importance of these indigenous territories, there is a lack of studies evaluating their contribution to the regional diversity in Colombia, especially for invertebrate groups. By surveying indigenous territories for the first time in the Amazonas department, we investigated the army ant fauna inhabiting different land use types: forests;“chagras” in different stages of regeneration; and pastures. In a sampling plot that represents just 0.0014% of the department’s area, four genera and eight army ants species were recorded, representing 47% of known species in the Amazonas department and 18% in Colombia. Our sampling revealed three new records for the department and Neivamyrmex diana as a new record for the country. Mature forests harbored six species, while the chagras system retained up to five species. Notably, when placing our results in a national context, we revealed several country regions and departments (eg, Córdoba, Sucre, Vaupés) that presented limited or no information on army ant diversity. Our results provide justification for the continuous evaluation, monitoring and conservation of indigenous reserves, based not only on their demonstrated cultural and ancestral importance but to their potential to retain high biodiversity.
... In agreement, a slightly higher number of mitochondrial haplotypes and haplotype diversity indices were found in urban than in Native populations (those corresponding to 14 Hgs against 3). In comparison to other populations from Central and Southern Brazil, and from the Amazon region, however, mitochondrial haplotype diversity was high in the Amerindian sample studied (Ward et al. 1996;Bonatto and Salzano 1997;Schmitt et al. 2004;Dornelles et al. 2005;Marrero et al. 2007;Arias et al. 2018;Tavares et al. 2019). Despite the small sample size, this pattern was similar to that observed for urban and Amerindian populations from the South of Brazil, reinforcing the role of admixed populations as reservoirs of Amerindian haplotypes (Tavares et al. 2019). ...
Full-text available
The Brazilian population is a product of asymmetric admixture among European men and Amerindian and African women. However, Brazilian subcontinental ancestry is scarcely documented, especially regarding its African roots. Here, we aimed to unveil the uniparental continental and subcontinental contributions from distinct Brazilian regions, including South (n = 43), Southeast (n = 71), the poorly genetically characterized Central-Western region (n = 323), and a subset of unique Brazilian Amerindians (n = 24), in the context of their genome-wide ancestral contributions. The overwhelming majority of European Y haplogroups (85%) contrast sharply with the predominant African and Amerindian mtDNA haplogroups (73.2%) in admixed populations, whereas in Amerindians, non-Native haplogroups could only be detected through the paternal line. Our in-depth investigation of uniparental markers showed signals of an Andean and Central-Brazilian Amerindian maternal contribution to Southeastern and Central-Western Brazil (83.1 ± 2.1% and 56.9 ± 0.2%, respectively), the last having the highest paternal Amerindian ancestry yet described for an admixed Brazilian region (9.7%) and contrasting with higher Southern-Brazilian Amerindian contribution to Southern Brazil (59.6 ± 1%). Unlike the higher African Bantu contribution previously reported for the South and Southeast, a relevant Western African non-Bantu contribution was detected in those regions (85.7 ± 5% and 71.8 ± 10.8% respectively). In contrast, a higher Bantu contribution was described for the first time in the Central-West (64.8 ± 1.3% maternal and 86.9 ± 9.6% paternal). We observed sex-biased signatures consistent with the historically recorded Brazilian colonization and added new insights in the subcontinental maternal ancestry of Brazilians from regions never studied at this level.
... Principal Component Analysis (PCA), on both mtDNA and Y-chromosome haplogroup frequencies, was computed using prcomp from the stats R package [42] with the center and scale arguments set as true. Correspondence Analysis (CA) was performed on the same haplogroup frequencies using the CA function from the FactoMineR R package [46], as in [47]. All plots were generated using the R packages ggplot2 [48] and factoextra ( ...
Full-text available
The Isthmus of Panama was a crossroads between North and South America during the continent’s first peopling (and subsequent movements) also playing a pivotal role during European colonization and the African slave trade. Previous analyses of uniparental systems revealed significant sex biases in the genetic history of Panamanians, as testified by the high proportions of Indigenous and sub-Saharan mitochondrial DNAs (mtDNAs) and by the prevalence of Western European/northern African Y chromosomes. Those studies were conducted on the general population without considering any self-reported ethnic affiliations. Here, we compared the mtDNA and Y-chromosome lineages of a new sample collection from 431 individuals (301 males and 130 females) belonging to either the general population, mixed groups, or one of five Indigenous groups currently living in Panama. We found different proportions of paternal and maternal lineages in the Indigenous groups testifying to pre-contact demographic events and genetic inputs (some dated to Pleistocene times) that created genetic structure. Then, while the local mitochondrial gene pool was marginally involved in post-contact admixtures, the Indigenous Y chromosomes were differentially replaced, mostly by lineages of western Eurasian origin. Finally, our new estimates of the sub-Saharan contribution, on a more accurately defined general population, reduce an apparent divergence between genetic and historical data.
... Finally, there is a shared cultural influence in panpipe production focused in northeast Amazonia, separated from a more central/southern core: a geographical pattern illustrating the cultural diversity in such a large area as Amazonia (Fausto, 2020;Pearce et al., 2020). Amazonia has been traditionally considered home to relatively homogeneous, albeit isolated groups, but recent archaeological and genetic work demonstrates the multilayered diversity of the region, highlighting the presence of ancient structure and complex societies (Heckenberger and Neves, 2009;Arias et al., 2018;Posth et al., 2018;Barbieri et al., 2019). The cultural areas recognized here (Murdock, 1951) are subject to debate, as outlined in Weiss (1980); a broader scheme includes the three major cultural and eco-geographical areas (grasslands, tropical forest and desert-mountain). ...
Full-text available
Musical instruments provide material evidence to study the diversity and technical innovation of music in space and time. We employed a cultural evolutionary perspective to analyse organological data and their relation to language groups and population history in South America, a unique and complex geographic area for human evolution. The ethnological and archaeological native musical instrument record, documented in three newly assembled continental databases, reveals exceptionally high diversity of wind instruments. We explored similarities in the collection of instruments for each population, considering geographic patterns and focusing on groupings associated with language families. A network analysis of panpipe organological features illustrates four regional/cultural clusters: two in the Tropical Forest and two in the Andes. Twenty-five percent of the instruments in the standard organological classification are present in the archaeological, but not in the ethnographic record, suggesting extinction events. Most recent extinctions can be traced back to European contact, causing a reduction in indigenous cultural diversity.
... CARACTERIZAÇÃO DA AMAZÔNIA BRASILEIRA A bacia amazônica representa cerca de metade das florestas tropicais que existem no planeta e sustenta a maior biodiversidade existente (Hansen et al. 2013). Além disso, a Amazônia também abriga uma miríade de sociedades humanas, com uma diversidade linguística e cultural impressionante que evoluiu em uma paisagem complexa imersa em uma extensa rede de rios (Arias et al. 2018). Ao mesmo tempo, a floresta amazônica também representa uma grande oportunidade para os governos de nove países da América do Sul que compartilham suas fronteiras em termos de desenvolvimento industrial, devido aos valiosos recursos naturais. ...
Full-text available
O atual cenário global de perda de biodiversidade, destruição das florestas tropicais e mudanças no clima tem levado as sociedades a pensar com urgência em novos caminhos de desenvolvimento que contemplem a proteção dos sistemas naturais e as demandas sociais. A Amazônia aparece com destaque nessa discussão por abrigar as maiores diversidades biológicas e culturais do mundo, responsáveis por serviços ecossistêmicos locais, regionais e globais. Neste contexto, o desenvolvimento de novos modelos de desenvolvimento local na região Amazônica é um grande desafio não apenas para o Brasil, mas para todo o mundo. Neste relatório, fizemos uma análise de conjuntura focada no Rio Juruá, um dos principais afluentes do rio Amazonas que representa uma região que vem se destacando pelas iniciativas de conservação e desenvolvimento local que combinam proteção da biodiversidade e bem-estar das populações locais. Neste material, buscamos contextualizar e discutir as principais linhas de pensamento e práticas que permeiam as iniciativas de desenvolvimento sustentável na Amazônia brasileira. Os métodos utilizados incluem entrevistas com lideranças locais, tomadores de decisão, revisão de literatura e pesquisas de campo realizadas durante os últimos 14 anos na região do Médio Juruá. Primeiramente, mostramos brevemente as raízes do problema, focando sobretudo nos modelos de governança implementados contemporaneamente nessa região riquíssima em diversidade biológica e cultural. Posteriormente, discutimos a necessidade de se pensar de forma integrada a biodiversidade e as demandas sociais e apresentamos os modelos modernos de desenvolvimento local que tem por objetivo conciliar a exploração e proteção da biodiversidade e o bem estar social das populações locais. Por fim, apresentamos a região do Médio Rio Juruá como um modelo de desenvolvimento local que podem inspirar abordagens transformativas de ocupação territorial e desenvolvimento local na Amazônia.
Objectives The aim of this study was to create a comprehensive summary of available mtDNA and Y-chromosome data for Native Americans from North, Central, and South America, including both modern and ancient DNA. To illustrate the usefulness of this dataset we present a broad picture of the genetic variation for both markers across the Americas. Methods We searched PubMed, ResearchGate, Google Scholar for studies about mtDNA or Y-chromosome variation in Native American populations, including geographic, linguistic, ecological (ecoregion), archeological and chronological information. We used AMOVA to estimate the genetic structure associated with language and ecoregion grouping and Mantel tests to evaluate the correlation between genetic and geographic distances. Results Genetic data were obtained from 321 primary sources, including 22,569 individuals from 298 contemporary populations, and 3628 individuals from 202 archeological populations. MtDNA lineages of probable non-Amerindian origin were rare, in contrast with Y-chromosome lineages. Mantel tests showed a statistically significant correlation for the whole continent considering mtDNA but not the Y-chromosome. Genetic structure between groups was always stronger for mtDNA than for the Y-chromosome. Conclusions This study summarizes decades of research conducted in Native American populations for both mtDNA and the Y-chromosome. Continental or sub-continental patterns of variation reveal that most of the genetic variation occurs within populations rather than among linguistic or ecoregional groups, and that isolation by distance is barely detectable in most population sets. The genetic structure among groups was always larger for mtDNA than for the Y-chromosome, suggesting between-sex differences in gene flow.
Full-text available
From the foods we eat and the houses we construct, to our religious practices and political organization, to who we can marry and the types of games we teach our children, the diversity of cultural practices in the world is astounding. Yet, our ability to visualize and understand this diversity is limited by the ways it has been documented and shared: on a culture-by-culture basis, in locally-told stories or difficult-to-access repositories. In this paper we introduce D-PLACE, the Database of Places, Language, Culture, and Environment. This expandable and open-access database (accessible at ) brings together a dispersed corpus of information on the geography, language, culture, and environment of over 1400 human societies. We aim to enable researchers to investigate the extent to which patterns in cultural diversity are shaped by different forces, including shared history, demographics, migration/diffusion, cultural innovations, and environmental and ecological conditions. We detail how D-PLACE helps to overcome four common barriers to understanding these forces: i) location of relevant cultural data, (ii) linking data from distinct sources using diverse ethnonyms, (iii) variable time and place foci for data, and (iv) spatial and historical dependencies among cultural groups that present challenges for analysis. D-PLACE facilitates the visualisation of relationships among cultural groups and between people and their environments, with results downloadable as tables, on a map, or on a linguistic tree. We also describe how D-PLACE can be used for exploratory, predictive, and evolutionary analyses of cultural diversity by a range of users, from members of the worldwide public interested in contrasting their own cultural practices with those of other societies, to researchers using large-scale computational phylogenetic analyses to study cultural evolution. In summary, we hope that D-PLACE will enable new lines of investigation into the major drivers of cultural change and global patterns of cultural diversity.
Full-text available
The exact timing, route, and process of the initial peopling of the Americas remains uncertain despite much research. Archaeological evidence indicates the presence of humans as far as southern Chile by 14.6 thousand years ago (ka), shortly after the Pleistocene ice sheets blocking access from eastern Beringia began to retreat. Genetic estimates of the timing and route of entry have been constrained by the lack of suitable calibration points and low genetic diversity of Native Americans. We sequenced 92 whole mitochondrial genomes from pre-Columbian South American skeletons dating from 8.6 to 0.5 ka, allowing a detailed, temporally calibrated reconstruction of the peopling of the Americas in a Bayesian coalescent analysis. The data suggest that a small population entered the Americas via a coastal route around 16.0 ka, following previous isolation in eastern Beringia for ~2.4 to 9 thousand years after separation from eastern Siberian populations. Following a rapid movement throughout the Americas, limited gene flow in South America resulted in a marked phylogeographic structure of populations, which persisted through time. All of the ancient mitochondrial lineages detected in this study were absent from modern data sets, suggesting a high extinction rate. To investigate this further, we applied a novel principal components multiple logistic regression test to Bayesian serial coalescent simulations. The analysis supported a scenario in which European colonization caused a substantial loss of pre-Columbian lineages.
Full-text available
Although Siberia was inhabited by modern humans at an early stage, there is still debate over whether it remained habitable during the extreme cold of the Last Glacial Maximum or whether it was subsequently repopulated by peoples with recent shared ancestry. Previous studies of the genetic history of Siberian populations were hampered by the extensive admixture that appears to have taken place among these populations, since commonly used methods assume a tree-like population history and at most single admixture events. Here we analyze geogenetic maps and use other approaches to distinguish the effects of shared ancestry from prehistoric migrations and contact, and develop a new method based on the covariance of ancestry components, to investigate the potentially complex admixture history. We furthermore adapt a previously devised method of admixture dating for use with multiple events of gene flow, and apply these methods to whole-genome genotype data from over 500 individuals belonging to 20 different Siberian ethnolinguistic groups. The results of these analyses indicate that there have been multiple layers of admixture detectable in most of the Siberian populations, with considerable differences in the admixture histories of individual populations. Furthermore, most of the populations of Siberia included here, even those settled far to the north, appear to have a southern origin, with the northward expansions of different populations possibly being driven partly by the advent of pastoralism, especially reindeer domestication. These newly developed methods to analyse multiple admixture events should aid in the investigation of similarly complex population histories elsewhere.
Full-text available
This paper presents a reconstruction of Proto-Tukanoan consonants and the classification of the family based on shared phonological innovations. The proposed reconstruction contrasts with previous comparative studies of the family by proposing a series of laryngealized stops (instead of voiced stops) and a different set of sounds for the alveolar and palatal points of articulation. Methodologically, it considers lexicostatistical methods to play a secondary role in the classification of languages in the Tukanoan family. It offers a detailed discussion of contact vs. inherited linguistic traits in the Tukanoan family and in the Vaupes region, well known for its multilingualism and system of linguistic exogamy, and concludes with an interpretation of the evolution of the Tukanoan family in historical terms.
Full-text available
Recent studies have expanded and refined the founding haplogroups of the Americas using whole mitochondrial (mtDNA) genome analysis. In addition to pan-American lineages, specific variants have been identified in a number of studies that show higher frequencies in restricted geographical areas. To further characterize Native American maternal lineages and specifically examine local patterns within South America, we analyzed 12 maternally unrelated Yekuana whole mtDNA genomes from one village (Sharamaña) that include the four major Native American haplogroups A2, B2, C1, and D1. Based on our results, we propose a reconfiguration of one subhaplogroup A2 (A2aa) that is specific to South America and identify other singleton branches across the four haplogroups. Furthermore, we show nucleotide diversity values that increase from north to south for haplogroups C1 and D1. The results from our work add to the growing mitogenomic data that highlight local phylogenies and support the rapid genetic diffferentiation of South American populations, which has been correlated with the linguistic diversity in the region by previous studies.