Article
To read the full-text of this research, you can request a copy directly from the authors.

Abstract

Background: Available mitochondrial (mtDNA) data demonstrate genetic differentiation among South Slavs inhabiting the Balkan Peninsula. However, their resolution is insufficient to elucidate the female-specific aspects of the genetic history of South Slavs, including the genetic impact of various migrations which were rather common within the Balkans, a region having a turbulent demographic history. Aim: Therefore, we thoroughly studied complete mitogenomes of Serbians, a population linking westward and eastward South Slavs. Subjects and methods: Forty-six predominantly Serbian super-haplogroup U complete mitogenomes were analysed phylogenetically against ∼4000 available complete mtDNAs of modern and ancient Western Eurasians. Results: Serbians share a number of U mtDNA lineages with Southern, Eastern-Central and North-Western Europeans. Putative Balkan-specific lineages (e.g. U1a1c2, U4c1b1, U5b3j, K1a4l and K1a13a1) and lineages shared among Serbians (South Slavs) and West and East Slavs were detected (e.g. U2e1b1, U2e2a1d, U4a2a, U4a2c, U4a2g1, U4d2b and U5b1a1). Conclusion: The exceptional diversity of maternal lineages found in Serbians may be associated with the genetic impact of both autochthonous pre-Slavic Balkan populations whose mtDNA gene pool was affected by migrations of various populations over time (e.g. Bronze Age pastoralists) and Slavic and Germanic newcomers in the early Middle Ages.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the authors.

... We analyzed complete mitochondrial genomes of 226 maternally unrelated individuals from the general population of Serbia, of which 170 were sequenced de novo and 56 were taken from our previous work (GenBank accession numbers KT697997-KT698032, KM096761-KM096763, and KM096765-KM096781) [18,33]. For comparative purposes, we used 3145 complete mitogenomes originating from ten Eurasian populations. ...
... Along with the assignment of haplotypes to the most recent common ancestor (MRCA) by SAM2, which is a conservative estimate suitable in forensics, we also performed haplotype assignment through reconstruction of most-parsimonious trees using mtPhyl v4.015 software (http://eltsov.org). Since this version of program does not use the updated mtDNA phylogeny available in PhyloTree build 17, the trees were modified manually according to the latest PhyloTree build 17 and the following literature [18,23,25,33,36,[51][52][53][54][55]. New subclades were defined when they comprised at least two different mitogenomes having at least one shared mutation which is not characterized as a hotspot [32]. ...
... In addition, further refinement of haplogroups found outside of the widely accepted PhyloTree build 17 were estimated with mtPhyl by updating the phylogeny according to new findings from the present study as well as those found in available literature [18,23,25,33,36,[51][52][53][54][55] (Fig. S1). As a result, 22 haplotypes were assigned to 10 new subclades, namely H1cm, H5a10, H7j, H13d, H15b3, H15b3a, H30a1, H110, V6a, and W3b2, not defined previously in PhyloTree build 17 and available literature ( Fig. S1 and Table S1). ...
Article
Full-text available
Mitochondrial genome (mtDNA) is a valuable resource in resolving various human forensic casework. The usage of variability of complete mtDNA genomes increases their discriminatory power to the maximum and enables ultimate resolution of distinct maternal lineages. However, their wider employment in forensic casework is nowadays limited by the lack of appropriate reference database. In order to fill in the gap in the reference data, which, considering Slavic-speaking populations, currently comprises only mitogenomes of East and West Slavs, we present mitogenome data for 226 Serbians, representatives of South Slavs from the Balkan Peninsula. We found 143 (sub)haplogroups among which West Eurasian ones were dominant. The percentage of unique haplotypes was 85%, and the random match probability was as low as 0.53%. We support previous findings on both high levels of genetic diversity in the Serbian population and patterns of genetic differentiation among this and ten studied European populations. However, our high-resolution data supported more pronounced genetic differentiation among Serbians and two Slavic populations (Russians and Poles) as well as expansion of the Serbian population after the Last Glacial Maximum and during the Migration period (fourth to ninth century A.D.), as inferred from the Bayesian skyline analysis. Phylogenetic analysis of haplotypes found in Serbians contributed towards the improvement of the worldwide mtDNA phylogeny, which is essential for the interpretation of the mtDNA casework.
... Our analyses further support the hypothesis of gene flow between Neanderthal and pan-European ancestral populations, with the level of introgression into the Serbian genome being within the range observed in other European populations. Previous genetic studies involving Slavic populations employed mitochondrial, Ychromosome and SNV-panel data to investigate the relationship between geographic, genetic and linguistic distances [69,70]. Consistent with this work, our analyses expand the scope beyond Slavic populations and further contribute to the understanding of human genetic variation and its geographic distribution. ...
... In contrast to studies using genotyping arrays [2,3,69,70], the availability of wholegenome sequences presents the opportunity for a high-resolution individualized analysis. To this end, we found that the sequenced genome contains a significant number of previously unobserved variants, which emphasizes the importance of continued sequencing of a large number of individuals, especially from previously uncharacterized ethnic groups. ...
Article
Full-text available
Recent genetic studies and whole-genome sequencing projects have greatly improved our understanding of human variation and clinically actionable genetic information. Smaller ethnic populations, however, remain underrepresented in both individual and large-scale sequencing efforts and hence present an opportunity to discover new variants of biomedical and demographic significance. This report describes the sequencing and analysis of a genome obtained from an individual of Serbian origin, introducing tens of thousands of previously unknown variants to the currently available pool. Ancestry analysis places this individual in close proximity to Central and Eastern European populations; i.e., closest to Croatian, Bulgarian and Hungarian individuals and, in terms of other Europeans, furthest from Ashkenazi Jewish, Spanish, Sicilian and Baltic individuals. Our analysis confirmed gene flow between Neanderthal and ancestral pan-European populations, with similar contributions to the Serbian genome as those observed in other European groups. Finally, to assess the burden of potentially disease-causing/clinically relevant variation in the sequenced genome, we utilized manually curated genotype-phenotype association databases and variant-effect predictors. We identified several variants that have previously been associated with severe early-onset disease that is not evident in the proband, as well as putatively impactful variants that could yet prove to be clinically relevant to the proband over the next decades. The presence of numerous private and low-frequency variants, along with the observed and predicted disease-causing mutations in this genome, exemplify some of the global challenges of genome interpretation, especially in the context of under-studied ethnic groups.
... Our analyses further support the hypothesis of gene flow between Neanderthal and pan-European ancestral populations, with the level of introgression into the Serbian genome being within the range observed in other European populations. Previous genetic studies involving Slavic populations employed mitochondrial, Y-chromosome and SNV-panel data to investigate the relationship between geographic, genetic and linguistic distances [20,10]. Consistent with this work, our analyses expand the scope beyond Slavic populations and further contribute to the understanding of human genetic variation and its geographic distribution. ...
... In contrast to studies using genotyping arrays [33,23,20,10], the availability of wholegenome sequences presents the opportunity for a high-resolution individualized analysis. To this end, we found that the sequenced genome contains a significant number of previously unobserved variants, which emphasizes the importance of continued sequencing of a large number of individuals, especially from smaller ethnic groups. ...
Preprint
Full-text available
Recent genetic studies and whole-genome sequencing projects have greatly improved our understanding of human variation and clinically actionable genetic information. Smaller ethnic populations, however, remain underrepresented in both individual and large-scale sequencing efforts and hence present an opportunity to discover new variants of biomedical and demographic significance. This report describes the sequencing and analysis of a genome obtained from an individual of Serbian origin, introducing tens of thousands of previously unknown variants to the currently available pool. Ancestry analysis places this individual in close proximity of the Central and Eastern European populations; i.e., closest to Croatian, Bulgarian and Hungarian individuals and, in terms of other Europeans, furthest from Ashkenazi Jewish, Spanish, Sicilian, and Baltic individuals. Our analysis confirmed gene flow between Neanderthal and ancestral pan-European populations, with similar contributions to the Serbian genome as those observed in other European groups. Finally, to assess the burden of potentially disease-causing/clinically relevant variation in the sequenced genome, we utilized manually curated genotype-phenotype association databases and variant-effect predictors. We identified several variants that have previously been associated with severe early-onset disease that is not evident in the proband, as well as variants that could yet prove to be clinically relevant to the proband over the next decades. The presence of numerous private and low-frequency variants along with the observed and predicted disease-causing mutations in this genome exemplify some of the global challenges of genome interpretation, especially in the context of understudied ethnic groups.
... As a consequence, the genetic relationships of groups of similar sequences become difficult to disentangle and categorize for broader, evolutionary inferences. Furthermore, since Phylotree was last updated back in February 2016, several haplogroups have been recently defined but not integrated into the current nomenclatural system [11][12][13][14][15]. Thus, a method that could quickly categorize new sequences at high resolution would be extremely useful for phylogenetic studies. ...
Article
Full-text available
Background We combined an unsupervised learning methodology for analyzing mitogenome sequences with maximum likelihood (ML) phylogenetics to make detailed inferences about the evolution and diversification of mitochondrial DNA (mtDNA) haplogroup U5, which appears at high frequencies in northern Europe. Methods Haplogroup U5 mitogenome sequences were gathered from GenBank. The hierarchal Bayesian Analysis of Population Structure (hierBAPS) method was used to generate groups of sequences that were then projected onto a rooted maximum likelihood (ML) phylogenetic tree to visualize the pattern of clustering. The haplogroup statuses of the individual sequences were assessed using Haplogrep2. Results A total of 23 hierBAPS groups were identified, all of which corresponded to subclades defined in Phylotree, v.17. The hierBAPS groups projected onto the ML phylogeny accurately clustered all haplotypes belonging to a specific haplogroup in accordance with Haplogrep2. By incorporating the geographic source of each sequence and subclade age estimates into this framework, inferences about the diversification of U5 mtDNAs were made. Haplogroup U5 has been present in northern Europe since the Mesolithic, and spread in both eastern and western directions, undergoing significant diversification within Scandinavia. A review of historical and archeological evidence attests to some of the population interactions contributing to this pattern. Conclusions The hierBAPS algorithm accurately grouped mitogenome sequences into subclades in a phylogenetically robust manner. This analysis provided new insights into the phylogeographic structure of haplogroup U5 diversity in northern Europe, revealing a detailed perspective on the diversity of subclades in this region and their distribution in Scandinavian populations.
... However, thus far, there have been no reports on the contemporary Serbian variome. Most of the research on Balkan populations was conducted on uniparentally inherited markers such as mitochondrial DNA (mtDNA) and the Y chromosome, focusing on population descent and haplogroup diversity [23][24][25] . One recent exception is a report that describes the sequencing and analysis of a genome from a contemporary individual of Serbian origin and which introduces tens of thousands of previously unknown variants 26 . ...
Article
Full-text available
The complete understanding of the genomic contribution to complex traits, diseases, and response to treatments, as well as genomic medicine application to the well-being of all humans will be achieved through the global variome that encompasses fine-scale genetic diversity. Despite significant efforts in recent years, uneven representation still characterizes genomic resources and among the underrepresented European populations are the Western Balkans including the Serbian population. Our research addresses this gap and presents the first ever targeted sequencing dataset of variants in clinically relevant genes. By measuring population differentiation and applying the Principal Component and Admixture analysis we demonstrated that the Serbian population differs little from other European populations, yet we identified several novel and more frequent variants that appear as its unique genetic determinants. We explored thoroughly the functional impact of frequent variants and its correlation with the health burden of the population of Serbia based on a sample of 144 individuals. Our variants catalogue improves the understanding of genetics of modern Serbia, contributes to research on ancestry, and aids in improvements of well-being and health equity. In addition, this resource may also be applicable in neighboring regions and valuable in worldwide functional analyses of genetic variants in individuals of European descent.
... Most of the research on Balkan populations was conducted on uniparentally inherited markers such as mitochondrial DNA (mtDNA) and the Y chromosome, focusing on population descent and haplogroup diversity [22][23][24]. One recent exception is a report that describes the sequencing and analysis of a genome from a contemporary individual of Serbian origin and which introduces tens of thousands of previously unknown variants [25]. ...
Preprint
Full-text available
The complete understanding of the genomic contribution to complex traits, diseases, and response to treatments, as well as genomic medicine application to the well-being of all humans will be achieved through the global variome that encompasses fine-scale genetic diversity. Despite significant efforts in recent years, uneven representation still characterizes genomic resources and among the underrepresented European populations are the Western Balkans including the Serbian population. Our research addresses this gap and presents the first ever dataset of variants in clinically relevant genes in the population sample of contemporary Serbia. A few variants significantly more frequent in the analyzed sample population compared to the European population as a whole are distinguished as its unique genetic determinants. We explored thoroughly their potential functional impact and its correlation with the health burden of the population of Serbia. Our variant's catalogue improves the understanding of genetics of modern Serbia, contributes to application of precision medicine and health equity. In addition, this resource may also be applicable in neighboring regions and in worldwide functional analyses of genetic variants in individuals of European descent.
... Europe80 . The U1a sub-haplogroup is dated at ~ 13-15 Kya and is present in Southwest and South Asia, the Caucasus, and Europe. ...
Article
Full-text available
Chuetas are a group of descendants of Majorcan Crypto-Jews (Balearic Islands, Spain) who were socially stigmatized and segregated by their Majorcan neighbours until recently; generating a community that, although after the seventeenth century no longer contained Judaic religious elements, maintained strong group cohesion, Jewishness consciousness, and endogamy. Collective memory fixed 15 surnames as a most important defining element of Chueta families. Previous studies demonstrated Chuetas were a differentiated population, with a considerable proportion of their original genetic make-up. Genetic data of Y-chromosome polymorphism and mtDNA control region showed, in Chuetas’ paternal lineages, high prevalence of haplogroups J2-M172 (33%) and J1-M267 (18%). In maternal lineages, the Chuetas hallmark is the presence of a new sub-branching of the rare haplogroup R0a2m as their modal haplogroup (21%). Genetic diversity in both Y-chromosome and mtDNA indicates the Chueta community has managed to avoid the expected heterogeneity decrease in their gene pool after centuries of isolation and inbreeding. Moreover, the composition of their uniparentally transmitted lineages demonstrates a remarkable signature of Middle Eastern ancestry—despite some degree of host admixture—confirming Chuetas have retained over the centuries a considerable degree of ancestral genetic signature along with the cultural memory of their Jewish origin.
... Haplogroup HV0, a sister clade of H, accounted for 5.5 % (including V-subclades) of the total population, while its mother clade HV encompassed 6 % of the investigated individuals. The second most frequent haplogroup in Europe and in Croatia was haplogroup U (22 %) with haplogroup U5 being the most dominant U. Its frequency in the Croatian dataset of 10 % is similar to the values obtained in other populations of SEE [18][19][20][21][22][23]. Among its subtypes, U2e was the second most common U lineage (4 %) and subhaplogroup U2e1b1 encompassed 50 % of all U2 lineages. ...
... The U4d2 haplotype is currently distributed in some geographic areas of Eurasia, being most frequent in populations of Eastern Europe (Volga Tatars and Slavic speaking groups) and in aboriginal Siberians (Yakuts, Tuvinians, Khakassians) (Supplementary Table S4). It has been suggested that U4d2 branch might be of Slavic origin due to its high incidence in Slavic-speaking populations and it might have expanded into Central Europe and Siberia from Eastern Europe 28 . This haplotype includes three ancient maternally inherited genomes, all from Hungarian Conquerors 9 . ...
Article
Full-text available
The historical province of Dobruja, located in southeastern Romania, has experienced intense human population movement, invasions, and conflictual episodes during the Middle Ages, being an important intersection point between Asia and Europe. The most informative source of maternal population histories is the complete mitochondrial genome of archaeological specimens, but currently, there is insufficient ancient DNA data available for the medieval period in this geographical region to complement the archaeological findings. In this study, we reconstructed, by using Next Generation Sequencing, the entire mitochondrial genomes (mitogenomes) of six medieval individuals neglectfully buried in a multiple burial from Capidava necropolis (Dobruja), some presenting signs of a violent death. Six distinct maternal lineages (H11a1, U4d2, J1c15, U6a1a1, T2b, and N1a3a) with different phylogenetic background were identified, pointing out the heterogeneous genetic aspect of the analyzed medieval group. Using population genetic analysis based on high-resolution mitochondrial data, we inferred the genetic affinities of the available medieval dataset from Capidava to other ancient Eurasian populations. The genetic data were integrated with the archaeological and anthropological information in order to sketch a small, local piece of the mosaic that is the image of medieval European population history.
Article
Full-text available
To investigate the uniparental genetic structure of the Punjabi population from mtDNA aspect and to set up an appropriate mtDNA forensic database, we studied maternally unrelated Punjabi (N = 100) subjects from two caste groups (i.e. Arain and Gujar) belonging to territory of Punjab. The complete control region was elucidated by Sanger sequencing and the subsequent 58 different haplotypes were designated into appropriate haplogroups according to the most recently updated mtDNA phylogeny. We found a homogenous dispersal of Eurasian haplogroup uniformity among the Punjab Province and exhibited a strong connotation with the European populations. Punjabi castes are primarily a composite of substantial South Asian, East Asian and West Eurasian lineages. Moreover, for the first time we have defined the newly sub-haplogroup M52b1 characterized by 16223 T, 16275 G and 16438 A in Gujar caste. The vast array of mtDNA variants displayed in this study suggested that the haplogroup composition radiates signals of extensive genetic conglomeration, population admixture and demographic expansion that was equipped with diverse origin, whereas matrilineal gene pool was phylogeographically homogenous across the Punjab. This context was further fully acquainted with the facts supported by PCA scatterplot that Punjabi population clustered with South Asian populations. Finally, the high power of discrimination (0.8819) and low random match probability (0.0085%) proposed a worthy contribution of mtDNA control region dataset as a forensic database that considered a gold standard of today to get deeper insight into the genetic ancestry of contemporary matrilineal phylogeny.
Article
Full-text available
Founder analysis is a method for analysis of nonrecombining DNA sequence data, with the aim of identification and dating of migrations into new territory. The method picks out founder sequence types in potential source populations and dates lineage clusters deriving from them in the settlement zone of interest. Here, using mtDNA, we apply the approach to the colonization of Europe, to estimate the proportion of modern lineages whose ancestors arrived during each major phase of settlement. To estimate the Palaeolithic and Neolithic contributions to European mtDNA diversity more accurately than was previously achievable, we have now extended the Near Eastern, European, and northern-Caucasus databases to 1,234, 2, 804, and 208 samples, respectively. Both back-migration into the source population and recurrent mutation in the source and derived populations represent major obstacles to this approach. We have developed phylogenetic criteria to take account of both these factors, and we suggest a way to account for multiple dispersals of common sequence types. We conclude that (i) there has been substantial back-migration into the Near East, (ii) the majority of extant mtDNA lineages entered Europe in several waves during the Upper Palaeolithic, (iii) there was a founder effect or bottleneck associated with the Last Glacial Maximum, 20,000 years ago, from which derives the largest fraction of surviving lineages, and (iv) the immigrant Neolithic component is likely to comprise less than one-quarter of the mtDNA pool of modern Europeans.
Article
Full-text available
Significance One of the most enduring and widely debated questions in prehistoric archaeology concerns the origins of Europe’s earliest farmers: Were they the descendants of local hunter-gatherers, or did they migrate from southwestern Asia, where farming began? We recover genome-wide DNA sequences from early farmers on both the European and Asian sides of the Aegean to reveal an unbroken chain of ancestry leading from central and southwestern Europe back to Greece and northwestern Anatolia. Our study provides the coup de grâce to the notion that farming spread into and across Europe via the dissemination of ideas but without, or with only a limited, migration of people.
Article
Full-text available
We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers B45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers B25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe B3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.
Article
Full-text available
The Slavic branch of the Balto-Slavic sub-family of Indo-European languages underwent rapid divergence as a result of the spatial expansion of its speakers from Central-East Europe, in early medieval times. This expansion-mainly to East Europe and the northern Balkans-resulted in the incorporation of genetic components from numerous autochthonous populations into the Slavic gene pools. Here, we characterize genetic variation in all extant ethnic groups speaking Balto-Slavic languages by analyzing mitochondrial DNA (n = 6,876), Y-chromosomes (n = 6,079) and genome-wide SNP profiles (n = 296), within the context of other European populations. We also reassess the phylogeny of Slavic languages within the Balto-Slavic branch of Indo-European. We find that genetic distances among Balto-Slavic populations, based on autosomal and Y-chromosomal loci, show a high correlation (0.9) both with each other and with geography, but a slightly lower correlation (0.7) with mitochondrial DNA and linguistic affiliation. The data suggest that genetic diversity of the present-day Slavs was predominantly shaped in situ, and we detect two different substrata: 'central-east European' for West and East Slavs, and 'south-east European' for South Slavs. A pattern of distribution of segments identical by descent between groups of East-West and South Slavs suggests shared ancestry or a modest gene flow between those two groups, which might derive from the historic spread of Slavic people.
Article
Full-text available
The Bronze Age of Eurasia (around 3000-1000 BC) was a period of major cultural changes. However, there is debate about whether these changes resulted from the circulation of ideas or from human migrations, potentially also facilitating the spread of languages and certain phenotypic traits. We investigated this by using new, improved methods to sequence low-coverage genomes from 101 ancient humans from across Eurasia. We show that the Bronze Age was a highly dynamic period involving large-scale population migrations and replacements, responsible for shaping major parts of present-day demographic structure in both Europe and Asia. Our findings are consistent with the hypothesized spread of Indo-European languages during the Early Bronze Age. We also demonstrate that light skin pigmentation in Europeans was already present at high frequency in the Bronze Age, but not lactose tolerance, indicating a more recent onset of positive selection on lactose tolerance than previously thought.
Article
Full-text available
MtDNA has been a widely used tool in human evolutionary and population genetic studies over the past three decades. Its maternal inheritance and lack of recombination have offered the opportunity to explore genealogical relationships among individuals and to study the frequency differences of matrilineal clades among human populations at continental and regional scales. The whole mtDNA genome sequencing delivers molecular resolution that is sufficient to distinguish patterns that have arisen over thousands of years. However, mutation rate is highly variable among the functional and non-coding domains of mtDNA which makes it challenging to obtain accurate split dates of the mitochondrial clades. Due to the shallow coalescent time of mitochondrial TMRCA at approximately 100 to 200 thousand years (ky), mtDNA data have only limited power to inform us about the more distant past and the early stages of human evolutionary history. The variation shared by mitochondrial genomes of individuals drawn from different continents outside Africa has been used to illuminate the details of the colonization process of the Old World, whereas regional patterns of variation have been at the focus of studies addressing questions of a more recent time scale. In the era of whole nuclear genome sequencing, mitochondrial genomes are continuing to be informative as a unique tool for the assessment of female-specific aspects of the demographic history of human populations.
Article
Full-text available
We generated genome-wide data from 69 Europeans who lived between 8,000-3,000 years ago by enriching ancient DNA libraries for a target set of almost four hundred thousand polymorphisms. Enrichment of these positions decreases the sequencing required for genome-wide ancient DNA analysis by a median of around 250-fold, allowing us to study an order of magnitude more individuals than previous studies and to obtain new insights about the past. We show that the populations of western and far eastern Europe followed opposite trajectories between 8,000-5,000 years ago. At the beginning of the Neolithic period in Europe, ~8,000-7,000 years ago, closely related groups of early farmers appeared in Germany, Hungary, and Spain, different from indigenous hunter-gatherers, whereas Russia was inhabited by a distinctive population of hunter-gatherers with high affinity to a ~24,000 year old Siberian6 . By ~6,000-5,000 years ago, a resurgence of hunter-gatherer ancestry had occurred throughout much of Europe, but in Russia, the Yamnaya steppe herders of this time were descended not only from the preceding eastern European hunter-gatherers, but from a population of Near Eastern ancestry. Western and Eastern Europe came into contact ~4,500 years ago, as the Late Neolithic Corded Ware people from Germany traced ~3/4 of their ancestry to the Yamnaya, documenting a massive migration into the heartland of Europe from its eastern periphery. This steppe ancestry persisted in all sampled central Europeans until at least ~3,000 years ago, and is ubiquitous in present-day Europeans. These results provide support for the theory of a steppe origin of at least some of the Indo-European languages of Europe.
Article
Full-text available
While numerous ancient human DNA datasets from across Europe have been published till date, modern-day Poland in particular, remains uninvestigated. Besides application in the reconstruction of continent-wide human history, data from this region would also contribute towards our understanding of the history of the Slavs, whose origin is hypothesized to be in East or Central Europe. Here, we present the first population-scale ancient human DNA study from the region of modern-day Poland by establishing mitochondrial DNA profiles for 23 samples dated to 200 BC - 500 AD (Roman Iron Age) and for 20 samples dated to 1000-1400 AD (Medieval Age). Our results show that mitochondrial DNA sequences from both periods belong to haplogroups that are characteristic of contemporary West Eurasia. Haplotype sharing analysis indicates that majority of the ancient haplotypes are widespread in some modern Europeans, including Poles. Notably, the Roman Iron Age samples share more rare haplotypes with Central and Northeast Europeans, whereas the Medieval Age samples share more rare haplotypes with East-Central and South-East Europeans, primarily Slavic populations. Our data demonstrates genetic continuity of certain matrilineages (H5a1 and N1a1a2) in the area of present-day Poland from at least the Roman Iron Age until present. As such, the maternal gene pool of present-day Poles, Czechs and Slovaks, categorized as Western Slavs, is likely to have descended from inhabitants of East-Central Europe during the Roman Iron Age.
Article
Full-text available
Background Although the genetic heritage of aboriginal Siberians is mostly of eastern Asian ancestry, a substantial western Eurasian component is observed in the majority of northern Asian populations. Traces of at least two migrations into southern Siberia, one from eastern Europe and the other from western Asia/the Caucasus have been detected previously in mitochondrial gene pools of modern Siberians.ResultsWe report here 166 new complete mitochondrial DNA (mtDNA) sequences that allow us to expand and re-analyze the available data sets of western Eurasian lineages found in northern Asian populations, define the phylogenetic status of Siberian-specific subclades and search for links between mtDNA haplotypes/subclades and events of human migrations. From a survey of 158 western Eurasian mtDNA genomes found in Siberia we estimate that nearly 40% of them most likely have western Asian and another 29% European ancestry. It is striking that 65 of northern Asian mitogenomes, i.e. ~41%, fall into 19 branches and subclades which can be considered as Siberian-specific being found so far only in Siberian populations. From the coalescence analysis it is evident that the sequence divergence of Siberian-specific subclades was relatively small, corresponding to only 0.6-9.5 kya (using the complete mtDNA rate) and 1¿6 kya (coding region rate).Conclusions The phylogeographic analysis implies that the western Eurasian founders, giving rise to Siberian specific subclades, may trace their ancestry only to the early and mid-Holocene, though some of genetic lineages may trace their ancestry back to the end of Last Glacial Maximum (LGM). We have not found the modern northern Asians to have western Eurasian genetic components of sufficient antiquity to indicate traces of pre-LGM expansions.
Article
Full-text available
Contemporary inhabitants of the Balkan Peninsula belong to several ethnic groups of diverse cultural background. In this study, three ethnic groups from Bosnia and Herzegovina - Bosniacs, Bosnian Croats and Bosnian Serbs - as well as the populations of Serbians, Croatians, Macedonians from the former Yugoslav Republic of Macedonia, Montenegrins and Kosovars have been characterized for the genetic variation of 660 000 genome-wide autosomal single nucleotide polymorphisms and for haploid markers. New autosomal data of the 70 individuals together with previously published data of 20 individuals from the populations of the Western Balkan region in a context of 695 samples of global range have been analysed. Comparison of the variation data of autosomal and haploid lineages of the studied Western Balkan populations reveals a concordance of the data in both sets and the genetic uniformity of the studied populations, especially of Western South-Slavic speakers. The genetic variation of Western Balkan populations reveals the continuity between the Middle East and Europe via the Balkan region and supports the scenario that one of the major routes of ancient gene flows and admixture went through the Balkan Peninsula.
Article
Full-text available
High mtDNA variation in Southeastern Europe (SEE) is a reflection of the turbulent and complex demographic history of this area, influenced by gene flow from various parts of Eurasia and a long history of intermixing. Our results of 1035 samples (488 from Croatia, 239 from Bosnia and 130 from Herzegovina, reported earlier, and 97 Slovenians and 81 individuals from Žumberak, reported here for the first time) show that the SEE maternal genetic diversity fits within a broader European maternal genetic landscape. The study also shows that the population of Žumberak, located in the continental part of Croatia, developed some unique mtDNA haplotypes and elevated haplogroup frequencies due to distinctive demographic history and can be considered a moderate genetic isolate. We also report seven samples from the Bosnian population and one Herzegovinian sample designated as X2* individuals that could not be assigned to any of its sublineages (X2a'o) according to the existing X2 phylogeny. In an attempt to clarify the phylogeny of our X2 samples, their mitochondrial DNA has been completely sequenced. We suppose that these lineages are signs of local microdifferentiation processes that occurred in the recent demographic past in this area and could possibly be marked as SEE-specific X2 sublineages.
Article
Full-text available
Modern genetic data combined with appropriate statistical methods have the potential to contribute substantially to our understanding of human history. We have developed an approach that exploits the genomic structure of admixed populations to date and characterize historical mixture events at fine scales. We used this to produce an atlas of worldwide human admixture history, constructed by using genetic data alone and encompassing over 100 events occurring over the past 4000 years. We identified events whose dates and participants suggest they describe genetic impacts of the Mongol empire, Arab slave trade, Bantu expansion, first millennium CE migrations in Eastern Europe, and European colonialism, as well as unrecorded events, revealing admixture to be an almost universal force shaping human populations.
Article
Full-text available
Due to its pivotal geographical location and proximity to transcontinental migratory routes, Iran has played a key role in subsequent migrations, both prehistoric and historic, between Africa, Asia and Europe. To shed light on the genetic structure of the Iranian population as well as on the expansion patterns and population movements which affected this region, the complete mitochondrial genomes of 352 Iranians were obtained. All Iranian populations studied here exhibit similarly high diversity values comparable to the other groups from the Caucasus, Anatolia and Europe. The results of AMOVA and MDS analyses did not associate any regional and/or linguistic group of populations in the Anatolia/Caucasus and Iran region pointing to close genetic positions of Persians and Qashqais to each other and to Armenians, and Azeris from Iran to Georgians. By reconstructing the complete mtDNA phylogeny of haplogroups R2, N3, U1, U3, U5a1g, U7, H13, HV2, HV12, M5a and C5c we have found a previously unexplored genetic connection between the studied Iranian populations and the Arabian Peninsula, India, Near East and Europe, likely the result of both ancient and recent gene flow. Our results for Persians and Qashqais point to a continuous increase of the population sizes from ∼24 kya to the present, although the phase between 14-24 kya is thought to be hyperarid according to the Gulf Oasis model. Since this would have affected hunter-gatherer ranges and mobility patterns and forced them to increasingly rely on coastal resources, this transition can explain the human expansion across the Persian Gulf region.
Article
Full-text available
Farming or Fishing Evidence has been mounting that most modern European populations originated from the immigration of farmers who displaced the hunter-gatherers of the Mesolithic. Bollongino et al. (p. 479 , published online 10 October) present analyses of palaeogenetic and isotopic data from Neolithic human skeletons from the Blätterhöhle burial site in Germany. The analyses identify a Neolithic freshwater fish–eating hunter-gatherer group, living contemporaneously and in close proximity to a Neolithic farming group. While there is some evidence that hunter-gatherer women may have admixed into the farming population, it appears likely that marriage or cultural boundaries between the groups persisted for over two millennia. Thus, the transition from the Mesolithic involved a more complex pattern of coexistence among humans of different genetic origins and cultures in the Neolithic, rather than a more abrupt transition.
Article
Full-text available
The processes that shaped modern European mitochondrial DNA (mtDNA) variation remain unclear. The initial peopling by Palaeolithic hunter-gatherers ~42,000 years ago and the immigration of Neolithic farmers into Europe ~8000 years ago appear to have played important roles but do not explain present-day mtDNA diversity. We generated mtDNA profiles of 364 individuals from prehistoric cultures in Central Europe to perform a chronological study, spanning the Early Neolithic to the Early Bronze Age (5500 to 1550 calibrated years before the common era). We used this transect through time to identify four marked shifts in genetic composition during the Neolithic period, revealing a key role for Late Neolithic cultures in shaping modern Central European genetic diversity.
Article
Full-text available
The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA is passed along the maternal line. Its variation in the Ashkenazim is highly distinctive, with four major and numerous minor founders. However, due to their rarity in the general population, these founders have been difficult to trace to a source. Here we show that all four major founders, ~40% of Ashkenazi mtDNA variation, have ancestry in prehistoric Europe, rather than the Near East or Caucasus. Furthermore, most of the remaining minor founders share a similar deep European ancestry. Thus the great majority of Ashkenazi maternal lineages were not brought from the Levant, as commonly supposed, nor recruited in the Caucasus, as sometimes suggested, but assimilated within Europe. These results point to a significant role for the conversion of women in the formation of Ashkenazi communities, and provide the foundation for a detailed reconstruction of Ashkenazi genealogical history.
Article
Full-text available
Ethnic Belarusians make up more than 80% of the nine and half million people inhabiting the Republic of Belarus. Belarusians together with Ukrainians and Russians represent the East Slavic linguistic group, largest both in numbers and territory, inhabiting East Europe alongside Baltic-, Finno-Permic- and Turkic-speaking people. Till date, only a limited number of low resolution genetic studies have been performed on this population. Therefore, with the phylogeographic analysis of 565 Y-chromosomes and 267 mitochondrial DNAs from six well covered geographic sub-regions of Belarus we strove to complement the existing genetic profile of eastern Europeans. Our results reveal that around 80% of the paternal Belarusian gene pool is composed of R1a, I2a and N1c Y-chromosome haplogroups - a profile which is very similar to the two other eastern European populations - Ukrainians and Russians. The maternal Belarusian gene pool encompasses a full range of West Eurasian haplogroups and agrees well with the genetic structure of central-east European populations. Our data attest that latitudinal gradients characterize the variation of the uniparentally transmitted gene pools of modern Belarusians. In particular, the Y-chromosome reflects movements of people in central-east Europe, starting probably as early as the beginning of the Holocene. Furthermore, the matrilineal legacy of Belarusians retains two rare mitochondrial DNA haplogroups, N1a3 and N3, whose phylogeographies were explored in detail after de novo sequencing of 20 and 13 complete mitogenomes, respectively, from all over Eurasia. Our phylogeographic analyses reveal that two mitochondrial DNA lineages, N3 and N1a3, both of Middle Eastern origin, might mark distinct events of matrilineal gene flow to Europe: during the mid-Holocene period and around the Pleistocene-Holocene transition, respectively.
Article
Full-text available
The recent genealogical history of human populations is a complex mosaic formed by individual migration, large-scale population movements, and other demographic events. Population genomics datasets can provide a window into this recent history, as rare traces of recent shared genetic ancestry are detectable due to long segments of shared genomic material. We make use of genomic data for 2,257 Europeans (in the Population Reference Sample [POPRES] dataset) to conduct one of the first surveys of recent genealogical ancestry over the past 3,000 years at a continental scale. We detected 1.9 million shared long genomic segments, and used the lengths of these to infer the distribution of shared ancestors across time and geography. We find that a pair of modern Europeans living in neighboring populations share around 2-12 genetic common ancestors from the last 1,500 years, and upwards of 100 genetic ancestors from the previous 1,000 years. These numbers drop off exponentially with geographic distance, but since these genetic ancestors are a tiny fraction of common genealogical ancestors, individuals from opposite ends of Europe are still expected to share millions of common genealogical ancestors over the last 1,000 years. There is also substantial regional variation in the number of shared genetic ancestors. For example, there are especially high numbers of common ancestors shared between many eastern populations that date roughly to the migration period (which includes the Slavic and Hunnic expansions into that region). Some of the lowest levels of common ancestry are seen in the Italian and Iberian peninsulas, which may indicate different effects of historical population expansions in these areas and/or more stably structured populations. Population genomic datasets have considerable power to uncover recent demographic history, and will allow a much fuller picture of the close genealogical kinship of individuals across the world.
Article
Full-text available
To shed more light on the processes leading to crystallization of a Slavic identity, we investigated variability of complete mitochondrial genomes belonging to haplogroups H5 and H6 (63 mtDNA genomes) from the populations of Eastern and Western Slavs, including new samples of Poles, Ukrainians and Czechs presented here. Molecular dating implies formation of H5 approximately 11.5-16 thousand years ago (kya) in the areas of southern Europe. Within ancient haplogroup H6, dated at around 15-28 kya, there is a subhaplogroup H6c, which probably survived the last glaciation in Europe and has undergone expansion only 3-4 kya, together with the ancestors of some European groups, including the Slavs, because H6c has been detected in Czechs, Poles and Slovaks. Detailed analysis of complete mtDNAs allowed us to identify a number of lineages that seem specific for Central and Eastern Europe (H5a1f, H5a2, H5a1r, H5a1s, H5b4, H5e1a, H5u1, some subbranches of H5a1a and H6a1a9). Some of them could possibly be traced back to at least ∼4 kya, which indicates that some of the ancestors of today's Slavs (Poles, Czechs, Slovaks, Ukrainians and Russians) inhabited areas of Central and Eastern Europe much earlier than it was estimated on the basis of archaeological and historical data. We also sequenced entire mitochondrial genomes of several non-European lineages (A, C, D, G, L) found in contemporary populations of Poland and Ukraine. The analysis of these haplogroups confirms the presence of Siberian (C5c1, A8a1) and Ashkenazi-specific (L2a1l2a) mtDNA lineages in Slavic populations. Moreover, we were able to pinpoint some lineages which could possibly reflect the relatively recent contacts of Slavs with nomadic Altaic peoples (C4a1a, G2a, D5a2a1a1).
Article
Full-text available
An ongoing source of controversy in mitochondrial DNA (mtDNA) research is based on the detection of numerous errors in mtDNA profiles that led to erroneous conclusions and false disease associations. Most of these controversies could be avoided if the samples' haplogroup status would be taken into consideration. Knowing the mtDNA haplogroup affiliation is a critical prerequisite for studying mechanisms of human evolution and discovering genes involved in complex diseases, and validating phylogenetic consistency using haplogroup classification is an important step in quality control. However, despite the availability of Phylotree, a regularly updated classification tree of global mtDNA variation, the process of haplogroup classification is still time-consuming and error-prone, as researchers have to manually compare the polymorphisms found in a population sample to those summarized in Phylotree, polymorphism by polymorphism, sample by sample. We present HaploGrep, a fast, reliable and straight-forward algorithm implemented in a Web application to determine the haplogroup affiliation of thousands of mtDNA profiles genotyped for the entire mtDNA or any part of it. HaploGrep uses the latest version of Phylotree and offers an all-in-one solution for quality assessment of mtDNA profiles in clinical genetics, population genetics and forensics. HaploGrep can be accessed freely at http://haplogrep.uibk.ac.at. Hum Mutat 31:–8, 2010. © 2010 Wiley-Liss, Inc.
Article
Full-text available
Mutational events along the human mtDNA phylogeny are traditionally identified relative to the revised Cambridge Reference Sequence, a contemporary European sequence published in 1981. This historical choice is a continuous source of inconsistencies, misinterpretations, and errors in medical, forensic, and population genetic studies. Here, after having refined the human mtDNA phylogeny to an unprecedented level by adding information from 8,216 modern mitogenomes, we propose switching the reference to a Reconstructed Sapiens Reference Sequence, which was identified by considering all available mitogenomes from Homo neanderthalensis. This "Copernican" reassessment of the human mtDNA tree from its deepest root should resolve previous problems and will have a substantial practical and educational influence on the scientific and public perception of human evolution by clarifying the core principles of common ancestry for extant descendants.
Article
Full-text available
In Europe, the Neolithic transition (8,000-4,000 B.C.) from hunting and gathering to agricultural communities was one of the most important demographic events since the initial peopling of Europe by anatomically modern humans in the Upper Paleolithic (40,000 B.C.). However, the nature and speed of this transition is a matter of continuing scientific debate in archaeology, anthropology, and human population genetics. To date, inferences about the genetic make up of past populations have mostly been drawn from studies of modern-day Eurasian populations, but increasingly ancient DNA studies offer a direct view of the genetic past. We genetically characterized a population of the earliest farming culture in Central Europe, the Linear Pottery Culture (LBK; 5,500-4,900 calibrated B.C.) and used comprehensive phylogeographic and population genetic analyses to locate its origins within the broader Eurasian region, and to trace potential dispersal routes into Europe. We cloned and sequenced the mitochondrial hypervariable segment I and designed two powerful SNP multiplex PCR systems to generate new mitochondrial and Y-chromosomal data from 21 individuals from a complete LBK graveyard at Derenburg Meerenstieg II in Germany. These results considerably extend the available genetic dataset for the LBK (n = 42) and permit the first detailed genetic analysis of the earliest Neolithic culture in Central Europe (5,500-4,900 calibrated B.C.). We characterized the Neolithic mitochondrial DNA sequence diversity and geographical affinities of the early farmers using a large database of extant Western Eurasian populations (n = 23,394) and a wide range of population genetic analyses including shared haplotype analyses, principal component analyses, multidimensional scaling, geographic mapping of genetic distances, and Bayesian Serial Simcoal analyses. The results reveal that the LBK population shared an affinity with the modern-day Near East and Anatolia, supporting a major genetic input from this area during the advent of farming in Europe. However, the LBK population also showed unique genetic features including a clearly distinct distribution of mitochondrial haplogroup frequencies, confirming that major demographic events continued to take place in Europe after the early Neolithic.
Article
Full-text available
It is generally accepted that the most ancient European mitochondrial haplogroup, U5, has evolved essentially in Europe. To resolve the phylogeny of this haplogroup, we completely sequenced 113 mitochondrial genomes (79 U5a and 34 U5b) of central and eastern Europeans (Czechs, Slovaks, Poles, Russians and Belorussians), and reconstructed a detailed phylogenetic tree, that incorporates previously published data. Molecular dating suggests that the coalescence time estimate for the U5 is approximately 25-30 thousand years (ky), and approximately 16-20 and approximately 20-24 ky for its subhaplogroups U5a and U5b, respectively. Phylogeographic analysis reveals that expansions of U5 subclusters started earlier in central and southern Europe, than in eastern Europe. In addition, during the Last Glacial Maximum central Europe (probably, the Carpathian Basin) apparently represented the area of intermingling between human flows from refugial zones in the Balkans, the Mediterranean coastline and the Pyrenees. Age estimations amounting for many U5 subclusters in eastern Europeans to approximately 15 ky ago and less are consistent with the view that during the Ice Age eastern Europe was an inhospitable place for modern humans.
Article
Full-text available
Analysis of Y chromosome Y-STRs has proven to be a useful tool in the field of population genetics, especially in the case of closely related populations. We collected DNA samples from 169 males of Czech origin, 80 males of Slovakian origin, and 142 males dwelling Northern Poland. We performed Y-STR analysis of 12 loci in the samples collected (PowerPlex Y system from Promega) and compared the Y chromosome haplotype frequencies between the populations investigated. Also, we used Y-STR data available from the literature for comparison purposes. We observed significant differences between Y chromosome pools of Czechs and Slovaks compared to other Slavic and European populations. At the same time we were able to point to a specific group of Y-STR haplotypes belonging to an R1a haplogroup that seems to be shared by Slavic populations dwelling in Central Europe. The observed Y chromosome diversity may be explained by taking into consideration archeological and historical data regarding early Slav migrations.
Article
Full-text available
After the domestication of animals and crops in the Near East some 11,000 years ago, farming had reached much of central Europe by 7500 years before the present. The extent to which these early European farmers were immigrants or descendants of resident hunter-gatherers who had adopted farming has been widely debated. We compared new mitochondrial DNA (mtDNA) sequences from late European hunter-gatherer skeletons with those from early farmers and from modern Europeans. We find large genetic differences between all three groups that cannot be explained by population continuity alone. Most (82%) of the ancient hunter-gatherers share mtDNA types that are relatively rare in central Europeans today. Together, these analyses provide persuasive evidence that the first farmers were not the descendants of local hunter-gatherers but immigrated into central Europe at the onset of the Neolithic.
Article
Full-text available
Background It is customary, in population genetics studies, to consider Basques as the direct descendants of the Paleolithic Europeans. However, until now there has been no irrefutable genetic proof to support this supposition. Even studies based on mitochondrial DNA (mtDNA), an ideal molecule for constructing datable maternal genealogies, have failed to achieve this. It could be that incoming gene flow has replaced the Basque ancient lineages but it could also be that these lineages have not been detected due to a lack of resolution of the Basque mtDNA genealogies. To assess this possibility we analyzed here the mtDNA of a large sample of autochthonous Basques using mtDNA genomic sequencing for those lineages that could not be unequivocally classified by diagnostic RFLP analysis and control region (HVSI and HVSII) sequencing. Results We show that Basques have the most ancestral phylogeny in Europe for the rare mitochondrial subhaplogroup U8a. Divergence times situate the Basque origin of this lineage in the Upper Palaeolithic. Most probably, their primitive founders came from West Asia. The lack of U8a lineages in Africa points to an European and not a North African route of entrance. Phylogeographic analysis suggest that U8a had two expansion periods in Europe, the first, from a south-western area including the Iberian peninsula and Mediterranean France before 30,000 years ago, and the second, from Central Europe around 15,000–10,000 years ago. Conclusion It has been demonstrated, for the first time, that Basques show the oldest lineages in Europe for subhaplogroup U8a. Coalescence times for these lineages suggest their presence in the Basque country since the Upper Paleolithic. The European U8 phylogeography is congruent with the supposition that Basques could have participated in demographic re-expansions to repopulate central Europe in the last interglacial periods.
Article
Full-text available
There are extensive data indicating that some glacial refuge zones of southern Europe (Franco-Cantabria, Balkans, and Ukraine) were major genetic sources for the human recolonization of the continent at the beginning of the Holocene. Intriguingly, there is no genetic evidence that the refuge area located in the Italian Peninsula contributed to this process. Here we show, through phylogeographic analyses of mitochondrial DNA (mtDNA) variation performed at the highest level of molecular resolution (52 entire mitochondrial genomes), that the most likely homeland for U5b3-a haplogroup present at a very low frequency across Europe-was the Italian Peninsula. In contrast to mtDNA haplogroups that expanded from other refugia, the Holocene expansion of haplogroup U5b3 toward the North was restricted by the Alps and occurred only along the Mediterranean coasts, mainly toward nearby Provence (southern France). From there, approximately 7,000-9,000 years ago, a subclade of this haplogroup moved to Sardinia, possibly as a result of the obsidian trade that linked the two regions, leaving a distinctive signature in the modern people of the island. This scenario strikingly matches the age, distribution, and postulated geographic source of a Sardinian Y chromosome haplogroup (I2a2-M26), a paradigmatic case in the European context of a founder event marking both female and male lineages.
Article
Full-text available
For the past few years, scientific controversy has surrounded the large number of errors in forensic and literature mitochondrial DNA (mtDNA) data. However, recent research has shown that using mtDNA phylogeny and referring to known mtDNA haplotypes can be useful for checking the quality of sequence data. We developed a Web-based bioinformatics resource "mtDNAmanager" that offers a convenient interface supporting the management and quality analysis of mtDNA sequence data. The mtDNAmanager performs computations on mtDNA control-region sequences to estimate the most-probable mtDNA haplogroups and retrieves similar sequences from a selected database. By the phased designation of the most-probable haplogroups (both expected and estimated haplogroups), mtDNAmanager enables users to systematically detect errors whilst allowing for confirmation of the presence of clear key diagnostic mutations and accompanying mutations. The query tools of mtDNAmanager also facilitate database screening with two options of "match" and "include the queried nucleotide polymorphism". In addition, mtDNAmanager provides Web interfaces for users to manage and analyse their own data in batch mode. The mtDNAmanager will provide systematic routines for mtDNA sequence data management and analysis via easily accessible Web interfaces, and thus should be very useful for population, medical and forensic studies that employ mtDNA analysis. mtDNAmanager can be accessed at http://mtmanager.yonsei.ac.kr.
Article
Full-text available
A genetic perspective of human history in Europe was derived from 22 binary markers of the nonrecombining Y chromosome (NRY). Ten lineages account for >95% of the 1007 European Y chromosomes studied. Geographic distribution and age estimates of alleles are compatible with two Paleolithic and one Neolithic migratory episode that have contributed to the modern European gene pool. A significant correlation between the NRY haplotype data and principal components based on 95 protein markers was observed, indicating the effectiveness of NRY binary polymorphisms in the characterization of human population composition and history.
Article
Full-text available
The extent and nature of southeastern Europe (SEE) paternal genetic contribution to the European genetic landscape were explored based on a high-resolution Y chromosome analysis involving 681 males from seven populations in the region. Paternal lineages present in SEE were compared with previously published data from 81 western Eurasian populations and 5,017 Y chromosome samples. The finding that five major haplogroups (E3b1, I1b* (xM26), J2, R1a, and R1b) comprise more than 70% of SEE total genetic variation is consistent with the typical European Y chromosome gene pool. However, distribution of major Y chromosomal lineages and estimated expansion signals clarify the specific role of this region in structuring of European, and particularly Slavic, paternal genetic heritage. Contemporary Slavic paternal gene pool, mostly characterized by the predominance of R1a and I1b* (xM26) and scarcity of E3b1 lineages, is a result of two major prehistoric gene flows with opposite directions: the post-Last Glacial Maximum R1a expansion from east to west, the Younger Dryas-Holocene I1b* (xM26) diffusion out of SEE in addition to subsequent R1a and I1b* (xM26) putative gene flows between eastern Europe and SEE, and a rather weak extent of E3b1 diffusion toward regions nowadays occupied by Slavic-speaking populations.
Article
Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual’s DNA can be used to infer their geographic origin with surprising accuracy—often to within a few hundred kilometres.
Article
How modern humans dispersed into Eurasia and Australasia, including the number of separate expansions and their timings, is highly debated [1, 2]. Two categories of models are proposed for the dispersal of non-Africans: (1) single dispersal, i.e., a single major diffusion of modern humans across Eurasia and Australasia [3-5]; and (2) multiple dispersal, i.e., additional earlier population expansions that may have contributed to the genetic diversity of some present-day humans outside of Africa [6-9]. Many variants of these models focus largely on Asia and Australasia, neglecting human dispersal into Europe, thus explaining only a subset of the entire colonization process outside of Africa [3-5, 8, 9]. The genetic diversity of the first modern humans who spread into Europe during the Late Pleistocene and the impact of subsequent climatic events on their demography are largely unknown. Here we analyze 55 complete human mitochondrial genomes (mtDNAs) of hunter-gatherers spanning ∼35,000 years of European prehistory. We unexpectedly find mtDNA lineage M in individuals prior to the Last Glacial Maximum (LGM). This lineage is absent in contemporary Europeans, although it is found at high frequency in modern Asians, Australasians, and Native Americans. Dating the most recent common ancestor of each of the modern non-African mtDNA clades reveals their single, late, and rapid dispersal less than 55,000 years ago. Demographic modeling not only indicates an LGM genetic bottleneck, but also provides surprising evidence of a major population turnover in Europe around 14,500 years ago during the Late Glacial, a period of climatic instability at the end of the Pleistocene.
Article
The analysis of corded ware and accompanying artifacts reveals the nature of its appearance across the Central and Southern Balkan Eneolithic during three cultural-chronological horizons. The first horizon corresponds to the Early Eneolithic, namely the Bubanj-Salcuta-Krivodol cultural complex (BSK), while the second corresponds to the Cotofeni culture. The third horizon, showing chronological continuity with the second, and set within the Late Eneolithic/Early Bronze Age, has a site distribution that encompasses the territory of nearly the entire Balkan Peninsula, where corded ware is found together with other steppe elements which are present in large numbers, such are burials under mounds and the appearance of the domestic horse.
Article
Although south-Slavic populations have been studied to date from various aspects, the population of Serbia, occupying the central part of the Balkan Peninsula, is still genetically understudied at least at the level of mitochondrial DNA (mtDNA) variation. We analyzed polymorphisms of the first and the second mtDNA hypervariable segments (HVS-I and HVS-II) and informative coding-region markers in 139 Serbians to shed more light on their mtDNA variability, and used available data on other Slavic and neighboring non-Slavic populations to assess their interrelations in a broader European context. The contemporary Serbian mtDNA profile is consistent with the general European maternal landscape having a substantial proportion of shared haplotypes with eastern, central, and southern European populations. Serbian population was characterized as an important link between easternmost and westernmost south-Slavic populations due to the observed lack of genetic differentiation with all other south-Slavic populations and its geographical positioning within the Balkan Peninsula. An increased heterogeneity of south Slavs, most likely mirroring turbulent demographic events within the Balkan Peninsula over time (i.e., frequent admixture and differential introgression of various gene pools), and a marked geographical stratification of Slavs to south-, east-, and west-Slavic groups, were also found. A phylogeographic analyses of 20 completely sequenced Serbian mitochondrial genomes revealed not only the presence of mtDNA lineages predominantly found within the Slavic gene pool (U4a2a*, U4a2a1, U4a2c, U4a2g, HV10), supporting a common Slavic origin, but also lineages that may have originated within the southern Europe (H5*, H5e1, H5a1v) and the Balkan Peninsula in particular (H6a2b and L2a1k). Am J Phys Anthropol 156:449–465, 2015. © 2014 Wiley Periodicals, Inc.
Article
The Encyclopedia of Indo-European Culture provides the fullest and most inclusive coverage yet compiled of the major Indo-European language stocks and their origins, and the conceptual range of the reconstructed Proto-Indo-European language. The encyclopedia also offers entries on selected archaeological cultures having some relationship to the origin and dispersal of Indo-European groups, and on some of the major issues of Indo-European cultural studies. With over 700 entries, written by seventeen leading specialists, the Encyclopedia of Indo-European Culture is an essential reference work for all scholars and students in this field. In addition, its detailed indexing and clear layout and organization will ensure that readers find it easy to use. Outstanding Academic Book
Article
Background: Recent analyses of de novo DNA mutations in modern humans have suggested a nuclear substitution rate that is approximately half that of previous estimates based on fossil calibration. This result has led to suggestions that major events in human evolution occurred far earlier than previously thought. Results: Here, we use mitochondrial genome sequences from ten securely dated ancient modern humans spanning 40,000 years as calibration points for the mitochondrial clock, thus yielding a direct estimate of the mitochondrial substitution rate. Our clock yields mitochondrial divergence times that are in agreement with earlier estimates based on calibration points derived from either fossils or archaeological material. In particular, our results imply a separation of non-Africans from the most closely related sub-Saharan African mitochondrial DNAs (haplogroup L3) that occurred less than 62-95 kya. Conclusions: Though single loci like mitochondrial DNA (mtDNA) can only provide biased estimates of population divergence times, they can provide valid upper bounds. Our results exclude most of the older dates for African and non-African population divergences recently suggested by de novo mutation rate estimates in the nuclear genome.
Article
The distribution of identical and similar (phylogenetically related) types of hypervariable segment 1 (HVS1) of the mitochondrial DNA (mtDNA) was studied in human populations belonging to three Slavonic groups and nine ethnogeographic groups of Eurasia (total sample size 2772 people). The results testified to a common origin of West, South, and East Slavs and revealed a central place of West Slavs among all Slavonic ethnic groups. Mixing was shown to play a substantial role in the formation of specific features of all three Slavonic gene pools. The mitochondrial gene pools of the Slavonic ethnic groups proved to preserve features suggesting a common ancestor for these and South European populations (especially those of the Balkan Peninsula).
Article
There is currently no calibration available for the whole human mtDNA genome, incorporating both coding and control regions. Furthermore, as several authors have pointed out recently, linear molecular clocks that incorporate selectable characters are in any case problematic. We here confirm a modest effect of purifying selection on the mtDNA coding region and propose an improved molecular clock for dating human mtDNA, based on a worldwide phylogeny of > 2000 complete mtDNA genomes and calibrating against recent evidence for the divergence time of humans and chimpanzees. We focus on a time-dependent mutation rate based on the entire mtDNA genome and supported by a neutral clock based on synonymous mutations alone. We show that the corrected rate is further corroborated by archaeological dating for the settlement of the Canary Islands and Remote Oceania and also, given certain phylogeographic assumptions, by the timing of the first modern human settlement of Europe and resettlement after the Last Glacial Maximum. The corrected rate yields an age of modern human expansion in the Americas at approximately 15 kya that-unlike the uncorrected clock-matches the archaeological evidence, but continues to indicate an out-of-Africa dispersal at around 55-70 kya, 5-20 ky before any clear archaeological record, suggesting the need for archaeological research efforts focusing on this time window. We also present improved rates for the mtDNA control region, and the first comprehensive estimates of positional mutation rates for human mtDNA, which are essential for defining mutation models in phylogenetic analyses.
Article
It is widely accepted that the ancestors of Native Americans arrived in the New World via Beringia approximately 10 to 30 thousand years ago (kya). However, the arrival time(s), number of expansion events, and migration routes into the Western Hemisphere remain controversial because linguistic, archaeological, and genetic evidence have not yet provided coherent answers. Notably, most of the genetic evidence has been acquired from the analysis of the common pan-American mitochondrial DNA (mtDNA) haplogroups. In this study, we have instead identified and analyzed mtDNAs belonging to two rare Native American haplogroups named D4h3 and X2a. Phylogeographic analyses at the highest level of molecular resolution (69 entire mitochondrial genomes) reveal that two almost concomitant paths of migration from Beringia led to the Paleo-Indian dispersal approximately 15-17 kya. Haplogroup D4h3 spread into the Americas along the Pacific coast, whereas X2a entered through the ice-free corridor between the Laurentide and Cordilleran ice sheets. The examination of an additional 276 entire mtDNA sequences provides similar entry times for all common Native American haplogroups, thus indicating at least a dual origin for Paleo- Indians. A dual origin for the first Americans is a striking novelty from the genetic point of view, and it makes plausible a scenario positing that within a rather short period of time, there may have been several entries into the Americas from a dynamically changing Beringian source. Moreover, this implies that most probably more than one language family was carried along with the Paleo-Indians.
Article
Human mitochondrial DNA is widely used as tool in many fields including evolutionary anthropology and population history, medical genetics, genetic genealogy, and forensic science. Many applications require detailed knowledge about the phylogenetic relationship of mtDNA variants. Although the phylogenetic resolution of global human mtDNA diversity has greatly improved as a result of increasing sequencing efforts of complete mtDNA genomes, an updated overall mtDNA tree is currently not available. In order to facilitate a better use of known mtDNA variation, we have constructed an updated comprehensive phylogeny of global human mtDNA variation, based on both coding- and control region mutations. This complete mtDNA tree includes previously published as well as newly identified haplogroups, is easily navigable, will be continuously and regularly updated in the future, and is online available at http://www.phylotree.org. © 2008 Wiley-Liss, Inc.
Article
The Human Genome Project, from one perspective, began in 1981 with the publication1 of the complete sequence of human mitochondrial DNA (mtDNA). The Cambridge reference sequence (CRS), as it is now designated, continues to be indispensable for studies of human evolution, population genetics and mitochondrial diseases. It has been recognized for some time, however, that the CRS differs at several sites from the mtDNA sequences obtained by other investigators2, 3. These discrepancies may reflect either true errors in the original sequencing analysis or rare polymorphisms in the CRS mtDNA. A further complication is that the original mtDNA sequence was principally derived from a single individual of European descent, although it also contained some sequences from both HeLa and bovine mtDNA (1). To resolve these uncertainties, we have completely resequenced the original placental mtDNA sample.
Article
The Eskimo-Aleut language phylum is distributed from coastal Siberia across Alaska and Canada to Greenland and is well distinguished from the neighboring Na Dene languages. Genetically, however, the distinction between Na Dene and Eskimo-Aleut speakers is less clear. In order to improve the genetic characterization of Eskimos in general and Greenlanders in particular, we have sequenced hypervariable segment I (HVS-I) of the mitochondrial DNA (mtDNA) control region and typed relevant RFLP sites in the mtDNA of 82 Eskimos from Greenland. A comparison of our data with published sequences demonstrates major mtDNA types shared between Na Dene and Eskimo, indicating a common Beringian history within the Holocene. We further confirm the presence of an Eskimo-specific mtDNA subgroup characterized by nucleotide position 16265G within mtDNA group A2. This subgroup is found in all Eskimo groups analyzed so far and is estimated to have originated <3,000 years ago. A founder analysis of all Eskimo and Chukchi A2 types indicates that the Siberian and Greenland ancestral mtDNA pools separated around the time when the Neo-Eskimo culture emerged. The Greenland mtDNA types are a subset of the Alaskan mtDNA variation: they lack the groups D2 and D3 found in Siberia and Alaska and are exclusively A2 but at the same time lack the A2 root type. The data are in agreement with the view that the present Greenland Eskimos essentially descend from Alaskan Neo-Eskimos. European mtDNA types are absent in our Eskimo sample.
Article
Forty-seven mtDNAs collected in the Dominican Republic and belonging to the African-specific haplogroup L2 were studied by high-resolution RFLP and control-region sequence analyses. Four sets of diagnostic markers that subdivide L2 into four clades (L2a-L2d) were identified, and a survey of published African data sets appears to indicate that these clades encompass all L2 mtDNAs and harbor very different geographic/ethnic distributions. One mtDNA from each of the four clades was completely sequenced by means of a new sequencing protocol that minimizes time and expense. The phylogeny of the L2 complete sequences showed that the two mtDNAs from L2b and L2d seem disproportionately derived, compared with those from L2a and L2c. This result is not consistent with a simple model of neutral evolution with a uniform molecular clock. The pattern of nonsynonymous versus synonymous substitutions hints at a role for selection in the evolution of human mtDNA. Regardless of whether selection is shaping the evolution of modern human mtDNAs, the population screening of L2 mtDNAs for the mutations identified by our complete sequence study should allow the identification of marker motifs of younger age with more restricted geographic distributions, thus providing new clues about African prehistory and the origin and relationships of African ethnic groups.
Article
To investigate which aspects of contemporary human Y-chromosome variation in Europe are characteristic of primary colonization, late-glacial expansions from refuge areas, Neolithic dispersals, or more recent events of gene flow, we have analyzed, in detail, haplogroup I (Hg I), the only major clade of the Y phylogeny that is widespread over Europe but virtually absent elsewhere. The analysis of 1,104 Hg I Y chromosomes, which were identified in the survey of 7,574 males from 60 population samples, revealed several subclades with distinct geographic distributions. Subclade I1a accounts for most of Hg I in Scandinavia, with a rapidly decreasing frequency toward both the East European Plain and the Atlantic fringe, but microsatellite diversity reveals that France could be the source region of the early spread of both I1a and the less common I1c. Also, I1b*, which extends from the eastern Adriatic to eastern Europe and declines noticeably toward the southern Balkans and abruptly toward the periphery of northern Italy, probably diffused after the Last Glacial Maximum from a homeland in eastern Europe or the Balkans. In contrast, I1b2 most likely arose in southern France/Iberia. Similarly to the other subclades, it underwent a postglacial expansion and marked the human colonization of Sardinia approximately 9,000 years ago.
Article
A set of 18 Y-chromosomal microsatellite loci was analysed in 568 males from Poland, Slovakia and three regions of Belarus. The results were compared to data available for 2,937 Y chromosome samples from 20 other Slavic populations. Lack of relationship between linguistic, geographic and historical relations between Slavic populations and Y-short tandem repeat (STR) haplotype distribution was observed. Two genetically distant groups of Slavic populations were revealed: one encompassing all Western-Slavic, Eastern-Slavic, and two Southern-Slavic populations, and one encompassing all remaining Southern Slavs. An analysis of molecular variance (AMOVA) based on Y-chromosomal STRs showed that the variation observed between the two population groups was 4.3%, and was higher than the level of genetic variance among populations within the groups (1.2%). Homogeneity of northern Slavic paternal lineages in Europe was shown to stretch from the Alps to the upper Volga and involve ethnicities speaking completely different branches of Slavic languages. The central position of the population of Ukraine in the network of insignificant AMOVA comparisons, and the lack of traces of significant contribution of ancient tribes inhabiting present-day Poland to the gene pool of Eastern and Southern Slavs, support hypothesis placing the earliest known homeland of Slavs in the middle Dnieper basin.
Article
To resolve the phylogeny of certain mitochondrial DNA (mtDNA) haplogroups in eastern Europe and estimate their evolutionary age, a total of 73 samples representing mitochondrial haplogroups U4, HV*, and R1 were selected for complete mitochondrial genome sequencing from a collection of about 2,000 control region sequences sampled in eastern (Russians, Belorussians, and Ukrainians) and western (Poles, Czechs, and Slovaks) Slavs. On the basis of whole-genome resolution, we fully characterized a number of haplogroups (HV3, HV4, U4a1, U4a2, U4a3, U4b, U4c, U4d, and R1a) that were previously described only partially. Our findings demonstrate that haplogroups HV3, HV4, and U4a1 could be traced back to the pre-Neolithic times ( approximately 12,000-19,000 years before present [YBP]) in eastern Europe. In addition, an ancient connection between the Caucasus/Europe and India has been revealed by analysis of haplogroup R1 diversity, with a split between the Indian and Caucasus/European R1a lineages occurring about 16,500 years ago. Meanwhile, some mtDNA subgroups detected in Slavs (such as U4a2a, U4a2*, HV3a, and R1a1) are definitely younger being dated between 6,400 and 8,200 YBP. However, robust age estimations appear to be problematic due to the high ratios of nonsynonymous to synonymous substitutions found in young mtDNA subclusters.