ArticlePublisher preview available

Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

Indigenous populations of the Americas experienced high mortality rates during the early contact period as a result of infectious diseases, many of which were introduced by Europeans. Most of the pathogenic agents that caused these outbreaks remain unknown. Through the introduction of a new metagenomic analysis tool called MALT, applied here to search for traces of ancient pathogen DNA, we were able to identify Salmonella enterica in individuals buried in an early contact era epidemic cemetery at Teposcolula-Yucundaa, Oaxaca in southern Mexico. This cemetery is linked, based on historical and archaeological evidence, to the 1545-1550 CE epidemic that affected large parts of Mexico. Locally, this epidemic was known as 'cocoliztli', the pathogenic cause of which has been debated for more than a century. Here, we present genome-wide data from ten individuals for Salmonella enterica subsp. enterica serovar Paratyphi C, a bacterial cause of enteric fever. We propose that S. Paratyphi C be considered a strong candidate for the epidemic population decline during the 1545 cocoliztli outbreak at Teposcolula-Yucundaa.
This content is subject to copyright. Terms and conditions apply.
Articles
https://doi.org/10.1038/s41559-017-0446-6
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. © 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved.
1Max Planck Institute for the Science of Human History, Jena, Germany. 2Institute for Archaeological Sciences, University of Tübingen, Tübingen, Germany.
3Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA. 4Institute of Evolutionary Medicine, University of Zurich, Zurich,
Switzerland. 5National Institute of Anthropology and History (INAH), Mexico, Teposcolula-Yucundaa Archaeological Project, Mexico City, Mexico. 6Center
for Bioinformatics Tübingen (ZBIT), University of Tübingen, Tübingen, Germany. Present address: 7Smithsonian Conservation Biology Institute, Center for
Conservation Genomics, Washington DC, USA. Åshild J. Vågene and Alexander Herbig contributed equally to this work. *e-mail: herbig@shh.mpg.de;
tuross@fas.harvard.edu; bos@shh.mpg.de; krause@shh.mpg.de
Infectious diseases introduced to the New World following
European contact led to successive outbreaks in many regions
of the Americas that continued well into the nineteenth century.
These often caused high mortality and, therefore, contributed a
central, and often underappreciated, influence on the demographic
collapse of many indigenous populations14. Population declines
linked to regionally specific epidemics are estimated to have
reached as high as 95%3, and their genetic impact based on recent
population-based studies of ancient and modern human exome and
mitochondrial data attests to their scale5,6. One hypothesis posits
that the increased susceptibility of New World populations to Old
World diseases facilitated European conquest, whereby rapidly dis-
seminating diseases severely weakened indigenous populations2, in
some cases even before European presence in the region2,7. Well-
characterized infections, such as smallpox, measles, mumps and
influenza, are known causes of later contact era outbreaks; however,
the diseases that are responsible for many early contact period New
World epidemics remain unknown and have been the subject of sci-
entific debate for more than a century24,7,8.
Morphological changes in skeletal remains9 and ethnohistorical
accounts10 are often explored as sources for understanding popula-
tion health in the past, although both provide only limited resolution
and have generated speculative and, at times, conflicting hypotheses
about the diseases introduced to New World populations2,3,7,11,12.
Most infectious diseases do not leave characteristic markers on
the skeleton due to their short periods of infectivity, the death of
the victim in the acute phase before skeletal changes formed, or a
lack of osteological involvement9. Although historical descriptions
of infectious disease symptoms can be detailed, they are subject
to cultural biases, are affected by translational inaccuracies, lack
a foundation in germ theory and describe historical forms of a con-
dition that may differ from modern manifestations8,11. In addition,
differential diagnosis based on symptoms alone can be unreliable
even in modern contexts, as many infectious diseases have similar
clinical presentations.
Genome-wide studies of ancient pathogens have proven instru-
mental in both identifying and characterizing past human infec-
tious diseases. These studies have largely been restricted to skeletal
collections where individuals display physical changes consistent
with particular infections1315, a historical context that links a spe-
cific pathogen to a known epidemiological event16 or an organ-
ism that was identified via targeted molecular screening without
prior indication of its presence17. Recent attempts to circumvent
these limitations have concentrated on broad-spectrum molecular
approaches focused on pathogen detection via fluorescence-hybrid-
ization-based microarray technology18, identification via DNA
enrichment of certain microbial regions19 or computational screen-
ing of non-enriched sequence data against human microbiome data
sets20. These approaches offer improvements, but remain biased in
the bacterial taxa used for species-level assignments. As archaeo-
logical material is expected to harbour an abundance of bacteria
that stem from the depositional context, omission of environmental
taxa in species assignments can lead to false-positive identifications.
Additional techniques for authenticating ancient DNA have been
developed21,22, including the identification of characteristic damage
patterns caused by the deamination of cytosines23, methods that
evaluate evenness of coverage of aligned reads across a reference
genome, or length distributions that consider the degree of frag-
mentation, where ancient molecules are expected to be shorter than
those from modern contaminants24.
Salmonella enterica genomes from victims of a
major sixteenth-century epidemic in Mexico
Åshild J. Vågene1,2, Alexander Herbig 1,2*, Michael G. Campana 3,4,7, Nelly M. Robles García5,
Christina Warinner1, Susanna Sabin1, Maria A. Spyrou1,2, Aida Andrades Valtueña1, Daniel Huson6,
Noreen Tuross3*, Kirsten I. Bos1,2* and Johannes Krause 1,2*
Indigenous populations of the Americas experienced high mortality rates during the early contact period as a result of infec-
tious diseases, many of which were introduced by Europeans. Most of the pathogenic agents that caused these outbreaks
remain unknown. Through the introduction of a new metagenomic analysis tool called MALT, applied here to search for traces
of ancient pathogen DNA, we were able to identify Salmonella enterica in individuals buried in an early contact era epidemic
cemetery at Teposcolula-Yucundaa, Oaxaca in southern Mexico. This cemetery is linked, based on historical and archaeological
evidence, to the 1545–1550 
ce
epidemic that affected large parts of Mexico. Locally, this epidemic was known as cocoliztli, the
pathogenic cause of which has been debated for more than a century. Here, we present genome-wide data from ten individuals
for Salmonella enterica subsp. enterica serovar Paratyphi C, a bacterial cause of enteric fever. We propose that S. Paratyphi C be
considered a strong candidate for the epidemic population decline during the 1545 cocoliztli outbreak at Teposcolula-Yucundaa.
NATURE ECOLOGY & EVOLUTION | VOL 2 | MARCH 2018 | 520–528 | www.nature.com/natecolevol
520
Content courtesy of Springer Nature, terms of use apply. Rights reserved
... The sequencing reads that did not align to the human and wolf reference genomes were aligned to all available mitochondrial genomes (downloaded from NCBI on 14 May 2020) using MALT/v.0.4.1 (Vågene et al., 2018) and a Last Common Ancestor (LCA) analysis was performed in MEGAN/v6.19.4 (Huson et al., 2016). All species with a substantial number of matching reads (≥ 500) were considered to be "candidate species" and were subjected to further analysis to evaluate whether they were likely authentic. ...
Article
Distinguishing early domesticates from their wild progenitors presents a significant obstacle for understanding human-mediated effects in the past. The origin of dogs is particularly controversial because potential early dog remains often lack corroborating evidence that can provide secure links between proposed dog remains and human activity. The Tumat Puppies, two permafrost-preserved Late Pleistocene canids, have been hypothesized to have been littermates and early domesticates due to a physical association with putatively butchered mammoth bones. Through a combination of osteometry, stable isotope analysis, plant macrofossil analysis, and genomic and metagenomic analyses, this study exploits the unique properties of the naturally mummified Tumat Puppies to examine their familial relationship and to determine whether dietary information links them to human activities. The multifaceted analysis reveals that the 14,965-14,046 cal yr BP Tumat Puppies were littermates who inhabited a dry and relatively mild environment with heterogeneous vegetation and consumed a diverse diet, including woolly rhinoceros in their final days. However, because there is no evidence of mammoth consumption, these data do not establish a link between the canids and ancient humans.
... The sequencing reads were screened for the presence of pathogens following an inhouse pipeline [122,123] using MALT [124] v0.4.1 with a semi-global alignment mode and a minimum percent identity of 90% to align the samples against a database of 27,730 bacterial and 10,543 viral complete genomes [125,126]. ...
Article
Full-text available
Background The northern European Neolithic is characterized by two major demographic events: immigration of early farmers from Anatolia at 7500 years before present, and their admixture with local western hunter-gatherers forming late farmers, from around 6200 years before present. The influence of this admixture event on variation in the immune-relevant human leukocyte antigen (HLA) region is understudied. Results We analyzed genome-wide data of 125 individuals from seven archeological early farmer and late farmer sites located in present-day Germany. The late farmer group studied here is associated with the Wartberg culture, from around 5500–4800 years before present. We note that late farmers resulted from sex-biased admixture from male western hunter-gatherers. In addition, we observe Y-chromosome haplogroup I as the dominant lineage in late farmers, with site-specific sub-lineages. We analyze true HLA genotypes from 135 Neolithic individuals, the majority of which were produced in this study. We observe significant shifts in HLA allele frequencies from early farmers to late farmers, likely due to admixture with western hunter-gatherers. Especially for the haplotype DQB1*04:01-DRB1*08:01, there is evidence for a western hunter-gatherer origin. The HLA diversity increased from early farmers to late farmers. However, it is considerably lower than in modern populations. Conclusions Both early farmers and late farmers exhibit a relatively narrow HLA allele spectrum compared to today. This coincides with sparse traces of pathogen DNA, potentially indicating a lower pathogen pressure at the time.
... It is also possible to identify the victims of epidemic disease outbreaks based on mortuary contexts, e.g., atypical burial patterns, or associated documentary evidence (DeWitte, 2016). For example, in addition to plague burials detailed below (see list of sites in Kacki, 2019), archaeological evidence indicates that a cemetery with inconsistent burial positions and mass burials from the Mixtec site of Teposcolula Yucundaa, Mexico is associated with a historically documented disease (cocoliztli) outbreak in 1544-1550 (Warinner et al., 2012), possibly caused by Salmonella enterica (Vågene et al., 2018). It is also possible to use medical records associated with documented skeletal collections to identify people who suffered from specific diseases, such as the 1918 influenza pandemic (Wissler, 2019). ...
... Powders were processed using established techniques with both double-stranded 28 and single-stranded libraries 29 constructed for sequencing and analysis. Datasets of bulk DNA content were generated to a depth of around 5 million reads and were screened for the presence of T. pallidum DNA via analysis within the MALT/HOPS pipeline [30][31][32] (Supplementary Table 3). This process revealed four candidates for genome enrichment: an adult tibial fragment (GAP009) from the Chonos Archipelago of southern Chile, a tibia and humerus (pooled as MXV001) from an infant in Mexico City, a femur and occipital bone (pooled as RAZ007) from a child in Mexico City, and an upper molar, pelvic bone (iliac crest) and rib (pooled as DFU001) from a young adult in Deán Funes, Córdoba Province, in central Argentina (Fig. 1 Table 4). ...
Article
Full-text available
Human treponemal infections are caused by a family of closely related Treponema pallidum that give rise to the diseases yaws, bejel, pinta and, most notably, syphilis¹. Debates on a common origin for these pathogens and the history of syphilis itself have weighed evidence for the ‘Columbian hypothesis’², which argues for an American origin, against that for the ‘pre-Columbian hypothesis’³, which argues for the presence of the disease in Eurasia in the Medieval period and possibly earlier. Although molecular data has provided a genetic basis for distinction of the typed subspecies⁴, deep evolution of the complex has remained unresolved owing to limitations in the conclusions that can be drawn from the sparse palaeogenomic data that are currently available. Here we explore this evolutionary history through analyses of five pre- and peri-contact ancient treponemal genomes from the Americas that represent ancient relatives of the T. pallidum subsp. pallidum (syphilis), T. pallidum subsp. pertenue (yaws) and T. pallidum subsp. endemicum (bejel) lineages. Our data indicate unexplored diversity and an emergence of T. pallidum that post-dates human occupation in the Americas. Together, these results support an American origin for all T. pallidum characterized at the genomic level, both modern and ancient.
... For museums in many countries, the communities to whom objects originally belonged no longer exist as a group that can be consulted (Schuenemann et al. 2017;Vågene et al. 2018). This makes the museological process of decontextualization from the original setting and recontextualization into the museum irreversible. ...
Article
Full-text available
Fossils and First Nations artifacts are both physical remains that demonstrate the deep history of the Earth and its inhabitants. Modern museums have become the places where both of these kinds of natural and cultural heritage are often stored. Yet, many museums carry baggage of institutional distrust, rooted in damaging colonial practices that rely on extractive approaches to research and collection. These practices have encouraged the separation of paleontological and cultural collections and knowledge, and have often excluded First Nations voices from their exhibitions and research practices, Here, we explore the role of museums and their relationship with paleontological objects and sites in Australia and their relationship with First Nations peoples and heritage. We analyze the results of the 'Found a Fossil' survey, and describe three case studies; paleontological and cultural cross-over with dinosaur tracks in the West Kimberley region of Western Australia, the digital Museum of Stone Tools initiative, and the Rola[Stone] documentary film project, both based on Anaiwan Country in New South Wales, Australia. Acknowledging the innate duality of objects and places promotes elevated complexity in the way they are understood , studied, managed, legislated and curated. It is essential that First Nations stories, places, and objects be centered with fossils and geological materials to simultaneously tell the story of life on Earth, and in Australia, the story of the oldest living cultures in the world.
Article
Full-text available
Telemedicine has revolutionized medical care by improving the accessibility and efficiency of health services. However, its implementation presents ethical challenges related to information security, professional responsibility and patient trust. This study analyzes these challenges through a literature review of scientific articles indexed in recognized databases. The methodology used is exploratory in nature, with a qualitative approach based on the critical interpretation of the existing literature. The results show the vulnerability of medical data to cyber-attacks, the lack of training in the management of digital platforms by healthcare personnel and the absence of standardized regulations to guarantee the privacy of information. In addition, a significant reluctance on the part of patients to use telemedicine services is identified due to perceived risks to the confidentiality of their data. The discussion highlights the need to strengthen security protocols, train professionals in the ethical use of technology and develop homogeneous regulatory frameworks. In conclusion, telemedicine requires a comprehensive approach that combines technological advances with clear policies to ensure its effectiveness and acceptance.
Preprint
Full-text available
Motivation DNA damage patterns, such as increased frequencies of C>T and G>A substitutions at fragment ends, are widely used in ancient DNA studies to assess authenticity and detect contamination. In metagenomic studies, fragments can be mapped against multiple references or de novo assembled contigs to identify those likely to be ancient. Generating and comparing damage profiles to identify samples or contigs that are likely to stem from modern contaminants, however, can be both tedious and time-consuming. Although tools exist for estimating damage in single reference genomes and metagenomic datasets, none efficiently cluster damage patterns. Results To address this methodological gap, we developed AdDeam, a tool that combines rapid damage estimation with clustering for streamlined analyses and easy identification of potential contaminants or outliers. Our tool takes aligned aDNA fragments from various samples or contigs as input, computes damage patterns, clusters them, and outputs representative damage profiles per cluster, a probability of each sample pertaining to a cluster, as well as a PCA of the damage patterns for each sample for fast visualization. We evaluated AdDeam on both simulated and empirical datasets. AdDeam effectively distinguishes different damage levels, such as UDG-treated samples, sample-specific damages from specimens of different time periods, and can also distinguish between contigs containing modern or ancient fragments, providing a clear framework for aDNA authentication and facilitating large-scale analyses. The software is publicly available at https://github.com/LouisPwr/AdDeam.
Article
Whole-genome scans for natural selection signatures across Mexican indigenous populations remain underrepresented in the literature. Here, we conducted the first comparative analysis of genetic adaptation in Mexican indigenous populations using whole-genome sequencing data from 76 individuals representing 27 different ethnic groups in Mexico. We divided the cohort into northern, central, and southern populations and identified signals of natural selection within and across populations. We find evidence of adaptation to pathogenic environments in all our populations, including significant signatures in the Duffy blood group gene in central Mexican indigenous populations. Despite each region exhibiting unique local adaptation profiles, selection signatures on ARHGAP15, VGLL4, LINGO2, SYNDIG1, and TFAP2B were common to all populations. Our results also suggest that selection signatures falling within enhancers or promoters are usually connected to noncoding features, with notable exceptions like ARHGAP15 and GTDC1. This paper provides new evidence on the selection landscape of Mexican indigenous populations and lays the foundation for additional work on Mexican phenotypic characterization.
Preprint
Full-text available
Most human pathogens are of zoonotic origin. Many emerged during prehistory, coinciding with domestication providing more opportunities for spillover from original host species. However, we lack direct evidence linking past animal reservoirs and human infections. Here we present a Yersinia pestis genome recovered from a 3rd millennium BCE domesticated sheep from the Eurasian Steppe belonging to the Late Neolithic Bronze Age (LNBA) lineage, until now exclusively identified in ancient humans across Eurasia. We show that this ancient lineage underwent ancestral gene decay paralleling extant lineages, but evolved under distinct selective pressures contributing to its lack of geographic differentiation. We collect evidence supporting a scenario where the LNBA lineage, unable to efficiently transmit via fleas, spread from an unidentified reservoir to humans via sheep and likely other domesticates. Collectively, our results connect prehistoric livestock with infectious disease in humans and showcase the power of moving paleomicrobiology into the zooarcheological record.
Article
Infections have imposed strong selection pressures throughout human evolution, making the study of natural selection's effects on immunity genes highly complementary to disease-focused research. This review discusses how ancient DNA studies, which have revolutionized evolutionary genetics, increase our understanding of the evolution of human immunity. These studies have shown that interbreeding between modern humans and Neanderthals or Denisovans has influenced present-day immune responses, particularly to viruses. Additionally, ancient genomics enables the tracking of how human immunity has evolved across cultural transitions, highlighting strong selection since the Bronze Age in Europe (<4,500 years) and potential genetic adaptations to epidemics raging during the Middle Ages and the European colonization of the Americas. Furthermore, ancient genomic studies suggest that the genetic risk for noninfectious immune disorders has gradually increased over millennia because alleles associated with increased risk for autoimmunity and inflammation once conferred resistance to infections. The challenge now is to extend these findings to diverse, non-European populations and to provide a more global understanding of the evolution of human immunity.
Preprint
Full-text available
Salmonella enterica serovar Paratyphi C is the causative agent of enteric (paratyphoid) fever. While today a potentially lethal infection of humans that occurs in Africa and Asia, early 20 th century observations in Eastern Europe suggest it may once have had a wider-ranging impact on human societies. We recovered a draft Paratyphi C genome from the 800-year-old skeleton of a young woman in Trondheim, Norway, who likely died of enteric fever. Analysis of this genome against a new, significantly expanded database of related modern genomes demonstrated that Paratyphi C is descended from the ancestors of swine pathogens, serovars Choleraesuis and Typhisuis, together forming the Para C Lineage. Our results indicate that Paratyphi C has been a pathogen of humans for at least 1,000 years, and may have evolved after zoonotic transfer from swine during the Neolithic period. One Sentence Summary The combination of an 800-year-old Salmonella enterica Paratyphi C genome with genomes from extant bacteria reshapes our understanding of this pathogen’s origins and evolution.
Article
Full-text available
ELife digest Why and how have some bacteria evolved to cause illness in humans? One way to study bacterial evolution is to search for ancient samples of bacteria and use DNA sequencing technology to investigate how modern bacteria have changed from their ancestors. Understanding the evolution process may help researchers to understand how some bacteria become resistant to the antibiotics designed to kill them. Complications that occur during pregnancy, including bacterial infections, have long been a major cause of death for women. Now, Devault, Mortimer et al. have been able to sequence the DNA of bacteria found in tissue collected from a woman buried 800 years ago in a cemetery in Troy. Some of the woman’s tissues had been well preserved because they had calcified (probably as the result of infection), which preserved their structure in a mineralized layer. Two mineralized “nodules” in the body appear to be the remains of abscesses. Some of the human DNA in the nodules came from a male, suggesting that the woman was pregnant with a boy and that the abscesses formed in placental tissue. Sequencing the DNA of the bacteria in the abscess allowed Devault, Mortimer et al. to diagnose the woman’s infection, which was caused by two types of bacteria. One species, called Gardnerella vaginalis, is found in modern pregnancy-related infections. The DNA of the ancient samples was similar to that of modern bacteria. The other bacteria species was an ancient form of Staphylococcus saprophyticus, a type of bacteria that causes urinary tract infections. However, the DNA of the ancient S. saprophyticus bacteria is quite different to that of the bacteria found in modern humans. Instead, their DNA sequence appears more similar to forms of the bacteria that infect currently livestock. As humans lived closely with their livestock at the time the woman lived, her infection may be due to a type of bacteria that passed easily between humans and animals. Overall, the results suggest that the disease-causing properties of bacteria can arise from a wide range of sources. In addition, Devault, Mortimer et al. have demonstrated that certain types of tissue found in archeological remains are a potential gold mine of information about the evolution of bacteria and other microbes found in the human body. DOI: http://dx.doi.org/10.7554/eLife.20983.002
Article
Full-text available
Microbial archaeology is flourishing in the era of high-throughput sequencing, revealing the agents behind devastating historical plagues, identifying the cryptic movements of pathogens in prehistory, and reconstructing the ancestral microbiota of humans. Here, we introduce the fundamental concepts and theoretical framework of the discipline, then discuss applied methodologies for pathogen identification and microbiome characterization from archaeological samples. We give special attention to the process of identifying, validating, and authenticating ancient microbes using high-throughput DNAsequencing data. Finally, we outline standards and precautions to guide future research in the field.
Article
Full-text available
A major factor for the population decline of Native Americans after European contact has been attributed to infectious disease susceptibility. To investigate whether a pre-existing genetic component contributed to this phenomenon, here we analyse 50 exomes of a continuous population from the Northwest Coast of North America, dating from before and after European contact. We model the population collapse after European contact, inferring a 57% reduction in effective population size. We also identify signatures of positive selection on immune-related genes in the ancient but not the modern group, with the strongest signal deriving from the human leucocyte antigen (HLA) gene HLA-DQA1. The modern individuals show a marked frequency decrease in the same alleles, likely due to the environmental change associated with European colonization, whereby negative selection may have acted on the same gene after contact. The evident shift in selection pressures correlates to the regional European-borne epidemics of the 1800s.
Article
Full-text available
Unlabelled: For 100 years, it has been obvious that Salmonella enterica strains sharing the serotype with the formula 1,4,[5],12:b:1,2-now known as Paratyphi B-can cause diseases ranging from serious systemic infections to self-limiting gastroenteritis. Despite considerable predicted diversity between strains carrying the common Paratyphi B serotype, there remain few methods that subdivide the group into groups that are congruent with their disease phenotypes. Paratyphi B therefore represents one of the canonical examples in Salmonella where serotyping combined with classical microbiological tests fails to provide clinically informative information. Here, we use genomics to provide the first high-resolution view of this serotype, placing it into a wider genomic context of the Salmonella enterica species. These analyses reveal why it has been impossible to subdivide this serotype based upon phenotypic and limited molecular approaches. By examining the genomic data in detail, we are able to identify common features that correlate with strains of clinical importance. The results presented here provide new diagnostic targets, as well as posing important new questions about the basis for the invasive disease phenotype observed in a subset of strains. Importance: Salmonella enterica strains carrying the serotype Paratyphi B have long been known to possess Jekyll and Hyde characteristics; some cause gastroenteritis, while others cause serious invasive disease. Understanding what makes up the population of strains carrying this serotype, as well as the source of their invasive disease, is a 100-year-old puzzle that we address here using genomics. Our analysis provides the first high-resolution view of this serotype, placing strains carrying serotype Paratyphi B into the wider genomic context of the Salmonella enterica species. This work reveals a history of disease dating back to the middle ages, caused by a group of distinct lineages with various abilities to cause invasive disease. By quantifying the key genomic differences between the invasive and noninvasive populations, we are able to identify key virulence-related targets that can form the basis of simple, rapid, point-of-care tests.
Article
Full-text available
There is increasing interest in employing shotgun sequencing, rather than amplicon sequencing, to analyze microbiome samples. Typical projects may involve hundreds of samples and billions of sequencing reads. The comparison of such samples against a protein reference database generates billions of alignments and the analysis of such data is computationally challenging. To address this, we have substantially rewritten and extended our widely-used microbiome analysis tool MEGAN so as to facilitate the interactive analysis of the taxonomic and functional content of very large microbiome datasets. Other new features include a functional classifier called InterPro2GO, gene-centric read assembly, principal coordinate analysis of taxonomy and function, and support for metadata. The new program is called MEGAN Community Edition (CE) and is open source. By integrating MEGAN CE with our high-throughput DNA-to-protein alignment tool DIAMOND and by providing a new program MeganServer that allows access to metagenome analysis files hosted on a server, we provide a straightforward, yet powerful and complete pipeline for the analysis of metagenome shotgun sequences. We illustrate how to perform a full-scale computational analysis of a metagenomic sequencing project, involving 12 samples and 800 million reads, in less than three days on a single server. All source code is available here: https://github.com/danielhuson/megan-ce.
Article
While a comparatively young area of research, investigations relying on ancient DNA data have been highly valuable in revealing snapshots of genetic variation in both the recent and the not-so-recent past. Born out of a tradition of single-locus PCR-based approaches that often target individual species, stringent criteria for both data acquisition and analysis were introduced early to establish high standards of data quality. Today, the immense volume of data made available through next-generation sequencing has significantly increased the analytical resolution offered by processing ancient tissues and permits parallel analyses of host and microbial communities. The adoption of this new approach to data acquisition, however, requires an accompanying update on methods of DNA authentication, especially given that ancient molecules are expected to exist in low proportions in archaeological material, where an environmental signal is likely to dominate. In this review, we provide a summary of recent data authentication approaches that have been successfully used to distinguish between endogenous and nonendogenous DNA sequences in metagenomic data sets. While our discussion mostly centers on the detection of ancient human and ancient bacterial pathogen DNA, their applicability is far wider.