Thomas Martin Bataillon

Thomas Martin Bataillon
Aarhus University | AU · Center for Bioinformatik BIRC

PhD Agro Paris Tech 2000

About

171
Publications
29,957
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
6,996
Citations
Additional affiliations
September 2001 - December 2005
French National Institute for Agriculture, Food, and Environment (INRAE)
Position
  • Senior Researcher
August 2012 - July 2013
French National Centre for Scientific Research
Position
  • Sabbatical
January 2006 - March 2016
Aarhus University
Position
  • Professor (Associate)

Publications

Publications (171)
Article
Full-text available
Whether species can respond evolutionarily to current climate change is crucial for the persistence of many species. Yet, very few studies have examined genetic responses to climate change in manipulated experiments carried out in natural field conditions. We examined the evolutionary response to climate change in a common annelid worm using a cont...
Article
Full-text available
There have been a variety of approaches taken to try to characterize and identify the genetic basis of adaptation in nature, spanning theoretical models, experimental evolution studies, and direct tests of natural populations. Theoretical models can provide formalized and detailed hypotheses regarding evolutionary processes and patterns, from which...
Article
Full-text available
We study genome-wide nucleotide diversity in three subspecies of extant chimpanzees using exome capture. After strict filtering, SNVs and indels were called and genotyped for >50% of exons at a mean coverage of 35x per individual. Central chimpanzees (P. t. troglodytes) are the most polymorphic (nucleotide diversity, θw= 0.0023 per site) followed b...
Article
Full-text available
The fitness landscape - the mapping between genotypes and fitness - determines properties of the process of adaptation. Several small genetic fitness landscapes have recently been built by selecting a handful of beneficial mutations and measuring fitness of all combinations of these mutations. Here we generate several testable predictions for the p...
Article
Full-text available
The fitness landscape defines the relationship between genotypes and fitness in a given environment, and underlies fundamental quantities such as the distribution of selection coefficient, or the magnitude and type of epistasis. A better understanding of variation of landscape structure across species and environments is thus necessary to understan...
Preprint
Full-text available
Calling rare somatic variants from NGS data is more challenging than calling inherited variants, especially if the somatic variant is only present in a small fraction of the cells in the sequenced biopsy. In this case, having a good estimate of the error rate of a specific base in a particular read becomes essential. In paired-end sequencing, where...
Article
Full-text available
Plant root exudates are involved in nutrient acquisition, microbial partnerships, and inter‐organism signaling. Yet, little is known about the genetic and environmental drivers of root exudate variation at large geographical scales, which may help understand the evolutionary trajectories of plants in heterogeneous environments. We quantified natura...
Preprint
Full-text available
Homologous recombination rearranges genetic information during meiosis to generate new combinations of variants. Recombination also causes new mutations, affects the GC content of the genome and reduces selective interference. Here, we use HiFi long-read sequencing to directly detect crossover and gene conversion events from switches between the tw...
Preprint
Full-text available
Gene conversions are broadly defined as the transfer of genetic material from a 'donor' to an 'acceptor' sequence and can happen both in meiosis and mitosis. They are a subset of non-crossover events and like crossover events, gene conversion can generate new combinations of alleles, erode linkage disequilibrium, and even counteract the mutation lo...
Article
Full-text available
Estimating the distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary biology, ecology, and conservation. However, existing methods for DFE estimation suffer from limitations, such as slow computation speed and limited scalability. To address these issues, we introduce fastDFE, a Python-based software pa...
Preprint
Full-text available
Estimating the distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary biology, ecology, and conservation. However, existing methods for DFE estimation suffer from limitations, such as slow computation speed and limited scalability. To address these issues, we introduce fastDFE, a Python-based software pa...
Article
Full-text available
Noncoding DNA is central to our understanding of human gene regulation and complex diseases1,2, and measuring the evolutionary sequence constraint can establish the functional relevance of putative regulatory elements in the human genome3–9. Identifying the genomic elements that have become constrained specifically in primates has been hampered by...
Article
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 mill...
Article
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology and is urgent given severe threats the...
Preprint
Full-text available
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 mil...
Preprint
The rich diversity of morphology and behavior displayed across primate species provides an informative context in which to study the impact of genomic diversity on fundamental biological processes. Analysis of that diversity provides insight into long-standing questions in evolutionary and conservation biology, and is urgent given severe threats th...
Article
Full-text available
The diversity of resistance challenges the ability of pathogens to spread and to exploit host populations. Yet, how this host diversity evolves over time remains unclear because it depends on the interplay between intraspecific competition among host genotypes and coevolution with pathogens. Here we study experimentally the effect of coevolving pha...
Article
Full-text available
The “Fitness landscape” metaphor is central to our ability to conceptualize how mutations generate new phenotypes and, in turn, variation in fitness. This metaphor has been instrumental in shaping collective mental pictures of how evolution proceeds, where the limits to innovation lie, and how adaptation emerges as a consequence of natural selectio...
Preprint
Full-text available
The two-stage sampling design provides good local estimates of both the number of plant species and the relative abundances. However, it is a problem to calculate Hill diversity indices at the local scale, because some of the species found in the large plot are not present in the small plot and such species should then incorrectly be weighted with...
Article
Full-text available
A central question in evolution is how several adaptive phenotypes are maintained within a species. Theory predicts that the genetic determination of a trait, and in particular the amounts of redundancy in the mapping of genotypes to phenotypes, mediates evolutionary outcomes of phenotypic selection. In Mediterranean wild thyme, numerous discrete c...
Article
Full-text available
Sexual dimorphisms are widespread in animals and plants, for morphological as well as physiological traits. Understanding the genetic basis of sexual dimorphism and its evolution is crucial for understanding biological differences between the sexes. Genetic variants with sex‐antagonistic effects on fitness are expected to segregate in populations a...
Article
Full-text available
Loss of habitat, eutrophication and reduced grazing intensity are known drivers of landscape-level changes in plant species composition; however, consequences of the massive decline in insect abundance are still to be understood. Pollinator decline can reduce seed set in plants relying on insects for successful reproduction. This may result in a re...
Article
Full-text available
Many quantitative traits are subject to polygenic selection, where several genomic regions undergo small, simultaneous changes in allele frequency that collectively alter a phenotype. The widespread availability of genome data, along with novel statistical techniques, has made it easier to detect these changes. We apply one such method, the “Single...
Article
Full-text available
The distribution of fitness effects (DFE) of new mutations plays a central role in molecular evolution. It is therefore crucial to be able to estimate it accurately from genomic data and to understand the factors that shape it. After a rapid overview of available methods to characterize the fitness effects of mutations, we review what is known on t...
Preprint
Full-text available
The diversity of resistance fuels host adaptation to infectious diseases and challenges the ability of pathogens to exploit host populations. Yet, how this host diversity evolves over time remains unclear because it depends on the interplay between intraspecific competition and coevolution with pathogens. Here we study the effect of a coevolving ph...
Article
Full-text available
Homologous recombination is expected to increase natural selection efficacy by decoupling the fate of beneficial and deleterious mutations and by readily creating new combinations of beneficial alleles. Here, we investigate how the proportion of amino acid substitutions fixed by adaptive evolution (α) depends on the recombination rate in bacteria....
Article
Full-text available
The Distribution of Fitness Effects (DFE) of new mutations is a key parameter of molecular evolution. The DFE can in principle be estimated by comparing the Site Frequency Spectra (SFS) of putatively neutral and functional polymorphisms. Unfortunately the DFE is intrinsically hard to estimate, especially for beneficial mutations since these tend to...
Preprint
Full-text available
Homologous recombination is expected to increase natural selection efficacy by decoupling the fate of beneficial and deleterious mutations and by readily creating new combinations of beneficial alleles. Here, we investigate how the proportion of amino acid substitutions fixed by adaptive evolution ( α ) depends on the recombination rate in bacteria...
Preprint
Full-text available
A major question in evolution is how to maintain many adaptive phenotypes within a species. In Mediterranean wild thyme, a staggering number of discrete chemical phenotypes (chemotypes) coexist in close geographic proximity. Plant chemotypes are defined by the dominant monoterpene produced in their essential oil. We study the genetics of six distin...
Article
Full-text available
About 15,000 angiosperm species (∼6%) have separate sexes, a phenomenon known as dioecy. Why dioecious taxa are so rare is still an open question. Early work reported lower species richness in dioecious compared to non-dioecious sister clades, raising the hypothesis that dioecy may be an evolutionary dead-end. This hypothesis has been recently chal...
Preprint
Full-text available
Sex dimorphisms are widespread in animals and plants, for morphological as well as physiological traits. Understanding the genetic basis of sex dimorphism and its evolution is crucial for understanding biological differences between the sexes. Genetic variants with sex-antagonistic effects on fitness are expected to segregate in populations at the...
Preprint
Full-text available
Many quantitative traits are subject to polygenic selection, where several genomic regions undergo small, simultaneous changes in allele frequency that collectively alter a phenotype. The widespread availability of genome data, along with novel statistical techniques, has made it easier to detect these changes. We apply one such method, the ‘Single...
Chapter
Full-text available
The possible evolutionary trajectories a population can follow is determined by the fitness effects of new mutations. Their relative frequencies are best specified through a distribution of fitness effects (DFE) that spans deleterious, neutral, and beneficial mutations. As such, the DFE is key to several aspects of the evolution of a population, an...
Article
Full-text available
A major research goal in evolutionary genetics is to uncover loci experiencing positive selection. One approach involves finding 'selective sweeps' patterns, which can either be 'hard sweeps' formed by de novo mutation, or 'soft sweeps' arising from recurrent mutation or existing standing variation. Existing theory generally assumes outcrossing pop...
Article
Full-text available
The importance of adaptive mutations in molecular evolution is extensively debated. Recent developments in population genomics allow inferring rates of adaptive mutations by fitting a distribution of fitness effects to the observed patterns of polymorphism and divergence at sites under selection and sites assumed to evolve neutrally. Here, we summa...
Article
Castellano et al. provide the first comparison of the full distribution of fitness effects (including deleterious, neutral but also beneficial mutations) in the great apes. The authors investigate which aspects of the full DFE are likely... The distribution of fitness effects (DFE) is central to many questions in evolutionary biology. However, litt...
Article
Distribution of fitness effects (DFE) of mutations can be inferred from site frequency spectrum (SFS) data. There is mounting interest to determine whether distinct genomic regions and/or species share a common DFE, or whether evidence exists for differences among them. polyDFEv2.0 fits multiple SFS datasets at once and provides likelihood ratio te...
Preprint
Full-text available
The distribution of fitness effects (DFE) is central to many questions in evolutionary biology. However, little is known about the differences in DFEs between closely related species. We use more than 9,000 coding genes orthologous one­to­one across great apes, gibbons, and macaques to assess the stability of the DFE across great apes. We use the u...
Article
Full-text available
Most studies on consequences of environmental change focus on evolutionary and phenotypic plastic responses, but parental effects represent an additional mechanism by which organisms respond to their local environment. Parental effects can be adaptive if they enhance offsprings ability to cope with environments experienced by their parents, but can...
Article
Full-text available
Parallel evolution, defined as identical changes arising in independent populations, is often attributed to similar selective pressures favoring the fixation of identical genetic changes. However, some level of parallel evolution is also expected if mutation rates are heterogeneous across regions of the genome. Theory suggests that mutation and sel...
Preprint
Full-text available
About 15,000 angiosperm species (∼6%) have separate sexes, a phenomenon known as dioecy. Early work reported a lower species richness in dioecious compared to non-dioecious sister clades, which was taken to suggest that dioecy might be an evolutionary dead end. More recently, phylogenetic analyses using different methodologies have challenged this...
Article
Fitness landscapes map the relationship between genotypes and fitness. However, most fitness landscape studies ignore the genetic architecture imposed by the codon table and thereby neglect the potential role of synonymous mutations. To quantify the fitness effects of synonymous mutations and their potential impact on adaptation on a fitness landsc...
Preprint
Full-text available
Distributions of fitness effects (DFE) of mutations can be inferred from site frequency spectrum (SFS) data. There is mounting interest to determine whether distinct genomic regions and/or species share a common DFE, or whether evidence exists for differences among them. polyDFEv2.0 fits multiple SFS datasets at once and provides likelihood ratio t...
Preprint
Full-text available
Fitness landscapes map the relationship between genotypes and fitness. However, most fitness landscape studies ignore the genetic architecture imposed by the codon table and thereby neglect the potential role of synonymous mutations. To quantify the fitness effects of synonymous mutations and their potential impact on adaptation on a fitness landsc...
Article
Full-text available
Estimating the proportion of adaptive substitutions (α) is of primary importance to uncover the determinants of adaptation in comparative genomic studies. Several methods have been proposed to estimate α from patterns polymorphism and divergence in coding sequences. However, estimators of α can be biased when the underlying assumptions are not met....
Preprint
Full-text available
A major research goal in evolutionary genetics is to uncover loci experiencing positive selection. One approach involves finding ‘selective sweeps’ patterns, which can either be ‘hard sweeps’ formed by de novo mutation, or ‘soft sweeps’ arising from recurrent mutation or existing standing variation. Existing theory generally assumes outcrossing pop...
Preprint
Full-text available
This preprint has been reviewed and recommended by Peer Community In Evolutionary Biology ( http://dx.doi.org/10.24072/pci.evolbiol.100045 ). Parallel evolution, defined as identical changes arising in independent populations, is often attributed to similar selective pressures favoring the fixation of identical genetic changes. However, some level...
Article
Full-text available
Gynodioecy is a sexual dimorphism where females coexist with hermaphrodite individuals. In most cases, this dimorphism involves the interaction of cytoplasmic male sterility (CMS) genes and nuclear restorer genes. Two scenarios can account for how these interactions maintain gynodioecy. Either CMS genes recurrently enter populations at low frequenc...
Article
Full-text available
The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through the site frequency spectrum (SFS) and divergence d...
Article
Full-text available
Genome-wide surveys of nucleotide polymorphisms, obtained from next-generation sequencing, have uncovered numerous examples of adaptation in self-fertilizing organisms, especially regarding changes to climate, geography, and reproductive systems. Yet existing models for inferring attributes of adaptive mutations often assume idealized outcrossing p...
Article
Full-text available
Significance Our study shows that deep subseafloor sediments are populated by descendants of rare members of surface sediment microbial communities that become predominant during burial over thousands of years. We provide estimates of mutation rates and strength of purifying selection in a set of taxonomically diverse microbial populations in marin...
Article
Full-text available
Parallel evolution is the repeated evolution of the same phenotype or genotype in evolutionarily independent populations. Here, we use evolve-and-resequence experiments with bacteria and yeast to dissect the drivers of parallel evolution at the gene level. A meta-analysis shows that parallel evolution is often rare, but there is a positive relation...
Article
Full-text available
The Wright-Fisher model provides an elegant mathematical framework for understanding allele frequency data. In particular, the model can be used to infer the demographic history of species and identify loci under selection. A crucial quantity for inference under the Wright-Fisher model is the distribution of allele frequencies (DAF). Despite the ap...
Article
Full-text available
Effective population size (Ne) is a central parameter in population and conservation genetics. It measures the magnitude of genetic drift, rates of accumulation of inbreeding in a population, and it conditions the efficacy of selection. It is often assumed that a single Ne can account for the evolution of genomes. However, recent work provides indi...
Preprint
Full-text available
The distribution of fitness effects (DFE) encompasses deleterious, neutral and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution ( α ). Inference of DFE and α from patterns of polymorphism (SFS) and divergence data has been a longstanding goal of evolutionary genetics...
Data
Table S1. Overview of number of SNPs called by two different methods: pools consisting of 10 individuals versus separate individuals. Table S2. ranking and annotation of the 10 highest ranking annotated contigs. Figure S1. Comparison of degree‐days accumulated in experimental plots in warming (T) versus ambient (A) plots. Figure S2. Individual v...
Article
Full-text available
Effective population size (Ne) is defined as the size of an idealized population undergoing the same rate of genetic drift as the population under study. It is a central concept in population genetics and used extensively in the design and monitoring of conservation and breeding programs. It is most often assumed that genetic drift effects are homo...
Article
Full-text available
The large amount and high quality of genomic data available today enables, in principle, accurate inference of evolutionary histories of observed populations. The Wright-Fisher model is one of the most widely used models for this purpose. It describes the stochastic behavior in time of allele frequencies and the influence of evolutionary pressures,...
Preprint
Full-text available
The large amount and high quality of genomic data available today enables, in principle, accurate inference of evolutionary history of observed populations. The Wright-Fisher model is one of the most widely used models for this purpose. It describes the stochastic behavior in time of allele frequencies and the influence of evolutionary pressures, s...
Article
Full-text available
The characterization of functional elements in genomes relies on the identification of the footprints of natural selection. In this quest, taking into account neutral evolutionary processes such as mutation and genetic drift is crucial because these forces can generate patterns that may obscure or mimic signatures of selection. In mammals, and prob...
Preprint
Full-text available
The characterization of functional elements in genomes relies on the identification of the footprints of natural selection. In this quest, taking into account neutral evolutionary processes such as mutation and genetic drift is crucial because these forces can generate patterns that may obscure or mimic signatures of selection. In mammals, and prob...
Article
Full-text available
The rates and properties of new mutations affecting fitness have implications for a number of outstanding questions in evolutionary biology. Obtaining estimates of mutation rates and effects has historically been challenging, and little theory has been available for predicting the distribution of fitness effects (DFE); however, there have been rece...
Article
Full-text available
Unraveling the factors that determine the rate of adaptation is a major question in evolutionary biology. One key parameter is the effect of a new mutation on fitness, which invariably depends on the environment and genetic background. The fate of a mutation also depends on population size, which determines the amount of drift it will experience. H...
Article
Studies of biological responses in the terrestrial environment to rapid changes in climate have mostly been concerned with aboveground biota, whereas less is known of belowground organisms. The present study focuses on mites and springtails of heathland ecosystems and how the microarthropod community has responded to simulated climate change in a l...
Data
Change in fitness after 120 (red symbols) and 240 generations of adaptation (black symbols).