Using Time-Structured Data to Estimate Evolutionary Rates of Double-Stranded DNA Viruses

Department of Biology, The Pennsylvania State University, USA.
Molecular Biology and Evolution (Impact Factor: 9.11). 04/2010; 27(9):2038-51. DOI: 10.1093/molbev/msq088
Source: PubMed


Double-stranded (ds) DNA viruses are often described as evolving through long-term codivergent associations with their hosts, a pattern that is expected to be associated with low rates of nucleotide substitution. However, the hypothesis of codivergence between dsDNA viruses and their hosts has rarely been rigorously tested, even though the vast majority of nucleotide substitution rate estimates for dsDNA viruses are based upon this assumption. It is therefore important to estimate the evolutionary rates of dsDNA viruses independent of the assumption of host-virus codivergence. Here, we explore the use of temporally structured sequence data within a Bayesian framework to estimate the evolutionary rates for seven human dsDNA viruses, including variola virus (VARV) (the causative agent of smallpox) and herpes simplex virus-1. Our analyses reveal that although the VARV genome is likely to evolve at a rate of approximately 1 x 10(-5) substitutions/site/year and hence approaching that of many RNA viruses, the evolutionary rates of many other dsDNA viruses remain problematic to estimate. Synthetic data sets were constructed to inform our interpretation of the substitution rates estimated for these dsDNA viruses and the analysis of these demonstrated that given a sequence data set of appropriate length and sampling depth, it is possible to use time-structured analyses to estimate the substitution rates of many dsDNA viruses independently from the assumption of host-virus codivergence. Finally, the discovery that some dsDNA viruses may evolve at rates approaching those of RNA viruses has important implications for our understanding of the long-term evolutionary history and emergence potential of this major group of viruses.

Download full-text


Available from: Andrew Kitchen
  • Source
    • "The phylogenetic analysis clarified the relationship between the previously described Type I and Type III A B C D Fig. 3Investigating the presence of a clock-like signal in the data. The presence of a molecular clock in the dataset was assessed by plotting the root to tip distance of isolates in the phylogeny against isolation date[53]for both the dataset as a whole (a + b) and only Type II isolates (c + d). Both had evidence of a very weak positive signal, as indicated by a low linear regression correlation coefficient (a + c). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: Mycobacterium avium subspecies paratuberculosis (Map) is an infectious enteric pathogen that causes Johne's disease in livestock. Determining genetic diversity is prerequisite to understanding the epidemiology and biology of Map. We performed the first whole genome sequencing (WGS) of 141 global Map isolates that encompass the main molecular strain types currently reported. We investigated the phylogeny of the Map strains, the diversity of the genome and the limitations of commonly used genotyping methods. Results: Single nucleotide polymorphism (SNP) and phylogenetic analyses confirmed two major lineages concordant with the former Type S and Type C designations. The Type I and Type III strain groups are subtypes of Type S, and Type B strains are a subtype of Type C and not restricted to Bison species. Conclusions: This study clarifies the phylogenetic relationships between the previously described Map strain groups, and highlights the limitations of current genotyping techniques. Map isolates exhibit restricted genetic diversity and a substitution rate consistent with a monomorphic pathogen. WGS provides the ultimate level of resolution for differentiation between strains. However, WGS alone will not be sufficient for tracing and tracking Map infections, yet importantly it can provide a phylogenetic context for affirming epidemiological connections.
    Full-text · Article · Dec 2016 · BMC Genomics
  • Source
    • "If the rate estimate from the original data set is excluded from the 95% credibility intervals of the rate estimates from at least 95% of the date-randomized replicates, the data set is deemed to contain adequate temporal structure to allow a meaningful estimate of the rate. Many, but not all, published ancient DNA and virus data sets satisfy this condition (Firth et al. 2010; Ho et al. 2011b; Duch^ ene et al. 2014). Modifications of the test, including the use of a more conservative criterion, were proposed recently (Duch^ ene et al. 2015a). "
    [Show abstract] [Hide abstract]
    ABSTRACT: We are writing in response to a recent critique by Emerson & Hickerson (2015), who challenge the evidence of a time-dependent bias in molecular rate estimates. This bias takes the form of a negative relationship between inferred evolutionary rates and the ages of the calibrations on which these estimates are based. Here, we present a summary of the evidence obtained from a broad range of taxa that supports a time-dependent bias in rate estimates, with a consideration of the potential causes of these observed trends. We also describe recent progress in improving the reliability of evolutionary rate estimation and respond to the concerns raised by Emerson & Hickerson (2015) about the validity of rates estimated from time-structured sequence data. In doing so, we hope to dispel some misconceptions and to highlight several research directions that will improve our understanding of time-dependent biases in rate estimates.
    Preview · Article · Dec 2015
  • Source
    • "If more recently sampled sequences have undergone more molecular evolution, then the true sampling dates should yield a t MRCA that differs substantially from the equivalent estimates with the sampling dates randomly permuted over sequences (Ramsden, Holmes & Charleston 2009; e.g. Duffy & Holmes 2009; Firth et al. 2010; Fraile et al. 2011; Pag an & Holgu ın 2013; Duch^ ene, Holmes & Ho 2014b; Duch^ ene et al. 2015a). Finally, a distinct approach uses model selection and compares the fit of models with the sampling dates included or excluded, thereby failing to take special account for any evolution that might have taken place during the sampling period (Rambaut 2000; Drummond, Pybus & Rambaut 2003b; Drummond et al. 2003a; Baele et al. 2012). "
    [Show abstract] [Hide abstract]
    ABSTRACT: 1. ‘Dated-tip’ methods of molecular dating use DNA sequences sampled at different times, to estimate the age of their most recent common ancestor. Several tests of ‘temporal signal’ are available to determine whether data sets are suitable for such analysis. However, it remains unclear whether these tests are reliable. 2. We investigate the performance of several tests of temporal signal, including some recently suggested modifi- cations. We use simulated data (where the true evolutionary history is known), and whole genomes of methicillin-resistant Staphylococcus aureus (to show how particular problems arise with real-world data sets). 3. We show that all of the standard tests of temporal signal are seriously misleading for data where temporal and genetic structures are confounded (i.e. where closely related sequences are more likely to have been sampled at similar times). This is not an artefact of genetic structure or tree shape per se, and can arise even when sequences have measurably evolved during the sampling period. More positively, we show that a ‘clustered permutation’ approach introduced by Duchêne et al. (Molecular Biology and Evolution, 32, 2015, 1895) can successfully correct for this artefact in all cases and introduce techniques for implementing this method with real data sets. 4. The confounding of temporal and genetic structures may be difficult to avoid in practice, particularly for outbreaks of infectious disease, or when using ancient DNA. Therefore, we recommend the use of ‘clustered permutation’ for all analyses. The failure of the standard tests may explain why different methods of dating pathogen origins have reached such wildly different conclusions.
    Full-text · Article · Sep 2015 · Methods in Ecology and Evolution
Show more