Using Time-Structured Data to Estimate Evolutionary Rates of Double-Stranded DNA Viruses

Department of Biology, The Pennsylvania State University, USA.
Molecular Biology and Evolution (Impact Factor: 9.11). 04/2010; 27(9):2038-51. DOI: 10.1093/molbev/msq088
Source: PubMed


Double-stranded (ds) DNA viruses are often described as evolving through long-term codivergent associations with their hosts, a pattern that is expected to be associated with low rates of nucleotide substitution. However, the hypothesis of codivergence between dsDNA viruses and their hosts has rarely been rigorously tested, even though the vast majority of nucleotide substitution rate estimates for dsDNA viruses are based upon this assumption. It is therefore important to estimate the evolutionary rates of dsDNA viruses independent of the assumption of host-virus codivergence. Here, we explore the use of temporally structured sequence data within a Bayesian framework to estimate the evolutionary rates for seven human dsDNA viruses, including variola virus (VARV) (the causative agent of smallpox) and herpes simplex virus-1. Our analyses reveal that although the VARV genome is likely to evolve at a rate of approximately 1 x 10(-5) substitutions/site/year and hence approaching that of many RNA viruses, the evolutionary rates of many other dsDNA viruses remain problematic to estimate. Synthetic data sets were constructed to inform our interpretation of the substitution rates estimated for these dsDNA viruses and the analysis of these demonstrated that given a sequence data set of appropriate length and sampling depth, it is possible to use time-structured analyses to estimate the substitution rates of many dsDNA viruses independently from the assumption of host-virus codivergence. Finally, the discovery that some dsDNA viruses may evolve at rates approaching those of RNA viruses has important implications for our understanding of the long-term evolutionary history and emergence potential of this major group of viruses.

Download full-text


Available from: Andrew Kitchen,
  • Source
    • "If more recently sampled sequences have undergone more molecular evolution, then the true sampling dates should yield a t MRCA that differs substantially from the equivalent estimates with the sampling dates randomly permuted over sequences (Ramsden, Holmes & Charleston 2009; e.g. Duffy & Holmes 2009; Firth et al. 2010; Fraile et al. 2011; Pag an & Holgu ın 2013; Duch^ ene, Holmes & Ho 2014b; Duch^ ene et al. 2015a). Finally, a distinct approach uses model selection and compares the fit of models with the sampling dates included or excluded, thereby failing to take special account for any evolution that might have taken place during the sampling period (Rambaut 2000; Drummond, Pybus & Rambaut 2003b; Drummond et al. 2003a; Baele et al. 2012). "
    [Show abstract] [Hide abstract]
    ABSTRACT: 1. ‘Dated-tip’ methods of molecular dating use DNA sequences sampled at different times, to estimate the age of their most recent common ancestor. Several tests of ‘temporal signal’ are available to determine whether data sets are suitable for such analysis. However, it remains unclear whether these tests are reliable. 2. We investigate the performance of several tests of temporal signal, including some recently suggested modifi- cations. We use simulated data (where the true evolutionary history is known), and whole genomes of methicillin-resistant Staphylococcus aureus (to show how particular problems arise with real-world data sets). 3. We show that all of the standard tests of temporal signal are seriously misleading for data where temporal and genetic structures are confounded (i.e. where closely related sequences are more likely to have been sampled at similar times). This is not an artefact of genetic structure or tree shape per se, and can arise even when sequences have measurably evolved during the sampling period. More positively, we show that a ‘clustered permutation’ approach introduced by Duchêne et al. (Molecular Biology and Evolution, 32, 2015, 1895) can successfully correct for this artefact in all cases and introduce techniques for implementing this method with real data sets. 4. The confounding of temporal and genetic structures may be difficult to avoid in practice, particularly for outbreaks of infectious disease, or when using ancient DNA. Therefore, we recommend the use of ‘clustered permutation’ for all analyses. The failure of the standard tests may explain why different methods of dating pathogen origins have reached such wildly different conclusions.
    Methods in Ecology and Evolution 09/2015; DOI:10.1111/2041-210X.12466 · 6.55 Impact Factor
  • Source
    • "), even if taxonomic classifications could not be based solely on the time of divergence. In addition, when Firth et al. tried to account for the specificity of double stranded DNA genomes in evolution calculations, their most robust virus model (with the different clock and demographic models tested) was Variola virus, suggesting that heterochronous phylogenetic modeling may be used for poxviruses evolution calculations (Firth et al., 2010 "
    [Show abstract] [Hide abstract]
    ABSTRACT: Avipoxviruses are divided into three clades: canarypox-like viruses, fowlpox-like viruses, and psittacinepox-like viruses. Several molecular clock and demographic models available in the BEAST package were compared on three avipoxvirus genes (P4b, cnpv186 and DNA polymerase genes), which enabled to determine that avipoxviruses evolved at a rate of 2-8×10(-5)substitution/site/year, in the range of poxviruses previously reported evolution rates. In addition, the date of mean time of divergence of avipoxviruses from a common ancestor was extrapolated to be about 10,000-30,000years ago, at the same period as modern poxvirus species. Our findings will facilitate epidemiological investigations on avipoxviruses' spread, origin and circulation. Copyright © 2015. Published by Elsevier B.V.
    Infection, genetics and evolution: journal of molecular epidemiology and evolutionary genetics in infectious diseases 07/2015; 35. DOI:10.1016/j.meegid.2015.07.031 · 3.02 Impact Factor
  • Source
    • "by chance alone, we repeated the root-to-tip analysis 1000 times with the tip dates of the isolates randomly permutated each time (Firth et al. 2010). In all cases, the data with random permutations gave lower R 2 values than the real data, so that we could reject our null hypothesis at the 0.001 level and accept the alternative hypothesis that the real data contains significant temporal signal. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Strangles, the most frequently diagnosed infectious disease of horses worldwide, is caused by Streptococcus equi. Despite its prevalence, the global diversity and mechanisms underlying the evolution of S. equi as a host-restricted pathogen remain poorly understood. Here we define the global population structure of this important pathogen and reveal a population replacement in the late 19th or early 20th century. Our data reveal a dynamic genome that continues to mutate and decay, but also to amplify and acquire genes despite the organism having lost its natural competence and become host-restricted. The lifestyle of S. equi within the horse is defined by short-term acute disease, strangles, followed by long-term carriage. Population analysis reveals evidence of convergent evolution in isolates from post-acute disease samples, as a result of niche adaptation to persistent carriage within a host. Mutations that lead to metabolic streamlining and the loss of virulence determinants are more frequently found in carriage isolates, suggesting that the pathogenic potential of S. equi reduces as a consequence of long term residency within the horse post-acute disease. An example of this is the deletion of the equibactin siderophore locus that is associated with iron acquisition, which occurs exclusively in carrier isolates, and renders S. equi significantly less able to cause acute disease in the natural host. We identify several loci that may similarly be required for the full virulence of S. equi, directing future research towards the development of new vaccines against this host-restricted pathogen. Published by Cold Spring Harbor Laboratory Press.
    Genome Research 07/2015; 25(9). DOI:10.1101/gr.189803.115 · 14.63 Impact Factor
Show more