Sequence analysis of Potato leafroll virus isolates reveals genetic stability, major evolutionary events and differential selection pressure between overlapping reading frame products.

INRA - UMR BiO3P, Domaine de la Motte, BP 35327, 35653 Le Rheu Cedex, France.
Journal of General Virology (Impact Factor: 3.53). 08/2002; 83(Pt 7):1799-807.
Source: PubMed

ABSTRACT In order to investigate the genetic diversity of Potato leafroll virus (PLRV), seven new complete genomic sequences of isolates collected worldwide were compared with the five sequences available in GenBank. Then, a restricted polymorphic region of the genome was chosen to further analyse new sequences. The sequences of PLRV open reading frames (ORFs) 3 and 4 were also compared with those of two other poleroviruses and the non-synonymous to synonymous substitution ratio distribution was analysed in overlapping and non-overlapping regions of the genome using maximum-likelihood models. Results confirmed that PLRV sequences from around the world are very closely related and showed that the region encoding protein P0 allowed the detection of three groups of isolates. When compared to other poleroviruses, PLRV was the most conserved in both ORFs 3 and 4. However, the results suggest that important events, such as deletion, mutation at a stop codon and intraspecific homologous recombination events, have occurred during the evolution of PLRV. Finally, it was shown that the translation products of ORFs 0 and 3 are significantly more conserved than those of the overlapping ORFs 1 and 4, respectively. All together, the results allow the proposal of new hypotheses to explain the apparent genetic stability of PLRV and its evolution.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The phosphoprotein (P) gene of most Paramyxovirinae encodes several proteins in overlapping frames: P and V, which share a common N-terminus (PNT), and C, which overlaps PNT. Overlapping genes are of particular interest because they encode proteins originated de novo, some of which have unknown structural folds, challenging the notion that nature utilizes only a limited, well-mapped area of fold space. The C proteins cluster in three groups, comprising measles, Nipah, and Sendai virus. We predicted that all C proteins have a similar organization: a variable, disordered N-terminus and a conserved, α-helical C-terminus. We confirmed this predicted organization by biophysically characterizing recombinant C proteins from Tupaia paramyxovirus (measles group) and human parainfluenza virus 1 (Sendai group). We also found that the C of the measles and Nipah groups have statistically significant sequence similarity, indicating a common origin. Although the C of the Sendai group lack sequence similarity with them, we speculate that they also have a common origin, given their similar genomic location and structural organization. Since C is dispensable for viral replication, unlike PNT, we hypothesize that C may have originated de novo by overprinting PNT in the ancestor of Paramyxovirinae. Intriguingly, in measles virus and Nipah virus, PNT encodes STAT1-binding sites that overlap different regions of the C-terminus of C, indicating they have probably originated independently. This arrangement, in which the same genetic region encodes simultaneously a crucial functional motif (a STAT1-binding site) and a highly constrained region (the C-terminus of C), seems paradoxical, since it should severely reduce the ability of the virus to adapt. The fact that it originated twice suggests that it must be balanced by an evolutionary advantage, perhaps from reducing the size of the genetic region vulnerable to mutations.
    PLoS ONE 02/2014; 9(2):e90003. · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Overlapping genes are two protein-coding sequences sharing a significant part of the same DNA locus in different reading frames. Although in recent times an increasing number of examples have been found in bacteria the underlying mechanisms of their evolution are unknown. In this work we explore how selective pressure in a protein-coding sequence influences its overlapping genes in alternative reading frames. We model evolution using a time-continuous Markov process and derive the corresponding model for the remaining frames to quantify selection pressure and genetic noise. Our findings lead to the presumption that, once information is embedded in the reverse reading frame -2 (relative to the mother gene in +1) purifying selection in the protein-coding reading frame automatically protects the sequences in both frames. We also found that this coincides with the fact that the genetic noise measured using the conditional entropy is minimal in frame -2 under selection in the coding frame.
    PLoS ONE 10/2014; 9(10):e108768. · 3.53 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Five Potato leafroll virus (PLRV) isolates were collected from five states representing different potato growing parts of India. The ssRNA genome sequences of these isolates were determined. The genome comprised of 5,883 nucleotides and deduced genome organization resembled other PLRV isolates. About 97.6–98.7 % similarities was observed within the Indian isolates and were more close to European, Canadian, African, American and Czech isolates (95.8–98.6 %) than to an Australian isolate (92.9–93.4 %). These isolates were 43.7–53.1 % similar to other poleroviruses and 29.1–29.3 % to Barley yellow dwarf virus, a luteovirus. Out of five isolates, the isolate PBI-6 was recombinant one as detected by RDP3 software. Multiple sequence alignment of nucleotide and amino acid sequences of different ORFs indicated that the ORF 3 and ORF 4, corresponding to coat protein and movement proteins are more conserved than other ORFs. Amino acid changes specific to Indian isolates were observed and it was more in ORF 2 than in ORF 0, ORF 3 and ORF 4. This is the first report of complete genome sequence of PLRV isolates from India, which reveals low level genetic diversity.
    Indian Journal of Virology 09/2013; · 0.36 Impact Factor