Article

Periodicity of DNA in exons.

Institute of Genetics and Bioinformatics, University of New England, Armidale, NSW, Australia.
BMC Molecular Biology (impact factor: 2.86). 09/2004; 5:12. DOI:10.1186/1471-2199-5-12 pp.12
Source: PubMed

ABSTRACT The periodic pattern of DNA in exons is a known phenomenon. It was suggested that one of the initial causes of periodicity could be the universal (RNY)npattern (R = A or G, Y = C or U, N = any base) of ancient RNA. Two major questions were addressed in this paper. Firstly, the cause of DNA periodicity, which was investigated by comparisons between real and simulated coding sequences. Secondly, quantification of DNA periodicity was made using an evolutionary algorithm, which was not previously used for such purposes.
We have shown that simulated coding sequences, which were composed using codon usage frequencies only, demonstrate DNA periodicity very similar to the observed in real exons. It was also found that DNA periodicity disappears in the simulated sequences, when the frequencies of codons become equal. Frequencies of the nucleotides (and the dinucleotide AG) at each location along phase 0 exons were calculated for C. elegans, D. melanogaster and H. sapiens. Two models were used to fit these data, with the key objective of describing periodicity. Both of the models showed that the best-fit curves closely matched the actual data points. The first dynamic period determination model consistently generated a value, which was very close to the period equal to 3 nucleotides. The second fixed period model, as expected, kept the period exactly equal to 3 and did not detract from its goodness of fit.
Conclusion can be drawn that DNA periodicity in exons is determined by codon usage frequencies. It is essential to differentiate between DNA periodicity itself, and the length of the period equal to 3. Periodicity itself is a result of certain combinations of codons with different frequencies typical for a species. The length of period equal to 3, instead, is caused by the triplet nature of genetic code. The models and evolutionary algorithm used for characterising DNA periodicity are proven to be an effective tool for describing the periodicity pattern in a species, when a number of exons in the same phase are analysed.

0 0
 · 
0 Bookmarks
 · 
39 Views
  • Source
    Article: Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justification.
    [show abstract] [hide abstract]
    ABSTRACT: The periodic variations obtained by correlating the relative positions of purines and pyrimidines (and of the four bases thymine, cytosine, adenine, and guanine) in a wide variety of genomes of wholly or partly known sequence suggest that there may be enough of an earlier comma-free coding system (i.e., only readable in one frame) still present to permit determination of the reading frame and approximate extent of the present protein coding stretches. The characteristics of these variations support the hypothesis that these primitive messages were formed of coding triplets having the form RNY (R = purine; Y = pyrimidine; and N = purine or pyrimidine). The base sequences and reading frames that have a minimal deviation from such a message are still good predictors of actual coding regions and reading frames in spite of the many mutations that have occurred since such a genetic code was last in use. In fact, the right frame for almost all the proteins in a number of viruses and various prokaryotes and eukaryotes is deduced purely from purine/pyrimidine information and not by using the normal start and stop signals.
    Proceedings of the National Academy of Sciences 04/1981; 78(3):1596-600. · 9.68 Impact Factor
  • Article: Periodic correlations in DNA sequences and evidence suggesting their evolutionary origin in a comma-less genetic code.
    [show abstract] [hide abstract]
    ABSTRACT: Strong rhythms with a period of three bases have been seen while correlating the relative positions of purines and pyrimidines and of the four individual bases in the complete DNA sequence of the viruses phi X174, G4 and fd. Generally weaker variations of the same type have been found in the DNA virus SV40, the plasmid pBR322, the RNA virus MS2, and elsewhere in procaryotes and eucaryotes (e.g. in a ribosomal protein gene cluster of E. coli and the sea urchin histone genes). From the interrelation of four-base with purine-pyrimidine rhythms it seems that the purine-pyrimidine relationships have a basic significance. An explanation is proposed in terms of the former use of a comma-less genetic code (i.e. readable only in one frame) of the general form RNY (R = purine, Y = pyrimidine and N = purine or pyrimidine). In spite of subsequent mutation, there appears to be still enough of the primitive messages remaining to produce these periodic variations with their characteristic properties in phase and amplitude. Particularly good evidence for this hypothesis is provided by the fact that the phases for the stronger rhythms are the same in all the genomes tested and can be successfully predicted by a simple consideration of the original RNY pattern. With regard to amplitude it can be similarly foreseen which variations will be more clearly marked than others. The observed behaviour of the amplitude as the separation between correlated bases increases is also explained by the insertions, deletions and point mutations which have occurred. Additionally it is possible to account for some notable features of the non-random use of codons for the same amino acid by this theory.
    Journal of Molecular Evolution 02/1981; 17(2):94-102. · 2.27 Impact Factor
  • Source
    Article: Periodicity in DNA primary structure is defined by secondary structure of the coded protein.
    [show abstract] [hide abstract]
    ABSTRACT: A 10.5-base periodicity found earlier is inherent in both eu- and prokaryotic coding nucleotide sequences. In the case of noncoding eukaryotic sequences no periodicity is found, so the 10.5-base oscillation seemingly does not correlate with the nucleosomal organization of DNA. It is shown that the DNA fragments, coding the alpha-helical protein segments, manifest the pronounced 10.5-base periodicity, while those regions of DNA which code the beta-structure have a 6-base oscillation. The repeating pattern of nucleotide sequences can be used for comparison of the DNA segments with low degree of homology.
    Nucleic Acids Research 05/1981; 9(8):1963-71. · 8.03 Impact Factor

Full-text

View
1 Download
Available from

Keywords

3 nucleotides
 
actual data points
 
characterising DNA periodicity
 
codon usage frequencies
 
different frequencies typical
 
DNA periodicity
 
evolutionary algorithm
 
first dynamic period determination model
 
frequencies
 
genetic code
 
H. sapiens
 
initial causes
 
period equal
 
period model
 
periodicity pattern
 
phase 0 exons
 
real exons
 
simulated coding sequences
 
simulated sequences
 
triplet nature
 

Stephen T Eskesen