A periodic pattern of mRNA secondary structure created by the genetic code

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Nucleic Acids Research (Impact Factor: 9.11). 02/2006; 34(8):2428-37. DOI: 10.1093/nar/gkl287
Source: PubMed


Single-stranded mRNA molecules form secondary structures through complementary self-interactions. Several hypotheses have been proposed on the relationship between the nucleotide sequence, encoded amino acid sequence and mRNA secondary structure. We performed the first transcriptome-wide in silico analysis of the human and mouse mRNA foldings and found a pronounced periodic pattern of nucleotide involvement in mRNA secondary structure. We show that this pattern is created by the structure of the genetic code, and the dinucleotide relative abundances are important for the maintenance of mRNA secondary structure. Although synonymous codon usage contributes to this pattern, it is intrinsic to the structure of the genetic code and manifests itself even in the absence of synonymous codon usage bias at the 4-fold degenerate sites. While all codon sites are important for the maintenance of mRNA secondary structure, degeneracy of the code allows regulation of stability and periodicity of mRNA secondary structure. We demonstrate that the third degenerate codon sites contribute most strongly to mRNA stability. These results convincingly support the hypothesis that redundancies in the genetic code allow transcripts to satisfy requirements for both protein structure and RNA structure. Our data show that selection may be operating on synonymous codons to maintain a more stable and ordered mRNA secondary structure, which is likely to be important for transcript stability and translation. We also demonstrate that functional domains of the mRNA [5'-untranslated region (5'-UTR), CDS and 3'-UTR] preferentially fold onto themselves, while the start codon and stop codon regions are characterized by relaxed secondary structures, which may facilitate initiation and termination of translation.

Download full-text


Available from: Nikolay A Spiridonov,
40 Reads
  • Source
    • "To obtain mRNA secondary structure from the RB1 gene sequence we have used RNAfold (Capon et al., 2004; Shabalina et al., 2006; Altschul et al., 1990; Hofacker et al., 1994; McCaskill, 1990; Zuker and Stiegler, 1981; Lorenz et al., 2011; Turner and Mathews, 2009; Bompfunewerer et al., 2008; Hofacker and Stadler, 2006; Mathews et al., 2004; Jia and Luo, 2006; Jia et al., 2004) version 2.1.7 online tool ( from Vienna RNA package 2.0 (Hofacker et al., 1994). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Clinically significant 18 Single Nucleotide Polymorphisms (SNPs) from exon regions of Retinoblastoma gene (RB1) were analyzed to find out the structural variations in mRNAs. Online bioinformatic tools i.e., Vienna RNA, RNAfold were used for secondary structure analysis of mRNAs. Predicted minimum Free Energy Change (MFE) was calculated for mRNAs structures. It has been observed that the average of predicted MFE value from 13 nonsense mutations was higher (0.76 kcal/mol) in comparison to 5 missense mutations. Presumably, 13 nonsense mutations are responsible for Nonsense Mediated mRNA Decay (NMD), therefore, excluded from haplotype analysis. From the statistical analysis all the thermodynamic data obtained from four SNP haplotypes are significant (p≤0.05), followed by three-SNP haplotype data except Ensemble diversity (p≤0.10). Interestingly, MEF of Centroid Secondary Structure is highly significant (p≤0.01) in all the cases (Two-SNP haplotypes, Three-SNP haplotypes and Four-SNP haplotypes).
    American Journal of Biochemistry and Biotechnology 10/2015; DOI:10.3844/ajbbsp.2015
  • Source
    • "It is indicated that, in addition to translational selection, other factors that correlate with GC3, e.g. transcriptional selection [40,41], mRNA stability [11,12], biased gene conversion [8,9], may also have combined with translational selection to contribute to the positive correlation between CUB and gene expression level. As translational regulation rather than transcriptional regulation or mRNA stability is more pronounced in influencing protein level in mammals [42], future investigations might as well involve protein expression data to verify such strong translational selection in human HK genes and take account of translation initiation [13] and elongation as well as codon order [43]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Translational selection is a ubiquitous and significant mechanism to regulate protein expression in prokaryotes and unicellular eukaryotes. Recent evidence has shown that translational selection is weakly operative in highly expressed genes in human and other vertebrates. However, it remains unclear whether translational selection acts differentially on human genes depending on their expression patterns. Results Here we report that human housekeeping (HK) genes that are strictly defined as genes that are expressed ubiquitously and consistently in most or all tissues, are under stronger translational selection. Conclusions These observations clearly show that translational selection is also closely associated with expression pattern. Our results suggest that human HK genes are more efficiently and/or accurately translated into proteins, which will inevitably open up a new understanding of HK genes and the regulation of gene expression. Reviewers This article was reviewed by Yuan Yuan, Baylor College of Medicine; Han Liang, University of Texas MD Anderson Cancer Center (nominated by Dr Laura Landweber) Eugene Koonin, NCBI, NLM, NIH, United States of America Sandor Pongor, International Centre for Genetic Engineering and biotechnology (ICGEB), Italy.
    Biology Direct 07/2014; 9(1):17. DOI:10.1186/1745-6150-9-17 · 4.66 Impact Factor
  • Source
    • "Codon bias can also be influenced by selection for mRNA stability. In humans and mice, optimal codons for translation are mostly GC-ending [44,45]; these codons are thought to decrease both mRNA degradation rates in vitro[46] and the Gibbs free energy of mRNA secondary structure [47,48]. Lastly, selective constraint for splicing control also seems to cause low synonymous substitution rates in splicing associated regions, such as purine-rich exonic splicing enhancers (ESEs) [49] and exon-intron junctions [50,51]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Synonymous codon usage can affect many cellular processes, particularly those associated with translation such as polypeptide elongation and folding, mRNA degradation/stability, and splicing. Highly expressed genes are thought to experience stronger selection pressures on synonymous codons. This should result in codon usage bias even in species with relatively low effective population sizes, like mammals, where synonymous site selection is thought to be weak. Here we use phylogenetic codon-based likelihood models to explore patterns of codon usage bias in a dataset of 18 mammalian rhodopsin sequences, the protein mediating the first step in vision in the eye, and one of the most highly expressed genes in vertebrates. We use these patterns to infer selection pressures on key translational mechanisms including polypeptide elongation, protein folding, mRNA stability, and splicing. Results Overall, patterns of selection in mammalian rhodopsin appear to be correlated with post-transcriptional and translational processes. We found significant evidence for selection at synonymous sites using phylogenetic mutation-selection likelihood models, with C-ending codons found to have the highest relative fitness, and to be significantly more abundant at conserved sites. In general, these codons corresponded with the most abundant tRNAs in mammals. We found significant differences in codon usage bias between rhodopsin loops versus helices, though there was no significant difference in mean synonymous substitution rate between these motifs. We also found a significantly higher proportion of GC-ending codons at paired sites in rhodopsin mRNA secondary structure, and significantly lower synonymous mutation rates in putative exonic splicing enhancer (ESE) regions than in non-ESE regions. Conclusions By focusing on a single highly expressed gene we both distinguish synonymous codon selection from mutational effects and analytically explore underlying functional mechanisms. Our results suggest that codon bias in mammalian rhodopsin arises from selection to optimally balance high overall translational speed, accuracy, and proper protein folding, especially in structurally complicated regions. Selection at synonymous sites may also be contributing to mRNA stability and splicing efficiency at exonic-splicing-enhancer (ESE) regions. Our results highlight the importance of investigating highly expressed genes in a broader phylogenetic context in order to better understand the evolution of synonymous substitutions.
    BMC Evolutionary Biology 05/2014; 14(1):96. DOI:10.1186/1471-2148-14-96 · 3.37 Impact Factor
Show more