Conference Paper

A Seed-Based Method for Predicting Common Secondary Structures in Unaligned RNA Sequences.

DOI: 10.1007/978-3-540-73729-2_38 Conference: Modeling Decisions for Artificial Intelligence, 4th International Conference, MDAI 2007, Kitakyushu, Japan, August 16-18, 2007, Proceedings
Source: DBLP


The prediction of RNA secondary structure can be facilitated by incorporating with comparative analysis of homologous sequences.
However, most of existing comparative approaches are vulnerable to alignment errors. Here we use unaligned sequences to devise
a seed-based method for predicting RNA secondary structures. The central idea of our method can be described by three major
steps: 1) to detect all possible stems in each sequence using the so-called position matrix, which indicates the paired or
unpaired information for each position in the sequence; 2) to select the seeds for RNA folding by finding and assessing the
conserved stems across all sequences; 3) to predict RNA secondary structures on the basis of the seeds. We tested our method
on data sets composed of RNA sequences with known secondary structures. Our method has average accuracy (measured as sensitivity)
69.93% for singe sequence tests, 72.97% for two-sequence tests, and 79.27% for three-sequence tests. The results show that
our method can predict RNA secondary structure with a higher accuracy than Mfold.

1 Read