Article

PSI-BLAST-ISS: an intermediate sequence search tool for estimation of the position-specific alignment reliability.

Institute of Biotechnology, Graiciuno 8, LT-02241 Vilnius, Lithuania.
BMC Bioinformatics (impact factor: 2.75). 02/2005; 6:185. DOI:10.1186/1471-2105-6-185 pp.185
Source: PubMed

ABSTRACT Protein sequence alignments have become indispensable for virtually any evolutionary, structural or functional study involving proteins. Modern sequence search and comparison methods combined with rapidly increasing sequence data often can reliably match even distantly related proteins that share little sequence similarity. However, even highly significant matches generally may have incorrectly aligned regions. Therefore when exact residue correspondence is used to transfer biological information from one aligned sequence to another, it is critical to know which alignment regions are reliable and which may contain alignment errors.
PSI-BLAST-ISS is a standalone Unix-based tool designed to delineate reliable regions of sequence alignments as well as to suggest potential variants in unreliable regions. The region-specific reliability is assessed by producing multiple sequence alignments in different sequence contexts followed by the analysis of the consistency of alignment variants. The PSI-BLAST-ISS output enables the user to simultaneously analyze alignment reliability between query and multiple homologous sequences. In addition, PSI-BLAST-ISS can be used to detect distantly related homologous proteins. The software is freely available at: http://www.ibt.lt/bioinformatics/iss.
PSI-BLAST-ISS is an effective reliability assessment tool that can be useful in applications such as comparative modelling or analysis of individual sequence regions. It favorably compares with the existing similar software both in the performance and functional features.

0 0
 · 
0 Bookmarks
 · 
35 Views
  • Article: Combining sensitive database searches with multiple intermediates to detect distant homologues.
    [show abstract] [hide abstract]
    ABSTRACT: Using data from the CATH structure classification, we have assessed the blastp, fasta, smith-waterman and gapped-blast algorithms, developed a portable normalization scheme and identified safe thresholds for database searching. Of the four methods assessed, fasta, smith-waterman and gapped-blast perform similarly, whereas the sensitivity of blastp was much lower. Introduction of an intermediate sequence search substantially improved the results. When tested on a set of relationships that could not be identified by blastp, intermediate sequences were able to find double the number of relationships identified by the smith-waterman algorithm alone. However, we found that the benefit of using intermediates varied considerably between each family and depended not only on the number of available sequences, but also their diversity. In an attempt to increase sensitivity further, a multiple intermediate sequence search (MISS) procedure was developed. When assessed on 1906 cases from a wide range of homologous families that could not be detected by the previous approaches, MISS was able to identify 241 additional relationships. MISS uses the full extent of sequence diversity to detect additional relationships, but does not consider any structure-specific information. For this reason, it is more generally applicable than fold recognition and threading methods, which require a library of known structures.
    Protein engineering 03/1999; 12(2):95-100.
  • Source
    Article: SEALS: a system for easy analysis of lots of sequences.
    [show abstract] [hide abstract]
    ABSTRACT: We present a system of programs designed to facilitate sequence analysis projects involving large amounts of data. SEALS (System for Easy Analysis of Lots of Sequences) is a logically organized set of flexible, easily modifiable research tools, designed to run on open systems. Functionality is divided into approximately 50 commands which follow consistent syntax and semantics; wrappers are also provided for commonly used sequence analysis software to effect similar syntax for these programs. SEALS includes software for retrieving sequence information, scripting database search tools such as BLAST and MoST, viewing and analyzing search outputs, searching in and processing nucleotide and protein sequences using regular expressions, and constructing rational predictions of protein features. The system is designed to provide modular elements which can be combined, modified, and integrated with other methods in order to quickly design and execute computer experiments for sequence analysis projects at the scale of whole genomes.
    Proceedings / ... International Conference on Intelligent Systems for Molecular Biology; ISMB. International Conference on Intelligent Systems for Molecular Biology 02/1997; 5:333-9.
  • Article: Ten Years and Counting…
    [show abstract] [hide abstract]
    ABSTRACT: The articles in this issue cover mechanics, biology, and laboratory applications, and three cover computation per se. Moreover, they're evenly divided between the two categories of CiSE readers. CiSE thus seems to be fulfilling its mission of establishing a liaison between the physical sciences and computational sciences.
    Computing in Science and Engineering 08/2008; · 1.42 Impact Factor

Full-text

View
0 Downloads
Available from

Keywords

aligned sequence
 
alignment regions
 
alignment reliability
 
alignment variants
 
comparative modelling
 
comparison methods
 
delineate reliable regions
 
different sequence contexts
 
effective reliability assessment tool
 
individual sequence regions
 
Modern sequence search
 
multiple homologous sequences
 
multiple sequence alignments
 
potential variants
 
Protein sequence alignments
 
PSI-BLAST-ISS output enables
 
region-specific reliability
 
sequence data
 
sequence similarity
 
unreliable regions