Article

Construction of reliable protein-protein interaction networks with a new interaction generality measure.

Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center (GSC), Tsurumi-ku, Yokohama 230-0045, Japan.
Bioinformatics (impact factor: 5.47). 05/2003; 19(6):756-63. pp.756-63
Source: PubMed

ABSTRACT MOTIVATION: Recent screening techniques have made large amounts of protein-protein interaction data available, from which biologically important information such as the function of uncharacterized proteins, the existence of novel protein complexes, and novel signal-transduction pathways can be discovered. However, experimental data on protein interactions contain many false positives, making these discoveries difficult. Therefore computational methods of assessing the reliability of each candidate protein-protein interaction are urgently needed. RESULTS: We developed a new 'interaction generality' measure (IG2) to assess the reliability of protein-protein interactions using only the topological properties of their interaction-network structure. Using yeast protein-protein interaction data, we showed that reliable protein-protein interactions had significantly lower IG2 values than less-reliable interactions, suggesting that IG2 values can be used to evaluate and filter interaction data to enable the construction of reliable protein-protein interaction networks.

0 0
 · 
0 Bookmarks
 · 
45 Views
  • Source
    Article: Assessing and predicting protein interactions by combining manifold embedding with multiple information integration.
    [show abstract] [hide abstract]
    ABSTRACT: Protein-protein interactions (PPIs) play crucial roles in virtually every aspect of cellular function within an organism. Over the last decade, the development of novel high-throughput techniques has resulted in enormous amounts of data and provided valuable resources for studying protein interactions. However, these high-throughput protein interaction data are often associated with high false positive and false negative rates. It is therefore highly desirable to develop scalable methods to identify these errors from the computational perspective. We have developed a robust computational technique for assessing the reliability of interactions and predicting new interactions by combining manifold embedding with multiple information integration. Validation of the proposed method was performed with extensive experiments on densely-connected and sparse PPI networks of yeast respectively. Results demonstrate that the interactions ranked top by our method have high functional homogeneity and localization coherence. Our proposed method achieves better performances than the existing methods no matter assessing or predicting protein interactions. Furthermore, our method is general enough to work over a variety of PPI networks irrespectively of densely-connected or sparse PPI network. Therefore, the proposed algorithm is a much more promising method to detect both false positive and false negative interactions in PPI networks.
    BMC Bioinformatics 01/2012; 13 Suppl 7:S3. · 2.75 Impact Factor
  • Source
    Article: HitPredict: a database of quality assessed protein-protein interactions in nine species.
    [show abstract] [hide abstract]
    ABSTRACT: Despite the availability of a large number of protein-protein interactions (PPIs) in several species, researchers are often limited to using very small subsets in a few organisms due to the high prevalence of spurious interactions. In spite of the importance of quality assessment of experimentally determined PPIs, a surprisingly small number of databases provide interactions with scores and confidence levels. We introduce HitPredict (http://hintdb.hgc.jp/htp/), a database with quality assessed PPIs in nine species. HitPredict assigns a confidence level to interactions based on a reliability score that is computed using evidence from sequence, structure and functional annotations of the interacting proteins. HitPredict was first released in 2005 and is updated annually. The current release contains 36,930 proteins with 176,983 non-redundant, physical interactions, of which 116,198 (66%) are predicted to be of high confidence.
    Nucleic Acids Research 10/2010; 39(Database issue):D744-9. · 8.03 Impact Factor
  • Source
    Article: Predicting direct protein interactions from affinity purification mass spectrometry data.
    [show abstract] [hide abstract]
    ABSTRACT: Affinity purification followed by mass spectrometry identification (AP-MS) is an increasingly popular approach to observe protein-protein interactions (PPI) in vivo. One drawback of AP-MS, however, is that it is prone to detecting indirect interactions mixed with direct physical interactions. Therefore, the ability to distinguish direct interactions from indirect ones is of much interest. We first propose a simple probabilistic model for the interactions captured by AP-MS experiments, under which the problem of separating direct interactions from indirect ones is formulated. Then, given idealized quantitative AP-MS data, we study the problem of identifying the most likely set of direct interactions that produced the observed data. We address this challenging graph theoretical problem by first characterizing signatures that can identify weakly connected nodes as well as dense regions of the network. The rest of the direct PPI network is then inferred using a genetic algorithm.Our algorithm shows good performance on both simulated and biological networks with very high sensitivity and specificity. Then the algorithm is used to predict direct interactions from a set of AP-MS PPI data from yeast, and its performance is measured against a high-quality interaction dataset. As the sensitivity of AP-MS pipeline improves, the fraction of indirect interactions detected will also increase, thereby making the ability to distinguish them even more desirable. Despite the simplicity of our model for indirect interactions, our method provides a good performance on the test networks.
    Algorithms for Molecular Biology 10/2010; 5:34. · 1.35 Impact Factor

Full-text

View
0 Downloads
Available from

Keywords

candidate protein-protein interaction
 
computational methods
 
false positives
 
filter interaction data
 
IG2 values
 
interaction-network structure
 
less-reliable interactions
 
new 'interaction generality' measure
 
novel signal-transduction pathways
 
protein interactions
 
protein-protein interaction data available
 
protein-protein interactions
 
Recent screening techniques
 
reliable protein-protein interaction networks
 
reliable protein-protein interactions
 
yeast protein-protein interaction data