Article

Mining Alzheimer disease relevant proteins from integrated protein interactome data.

Indiana University School of Informatics, Purdue University School of Science, Dept. of Computer and Information Science Indianapolis, IN 46202, USA.
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 02/2006; DOI: 10.1142/9789812701626_0034
Source: PubMed

ABSTRACT Huge unrealized post-genome opportunities remain in the understanding of detailed molecular mechanisms for Alzheimer Disease (AD). In this work, we developed a computational method to rank-order AD-related proteins, based on an initial list of AD-related genes and public human protein interaction data. In this method, we first collected an initial seed list of 65 AD-related genes from the OMIM database and mapped them to 70 AD seed proteins. We then expanded the seed proteins to an enriched AD set of 765 proteins using protein interactions from the Online Predicated Human Interaction Database (OPHID). We showed that the expanded AD-related proteins form a highly connected and statistically significant protein interaction sub-network. We further analyzed the sub-network to develop an algorithm, which can be used to automatically score and rank-order each protein for its biological relevance to AD pathways(s). Our results show that functionally relevant AD proteins were consistently ranked at the top: among the top 20 of 765 expanded AD proteins, 19 proteins are confirmed to belong to the original 70 AD seed protein set. Our method represents a novel use of protein interaction network data for Alzheimer disease studies and may be generalized for other disease areas in the future.

0 Bookmarks
 · 
88 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: The human network of Protein-Protein Interactions (PPIs) (interactome) provides information on biological systems that can be used to aid prediction of protein function and disease association. As some classes of protein may be the focus of much study, data sets may contain bias, which may affect the results of network analyses. Implicated cancer proteins and proteins including significant known mediators of cardiovascular disease (CVD) display a tendency to play a central role in a previously constructed interactome. However, removing possible bias in the interactome by only considering interactions obtained from non-targeted approaches affects the significance of the findings.
    International Journal of Data Mining and Bioinformatics 01/2014; 9(4):339 - 357. DOI:10.1504/IJDMB.2014.062150 · 0.66 Impact Factor
  • Source
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We developed a new computational technique called Step-Level Differential Response (SLDR) to identify genetic regulatory relationships. Our technique takes advantages of functional genomics data for the same species under different perturbation conditions, therefore complementary to current popular computational techniques. It can particularly identify "rare" activation/inhibition relationship events that can be difficult to find in experimental results. In SLDR, we model each candidate target gene as being controlled by N binary-state regulators that lead to ≤2N observable states ("step-levels") for the target. We applied SLDR to the study of the GEO microarray data set GSE25644, which consists of 158 different mutant S. cerevisiae gene expressional profiles. For each target gene t, we first clustered ordered samples into various clusters, each approximating an observable step-level of t to screen out the "de-centric" target. Then, we ordered each gene x as a candidate regulator and aligned t to x for the purpose of examining the step-level correlations between low expression set of x (Ro) and high expression set of x (Rh) from the regulator x to t, by finding max f(t, x): |Ro-Rh| over all candidate × in the genome for each t. We therefore obtained activation and inhibitions events from different combinations of Ro and Rh. Furthermore, we developed criteria for filtering out less-confident regulators, estimated the number of regulators for each target t, and evaluated identified top-ranking regulator-target relationship. Our results can be cross-validated with the Yeast Fitness database. SLDR is also computationally efficient with o(N2) complexity. In summary, we believe SLDR can be applied to the mining of functional genomics big data for future network biology and network medicine applications.
    BMC Bioinformatics 10/2014; 15(S11):S1. DOI:10.1186/1471-2105-15-S11-S1 · 2.67 Impact Factor

Full-text (2 Sources)

Download
49 Downloads
Available from
May 22, 2014

Similar Publications