Constructing a robust protein-protein interaction network by integrating multiple public databases

Department of Information Science, University of Arkansas at Little Rock, 2801 S, University Ave, Little Rock, AR 72204-1099, USA.
BMC Bioinformatics (Impact Factor: 2.67). 10/2011; 12 Suppl 10(Suppl 10):S7. DOI: 10.1186/1471-2105-12-S10-S7
Source: PubMed

ABSTRACT Protein-protein interactions (PPIs) are a critical component for many underlying biological processes. A PPI network can provide insight into the mechanisms of these processes, as well as the relationships among different proteins and toxicants that are potentially involved in the processes. There are many PPI databases publicly available, each with a specific focus. The challenge is how to effectively combine their contents to generate a robust and biologically relevant PPI network.
In this study, seven public PPI databases, BioGRID, DIP, HPRD, IntAct, MINT, REACTOME, and SPIKE, were used to explore a powerful approach to combine multiple PPI databases for an integrated PPI network. We developed a novel method called k-votes to create seven different integrated networks by using values of k ranging from 1-7. Functional modules were mined by using SCAN, a Structural Clustering Algorithm for Networks. Overall module qualities were evaluated for each integrated network using the following statistical and biological measures: (1) modularity, (2) similarity-based modularity, (3) clustering score, and (4) enrichment.
Each integrated human PPI network was constructed based on the number of votes (k) for a particular interaction from the committee of the original seven PPI databases. The performance of functional modules obtained by SCAN from each integrated network was evaluated. The optimal value for k was determined by the functional module analysis. Our results demonstrate that the k-votes method outperforms the traditional union approach in terms of both statistical significance and biological meaning. The best network is achieved at k = 2, which is composed of interactions that are confirmed in at least two PPI databases. In contrast, the traditional union approach yields an integrated network that consists of all interactions of seven PPI databases, which might be subject to high false positives.
We determined that the k-votes method for constructing a robust PPI network by integrating multiple public databases outperforms previously reported approaches and that a value of k=2 provides the best results. The developed strategies for combining databases show promise in the advancement of network construction and modeling.

  • [Show abstract] [Hide abstract]
    ABSTRACT: It is known that a disease is rarely a consequence of an abnormality of a single gene, but reflects the interactions of various processes in a complex network. Annotated molecular networks offer new opportunities to understand diseases within a systems biology framework and provide an excellent substrate for network-based identification of biomarkers. The network biomarkers and dynamic network biomarkers (DNBs) represent new types of biomarkers with protein–protein or gene–gene interactions that can be monitored and evaluated at different stages and time-points during development of disease. Clinical bioinformatics as a new way to combine clinical measurements and signs with human tissue-generated bioinformatics is crucial to translate biomarkers into clinical application, validate the disease specificity, and understand the role of biomarkers in clinical settings. In this article, the recent advances and developments on network biomarkers and DNBs are comprehensively reviewed. How network biomarkers help a better understanding of molecular mechanism of diseases, the advantages and constraints of network biomarkers for clinical application, clinical bioinformatics as a bridge to the development of diseases-specific, stage-specific, severity-specific and therapy predictive biomarkers, and the potentials of network biomarkers are also discussed.
    Journal of Cellular and Molecular Medicine 11/2014; 19(2). DOI:10.1111/jcmm.12447 · 3.70 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This study investigated the relationship between particle size and toxicity of silica particles (SP) with diameters of 30, 70, and 300 nm, which is essential to the safe design and application of SP. Data obtained from histopathological examinations suggested that SP of these sizes can all induce acute inflammation in the liver. In vivo imaging showed that intravenously administrated SP are mainly present in the liver, spleen and intestinal tract. Interestingly, in gene expression analysis, the cellular response pathways activated in the liver are predominantly conserved independently of particle dose when the same size SP are administered or are conserved independently of particle size, surface area and particle number when nano- or submicro-sized SP are administered at their toxic doses. Meanwhile, integrated analysis of transcriptomics, previous metabonomics and conventional toxicological results support the view that SP can result in inflammatory and oxidative stress, generate mitochondrial dysfunction, and eventually cause hepatocyte necrosis by neutrophil-mediated liver injury.
    Nanotechnology 12/2012; 24(1):015106. DOI:10.1088/0957-4484/24/1/015106 · 3.67 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Chinese medicine has been widely used in clinical practice, but its mode of action often remains obscure. This has seriously hindered further development and better clinical applications of Chinese medicine. Among the most critical questions to be addressed, the identification of active ingredients is an important one requiring more research. Existing methods are only concerned the potential pharmacological effects of the individual purified chemical ingredients without consideration of the contents of these ingredients, which is critical to the comprehensive effect of Chinese medicine. A novel approach was proposed here to integrate network pharmacology analysis and ingredient content in Chinese medicine to identify active ingredients. The therapeutic action of Xuesaitong (XST) injection on myocardial infarction was analyzed as an example in this study. Firstly, we built a cardiovascular disease (CVD) related protein-protein interaction (PPI) network. Secondly, the potential targets of the ingredients of XST were identified by integrating microarray data, text mining and pharmacophore model-based prediction. The target-ingredient relationships were then mapped to the network. Topological attributes related to the targets of these ingredients, together with the ingredients' contents, were combined to calculate a composition-weighted index for integrative evaluation of ingredient efficacy. Our results indicated that major active ingredients in XST were notoginsenoside R1, ginsenoside Rg1, Rb1, Rd and Re, which was further validated on myocardial infarction rat models. In conclusion, this study presented a novel approach to identify active ingredients in Chinese medicine.
    Molecular BioSystems 04/2014; 10(7). DOI:10.1039/c3mb70581a · 3.18 Impact Factor