Three-dimensional reconstruction of protein networks provides insight into human genetic disease.

Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, USA.
Nature Biotechnology (Impact Factor: 39.08). 01/2012; 30(2):159-64. DOI: 10.1038/nbt.2106
Source: PubMed

ABSTRACT To better understand the molecular mechanisms and genetic basis of human disease, we systematically examine relationships between 3,949 genes, 62,663 mutations and 3,453 associated disorders by generating a three-dimensional, structurally resolved human interactome. This network consists of 4,222 high-quality binary protein-protein interactions with their atomic-resolution interfaces. We find that in-frame mutations (missense point mutations and in-frame insertions and deletions) are enriched on the interaction interfaces of proteins associated with the corresponding disorders, and that the disease specificity for different mutations of the same gene can be explained by their location within an interface. We also predict 292 candidate genes for 694 unknown disease-to-gene associations with proposed molecular mechanism hypotheses. This work indicates that knowledge of how in-frame disease mutations alter specific interactions is critical to understanding pathogenesis. Structurally resolved interaction networks should be valuable tools for interpreting the wealth of data being generated by large-scale structural genomics and disease association studies.

1 Follower
  • [Show abstract] [Hide abstract]
    ABSTRACT: Background: Nuclear distribution E homolog 1 (NDE1), located within chromosome 16p13.11, plays an essential role in microtubule organization, mitosis, and neuronal migration and has been suggested by several studies of rare copy number variants to be a promising schizophrenia (SCZ) candidate gene. Recently, increasing attention has been paid to rare single-nucleotide variants (SNVs) discovered by deep sequencing of candidate genes, because such SNVs may have large effect sizes and their functional analysis may clarify etiopathology. Methods and Results: We conducted mutation screening of NDE1 coding exons using 433 SCZ and 145 pervasive developmental disorders samples in order to identify rare single nucleotide variants with a minor allele frequency ≤5%. We then performed genetic association analysis using a large number of unrelated individuals (3554 SCZ, 1041 bipolar disorder [BD], and 4746 controls). Among the discovered novel rare variants, we detected significant associations between SCZ and S214F (P = .039), and between BD and R234C (P = .032). Furthermore, functional assays showed that S214F affected axonal outgrowth and the interaction between NDE1 and YWHAE (14-3-3 epsilon; a neurodevelopmental regulator). Conclusions: This study strengthens the evidence for association between rare variants within NDE1 and SCZ, and may shed light into the molecular mechanisms underlying this severe psychiatric disorder.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Here we present a method for extracting candidate cancer pathways from tumor 'omics data while explicitly accounting for diverse consequences of mutations for protein interactions. Disease-causing mutations are frequently observed at either core or interface residues mediating protein interactions. Mutations at core residues frequently destabilize protein structure while mutations at interface residues can specifically affect the binding energies of protein-protein interactions. As a result, mutations in a protein may result in distinct interaction profiles and thus have different phenotypic consequences. We describe a protein structure-guided pipeline for extracting interacting protein sets specific to a particular mutation. Of 59 cancer genes with 3D co-complexed structures in the Protein Data Bank, 43 showed evidence of mutations with different functional consequences. Literature survey reciprocated functional predictions specific to distinct mutations on APC, ATRX, BRCA1, CBL and HRAS. Our analysis suggests that accounting for mutation-specific perturbations to cancer pathways will be essential for personalized cancer therapy.
    Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 01/2015; 20:84-95.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Interactions between proteins largely govern cellular processes and this has led to numerous efforts culminating in enormous information related to the proteins, their interactions and the function which is determined by their interactions. The main concern of the present study is to present interface analysis of cardiovascular-disorder (CVD) related proteins to shed lights on details of interactions and to emphasize the importance of using structures in network studies. This study combines the network-centred approach with three dimensional studies to comprehend the fundamentals of biology. Interface properties were used as descriptors to classify the CVD associated proteins and non-CVD associated proteins. Machine learning algorithm was used to generate a classifier based on the training set which was then used to predict potential CVD related proteins from a set of polymorphic proteins which are not known to be involved in any disease. Among several classifying algorithms applied to generate models, best performance was achieved using Random Forest with an accuracy of 69.5 %. The tool named CARDIO-PRED, based on the prediction model is present at The predicted CVD related proteins may not be the causing factor of particular disease but can be involved in pathways and reactions yet unknown to us thus permitting a more rational analysis of disease mechanism. Study of their interactions with other proteins can significantly improve our understanding of the molecular mechanism of diseases.
    Systems and Synthetic Biology 06/2015; 9(1-2). DOI:10.1007/s11693-015-9164-z


1 Download
Available from