The Human Phenotype Ontology: A Tool for Annotating and Analyzing Human Hereditary Disease

Institute for Medical Genetics, Charité-Universitätsmedizin Berlin, 13353 Berlin, Germany.
The American Journal of Human Genetics (Impact Factor: 10.99). 11/2008; 83(5):610-5. DOI: 10.1016/j.ajhg.2008.09.017
Source: PubMed

ABSTRACT There are many thousands of hereditary diseases in humans, each of which has a specific combination of phenotypic features, but computational analysis of phenotypic data has been hampered by lack of adequate computational data structures. Therefore, we have developed a Human Phenotype Ontology (HPO) with over 8000 terms representing individual phenotypic anomalies and have annotated all clinical entries in Online Mendelian Inheritance in Man with the terms of the HPO. We show that the HPO is able to capture phenotypic similarities between diseases in a useful and highly significant fashion.

Download full-text


Available from: Sebastian Köhler, Sep 05, 2015
    • "Recently, several studies have been published that have investigated the expected yield of clinical exome sequencing [Yang et al., 2013; Soden et al., 2014; Yang et al., 2014; Zemojtel et al., 2014]. The phenotypic matching algorithms often made use of the Human Phenotype Ontology [Robinson et al., 2008; Köhler et al., 2009; Köhler et al., 2014]. Several groups have developed algorithms to use HPO analysis as a part of gene prioritization in exome sequencing [Sifrim et al., 2013; Javed et al., 2014; Singleton et al., 2014; Robinson et al., 2014b]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Genomic matchmaking databases allow participants to submit genomic and phenotypic data with the goal of identifying previously uncharacterized disease-associated genes by "matching" to other comparable cases. Current estimates suggest that there are at least 3000 Mendelian disease-associated genes that have not yet been characterized as such, but the true number may be substantially higher. Therefore, Genomic matchmaking databases are addressing a pressing medical need, and it is important to ask how they should be designed and how much data they should strive to contain in order to identify a certain number of these genes. In this work, we argue that genomic matchmaking has similarities to the so called "birthday paradox", which refers to the observation that within a group of just 23 persons, two people will have the same birthday with probability greater than 50%. We develop a series of simulations to provide a rough estimate of the number of cases required and to explore the influence of parameters such as genetic heterogeneity, mode of inheritance, background variation, precision of phenotypic descriptions, disease prevalence, and the accuracy of bioinformatic pathogenicity prediction programs on the performance of genomic matchmaking. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
    Human Mutation 08/2015; DOI:10.1002/humu.22848 · 5.05 Impact Factor
  • Source
    • "Subsequently a computational method termed Phenotypic Interpretation of eXomes (PhenIX), that evaluates variants based on population frequency and predicted pathogenicity and then ranks the genes according to variant score and a clinical relevance score, was used to prioritize candidate genes [Zemojtel et al., 2014]. To calculate the clinical relevance score we entered the following human phenotype ontology [Robinson et al., 2008] terms: 2e3 toe cutaneous syndactyly (HP0005709); proximal/ middle symphalangism of 3rd finger (HP:0009482); proximal/ middle symphalangism of 2nd finger (HP:0009579), distal/middle symphalangism of 2nd finger (HP:0009563), short proximal phalanx of 2nd finger (HP:0009597), Aplasia/Hypoplasia of the middle phalanx of the 2nd finger (HP:0009568); Aplasia/Hypoplasia of the middle phalanx of the 3rd finger (HP0009437); Aplasia/Hypoplasia of the middle phalanx of the 4th finger (HP:0009299); Aplasia/ "
    [Show abstract] [Hide abstract]
    ABSTRACT: Pfeiffer syndrome (MIM: #101600) is a rare autosomal dominant disorder classically characterized by limb and craniofacial anomalies. It is caused by heterozygous mutations in the fibroblast growth factor receptors types 1 and 2 (FGFR1 and FGFR2). We applied a next generation sequencing (NGS) panel approach comprising all 2877 genes currently known to be causative for one or more Mendelian diseases combined with the phenotype based computational tool PhenIX (Phenotypic Interpretation of eXomes). We report on a patient presenting with multiple anomalies of hands and feet including brachydactyly and symphalangism. No clinical diagnosis could be established based on the clinical findings and testing of several genes associated with brachydactyly and symphalangism failed to identify mutations. Via next generation sequencing (NGS) panel approach we then identified a novel de novo missense FGFR2 mutation affecting an amino acid reported to be mutated in Pfeiffer syndrome. Since our patient shows typical radiological findings of Pfeiffer syndrome in hands and feet but at the same time lacks several characteristic features such as clinical signs of craniosynostosis and prominent eyes we suggest introducing the term "FGFR2 associated phenotypes" for similar cases. Our results highlight the emerging role of combined NGS and phenotype based bioinformatics strategies to establish clinical diagnoses. Copyright © 2015. Published by Elsevier Masson SAS.
    European journal of medical genetics 06/2015; 58(8). DOI:10.1016/j.ejmg.2015.05.007 · 1.49 Impact Factor
  • Source
    • "In total, 42 unrelated European patients and two affected sisters (12-0880 and 12-0878 in Table 1) with Usher syndrome were included in this study. The clinical diagnosis was based on simultaneous occurrence of bilateral sensorineural hearing loss (HL) (HP:0000407) (Robinson et al. 2008) and bilateral RP (HP:0007703) (Robinson et al. 2008). The Charit e University Medicine ethics board approved this study and informed consent for genetic testing was obtained from all probands and DNA was extracted from 200 lL of EDTA blood with a Biorobot M48 (Qiagen, Venlo, Netherlands) according to the manufacturer's protocol. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
    09/2014; 2(5). DOI:10.1002/mgg3.92
Show more