dbNSFP: A Lightweight Database of Human Nonsynonymous SNPs and Their Functional Predictions

Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA.
Human Mutation (Impact Factor: 5.14). 08/2011; 32(8):894-9. DOI: 10.1002/humu.21517
Source: PubMed


With the advance of sequencing technologies, whole exome sequencing has increasingly been used to identify mutations that cause human diseases, especially rare Mendelian diseases. Among the analysis steps, functional prediction (of being deleterious) plays an important role in filtering or prioritizing nonsynonymous SNP (NS) for further analysis. Unfortunately, different prediction algorithms use different information and each has its own strength and weakness. It has been suggested that investigators should use predictions from multiple algorithms instead of relying on a single one. However, querying predictions from different databases/Web-servers for different algorithms is both tedious and time consuming, especially when dealing with a huge number of NSs identified by exome sequencing. To facilitate the process, we developed dbNSFP (database for nonsynonymous SNPs' functional predictions). It compiles prediction scores from four new and popular algorithms (SIFT, Polyphen2, LRT, and MutationTaster), along with a conservation score (PhyloP) and other related information, for every potential NS in the human genome (a total of 75,931,005). It is the first integrated database of functional predictions from multiple algorithms for the comprehensive collection of human NSs. dbNSFP is freely available for download at

Download full-text


Available from: Xiaoming Liu
  • Source
    • "The variant is predicted to have a deleterious effect on protein function by bioinformatic analysis using SIFT and PolyPhen-2 (Table 2). Other prediction programs however give variable results [Liu et al., 2011] (Supp. Table S2). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Corneal dystrophies are a clinically and genetically heterogeneous group of inherited disorders that bilaterally affect corneal transparency. They are defined according to the corneal layer affected and by their genetic cause. In this study we identified a dominantly inherited Epithelial Recurrent Erosion Dystrophy (ERED)-like disease that is common in northern Sweden. Whole-exome sequencing resulted in the identification of a novel mutation, c.2816C>T, p.T939I, in the COL17A1 gene which encodes collagen type XVII alpha 1. The variant segregated with disease in a genealogically expanded pedigree dating back 200 years. We also investigated a unique COL17A1 synonymous variant, c.3156C>T, identified in a previously reported unrelated dominant ERED-like family linked to a locus on chromosome 10q23-q24 encompassing COL17A1. We show that this variant introduces a cryptic donor site resulting in aberrant pre-mRNA splicing and is highly likely to be pathogenic. Bi-allelic COL17A1 mutations have previously been associated with a recessive skin disorder, junctional epidermolysis bullosa, with recurrent corneal erosions being reported in some cases. Our findings implicate presumed gain of function COL17A1 mutations causing dominantly inherited ERED and improve understanding of the underlying pathology. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
    Full-text · Article · Feb 2015 · Human Mutation
  • Source
    • "Fly geneticists also discovered the founding member of the transient receptor potential (TRP) channels (Montell et al. 1985; Montell and Rubin 1989). These channels have been shown to play critical roles in vision, pain, heat, and cold perception and the trp founding member promoted the discovery of the vertebrate homologs that are associated with numerous Mendelian diseases (Dai et al. 2010; Nilius and Owsianik 2011). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Many scientists complain that the current funding situation is dire. Indeed, there has been an overall decline in support in funding for research from the National Institutes of Health and the National Science Foundation. Within the Drosophila field, some of us question how long this funding crunch will last as it demotivates principal investigators and perhaps more importantly affects the longterm career choice of many young scientists. Yet numerous very interesting biological processes and avenues remain to be investigated in Drosophila, and probing questions can be answered fast and efficiently in flies to reveal new biological phenomena. Moreover, Drosophila is an excellent model organism for studies that have translational impact for genetic disease and for other medical implications such as vector-borne illnesses. We would like to promote a better collaboration between Drosophila geneticists/biologists and human geneticists/bioinformaticians/clinicians, as it would benefit both fields and significantly impact the research on human diseases. Copyright © 2015, The Genetics Society of America.
    Full-text · Article · Jan 2015 · Genetics
  • Source
    • "Mapping and coverage statistics were generated from the mapping output files using the SeqCap analysis toolkit provided by Roche 454 as well as GATK. Identified variants were checked against the dbNSFP v1.3[47]as well as dbSNP v135 and HGMDH Professional 2011.4 database (released December 9, 2011). SNVs and indels were filtered depending on their allele frequency focusing on rare variants with a minor allele frequency (MAF) of 3% or less. "

    Full-text · Article · Nov 2014 · PLoS ONE
Show more