KAVIAR: an accessible system for testing SNV novelty

Institute for Systems Biology, Seattle, WA 98109, USA.
Bioinformatics (Impact Factor: 4.98). 09/2011; 27(22):3216-7. DOI: 10.1093/bioinformatics/btr540
Source: PubMed


With the rapidly expanding availability of data from personal genomes, exomes and transcriptomes, medical researchers will frequently need to test whether observed genomic variants are novel or known. This task requires downloading and handling large and diverse datasets from a variety of sources, and processing them with bioinformatics tools and pipelines. Alternatively, researchers can upload data to online tools, which may conflict with privacy requirements. We present here Kaviar, a tool that greatly simplifies the assessment of novel variants. Kaviar includes: (i) an integrated and growing database of genomic variation from diverse sources, including over 55 million variants from personal genomes, family genomes, transcriptomes, SNV databases and population surveys; and (ii) software for querying the database efficiently.

Download full-text


Available from: Juan Caballero
  • Source
    • "No rare, non-synonymous variants were evident in ARHGAP31, RBPJ, or EOGT. We observed two rare, putatively detrimental variants in DOCK6: a frameshift mutation at c.3190_3191del2 (p.L1064Vfs à 60) with an allele frequency of 0.02% (computed using Kaviar) [Glusman et al., 2011], and a stop-gain, c.4480G > T (p.E1494 à ), which had not been previously reported. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Adams-Oliver syndrome (AOS) is a rare malformation syndrome characterized by the presence of two anomalies: aplasia cutis congenita of the scalp and transverse terminal limb defects. Many affected individuals also have additional malformations, including a variety of intracranial anomalies such as periventricular calcification in keeping with cerebrovascular microbleeds, impaired neuronal migration, epilepsy, and microcephaly. Cardiac malformations can be present, as can vascular dysfunction in the forms of cutis marmorata telangiectasia congenita, pulmonary vein stenoses, and abnormal hepatic microvasculature. Elucidated genetic causes include four genes in different pathways, leading to a model of AOS as a multi-pathway disorder. We identified an infant with mild aplasia cutis congenita and terminal transverse limb defects, developmental delay and a severe, diffuse angiopathy with incomplete microvascularization. Whole-genome sequencing documented two rare truncating variants in DOCK6, a gene associated with a type of autosomal recessive AOS that recurrently features periventricular calcification and impaired neurodevelopment. We highlight an unexpectedly high frequency of likely deleterious mutations in this gene in the general population, relative to the rarity of the disease, and discuss possible explanations for this discrepancy. © 2014 Wiley Periodicals, Inc.
    Full-text · Article · Oct 2014 · American Journal of Medical Genetics Part A
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Marked prolongation of the QT interval and polymorphic ventricular tachycardia following medication (drug-induced long QT syndrome, diLQTS) is a severe adverse drug reaction (ADR) that phenocopies congenital long QT syndrome (cLQTS) and is one of the leading causes for drug withdrawal and relabeling. We evaluated the frequency of rare non-synonymous variants in genes contributing to the maintenance of heart rhythm in cases of diLQTS using targeted capture coupled to next-generation sequencing. Eleven of 31 diLQTS subjects (36%) carried a novel missense mutation in genes with known congenital arrhythmia associations or with a known cLQTS mutation. In the 26 Caucasian subjects, 23% carried a highly conserved rare variant predicted to be deleterious to protein function in these genes compared with only 2-4% in public databases (P<0.003). We conclude that the rare variation in genes responsible for congenital arrhythmia syndromes is frequent in diLQTS. Our findings demonstrate that diLQTS is a pharmacogenomic syndrome predisposed by rare genetic variants.The Pharmacogenomics Journal advance online publication, 15 May 2012; doi:10.1038/tpj.2012.14.
    Full-text · Article · May 2012 · The Pharmacogenomics Journal
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Various processes such as annotation and filtering of variants or comparison of variants in different genomes are required in whole-genome or exome analysis pipelines. However, processing different databases and searching among millions of genomic loci is not trivial. gSearch compares sequence variants in the Genome Variation Format (GVF) or Variant Call Format (VCF) with a pre-compiled annotation or with variants in other genomes. Its search algorithms are subsequently optimized and implemented in a multi-threaded manner. The proposed method is not a stand-alone annotation tool with its own reference databases. Rather, it is a search utility that readily accepts public or user-prepared reference files in various formats including GVF, Generic Feature Format version 3 (GFF3), Gene Transfer Format (GTF), VCF and Browser Extensible Data (BED) format. Compared to existing tools such as ANNOVAR, gSearch runs more than 10 times faster. For example, it is capable of annotating 52.8 million variants with allele frequencies in 6 min. gSearch is available at It can be used as an independent search tool or can easily be integrated to existing pipelines through various programming environments such as Perl, Ruby and Python.
    Full-text · Article · Jun 2012 · Bioinformatics
Show more