Article

CNV Workshop: an integrated platform for high-throughput copy number variation discovery and clinical diagnostics

Center for Biomedical Informatics, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
BMC Bioinformatics (Impact Factor: 2.67). 02/2010; 11(1):74. DOI: 10.1186/1471-2105-11-74
Source: PubMed

ABSTRACT Recent studies have shown that copy number variations (CNVs) are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. The increasing availability of high-resolution genome surveillance platforms provides opportunity for rapidly assessing research and clinical samples for CNV content, as well as for determining the potential pathogenicity of identified variants. However, few informatics tools for accurate and efficient CNV detection and assessment currently exist.
We developed a suite of software tools and resources (CNV Workshop) for automated, genome-wide CNV detection from a variety of SNP array platforms. CNV Workshop includes three major components: detection, annotation, and presentation of structural variants from genome array data. CNV detection utilizes a robust and genotype-specific extension of the Circular Binary Segmentation algorithm, and the use of additional detection algorithms is supported. Predicted CNVs are captured in a MySQL database that supports cohort-based projects and incorporates a secure user authentication layer and user/admin roles. To assist with determination of pathogenicity, detected CNVs are also annotated automatically for gene content, known disease loci, and gene-based literature references. Results are easily queried, sorted, filtered, and visualized via a web-based presentation layer that includes a GBrowse-based graphical representation of CNV content and relevant public data, integration with the UCSC Genome Browser, and tabular displays of genomic attributes for each CNV.
To our knowledge, CNV Workshop represents the first cohesive and convenient platform for detection, annotation, and assessment of the biological and clinical significance of structural variants. CNV Workshop has been successfully utilized for assessment of genomic variation in healthy individuals and disease cohorts and is an ideal platform for coordinating multiple associated projects.
Available on the web at: http://sourceforge.net/projects/cnv.

Full-text

Available from: Peter S White, Jun 14, 2015
0 Followers
 · 
370 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: The 22q11.2 deletion syndrome (22q11DS; velocardiofacial/DiGeorge syndrome; VCFS/DGS) is the most common microdeletion syndrome and the phenotypic presentation is highly variable. Approximately 65% of individuals with 22q11DS have a congenital heart defect (CHD), mostly of the conotruncal type, and/or an aortic arch defect. The etiology of this phenotypic variability is not currently known. We hypothesized that copy-number variants (CNVs) outside the 22q11.2 deleted region might increase the risk of being born with a CHD in this sensitized population. Genotyping with Affymetrix SNP Array 6.0 was performed on two groups of subjects with 22q11DS separated by time of ascertainment and processing. CNV analysis was completed on a total of 949 subjects (cohort 1, n = 562; cohort 2, n = 387), 603 with CHDs (cohort 1, n = 363; cohort 2, n = 240) and 346 with normal cardiac anatomy (cohort 1, n = 199; cohort 2, n = 147). Our analysis revealed that a duplication of SLC2A3 was the most frequent CNV identified in the first cohort. It was present in 18 subjects with CHDs and 1 subject without (p = 3.12 × 10(-3), two-tailed Fisher's exact test). In the second cohort, the SLC2A3 duplication was also significantly enriched in subjects with CHDs (p = 3.30 × 10(-2), two-tailed Fisher's exact test). The SLC2A3 duplication was the most frequent CNV detected and the only significant finding in our combined analysis (p = 2.68 × 10(-4), two-tailed Fisher's exact test), indicating that the SLC2A3 duplication might serve as a genetic modifier of CHDs and/or aortic arch anomalies in individuals with 22q11DS. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
    The American Journal of Human Genetics 04/2015; 96(5). DOI:10.1016/j.ajhg.2015.03.007 · 10.99 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: We sought to characterize the landscape of structural variation associated with the subset of congenital cardiac defects characterized by left-sided obstruction. Methods: Cases with left-sided cardiac defects (LSCD) and pediatric controls were uniformly genotyped and assessed for copy number variant (CNV) calls. Significance testing was performed to ascertain differences in overall CNV incidence, and for CNV enrichment of specific genes and gene functions in LSCD cases relative to controls. Results: A total of 257 cases of European descent and 962 ethnically matched, disease-free pediatric controls were included. Although there was no difference in CNV rate between cases and controls, a significant enrichment in rare LSCD CNVs was detected overall (p = 7.30 × 10−3, case/control ratio = 1.26) and when restricted either to deletions (p = 7.58 × 10−3, case/control ratio = 1.20) or duplications (3.02 × 10−3, case/control ratio = 1.43). Neither gene-based, functional nor knowledge-based analyses identified genes, loci or pathways that were significantly enriched in cases as compared to controls when appropriate corrections for multiple tests were applied. However, several genes of interest were identified by virtue of their association with cardiac development, known human conditions, or reported disruption by CNVs in other patient cohorts. Conclusion: This study examines the largest cohort to date with LSCD for structural variation. These data suggest that CNVs play a role in disease risk and identify numerous genes disrupted by CNVs of potential disease relevance. These findings further highlight the genetic heterogeneity and complexity of these disorders. Birth Defects Research (Part A), 2014. © 2014 Wiley Periodicals, Inc.
    Birth Defects Research Part A Clinical and Molecular Teratology 12/2014; 100(12). DOI:10.1002/bdra.23279 · 2.21 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Copy number variation (CNV) is a major genetic polymorphism contributing to genetic diversity and human evolution. Clinical application of CNVs for diagnostic purposes largely depends on sufficient population CNV data for accurate interpretation. CNVs from general population in currently available databases help classify CNVs of uncertain clinical significance, and benign CNVs. Earlier studies of CNV distribution in several populations worldwide showed that a significant fraction of CNVs are population specific. In this study, we characterized and analyzed CNVs in 3,017 unrelated Thai individuals genotyped with the Illumina Human610, Illumina HumanOmniexpress, or Illumina HapMap550v3 platform. We employed hidden Markov model and circular binary segmentation methods to identify CNVs, extracted 23,458 CNVs consistently identified by both algorithms, and cataloged these high confident CNVs into our publicly available Thai CNV database. Analysis of CNVs in the Thai population identified a median of eight autosomal CNVs per individual. Most CNVs (96.73%) did not overlap with any known chromosomal imbalance syndromes documented in the DECIPHER database. When compared with CNVs in the 11 HapMap3 populations, CNVs found in the Thai population shared several characteristics with CNVs characterized in HapMap3. Common CNVs in Thais had similar frequencies to those in the HapMap3 populations, and all high frequency CNVs (>20%) found in Thai individuals could also be identified in HapMap3. The majorities of CNVs discovered in the Thai population, however, were of low frequency, or uniquely identified in Thais. When performing hierarchical clustering using CNV frequencies, the CNV data were clustered into Africans, Europeans, and Asians, in line with the clustering performed with single nucleotide polymorphism (SNP) data. As CNV data are specific to origin of population, our population-specific reference database will serve as a valuable addition to the existing resources for the investigation of clinical significance of CNVs in Thais and related ethnicities.
    PLoS ONE 08/2014; 9(8):e104355. DOI:10.1371/journal.pone.0104355 · 3.53 Impact Factor