CNV Workshop: An integrated platform for high-throughput copy number variation discovery and clinical diagnostics

Center for Biomedical Informatics, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
BMC Bioinformatics (Impact Factor: 2.58). 02/2010; 11(1):74. DOI: 10.1186/1471-2105-11-74
Source: PubMed


Recent studies have shown that copy number variations (CNVs) are frequent in higher eukaryotes and associated with a substantial portion of inherited and acquired risk for various human diseases. The increasing availability of high-resolution genome surveillance platforms provides opportunity for rapidly assessing research and clinical samples for CNV content, as well as for determining the potential pathogenicity of identified variants. However, few informatics tools for accurate and efficient CNV detection and assessment currently exist.
We developed a suite of software tools and resources (CNV Workshop) for automated, genome-wide CNV detection from a variety of SNP array platforms. CNV Workshop includes three major components: detection, annotation, and presentation of structural variants from genome array data. CNV detection utilizes a robust and genotype-specific extension of the Circular Binary Segmentation algorithm, and the use of additional detection algorithms is supported. Predicted CNVs are captured in a MySQL database that supports cohort-based projects and incorporates a secure user authentication layer and user/admin roles. To assist with determination of pathogenicity, detected CNVs are also annotated automatically for gene content, known disease loci, and gene-based literature references. Results are easily queried, sorted, filtered, and visualized via a web-based presentation layer that includes a GBrowse-based graphical representation of CNV content and relevant public data, integration with the UCSC Genome Browser, and tabular displays of genomic attributes for each CNV.
To our knowledge, CNV Workshop represents the first cohesive and convenient platform for detection, annotation, and assessment of the biological and clinical significance of structural variants. CNV Workshop has been successfully utilized for assessment of genomic variation in healthy individuals and disease cohorts and is an ideal platform for coordinating multiple associated projects.
Available on the web at:

Download full-text


Available from: Peter S White
  • Source
    • "The presence of copy number variations (CNVs) (duplications, deletions) might alter gene dosage in KS. CNVs have a clinical impact on the risk for a variety of human diseases (Gai et al., 2010), including male infertility (Krausz et al., 2012), but no study has been performed on the X chromosome of KS, including their possible involvement in the clinical variability of KS. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The Klinefelter syndrome (KS) is the most frequent sex chromosomal disorder in males, characterized by at least one supernumerary X chromosome (most frequent karyotype 47,XXY). This syndrome presents with a broad range of phenotypes. The common characteristics include small testes and infertility, but KS subjects are at increased risk of hypogonadism, cognitive dysfunction, obesity, diabetes, metabolic syndrome, osteoporosis, and autoimmune disorders, which are present in variable proportion. Although part of the clinical variability might be linked to a different degree of testicular function observed in KS patients, genetic mechanisms of the supernumerary X chromosome might contribute. Gene-dosage effects and parental origin of the supernumerary X chromosome have been suggested to this regard. No study has been performed analyzing the genetic constitution of the X chromosome in terms of copy number variations (CNVs) and their possible involvement in phenotype of KS. To this aim, we performed a SNP arrays analysis on 94 KS and 85 controls. We found that KS subjects have more frequently than controls X-linked CNVs (39/94, [41.5%] with respect to 12/42, [28.6%] of females, and 8/43, [18.6%] of males, p < 0.01). The number of X-linked CNVs in KS patients was 4.58 ± 1.92 CNVs/subject, significantly higher with respect to that found in control females (1.50 ± 1.29 CNVs/subject) and males (1.14 ± 0.37 CNVs/subject). Importantly, 94.4% X-linked CNVs in KS subjects were duplications, higher with respect to control males (50.0%, p < 0.001) and females (83.3%, p = 0.1). Half of the X-linked CNVs fell within regions encompassing genes and most of them (90%) included genes escaping X-inactivation in the regions of X-Y homology, particularly in the pseudoautosomal region 1 (PAR1) and Xq21.31. This study described for the first time the genetic properties of the X chromosome in KS and suggests that X-linked CNVs (especially duplications) might contribute to the clinical phenotype.
    Full-text · Article · Jan 2016 · Andrology
  • Source
    • "tched , disease - free children was used as a healthy control group . All case and control samples were genotyped using the Illumina HumanHap550K BeadChip and a single consistent protocol . Genotype data were uni - formly analyzed for CNVs using Illumina ' s GenomeStudio software in combination with CNV Workshop and PennCNV ( Wang et al . , 2007 ; Gai et al . , 2010 ) ."
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: We sought to characterize the landscape of structural variation associated with the subset of congenital cardiac defects characterized by left-sided obstruction. Methods: Cases with left-sided cardiac defects (LSCD) and pediatric controls were uniformly genotyped and assessed for copy number variant (CNV) calls. Significance testing was performed to ascertain differences in overall CNV incidence, and for CNV enrichment of specific genes and gene functions in LSCD cases relative to controls. Results: A total of 257 cases of European descent and 962 ethnically matched, disease-free pediatric controls were included. Although there was no difference in CNV rate between cases and controls, a significant enrichment in rare LSCD CNVs was detected overall (p=7.30 × 10(-3) , case/control ratio=1.26) and when restricted either to deletions (p=7.58 × 10(-3) , case/control ratio=1.20) or duplications (3.02 × 10(-3) , case/control ratio=1.43). Neither gene-based, functional nor knowledge-based analyses identified genes, loci or pathways that were significantly enriched in cases as compared to controls when appropriate corrections for multiple tests were applied. However, several genes of interest were identified by virtue of their association with cardiac development, known human conditions, or reported disruption by CNVs in other patient cohorts. Conclusion: This study examines the largest cohort to date with LSCD for structural variation. These data suggest that CNVs play a role in disease risk and identify numerous genes disrupted by CNVs of potential disease relevance. These findings further highlight the genetic heterogeneity and complexity of these disorders.
    Full-text · Article · Dec 2014 · Birth Defects Research Part A Clinical and Molecular Teratology
  • Source
    • "In this study, we combined the genomics data generated from multiple genome-wide association studies (GWAS) consisting of 3,017 unrelated Thai subjects with no undiagnosed genetic disorders. We carried out CNV discovery from these dataset using the two commonly used CNV calling algorithms, PennCNV [13] and CNV Workshop [14], to identify the most accurate set of CNVs, and put together the first large reference CNV database for Thais. Furthermore, we performed population Copy Number Variation Region (CNVR) frequency comparison between Thais and 11 HapMap3 populations, and identified unique CNVRs in Thais as well as CNVs overlapping with genes associated with Thai population. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Copy number variation (CNV) is a major genetic polymorphism contributing to genetic diversity and human evolution. Clinical application of CNVs for diagnostic purposes largely depends on sufficient population CNV data for accurate interpretation. CNVs from general population in currently available databases help classify CNVs of uncertain clinical significance, and benign CNVs. Earlier studies of CNV distribution in several populations worldwide showed that a significant fraction of CNVs are population specific. In this study, we characterized and analyzed CNVs in 3,017 unrelated Thai individuals genotyped with the Illumina Human610, Illumina HumanOmniexpress, or Illumina HapMap550v3 platform. We employed hidden Markov model and circular binary segmentation methods to identify CNVs, extracted 23,458 CNVs consistently identified by both algorithms, and cataloged these high confident CNVs into our publicly available Thai CNV database. Analysis of CNVs in the Thai population identified a median of eight autosomal CNVs per individual. Most CNVs (96.73%) did not overlap with any known chromosomal imbalance syndromes documented in the DECIPHER database. When compared with CNVs in the 11 HapMap3 populations, CNVs found in the Thai population shared several characteristics with CNVs characterized in HapMap3. Common CNVs in Thais had similar frequencies to those in the HapMap3 populations, and all high frequency CNVs (>20%) found in Thai individuals could also be identified in HapMap3. The majorities of CNVs discovered in the Thai population, however, were of low frequency, or uniquely identified in Thais. When performing hierarchical clustering using CNV frequencies, the CNV data were clustered into Africans, Europeans, and Asians, in line with the clustering performed with single nucleotide polymorphism (SNP) data. As CNV data are specific to origin of population, our population-specific reference database will serve as a valuable addition to the existing resources for the investigation of clinical significance of CNVs in Thais and related ethnicities.
    Full-text · Article · Aug 2014 · PLoS ONE
Show more