Article

Partitioning of copy-number genotypes in pedigrees

Montreal Heart Institute Research Center, Montréal, Canada.
BMC Bioinformatics (Impact Factor: 2.67). 05/2010; 11:226. DOI: 10.1186/1471-2105-11-226
Source: PubMed

ABSTRACT Copy number variations (CNVs) and polymorphisms (CNPs) have only recently gained the genetic community's attention. Conservative estimates have shown that CNVs and CNPs might affect more than 10% of the genome and that they may be at least as important as single nucleotide polymorphisms in assessing human variability. Widely used tools for CNP analysis have been implemented in Birdsuite and PLINK for the purpose of conducting genetic association studies based on the unpartitioned total number of CNP copies provided by the intensities from Affymetrix's Genome-Wide Human SNP Array. Here, we are interested in partitioning copy number variations and polymorphisms in extended pedigrees for the purpose of linkage analysis on familial data.
We have developed CNGen, a new software for the partitioning of copy number polymorphism using the integrated genotypes from Birdsuite with the Affymetrix platform. The algorithm applied to familial trios or extended pedigrees can produce partitioned copy number genotypes with distinct parental alleles. We have validated the algorithm using simulations on a complex pedigree structure using frequencies calculated from a real dataset of 300 genotyped samples from 42 pedigrees segregating a congenital heart defect phenotype.
CNGen is the first published software for the partitioning of copy number genotypes in pedigrees, making possible the use CNPs and CNVs for linkage analysis. It was implemented with the Python interpreter version 2.5.2. It was successfully tested on current Linux, Windows and Mac OS workstations.

Download full-text

Full-text

Available from: Marie-Pierre Dubé, Dec 12, 2014
0 Followers
 · 
177 Views
  • [Show abstract] [Hide abstract]
    ABSTRACT: Objectives: Ventricular septal defect (VSD) is the most common congenital heart disease (CHD). Genome-wide linkage analysis revealed a potential CHD susceptibility locus in the homeodomain leucine zipper-encoding (HOMEZ) gene in a South Indian population. The present study aimed to identify potential pathogenic mutations for HOMEZ and to provide insights into the etiology of isolated VSD in the Chinese population. Methods: Case-control mutational analysis was performed in 400 patients with isolated VSD and 400 healthy controls. Protein-coding exton of HOMEZ and their flanking sequences were amplified by polymerase chain reaction and sequenced on an ABI3730 Automated Sequencer. CLC workbench software was used to compare the conservatism of the HOMEZ protein with other multiple species. The ExPASy-ProtScale online tool was used to predicate the alignment of the hydrophobic features. Results: Two novel heterozygous missense mutations (c.116 C>T; c. 630T>A) were identified in HOMEZ gene exon-2. The two mutations lead to alanine to valine substitution at position 39 and serine to arginine at position 210, which are highly conserved among many species. The hydropathicity of the valine and arginine residue at the position 39 and 210 were significantly different from the wild type. Conclusions: We have identified two novel heterozygous missense mutations in HOMEZ gene exon-2 in isolated VSD patients in the Chinese population and have found that these two mutations resulted in alteration of the hydropathicity of the HOMEZ protein. Therefore, the two missense mutations of the HOMEZ gene are directly linked with the etiology of isolated VSD in the Chinese population.
    Genetic Testing and Molecular Biomarkers 04/2013; DOI:10.1089/gtmb.2012.0435 · 1.15 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV , which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear famili
    PLoS ONE 04/2015; 10(4):e0122713. · 3.53 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.
    PLoS ONE 04/2015; 10(4):e0122713. DOI:10.1371/journal.pone.0122713 · 3.53 Impact Factor