Article

KC-SMARTR: An R package for detection of statistically significant aberrations in multi-experiment aCGH data

BMC Research Notes 01/2010; DOI:http://www.doaj.org/doaj?func=openurl&genre=article&issn=17560500&date=2010&volume=3&issue=1&spage=298
Source: DOAJ

ABSTRACT Abstract Background Most approaches used to find recurrent or differential DNA Copy Number Alterations (CNA) in array Comparative Genomic Hybridization (aCGH) data from groups of tumour samples depend on the discretization of the aCGH data to gain, loss or no-change states. This causes loss of valuable biological information in tumour samples, which are frequently heterogeneous. We have previously developed an algorithm, KC-SMART, that bases its estimate of the magnitude of the CNA at a given genomic location on kernel convolution (Klijn et al., 2008). This accounts for the intensity of the probe signal, its local genomic environment and the signal distribution across multiple samples. Results Here we extend the approach to allow comparative analyses of two groups of samples and introduce the R implementation of these two approaches. The comparative module allows for a supervised analysis to be performed, to enable the identification of regions that are differentially aberrated between two user-defined classes. We analyzed data from a series of B- and T-cell lymphomas and were able to retrieve all positive control regions (VDJ regions) in addition to a number of new regions. A t-test employing segmented data, that we implemented, was also able to locate all the positive control regions and a number of new regions but these regions were highly fragmented. Conclusions KC-SMARTR offers recurrent CNA and class specific CNA detection, at different genomic scales, in a single package without the need for additional segmentation. It is memory efficient and runs on a wide range of machines. Most importantly, it does not rely on data discretization and therefore maximally exploits the biological information in the aCGH data. The program is freely available from the Bioconductor website http://www.bioconductor.org/ under the terms of the GNU General Public License.

0 0
 · 
0 Bookmarks
 · 
46 Views
  • Article: The hallmarks of cancer.
    Cell 02/2000; 100(1):57-70. · 32.40 Impact Factor
  • Source
    Article: High resolution array-CGH analysis of single cells.
    [show abstract] [hide abstract]
    ABSTRACT: Heterogeneity in the genome copy number of tissues is of particular importance in solid tumor biology. Furthermore, many clinical applications such as pre-implantation and non-invasive prenatal diagnosis would benefit from the ability to characterize individual single cells. As the amount of DNA from single cells is so small, several PCR protocols have been developed in an attempt to achieve unbiased amplification. Many of these approaches are suitable for subsequent cytogenetic analyses using conventional methodologies such as comparative genomic hybridization (CGH) to metaphase spreads. However, attempts to harness array-CGH for single-cell analysis to provide improved resolution have been disappointing. Here we describe a strategy that combines single-cell amplification using GenomePlex library technology (GenomePlex) Single Cell Whole Genome Amplification Kit, Sigma-Aldrich, UK) and detailed analysis of genomic copy number changes by high-resolution array-CGH. We show that single copy changes as small as 8.3 Mb in single cells are detected reliably with single cells derived from various tumor cell lines as well as patients presenting with trisomy 21 and Prader-Willi syndrome. Our results demonstrate the potential of this technology for studies of tumor biology and for clinical diagnostics.
    Nucleic Acids Research 02/2007; 35(3):e15. · 8.03 Impact Factor
  • Source
    Article: Identification of cancer genes using a statistical framework for multiexperiment analysis of nondiscretized array CGH data.
    [show abstract] [hide abstract]
    ABSTRACT: Tumor formation is in part driven by DNA copy number alterations (CNAs), which can be measured using microarray-based Comparative Genomic Hybridization (aCGH). Multiexperiment analysis of aCGH data from tumors allows discovery of recurrent CNAs that are potentially causal to cancer development. Until now, multiexperiment aCGH data analysis has been dependent on discretization of measurement data to a gain, loss or no-change state. Valuable biological information is lost when a heterogeneous system such as a solid tumor is reduced to these states. We have developed a new approach which inputs nondiscretized aCGH data to identify regions that are significantly aberrant across an entire tumor set. Our method is based on kernel regression and accounts for the strength of a probe's signal, its local genomic environment and the signal distribution across multiple tumors. In an analysis of 89 human breast tumors, our method showed enrichment for known cancer genes in the detected regions and identified aberrations that are strongly associated with breast cancer subtypes and clinical parameters. Furthermore, we identified 18 recurrent aberrant regions in a new dataset of 19 p53-deficient mouse mammary tumors. These regions, combined with gene expression microarray data, point to known cancer genes and novel candidate cancer genes.
    Nucleic Acids Research 03/2008; 36(2):e13. · 8.03 Impact Factor

Full-text

View
4 Downloads
Available from
18 Sep 2012

Keywords

aCGH data
 
array Comparative Genomic Hybridization
 
class specific CNA detection
 
comparative analyses
 
data discretization
 
different genomic scales
 
differentially aberrated
 
given genomic location
 
GNU General Public License
 
local genomic environment
 
positive control regions
 
probe signal
 
segmented data
 
single package
 
supervised analysis
 
T-cell lymphomas
 
two approaches
 
user-defined classes
 
valuable biological information
 
wide range