CREST maps somatic structural variation in cancer genomes with base-pair resolution

Department of Information Sciences, St. Jude Children's Research Hospital, Memphis, Tennessee, USA.
Nature Methods (Impact Factor: 25.95). 06/2011; 8(8):652-4. DOI: 10.1038/nmeth.1628
Source: PubMed

ABSTRACT We developed 'clipping reveals structure' (CREST), an algorithm that uses next-generation sequencing reads with partial alignments to a reference genome to directly map structural variations at the nucleotide level of resolution. Application of CREST to whole-genome sequencing data from five pediatric T-lineage acute lymphoblastic leukemias (T-ALLs) and a human melanoma cell line, COLO-829, identified 160 somatic structural variations. Experimental validation exceeded 80%, demonstrating that CREST had a high predictive accuracy.

Download full-text


Available from: Linda Holmfeldt, Jun 27, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Motivation Methods for detecting somatic genome rearrangements in tumours using next generation sequencing are vital in cancer genomics. Available algorithms use one or more sources of evidence, such as read depth, paired end reads or split reads to predict structural variants. However, the problem remains challenging due to the significant computational burden, and high false positive or false negative rates.Results In this paper we present Socrates, a highly efficient and effective method for detecting genomic rearrangements in tumours that utilises only split-read data. Socrates has single nucleotide resolution, identifies micro-homologies and untemplated sequence at breakpoints, has very high sensitivity and high specificity, and takes advantage of parallelism for efficient use of resources. We demonstrate using simulated and real data that Socrates performs well compared to a number of existing SV detection tools.Availability Socrates is released as open source and available from
    Bioinformatics 01/2014; DOI:10.1093/bioinformatics/btt767 · 4.62 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The opportunistic fungal pathogen Cryptococcus neoformans is a leading cause of mortality amongst the HIV/AIDS population, and is known for frequently causing life-threatening relapse. To investigate the potential contribution of in-host microevolution to persistence and relapse we have analyzed two serial isolates obtained from an AIDS patient who suffered an initial and relapse episode of cryptococcal meningoencephalitis. Despite being identical by multilocus sequence typing, the isolates differ phenotypically, exhibiting changes in key virulence factors, nutrient acquisition, metabolic profiles and ability to disseminate in an animal model. Whole genome sequencing uncovered a clonal relationship, with only a few unique differences. Of these, two key changes are expected to explain the phenotypic differences observed in the relapse isolate: loss of a predicted AT-rich interaction domain protein, and changes in copy number of the left and right arms of chromosome 12. Gene deletion of the predicted transcriptional regulator produced changes in melanin, capsule, carbon source utilization and dissemination in the host, consistent with the phenotype of the relapse isolate. In addition, the deletion mutant displayed altered virulence in the murine model. The observed differences suggest the relapse isolate evolved subsequent to penetration of the central nervous system and may have gained dominance following the administration of antifungal therapy. These data reveal the first molecular insights into how the Cryptococcus neoformans genome changes during infection of humans and the manner in which microevolution progresses in this deadly fungal pathogen.
    G3-Genes Genomes Genetics 03/2013; 3(4). DOI:10.1534/g3.113.005660 · 2.51 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Insertional mutagenesis from virus infection is an important pathogenic risk for the development of cancer. Despite the advent of high-throughput sequencing, discovery of viral integration sites and expressed viral fusion events are still limited. Here, we present ViralFusionSeq (VFS), which combines soft-clipping information, read-pair analysis, and targeted de novo assembly to discover and annotate viral-human fusions. VFS was used in an RNA-Seq experiment, simulated DNA-Seq experiment and re-analysis of published DNA-Seq datasets. Our experiments demonstrated that VFS is both sensitive and highly accurate. AVAILABILITY: VFS is distributed under GPL version 3 at CONTACT: SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics Online.
    Bioinformatics 01/2013; 29(5):649-651. DOI:10.1093/bioinformatics/btt011 · 4.62 Impact Factor