Reanalysis of RNA-Sequencing Data Reveals Several Additional Fusion Genes with Multiple Isoforms

The Institute of Cancer Research, London, United Kingdom
PLoS ONE (Impact Factor: 3.53). 10/2012; 7(10):e48745. DOI: 10.1371/journal.pone.0048745
Source: PubMed

ABSTRACT RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60%) of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Cancer-specific fusion genes are often caused by cytogenetically visible chromosomal rearrangements such as translocations, inversions, deletions or insertions, they can be the targets of molecular therapy, they play a key role in the accurate diagnosis and classification of neoplasms, and they are of prognostic impact. The identification of novel fusion genes in various neoplasms therefore not only has obvious research importance, but is also potentially of major clinical significance. The "traditional" methodology to detect them began with cytogenetic analysis to find the chromosomal rearrangement, followed by utilization of fluorescence in situ hybridization techniques to find the probe which spans the chromosomal breakpoint, and finally molecular cloning to localize the breakpoint more precisely and identify the genes fused by the chromosomal rearrangement. Although laborious, the above-mentioned sequential approach is robust and reliable and a number of fusion genes have been cloned by such means. Next generation sequencing (NGS), mainly RNA sequencing (RNA-Seq), has opened up new possibilities to detect fusion genes even when cytogenetic aberrations are cryptic or information about them is unknown. However, NGS suffers from the shortcoming of identifying as "fusion genes" also many technical, biological and, perhaps in particular, clinical "false positives," thus making the assessment of which fusions are important and which are noise extremely difficult. The best way to overcome this risk of information overflow is, whenever reliable cytogenetic information is at hand, to compare karyotyping and sequencing data and concentrate exclusively on those suggested fusion genes that are found in chromosomal breakpoints.
    The International Journal of Biochemistry & Cell Biology 05/2014; 53. DOI:10.1016/j.biocel.2014.05.018 · 4.24 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Studies of gene rearrangements and the consequent oncogenic fusion proteins have laid the foundation for targeted cancer therapy. To identify oncogenic fusions associated with glioma progression, we catalogued fusion transcripts by RNA-seq of 272 WHO grade II, III, and IV gliomas. Fusion transcripts were more frequently found in high grade gliomas, in the classical subtype of gliomas, and in gliomas treated with radiation / temozolomide. 67 in-frame fusion transcripts were identified, including three recurrent fusion transcripts: FGFR3-TACC3, RNF213-SLC26A11, and PTPRZ1-MET (ZM). Interestingly, the ZM fusion was found only in grade III astrocytomas (1/13; 7.7%) or secondary GBMs (sGBMs, 3/20; 15.0%). In an independent cohort of sGBMs, the ZM fusion was found in 3 of 20 (15%) specimens. The fusion was additionally detected in 3 of 19 (16%) glioblastoma cell lines, confirming the recurrent nature of this transcript. Genomic analysis revealed that the fusion arose from translocation events involving introns 3 or 8 of PTPRZ and intron 1 of MET. ZM fusion transcripts were found in GBMs irrespective of isocitrate dehydrogenase 1 (IDH1) mutation status. sGBMs harboring ZM fusion showed higher expression of genes required for PIK3CA signaling and lowered expression of genes that suppressed RB1 or TP53 function. Expression of the ZM fusion was mutually exclusive with EGFR over-expression in sGBMs. Exogenous expression of the ZM fusion in the U87MG glioblastoma line enhanced cell-migration and invasion. Clinically, patients afflicted with ZM fusion harboring glioblastomas survived poorly relative to those afflicted with non-ZM harboring sGBMs (p<0.001). Our study profiles the shifting RNA landscape of gliomas during progression and reveled ZM as a novel, recurrent fusion transcript in sGBMs.
    Genome Research 08/2014; 24(11). DOI:10.1101/gr.165126.113 · 13.85 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Multicystic mesothelioma is a rare disease of unknown etiology and pathogenesis. Nothing has been known about the cytogenetic and molecular genetic features of these tumors. Here we present the first cytogenetically analyzed multicystic mesothelioma with the karyotype 46,XX,t(7;17)(p13;q23),t(8;11)(q23;p13). RNA-sequencing showed that the t(7;17)(p13;q23) generated a chimeric TNS3-MAP3K3 gene, which codes for a chimeric protein kinase, as well as the reciprocal MAP3K3-TNS3 in which the region of TNS3 coding for the SH2_Tensin_like region and the tensin phosphotyrosine-binding domain is under the control of the MAP3K3 promoter. The other translocation, t(8;11)(q23;p13), generated a chimeric ZFPM2-ELF5 gene which codes for a chimeric transcription factor in which the first 40 amino acids of ELF5 are replaced by the first 100 amino acid of ZFPM2. RT-PCR together with Sanger sequencing verified the presence of the above-mentioned fusion transcripts. The finding of acquired clonal chromosome abnormalities in cells cultured from the lesion and the presence of the TNS3-MAP3K3 chimeric protein kinase and the ZFPM2-ELF5 chimeric transcription factor confirm the neoplastic nature of multicystic mesothelioma. Copyright © 2014. Published by Elsevier Ireland Ltd.
    Cancer Letters 12/2014; 357(2). DOI:10.1016/j.canlet.2014.12.002 · 5.02 Impact Factor