Bainbridge, M.N. et al. Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach. BMC Genomics 7, 246

British Columbia Cancer Agency (BCCA) Genome Sciences Centre, Vancouver, British Columbia, Canada. .
BMC Genomics (Impact Factor: 3.99). 02/2006; 7(1):246. DOI: 10.1186/1471-2164-7-246
Source: PubMed


High throughput sequencing-by-synthesis is an emerging technology that allows the rapid production of millions of bases of data. Although the sequence reads are short, they can readily be used for re-sequencing. By re-sequencing the mRNA products of a cell, one may rapidly discover polymorphisms and splice variants particular to that cell.
We present the utility of massively parallel sequencing by synthesis for profiling the transcriptome of a human prostate cancer cell-line, LNCaP, that has been treated with the synthetic androgen, R1881. Through the generation of approximately 20 megabases (MB) of EST data, we detect transcription from over 10,000 gene loci, 25 previously undescribed alternative splicing events involving known exons, and over 1,500 high quality single nucleotide discrepancies with the reference human sequence. Further, we map nearly 10,000 ESTs to positions on the genome where no transcription is currently predicted to occur. We also characterize various obstacles with using sequencing by synthesis for transcriptome analysis and propose solutions to these problems.
The use of high-throughput sequencing-by-synthesis methods for transcript profiling allows the specific and sensitive detection of many of a cell's transcripts, and also allows the discovery of high quality base discrepancies, and alternative splice variants. Thus, this technology may provide an effective means of understanding various disease states, discovering novel targets for disease treatment, and discovery of novel transcripts.

Download full-text


Available from: René L Warren,
  • Source
    • "The advent of RNA-Seq [1] and dramatic decrease in next-generation sequencing costs have led to numerous RNA-Seq studies in recent years. This revolutionary technique has enabled digital transcriptome profiling at unprecedented resolution that avoids many of the limitations inherent to the analog nature of microarray technology [2,3]. "
    [Show abstract] [Hide abstract]
    ABSTRACT: RNA-Sequencing (RNA-Seq) experiments have been optimized for library preparation, mapping, and gene expression estimation. These methods, however, have revealed weaknesses in the next stages of analysis of differential expression, with results sensitive to systematic sample stratification or, in more extreme cases, to outliers. Further, a method to assess normalization and adjustment measures imposed on the data is lacking. To address these issues, we utilize previously published eQTLs as a novel gold standard at the center of a framework that integrates DNA genotypes and RNA-Seq data to optimize analysis and aid in the understanding of genetic variation and gene expression. After detecting sample contamination and sequencing outliers in RNA-Seq data, a set of previously published brain eQTLs was used to determine if sample outlier removal was appropriate. Improved replication of known eQTLs supported removal of these samples in downstream analyses. eQTL replication was further employed to assess normalization methods, covariate inclusion, and gene annotation. This method was validated in an independent RNA-Seq blood data set from the GTEx project and a tissue-appropriate set of eQTLs. eQTL replication in both data sets highlights the necessity of accounting for unknown covariates in RNA-Seq data analysis. As each RNA-Seq experiment is unique with its own experiment-specific limitations, we offer an easily-implementable method that uses the replication of known eQTLs to guide each step in one's data analysis pipeline. In the two data sets presented herein, we highlight not only the necessity of careful outlier detection but also the need to account for unknown covariates in RNA-Seq experiments.
    BMC Genomics 12/2013; 14(1):892. DOI:10.1186/1471-2164-14-892 · 3.99 Impact Factor
  • Source
    • "WGS and WES sequencing have been augmented by cDNA sequencing (referred to as RNA-seq) to explore alterations to the transcriptome. RNA-seq provides not only gene expression levels, but also aberrant splicing, chimeric gene fusion transcripts characteristic of cancer cells and expressed somatic mutations (Bainbridge et al. 2006; Dong et al. 2009; Maher et al. 2009; Shah et al. 2009; Berger et al. 2010; Tuch et al. 2010; Wang et al. 2012). Analysis of chromatin modification is in its infancy as applied to the cancer cell, but the recent reporting of the ENCODE Project Consortium's genome-wide results (The ENCODE Project Consortium 2012) may provide the tools and technologies to enable new approaches. "
    [Show abstract] [Hide abstract]
    ABSTRACT: The realization that cancer progression required the participation of cellular genes provided one of several key rationales, in 1986, for embarking on the human genome project. Only with a reference genome sequence could the full spectrum of somatic changes leading to cancer be understood. Since its completion in 2003, the human reference genome sequence has fulfilled its promise as a foundational tool to illuminate the pathogenesis of cancer. Herein, we review the key historical milestones in cancer genomics since the completion of the genome, and some of the novel discoveries that are shaping our current understanding of cancer.
    Genome Research 07/2013; 23(7):1054-62. DOI:10.1101/gr.157602.113 · 14.63 Impact Factor
  • Source
    • "ng attached to the beads and then read ; as reads start close to the bead and stop after a few hundred bases , this could account for the observed greater coverage of the extremities of the segments . That deeper coverage could be noticed at both cDNA ends for most genes , and analogous conclusions have been drawn independently by others ( e . g . Bainbridge et al . , 2006 ) . There was also a tendency towards even deeper coverage of the 3 0 than the 5 0 end of the cDNA , especially in the case of the NP genes which showed relatively shallow coverage of their 5 0 cDNA ends . The mechanism accounting for this difference between the two cDNA ends is uncertain , and might be related to sequence composition ."
    [Show abstract] [Hide abstract]
    ABSTRACT: We studied the sub-population level evolution of a duck influenza A virus isolate during passage in swine tracheal cells. The complete genomes of the A/mallard/Netherlands/10-Nmkt/1999 strain and its swine cell-passaged descendent were analysed by 454 pyrosequencing with coverage depth ranging from several hundred to several thousand reads at any point. This allowed characterization of defined minority sub-populations of gene segments 2, 3, 4, 5, 7, and 8 present in the original isolate. These minority sub-populations ranged between 9.5% (for segment 2) and 46% (for segment 4) of their respective gene segments in the parental stock. They were likely contributed by one or more viruses circulating within the same area, at the same period and in the same or a sympatric host species. The minority sub-populations of segments 3, 4, and 5 became extinct upon viral passage in swine cells, whereas the minority sub-populations of segments 2, 7 and 8 completely replaced their majority counterparts. The swine cell-passaged virus was therefore a three-segment reassortant and also harboured point mutations in segments 3 and 4. The adapted virus was more homogenous than the parental stock, with only 17 minority single nucleotide polymorphisms present above 5% frequency across the whole genome. Though limited here to one sample, this deep sequencing approach highlights the evolutionary versatility of influenza viruses whereby they exploit their genetic diversity, predilection for mixed infection and reassortment to adapt to a new host environmental niche.
    Infection, genetics and evolution: journal of molecular epidemiology and evolutionary genetics in infectious diseases 05/2013; 18. DOI:10.1016/j.meegid.2013.04.034 · 3.02 Impact Factor
Show more