The First Insight into the Tissue Specific Taxus Transcriptome via Illumina Second Generation Sequencing

Biotechnology Institute, Dalian Jiaotong University, Dalian, China.
PLoS ONE (Impact Factor: 3.23). 06/2011; 6(6):e21220. DOI: 10.1371/journal.pone.0021220
Source: PubMed


Illumina second generation sequencing is now an efficient route for generating enormous sequence collections that represent expressed genes and quantitate expression level. Taxus is a world-wide endangered gymnosperm genus and forms an important anti-cancer medicinal resource, but the large and complex genomes of Taxus have hindered the development of genomic resources. The research of its tissue-specific transcriptome is absent. There is also no study concerning the association between the plant transcriptome and metabolome with respect to the plant tissue type.
We performed the de novo assembly of Taxus mairei transcriptome using Illumina paired-end sequencing technology. In a single run, we produced 13,737,528 sequencing reads corresponding to 2.03 Gb total nucleotides. These reads were assembled into 36,493 unique sequences. Based on similarity search with known proteins, 23,515 Unigenes were identified to have the Blast hit with a cut-off E-value above 10⁻⁵. Furthermore, we investigated the transcriptome difference of three Taxus tissues using a tag-based digital gene expression system. We obtained a sequencing depth of over 3.15 million tags per sample and identified a large number of genes associated with tissue specific functions and taxane biosynthetic pathway. The expression of the taxane biosynthetic genes is significantly higher in the root than in the leaf and the stem, while high activity of taxane-producing pathway in the root was also revealed via metabolomic analyses. Moreover, many antisense transcripts and novel transcripts were found; clusters with similar differential expression patterns, enriched GO terms and enriched metabolic pathways with regard to the differentially expressed genes were revealed for the first time.
Our data provides the most comprehensive sequence resource available for Taxus study and will help define mechanisms of tissue specific functions and secondary metabolism in non-model plant organisms.

Download full-text


Available from: Guangbo Ge,
  • Source
    • "Species Explant Profiling method Number of transcripts Reference T. chinensis MeJA elicited cells Illumina deep sequencing 46,581 unigenes Li et al. (2012a, 2012b) T. chinensis MeJA elicited cells Illumina deep sequencing 1,256,425 sRNAs Qiu et al. (2009) T. chinensis MeJA elicited cells Random Sanger sequencing of cDNA library 3563 unigenes Jennewein et al. (2004) T. cuspidata MeJA elicited cells Sanger sequencing of subtractive hybridization library 331 unigenes Lenka et al. (2012) T. cuspidata (Cambial) cells 454 deep sequencing 26,906 unigenes Lee et al. (2010) T. cuspidata Needles 454 deep sequencing 20,557 unigenes Wu et al. (2011) T. mairei Roots, leaves, stems Illumina deep sequencing 36,493 unigenes Hao et al. (2011a, 2011b "
    [Show abstract] [Hide abstract]
    ABSTRACT: Taxol is a complex diterpene alkaloid scarcely produced in nature and with a high anticancer activity. Biotechnological systems for taxol production based on cell cultures of Taxus spp. have been developed, but the growing commercial demand for taxol and its precursors requires the optimization of these procedures. In order to increase the biotechnological production of taxol and related taxanes in Taxus spp. cell cultures, it is necessary not only to take an empirical approach that strives to optimize in-put factors (cell line selection, culture conditions, elicitation, up-scaling, etc.) and out-put factors (growth, production, yields, etc.), but also to carry out molecular biological studies. The latter can provide valuable insight into how the enhancement of taxane biosynthesis and accumulation affects metabolic profiles and gene expression in Taxus spp. cell cultures.
    Biotechnology advances 11/2014; 32(6). DOI:10.1016/j.biotechadv.2014.03.002 · 9.02 Impact Factor
  • Source
    • "e l s e v i e r . c o m / l o c a t e / g e n e (Hao et al., 2011; Li et al., 2011; Sandmann et al., 2011; Iseli et al., 1999). In the current study, we compared transcriptome sequences of above-and underground tissues of R. algida to obtain a profile of molecular mechanisms involved in synthesis of bioactive compounds . "
    [Show abstract] [Hide abstract]
    ABSTRACT: Transcriptome sequencing is a powerful tool for the assessment of gene expression and the identification and characterization of molecular markers in non-model organisms. Rhodiola algida L. (Crassulaceae), endemic to the Qinghai-Tibetan Plateau, has long been used in traditional Chinese medicine to prevent altitude sickness and eliminate fatigue. Illumina-based high-throughput transcriptome sequencing of aboveground and underground tissues of R. algida respectively yielded 5.40 million and 5.18 million clean reads. A total of 82,664 unigenes averaging 577bp in length were generated from the aboveground clean reads, with 86,237 unigenes of 502-bp mean length obtained from the underground tissues. Of 55,028 unigenes compared with sequences in UniProt databases, 20,413 unigenes had significant similarities with existing sequences in NR, NT, Swiss-Prot, GO, KEGG, and COG databases. Single nucleotide polymorphism (SNP) analysis identified 237,294 SNPs from 154,636 contigs of aboveground tissues and 197,540 SNPs from 144,963 underground-derived contigs. The information uncovered in this study should serve as a valuable resource for the characterization of important traits related to secondary metabolite formation and for the identification of associated molecular mechanisms.
    Gene 10/2014; 553(2):90-97. DOI:10.1016/j.gene.2014.09.063 · 2.14 Impact Factor
  • Source
    • "Accurate assembly of short reads generated in the NGS platforms are essential in sequence annotation [13] and transcriptome analysis of non-model organisms [10], [14]. Over 100 studies have been conducted on gene expression pattern, single nucleotide polymorphism (SNP) identification, transcriptome analysis involving various aquaculture species, using the NGS platforms [15] as well as on the complete set of all transcripts from certain types of cells or tissues [16], [17]. Recent transcriptome studies on soft-shelled turtle, basal jawed vertebrates and neo-tropical catfish [18], [19], [20]; provide evidence for suitability of this platform for surveying the complex vertebrate transcripts. "
    [Show abstract] [Hide abstract]
    ABSTRACT: From an immunologist perspective, sharks are an important group of jawed cartilaginous fishes and survey of the public database revealed a great gap in availability of large-scale sequence data for the group of Chondrichthyans the elasmobranchs. In an attempt to bridge this deficit we generated the transcriptome from the spleen and kidney tissues (a total of 1,606,172 transcripts) of the shark, Chiloscyllium griseum using the Illumina HiSeq2000 platform. With a cut off of > = 300 bp and an expression value of >1RPKM we used 43,385 transcripts for BLASTX analysis which revealed 17,548 transcripts matching to the NCBI nr database with an E-value of < = 10-5 and similarity score of 40%. The longest transcript was 16,974 bases with matched to HECT domain containing E3 ubiqutin protein ligase. MEGAN4 annotation pipeline revealed immune and signalling pathways including cell adhesion molecules, cytokine-cytokine receptor interaction, T-cell receptor signalling pathway and chemokine signaling pathway to be highly expressed in spleen, while different metabolism pathways such as amino acid metabolism, carbohydrate metabolism, lipid metabolism and xenobiotic biodegradation were highly expressed in kidney. Few of the candidate genes were selected to analyze their expression levels in various tissues by real-time PCR and also localization of a receptor by in-situ PCR to validate the prediction. We also predicted the domains structures of some of the identified pattern recognition receptors, their phylogenetic relationship with lower and higher vertebrates and the complete downstream signaling mediators of classical dsRNA signaling pathway. The generated transcriptome will be a valuable resource to further genetic and genomic research in elasmobranchs.
    PLoS ONE 06/2014; 9(6):e100018. DOI:10.1371/journal.pone.0100018 · 3.23 Impact Factor
Show more