Article

De novo assembly and validation of planaria transcriptome by massive parallel sequencing and shotgun proteomics.

Max-Delbrück-Center for Molecular Medicine, Berlin Institute for Medical Systems Biology, Robert Rössle Strasse 10, Berlin, Germany.
Genome Research (Impact Factor: 13.85). 05/2011; 21(7):1193-200. DOI: 10.1101/gr.113779.110
Source: PubMed

ABSTRACT Freshwater planaria are a very attractive model system for stem cell biology, tissue homeostasis, and regeneration. The genome of the planarian Schmidtea mediterranea has recently been sequenced and is estimated to contain >20,000 protein-encoding genes. However, the characterization of its transcriptome is far from complete. Furthermore, not a single proteome of the entire phylum has been assayed on a genome-wide level. We devised an efficient sequencing strategy that allowed us to de novo assemble a major fraction of the S. mediterranea transcriptome. We then used independent assays and massive shotgun proteomics to validate the authenticity of transcripts. In total, our de novo assembly yielded 18,619 candidate transcripts with a mean length of 1118 nt after filtering. A total of 17,564 candidate transcripts could be mapped to 15,284 distinct loci on the current genome reference sequence. RACE confirmed complete or almost complete 5' and 3' ends for 22/24 transcripts. The frequencies of frame shifts, fusion, and fission events in the assembled transcripts were computationally estimated to be 4.2%-13%, 0%-3.7%, and 2.6%, respectively. Our shotgun proteomics produced 16,135 distinct peptides that validated 4200 transcripts (FDR ≤1%). The catalog of transcripts assembled in this study, together with the identified peptides, dramatically expands and refines planarian gene annotation, demonstrated by validation of several previously unknown transcripts with stem cell-dependent expression patterns. In addition, our robust transcriptome characterization pipeline could be applied to other organisms without genome assembly. All of our data, including homology annotation, are freely available at SmedGD, the S. mediterranea genome database.

Download full-text

Full-text

Available from: Pinar Önal, Jun 28, 2015
1 Follower
 · 
391 Views
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Planarian flatworms regenerate every organ after amputation. Adult pluripotent stem cells drive this ability, but how injury activates and directs stem cells into the appropriate lineages is unclear. Here we describe a single-organ regeneration assay in which ejection of the planarian pharynx is selectively induced by brief exposure of animals to sodium azide. To identify genes required for pharynx regeneration, we performed an RNAi screen of 356 genes upregulated after amputation, using successful feeding as a proxy for regeneration. We found that knockdown of 20 genes caused a wide range of regeneration phenotypes and that RNAi of the forkhead transcription factor FoxA, which is expressed in a subpopulation of stem cells, specifically inhibited regrowth of the pharynx. Selective amputation of the pharynx therefore permits the identification of genes required for organ-specific regeneration and suggests an ancient function for FoxA-dependent transcriptional programs in driving regeneration. DOI: http://dx.doi.org/10.7554/eLife.02238.001.
    eLife Sciences 04/2014; 3:e02238. DOI:10.7554/eLife.02238 · 8.52 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Transcriptome analysis of polar bears (Ursus maritimus) yielded sequences with highest similarity to the human endogenous retrovirus group HERV-K(HML-2). Further analysis of the polar bear draft genome identified an endogenous betaretrovirus group comprising 26 proviral copies and 231 solo LTRs. Molecular dating indicates the group originated before the divergence of bears from a common ancestor but is not present in all carnivores. Closely related sequences were identified in the giant panda (Ailuropoda melanoleuca) and characterized from its genome. We have designated the polar bear and giant panda sequences U. maritimus endogenous retrovirus (UmaERV) and A. melanoleuca endogenous retrovirus (AmeERV), respectively. Phylogenetic analysis demonstrated that the bear virus group is nested within the HERV-K supergroup among bovine and bat endogenous retroviruses suggesting a complex evolutionary history within the HERV-K group. All individual remnants of proviral sequences contain numerous frameshifts and stop codons and thus, the virus is likely non-infectious.
    Virology 05/2013; 443(1). DOI:10.1016/j.virol.2013.05.008 · 3.28 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: SUMMARY During platyhelminth infection, a cocktail of proteins is released by the parasite to aid invasion, initiate feeding, facilitate adaptation and mediate modulation of the host immune response. Included amongst these proteins is the Venom Allergen-Like (VAL) family, part of the larger sperm coating protein/Tpx-1/Ag5/PR-1/Sc7 (SCP/TAPS) superfamily. To explore the significance of this protein family during Platyhelminthes development and host interactions, we systematically summarize all published proteomic, genomic and immunological investigations of the VAL protein family to date. By conducting new genomic and transcriptomic interrogations to identify over 200 VAL proteins (228) from species in all 4 traditional taxonomic classes (Trematoda, Cestoda, Monogenea and Turbellaria), we further expand our knowledge related to platyhelminth VAL diversity across the phylum. Subsequent phylogenetic and tertiary structural analyses reveal several class-specific VAL features, which likely indicate a range of roles mediated by this protein family. Our comprehensive analysis of platyhelminth VALs represents a unifying synopsis for understanding diversity within this protein family and a firm context in which to initiate future functional characterization of these enigmatic members.
    Parasitology 05/2012; 139(10):1231-45. DOI:10.1017/S0031182012000704 · 2.35 Impact Factor