Thesis

mRNA localization and transcriptome dynamics in early zebrafish development

Authors:
To read the full-text of this research, you can request a copy directly from the author.

Abstract

Die Lokalisierung von mRNA ist ein wichtiger regulativer Mechanismus in polarisierten Zellen und in frühen Embryonalstadien. Dort sind räumliche Muster maternaler mRNA für die korrekte Entwicklung der Körperachsen und die Spezifizierung der Keimzellen verantwortlich. Systematische Analysen dieser Prozesse wurden jedoch bisher limitiert durch einen Mangel an räumlicher und zeitlicher Auflösung von Einzelzell- Sequenzierungsdaten. Wir analysierten die Dynamik des räumlichen und zeitlichen Transkriptoms während frühen Embryonalstadien von Zebrafischen. Wir verbesserten Empfindlichkeit und Auflösung von tomo-seq und erfassten damit systematisch räumlich aufgelöste Transkriptome entlang der animal-vegetalen-Achse Embryonen im Einzell-Stadium und fanden 97 vegetal lokalisierte Gene. Außerdem etablierten wir eine Hochdurchsatz kompatible Variante der RNA-Markierungsmethode scSLAM-seq. Wir wendeten diese in Embryonen während der Gastrulation. Von den vegetal lokalisierten Genen waren 22 angereichert in Keimzellen, was eine funktionelle Rolle bei der Spezifizierung von Keimzellen nahelegt. Mit tomo-seq untersuchten wir die evolutionäre Konservierung der RNA-Lokalisierung zwischen Zebrafischen und gereiften Oozyten zweier Xenopus-Arten. Wir verglichen die lokalisierten Gene, suchten nach konservierten 3'UTR-Motiven, und fanden zum Teil überlappende Motive, was auf eine mögliche mechanistische Konservierung der Lokalisierungsmechanismen hinweist. Wir untersuchten auch RNA-Editierung von Adenin zu Inosin während der Embryonalentwicklung und in den Organen erwachsener Fische. In im Gehirn exprimierten Transkripten fanden wir 117 Editierstellen, die hauptsächlich für Ionentransporter kodieren und zum Teil zum Menschen konserviert sind. Die höchsten Editierraten konnten wir in Eierstöcken, Hoden und frühen Embryonen nachweisen, was auf eine mögliche Rolle bei der Regulierung der RNA-Stabilität hindeutet.

No full-text available

Request Full-text Paper PDF

To read the full-text of this research,
you can request a copy directly from the author.

ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
A-to-I RNA editing is a common post transcriptional mechanism, mediated by the Adenosine deaminase that acts on RNA (ADAR) enzymes, that increases transcript and protein diversity. The study of RNA editing is limited by the absence of editing maps for most model organisms, hindering the understanding of its impact on various physiological conditions. Here, we mapped the vertebrate developmental landscape of A-to-I RNA editing, and generated the first comprehensive atlas of editing sites in zebrafish. Tens of thousands unique editing events and 149 coding sites were identified with high-accuracy. Some of these edited sites are conserved between zebrafish and humans. Sequence analysis of RNA over seven developmental stages revealed high levels of editing activity in early stages of embryogenesis, when embryos rely on maternal mRNAs and proteins. In contrast to the other organisms studied so far, the highest levels of editing were detected in the zebrafish ovary and testes. This resource can serve as the basis for understanding of the role of editing during zebrafish development and maturity.
Article
Full-text available
Several techniques are currently being developed for spatially resolved omics profiling, but each new method requires the setup of specific detection strategies or specialized instrumentation. Here we describe an imaging-free framework to localize high-throughput readouts within a tissue by cutting the sample into thin strips in a way that allows subsequent image reconstruction. We implemented this framework to transform a low-input RNA sequencing protocol into an imaging-free spatial transcriptomics technique (called STRP-seq) and validated it by profiling the spatial transcriptome of the mouse brain. We applied the technique to the brain of the Australian bearded dragon, Pogona vitticeps. Our results reveal the molecular anatomy of the telencephalon of this lizard, providing evidence for a marked regionalization of the reptilian pallium and subpallium. We expect that STRP-seq can be used to derive spatially resolved data from a range of other omics techniques. A refined spatial sampling technique transforms standard RNA sequencing into a spatial transcriptomics method.
Preprint
Full-text available
Myocardial infarction is a leading cause of death worldwide, as the adult human heart does not have the ability to regenerate efficiently after insults. In contrast, the adult zebrafish heart has a high capacity for regeneration, and understanding the mechanisms of regenerative processes in fish allows identification of novel therapeutic strategies. While several pro-regenerative factors have been described, the cell types orchestrating heart regeneration remain largely elusive. To overcome this conceptual limitation, we dissected cell type diversity in the regenerating zebrafish heart based on single cell transcriptomics and spatiotemporal analysis. We discovered a dramatic induction of several pro-regenerative cell types with fibroblast characteristics. To understand the cascade of events leading to heart regeneration, we determined the origin of these cell types by high-throughput lineage tracing. We found that pro-regenerative fibroblasts are derived from two separate sources, the epicardium and the endocardium. Mechanistically, we identified Wnt signaling as a key regulator of the endocardial regenerative response. In summary, our results uncover specialized fibroblast cell types as major drivers of heart regeneration, thereby opening up new possibilities to interfere with the regenerative capacity of the vertebrate heart.
Preprint
Full-text available
Early stages of embryogenesis depend heavily on subcellular localization and transport of maternally deposited mRNA. However, systematic analysis of these processes is currently hindered by a lack of spatio-temporal information in single-cell RNA sequencing. Here, we combined spatially-resolved transcriptomics and single-cell RNA labeling to study the spatio-temporal dynamics of the transcriptome during the first few hours of zebrafish development. We measured spatial localization of mRNA molecules with sub-single-cell resolution at the one-cell stage, which allowed us to identify a class of mRNAs that are specifically localized at an extraembryonic position, the vegetal pole. Furthermore, we established a method for high-throughput single-cell RNA labeling in early zebrafish embryos, which enabled us to follow the fate of individual maternal transcripts until gastrulation. This approach revealed that many localized transcripts are specifically transported to the primordial germ cells. Finally, we acquired spatial transcriptomes of two xenopus species, and we compared evolutionary conservation of localized genes as well as enriched sequence motifs. In summary, we established sub-single-cell spatial transcriptomics and single-cell RNA labeling to reveal principles of mRNA localization in early vertebrate development.
Article
Full-text available
Single-cell RNA sequencing offers snapshots of whole transcriptomes but obscures the temporal RNA dynamics. Here we present single-cell metabolically labeled new RNA tagging sequencing (scNT-seq), a method for massively parallel analysis of newly transcribed and pre-existing mRNAs from the same cell. This droplet microfluidics-based method enables high-throughput chemical conversion on barcoded beads, efficiently marking newly transcribed mRNAs with T-to-C substitutions. Using scNT-seq, we jointly profiled new and old transcriptomes in ~55,000 single cells. These data revealed time-resolved transcription factor activities and cell-state trajectories at the single-cell level in response to neuronal activation. We further determined rates of RNA biogenesis and decay to uncover RNA regulatory strategies during stepwise conversion between pluripotent and rare totipotent two-cell embryo (2C)-like stem cell states. Finally, integrating scNT-seq with genetic perturbation identifies DNA methylcytosine dioxygenase as an epigenetic barrier into the 2C-like cell state. Time-resolved single-cell transcriptomic analysis thus opens new lines of inquiry regarding cell-type-specific RNA regulatory mechanisms. ScNT-seq leverages metabolic labeling of nascent RNA and droplet-based sequencing for parallel analysis of newly transcribed and pre-existing mRNAs, which enables time-resolved analysis of dynamic biological processes in single cells.
Article
Full-text available
RNA velocity has opened up new ways of studying cellular differentiation in single-cell RNA-sequencing data. It describes the rate of gene expression change for an individual gene at a given time point based on the ratio of its spliced and unspliced messenger RNA (mRNA). However, errors in velocity estimates arise if the central assumptions of a common splicing rate and the observation of the full splicing dynamics with steady-state mRNA levels are violated. Here we present scVelo, a method that overcomes these limitations by solving the full transcriptional dynamics of splicing kinetics using a likelihood-based dynamical model. This generalizes RNA velocity to systems with transient cell states, which are common in development and in response to perturbations. We apply scVelo to disentangling subpopulation kinetics in neurogenesis and pancreatic endocrinogenesis. We infer gene-specific rates of transcription, splicing and degradation, recover each cell’s position in the underlying differentiation processes and detect putative driver genes. scVelo will facilitate the study of lineage decisions and gene regulation. scVelo reconstructs transient cell states and differentiation pathways from single-cell RNA-sequencing data.
Article
Full-text available
Revealing fine-scale cellular heterogeneity among spatial context and the functional and structural foundations of tissue architecture is fundamental within biological research and pharmacology. Unlike traditional approaches involving single molecules or bulk omics, cutting-edge, spatially resolved transcriptomics techniques offer near-single-cell or even subcellular resolution within tissues. Massive information across higher dimensions along with position-coordinating labels can better map the whole 3D transcriptional landscape of tissues. In this review, we focus on developments and strategies in spatially resolved transcriptomics, compare the cell and gene throughput and spatial resolution in detail for existing methods, and highlight the enormous potential in biomedical research.
Article
Full-text available
Recent advances in spatially resolved transcriptomics have greatly expanded the knowledge of complex multicellular biological systems. The field has quickly expanded in recent years, and several new technologies have been developed that all aim to combine gene expression data with spatial information. The vast array of methodologies displays fundamental differences in their approach to obtain this information, and thus, demonstrate method‐specific advantages and shortcomings. While the field is moving forward at a rapid pace, there are still multiple challenges presented to be addressed, including sensitivity, labor extensiveness, tissue‐type dependence, and limited capacity to obtain detailed single‐cell information. No single method can currently address all these key parameters. In this review, available spatial transcriptomics methods are described and their applications as well as their strengths and weaknesses are discussed. Future developments are explored and where the field is heading to is deliberated upon.
Article
Full-text available
The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive body of genetic and genomic studies in these organisms. Initiated in 2016, the Alliance is building a central portal (www.alliancegenome.org) for access to data for the primary model organisms along with gene ontology data and human data. All data types represented in the Alliance portal (e.g. genomic data and phenotype descriptions) have common data models and workflows for curation. All data are open and freely available via a variety of mechanisms. Long-term plans for the Alliance project include a focus on coverage of additional model organisms including those without dedicated curation communities, and the inclusion of new data types with a particular focus on providing data and tools for the non-model-organism researcher that support enhanced discovery about human health and disease. Here we review current progress and present immediate plans for this new bioinformatics resource.
Article
Full-text available
Background: Adenosine-to-inosine (A-to-I) RNA editing is a process that contributes to the diversification of proteins that has been shown to be essential for neurotransmission and other neuronal functions. However, the spatiotemporal and diversification properties of RNA editing in the brain are largely unknown. Here, we applied in situ sequencing to distinguish between edited and unedited transcripts in distinct regions of the mouse brain at four developmental stages, and investigate the diversity of the RNA landscape. Results: We analyzed RNA editing at codon-altering sites using in situ sequencing at single-cell resolution, in combination with the detection of individual ADAR enzymes and specific cell type marker transcripts. This approach revealed cell-type-specific regulation of RNA editing of a set of transcripts, and developmental and regional variation in editing levels for many of the targeted sites. We found increasing editing diversity throughout development, which arises through regional- and cell type-specific regulation of ADAR enzymes and target transcripts. Conclusions: Our single-cell in situ sequencing method has proved useful to study the complex landscape of RNA editing and our results indicate that this complexity arises due to distinct mechanisms of regulating individual RNA editing sites, acting both regionally and in specific cell types.
Preprint
Full-text available
In colorectal cancer, oncogenic mutations transform a hierarchically organized and homeostatic epithelium into invasive cancer tissue lacking visible organization. We sought to identify differences in cellular composition between normal colon and colorectal cancer, and to define signals controlling cancer cell development. We used single cell RNA and protein profiling to analyze tumors and matched normal tissues of twelve colorectal cancer patients. RNA metabolic labelling followed by single cell RNA sequencing in patient-derived normal colon and colorectal cancer organoids was employed to define colorectal cancer cell developmental trajectories. We find that colorectal cancer tissues exhibited consistent changes in cellular composition in the epithelial, immune and stromal compartments across patients compared to normal colon. Tumor epithelial cells displayed patient-specific gene expression often correlating with somatic copy number alterations, but mainly organized into patient-overarching clusters. These clusters were defined by cell type-specific transcriptional programs related to stem, transient-amplifying and immature goblet cells, and showed differential expression for signatures of oncogenic traits such as replication stress. Patient-derived colorectal cancer organoids exhibited developmental trajectories forming along a gradient of mitogen-activated protein kinase activity that were Wnt-independent. Likewise, colorectal cancer cell types of patient samples were organized by mitogen-activated protein kinase activity. Our single-cell analyses provide a subtyping system for colorectal cancer cells based on transcriptional readout of morphogenetic signals and oncogenic traits. We provide evidence that mitogen-activated protein kinase signaling, a key pathway for targeted therapy, is a main driver of colorectal cancer cell plasticity.
Article
Full-text available
Multiplexed RNA sequencing in individual cells is transforming basic and clinical life sciences1–4. Often, however, tissues must first be dissociated, and crucial information about spatial relationships and communication between cells is thus lost. Existing approaches to reconstruct tissues assign spatial positions to each cell, independently of other cells, by using spatial patterns of expression of marker genes5,6—which often do not exist. Here we reconstruct spatial positions with little or no prior knowledge, by searching for spatial arrangements of sequenced cells in which nearby cells have transcriptional profiles that are often (but not always) more similar than cells that are farther apart. We formulate this task as a generalized optimal-transport problem for probabilistic embedding and derive an efficient iterative algorithm to solve it. We reconstruct the spatial expression of genes in mammalian liver and intestinal epithelium, fly and zebrafish embryos, sections from the mammalian cerebellum and whole kidney, and use the reconstructed tissues to identify genes that are spatially informative. Thus, we identify an organization principle for the spatial expression of genes in animal tissues, which can be exploited to infer meaningful probabilities of spatial position for individual cells. Our framework (‘novoSpaRc’) can incorporate prior spatial information and is compatible with any single-cell technology. Additional principles that underlie the cartography of gene expression can be tested using our approach. A new computational framework, novoSpaRc, leverages single-cell data to reconstruct spatial context for cells and spatial expression across tissues and organisms, on the basis of an organization principle for gene expression.
Article
Full-text available
Adenosine-to-inosine (A-to-I) RNA editing by the adenosine deaminase that acts on RNA (ADAR) enzymes is a common RNA modification, preventing false activation of the innate immune system by endogenous double-stranded RNAs. Methods for quantification of ADAR activity are sought after, due to an increasing interest in the role of ADARs in cancer and autoimmune disorders, as well as attempts to harness the ADAR enzymes for RNA engineering. Here, we present the Alu editing index (AEI), a robust and simple-to-use computational tool devised for this purpose. We describe its properties and demonstrate its superiority to current quantification methods of ADAR activity. The AEI is used to map global editing across a large dataset of healthy human samples and identify putative regulators of ADAR, as well as previously unknown factors affecting the observed Alu editing levels. These should be taken into account in future comparative studies of ADAR activity. The AEI tool is available at https://github.com/a2iEditing/RNAEditingIndexer.
Article
Full-text available
The expression profiles and spatial distributions of RNAs regulate many cellular functions. Image-based transcriptomic approaches provide powerful means to measure both expression and spatial information of RNAs in individual cells within their native environment. Among these approaches, multiplexed error-robust fluorescence in situ hybridization (MERFISH) has achieved spatially resolved RNA quantification at transcriptome scale by massively multiplexing single-molecule FISH measurements. Here, we increased the gene throughput of MERFISH and demonstrated simultaneous measurements of RNA transcripts from ∼10,000 genes in individual cells with ∼80% detection efficiency and ∼4% misidentification rate. We combined MERFISH with cellular structure imaging to determine subcellular compartmentalization of RNAs. We validated this approach by showing enrichment of secretome transcripts at the endoplasmic reticulum, and further revealed enrichment of long noncoding RNAs, RNAs with retained introns, and a subgroup of protein-coding mRNAs in the cell nucleus. Leveraging spatially resolved RNA profiling, we developed an approach to determine RNA velocity in situ using the balance of nuclear versus cytoplasmic RNA counts. We applied this approach to infer pseudotime ordering of cells and identified cells at different cell-cycle states, revealing ∼1,600 genes with putative cell cycle-dependent expression and a gradual transcription profile change as cells progress through cell-cycle stages. Our analysis further revealed cell cycle-dependent and cell cycle-independent spatial heterogeneity of transcriptionally distinct cells. We envision that the ability to perform spatially resolved, genome-wide RNA profiling with high detection efficiency and accuracy by MERFISH could help address a wide array of questions ranging from the regulation of gene expression in cells to the development of cell fate and organization in tissues.
Preprint
Full-text available
Subcellular RNA localization regulates spatially polarized cellular processes, but unbiased investigation of its control in vivo remains challenging. Current hybridization-based methods cannot differentiate small regulatory variants, while in situ sequencing is limited by short reads. We solved these problems using a bidirectional sequencing chemistry to efficiently image transcript-specific barcode in situ, which are then extracted and assembled into longer reads using NGS. In the Drosophila retina, genes regulating eye development and cytoskeletal organization were enriched compared to methods using extracted RNA. We therefore named our method In Situ Transcriptome Accessibility sequencing (INSTA-seq). Sequencing reads terminated near 3' UTR cis-motifs (e.g. Zip48C, stau), revealing RNA-protein interactions. Additionally, Act5C polyadenylation isoforms retaining zipcode motifs were selectively localized to the optical stalk, consistent with their biology. Our platform provides a powerful way to visualize any RNA variants or protein interactions in situ to study their regulation in animal development.
Article
Full-text available
Sequencing of newly synthesised RNA can monitor transcriptional dynamics with great sensitivity and high temporal resolution, but is currently restricted to populations of cells. Here, we develop new transcriptome alkylation-dependent single-cell RNA sequencing (NASC-seq), to monitor newly synthesised and pre-existing RNA simultaneously in single cells. We validate the method on pre-labelled RNA, and by demonstrating that more newly synthesised RNA was detected for genes with known high mRNA turnover. Monitoring RNA synthesis during Jurkat T-cell activation with NASC-seq reveals both rapidly up- and down-regulated genes, and that induced genes are almost exclusively detected as newly transcribed. Moreover, the newly synthesised and pre-existing transcriptomes after T-cell activation are distinct, confirming that NASC-seq simultaneously measures gene expression corresponding to two time points in single cells. Altogether, NASC-seq enables precise temporal monitoring of RNA synthesis at single-cell resolution during homoeostasis, perturbation responses and cellular differentiation.
Article
Full-text available
Single-cell RNA sequencing (scRNA-seq) has highlighted the important role of intercellular heterogeneity in phenotype variability in both health and disease¹. However, current scRNA-seq approaches provide only a snapshot of gene expression and convey little information on the true temporal dynamics and stochastic nature of transcription. A further key limitation of scRNA-seq analysis is that the RNA profile of each individual cell can be analysed only once. Here we introduce single-cell, thiol-(SH)-linked alkylation of RNA for metabolic labelling sequencing (scSLAM-seq), which integrates metabolic RNA labelling², biochemical nucleoside conversion³ and scRNA-seq to record transcriptional activity directly by differentiating between new and old RNA for thousands of genes per single cell. We use scSLAM-seq to study the onset of infection with lytic cytomegalovirus in single mouse fibroblasts. The cell-cycle state and dose of infection deduced from old RNA enable dose–response analysis based on new RNA. scSLAM-seq thereby both visualizes and explains differences in transcriptional activity at the single-cell level. Furthermore, it depicts ‘on–off’ switches and transcriptional burst kinetics in host gene expression with extensive gene-specific differences that correlate with promoter-intrinsic features (TBP–TATA-box interactions and DNA methylation). Thus, gene-specific, and not cell-specific, features explain the heterogeneity in transcriptomes between individual cells and the transcriptional response to perturbations.
Article
Full-text available
Imaging the transcriptome in situ with high accuracy has been a major challenge in single-cell biology, which is particularly hindered by the limits of optical resolution and the density of transcripts in single cells1–5. Here we demonstrate an evolution of sequential fluorescence in situ hybridization (seqFISH+). We show that seqFISH+ can image mRNAs for 10,000 genes in single cells—with high accuracy and sub-diffraction-limit resolution—in the cortex, subventricular zone and olfactory bulb of mouse brain, using a standard confocal microscope. The transcriptome-level profiling of seqFISH+ allows unbiased identification of cell classes and their spatial organization in tissues. In addition, seqFISH+ reveals subcellular mRNA localization patterns in cells and ligand–receptor pairs across neighbouring cells. This technology demonstrates the ability to generate spatial cell atlases and to perform discovery-driven studies of biological processes in situ. seqFISH+, an evolution of sequential fluorescence in situ hybridization with super-resolution imaging capabilities, is used to image mRNAs of 10,000 genes in cultured cells and mouse brain slices, demonstrating the ability to generate spatial atlases and to perform discovery-driven studies in situ.
Preprint
Full-text available
The spatial organization of cells in tissue has a profound influence on their function, yet a high-throughput, genome-wide readout of gene expression with cellular resolution is lacking. Here, we introduce Slide-seq, a highly scalable method that enables facile generation of large volumes of unbiased spatial transcriptomes with 10 µm spatial resolution, comparable to the size of individual cells. In Slide-seq, RNA is transferred from freshly frozen tissue sections onto a surface covered in DNA-barcoded beads with known positions, allowing the spatial locations of the RNA to be inferred by sequencing. To demonstrate Slide-seq’s utility, we localized cell types identified by large-scale scRNA-seq datasets within the cerebellum and hippocampus. We next systematically characterized spatial gene expression patterns in the Purkinje layer of mouse cerebellum, identifying new axes of variation across Purkinje cell compartments. Finally, we used Slide-seq to define the temporal evolution of cell-type-specific responses in a mouse model of traumatic brain injury. Slide-seq will accelerate biological discovery by enabling routine, high-resolution spatial mapping of gene expression. One Sentence Summary Slide-seq measures genome-wide expression in complex tissues at 10-micron resolution.
Article
Full-text available
Heart diseases are the leading cause of death for the vast majority of people around the world, which is often due to the limited capability of human cardiac regeneration. In contrast, zebrafish have the capacity to fully regenerate their hearts after cardiac injury. Understanding and activating these mechanisms would improve health in patients suffering from long-term consequences of ischemia. Therefore, we monitored the dynamic transcriptome response of both mRNA and microRNA in zebrafish at 1–160 days post cryoinjury (dpi). Using a control model of sham-operated and healthy fish, we extracted the regeneration specific response and further delineated the spatio-temporal organization of regeneration processes such as cell cycle and heart function. In addition, we identified novel (miR-148/152, miR-218b and miR-19) and previously known microRNAs among the top regulators of heart regeneration by using theoretically predicted target sites and correlation of expression profiles from both mRNA and microRNA. In a cross-species effort, we validated our findings in the dynamic process of rat myoblasts differentiating into cardiomyocytes-like cells (H9c2 cell line). Concluding, we elucidated different phases of transcriptomic responses during zebrafish heart regeneration. Furthermore, microRNAs showed to be important regulators in cardiomyocyte proliferation over time.
Preprint
Full-text available
Sequencing of newly synthesized RNA can monitor transcriptional dynamics with great sensitivity and high temporal resolution, but is currently restricted to populations of cells. Here, we developed newly synthesized alkylation-dependent single-cell RNA sequencing (NASC-seq), to monitor both newly synthesized and pre-existing RNA in single cells. We validated the method on pre-alkylated exogenous spike-in RNA, and by demonstrating that more newly synthesized RNA was detected for genes with known high mRNA turnover. Importantly, NASC-seq reveals rapidly up- and down-regulated genes during the T-cell activation, and RNA sequenced for induced genes were essentially only newly synthesized. Moreover, the newly synthesized and pre-existing transcriptomes after T-cell activation were distinct confirming that we indeed could simultaneously measure gene expression corresponding to two time points in single cells. Altogether, NASC-seq is a powerful tool to investigate transcriptional dynamics and it will enable the precise monitoring of RNA synthesis at flexible time periods during homeostasis, perturbation responses and cellular differentiation.
Article
Full-text available
Global efforts to create a molecular census of the brain using single-cell transcriptomics are producing a large catalog of molecularly defined cell types. However, spatial information is lacking and new methods are needed to map a large number of cell type–specific markers simultaneously on large tissue areas. Here, we describe a cyclic single-molecule fluorescence in situ hybridization methodology and define the cellular organization of the somatosensory cortex. © 2018, The Author(s), under exclusive licence to Springer Nature America, Inc.
Article
Full-text available
RNA abundance is a powerful indicator of the state of individual cells. Single-cell RNA sequencing can reveal RNA abundance with high quantitative accuracy, sensitivity and throughput¹. However, this approach captures only a static snapshot at a point in time, posing a challenge for the analysis of time-resolved phenomena such as embryogenesis or tissue regeneration. Here we show that RNA velocity—the time derivative of the gene expression state—can be directly estimated by distinguishing between unspliced and spliced mRNAs in common single-cell RNA sequencing protocols. RNA velocity is a high-dimensional vector that predicts the future state of individual cells on a timescale of hours. We validate its accuracy in the neural crest lineage, demonstrate its use on multiple published datasets and technical platforms, reveal the branching lineage tree of the developing mouse hippocampus, and examine the kinetics of transcription in human embryonic brain. We expect RNA velocity to greatly aid the analysis of developmental lineages and cellular dynamics, particularly in humans.
Article
Full-text available
Phase separation represents an important form of subcellular compartmentalization. However, relatively little is known about how the formation or disassembly of such compartments is regulated. In zebrafish, the Balbiani body (Bb) and the germ plasm (Gp) are intimately linked phase-separated structures essential for germ cell specification and home to many germ cell-specific mRNAs and proteins. Throughout development, these structures occur as a single large aggregate (Bb), which disperses throughout oogenesis and upon fertilization accumulates again into relatively large assemblies (Gp). Formation of the Bb requires Bucky ball (Buc), a protein with prion-like properties. We found that the multi-tudor domain-containing protein Tdrd6a interacts with Buc, affecting its mobility and aggregation properties. Importantly, lack of this regulatory interaction leads to significant defects in germ cell development. Our work presents insights into how prion-like protein aggregations can be regulated and highlights the biological relevance of such regulatory events.
Article
Full-text available
Asymmetric cell division is a ubiquitous feature during the development of higher organisms. Asymmetry is achieved by differential localization or activities of biological molecules such as proteins, and coding and non-coding RNAs. Here, we present subcellular transcriptomic and proteomic analyses along the animal-vegetal axis of Xenopus laevis eggs. More than 98% of the maternal mRNAs could be categorized into four localization profile groups: animal, vegetal, extremely vegetal, and a newly described group of mRNAs that we call extremely animal, which are mRNAs enriched in the animal cortex region. 3'UTRs of localized mRNAs were analyzed for localization motifs. Several putative motifs were discovered for vegetal and extremely vegetal mRNAs, while no distinct conserved motifs for the extremely animal mRNAs were identified, suggesting different localization mechanisms. Asymmetric profiles were also found for proteins, with correlation to those of corresponding mRNAs. Based on unexpected observation of the profiles of the homoeologous genes exd2 we propose a possible mechanism of genetic evolution.
Article
Full-text available
Mapping the vertebrate developmental landscape As embryos develop, numerous cell types with distinct functions and morphologies arise from pluripotent cells. Three research groups have used single-cell RNA sequencing to analyze the transcriptional changes accompanying development of vertebrate embryos (see the Perspective by Harland). Wagner et al. sequenced the transcriptomes of more than 90,000 cells throughout zebrafish development to reveal how cells differentiate during axis patterning, germ layer formation, and early organogenesis. Farrell et al. profiled the transcriptomes of tens of thousands of embryonic cells and applied a computational approach to construct a branching tree describing the transcriptional trajectories that lead to 25 distinct zebrafish cell types. The branching tree revealed how cells change their gene expression as they become more and more specialized. Briggs et al. examined whole frog embryos, spanning zygotic genome activation through early organogenesis, to map cell states and differentiation across all cell lineages over time. These data and approaches pave the way for the comprehensive reconstruction of transcriptional trajectories during development. Science , this issue p. 981 , p. eaar3131 , p. eaar5780 ; see also p. 967
Article
Full-text available
Opacification of the ocular lens, termed cataract, is a common cause of blindness. To become transparent, lens fiber cells undergo degradation of their organelles, including their nuclei, presenting a fundamental question: does signaling/transcription sufficiently explain differentiation of cells progressing toward compromised transcriptional potential? We report that a conserved RNA-binding protein Celf1 post-transcriptionally controls key genes to regulate lens fiber cell differentiation. Celf1-targeted knockout mice and celf1-knockdown zebrafish and Xenopus morphants have severe eye defects/cataract. Celf1 spatiotemporally down-regulates the cyclin-dependent kinase (Cdk) inhibitor p27Kip1 by interacting with its 5’ UTR and mediating translation inhibition. Celf1 deficiency causes ectopic up-regulation of p21Cip1. Further, Celf1 directly binds to the mRNA of the nuclease Dnase2b to maintain its high levels. Together these events are necessary for Cdk1-mediated lamin A/C phosphorylation to initiate nuclear envelope breakdown and DNA degradation in fiber cells. Moreover, Celf1 controls alternative splicing of the membrane-organization factor beta-spectrin and regulates F-actin-crosslinking factor Actn2 mRNA levels, thereby controlling fiber cell morphology. Thus, we illustrate new Celf1-regulated molecular mechanisms in lens development, suggesting that post-transcriptional regulatory RNA-binding proteins have evolved conserved functions to control vertebrate oculogenesis.
Article
Full-text available
Technological advances have made it possible to measure spatially resolved gene expression at high throughput. However, methods to analyze these data are not established. Here we describe SpatialDE, a statistical test to identify genes with spatial patterns of expression variation from multiplexed imaging or spatial RNA-sequencing data. SpatialDE also implements 'automatic expression histology', a spatial gene-clustering approach that enables expression-based tissue histology. © 2018 Nature America, Inc., part of of Springer Nature. All rights reserved.
Article
Full-text available
Disease epidemiology during ageing shows a transition from cancer to degenerative chronic disorders as dominant contributors to mortality in the old. Nevertheless, it has remained unclear to what extent molecular signatures of ageing reflect this phenomenon. Here we report on the identification of a conserved transcriptomic signature of ageing based on gene expression data from four vertebrate species across four tissues. We find that ageing-associated transcriptomic changes follow trajectories similar to the transcriptional alterations observed in degenerative ageing diseases but are in opposite direction to the transcriptomic alterations observed in cancer. We confirm the existence of a similar antagonism on the genomic level, where a majority of shared risk alleles which increase the risk of cancer decrease the risk of chronic degenerative disorders and vice versa. These results reveal a fundamental trade-off between cancer and degenerative ageing diseases that sheds light on the pronounced shift in their epidemiology during ageing.
Article
Full-text available
RNA sequencing (RNA-seq) offers a snapshot of cellular RNA populations, but not temporal information about the sequenced RNA. Here we report TimeLapse-seq, which uses oxidative-nucleophilic-aromatic substitution to convert 4-thiouridine into cytidine analogs, yielding apparent U-to-C mutations that mark new transcripts upon sequencing. TimeLapse-seq is a single-molecule approach that is adaptable to many applications and reveals RNA dynamics and induced differential expression concealed in traditional RNA-seq.
Article
Full-text available
Gene expression profiling by high-throughput sequencing reveals qualitative and quantitative changes in RNA species at steady state but obscures the intracellular dynamics of RNA transcription, processing and decay. We developed thiol(SH)-linked alkylation for the metabolic sequencing of RNA (SLAM seq), an orthogonal-chemistry-based RNA sequencing technology that detects 4-thiouridine (s4U) incorporation in RNA species at single-nucleotide resolution. In combination with well-established metabolic RNA labeling protocols and coupled to standard, low-input, high-throughput RNA sequencing methods, SLAM seq enabled rapid access to RNA-polymerase-II-dependent gene expression dynamics in the context of total RNA. We validated the method in mouse embryonic stem cells by showing that the RNA-polymerase-II-dependent transcriptional output scaled with Oct4/Sox2/Nanog-defined enhancer activity, and we provide quantitative and mechanistic evidence for transcript-specific RNA turnover mediated by post-transcriptional gene regulatory pathways initiated by microRNAs and N6-methyladenosine. SLAM seq facilitates the dissection of fundamental mechanisms that control gene expression in an accessible, cost-effective and scalable manner.
Article
Full-text available
Protein subcellular localization is fundamental to the establishment of the body axis, cell migration, synaptic plasticity, and a vast range of other biological processes. Protein localization occurs through three mechanisms: protein transport, mRNA localization, and local translation. However, the relative contribution of each process to neuronal polarity remains unknown. Using neurons differentiated from mouse embryonic stem cells, we analyze protein and RNA expression and translation rates in isolated cell bodies and neurites genome-wide. We quantify 7323 proteins and the entire transcriptome, and identify hundreds of neurite-localized proteins and locally translated mRNAs. Our results demonstrate that mRNA localization is the primary mechanism for protein localization in neurites that may account for half of the neurite-localized proteome. Moreover, we identify multiple neurite-targeted non-coding RNAs and RNA-binding proteins with potential regulatory roles. These results provide further insight into the mechanisms underlying the establishment of neuronal polarity.
Article
Full-text available
The concentration of depleted uranium (DU) in the environment is expected to increase due to anthropogenic activities, posing potential risks on ecosystems. The effects of chronic exposure to DU at concentration close to the environmental standards (0.3-30 µg DU/L) are scarcely characterised. Genomic alterations caused by low doses of pollutants can potentially propagate over generations, but how these effects may affect the health of the progeny remain uncertain for the vast majority of toxicants. The present dataset describes the transcriptomic effects of a chronic exposure to 20 µg DU/L during 10 days on adult zebrafish (Danio rerio) organs, the brain, the testis and the ovaries. The potential multigenerational effects of DU were assessed on the progeny of the adult exposed fish at the two-cells stage and after four days of development. We describe in this article the summary statistics of the differential gene expression analysis and focus on key molecular pathways affected by an exposure to a low concentration of DU. The data presented in this study supports the observation made in Armant et al. (2017) [1] (https://doi.org/10.1016/j.dib.2016.05.007) that DU can induce a molecular stress in both adult zebrafish and their progeny. The raw dataset has been deposited at the Gene Expression Omnibus (GEO) repository under the accession number GEO:GSE96603.
Article
Full-text available
Conventional gene expression studies analyze multiple cells simultaneously or single cells, for which the exact in vivo or in situ position is unknown. Although cellular heterogeneity can be discerned when analyzing single cells, any spatially defined attributes that underpin the heterogeneous nature of the cells cannot be identified. Here, we describe how to use Geo-seq, a method that combines laser capture microdissection (LCM) and single-cell RNA-seq technology. The combination of these two methods enables the elucidation of cellular heterogeneity and spatial variance simultaneously. The Geo-seq protocol allows the profiling of transcriptome information from only a small number cells and retains their native spatial information. This protocol has wide potential applications to address biological and pathological questions of cellular properties such as prospective cell fates, biological function and the gene regulatory network. Geo-seq has been applied to investigate the spatial transcriptome of mouse early embryo, mouse brain, and pathological liver and sperm tissues. The entire protocol from tissue collection and microdissection to sequencing requires ~5 d, Data analysis takes another 1 or 2 weeks, depending on the amount of data and the speed of the processor.
Article
Full-text available
During oogenesis, hundreds of maternal RNAs are selectively localized to the animal or vegetal pole, including determinants of somatic and germline fates. Although microarray analysis has identified localized determinants, it is not comprehensive and is limited to known transcripts. Here, we utilized high-throughput RNA-sequencing analysis to comprehensively interrogate animal and vegetal pole RNAs in the fully grown Xenopus laevis oocyte. We identified 411 (198 annotated) and 27 (15 annotated) enriched mRNAs at the vegetal and animal pole, respectively. Ninety were novel mRNAs over 4-fold enriched at the vegetal pole and six were over 10-fold enriched at the animal pole. Unlike mRNAs, microRNAs were not asymmetrically distributed. Whole-mount in situ hybridization confirmed that all 17 selected mRNAs were localized. Biological function and network analysis of vegetally enriched transcripts identified protein-modifying enzymes, receptors, ligands, RNA-binding proteins, transcription factors and co-factors with five defining hubs linking 47 genes in a network. Initial functional studies of maternal vegetally localized mRNAs show that sox7 plays a novel and important role in primordial germ cell (PGC) development and that ephrinB1 ( efnb1 ) is required for proper PGC migration. We propose potential pathways operating at the vegetal pole that highlight where future investigations might be most fruitful.
Article
Single-cell transcriptomics has transformed our ability to characterize cell states, but deep biological understanding requires more than a taxonomic listing of clusters. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these datasets to better understand cellular identity and function. Here, we develop a strategy to "anchor" diverse datasets together, enabling us to integrate single-cell measurements not only across scRNA-seq technologies, but also across different modalities. After demonstrating improvement over existing methods for integrating scRNA-seq data, we anchor scRNA-seq experiments with scATAC-seq to explore chromatin differences in closely related interneuron subsets and project protein expression measurements onto a bone marrow atlas to characterize lymphocyte populations. Lastly, we harmonize in situ gene expression and scRNA-seq datasets, allowing transcriptome-wide imputation of spatial gene expression patterns. Our work presents a strategy for the assembly of harmonized references and transfer of information across datasets.
Article
Cell–cell communication mediated by ligand–receptor complexes is critical to coordinating diverse biological processes, such as development, differentiation and inflammation. To investigate how the context-dependent crosstalk of different cell types enables physiological processes to proceed, we developed CellPhoneDB, a novel repository of ligands, receptors and their interactions. In contrast to other repositories, our database takes into account the subunit architecture of both ligands and receptors, representing heteromeric complexes accurately. We integrated our resource with a statistical framework that predicts enriched cellular interactions between two cell types from single-cell transcriptomics data. Here, we outline the structure and content of our repository, provide procedures for inferring cell–cell communication networks from single-cell RNA sequencing data and present a practical step-by-step guide to help implement the protocol. CellPhoneDB v.2.0 is an updated version of our resource that incorporates additional functionalities to enable users to introduce new interacting molecules and reduces the time and resources needed to interrogate large datasets. CellPhoneDB v.2.0 is publicly available, both as code and as a user-friendly web interface; it can be used by both experts and researchers with little experience in computational genomics. In our protocol, we demonstrate how to evaluate meaningful biological interactions with CellPhoneDB v.2.0 using published datasets. This protocol typically takes ~2 h to complete, from installation to statistical analysis and visualization, for a dataset of ~10 GB, 10,000 cells and 19 cell types, and using five threads. CellPhoneDB combines an interactive database and a statistical framework for the exploration of ligand–receptor interactions inferred from single-cell transcriptomics measurements.
Article
The hematopoietic stem and progenitor cell (HSPC) niche is a supportive microenvironment comprised of distinct cell types, including specialized vascular endothelial cells (ECs) that directly interact with HSPCs and promote stem cell function. Utilizing spatial transcriptomics, in combination with tissue-specific RNA-seq, we identified 29 genes selectively enriched in ECs of the zebrafish fetal hematopoietic niche. Using upstream regulatory sequences for two of these genes, mrc1a and selectin E (sele), we generated GFP reporter lines that allowed us to selectively isolate niche ECs for ATAC-seq. This analysis identified 6,848 regions of chromatin that were accessible in niche ECs but not ECs from other tissues. Several of these regions were associated with the 29 genes. To evaluate whether these regions might be enhancers we coupled them to GFP and injected them into embryos. 12/15 sequences drove GFP expression in niche ECs. Upon closer examination of the mrc1a and sele genes, we identified enhancer sequences as short as 125 bp and 158 bp, respectively, which drove niche EC-specific expression. A genome-wide motif enrichment analysis of the 6,848 uniquely open chromatin regions revealed that Ets, SoxF and Nuclear Hormone Receptor (RXRA/NR2F2, specifically) sites were most enriched. In contrast, 4,522 pan-EC elements were enriched for Ets sites but not SoxF or NHR motifs. Using mutant variants of the 125 bp and 158 bp enhancer sequences, we demonstrated that Ets, SoxF and RXRA/NR2F2 sites were independently required for specific transgene expression. Gel shift experiments demonstrated that NR2F2 could bind the 125 bp and 158 bp zebrafish enhancers and this binding was disrupted upon mutation of the NR2F2 binding sites. Knockdown of the endogenous zebrafish nr2f2 gene resulted in a loss of expression of the 125 bp mrc1a enhancer-GFP construct and a significantly reduced number of HSPCs in the fetal niche. We next injected pools of human transcription factors, including at least one member from each of the three families, under the control of a ubiquitous promoter. Strikingly, we found that a combination of ETV2 or ETS1 with SOX7 and NR2F2 generated ectopic patches of mrc1a+ niche ECs that recruited runx1+ HSPCs outside of the endogenous niche. Using high-resolution live cell imaging we could observe HSPCs initially arriving at the ectopic sites, lodging for several hours and then eventually dividing and migrating away from the site through circulation. HSPCs localized to the ectopic regions were found in both intravascular and extravascular spaces, and were often enwrapped by ECs and in contact with cxcl12a+ stromal cells, similar to what is observed in the endogenous niche. Ectopic regions of niche EC gene expression were similarly observed when alternative regulatory elements were used for transcription factor overexpression, including a pan-EC enhancer (nrp1b), a muscle promoter (mylz2) and a heat shock promoter (hsp70). These results suggest the three-factor combinations are sufficient to reprogram niche EC identify in vivo. Lastly, we evaluated by RNA-seq the expression of our niche EC signature in the zebrafish kidney marrow (the site of adult hematopoiesis) and in ECs from multiple organs of the mouse, including the heart, kidney, liver, lung and bone marrow, at multiple stages of development (E11-13, E14-15, E16-17, P2-P4 and adult). Strikingly, 23/29 genes were highly expressed in ECs of the zebrafish kidney and 21/29 genes were enriched in the ECs of a mammalian hematopoietic organ - the fetal liver and/or adult bone marrow - relative to their expression in ECs from non-hematopoietic organs at the same stage. Notably, for a subset of the genes the expression patterns mirrored the temporal dynamics of HSPC ontogeny in the mouse, showing robust expression in fetal liver ECs and then later in adult bone marrow ECs with a concomitant reduction in liver ECs. An analysis of transcription factor expression within these EC populations revealed that Ets1, the SoxF factor Sox18, and Nr2f2 were the most highly expressed members of the Ets, Sox and NHR families. Collectively our work has uncovered a conserved gene expression signature and transcriptional regulatory program unique to the vascular niche of hematopoietic organs. These findings have important implications for designing a synthetic vascular niche for blood stem cells or for modulating the niche in a therapeutic context. Disclosures Zon: Fate Therapeutics: Equity Ownership; Scholar Rock: Equity Ownership; CAMP4: Equity Ownership.
Chapter
The fate of future generations depends on a high-quality germ line. For a female to successfully produce offspring, her oocytes must be successfully specified and their contents meticulously organized. Germ cells are specified by two general mechanisms: inductive and inherited. In the inductive mechanism, the primordial germ cells (PGCs) are induced by signals from the surrounding cells. In the inherited mechanism, PGCs are specified by passing localized germ plasm material from the oocyte to the future germ cells. The Balbiani body, a conserved oocyte aggregate, facilitates the organization of the oocyte into a polarized cell with discrete cytoplasmic domains, including localizing the germ plasm. In the mouse, the Balbiani body is implicated in oocyte survival, while in frogs and zebrafish the Balbiani body carries specific mRNAs to the vegetal pole. These asymmetric mRNAs form the foundation of the functionally polarized oocyte and play important roles in axial patterning and germ plasm formation of the embryo.
Chapter
Embryonic development is heavily dependent on temporally and spatially restricted gene expression. Spatially resolved measurements of gene expression are therefore crucial for identifying novel regulators and the understanding of their function. However, in situ methods do not resolve global gene expression, and sequencing-based methods usually do not provide spatial information. Here, we describe tomo-seq, a method that combines classical histological sectioning of embryos or tissues with a highly sensitive RNA-sequencing technique. Application of tomo-seq to zebrafish embryos allows reconstructing the spatial gene expression of thousands of genes.
Article
To advance our understanding of the genetic programs that drive cell and tissue specialization, it is necessary to obtain a comprehensive overview of gene expression patterns. Here, we have used spatial transcriptomics to generate high-resolution, anteroposterior gene expression maps of C. elegans males and hermaphrodites. To explore these maps, we have developed computational methods for discovering region- and tissue-specific genes. We have found extensive sex-specific gene expression differences in the germline and sperm and discovered genes that are specifically expressed in the male reproductive tract. These include a group of uncharacterized genes that encode small secreted proteins that are required for male fertility. We conclude that spatial gene expression maps provide a powerful resource for identifying tissue-specific gene functions in C. elegans. Importantly, we found that expression maps from different animals can be precisely aligned, enabling transcriptome-wide comparisons of gene expression patterns.
Article
Transcriptome mapping in the 3D brain RNA sequencing samples the entire transcriptome but lacks anatomical information. In situ hybridization, on the other hand, can only profile a small number of transcripts. In situ sequencing technologies address these shortcomings but face a challenge in dense, complex tissue environments. Wang et al. combined an efficient sequencing approach with hydrogel-tissue chemistry to develop a multidisciplinary technology for three-dimensional (3D) intact-tissue RNA sequencing (see the Perspective by Knöpfel). More than 1000 genes were simultaneously mapped in sections of mouse brain at single-cell resolution to define cell types and circuit states and to reveal cell organization principles. Science , this issue p. eaat5691 ; see also p. 328
Article
Mapping the vertebrate developmental landscape As embryos develop, numerous cell types with distinct functions and morphologies arise from pluripotent cells. Three research groups have used single-cell RNA sequencing to analyze the transcriptional changes accompanying development of vertebrate embryos (see the Perspective by Harland). Wagner et al. sequenced the transcriptomes of more than 90,000 cells throughout zebrafish development to reveal how cells differentiate during axis patterning, germ layer formation, and early organogenesis. Farrell et al. profiled the transcriptomes of tens of thousands of embryonic cells and applied a computational approach to construct a branching tree describing the transcriptional trajectories that lead to 25 distinct zebrafish cell types. The branching tree revealed how cells change their gene expression as they become more and more specialized. Briggs et al. examined whole frog embryos, spanning zygotic genome activation through early organogenesis, to map cell states and differentiation across all cell lineages over time. These data and approaches pave the way for the comprehensive reconstruction of transcriptional trajectories during development. Science , this issue p. 981 , p. eaar3131 , p. eaar5780 ; see also p. 967
Article
The stability of mRNAs is regulated by signals within their sequences, but a systematic and predictive understanding of the underlying sequence rules remains elusive. Here we introduce UTR-seq, a combination of massively parallel reporter assays and regression models, to survey the dynamics of tens of thousands of 3' UTR sequences during early zebrafish embryogenesis. UTR-seq revealed two temporal degradation programs: a maternally encoded early-onset program and a late-onset program that accelerated degradation after zygotic genome activation. Three signals regulated early-onset rates: stabilizing poly-U and UUAG sequences and destabilizing GC-rich signals. Three signals explained late-onset degradation: miR-430 seeds, AU-rich sequences, and Pumilio recognition sites. Sequence-based regression models translated 3' UTRs into their unique decay patterns and predicted the in vivo effect of sequence signals on mRNA stability. Their application led to the successful design of artificial 3' UTRs that conferred specific mRNA dynamics. UTR-seq provides a general strategy to uncover the rules of RNA cis regulation.
Article
The localization and organization of mitochondria- and ribonucleoprotein granule-rich germ plasm is essential for many aspects of germ cell development. In Xenopus, germ plasm is maternally inherited and is required for the specification of primordial germ cells (PGCs). Germ plasm is aggregated into larger patches during egg activation and cleavage and is ultimately translocated perinuclearly during gastrulation. Although microtubule dynamics and a kinesin (Kif4a) have been implicated in Xenopus germ plasm localization, little is known about how germ plasm distribution is regulated. Here, we identify a role for maternal Xenopus Syntabulin in the aggregation of germ plasm following fertilization. We show that depletion of sybu mRNA using antisense oligonucleotides injected into oocytes results in defects in the aggregation and perinuclear transport of germ plasm and subsequently in reduced PGC numbers. Using live imaging analysis, we also characterize a novel role for Sybu in the collection of germ plasm in vegetal cleavage furrows by surface contraction waves. Additionally, we show that a localized kinesin-like protein, Kif3b, is also required for germ plasm aggregation and that Sybu functionally interacts with Kif3b and Kif4a in germ plasm aggregation. Overall, these data suggest multiple coordinate roles for kinesins and adaptor proteins in controlling the localization and distribution of a cytoplasmic determinant in early development.
Article
A mechanistic dissection of early oocyte differentiation in vertebrates is key to advancing our knowledge of germline development, reproductive biology, the regulation of meiosis, and all of their associated disorders. Recent advances in the field include breakthroughs in the identification of germline stem cells in Medaka, in the cellular architecture of the germline cyst in mice, in a mechanistic dissection of chromosomal pairing and bouquet formation in meiosis in mice, in tracing oocyte symmetry breaking to the chromosomal bouquet of meiosis in zebrafish, and in the biology of the Balbiani body, a universal oocyte granule. Many of the major events in early oogenesis are universally conserved, and some are co-opted for species-specific needs. The chromosomal events of meiosis are of tremendous consequence to gamete formation and have been extensively studied. New light is now being shed on other aspects of early oocyte differentiation, which were traditionally considered outside the scope of meiosis, and their coordination with meiotic events. The emerging theme is of meiosis as a common groundwork for coordinating multifaceted processes of oocyte differentiation. In an accompanying manuscript we describe methods that allowed for investigations in the zebrafish ovary to contribute to these breakthroughs. Here, we review these advances mostly from the zebrafish and mouse. We discuss oogenesis concepts across established model organisms, and construct an inclusive paradigm for early oocyte differentiation in vertebrates.
Article
RNA editing, a post-transcriptional process, allows the diversification of proteomes beyond the genomic blueprint; however it is infrequently used among animals for this purpose. Recent reports suggesting increased levels of RNA editing in squids thus raise the question of the nature and effects of these events. We here show that RNA editing is particularly common in behaviorally sophisticated coleoid cephalopods, with tens of thousands of evolutionarily conserved sites. Editing is enriched in the nervous system, affecting molecules pertinent for excitability and neuronal morphology. The genomic sequence flanking editing sites is highly conserved, suggesting that the process confers a selective advantage. Due to the large number of sites, the surrounding conservation greatly reduces the number of mutations and genomic polymorphisms in protein-coding regions. This trade-off between genome evolution and transcriptome plasticity highlights the importance of RNA recoding as a strategy for diversifying proteins, particularly those associated with neural function.
Article
The tissues in our bodies are complex systems composed of diverse cell types that often interact in highly structured repeating anatomical units. External gradients of morphogens, directional blood flow, as well as the secretion and absorption of materials by cells generate distinct microenvironments at different tissue coordinates. Such spatial heterogeneity enables optimized function through division of labor among cells. Unraveling the design principles that govern this spatial division of labor requires techniques to quantify the entire transcriptomes of cells while accounting for their spatial coordinates. In this review we describe how recent advances in spatial transcriptomics open the way for tissue-level systems biology.
Article
Single-cell RNA sequencing (scRNA-seq) has become an established and powerful method to investigate transcriptomic cell-to-cell variation, thereby revealing new cell types and providing insights into developmental processes and transcriptional stochasticity. A key question is how the variety of available protocols compare in terms of their ability to detect and accurately quantify gene expression. Here, we assessed the protocol sensitivity and accuracy of many published data sets, on the basis of spike-in standards and uniform data processing. For our workflow, we developed a flexible tool for counting the number of unique molecular identifiers (https://github.com/vals/umis/). We compared 15 protocols computationally and 4 protocols experimentally for batch-matched cell populations, in addition to investigating the effects of spike-in molecular degradation. Our analysis provides an integrated framework for comparing scRNA-seq protocols.