
Oliver StegleGerman Cancer Research Center | DKFZ
Oliver Stegle
Ph.D., University of Cambridge
About
468
Publications
66,069
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
23,383
Citations
Introduction
Additional affiliations
January 2019 - present
January 2018 - present
November 2012 - December 2018
Education
October 2005 - May 2009
October 2004 - July 2009
Publications
Publications (468)
To (re)define tissue architecture of the lung and airways at the cellular and molecular level, we profiled five proximal-to-distal locations of healthy human lungs in depth using multi-omic single cell/nuclei and Visium Spatial Transcriptomics. Using computational data integration and analysis, we extend beyond the suspension cell paradigm of lung...
Spatial transcriptomic technologies promise to resolve cellular wiring diagrams of tissues in health and disease, but comprehensive mapping of cell types in situ remains a challenge. Here we present сell2location, a Bayesian model that can resolve fine-grained cell types in spatial transcriptomic data and create comprehensive cellular maps of diver...
Common variable immunodeficiency (CVID), the most prevalent symptomatic primary immunodeficiency, displays impaired terminal B-cell differentiation and defective antibody responses. Incomplete genetic penetrance and ample phenotypic expressivity in CVID suggest the participation of additional pathogenic mechanisms. Monozygotic (MZ) twins discordant...
The resolution of causal genetic variants informs understanding of disease biology. We used regulatory quantitative trait loci (QTLs) from the BLUEPRINT, GTEx and eQTLGen projects to fine-map putative causal variants for 12 immune-mediated diseases. We identify 340 unique loci that colocalize with high posterior probability (≥98%) with regulatory Q...
Cancer genomes harbor a broad spectrum of structural variants (SV) driving tumorigenesis, a relevant subset of which are likely to escape discovery in short reads. We employed Oxford Nanopore Technologies (ONT) sequencing in a paired diagnostic and post-therapy medulloblastoma to unravel the haplotype-resolved somatic genetic and epigenetic landsca...
Developmental progression and cellular diversity are largely driven by transcription factors (TFs); yet, characterizing their loss-of-function phenotypes remains challenging and often disconnected from their underlying molecular mechanisms. Here, we combine single-cell regulatory genomics with loss-of-function mutants to jointly assess both cellula...
Factor analysis is a widely used method for dimensionality reduction in genome biology, with applications from personalized health to single-cell biology. Existing factor analysis models assume independence of the observed samples, an assumption that fails in spatio-temporal profiling studies. Here we present MEFISTO, a flexible and versatile toolb...
Advances in multi-omics have led to an explosion of multimodal datasets to address questions from basic biology to translation. While these data provide novel opportunities for discovery, they also pose management and analysis challenges, thus motivating the development of tailored computational solutions. Here, we present a data standard and an an...
While it is established that the functional impact of genetic variation can vary across cell types and states, capturing this diversity remains challenging. Current studies using bulk sequencing either ignore this heterogeneity or use sorted cell populations, reducing discovery and explanatory power. Here, we develop scDALI, a versatile computation...
Virtually all patients with multiple myeloma become unresponsive to treatment over time. Relapsed/refractory multiple myeloma (RRMM) is accompanied by the clonal evolution of myeloma cells with heterogeneous genomic aberrations and profound changes of the bone marrow microenvironment (BME). However, the molecular mechanisms that drive drug resistan...
Multiple distinct cell types of the human lung and airways have been defined by single cell RNA sequencing (scRNAseq). Here we present a multi-omics spatial lung atlas to define additional heterogeneity and novel cell types which we map back into the macro- and micro-anatomical tissue context to define functional tissue microenvironments. First, we...
Background
Relapsed/refractory multiple myeloma (RRMM) is characterized by a high inter- and intratumor heterogeneity and a complex interplay of myeloma cells with the bone marrow microenvironment (BME). Accordingly, there is an urgent need to dissect subclone structure, transcriptional heterogeneity and cellular interactions to unravel the molecul...
Introduction
Despite new treatment approaches, in most patients with multiple myeloma (MM) resistant subclones expand, finally resulting in the development of relapsed/refractory disease (RRMM). Recent single-cell transcriptomic (scRNA-seq) studies have improved our understanding of tumor heterogeneity in MM, but our knowledge about the regulatory...
Single cell RNA sequencing (scRNA-seq) enables characterizing the cellular heterogeneity in human tissues. Technological advances have enabled the first population-scale scRNA-seq studies in hundreds of individuals, allowing to assay genetic effects with single-cell resolution. However, existing strategies to perform genetic analyses using scRNA-se...
Mitochondrial DNA (mtDNA) variants influence the risk of late-onset human diseases, but the reasons for this are poorly understood. Undertaking a hypothesis-free analysis of 5,689 blood-derived biomarkers with mtDNA variants in 16,220 healthy donors, here we show that variants defining mtDNA haplogroups Uk and H4 modulate the level of circulating N...
Common genetic variants modulate the cellular response to viruses and are implicated in a range of immune pathologies, including infectious and autoimmune diseases. The transcriptional antiviral response is known to vary between infected cells from a single individual, yet how genetic variants across individuals modulate the antiviral response (and...
Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-e...
Chromothripsis is a form of genome instability, whereby a presumably single catastrophic event generates extensive genomic rearrangements of one or few chromosome(s). However, little is known about the heterogeneity of chromothripsis across different clones from the same tumor, as well as changes in response to treatment. We analyzed single-cell ge...
Advances in multi-omics technologies have led to an explosion of multimodal datasets to address questions ranging from basic biology to translation. While these rich data provide major opportunities for discovery, they also come with data management and analysis challenges, thus motivating the development of tailored computational solutions to deal...
The development of single-cell multimodal assays provides a powerful tool for investigating multiple dimensions of cellular heterogeneity, enabling new insights into development, tissue homeostasis and disease. A key challenge in the analysis of single-cell multimodal data is to devise appropriate strategies for tying together data across different...
The SARS-CoV-2 virus is the causative agent of the global COVID-19 infectious disease outbreak, which can lead to acute respiratory distress syndrome (ARDS). However, it is still unclear how the virus interferes with immune cell and metabolic functions in the human body. In this study, we investigated the immune response in acute or convalescent CO...
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-03287-8.
X chromosome inactivation (XCI) is a dosage compensation mechanism in female mammals whereby transcription from one X chromosome is repressed. Analysis of human induced pluripotent stem cells (iPSCs) derived from female donors identified that low levels of XIST RNA correlated strongly with erosion of XCI. Proteomic analysis, RNA sequencing (RNA-seq...
Studying the function of common genetic variants in primary human tissues and during development is challenging. To address this, we use an efficient multiplexing strategy to differentiate 215 human induced pluripotent stem cell (iPSC) lines toward a midbrain neural fate, including dopaminergic neurons, and use single-cell RNA sequencing (scRNA-seq...
Induced pluripotent stem cells (iPSCs) are an established cellular system to study the impact of genetic variants in derived cell types and developmental contexts. However, in their pluripotent state, the disease impact of genetic variants is less well known. Here, we integrate data from 1,367 human iPSC lines to comprehensively map common and rare...
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent–child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) integrate all forms of genetic var...
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent–child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) integrate all forms of genetic var...
Here we describe the LifeTime Initiative, which aims to track, understand and target human cells during the onset and progression of complex diseases, and to analyse their response to therapy at single-cell resolution. This mission will be implemented through the development, integration and application of single-cell multi-omics and imaging, artif...
Factor analysis is among the most-widely used methods for dimensionality reduction in genome biology, with applications from personalized health to single-cell studies. Existing implementations of factor analysis assume independence of the observed samples, an assumption that fails in emerging spatio-temporal profiling studies. Here, we present MEF...
LifeTime aims to track, understand and target human cells during the onset and progression of complex diseases and their response to therapy at single-cell resolution. This mission will be implemented through the development and integration of single-cell multi-omics and imaging, artificial intelligence and patient-derived experimental disease mode...
Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant global health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including data for 563,085 European ancestry participants...
Human disease phenotypes are ultimately driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human...
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Variability among pluripotent stem cell (PSC) lines is a prevailing issue that hampers not only experimental reproducibility but also large-scale applications and personalized cell-based therapy. This variability could result from epigenetic and genetic factors that influence stem cell behavior. Naive culture conditions minimize epigenetic fluctuat...
Zygotic genome activation (ZGA) is an essential transcriptional event in embryonic development that coincides with extensive epigenetic reprogramming. Complex manipulation techniques and maternal stores of proteins preclude large-scale functional screens for ZGA regulators within early embryos. Here, we combined pooled CRISPR activation (CRISPRa) w...
Structural variants (SVs) and short tandem repeats (STRs) comprise a broad group of diverse DNA variants which vastly differ in their sizes and distributions across the genome. Here, we identify genomic features of SV classes and STRs that are associated with gene expression and complex traits, including their locations relative to eGenes, likeliho...
Structural variants (SVs) and short tandem repeats (STRs) are important sources of genetic diversity but are not routinely analyzed in genetic studies because they are difficult to accurately identify and genotype. Because SVs and STRs range in size and type, it is necessary to apply multiple algorithms that incorporate different types of evidence...
Common genetic variants can have profound effects on cellular function, but studying these effects in primary human tissue samples and during development is challenging. Human induced pluripotent stem cell (iPSC) technology holds great promise for assessing these effects across different differentiation contexts. Here, we use an efficient pooling s...
Technological advances have enabled the profiling of multiple molecular layers at single-cell resolution, assaying cells from multiple samples or conditions. Consequently, there is a growing need for computational strategies to analyze data from complex experimental designs that include multiple data modalities and multiple groups of samples. We pr...
Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computatio...
Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computatio...
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
Integrated analysis using proteomics, RNAseq and polysome profiles revealed a major impact on the proteome caused by erosion of X chromosome inactivation (XCI) in human iPSCs. Low XIST RNA levels were detected in ~40% of iPSC lines from healthy female donors and strongly correlated with erosion of XCI, as evaluated by biallelic expression. Low XIST...
In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs ac...
Recent developments in stem cell biology have enabled the study of cell fate decisions in early human development that are impossible to study in vivo. However, understanding how development varies across individuals and, in particular, the influence of common genetic variants during this process has not been characterised. Here, we exploit human i...
The recent boom in microfluidics and combinatorial indexing strategies, combined with low sequencing costs, has empowered single-cell sequencing technology. Thousands-or even millions-of cells analyzed in a single experiment amount to a data revolution in single-cell biology and pose unique data science problems. Here, we outline eleven challenges...
Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including 563,946 European ancestry participants, and discover 5...
Transcript alterations often result from somatic changes in cancer genomes¹. Various forms of RNA alterations have been described in cancer, including overexpression², altered splicing³ and gene fusions⁴; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relativ...
Thousands of genetic associations with phenotypes of blood cells are known, but few are with phenotypes relevant to cell function. We performed GWAS of 63 flow-cytometry phenotypes, including measures of cell granularity, nucleic acid content, and reactivity, in 39,656 participants in the INTERVAL study, identifying 2,172 variant-trait associations...
Genotype-environment interaction (G×E) studies typically focus on variants with previously known marginal associations. While such two-step filtering greatly reduces the multiple testing burden, it can miss loci with pronounced G×E effects, which tend to have weaker marginal associations. To test for G×E effects on a genome-wide scale whilst levera...
The identification of causal genetic variants for common diseases improves understanding of disease biology. Here we use data from the BLUEPRINT project to identify regulatory quantitative trait loci (QTL) for three primary human immune cell types and use these to fine-map putative causal variants for twelve immune-mediated diseases. We identify 34...