Oliver Stegle

Oliver Stegle
German Cancer Research Center | DKFZ

Ph.D., University of Cambridge

About

468
Publications
66,069
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
23,383
Citations
Additional affiliations
January 2019 - present
German Cancer Research Center
Position
  • Division head
January 2018 - present
European Molecular Biology Laboratory
Position
  • Group Leader
November 2012 - December 2018
EMBL-EBI
Position
  • Group Leader
Education
October 2005 - May 2009
University of Cambridge
Field of study
  • Physics
October 2004 - July 2009
University of Cambridge
Field of study
  • Theoretical Physics

Publications

Publications (468)
Preprint
To (re)define tissue architecture of the lung and airways at the cellular and molecular level, we profiled five proximal-to-distal locations of healthy human lungs in depth using multi-omic single cell/nuclei and Visium Spatial Transcriptomics. Using computational data integration and analysis, we extend beyond the suspension cell paradigm of lung...
Article
Full-text available
Spatial transcriptomic technologies promise to resolve cellular wiring diagrams of tissues in health and disease, but comprehensive mapping of cell types in situ remains a challenge. Here we present сell2location, a Bayesian model that can resolve fine-grained cell types in spatial transcriptomic data and create comprehensive cellular maps of diver...
Article
Full-text available
Common variable immunodeficiency (CVID), the most prevalent symptomatic primary immunodeficiency, displays impaired terminal B-cell differentiation and defective antibody responses. Incomplete genetic penetrance and ample phenotypic expressivity in CVID suggest the participation of additional pathogenic mechanisms. Monozygotic (MZ) twins discordant...
Article
Full-text available
The resolution of causal genetic variants informs understanding of disease biology. We used regulatory quantitative trait loci (QTLs) from the BLUEPRINT, GTEx and eQTLGen projects to fine-map putative causal variants for 12 immune-mediated diseases. We identify 340 unique loci that colocalize with high posterior probability (≥98%) with regulatory Q...
Preprint
Cancer genomes harbor a broad spectrum of structural variants (SV) driving tumorigenesis, a relevant subset of which are likely to escape discovery in short reads. We employed Oxford Nanopore Technologies (ONT) sequencing in a paired diagnostic and post-therapy medulloblastoma to unravel the haplotype-resolved somatic genetic and epigenetic landsca...
Article
Developmental progression and cellular diversity are largely driven by transcription factors (TFs); yet, characterizing their loss-of-function phenotypes remains challenging and often disconnected from their underlying molecular mechanisms. Here, we combine single-cell regulatory genomics with loss-of-function mutants to jointly assess both cellula...
Article
Full-text available
Factor analysis is a widely used method for dimensionality reduction in genome biology, with applications from personalized health to single-cell biology. Existing factor analysis models assume independence of the observed samples, an assumption that fails in spatio-temporal profiling studies. Here we present MEFISTO, a flexible and versatile toolb...
Article
Full-text available
Advances in multi-omics have led to an explosion of multimodal datasets to address questions from basic biology to translation. While these data provide novel opportunities for discovery, they also pose management and analysis challenges, thus motivating the development of tailored computational solutions. Here, we present a data standard and an an...
Article
Full-text available
While it is established that the functional impact of genetic variation can vary across cell types and states, capturing this diversity remains challenging. Current studies using bulk sequencing either ignore this heterogeneity or use sorted cell populations, reducing discovery and explanatory power. Here, we develop scDALI, a versatile computation...
Article
Full-text available
Virtually all patients with multiple myeloma become unresponsive to treatment over time. Relapsed/refractory multiple myeloma (RRMM) is accompanied by the clonal evolution of myeloma cells with heterogeneous genomic aberrations and profound changes of the bone marrow microenvironment (BME). However, the molecular mechanisms that drive drug resistan...
Preprint
Full-text available
Multiple distinct cell types of the human lung and airways have been defined by single cell RNA sequencing (scRNAseq). Here we present a multi-omics spatial lung atlas to define additional heterogeneity and novel cell types which we map back into the macro- and micro-anatomical tissue context to define functional tissue microenvironments. First, we...
Article
Background Relapsed/refractory multiple myeloma (RRMM) is characterized by a high inter- and intratumor heterogeneity and a complex interplay of myeloma cells with the bone marrow microenvironment (BME). Accordingly, there is an urgent need to dissect subclone structure, transcriptional heterogeneity and cellular interactions to unravel the molecul...
Article
Introduction Despite new treatment approaches, in most patients with multiple myeloma (MM) resistant subclones expand, finally resulting in the development of relapsed/refractory disease (RRMM). Recent single-cell transcriptomic (scRNA-seq) studies have improved our understanding of tumor heterogeneity in MM, but our knowledge about the regulatory...
Preprint
Full-text available
Single cell RNA sequencing (scRNA-seq) enables characterizing the cellular heterogeneity in human tissues. Technological advances have enabled the first population-scale scRNA-seq studies in hundreds of individuals, allowing to assay genetic effects with single-cell resolution. However, existing strategies to perform genetic analyses using scRNA-se...
Article
Full-text available
Mitochondrial DNA (mtDNA) variants influence the risk of late-onset human diseases, but the reasons for this are poorly understood. Undertaking a hypothesis-free analysis of 5,689 blood-derived biomarkers with mtDNA variants in 16,220 healthy donors, here we show that variants defining mtDNA haplogroups Uk and H4 modulate the level of circulating N...
Preprint
Full-text available
Common genetic variants modulate the cellular response to viruses and are implicated in a range of immune pathologies, including infectious and autoimmune diseases. The transcriptional antiviral response is known to vary between infected cells from a single individual, yet how genetic variants across individuals modulate the antiviral response (and...
Article
Full-text available
Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-e...
Preprint
Full-text available
Chromothripsis is a form of genome instability, whereby a presumably single catastrophic event generates extensive genomic rearrangements of one or few chromosome(s). However, little is known about the heterogeneity of chromothripsis across different clones from the same tumor, as well as changes in response to treatment. We analyzed single-cell ge...
Preprint
Full-text available
Advances in multi-omics technologies have led to an explosion of multimodal datasets to address questions ranging from basic biology to translation. While these rich data provide major opportunities for discovery, they also come with data management and analysis challenges, thus motivating the development of tailored computational solutions to deal...
Article
Full-text available
The development of single-cell multimodal assays provides a powerful tool for investigating multiple dimensions of cellular heterogeneity, enabling new insights into development, tissue homeostasis and disease. A key challenge in the analysis of single-cell multimodal data is to devise appropriate strategies for tying together data across different...
Article
Full-text available
The SARS-CoV-2 virus is the causative agent of the global COVID-19 infectious disease outbreak, which can lead to acute respiratory distress syndrome (ARDS). However, it is still unclear how the virus interferes with immune cell and metabolic functions in the human body. In this study, we investigated the immune response in acute or convalescent CO...
Article
Full-text available
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-03287-8.
Article
Full-text available
X chromosome inactivation (XCI) is a dosage compensation mechanism in female mammals whereby transcription from one X chromosome is repressed. Analysis of human induced pluripotent stem cells (iPSCs) derived from female donors identified that low levels of XIST RNA correlated strongly with erosion of XCI. Proteomic analysis, RNA sequencing (RNA-seq...
Article
Full-text available
Studying the function of common genetic variants in primary human tissues and during development is challenging. To address this, we use an efficient multiplexing strategy to differentiate 215 human induced pluripotent stem cell (iPSC) lines toward a midbrain neural fate, including dopaminergic neurons, and use single-cell RNA sequencing (scRNA-seq...
Article
Full-text available
Induced pluripotent stem cells (iPSCs) are an established cellular system to study the impact of genetic variants in derived cell types and developmental contexts. However, in their pluripotent state, the disease impact of genetic variants is less well known. Here, we integrate data from 1,367 human iPSC lines to comprehensively map common and rare...
Article
Full-text available
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent–child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) integrate all forms of genetic var...
Preprint
Full-text available
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent–child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) integrate all forms of genetic var...
Article
Here we describe the LifeTime Initiative, which aims to track, understand and target human cells during the onset and progression of complex diseases, and to analyse their response to therapy at single-cell resolution. This mission will be implemented through the development, integration and application of single-cell multi-omics and imaging, artif...
Preprint
Full-text available
Factor analysis is among the most-widely used methods for dimensionality reduction in genome biology, with applications from personalized health to single-cell studies. Existing implementations of factor analysis assume independence of the observed samples, an assumption that fails in emerging spatio-temporal profiling studies. Here, we present MEF...
Article
Full-text available
LifeTime aims to track, understand and target human cells during the onset and progression of complex diseases and their response to therapy at single-cell resolution. This mission will be implemented through the development and integration of single-cell multi-omics and imaging, artificial intelligence and patient-derived experimental disease mode...
Article
Full-text available
Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant global health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including data for 563,085 European ancestry participants...
Article
Full-text available
Human disease phenotypes are ultimately driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human...
Article
Full-text available
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Article
Full-text available
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Article
Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pl...
Article
Full-text available
Variability among pluripotent stem cell (PSC) lines is a prevailing issue that hampers not only experimental reproducibility but also large-scale applications and personalized cell-based therapy. This variability could result from epigenetic and genetic factors that influence stem cell behavior. Naive culture conditions minimize epigenetic fluctuat...
Article
Full-text available
Zygotic genome activation (ZGA) is an essential transcriptional event in embryonic development that coincides with extensive epigenetic reprogramming. Complex manipulation techniques and maternal stores of proteins preclude large-scale functional screens for ZGA regulators within early embryos. Here, we combined pooled CRISPR activation (CRISPRa) w...
Article
Full-text available
Structural variants (SVs) and short tandem repeats (STRs) comprise a broad group of diverse DNA variants which vastly differ in their sizes and distributions across the genome. Here, we identify genomic features of SV classes and STRs that are associated with gene expression and complex traits, including their locations relative to eGenes, likeliho...
Article
Full-text available
Structural variants (SVs) and short tandem repeats (STRs) are important sources of genetic diversity but are not routinely analyzed in genetic studies because they are difficult to accurately identify and genotype. Because SVs and STRs range in size and type, it is necessary to apply multiple algorithms that incorporate different types of evidence...
Preprint
Full-text available
Common genetic variants can have profound effects on cellular function, but studying these effects in primary human tissue samples and during development is challenging. Human induced pluripotent stem cell (iPSC) technology holds great promise for assessing these effects across different differentiation contexts. Here, we use an efficient pooling s...
Article
Full-text available
Technological advances have enabled the profiling of multiple molecular layers at single-cell resolution, assaying cells from multiple samples or conditions. Consequently, there is a growing need for computational strategies to analyze data from complex experimental designs that include multiple data modalities and multiple groups of samples. We pr...
Article
Full-text available
Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computatio...
Article
Full-text available
Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computatio...
Article
Full-text available
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
Preprint
Full-text available
Integrated analysis using proteomics, RNAseq and polysome profiles revealed a major impact on the proteome caused by erosion of X chromosome inactivation (XCI) in human iPSCs. Low XIST RNA levels were detected in ~40% of iPSC lines from healthy female donors and strongly correlated with erosion of XCI, as evaluated by biallelic expression. Low XIST...
Article
Full-text available
In recent years, functional genomics approaches combining genetic information with bulk RNA-sequencing data have identified the downstream expression effects of disease-associated genetic risk factors through so-called expression quantitative trait locus (eQTL) analysis. Single-cell RNA-sequencing creates enormous opportunities for mapping eQTLs ac...
Article
Full-text available
Recent developments in stem cell biology have enabled the study of cell fate decisions in early human development that are impossible to study in vivo. However, understanding how development varies across individuals and, in particular, the influence of common genetic variants during this process has not been characterised. Here, we exploit human i...
Article
Full-text available
The recent boom in microfluidics and combinatorial indexing strategies, combined with low sequencing costs, has empowered single-cell sequencing technology. Thousands-or even millions-of cells analyzed in a single experiment amount to a data revolution in single-cell biology and pose unique data science problems. Here, we outline eleven challenges...
Preprint
Full-text available
Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including 563,946 European ancestry participants, and discover 5...
Article
Full-text available
Transcript alterations often result from somatic changes in cancer genomes¹. Various forms of RNA alterations have been described in cancer, including overexpression², altered splicing³ and gene fusions⁴; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relativ...
Preprint
Full-text available
Thousands of genetic associations with phenotypes of blood cells are known, but few are with phenotypes relevant to cell function. We performed GWAS of 63 flow-cytometry phenotypes, including measures of cell granularity, nucleic acid content, and reactivity, in 39,656 participants in the INTERVAL study, identifying 2,172 variant-trait associations...
Preprint
Full-text available
Genotype-environment interaction (G×E) studies typically focus on variants with previously known marginal associations. While such two-step filtering greatly reduces the multiple testing burden, it can miss loci with pronounced G×E effects, which tend to have weaker marginal associations. To test for G×E effects on a genome-wide scale whilst levera...
Preprint
Full-text available
The identification of causal genetic variants for common diseases improves understanding of disease biology. Here we use data from the BLUEPRINT project to identify regulatory quantitative trait loci (QTL) for three primary human immune cell types and use these to fine-map putative causal variants for twelve immune-mediated diseases. We identify 34...