Gerald Quon’s research while affiliated with Molecular and Cellular Biology Program and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (98)


ERK activity and target genes are dose-responsive to Epidermal Growth Factors
a Schematic of the experimental method. Live cells were imaged in 96-well plates for 19 hours and immediately fixed following the end of the time-lapse measurements. Plates were subsequently stained for antibody-based measurements. b Condition average response measurements for live-cell ERK biosensor (EKAR signal, shown in arbitrary units) with increasing concentrations of EGF. Data are presented as the mean of each condition (nreplicates = 3 for all conditions). c Condition average response measurements depicted as a heatmap. Each row represents the average EKAR FRET measurement for a condition, indicated by the color scale (yellow, high ERK; blue, low ERK). EGF concentration is indicated by colored triangles as in Fig. 1b. MEKi = MEK inhibitor PD0325901 (100 nM) (see Supplemental Table 1 for neplicates for each condition). d Plots of single-cell EKAR signals (in arbitrary units) for five representative cells in each indicated condition. Bold lines indicate the mean of all cells in one well of the condition. Red lines indicate the time points where treatments were added. e MCF10A cells immuno-stained with cyclic immunofluorescence. Each row depicts the same group of cells. Scale bar = 100 μm. f Nuclear quantification of cyclic immunofluorescence measurements from listed EGF condition. Plots indicate the median (bar), 25th/75th percentiles (box), and range of the data (whiskers). The dashed line indicates the median of vehicle control (Imaging Media). Variance-corrected t-tests (two-sided) were conducted by comparing each EGF-treated condition to control. nreplicates = 3. P values, relative to the 0 EGF condition, are shown on each distribution. Throughout the study, nreplicates is used to denote independent experimental replicates performed on different days.
ERK target gene expression moderately correlates with features of ERK dynamics
a Single-cell heatmap for EKAR FRET measurements and corresponding ETG intensity, each row represents one cell (ncells = 97,960, nreplicates = 3). ETG expression colored by intensity of immunofluorescence (IF) measurements (log10). b Schematic of ERK dynamic features used for analysis. Frequency was calculated by estimating the mean normalized frequency of the power spectrum of the EKAR FRET measurement time series for each cell. c Pearson correlation (r) between each ERK feature and each cyclic IF measurement, where single- cell values were used. d Pearson correlation (r) between single-cell ETG measurements and the EKAR FRET measurement at each timepoint from the live-cell experiment. e Spatial heatmap of EKAR (left) and ETG (right) measurements from a single well (control condition). Heatmap is organized by proximity of cells to each other so that neighboring cells in the well are plotted closer to each other in the heatmap. ETG colormap indicates the relative log intensity of data within each column; outliers in pERK column skew colormap towards red. (black = NA). Magenta box indicates cells pictured in f. White arrows indicate additional cells that recently activated ERK, which resulted in higher Egr-1 expression (right). f Images corresponding to the cells plotted within the magenta box in e, representing an example of an association that was observed consistently across all 3 experimental replicates. All panels shown are registered images of the same cells, with the scale bar indicating 50 μm shown in the Hoechst-stained image.
Convolutional neural network identifies non-linear signal transmission
a For each ETG, three types of prediction models were separately trained. Top: Simplified schematic of convolutional neural network architecture containing two convolutional layers and three fully connected layers. Bottom left: Multiple variable regression where ERK activity at each time point is considered as a predictor variable (TS linear). Bottom right: Multiple variable linear regression where nine features of ERK activity are considered as predictor variables (Featurized linear). b Top: Bar plot indicating R² for three models used to predict ETG levels. Bottom: Bar plot indicating mean square error for three models used to predict ETG levels. Error bars represent standard error calculated using values from each fold of the 5-fold cross-validation partitions. c Scatter plot of the predicted and observed values of the CNN trained on all 190 timepoints (19 hr). The data represent standardized (z-scored) values. d Feature attribution heatmap showing the importance of each timepoint in the CNN model trained on 150 timepoints (15 h). Colormap represents relative values within each row. c and d represent the validation set of the first 5-fold partition.
ERK target gene expression predicts history of ERK activation
a Single-cell regression showing the coefficient of determination (R2) of linear regression models that use ETGs to predict each ERK feature. 10-fold cross-validation was conducted to retrieve the best test-set model. This model was then fit on the full dataset. “All” indicates multiple regression models using all ETGs as predictors. b Scatter plots of single-cell regression models showing line of best fit. Color indicates relative density of the data. c Scatter plot showing each cell’s predicted (x-axis) vs true (y-axis) value in the multiple linear regression (MLR) models. d Results of adding predictors to MLR models. Color of each point indicates which predictor was added at each step. e Average values were calculated for all cells with the same condition. These values were then used to fit regression models that predict each ERK feature using ETGs. f Scatter plots showing line of best fit and confidence intervals for condition average regression models. Each dot indicates the average of one condition.
Cancer cell types display deficiencies in processing ERK dynamics
a Condition average responses for live-cell ERK biosensor (EKAR) under four conditions. Data are presented as the mean of each condition (nwell replicates = 3). b Single-cell response plots to indicated condition. The bold line indicates the average of all cells in one well of the condition. c Pearson correlation (r) between single-cell protein measurements and the EKAR FRET measurement at each timepoint from the live-cell experiment. d Single-cell regression showing the coefficient of determination (R²) of linear regression models that use protein levels to predict each ERK feature. 10-fold cross-validation was conducted to retrieve the best test-set model. This model was then fit on the full dataset. “All” indicates multiple regression models using all proteins as predictors. e Percentage of cells classified as pRb-positive or pRb-negative; intensity thresholds identified individually for each cell line. Independent t-tests (two-sided) were conducted by comparing each condition to the Imaging Medium control. nreplicates = 3. p-vals (left to right): 0.000527, 0.000002, 0.033124, 0.020208, 0.001571. Error bars: standard deviation; a.u., arbitrary units.

+2

Deciphering the history of ERK activity from fixed-cell immunofluorescence measurements
  • Article
  • Full-text available

May 2025

·

2 Reads

Abhineet Ram

·

Michael Pargett

·

Yongin Choi

·

[...]

·

The RAS/ERK pathway plays a central role in diagnosis and therapy for many cancers. ERK activity is highly dynamic within individual cells and drives cell proliferation, metabolism, and other processes through effector proteins including c-Myc, c-Fos, Fra-1, and Egr-1. These proteins are sensitive to the dynamics of ERK activity, but it is not clear to what extent the pattern of ERK activity in an individual cell determines effector protein expression, or how much information about ERK dynamics is embedded in the pattern of effector expression. Here, we evaluate these relationships using live-cell biosensor measurements of ERK activity, multiplexed with immunofluorescence staining for downstream target proteins of the pathway. Combining these datasets with linear regression, machine learning, and differential equation models, we develop an interpretive framework for immunofluorescence data, wherein Fra-1 and pRb levels imply long-term activation of ERK signaling, while Egr-1 and c-Myc indicate more recent activation. Analysis of multiple cancer cell lines reveals a distorted relationship between ERK activity and cell state in malignant cells. We show that this framework can infer various classes of ERK dynamics from effector protein stains within a heterogeneous population, providing a basis for annotating ERK dynamics within fixed cells.

Download

sciLaMA: A Single-Cell Representation Learning Framework to Leverage Prior Knowledge from Large Language Models

February 2025

·

11 Reads

A bstract Single-cell RNA sequencing (scRNA-seq) enables high-resolution exploration of cellular diversity and gene regulation, yet analyzing such data remains challenging due to technical and methodological limitations. Existing task-specific deep generative models like Variational Auto-Encoder (VAE) and its variants struggle to incorporate external biological knowledge, while transformer-based foundational large Language Models (LLMs or large LaMs) face limitations in computational cost and applicability to tabular gene expression data. Here, we introduce sciLaMA (single-cell interpretable Language Model Adapter), a novel representation learning framework that bridges these gaps by integrating static gene embeddings from multimodal LaMs with scRNA-seq tabular data through a paired-VAE architecture. Our approach generates context-aware representations for both cells and genes and outperforms state-of-the-art methods in key single-cell downstream tasks, including batch effect correction, cell clustering, and cell-state-specific gene marker and module identification, while maintaining computational efficiency. sciLaMA offers a computationally efficient, unified framework for comprehensive single-cell data analysis and biologically interpretable gene module discovery.


scPair: Boosting single cell multimodal analysis by leveraging implicit feature selection and single cell atlases

November 2024

·

31 Reads

·

2 Citations

Multimodal single-cell assays profile multiple sets of features in the same cells and are widely used for identifying and mapping cell states between chromatin and mRNA and linking regulatory elements to target genes. However, the high dimensionality of input features and shallow sequencing depth compared to unimodal assays pose challenges in data analysis. Here we present scPair, a multimodal single-cell data framework that overcomes these challenges by employing an implicit feature selection approach. scPair uses dual encoder-decoder structures trained on paired data to align cell states across modalities and predict features from one modality to another. We demonstrate that scPair outperforms existing methods in accuracy and execution time, and facilitates downstream tasks such as trajectory inference. We further show scPair can augment smaller multimodal datasets with larger unimodal atlases to increase statistical power to identify groups of transcription factors active during different stages of neural differentiation.


A scheme showing the overall mouse study design
a Cortical samples were collected from postnatal mice at four different timepoints corresponding to three different disease stages (n = 28). Four different Mecp2e1 genotypes were considered that include both sexes. b UMAP of the unsupervised clustering of cell types (n = 93,798 cells post QC) identified. Cell type labels were transferred from³⁹ Yao et al. 2021. c Top gene markers for each cell type are shown on y-axis. The color refers to the average expression of genes in a cell type and the percent expressed describes the percentage of cells within a cell type that express each gene marker. d Design of computational experiments comparing mutant to WT cells from mice of both sexes. Experiments 3 to 5 are comparing subtypes of cells in females due to X chromosome inactivation to examine potential non-cell-autonomous effects of Mecp2e1 mutation. Figure 1a, d is made using Biorender.
Sexually-dimorphic dynamic patterns of DEGs and KEGG pathway terms across time and cell type
a Heatmap of top 5 differentially expressed genes (DEGs) based on the lowest adjusted p-value ≤ 0.05 comparing male Mecp2e1-/y and Mecp2e1+/y cortical cells across timepoints (experiment 1). b Heatmap of top 5 DEGs comparing female Mecp2e1-/+ and Mecp2e1+/+ cortical cells across timepoints (experiment 2). *indicates adjusted p-value ≤ 0.05 (corrected via Benjamini and Hochberg method). c, d Number of DEGs over time at adjusted p-value ≤ 0.05 for experiments 1 and 2, respectively. e, f Dot plots showing the KEGG pathway terms for DEGs (adjusted p-value ≤ 0.1) from each cell type, selected for terms that are persistent over time for experiments 1 and 2, respectively.
HdWGCNA identifies co-expression networks for each cell type in the mouse cortex that correlated with Mecp2e1 genotype, disease phenotypes, and sex
a Top 10 hub genes identified for each of the 9 modules generated by hdWGCNA on entire snRNA-seq dataset, identified by color. The x-axis are all the genes in each module and the y-axis is the corresponding kME value. b Dot plot of the average gene expression of the top 10 hub genes in each module generated for each cortical cell type. c A heat map of correlations between experimental phenotypes and variables (body weight, disease score, genotype, time point, sex) and averaged gene expression (eigenmode value) for each cell type (cell types not shown are in Supplementary Fig. 7a). *, **, *** indicates FDR-corrected p-value ≤ 0.05, 0.01, and 0.001, respectively. The color bar shows the Pearson correlation coefficient.
Dynamic non-cell-autonomous effects on differentially expressed genes and KEGG pathways over disease progression
As shown in Experiment #3 (Fig. 1d), we compared WT cells from Mecp2e1-/+ female with WT cells from Mecp2e1+/+ glutamatergic and GABAergic neurons longitudinally. a UMAP plot of cell types identified in the mosaic females. b UMAP plot of the female cortices showing the clustering of the broad cell type categories. c UMAP plot of mosaic female cells parsed by Mecp2 allele d Volcano plots showing differentially expressed genes (DEGs) of the mouse cortical neurons contrasting WT cells from WT Mecp2e1+/+ females and WT cells from Mecp2e1-/+ mosaic females. e, f Venn diagrams of overlapping glutamatergic and GABAergic DEGs respectfully over time. g, i Venn diagrams of significant KEGG terms of glutamatergic and GABAergic neurons over time. h. Top 10 KEGG terms of glutamatergic neurons over time. j. Top 10 KEGG terms of GABAergic neurons over time.
Human RTT cortical neurons share transcriptional dysregulation specifically with Mecp2e1-/+ mosaic female mice
a A schematic of postmortem human RTT cortices and age/sex matched control cortices. Figure 1a was made using Biorender. b UMAP of the unsupervised clustering of cell types identified in the human cortices (n = 39336 cells post QC). Cell type labels were transferred from Bakken Trygve et al. 2021²⁰. c Top gene markers for each cell type in the human cortex. d Heatmap of top differentially expressed genes (DEGs) for human female cortices. *indicates adjusted p-value ≤ 0.05. e Bar graph showing overlapping of the top significant upregulated and downregulated genes by logFC in female mouse and female human. f Upset plot showing overlap of the significant DEGs from both GABAergic and glutamatergic neurons in female human, female mouse, and male mouse.
Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression

October 2024

·

71 Reads

·

3 Citations

Communications Biology

Dominant X-linked diseases are uncommon due to female X chromosome inactivation (XCI). While random XCI usually protects females against X-linked mutations, Rett syndrome (RTT) is a female neurodevelopmental disorder caused by heterozygous MECP2 mutation. After 6-18 months of typical neurodevelopment, RTT girls undergo a poorly understood regression. We performed longitudinal snRNA-seq on cerebral cortex in a construct-relevant Mecp2e1 mutant mouse model of RTT, revealing transcriptional effects of cell type, mosaicism, and sex on progressive disease phenotypes. Across cell types, we observed sex differences in the number of differentially expressed genes (DEGs) with 6x more DEGs in mutant females than males. Unlike males, female DEGs emerged prior to symptoms, were enriched for homeostatic gene pathways in distinct cell types over time and correlated with disease phenotypes and human RTT cortical cell transcriptomes. Non-cell-autonomous effects were prominent and dynamic across disease progression of Mecp2e1 mutant females, indicating that wild-type-expressing cells normalize transcriptional homeostasis. These results advance our understanding of RTT progression and treatment.


Gene expansions contributing to human brain evolution

September 2024

·

140 Reads

Genomic drivers of human-specific neurological traits remain largely undiscovered. Duplicated genes expanded uniquely in the human lineage likely contributed to brain evolution, including the increased complexity of synaptic connections between neurons and the dramatic expansion of the neocortex. Discovering duplicate genes is challenging because the similarity of paralogs makes them prone to sequence-assembly errors. To mitigate this issue, we analyzed a complete telomere-to-telomere human genome sequence (T2T-CHM13) and identified 213 duplicated gene families likely containing human-specific paralogs (>98% identity). Positing that genes important in universal human brain features should exist with at least one copy in all modern humans and exhibit expression in the brain, we narrowed in on 362 paralogs with at least one copy across thousands of ancestrally diverse genomes and present in human brain transcriptomes. Of these, 38 paralogs co-express in gene modules enriched for autism-associated genes and potentially contribute to human language and cognition. We narrowed in on 13 duplicate gene families with human-specific paralogs that are fixed among modern humans and show convincing brain expression patterns. Using long-read DNA sequencing revealed hidden variation across 200 modern humans of diverse ancestries, uncovering signatures of selection not previously identified, including possible balancing selection of CD8B . To understand the roles of duplicated genes in brain development, we generated zebrafish CRISPR “knockout” models of nine orthologs and transiently introduced mRNA-encoding paralogs, effectively “humanizing” the larvae. Morphometric, behavioral, and single-cell RNA-seq screening highlighted, for the first time, a possible role for GPR89B in dosage-mediated brain expansion and FRMPD2B function in altered synaptic signaling, both hallmark features of the human brain. Our holistic approach provides important insights into human brain evolution as well as a resource to the community for studying additional gene expansion drivers of human brain evolution. Abstract (short) Duplicated genes expanded in the human lineage likely contributed to brain evolution, yet challenges exist in their discovery due to sequence-assembly errors. We used a complete telomere-to-telomere genome sequence to identify 213 human-specific gene families. From these, 362 paralogs were found in all modern human genomes tested and brain transcriptomes, making them top candidates contributing to human-universal brain features. Choosing a subset of paralogs, we used long-read DNA sequencing of hundreds of modern humans to reveal previously hidden signatures of selection. To understand their roles in brain development, we generated zebrafish CRISPR “knockout” models of nine orthologs and introduced mRNA-encoding paralogs, effectively “humanizing” larvae. Our findings implicate two new genes in possibly contributing to hallmark features of the human brain: GPR89B in dosage-mediated brain expansion and FRMPD2B in altered synapse signaling. Our holistic approach provides new insights and a comprehensive resource for studying gene expansion drivers of human brain evolution.


Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression

May 2024

·

34 Reads

·

1 Citation

Dominant X-linked diseases are uncommon due to female X chromosome inactivation (XCI). While random XCI usually protects females against X-linked mutations, Rett syndrome (RTT) is a female neurodevelopmental disorder caused by heterozygous MECP2 mutation. After 6-18 months of typical neurodevelopment, RTT girls undergo poorly understood regression. We performed longitudinal snRNA-seq on cerebral cortex in a construct-relevant Mecp2e1 mutant mouse model of RTT, revealing transcriptional effects of cell type, mosaicism, and sex on progressive disease phenotypes. Across cell types, we observed sex differences in the number of differentially expressed genes (DEGs) with 6x more DEGs in mutant females than males. Unlike males, female DEGs emerged prior to symptoms, were enriched for homeostatic gene pathways in distinct cell types over time, and correlated with disease phenotypes and human RTT cortical cell transcriptomes. Non-cell-autonomous effects were prominent and dynamic across disease progression of Mecp2e1 mutant females, indicating wild-type-expressing cells normalizing transcriptional homeostasis. These results improve understanding of RTT progression and treatment.


Microbiome-based classification models for fresh produce safety and quality evaluation

March 2024

·

55 Reads

·

2 Citations

Microbiology Spectrum

Small sample sizes and loss of sequencing reads during the microbiome data preprocessing can limit the statistical power of differentiating fresh produce phenotypes and prevent the detection of important bacterial species associated with produce contamination or quality reduction. Here, we explored a machine learning-based k -mer hash analysis strategy to identify DNA signatures predictive of produce safety (PS) and produce quality (PQ) and compared it against the amplicon sequence variant (ASV) strategy that uses a typical denoising step and ASV-based taxonomy strategy. Random forest-based classifiers for PS and PQ using 7-mer hash data sets had significantly higher classification accuracy than those using the ASV data sets. We also demonstrated that the proposed combination of integrating multiple data sets and leveraging a 7-mer hash strategy leads to better classification performance for PS and PQ compared to the ASV method but presents lower PS classification accuracy compared to the feature-selected ASV-based taxonomy strategy. Due to the current limitation of generating taxonomy using the 7-mer hash strategy, the ASV-based taxonomy strategy with remarkably less computing time and memory usage is more efficient for PS and PQ classification and applicable for important taxa identification. Results generated from this study lay the foundation for future studies that wish and need to incorporate and/or compare different microbiome sequencing data sets for the application of machine learning in the area of microbial safety and quality of food. IMPORTANCE Identification of generalizable indicators for produce safety (PS) and produce quality (PQ) improves the detection of produce contamination and quality decline. However, effective sequencing read loss during microbiome data preprocessing and the limited sample size of individual studies restrain statistical power to identify important features contributing to differentiating PS and PQ phenotypes. We applied machine learning-based models using individual and integrated k -mer hash and amplicon sequence variant (ASV) data sets for PS and PQ classification and evaluated their classification performance and found that random forest (RF)-based models using integrated 7-mer hash data sets achieved significantly higher PS and PQ classification accuracy. Due to the limitation of taxonomic analysis for the 7-mer hash, we also developed RF-based models using feature-selected ASV-based taxonomic data sets, which performed better PS classification than those using the integrated 7-mer hash data set. The RF feature selection method identified 480 PS indicators and 263 PQ indicators with a positive contribution to the PS and PQ classification.


Deciphering the History of ERK Activity from Fixed-Cell Immunofluorescence Measurements

February 2024

·

35 Reads

The Ras/ERK pathway drives cell proliferation and other oncogenic behaviors, and quantifying its activity in situ is of high interest in cancer diagnosis and therapy. Pathway activation is often assayed by measuring phosphorylated ERK. However, this form of measurement overlooks dynamic aspects of signaling that can only be observed over time. In this study, we combine a live, single-cell ERK biosensor approach with multiplexed immunofluorescence staining of downstream target proteins to ask how well immunostaining captures the dynamic history of ERK activity. Combining linear regression, machine learning, and differential equation models, we develop an interpretive framework for immunostains, in which Fra-1 and pRb levels imply long term activation of ERK signaling, while Egr-1 and c-Myc indicate recent activation. We show that this framework can distinguish different classes of ERK dynamics within a heterogeneous population, providing a tool for annotating ERK dynamics within fixed tissues.


Projecting RNA measurements onto single cell atlases to extract cell type-specific expression profiles using scProjection

August 2023

·

84 Reads

·

7 Citations

Multi-modal single cell RNA assays capture RNA content as well as other data modalities, such as spatial cell position or the electrophysiological properties of cells. Compared to dedicated scRNA-seq assays however, they may unintentionally capture RNA from multiple adjacent cells, exhibit lower RNA sequencing depth compared to scRNA-seq, or lack genome-wide RNA measurements. We present scProjection, a method for mapping individual multi-modal RNA measurements to deeply sequenced scRNA-seq atlases to extract cell type-specific, single cell gene expression profiles. We demonstrate several use cases of scProjection, including identifying spatial motifs from spatial transcriptome assays, distinguishing RNA contributions from neighboring cells in both spatial and multi-modal single cell assays, and imputing expression measurements of un-measured genes from gene markers. scProjection therefore combines the advantages of both multi-modal and scRNA-seq assays to yield precise multi-modal measurements of single cells.


Modeling collective cell behavior in cancer: Perspectives from an interdisciplinary conversation

April 2023

·

112 Reads

·

8 Citations

Cell Systems

Collective cell behavior contributes to all stages of cancer progression. Understanding how collective behavior emerges through cell-cell interactions and decision-making will advance our understanding of cancer biology and provide new therapeutic approaches. Here, we summarize an interdisciplinary discussion on multicellular behavior in cancer, draw lessons from other scientific disciplines, and identify future directions.


Citations (42)


... Among these, MultiVI (21) applies unsupervised variational autoencoders for cross-modality alignment, while scPair (22) implements a supervised learning approach that links chromatin accessibility and gene expression end to end. These frameworks are powerful for learning biologically meaningful low-dimensional representations of the data across modalities. ...

Reference:

Single-cell multiomics reveals disrupted glial gene regulatory programs in Alzheimer's disease via interpretable machine learning
scPair: Boosting single cell multimodal analysis by leveraging implicit feature selection and single cell atlases

... Despite female samples being underrepresented four-fold in ASD biospecimens, sex stratification resulted in greater power to detect informative DNA methylation patterns in females than the usual approach of combining both sexes in the analyses and adjusting for sex as a confounding variable. For example, in the X-linked dominant Rett syndrome (MIM#312750), caused by heterozygous mutations of MECP2 in females, sex differences in disease models go beyond what is expected based on gene dosage alone [24,25]. Even for syndromic forms of IDD caused by autosomal gene mutations in humans and mouse models, disease severity and phenotypes can show profound sex differences [26,27]. ...

Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression

Communications Biology

... Indeed, new data are now available, which allows us to study MECP2 function in a cell type-specific perspective [62]. In this context, highly informative data are the transcriptome and epigenome analyzed at the single-cell level, which allows us to obtain information about the effect of MECP2 mutations at cellular-and allele-specific levels [94,95]. This is particularly important considering that in females, the expression of MECP2 mutation is a mosaic due to random X-chromosome inactivation [96,97]. ...

Sex-specific single cell-level transcriptomic signatures of Rett syndrome disease progression

... Johansen et al. [2023],Lopez et al. [2019],,Biancalani et al. [2021]. The experimental setup employed a leave-one-gene-out strategy, where the expression of a single gene was masked across all cells, and the models were tasked with predicting its expression pattern based on the remaining genes. ...

Projecting RNA measurements onto single cell atlases to extract cell type-specific expression profiles using scProjection

... Continuum models that use partial differential equations 33 (PDEs) have been used to simulate tumor growth and invasion at the tissue level. These 34 models treat the tumor as a continuous medium and account for variables such as cell den- 35 sity, nutrient concentration, and ECM composition [36][37][38][39][40] . PDE-based models are beneficial 36 for studying the effects of biochemical gradients, such as oxygen or nutrient gradients, on 37 tumor growth and invasion 41 . ...

Modeling collective cell behavior in cancer: Perspectives from an interdisciplinary conversation
  • Citing Article
  • April 2023

Cell Systems

... Still, there are some approaches that incorporate interpretability already as part of the model design. For example, the siVAE approach simultaneously infers a cell and gene embedding space via two encoder-decoder frameworks and and uses an additional regularization term in the loss function, where embeddings of genes indicate their contribution to distinct dimensions of the cell embedding space 36 . However, the dimensions in the cell embedding may still be entangled, and the contribution of variables to the dimensions of the cell embedding is not constrained to be sparse. ...

siVAE: interpretable deep generative models for single-cell transcriptomes

Genome Biology

... Single cell assays have been developed to capture diverse aspects of genome regulation, including gene expression 1,2 , chromatin accessibility 3 , and methylation profiling 4,5 , among others 6,7 . These single modality assays that capture a single data type have been widely deployed on a variety of tissues and species to catalog cell types and states [8][9][10][11][12][13][14][15][16] , identify genomic features that activate at specific steps along cellular trajectories [17][18][19][20][21][22][23] , and infer regulatory networks cataloging interactions of genes, open chromatin regions or methylation sites [24][25][26] . A common step of single cell data analysis is cell state inference: the inference of a low dimensional representation of a single cell data modality, that is subsequently used for 2D data visualization 27,28 , clustering to identify discrete cell types and states, and trajectory inference tasks 29,30 . ...

Cell adhesion molecules play subclass-specific roles in electrophysiological response and Schizophrenia risk

... Current eQTL studies have used coarser groupings of cells than in this study and found eQTLs specific to excitatory and GABAergic neurons (27) that we can further resolved into subclasslevel eQTLs to hone in more granular cell types targeted by variants associated with neurological disease. Furthermore, computational deconvolution (14,15) of eQTL analyses performed on bulk tissue (26,62,73) to resolve cell type signals can be refined using (14,15,26) the subclassspecific eQTLs reported in this study. ...

Projecting clumped transcriptomes onto single cell atlases to achieve single cell resolution

... As CTCF is important in setting up 3D chromatin interactions and TADs, we also quantified the overlap between differentially accessible regions and TADs in A549 cells (GSE92819) and found significant overlap (Fig. 7a). In addition, evidence from mouse model and HBECs support a protective role of TET1 against allergic asthma, which could be due to its role in promoting AhR signalling and limiting the expression of pro-inflammatory cytokines [9,67]. Therefore, we searched for the presence of asthma-associated variations in histone acetylation [51] and DNA methylation found in airway epithelial cells [52][53][54][55] among TET1-loss induced differentially accessible regions and DMRs. ...

Single-Cell RNA-Seq Analysis Reveals Lung Epithelial Cell Type-Specific Responses to HDM and Regulation by Tet1

... Studies that use single-cell sequencing generally have different research questions, and are focused on clustering and integration using AE architectures (e.g. [67][68][69][70]). Recent popular publications on these tasks use foundation models, which are promising as they can perform several tasks such as cell-type annotation and batch correction. ...

Interpretable deep generative models for genomics
  • Citing Preprint
  • September 2021