Nicholas Panchy’s research while affiliated with The University of Tennessee Medical Center at Knoxville and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (42)


Distributions of SCLC cells across the epithelial and mesenchymal spectrum. E and M scores of (A) 15,138 mouse Myc-overexpressing SCLC GEMM tumor cells in an ex vivo model; (B) 120 SCLC cell lines; and (C) 13,945 cells from eight SCLC cell lines. Scores were computed with ssGSEA and 425 previously identified EMT genes. Circles and black bars indicate means and standard deviations. Inset in A shows normalized expression of Cdh1 in four classes of cells.
Divergence of mesenchymal gene expressions in SCLC subtypes. (A,B) Normalized expression levels of Vim (M gene) and Cdh1 (E gene) in 15,138 mouse Myc-driven tumor cells (A) and in 13,945 cells from eight SCLC cell lines (B). Imputed data were plotted in B for visual aid. Circles and black bars show means and standard deviations of SCLC subtypes. (C,D) Line plots: nnPCA was performed. Plots show standard deviations of SCLC subtype means for mouse and human SCLC cells from the top 5 PCs. Scatter plots: nnPCA-based E and M scores. (E,F) Normalized expression levels of Zeb1 (M gene) and Cdh1 (E gene) in 15,138 mouse Myc-driven tumor cells (E) and in 13,945 cells from 8 SCLC cell lines (F). Imputed data were plotted in F for visual aid. (G) Mean differences between N cells (A/N cells for mouse) and Y cells in individual M gene’s expression in mouse and human SCLC cells. (H) A heatmap showing the diversity of M gene expression across N (A/N) and Y subtypes.
Detection of epithelial- and mesenchymal-like SCLC subtypes in human tumor cells. (A) Scatter plot shows nnPCA-based E and M scores for 54,523 SCLC cells [35]. Color code represents each of the 19 patients. Circles and black bars show means and standard deviations of cell scores for each patient. (B) Same data as in A with SCLC subtype labels obtained from Chan et al. [35]. Circles and black bars show means and standard deviations of scores across the previously determined SCLC subtypes. (C,D) Scatter plots show A2 and A subtype scores in the previously determined SCLC-A subtype cells from the data in B. (E,F) Transcriptome data of 11,056 SCLC cells from tumor RU1108 of the Chan et al. study [35] were projected onto indicated score axes. Linear regression lines and confidence intervals were obtained for all cells and ASCL1⁺ cells. sr is Spearman correlation coefficient. (G) Normalized expression of ZEB1 and VIM in the same dataset as in A. Color code represents each of the 19 patients as in A. (H) The same data as in G but color labels are defined by SCLC subtype shown in B.
Correlations between A2 and epithelial transcriptional programs in individual human tumor cells. (A) A total of 5268 single-cell transcriptomes from SCLC tumor SC53 [9] were projected onto the E and M score axes using nnPCA. Colors indicate SCLC subtype scores (see Section 2). (B–D) Linear regression lines and confidence intervals were obtained for all cells and ASCL1⁺ cells from data in D. (E) Datasets of three tumors (top labels) were used to compute the Spearman correlation coefficients between SCLC subtype scores (left labels) and E scores from nnPCA or CDH1 expression levels. All cells and ASCL1⁺ cells were analyzed separately. (F) Correlations between A2 scores and E scores (nnPCA) in untreated and cisplatin-treated cells of SC53 tumor. (G) Datasets of SC53 and SC68 tumors were used to compute the Spearman correlation coefficients between SCLC subtype scores (left labels) and E scores from nnPCA or CDH1 expression levels. All cells and ASCL1⁺ cells were analyzed separately. Untreated and cisplatin-treated cells were analyzed separately.
Involvement of Epithelial–Mesenchymal Transition Genes in Small Cell Lung Cancer Phenotypic Plasticity
  • Article
  • Full-text available

February 2023

·

68 Reads

·

11 Citations

Sarah M. Groves

·

Nicholas Panchy

·

Darren R. Tyson

·

[...]

·

Tian Hong

Simple Summary Small cell lung cancer (SCLC) is an aggressive cancer that is difficult to treat. There are at least five subtypes of SCLC cells, defined by gene expression signatures. The transitions between these subtypes and cooperation between them contribute to the progression of SCLC. Particularly, transitions between neuroendocrine (NE) states, including A, A2, and N subtypes, and the non-NE states, including P and Y subtypes, are hallmarks of SCLC plasticity. In this study, the relationship between SCLC subtypes and epithelial to mesenchymal transition (EMT) was analyzed. EMT is a well-known form of cellular plasticity that contributes to cancer invasiveness and resistance. The results showed that the SCLC-A2 subtype is epithelial while SCLC-A and SCLC-N are mesenchymal but distinct from the non-NE mesenchymal states. This study provides a basis for understanding the gene regulatory mechanisms of SCLC tumor plasticity and its applicability to other cancer types. Abstract Small cell lung cancer (SCLC) is an aggressive cancer recalcitrant to treatment, arising predominantly from epithelial pulmonary neuroendocrine (NE) cells. Intratumor heterogeneity plays critical roles in SCLC disease progression, metastasis, and treatment resistance. At least five transcriptional SCLC NE and non-NE cell subtypes were recently defined by gene expression signatures. Transition from NE to non-NE cell states and cooperation between subtypes within a tumor likely contribute to SCLC progression by mechanisms of adaptation to perturbations. Therefore, gene regulatory programs distinguishing SCLC subtypes or promoting transitions are of great interest. Here, we systematically analyze the relationship between SCLC NE/non-NE transition and epithelial to mesenchymal transition (EMT)—a well-studied cellular process contributing to cancer invasiveness and resistance—using multiple transcriptome datasets from SCLC mouse tumor models, human cancer cell lines, and tumor samples. The NE SCLC-A2 subtype maps to the epithelial state. In contrast, SCLC-A and SCLC-N (NE) map to a partial mesenchymal state (M1) that is distinct from the non-NE, partial mesenchymal state (M2). The correspondence between SCLC subtypes and the EMT program paves the way for further work to understand gene regulatory mechanisms of SCLC tumor plasticity with applicability to other cancer types.

Download

Figure 2. Combination of metrics is better than any individual metric. (A) We tested each of our models' metrics and compared them to CCsc MS, CCsc MM, and CCsc MMS and a set of randomly selected genes, equal to the number of genes used by our method on the TCGA-COAD dataset. (B) We made the same comparison on the TCGA-READ dataset. We show that integrating the lists of ranked genes from each metric provides better performance than any one individually. (C) KaplanMeier estimate based on our model's top set of predictors for the TCGA-COAD dataset. (D) KaplanMeier estimate based on our model's top set of predictors for the TCGA-READ dataset. The shaded regions represent 95% confidence intervals of the survival estimates. The p-value threshold for significance is < 0.05.
Figure 3. CCsc MMS Outperforms Other Methods. (A) We tested CCsc against several well-established tools on the TCGA-COAD dataset. We compared the mean concordance index performance of CCsc with its ideal weighting to DESeq2, edgeR, scDD, and DEsingle. We gave each method its optimal number of genes and ran each with its recommended settings, according to their respective best practices. (B) Same comparison but on the TCGA-READ dataset. We show that CCsc outperforms single-cell methods (scDD and DEsingle) and that bulk RNA-seq methods can be optimized for single-cell RNA-seq data (DESeq2, edgeR).
Figure 4. Top coefficients identified by our model. (A) Top risk decreasing (green, left) and top risk increasing (red, right) genes identified by our Cox model in TCGA-COAD. (B) Top risk decreasing genes (green, left) and top risk increasing (red, right) genes identified by our Cox model in TCGA-READ. For risk increasing genes the hazard ratio threshold is > 2. For risk decreasing genes the hazard ratio threshold is < 0.5.
Statistically Significantly Enriched READ Pathways: Most enriched pathways influenced by the genes with the largest hazard ratios are associated with increased risk in our TCGA-READ Cox model. p-value threshold < 0.05 FDR. Hazard ratio threshold > 2.
Using Single-Cell RNA Sequencing and MicroRNA Targeting Data to Improve Colorectal Cancer Survival Prediction

January 2023

·

48 Reads

·

9 Citations

Colorectal cancer has proven to be difficult to treat as it is the second leading cause of cancer death for both men and women worldwide. Recent work has shown the importance of microRNA (miRNA) in the progression and metastasis of colorectal cancer. Here, we develop a metric based on miRNA-gene target interactions, previously validated to be associated with colorectal cancer. We use this metric with a regularized Cox model to produce a small set of top-performing genes related to colon cancer. We show that using the miRNA metric and a Cox model led to a meaningful improvement in colon cancer survival prediction and correct patient risk stratification. We show that our approach outperforms existing methods and that the top genes identified by our process are implicated in NOTCH3 signaling and general metabolism pathways, which are essential to colon cancer progression.


Common language effective size of separation of dosage groups based on EMT scores
Common language effect size of separation of time and dosage groups based on EMT scores of integrated data
Comparative single-cell transcriptomes of dose and time dependent epithelial–mesenchymal spectrums

September 2022

·

42 Reads

·

13 Citations

NAR Genomics and Bioinformatics

Epithelial–mesenchymal transition (EMT) is a cellular process involved in development and disease progression. Intermediate EMT states were observed in tumors and fibrotic tissues, but previous in vitro studies focused on time-dependent responses with single doses of signals; it was unclear whether single-cell transcriptomes support stable intermediates observed in diseases. Here, we performed single-cell RNA-sequencing with human mammary epithelial cells treated with multiple doses of TGF-β. We found that dose-dependent EMT harbors multiple intermediate states at nearly steady state. Comparisons of dose- and time-dependent EMT transcriptomes revealed that the dose-dependent data enable higher sensitivity to detect genes associated with EMT. We identified cell clusters unique to time-dependent EMT, reflecting cells en route to stable states. Combining dose- and time-dependent cell clusters gave rise to accurate prognosis for cancer patients. Our transcriptomic data and analyses uncover a stable EMT continuum at the single-cell resolution, and complementary information of two types of single-cell experiments.


Figure 1. Distributions of SCLC cells across the epithelial and mesenchymal spectrum. A. E and M scores of 15138 mouse Myc-overexpressing SCLC GEMM tumor cells in an ex vivo model. Scores were computed with ssGSEA and 425 previously identified EMT genes. Circles and black bars show means and standard deviations of SCLC subtypes. Inset shows normalized expression of Cdh1 in four types of specialist cells. B. E and M scores of 120 SCLC cell lines computed with the same method as in A. C. E and M scores of 13945 cells from 8 SCLC cell lines computed with the same method as in A.
Figure 2. Divergence of mesenchymal gene expressions in SCLC subtypes. A, B. Normalized expression levels of Vim (M gene) and Cdh1 (E gene) in 15138 mouse Myc-driven tumor cells (A) and in 13945 cells from 8 SCLC cell lines (B). Imputed data were plotted in B for visual aid. Circles and black bars show means and standard deviations of SCLC subtypes. C, D. Line plots: nnPCA was performed. Plots show standard deviations of SCLC subtype means for mouse and human SCLC cells from the top 5 PCs. Scatter plots: nnPCA-based E and M scores. E, F. Normalized expression levels of Zeb1 (M gene) and Cdh1 (E gene) in 15138 mouse Mycdriven tumor cells (E) and in 13945 cells from 8 SCLC cell lines (F). Imputed data were plotted in F for visual aid. G. Mean differences between N cells (A/N cells for mouse) and Y cells in individual M gene's expression in mouse and human SCLC cells. H. A heatmap showing the diversity of M gene expression across N (A/N) and Y subtypes.
Figure 3. Predictions of epithelial-like A2 SCLC subtypes in human tumor cells. A. Scatter plot shows nnPCA-based E and M scores for 54,523 SCLC cells (Chan et al., 2021). Color code represents each of the 19 patients. Circles and black bars show means and standard deviations
Analysis of transcriptome datasets reveals involvement of epithelial-mesenchymal transition genes in small cell lung cancer phenotypic plasticity

September 2022

·

96 Reads

Small cell lung cancer (SCLC) is an aggressive cancer recalcitrant to treatment. SCLC arises in lung epithelium, and the phenotypic heterogeneity of SCLC cells plays critical roles in disease progression and drug resistance. Particularly, the transition from neuroendocrine (NE) cells to non-neuroendocrine (non-NE) cells, as well as the cooperations between these cell types, are observed in SCLC progression. At least five subtypes of SCLC tumors have recently been defined by signature gene expression and these subtypes have been extended to individual SCLC tumor cells. However, the molecular and cellular programs that distinguish the cell types or allow transitions among them are unclear. Here, we systematically analyzed the relationship between SCLC and epithelial to mesenchymal transition (EMT), a cellular process well known to contribute to invasiveness and drug resistance of cancer cells, using multiple transcriptome datasets from a mouse tumor model, human cancer cell lines, and human tumor samples. Our analysis of these datasets consistently showed that an extreme epithelial state corresponds to the SCLC-A2 subtype. Furthermore, two NE subtypes have significant expression of a subset of mesenchymal genes, with surprisingly limited overlap with the mesenchymal genes highly expressed in non-NE cells. Our work reveals a previously unknown connection between an EMT program and a specific SCLC subtype and an underappreciated divergence of EMT trajectories contributing to the SCLC phenotypic heterogeneity.


Figure 1. Analysis overview and progression of dose-dependent EMT at single-cell level. (A) A schematic of our analysis in this study. The analyses involve the existing time-course data (top) contain MCF10A cells at different time points following a common TGF-β treatment (about 200 pM; Deshmukh et al. [13]) as well as the dose-dependent data (bottom) containing MCF10A cells treated with different dosage levels of TGF-β after a fixed time period representing near-steady-state. Gene expression of cells from both experiments were measured using single-cell RNA-sequencing and were subsequently used individually and integrated for downstream analyses. (B) Projection of dose treatment single-cell expression data using UMAP. The color of individual points indicates the dose of TGF-β treatment from 0 pM (red) to 800 pM (pink). (C-D) Contour plots of gene set scores of E (x-axis) and M (y-axis) genes using nnPCA (C) and Zscore (D). Color indicates the dose of TGF-β as in (B). Circles indicate the mean E-and M-score of samples from each dose point and the associated error bars show the standard deviation (E-G) Overlay of the scaled expression of EMT marker genes CDH1 (an epithelial marker, E), VIM (a mesenchymal marker, F), and FN1 (a highly expressed mesenchymal gene, G). The color of individual points indicates the Z-score of expression of each gene from low (blue) to high (red).
Figure 2. Continuity of integrated single cell dose and time data in low-dimensional projections. (A) Projection of integrated dose and time data using UMAP. Each panel uses the same underlying UMAP for samples, but the coloring of points has different meaning. Left panel: color indicates the origin of the sample from the Days 0, 4 and 8 of the time experiment (red), the Days 0, 1, 2 and 3 of the time experiment (green), or the dose experiment (blue). Middle panel: color indicates the treatment dose of samples from the dose experiment; time samples are masked. Right panel: color indicates the time of treatment for samples from the time experiment, dose samples are masked. (B-C) Contour plots of gene set scores of E (x-axis) and M (y-axis) genes using nnPCA for dose (B) and time (C) samples from integrate data. Color indicates the dose of TGF-β treatment from 0 pM (red) to 800 pM (pink) for dose data and time of treatment from 0 days (red) to 8 days (pink) for time data. Circles indicate the mean E-and M-score of samples from each dose point and the associated error bars show the standard deviation. (D) Boxplots show the distribution of E (left) and M (right) scores across different dose treatments from integrated data. Color indicates the dose of TGF-β as in (B). (E) Boxplots show the distribution of E (left) and M (right) scores across different time treatments from integrated data. Color indicates the time of treatment as in (C).
Figure 4. Enrichment of dose and time samples in E-and M-score space. (A) A contour map showing the distributions of time (blue) and dose (red) samples in E-and M-score space. (B) Bubble chart showing the enrichment of time samples in different segments of E-and M-score space. Each point represents a segment of E-and M-score space defined by a particular quartile of E-score (x-axis) and M-score (y-axis). The size of the point corresponds to the total number samples in the segment and the color of the point represent the log-odds of time sample enrichment from low (blue) to high (red) with a log-odds of 0 (white)
Figure S3. Positioning of samples from different experiments in unintegrated UMAP Space. Projection of unintegrated dose and time data using UMAP. Color indicates the origin of the sample from the Days
Figure S4. Overlap of unintegrated E-and M-scores across time and dose data. Scatter plot of E-scores (x-axis) and M-scores (y-axis) of samples from the Days 0,4,8 of time experiment (red), Days 0,1,2,3 of time experiment (green) and dose experiment (blue) prior to full integration. Note that time samples have been integrated relative to one another as in Deshmukh et al.[13], but have not been integrated with the samples from the dose experiment.
Comparative single-cell transcriptomes of dose and time dependent epithelial-mesenchymal spectrums

May 2022

·

62 Reads

·

1 Citation

Epithelial-mesenchymal transition (EMT) is a key cellular process involved in development and disease progression. Single-cell transcriptomes can characterize intermediate EMT states observed in tumors and fibrotic tissues, but previous in vitro models focused on time-dependent responses after stimulation with single dose of EMT signals. It was therefore unclear whether single-cell transcriptomes support stable intermediate EMT phenotypes crucial for disease progression. We performed single-cell RNA-sequencing with human mammary epithelial cells treated with various concentrations of TGF-β. We found that the dose-dependent EMT harbors multiple intermediate states at the single-cell level after two weeks of treatment, suggesting a stable continuum. After correcting batch effects from experiments, we performed comparative analyses of the dose- and time-dependent EMT. We found that the dose-dependent EMT shows a stronger anti-correlation between epithelial and mesenchymal transcriptional programs and a better resolution of transition stages compared to the time-dependent process. These properties enable higher sensitivity to detect genes whose expressions are associated with core EMT regulatory networks. Nonetheless, we found cell clusters unique to the time-dependent EMT, which correspond to en route cell populations that do not appear at steady states. Furthermore, combining dose- and time-dependent cell clusters gave rise to more accurate prognosis for cancer patients compared to individual EMT spectrum. Our new data and analyses reveal a stable EMT continuum at the single-cell resolution and the transcriptomic level. The dose-dependent experimental model can complement the widely used time-course experiments to reflect physiologically or pathologically relevant EMT phenotypes in a comprehensive manner.


High‐throughput measurement of plant fitness traits with an object detection method using Faster R‐CNN

March 2022

·

93 Reads

·

13 Citations

Revealing the contributions of genes to plant phenotype is frequently challenging because loss‐of‐function effects may be subtle or masked by varying degrees of genetic redundancy. Such effects can potentially be detected by measuring plant fitness, which reflects the cumulative effects of genetic changes over the lifetime of a plant. However, fitness is challenging to measure accurately, particularly in species with high fecundity and relatively small propagule sizes such as Arabidopsis thaliana. An image segmentation‐based method using the software ImageJ and an object detection‐based method using the Faster Region‐based Convolutional Neural Network (R‐CNN) algorithm were used for measuring two Arabidopsis fitness traits: seed and fruit counts. The segmentation‐based method was error‐prone (correlation between true and predicted seed counts, r² = 0.849) because seeds touching each other were undercounted. By contrast, the object detection‐based algorithm yielded near perfect seed counts (r² = 0.9996) and highly accurate fruit counts (r² = 0.980). Comparing seed counts for wild‐type and 12 mutant lines revealed fitness effects for three genes; fruit counts revealed the same effects for two genes. Our study provides analysis pipelines and models to facilitate the investigation of Arabidopsis fitness traits and demonstrates the importance of examining fitness traits when studying gene functions.


High throughput measurement of Arabidopsis thaliana fitness traits using deep learning

September 2021

·

74 Reads

Revealing the contributions of genes to plant phenotype is frequently challenging because the effects of loss of gene function may be subtle or be masked by genetic redundancy. Such effects can potentially be detected by measuring plant fitness, which reflects the cumulative effects of genetic changes over the lifetime of a plant. However, fitness is challenging to measure accurately, particularly in species with high fecundity and relatively small propagule sizes such as Arabidopsis thaliana . An image segmentation-based (ImageJ) and a Faster Region Based Convolutional Neural Network (R-CNN) approach were used for measuring two Arabidopsis fitness traits: seed and fruit counts. Although straightforward to use, ImageJ was error-prone (correlation between true and predicted seed counts, r ² =0.849) because seeds touching each other were undercounted. In contrast, Faster R-CNN yielded near perfect seed counts (r ² =0.9996) and highly accurate fruit counts (r ² =0.980). By examining seed counts, we were able to reveal fitness effects for genes that were previously reported to have no or condition-specific loss-of-function phenotypes. Our study provides models to facilitate the investigation of Arabidopsis fitness traits and demonstrates the importance of examining fitness traits in the study of gene functions.


Interpretable, Scalable, and Transferrable Functional Projection of Large-Scale Transcriptome Data Using Constrained Matrix Decomposition

August 2021

·

68 Reads

·

6 Citations

Large-scale transcriptome data, such as single-cell RNA-sequencing data, have provided unprecedented resources for studying biological processes at the systems level. Numerous dimensionality reduction methods have been developed to visualize and analyze these transcriptome data. In addition, several existing methods allow inference of functional variations among samples using gene sets with known biological functions. However, it remains challenging to analyze transcriptomes with reduced dimensions that are interpretable in terms of dimensions’ directionalities, transferrable to new data, and directly expose the contribution or association of individual genes. In this study, we used gene set non-negative principal component analysis (gsPCA) and non-negative matrix factorization (gsNMF) to analyze large-scale transcriptome datasets. We found that these methods provide low-dimensional information about the progression of biological processes in a quantitative manner, and their performances are comparable to existing functional variation analysis methods in terms of distinguishing multiple cell states and samples from multiple conditions. Remarkably, upon training with a subset of data, these methods allow predictions of locations in the functional space using data from experimental conditions that are not exposed to the models. Specifically, our models predicted the extent of progression and reversion for cells in the epithelial-mesenchymal transition (EMT) continuum. These methods revealed conserved EMT program among multiple types of single cells and tumor samples. Finally, we demonstrate this approach is broadly applicable to data and gene sets beyond EMT and provide several recommendations on the choice between the two linear methods and the optimal algorithmic parameters. Our methods show that simple constrained matrix decomposition can produce to low-dimensional information in functionally interpretable and transferrable space, and can be widely useful for analyzing large-scale transcriptome data.


Interpretable, scalable, and transferrable functional projection of large-scale transcriptome data using constrained matrix decomposition

April 2021

·

39 Reads

Large-scale transcriptome data, such as single-cell RNA-sequencing data, have provided unprecedented resources for studying biological processes at the systems level. Numerous dimensionality reduction methods have been developed to visualize and analyze these transcriptome data. In addition, several existing methods allow inference of functional variations among samples using gene sets with known biological functions. However, it remains challenging to analyze transcriptomes with reduced dimensions that are both interpretable in terms of dimensions, directionalities and transferrable to new data. In this study, we used gene set non-negative principal component analysis (gsPCA) and non-negative matrix factorization (gsNMF) to analyze large-scale transcriptome datasets. We found that these methods provide low-dimensional information about the progression of biological processes in a quantitative manner, and their performances are comparable to existing functional variation analysis methods in terms of distinguishing multiple cell states and samples from multiple conditions. Remarkably, upon training with a subset of data, these methods allow predictions of locations in the functional space using data from experimental conditions that are not exposed to the models. Specifically, our models predicted the extent of progression and reversion for cells in the epithelial-mesenchymal transition (EMT) continuum. These methods revealed conserved EMT program among multiple types of single cells and tumor samples. Finally, we provide several recommendations on the choice between the two linear methods and the optimal algorithmic parameters. Our methods show that simple constrained matrix decomposition can produce to low-dimensional information in functionally interpretable and transferrable space, and can be widely useful for analyzing large-scale transcriptome data.


FIGURE 1 Network structure of the eS6-P regulatory circuit and comparisons between simulations and experiments. (A) Shown is an influence diagram of a model of the light-and clock-driven feedforward regulatory system that regulates eS6-P (ClockþLight Model). This model consists of two cycling systems: the light-dark cycle (an oscillatory input) and the circadian clock (an autonomous oscillator). Each cycle regulates cellular processes independently. The regulation by the light-dark cycle is mediated by the TOR-S6K pathway, and the clock-driven regulation is mediated by transcription factors belonging to the LHY/CCA1 (C1), EC (C2), PRR9/7 (C3), and PRR5/1 (C4) modules, which contain a repressilator circuit and additional interactions. The lightdark cycle also regulates the circadian clock via the LHY/CCA1 and PRR9/7 modules (entrainment), thus creating a feedforward circuit. The direction of regulation is indicated by the shape of arrowhead at the end of each line (triangle arrowhead: activation, flat arrowhead: repression). Note that the ambiguous (diamond) regulation of the clock by light reflects that light regulates LHY/CCA1 and PRR9/7 in opposite directions (we assume that light represses CCA1 stability (49)), which is important to restrict the LHY/CCA1 peak to dawn. The direction of regulation for pink arrows was inferred during optimization, whereas the direction of regulation for black arrows (i.e., the circadian clock and TOR pathway) were established based on prior knowledge. (B) Shown are model predictions (black) and experimental observations (red) of eS6-P under wild-type in long days (16:8, LD). Error bars indicate SD of pooled measurements at various circadian times. Circadian time is relative to dawn, and regions shaded with gray indicate period of darkness. Blue dots show the raw data points collected before pooling over a period of 84 h (18). (C) Shown are model predictions (black) and experimental observations (red) of eS6-P under long days (16:8, LD) but with a deficient clock (CCA1 overexpression). Error bars indicate SD of pooled measurements at various circadian times. Time is measured, and the graph is shaded as in (B). (D) Shown are model predictions (black) and experimental observations (red) of eS6-P under LL. Error bars (legend continued on next page)
FIGURE 3 Response of alternative models of eS6-P circuit to variations of the night-to-day transition time. (A) Shown are influence diagrams of the three alternative models of eS6-P circuit. In each diagram, the regulatory factors are indicated by the lettered black circles, and regulatory interactions are denoted by colored lines (blue ¼ activation, red ¼ repression). Right panels show simulation trajectories with these models under the LD condition. Gray regions show the period of night. (B) Shown are trajectories of eS6-P in response to variations of the night-to-day transition time from Dt ND ¼ À4 (purple) to Dt ND ¼ 4 (yellow) in 0.5-h increments. All trajectories start at the dusk of the previous day (day 0, unperturbed) and end at the dusk of the current day (day 1, perturbed). Yellow line with purple margin shows overlapped trajectories in the absence of light. Thick solid curves show trajectories in the early day (first 4 h after dawn), and thin solid curves show trajectories in the remaining hours of the day. The cyan dots indicate the peak value of eS6-P during the early day in each alternative model, which unlike the ClockþLight model (see Fig. 2 B), is unaffected by the change in night-to-day transition. Short line segments at the bottom show the time of dawn for each trajectory. (C) Left: peak metric (ratio of the early day maximum of eS6-P to the eS6-P levels at the dusk) with respect
FIGURE 7 Genes in A. thaliana with eS6-P-like expression patterns. (A) Shown is the distribution of the peak phase shift between LD and LL conditions for 92 genes for which phases of expression profiles are similar to those of eS6-P (blue). Equivalent phase shifts for all cyclic genes in the list generated by Dalchau et al. (8) are in gray. (B-D) Shown are the mRNA expression patterns of 92 genes with phases of expression profiles similar to those of eS6-P for wildtype under LD (B), wildtype under LL (C), and CCA1 overexpression under LD (D) conditions. Time course expression profiles of individual genes are shown as colored curves, whereas the solid black line shows the average expression of all 92 genes. The dotted black line is the average expression pattern of all cyclic genes in the list generated by Dalchau et al. (8). Expression values in all panels were normalized from 0 to 1, such that 0 corresponds to the minimum, and 1 corresponds to the maximum for each individual gene. (E) Shown are the distributions of the difference in average mRNA expression level between day and night of 92 eS6-P-like genes (blue) and all cyclic genes in the list generated by Dalchau et al. (8) (gray) under CCA1 overexpression condition. Positive values indicate greater average expression during the day, and negative values indicate greater average expression during the night. To see this figure in color, go online.
Early Detection of Daylengths with a Feedforward Circuit Coregulated by Circadian and Diurnal Cycles

November 2020

·

58 Reads

·

8 Citations

Biophysical Journal

Light-entrained circadian clocks confer rhythmic dynamics of cellular and molecular activities to animals and plants. These intrinsic clocks allow stable anticipations to light-dark (diel) cycles. Many genes in the model plant Arabidopsis thaliana are regulated by diel cycles via pathways independent of the clock, suggesting that the integration of circadian and light signals is important for the fitness of plants. Previous studies of light-clock signal integrations have focused on moderate phase adjustment of the two signals. However, dynamical features of integrations across a broad range of phases remain elusive. Phosphorylation of ribosomal protein of the small subunit 6 (eS6), a ubiquitous post-translational modification across kingdoms, is influenced by the circadian clock and the light-dark (diel) cycle in an opposite manner. To understand this striking phenomenon and its underlying information processing capabilities, we built a mathematical model for the eS6 phosphorylation (eS6-P) control circuit. We found that the dynamics of eS6-P can be explained by a feedforward circuit with inputs from both circadian and diel cycles. Furthermore, the early day response of this circuit with dual rhythmic inputs is sensitive to the changes in daylength, including both transient and gradual changes observed in realistic light intervals across a year, because of weather and seasons. By analyzing published gene expression data, we found that the dynamics produced by the eS6-P control circuit can be observed in the expression profiles of a large number of genes. Our work provides mechanistic insights into the complex dynamics of a ribosomal protein, and it proposes a previously underappreciated function of the circadian clock, which not only prepares organisms for normal diel cycles but also helps to detect both transient and seasonal changes with a predictive power.


Citations (22)


... Interestingly, an inverse relationship between NED and EMT has also been described in some tumors, i.e., small cell lung carcinoma (SCLC), Merkel cell carcinoma, and gastroenteropancreatic (GEP)-NET [18]. In SCLC, an association was revealed between the loss of NED and EMT induction [19] as inferred from the observation that the low NED subtype had undergone EMT and had activated-amongst others-the TGF-β pathway. However, differential effects of TGF-β on both programs were observed in SCLC in that TGF-β seems to be required for promoting EMT but not NED. ...

Reference:

Characterization of Epithelial–Mesenchymal and Neuroendocrine Differentiation States in Pancreatic and Small Cell Ovarian Tumor Cells and Their Modulation by TGF-β1 and BMP-7
Involvement of Epithelial–Mesenchymal Transition Genes in Small Cell Lung Cancer Phenotypic Plasticity

... Figure 3 provides an overview of multiple survival prediction studies that encompass a range of cancer subtypes, either within a pancancer context or within the context of predicting survival for different subtypes. A total of 14 studies have taken into account multiple cancer subtypes whereas the majority of the studies have only covered only a single type of cancer subtype such as colorectal cancer 109 , lymphoma 85 , colon adenocarcinoma 39 , gastric cancer 42 and so on. points 31,81 . ...

Using Single-Cell RNA Sequencing and MicroRNA Targeting Data to Improve Colorectal Cancer Survival Prediction

... Notably, NRG1 has been found to promote partial EMT in cultured patient HER2-positive breast cancer 85 . While NRG1 has been mostly described to drive EMT in epithelial cells, NRG1 stimulation on mesenchymal cells that already underwent EMT has been shown to instead induce epithelial gene expression in esophageal adenocarcinoma 86 87 . Similar to previous analyses, scRNA-seq data was clustered, and canonical markers were used to identify epithelial, intermediate, and mesenchymal states (Fig. 6a). ...

Comparative single-cell transcriptomes of dose and time dependent epithelial–mesenchymal spectrums

NAR Genomics and Bioinformatics

... The motivation to build our models derived from inconsistencies between existing EMT models that predict a paucity of EMT intermediate states, and experimental single-cell transcriptomic data that have been interpreted to support a wealth of states in a phenotypic continuum (10,11,39). Nonetheless, it is plausible that some of the intermediate states should be favored, perhaps in relation with microenvironmental factors such as nutrient availability and cytokines. ...

Comparative single-cell transcriptomes of dose and time dependent epithelial-mesenchymal spectrums

... While satellite and drone images can easily cover large areas, the resulting image resolution is most suitable for field-scale metrics such as Normalized Difference Vegetation Index (NDVI), plant count, or plant height ( [3], [26], [29]). For plant-scale and organ-scale metrics, such as fruit and pest counts, fruit quality, and weed classification, it is more typical to use images taken from ground-based platforms that are close to the crop ( [36], [17], [34], [24]) or taken in a laboratory setting ( [25], [12], [11], [42]). Automated ground-based imaging of crops is often accomplished through cameras mounted on mobile robots or gantries, which in addition to providing high-resolution images, can also accommodate other sensors placed close to crops and allow for the direct manipulation of plants ( [28], [30], [40]). ...

High‐throughput measurement of plant fitness traits with an object detection method using Faster R‐CNN

... We performed nnPCA using the gene sets mentioned above. nnPCA provided higher resolutions for large number of samples in the EMT spectrum compared to GSVA according to previous studies (31,59). nnPCA determines the approximately orthogonal axes with non-negative coefficients (loadings) for features (genes). ...

Interpretable, Scalable, and Transferrable Functional Projection of Large-Scale Transcriptome Data Using Constrained Matrix Decomposition

... In contrast, under continuous light, that is, in the absence of a light-on signal to set the phase of the circadian clock, eS6-P is repressed by the circadian clock during the subjective day and induced during the subjective night (Choudhary et al., 2015;Enganti et al., 2018). Mathematical modeling of this seemingly contradictory signaling network has suggested that eS6-P may respond in a sensitive manner to subtle shifts in photoperiod or diel illumination (Panchy et al., 2020). eS6 phosphorylation in plants is considered a canonical readout of the TOR-S6 kinase (TOR-S6K) pathway because this is the only established pathway known to regulate eS6 phosphorylation in plants (Chen et al., 2018;Dobrenel et al., 2016;Mahfouz et al., 2006). ...

Early Detection of Daylengths with a Feedforward Circuit Coregulated by Circadian and Diurnal Cycles

Biophysical Journal

... Another regulatory code is believed to control mRNA stability, localization and splicing [9,10], and there are analogous techniques for high-throughput interrogation of RNA-binding proteins that control these processes [11,12]. However, even when integrated, multi-omics data appear to be less useful than anticipated in predicting transcriptional regulatory networks [1,13,14] and inferring strongly predictive regulatory codes from these data remains a challenge [13]. ...

Improved recovery of cell-cycle gene expression in Saccharomyces cerevisiae from regulatory interactions in multiple omics data

BMC Genomics

... Adding to the complexity, researchers have discovered that EMT is heterogeneous within cell populations in biological models. Tumour biopsies have been found to contain cells expressing a spectrum of epithelial and mesenchymal markers [8,[15][16][17]. This heterogeneity extends to in vitro cultures as well, where RNA-sequencing analysis has revealed diverse gene expression changes within a single cell line in response to identical EMT-inducing stimuli [18]. ...

Integrative Transcriptomic Analysis Reveals a Multiphasic Epithelial–Mesenchymal Spectrum in Cancer and Non-tumorigenic Cells

... including stemness and differentiation, drug-sensitive and drug-resistant states, and transitions between epithelial and mesenchymal cell-states 11 . Increasing efforts are being made to characterize the intrinsic cellular factors that drive phenotypic plasticity [12][13][14] . However, despite extensive molecular characterization, the dynamics of phenotypic plasticity at the single-cell and population levels remain largely unclear. ...

Combinatorial perturbation analysis reveals divergent regulations of mesenchymal genes during epithelial-to-mesenchymal transition

npj Systems Biology and Applications