Susan E Celniker’s research while affiliated with Lawrence Berkeley National Laboratory and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (295)


An AI-Powered tissue-agnostic cellular morphometrics biomarker for risk assessment in patients with pan-gastrointestinal precancerous lesions and cancers
  • Preprint

November 2024

·

23 Reads

·

Chengfei Jiang

·

April W Mao

·

[...]

·

PURPOSE Tissue-agnostic biomarkers that capture the commonality in cancer biology, may provide a new avenue for treatment development and optimization across cancer types. Here, we aimed to evaluate and validate the clinical value of a tissue-agnostic cellular morphometrics biomarker (CMB) signature, which was discovered by artificial intelligence (AI) from H&E-stained whole-slide images (WSI) of diagnostic slides of colon cancers, in pan-gastrointestinal (pan-GI) pre-cancer lesions and cancers. METHODS We discovered CMBs from WSI using our well-established CMB-ML pipeline and established a CMB risk score (CMBRS) using multivariate regression models. Based on CMBRS, we assigned individual patients from The Cancer Genome Atlas Colon Adenocarcinoma Cohort (TCGA-COAD) (n=430) to CMB risk groups (CMBRG). We then extensively evaluated tissue-agnostic clinical value of CMB signature, CMBRS and CMBRG in multi-cohorts with different types of GI cancer (n=2,219) and risk assessment of precancerous lesions (n=1,016). We unraveled each CMB-related biological function using bulk RNA-sequencing, single-cell RNA-sequencing (scRNA-seq) and opal multiplex immunohistochemistry (IHC) techniques. RESULTS From the TCGA-COAD cohort, we developed a 13-CMB signature and constructed CMBRS/CMBRG that predict prognosis of colon cancer patients. Importantly, this 13-CMB signature proved prognostic and predictive values for TCGA patients with rectal, gastric and esophageal cancer independent of traditional clinical factors. These findings were independently validated using multiple cohorts from Drum Tower Hospital. Moreover, 13-CMB signature exhibited the power for risk stratification of colon adenoma and early esophageal neoplastic lesion patients for predicting cancer progression. In addition, we demonstrated and validated independent prognostic impacts of gene signatures and CMB signatures and a significant increase in predictive power by integration of CMB signature, gene signature and clinical factors. Correlations between CMBs and gene expression levels revealed the association of each CMB with biological functions including cell proliferation, epithelial-to-mesenchymal transition and immune microenvironment. The association of CMBs with the immune microenvironment was prospectively validated by scRNA-seq and was further confirmed by Opal multiplex IHC staining in colon cancer. CONCLUSION This study demonstrates the clinical value of tissue-agnostic AI-empowered CMB signature from WSI with defined biological functions, which can be used in clinical settings to assess risk, diagnose disease, and guide clinical interventions. Tissue-agnostic CMBs potentially provide a new avenue for a rapid, robust and cost-effective cross-cancer prediction that is essential for developing common treatment strategy for multiple cancers.


Binding profiles for 961 Drosophila and C. elegans transcription factors reveal tissue-specific regulatory relationships

October 2024

·

28 Reads

·

2 Citations

Genome Research

A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here we present the culmination of the efforts of the Model Organism ENCyclopedia Of DNA Elements (modENCODE) and the model organism Encyclopedia of Regulatory Networks (modERN) consortia to systematically assay TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). These datasets comprise 605 TFs identifying 3.6M sites in the fly and 356 TFs identifying 0.9 M sites in the worm and represent the majority of the regulatory space in each genome. We demonstrate that TFs associate with chromatin in clusters termed "metapeaks", that larger metapeaks have characteristics of high occupancy target (HOT) regions, and that the importance of consensus sequence motifs bound by TFs depends on metapeak size and complexity. Combining ChIP-seq data with single cell RNA-seq data in a machine learning model identifies TFs with a prominent role in promoting target gene expression in specific cell types, even differentiating between parent-daughter cells during embryogenesis. These data are a rich resource for the community that should fuel and guide future investigations into TF function. To facilitate data accessibility and utility, all strains expressing GFP-tagged TFs are available at the stock centers for each organism. The chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center, GEO, and through a direct interface that provides rapid access to processed data sets and summary analyses, as well as widgets to probe the cell type-specific TF-target relationships.


Long-Term, Non-Invasive Detection of Low-Dose Ionizing Radiation Exposure

August 2024

·

25 Reads

ECS Meeting Abstracts

Ionizing radiation induces complex changes in cells and tissues. The conventional approach to biological dosimetry has been to integrate physical and clinical measurements to optimize dose assessment. Molecular biodosimetry is an effective strategy to monitor radiation exposure and hematologic, cytogenetic, protein and transcript-based approaches have been developed to increase dose estimation accuracy. However, these approaches are invasive, time-consuming, have limited effectiveness over time, and importantly do not accurately inform on low dose radiation exposures. Therefore, novel, non-invasive, biomarkers are required that can overcome these limitations. We developed a pipeline that employs Fourier transform infrared (FTIR) spectroscopy in the mid-infrared spectrum to identify a signature of low dose ionizing radiation exposure in mouse ear pinnae over time. Two cohorts of C57BL/6J and one cohort of BALB/c mice were followed for ninety days after total body X-ray exposures of 10, 50, 100 or 200 cGy. The stratum corneum layer of the ear pinnae were repeatedly measured with attenuated total reflectance - focal plane array (ATR-FPA)-FTIR at 5, 14, 21, 49 and 90 days after radiation exposure. We found statistically significant discriminative power for all doses and all tested time-points out to 90 days after exposure. Classification accuracy was maximized when testing 14 days after exposure (specificity > 0.9 with a sensitivity threshold of 0.9) and dropped by roughly 30% sensitivity at 90 days. Since only hundreds of samples were required to learn highly discriminative signatures, developing human-relevant diagnostic capabilities is likely feasible and this non-invasive procedure points toward future non-invasive biodosimetry applications at population scales.



Identification of genetic variations and candidate genes associated with colorectal tumor susceptibility in CC mice.
Subtyping human CRCs based on 334 CTS candidate genes.
Differences in gut microbiome between high and low CTS group.
The core genera for predicting CTS and their related function analysis.
Gut microbiome partially mediates the effects of genetics on CTS.

+3

Genetic and microbial determinants of azoxymethane-induced colorectal tumor susceptibility in Collaborative Cross mice and their implication in human cancer
  • Article
  • Full-text available

April 2024

·

60 Reads

The insights into interactions between host genetics and gut microbiome (GM) in colorectal tumor susceptibility (CTS) remains lacking. We used Collaborative Cross mouse population model to identify genetic and microbial determinants of Azoxymethane-induced CTS. We identified 4417 CTS-associated single nucleotide polymorphisms (SNPs) containing 334 genes that were transcriptionally altered in human colorectal cancers (CRCs) and consistently clustered independent human CRC cohorts into two subgroups with different prognosis. We discovered a set of genera in early-life associated with CTS and defined a 16-genus signature that accurately predicted CTS, the majority of which were correlated with human CRCs. We identified 547 SNPs associated with abundances of these genera. Mediation analysis revealed GM as mediators partially exerting the effect of SNP UNC3869242 within Duox2 on CTS. Intestine cell-specific depletion of Duox2 altered GM composition and contribution of Duox2 depletion to CTS was significantly influenced by GM. Our findings provide potential novel targets for personalized CRC prevention and treatment.

Download

Experimental overview. (A) Diagram of immobilized mouse ear with germanium crystal above it. (B) Diagram of crystal during measurement. (C) Sketch of an animal under the microscope. (D) Phase contrast image of outermost layer of stratum corneum prior to measurement. (E) Phase contrast image of deformation of the stratum corneum caused by germanium crystal after measurement. (F) Two strains of mice C57BL/6J (n = 69) and BALB/cJ (n = 38) totaling 107 mice were generated for the study. All mice were treated with total body irradiation (TBI) of either 0, 0.1, 0.5, 1 or 2 Gy at 9–12 weeks of age. At 5-, 14-, 21-, 49- and 90-days post TBI, mice were anesthetized, and measurements of ear pinnae were collected with our FTIR-ATR-FPA system.
Hyperspectral image processing pipeline. Example of the procedure for quality control and normalization on hyperspectral images (range clipping, rubber-band baseline correction, peak normalization).
Spectral features and second derivative spectra. (A) Major spectral features of the murine ear skin annotated with specific biomolecules. (B) The mean second derivative spectra of stratum corneum from the control 0 Gy (green) and 0.1 to 2 Gy exposed (blue) (26,296 pixels from 274 samples) , and their difference spectrum (red). Mean spectra from the irradiated and control mouse ears in the 3050–900 cm–1 region were transformed to second derivative spectra (3-point Savitsky-Golay smoothing, polynomial order 5). Labeled peaks are assigned in Table 1. The difference spectrum was multiplied by − 1 for the ease of visualization purpose.
Hyperspectral image-based classification of radiation exposure. (A) The receiver operating characteristic (ROC) curves of predictions from center region 1D-CNN, stratified by different radiation levels. (B) The receiver operating characteristic (ROC) curves of predictions from center region 1D-CNN, stratified by different days after radiation exposure. (C) Ear ATR-FPA FTIR per-day per-radiation dose specificity with sensitivity threshold at 0.9. While specificity drops after day 21, the results are still far above random guessing (0.33 corresponds to a balanced accuracy of 0.66, p-value < 0.001). (D) Bar plot of specificity (at 90% sensitivity) across days after exposure. The height of each bar is the mean across doses. Error bars correspond to one SD across doses.
Long-term, non-invasive FTIR detection of low-dose ionizing radiation exposure

March 2024

·

117 Reads

·

3 Citations

Non-invasive methods of detecting radiation exposure show promise to improve upon current approaches to biological dosimetry in ease, speed, and accuracy. Here we developed a pipeline that employs Fourier transform infrared (FTIR) spectroscopy in the mid-infrared spectrum to identify a signature of low dose ionizing radiation exposure in mouse ear pinnae over time. Mice exposed to 0.1 to 2 Gy total body irradiation were repeatedly measured by FTIR at the stratum corneum of the ear pinnae. We found significant discriminative power for all doses and time-points out to 90 days after exposure. Classification accuracy was maximized when testing 14 days after exposure (specificity > 0.9 with a sensitivity threshold of 0.9) and dropped by roughly 30% sensitivity at 90 days. Infrared frequencies point towards biological changes in DNA conformation, lipid oxidation and accumulation and shifts in protein secondary structure. Since only hundreds of samples were used to learn the highly discriminative signature, developing human-relevant diagnostic capabilities is likely feasible and this non-invasive procedure points toward rapid, non-invasive, and reagent-free biodosimetry applications at population scales.




Binding profiles for 954 Drosophila and C. elegans transcription factors reveal tissue specific regulatory relationships

January 2024

·

445 Reads

·

3 Citations

A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here we present the culmination of the modERN (model organism Encyclopedia of Regulatory Networks) consortium that systematically assayed TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). We describe key features of these datasets, comprising 604 TFs identifying 3.6M sites in the fly and 350 TFs identifying 0.9 M sites in the worm. Applying a machine learning model to these data identifies sets of TFs with a prominent role in promoting 10 target gene expression in specific cell types. TF binding data are available through the ENCODE Data Coordinating Center and at https://epic.gs.washington.edu/modERNresource, which provides access to processed and summary data, as well as widgets to probe cell type-specific TF-target relationships. These data are a rich resource that should fuel investigations into TF function during development.


Organogenetic transcriptomes of the Drosophila embryo at single cell resolution

January 2024

·

73 Reads

·

2 Citations

Development

To gain insight into the transcription programs activated during the formation of Drosophila larval structures, we carried out single cell RNA sequencing during two periods of Drosophila embryogenesis: stages 10 – 12, when most organs are first specified and initiate morphological and physiological specialization, and stages 13 – 16, when organs achieve their final mature architectures and begin to function. Our data confirm previous findings with regards to functional specialization of some organs – the salivary gland and trachea – and clarify the embryonic functions of another – the plasmatocytes. We also identify two early developmental trajectories in germ cells and uncover a potential role for proteolysis during germline stem cell specialization. We identify the likely cell type of origin for key components of the Drosophila matrisome and several commonly used Drosophila embryonic cell culture lines. Finally, we compare our findings to other recent related studies and to other modalities for identifying tissue-specific gene expression patterns. These data provide a useful community resource for identifying many new players in tissue-specific morphogenesis and functional specialization of developing organs.


Citations (60)


... The protein changes indicate that the skin had not yet returned to its normal state and had remained in the natural remodeling phase. Lastly, the 1140-1000 cm −1 region revealed nucleic acid and carbohydrate changes, indicating differences in the metabolic activity [28][29][30]. ...

Reference:

Wound Healing Potential of Herbal Hydrogel Formulations of Cedrus brevifolia Extracts in Mice
Long-term, non-invasive FTIR detection of low-dose ionizing radiation exposure

... Reducing the large number of variables into a smaller dimensional space will be useful for identifying k-mers that can best discriminate between natural and engineered sequences and hence, better predict the engineering status of unknown DNA sequences (Meng et al., 2016). In the same way, combining different methods for identifying engineered DNA sequences will be useful for classifying sequences that are missed by any single approach (Crook et al., 2022;Adler et al., 2024;Berezin et al., 2024). ...

Ensemble Detection of DNA Engineering Signatures
  • Citing Article
  • March 2024

ACS Synthetic Biology

... Previously, TBP-1 was demonstrated to physically interact with USTC and is required for USTC foci to form, although it did not form foci itself [11]. To investigate the relationship between TBP-1 and USTC further, we independently examined TBP-1 localization in the germ line using super-resolution microscopy of an existing TBP-1::GFP strain [18]. In contrast to the previous observation, we found that TBP-1 did indeed form foci in germ nuclei, typically one or two strong foci, with additional faint foci per nucleus ( Fig 1A). ...

Binding profiles for 954 Drosophila and C. elegans transcription factors reveal tissue specific regulatory relationships

... Notch showed an epithelial and socket cell signal consistent with our previous study (34). In addition to their shared epithelial gene expression profile, the two clusters differ by the expression of markers specific that are hallmarks of tracheal tip cell growth and tracheal branching, including the tracheal progenitor selector gene trachealess, the FGF ligand/receptor pair branchless/breathless, the proteoglycan dally involved in tracheal FGF signaling, and the cytoskeletal factor Zasp52 (63)(64)(65)(66). It is also noteworthy that some Trch_epi markers such as datilografo, APP-like and slow border cells were not expected in a trachea-related tissue based on current knowledge, indicating possible evolutionary divergence with Drosophila processes. ...

Organogenetic transcriptomes of the Drosophila embryo at single cell resolution

Development

... Recently, a screening for smORF mutant phenotypes was carried out in Drosophila melanogaster, focusing on deeply conserved smORFs from Drosophila to humans [34]. Surprisingly, given that deep conservation is assumed to indicate function, only a small number of smORFs were observed to produce a phenotype when mutated, pertaining to either fly morphology or viability. ...

Molecular and functional characterization of the Drosophila melanogaster conserved smORFome
  • Citing Article
  • November 2023

Cell Reports

... The images were obtained with confocal microscopes (Leica SP8X and Zeiss LSM 880 Airyscan) and processed using the ImageJ software. 38 Immunoblotting Western blots were performed following previously described methods 39 with some modifications. The fly adult heads were homogenized on ice in radio-immunoprecipitation assay buffer (Thermo Fisher, # 89900) supplemented with ethylenediaminetetraacetic acid and protease inhibitor cocktail (GenDEPOT, # P3100-001) at 50 μl per 5 heads. ...

A comprehensive Drosophila resource to identify key functional interactions between SARS-CoV-2 factors and host proteins

Cell Reports

... Multiple TFs are often capable of binding to the same motifs with varying levels of affinity and many sites remain unbound, so we next examined available chromatin binding data for over 450 TFs to identify TFs actively bound at newly accessible regions near synaptic genes 33,34 . We compared binding peaks at newly accessible regions within 3kb of synaptic genes to those near three random control sets of neuronally expressed genes. ...

A modERN Resource: Identification of Drosophila Transcription Factor candidate target genes using RNAi

Genetics

... S. graminicola CBS10092 was grown in yeast extract peptone sucrose light (YEPSL) medium (7) for 48 h at 30°C in an orbital incubator (120 rpm). The harvested cells were homogenized using liquid nitrogen, and DNA was extracted by implementing the cetyltrimethylammonium bromide (CTAB) chloroform-isoamyl alcohol protocol (8). Genome sequencing was performed using the PacBio Sequel platform to generate 50 coverage with a total of 293,670 reads, which had an average read length of 13,690 bp. ...

Long-read, whole genome shotgun sequence data for five model organisms

... Approaches to investigate microprotein functions rely on gain-and loss-of-function and phenotypic analyses, specifically examining the physiology and/or morphology of the tissue or organ in which the microprotein is expressed. The field greatly benefits from CRISPR/Cas9 genetic tools, which enable, for example, the generation of precise mutations at endogenous loci to disrupt smORF translation (codon start or smORF sequence deletion) or to tag the microprotein [80,92,93]. Finally, the molecular target of the microprotein can be identified by proteomic approaches. ...

Two neuronal peptides encoded from a single transcript regulate mitochondrial complex III in Drosophila

eLife