
Xiaotao ShenStanford University | SU · Department of Genetics
Xiaotao Shen
PhD
Looking for an assistant professor position.
About
57
Publications
8,328
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
978
Citations
Citations since 2017
Introduction
I am now a postdoctoral research fellow at Stanford University. I am broadly interested in Metabolomics, Multi-omics, Biostatistics, Systems Biology, and Bioinformatics, and their application in healthcare.
Additional affiliations
September 2013 - February 2019
Publications
Publications (57)
Current healthcare practices are reactive and use limited physiological and clinical information, often collected months or years apart. Moreover, the discovery and profiling of blood biomarkers in clinical and research settings are constrained by geographical barriers, the cost and inconvenience of in-clinic venepuncture, low sampling frequency an...
Liquid chromatography–mass spectrometry (LC–MS)-based untargeted metabolomics provides systematic profiling of metabolic. Yet, its applications in precision medicine (disease diagnosis) have been limited by several challenges, including metabolite identification, information loss and low reproducibility. Here, we present the deep-learning-based Pse...
One of the major challenges in LC-MS data is converting many metabolic feature entries to biological function information, such as metabolite annotation and pathway enrichment, which are based on the compound and pathway databases. Multiple online databases have been developed. However, no tool has been developed for operating all these databases f...
Reproducibility, traceability, and transparency have been long-standing issues for metabolomics data analysis. Multiple tools have been developed, but limitations still exist. Here, we present the tidyMass project ( https://www.tidymass.org/ ), a comprehensive R-based computational framework that can achieve the traceable, shareable, and reproducib...
Pregnancy is a critical time that has long-term impacts on both maternal and fetal health. During pregnancy, the maternal metabolome undergoes dramatic systemic changes, although correlating longitudinal changes in maternal urine remain largely unexplored. We applied an LCMS-based untargeted metabolomics profiling approach to analyze 346 longitudin...
Conventional environmental health studies have primarily focused on limited environmental stressors at the population level, which lacks the power to dissect the complexity and heterogeneity of individualized environmental exposures. Here, as a pilot case study, we integrated deep-profiled longitudinal personal exposome and internal multi-omics to...
One of the major challenges in LC-MS data (metabolome, lipidome, and exposome) is converting many metabolic feature entries to biological function information, such as metabolite annotation and pathway enrichment, which are based on the compound and pathway databases. Multiple online databases have been developed, containing lots of information abo...
Determinants of severe COVID-19 in healthy adults are poorly understood, which limits opportunity for early intervention. We present a multiomic analysis using machine learning to characterize the genomic basis of COVID-19 severity. We use single-cell multiome profiling of human lungs to link genetic signals to cell-type-specific functions. We disc...
Liquid chromatography-mass spectrometry (LC-MS) based untargeted metabolomics provides systematic profiling of metabolic. Yet its applications in precision medicine (disease diagnosis) have been limited by several challenges, including metabolite identification, information loss, and low reproducibility. Here, we present the deepPseudoMSI project (...
Reproducibility and transparency have been longstanding but significant problems for the metabolomics field. Here, we present the tidyMass project ( https://www.tidymass.org/ ), a comprehensive computational framework that can achieve the shareable and reproducible workflow needs of data processing and analysis for LC-MS-based untargeted metabolomi...
Reproducibility and transparency have been longstanding but significant problems for the metabolomics field. Here, we present the tidyMass project (https://www.tidymass.org/), a comprehensive computational framework that can achieve the shareable and reproducible workflow needs of data processing and analysis for LC-MS-based untargeted metabolomics...
Accurate and efficient compound annotation is a long-standing challenge for LC−MS-based data (e.g., untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and stream...
The determinants of severe COVID-19 in non-elderly adults are poorly understood, which limits opportunities for early intervention and treatment. Here we present novel machine learning frameworks for identifying common and rare disease-associated genetic variation, which outperform conventional approaches. By integrating single-cell multiomics prof...
NGLY1 (N-glycanase 1) deficiency is a rare congenital recessive disorder caused by a mutation in the NGLY1 gene, which encodes the cytosol enzyme N-glycanase 1. The NGLY1 protein catalyzes the first step in protein deglycosylation, a process prerequisite for the cytosolic degradation of misfolded glycoproteins. By performing and combining metabolom...
Accurate and efficient compound annotation is a long-standing challenge for LC−MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streaml...
Conventional environmental health studies primarily focus on limited environmental stressors at the population level, which lacks the power of dissecting the complexity and heterogeneity of individualized environmental exposures. Here we integrated deep-profiled longitudinal personal exposome and internal multi-omics to systematically investigate h...
Background
Long-term smoking exposure will increase the risk of esophageal squamous cell carcinoma (ESCC), whereas the mechanism is still unclear. We conducted a cross-sectional study to explore whether serum metabolites mediate the occurrence of ESCC caused by cigarette smoking.
Methods
Serum metabolic profiles and lifestyle information of 464 pa...
BACKGROUND: Previous metabolomics studies have found differences in metabolic characteristics between the healthy and ESCC patients. However, few of these studies concerned the whole process of the progression of ESCC. This study aims to explore serum metabolites associated with the progression of ESCC. METHODS: Serum samples from 653 participants...
Metabolism during pregnancy is a dynamic and precisely programmed process, the failure of which can bring devastating consequences to the mother and fetus. To define a high-resolution temporal profile of metabolites during healthy pregnancy, we analyzed the untargeted metabolome of 784 weekly blood samples from 30 pregnant women. Broad changes and...
Large-scale metabolite annotation is a challenge in liquid chromatogram-mass spectrometry (LC-MS)-based untargeted metabolomics. Here, we develop a metabolic reaction network (MRN)-based recursive algorithm (MetDNA) that expands metabolite annotations without the need for a comprehensive standard spectral library. MetDNA is based on the rationale t...
Mass spectrometry-based metabolomics aims to profile the metabolic changes in biological systems and identify differential metabolites related to physiological phenotypes and aberrant activities. However, many confounding factors during data acquisition complicate metabolomics data, which is characterized by high dimensionality, uncertain degrees o...
Summary
Mass spectrometry-based metabolomics aims to profile the metabolic changes in biological systems and identify differential metabolites related to physiological phenotypes and aberrant activities. However, many confounding factors during data acquisition complicate metabolomics data, which is characterized by high dimensionality, uncertain d...
Ion mobility - mass spectrometry (IM-MS) has showed great application potential for lipidomics. However, IM-MS based lipidomics is significantly restricted by the available software for lipid structural identification. Here, we developed a software tool, namely, LipidIMMS Analyzer, to support the accurate identification of lipids in IM-MS. For the...
Metabolite identification is a long-standing challenge in untargeted metabolomics and a major hurdle
for functional metabolomics studies. Here, we developed a metabolic reaction network-based recursive
algorithm and webserver called MetDNA for the large-scale and unambiguous identification of
metabolites (available at http://metdna.zhulab.cn). We s...
Purpose
The present study aimed to identify a panel of potential metabolite biomarkers to predict tumor response to neoadjuvant chemo-radiation therapy (NCRT) in locally advanced rectal cancer (LARC).
Experimental design
Liquid chromatography-–mass spectrometry (LC-–MS)-based untargeted metabolomics was used to profile human serum samples (n = 106)...
The 14th International Conference of the Metabolomics Society
The use of collision cross-section (CCS) values derived from ion mobility-mass spectrometry (IM-MS) has been proven to facilitate lipid identifications. Its utility is restricted by the limited available CCS values. Recently, the machine-learning algorithm based prediction (e.g., MetCCS) is reported to generate CCS values in a large-scale. However,...
The rapid development of metabolomics has significantly advanced health and disease related research. However, metabolite identification remains a major analytical challenge for untargeted metabolomics. While the use of collision cross-section (CCS) values obtained in ion mobility - mass spectrometry (IM-MS) effectively increases identification con...
Introduction
Previous metabolomics studies have revealed perturbed metabolic signatures in esophageal squamous cell carcinoma (ESCC) patients, however, most of these studies included mainly late-staged ESCC patients due to the difficulties of collecting the early-staged samples from asymptotic ESCC subjects.
Objectives
This study aims to explore th...
Introduction Untargeted metabolomics studies for biomarker
discovery often have hundreds to thousands of
human samples. Data acquisition of large-scale samples
has to be divided into several batches and may span from
months to as long as several years. The signal drift of
metabolites during data acquisition (intra- and inter-batch)
is unavoidable a...
Introduction
Untargeted metabolomics studies for biomarker discovery often have hundreds to thousands of human samples. Data acquisition of large-scale samples has to be divided into several batches and may span from months to as long as several years. The signal drift of metabolites during data acquisition (intra- and inter-batch) is unavoidable a...
Projects
Project (1)
The research on the screening model for Esophageal Squamous Cell Carcinoma in high-risk population based on metabolomic biomarkers