Xiaotao Shen

Xiaotao Shen
Stanford University | SU · Department of Genetics

PhD
Looking for an assistant professor position.

About

57
Publications
8,328
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
978
Citations
Citations since 2017
51 Research Items
977 Citations
2017201820192020202120222023050100150200250300
2017201820192020202120222023050100150200250300
2017201820192020202120222023050100150200250300
2017201820192020202120222023050100150200250300
Introduction
I am now a postdoctoral research fellow at Stanford University. I am broadly interested in Metabolomics, Multi-omics, Biostatistics, Systems Biology, and Bioinformatics, and their application in healthcare.
Additional affiliations
September 2013 - February 2019
Chinese Academy of Sciences
Position
  • PhD

Publications

Publications (57)
Article
Full-text available
Current healthcare practices are reactive and use limited physiological and clinical information, often collected months or years apart. Moreover, the discovery and profiling of blood biomarkers in clinical and research settings are constrained by geographical barriers, the cost and inconvenience of in-clinic venepuncture, low sampling frequency an...
Article
Full-text available
Liquid chromatography–mass spectrometry (LC–MS)-based untargeted metabolomics provides systematic profiling of metabolic. Yet, its applications in precision medicine (disease diagnosis) have been limited by several challenges, including metabolite identification, information loss and low reproducibility. Here, we present the deep-learning-based Pse...
Article
Full-text available
One of the major challenges in LC-MS data is converting many metabolic feature entries to biological function information, such as metabolite annotation and pathway enrichment, which are based on the compound and pathway databases. Multiple online databases have been developed. However, no tool has been developed for operating all these databases f...
Article
Full-text available
Reproducibility, traceability, and transparency have been long-standing issues for metabolomics data analysis. Multiple tools have been developed, but limitations still exist. Here, we present the tidyMass project ( https://www.tidymass.org/ ), a comprehensive R-based computational framework that can achieve the traceable, shareable, and reproducib...
Preprint
Pregnancy is a critical time that has long-term impacts on both maternal and fetal health. During pregnancy, the maternal metabolome undergoes dramatic systemic changes, although correlating longitudinal changes in maternal urine remain largely unexplored. We applied an LCMS-based untargeted metabolomics profiling approach to analyze 346 longitudin...
Article
Full-text available
Conventional environmental health studies have primarily focused on limited environmental stressors at the population level, which lacks the power to dissect the complexity and heterogeneity of individualized environmental exposures. Here, as a pilot case study, we integrated deep-profiled longitudinal personal exposome and internal multi-omics to...
Preprint
One of the major challenges in LC-MS data (metabolome, lipidome, and exposome) is converting many metabolic feature entries to biological function information, such as metabolite annotation and pathway enrichment, which are based on the compound and pathway databases. Multiple online databases have been developed, containing lots of information abo...
Article
Full-text available
Determinants of severe COVID-19 in healthy adults are poorly understood, which limits opportunity for early intervention. We present a multiomic analysis using machine learning to characterize the genomic basis of COVID-19 severity. We use single-cell multiome profiling of human lungs to link genetic signals to cell-type-specific functions. We disc...
Preprint
Full-text available
Liquid chromatography-mass spectrometry (LC-MS) based untargeted metabolomics provides systematic profiling of metabolic. Yet its applications in precision medicine (disease diagnosis) have been limited by several challenges, including metabolite identification, information loss, and low reproducibility. Here, we present the deepPseudoMSI project (...
Preprint
Full-text available
Reproducibility and transparency have been longstanding but significant problems for the metabolomics field. Here, we present the tidyMass project ( https://www.tidymass.org/ ), a comprehensive computational framework that can achieve the shareable and reproducible workflow needs of data processing and analysis for LC-MS-based untargeted metabolomi...
Preprint
Full-text available
Reproducibility and transparency have been longstanding but significant problems for the metabolomics field. Here, we present the tidyMass project (https://www.tidymass.org/), a comprehensive computational framework that can achieve the shareable and reproducible workflow needs of data processing and analysis for LC-MS-based untargeted metabolomics...
Article
Full-text available
Accurate and efficient compound annotation is a long-standing challenge for LC−MS-based data (e.g., untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and stream...
Preprint
The determinants of severe COVID-19 in non-elderly adults are poorly understood, which limits opportunities for early intervention and treatment. Here we present novel machine learning frameworks for identifying common and rare disease-associated genetic variation, which outperform conventional approaches. By integrating single-cell multiomics prof...
Preprint
Full-text available
NGLY1 (N-glycanase 1) deficiency is a rare congenital recessive disorder caused by a mutation in the NGLY1 gene, which encodes the cytosol enzyme N-glycanase 1. The NGLY1 protein catalyzes the first step in protein deglycosylation, a process prerequisite for the cytosolic degradation of misfolded glycoproteins. By performing and combining metabolom...
Preprint
Accurate and efficient compound annotation is a long-standing challenge for LC−MS-based data (e.g. untargeted metabolomics and exposomics). Substantial efforts have been devoted to overcoming this obstacle, whereas current tools are limited by the sources of spectral information used (in-house and public databases) and are not automated and streaml...
Preprint
Conventional environmental health studies primarily focus on limited environmental stressors at the population level, which lacks the power of dissecting the complexity and heterogeneity of individualized environmental exposures. Here we integrated deep-profiled longitudinal personal exposome and internal multi-omics to systematically investigate h...
Article
Full-text available
Background Long-term smoking exposure will increase the risk of esophageal squamous cell carcinoma (ESCC), whereas the mechanism is still unclear. We conducted a cross-sectional study to explore whether serum metabolites mediate the occurrence of ESCC caused by cigarette smoking. Methods Serum metabolic profiles and lifestyle information of 464 pa...
Article
Full-text available
BACKGROUND: Previous metabolomics studies have found differences in metabolic characteristics between the healthy and ESCC patients. However, few of these studies concerned the whole process of the progression of ESCC. This study aims to explore serum metabolites associated with the progression of ESCC. METHODS: Serum samples from 653 participants...
Article
Full-text available
Metabolism during pregnancy is a dynamic and precisely programmed process, the failure of which can bring devastating consequences to the mother and fetus. To define a high-resolution temporal profile of metabolites during healthy pregnancy, we analyzed the untargeted metabolome of 784 weekly blood samples from 30 pregnant women. Broad changes and...
Article
Full-text available
Large-scale metabolite annotation is a challenge in liquid chromatogram-mass spectrometry (LC-MS)-based untargeted metabolomics. Here, we develop a metabolic reaction network (MRN)-based recursive algorithm (MetDNA) that expands metabolite annotations without the need for a comprehensive standard spectral library. MetDNA is based on the rationale t...
Article
Full-text available
Mass spectrometry-based metabolomics aims to profile the metabolic changes in biological systems and identify differential metabolites related to physiological phenotypes and aberrant activities. However, many confounding factors during data acquisition complicate metabolomics data, which is characterized by high dimensionality, uncertain degrees o...
Article
Full-text available
Summary Mass spectrometry-based metabolomics aims to profile the metabolic changes in biological systems and identify differential metabolites related to physiological phenotypes and aberrant activities. However, many confounding factors during data acquisition complicate metabolomics data, which is characterized by high dimensionality, uncertain d...
Article
Ion mobility - mass spectrometry (IM-MS) has showed great application potential for lipidomics. However, IM-MS based lipidomics is significantly restricted by the available software for lipid structural identification. Here, we developed a software tool, namely, LipidIMMS Analyzer, to support the accurate identification of lipids in IM-MS. For the...
Preprint
Metabolite identification is a long-standing challenge in untargeted metabolomics and a major hurdle for functional metabolomics studies. Here, we developed a metabolic reaction network-based recursive algorithm and webserver called MetDNA for the large-scale and unambiguous identification of metabolites (available at http://metdna.zhulab.cn). We s...
Article
Full-text available
Purpose The present study aimed to identify a panel of potential metabolite biomarkers to predict tumor response to neoadjuvant chemo-radiation therapy (NCRT) in locally advanced rectal cancer (LARC). Experimental design Liquid chromatography-–mass spectrometry (LC-–MS)-based untargeted metabolomics was used to profile human serum samples (n = 106)...
Presentation
The 14th International Conference of the Metabolomics Society
Article
Full-text available
The use of collision cross-section (CCS) values derived from ion mobility-mass spectrometry (IM-MS) has been proven to facilitate lipid identifications. Its utility is restricted by the limited available CCS values. Recently, the machine-learning algorithm based prediction (e.g., MetCCS) is reported to generate CCS values in a large-scale. However,...
Article
Full-text available
The rapid development of metabolomics has significantly advanced health and disease related research. However, metabolite identification remains a major analytical challenge for untargeted metabolomics. While the use of collision cross-section (CCS) values obtained in ion mobility - mass spectrometry (IM-MS) effectively increases identification con...
Article
Full-text available
Introduction Previous metabolomics studies have revealed perturbed metabolic signatures in esophageal squamous cell carcinoma (ESCC) patients, however, most of these studies included mainly late-staged ESCC patients due to the difficulties of collecting the early-staged samples from asymptotic ESCC subjects. Objectives This study aims to explore th...
Presentation
Introduction Untargeted metabolomics studies for biomarker discovery often have hundreds to thousands of human samples. Data acquisition of large-scale samples has to be divided into several batches and may span from months to as long as several years. The signal drift of metabolites during data acquisition (intra- and inter-batch) is unavoidable a...
Article
Full-text available
Introduction Untargeted metabolomics studies for biomarker discovery often have hundreds to thousands of human samples. Data acquisition of large-scale samples has to be divided into several batches and may span from months to as long as several years. The signal drift of metabolites during data acquisition (intra- and inter-batch) is unavoidable a...

Network

Cited By

Projects

Project (1)
Project
The research on the screening model for Esophageal Squamous Cell Carcinoma in high-risk population based on metabolomic biomarkers