
Feng Xie- Doctor of Philosophy
- Assistant Professor at University of Minnesota
Feng Xie
- Doctor of Philosophy
- Assistant Professor at University of Minnesota
Health Informatics and Data Science, Making AI/ML trustworthy for healthcare
About
55
Publications
7,023
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
583
Citations
Introduction
Dr. Feng Xie is an Assistant Professor in Computational Health Sciences. With a multidisciplinary background encompassing medical informatics, data science, biology, health services research, and biostatistics, his research focuses on developing trustworthy ML/AI solutions. He applies these advanced methodologies across various healthcare domains, including children's health, critical illness, emergency medicine, and beyond. Please refer to my website for more info: https://fengx13.github.io/
Current institution
Additional affiliations
September 2022 - July 2024
August 2017 - August 2022
Education
August 2017 - January 2022
August 2013 - July 2017
Publications
Publications (55)
While medication intake is common among pregnant women, medication safety remains underexplored, leading to unclear guidance for patients and healthcare professionals. PregMedNet addresses this gap by providing a multifaceted maternal medication safety framework based on systematic analysis of 1.19 million mother-baby dyads from U.S. claims databas...
Translational biology posits a strong bi-directional link between clinical phenotypes and a patient’s biological profile. By leveraging this bi-directional link, we can efficiently deconvolute pre-existing clinical information into biological profiles. However, traditional computational tools are limited in their ability to resolve this link becaus...
Machine learning (ML) has achieved substantial success in performing healthcare tasks in which the configuration of every part of the ML pipeline relies heavily on technical knowledge. To help professionals with borderline expertise to better use ML techniques, Automated ML (AutoML) has emerged as a prospective solution. However, most models genera...
Background
Multiple lines of evidence support peripheral organs in the initiation or progression of Lewy body disease (LBD), a spectrum of neurodegenerative diagnoses that include Parkinson’s Disease (PD) without or with dementia (PDD) and dementia with Lewy bodies (DLB). However, the potential contribution of the peripheral immune response to LBD...
Survival analysis is essential for studying time-to-event outcomes and providing a dynamic understanding of the probability of an event occurring over time. Various survival analysis techniques, from traditional statistical models to state-of-the-art machine learning algorithms, support healthcare intervention and policy decisions. However, there r...
Background
Anesthesiology plays a crucial role in perioperative care, critical care, and pain management, impacting patient experiences and clinical outcomes. However, our understanding of the anesthesiology research landscape is limited. Accordingly, we initiated a data-driven analysis through topic modeling to uncover research trends, enabling in...
Emergency departments (ED) are complex, triage is a main task in the ED to prioritize patient with limited medical resources who need them most. Machine learning (ML) based ED triage tool, Score for Emergency Risk Prediction (SERP), was previously developed using an interpretable ML framework with single center. We aimed to develop SERP with 3 Kore...
Jae Yong Yu Sejin Heo Feng Xie- [...]
원철 차
Studies implicated peripheral organs involvement in the development of Lewy body disease (LBD), a spectrum of neurodegenerative diagnoses that include Parkinson’s Disease (PD) without or with dementia (PDD) and dementia with Lewy bodies (DLB). This study characterized peripheral immune responses unique to LBD at single-cell resolution. Peripheral m...
Male infertility (MI) accounts for at least 30% of infertility etiology, yet the full breadth of potential MI risk factors and adverse health outcomes has not been explored. Here, we use electronic medical records (EMRs) from the University of California (UC) and Stanford to implement a data-driven case-control study to identify MI-associated comor...
Objective:
We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations.
Materials and methods:
The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, fed...
Objectives:
Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical pr...
p>Feature attribution methods have been widely employed in the eXplainable Artificial Intelligence ( XAI ) community and they have been designed with various underlying concepts. As a consequence, these methods sometimes assign different feature importance on the same data. To resolve the possible discrepancies, we introduce \textit{endorsed attrib...
p>Feature attribution methods have been widely employed in the eXplainable Artificial Intelligence ( XAI ) community and they have been designed with various underlying concepts. As a consequence, these methods sometimes assign different feature importance on the same data. To resolve the possible discrepancies, we introduce \textit{endorsed attrib...
The AutoScore framework can automatically generate data-driven clinical scores in various clinical applications. Here, we present a protocol for developing clinical scoring systems for binary, survival, and ordinal outcomes using the open-source AutoScore package. We describe steps for package installation, detailed data processing and checking, an...
Objective:
The proper handling of missing values is critical to delivering reliable estimates and decisions, especially in high-stakes fields such as clinical research. In response to the increasing diversity and complexity of data, many researchers have developed deep learning (DL)-based imputation techniques. We conducted a systematic review to...
Clinical scores are highly interpretable and widely used in clinical risk stratification. AutoScore was previously developed as a clinical score generator, integrating the interpretability of clinical scores and the discriminability of machine learning (ML). Although a basic framework has been established, AutoScore leaves room for enhancement: var...
Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This...
We propose FedScore, a privacy-preserving federated learning framework for scoring system generation across multiple sites to facilitate cross-institutional collaborations. The FedScore framework includes five modules: federated variable ranking, federated variable transformation, federated score derivation, federated model selection and federated...
Jae Yong Yu Sejin Heo Feng Xie- [...]
원철 차
Background:
Field triage is critical in injury patients as the appropriate transport of patients to trauma centers is directly associated with clinical outcomes. Several prehospital triage scores have been developed in Western and European cohorts; however, their validity and applicability in Asia remains unclear. Therefore, we aimed to develop an...
Recurrent pregnancy loss (RPL), defined as 2 or more pregnancy losses, affects 5-6% of ever-pregnant individuals. Approximately half of these cases have no identifiable explanation. To generate hypotheses about RPL etiologies, we implemented a case-control study comparing the history of over 1,600 diagnoses between RPL and live-birth patients, leve...
Recurrent pregnancy loss (RPL), defined as 2 or more pregnancy losses, affects 5-6% of
ever-pregnant individuals. Approximately half of these cases have no identifiable explanation.
To generate hypotheses about RPL etiologies, we implemented a case-control study comparing
the history of over 1,600 diagnoses between RPL and live-birth patients, leve...
Triage in an emergency department (ED) can help identify the urgency of patients’ treatment and allocate the appropriate resources. Interpretable machine learning methods could be a helpful tool for facilitating the triage process. However, existing related research used only conventional logistic regression methods. This study aims to develop and...
Background
Risk prediction models are useful tools in clinical decision-making which help with risk stratification and resource allocations and may lead to a better health care for patients. AutoScore is a machine learning–based automatic clinical score generator for binary outcomes. This study aims to expand the AutoScore framework to provide a to...
The demand for emergency department (ED) services is increasing across the globe, particularly during the current COVID-19 pandemic. Clinical triage and risk assessment have become increasingly challenging due to the shortage of medical resources and the strain on hospital infrastructure caused by the pandemic. As a result of the widespread use of...
Jae Yong Yu Feng Xie Nan Liu- [...]
원철 차
Emergency departments (EDs) are experiencing complex demands. An ED triage tool, the Score for Emergency Risk Prediction (SERP), was previously developed using an interpretable machine learning framework. It achieved a good performance in the Singapore population. We aimed to externally validate the SERP in a Korean cohort for all ED patients and c...
Objective: The proper handling of missing values is critical to delivering reliable estimates and decisions, especially in high-stakes fields such as clinical research. The increasing diversity and complexity of data have led many researchers to develop deep learning (DL)-based imputation techniques. We conducted a systematic review to evaluate the...
Aim
Accurate and timely prognostication of patients with out-of-hospital cardiac arrest (OHCA) who attain return of spontaneous circulation (ROSC) is crucial in clinical decision-making, resource allocation, and communication with family. A clinical decision tool, Survival After ROSC in Cardiac Arrest (SARICA), was recently developed, showing excel...
Jae Yong Yu Feng Xie Nan Liu- [...]
원철 차
Emergency departments (EDs) are experiencing complex demands. An ED triage tool, the Score for Emergency Risk Prediction (SERP), was previously developed using an interpretable machine learning framework. It achieved a good performance in the Singapore population. We aimed to externally validate the SERP in a Korean cohort for all ED patients and c...
Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors to create parsimonious scores, but such ‘black box’ variable selection limits interpretability, and variable importance evaluated from a single model can b...
Acute kidney injury (AKI) in hospitalised patients is a common syndrome associated with poorer patient outcomes. Clinical risk scores can be used for the early identification of patients at risk of AKI. We conducted a retrospective study using electronic health records of Singapore General Hospital emergency department patients who were admitted fr...
Background
Medical decision-making impacts both individual and public health. Clinical scores are commonly used among various decision-making models to determine the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. However, its curre...
Background
Emergency readmission poses an additional burden on both patients and healthcare systems. Risk stratification is the first step of transitional care interventions targeted at reducing readmission. To accurately predict the short- and intermediate-term risks of readmission and provide information for further temporal risk stratification,...
Background: Risk prediction models are useful tools in clinical decision-making which help with risk stratification and resource allocations and may lead to a better health care for patients. AutoScore is a machine learning-based automatic clinical score generator for binary outcomes. This study aims to expand the AutoScore framework to provide a t...
Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors, but such 'black box' variable selection limits interpretability, and variable importance evaluated from a single model can be biased. We propose a robust...
Acute kidney injury (AKI) in hospitalised patients is a common syndrome associated with poorer patient outcomes. Clinical risk scores can be used for the early identification of patients at risk of AKI. We conducted a retrospective study using electronic health records of Singapore General Hospital emergency department patients who were admitted fr...
Objective
Temporal electronic health records (EHRs) contain a wealth of information for secondary uses, such as clinical events prediction or chronic disease management. However, challenges exist for temporal data representation. We therefore sought to identify these challenges and evaluate novel methodologies for addressing them through a systemat...
There is a continuously growing demand for emergency department (ED) services across the world, especially under the COVID-19 pandemic. Risk triaging plays a crucial role in prioritizing limited medical resources for patients who need them most. Recently the pervasive use of Electronic Health Records (EHR) has generated a large volume of stored dat...
Background
Scoring systems are highly interpretable and widely used to evaluate time-to-event outcomes in healthcare research. However, existing time-to-event scores are predominantly created ad-hoc using a few manually selected variables based on clinician's knowledge, suggesting an unmet need for a robust and efficient generic score-generating me...
BACKGROUND
There is a growing demand globally for emergency department (ED) services. An increase in ED visits has resulted in overcrowding and longer wait times. The triage process plays a crucial role in assessing and stratifying patients' risks and ensuring that the critically ill promptly receive appropriate priority and emergency treatment. A...
Background:
There is a growing demand globally for emergency department (ED) services. An increase in ED visits has resulted in overcrowding and longer waiting times. The triage process plays a crucial role in assessing and stratifying patients' risks and ensuring that the critically ill promptly receive appropriate priority and emergency treatmen...
Importance
Triage in the emergency department (ED) is a complex clinical judgment based on the tacit understanding of the patient’s likelihood of survival, availability of medical resources, and local practices. Although a scoring tool could be valuable in risk stratification, currently available scores have demonstrated limitations.
Objectives
To...
Objective: Temporal electronic health records (EHRs) can be a wealth of information for secondary uses, such as clinical events prediction or chronic disease management. However, challenges exist for temporal data representation. We therefore sought to identify these challenges and evaluate novel methodologies for addressing them through a systemat...
Background: Medical decision-making impacts both individual and public health. Clinical scores are commonly used among a wide variety of decision-making models for determining the degree of disease deterioration at the bedside. AutoScore was proposed as a useful clinical score generator based on machine learning and a generalized linear model. Its...
Background: Acute kidney injury (AKI) in hospitalised patients is a common syndrome associated with poorer patient outcomes. Clinical risk scores can be used for the early identification of patients at risk of AKI.
Methods: We conducted a retrospective study using electronic health records of Singapore General Hospital emergency department patients...
Description A novel interpretable machine learning-based framework to automate the development of a clinical scoring model for predefined outcomes. Our novel framework consists of six modules: variable ranking with machine learning, variable transformation , score derivation, model selection, domain knowledge-based score fine-tuning, and performanc...
Scoring systems are highly interpretable and widely used to evaluate time-to-event outcomes in healthcare research. However, existing time-to-event scores are predominantly created ad-hoc using a few manually selected variables based on clinician's knowledge, suggesting an unmet need for a robust and efficient generic score-generating method. AutoS...
Importance: Triage in the emergency department (ED) for admission and appropriate level of hospital care is a complex clinical judgement based on tacit understanding of the patient's likely acute course, availability of medical resources, and local practices. While a scoring tool could be valuable in triage, currently available tools have demonstra...
Background
Risk scores can be useful in clinical risk stratification and accurate allocations of medical resources, helping health providers improve patient care. Point-based scores are more understandable and explainable than other complex models and are now widely used in clinical decision making. However, the development of the risk scoring mode...
Background:
Chest pain is one of the most common complaints among patients presenting to the emergency department (ED). Causes of chest pain can be benign or life threatening, making accurate risk stratification a critical issue in the ED. In addition to the use of established clinical scores, prior studies have attempted to create predictive mode...
Objectives
To identify risk factors for inpatient mortality after patients’ emergency admission and to create a novel model predicting inpatient mortality risk.
Design
This was a retrospective observational study using data extracted from electronic health records (EHRs). The data were randomly split into a derivation set and a validation set. The...
Background: Chest pain is one of the most common complaints among patients presenting to the emergency department (ED). Causes of chest pain can be benign or life threatening, making its accurate risk stratification a critical issue in the ED. In addition to the use of established clinical scores, some studies attempted to create predictive models...