Tom Pollard’s research while affiliated with Massachusetts Institute of Technology and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (67)


882: PERFORMANCE OF THE GLOBAL OPEN SOURCE SEVERITY OF ILLNESS SCORE AT A QUATERNARY CHILDREN’S HOSPITAL
  • Article

January 2025

·

2 Reads

Critical Care Medicine

Sarah Nutman

·

Jesse Klug

·

Chelsea Bitler

·

[...]

·

Christopher Horvat


Deep learning-enabled electrocardiogram assessment of valvular heart disease

January 2025

·

3 Reads

European Heart Journal Cardiovascular Imaging

Background Valvular heart disease is a highly prevalent condition that contributes significantly to cardiovascular morbidity and mortality. Echocardiography is the gold-standard for valvular disease evaluation; however, diagnosis may be delayed due to limited resources or expertise. Deep learning analysis of electrocardiography (ECG) has demonstrated the ability to identify subtle structural and functional abnormalities within the cardiovascular system. Purpose The aim of this study was to evaluate the accuracy of a deep learning algorithm to identify valvular heart disease from a standard 12-lead ECG. Methods From 2008 to 2019, a total of 27,689 patients who were admitted to the Medical Center with an echocardiogram and ECG performed within 14 days of each other were included in the study. We trained a convolutional neural network to identify the presence of moderate or severe aortic stenosis (AS), aortic regurgitation (AR), mitral regurgitation (MR), and/or tricuspid regurgitation (TR) using a 12-lead ECG. Model performance was assessed using area under the receiver-operating characteristic (AUROC). Patients were randomly divided into development (n = 22,151; 80%) and independent validation sets (n = 5,538; 20[AN1] %). Data used for the study was collected from two open access databases, MIMIC-IV-ECG and MIMIC-IV-Note. Results Mean age of the cohort was 65.9 ± 16.6 years old and 51% were female. On the independent validation dataset, AS was present in 126 (2.3%) of patients, AR in 51 (1%), MR in 337 (6.1%), and TR in 450 (8.1 %) patients. During validation, the model accurately identified AS, AR, MR, and TR with AUROCs of 0.80 (95% CI 0.77-0.83), 0.74 (95% CI 0.69-0.80), 0.86 (95% CI 0.84-0.88), and 0.85 (95% CI 0.83-0.87), respectively (Table 1). Test characteristics were dependent on underlying prevalence and selected sensitivity levels. For a composite of any of AS, AR, MR, or TR, the model achieved a positive predictive value of 28% with 79.4% sensitivity. Conclusion A deep learning-enabled ECG demonstrates robust performance to identify patients living with valvular heart disease. This model may serve as a simple and promising tool for improving the early detection of valvular heart disease, showcasing the potential for an early development valvular disease screening program.


Tackling algorithmic bias and promoting transparency in health datasets: the STANDING Together consensus recommendations

December 2024

·

37 Reads

·

9 Citations

The Lancet Digital Health

Without careful dissection of the ways in which biases can be encoded into artificial intelligence (AI) health technologies, there is a risk of perpetuating existing health inequalities at scale. One major source of bias is the data that underpins such technologies. The STANDING Together recommendations aim to encourage transparency regarding limitations of health datasets and proactive evaluation of their effect across population groups. Draft recommendation items were informed by a systematic review and stakeholder survey. The recommendations were developed using a Delphi approach, supplemented by a public consultation and international interview study. Overall, more than 350 representatives from 58 countries provided input into this initiative. 194 Delphi participants from 25 countries voted and provided comments on 32 candidate items across three electronic survey rounds and one in-person consensus meeting. The 29 STANDING Together consensus recommendations are presented here in two parts. Recommendations for Documentation of Health Datasets provide guidance for dataset curators to enable transparency around data composition and limitations. Recommendations for Use of Health Datasets aim to enable identification and mitigation of algorithmic biases that might exacerbate health inequalities. These recommendations are intended to prompt proactive inquiry rather than acting as a checklist. We hope to raise awareness that no dataset is free of limitations, so transparent communication of data limitations should be perceived as valuable, and absence of this information as a limitation. We hope that adoption of the STANDING Together recommendations by stakeholders across the AI health technology lifecycle will enable everyone in society to benefit from technologies which are safe and effective.



Raising awareness of potential biases in medical machine learning: Experience from a Datathon
  • Preprint
  • File available

October 2024

·

32 Reads

·

1 Citation

Objective: To challenge clinicians and informaticians to learn about potential sources of bias in medical machine learning models through investigation of data and predictions from an open-source severity of illness score. Methods: Over a two-day period (total elapsed time approximately 28 hours), we conducted a datathon that challenged interdisciplinary teams to investigate potential sources of bias in the Global Open Source Severity of Illness Score. Teams were invited to develop hypotheses, to use tools of their choosing to identify potential sources of bias, and to provide a final report. Results: Five teams participated, three of which included both informaticians and clinicians. Most (4/5) used Python for analyses, the remaining team used R. Common analysis themes included relationship of the GOSSIS-1 prediction score with demographics and care related variables; relationships between demographics and outcomes; calibration and factors related to the context of care; and the impact of missingness. Representativeness of the population, differences in calibration and model performance among groups, and differences in performance across hospital settings were identified as possible sources of bias. Discussion: Datathons are a promising approach for challenging developers and users to explore questions relating to unrecognized biases in medical machine learning algorithms.

Download

Flow chart of this study.
Receiver operating characteristics (ROC) curves for mortality prediction of the developed prediction model. GBM indicates the ROC curves of our model. ASA, American Society of Anesthesiologists physical status classification; LR, Logistic regression; GBM: Gradient Boost Machine.
INSPIRE, a publicly available research dataset for perioperative medicine

June 2024

·

95 Reads

·

9 Citations

Scientific Data

We present the INSPIRE dataset, a publicly available research dataset in perioperative medicine, which includes approximately 130,000 surgical operations at an academic institution in South Korea over a ten-year period between 2011 and 2020. This comprehensive dataset includes patient characteristics such as age, sex, American Society of Anesthesiologists physical status classification, diagnosis, surgical procedure code, department, and type of anaesthesia. The dataset also includes vital signs in the operating theatre, general wards, and intensive care units (ICUs), laboratory results from six months before admission to six months after discharge, and medication during hospitalisation. Complications include total hospital and ICU length of stay and in-hospital death. We hope this dataset will inspire collaborative research and development in perioperative medicine and serve as a reproducible external validation dataset to improve surgical outcomes.


Figure 1. Study flow chart.
Figure 2. Subgroup analysis of the deep learning model by patient age, sex, and race/ethnicity using diagnostic odds ratio (OR) with 95% confidence intervals (CIs). The vertical dashed lines represent the OR of the model across all patients in the test set.
Model performance of the deep learning model on the test set (RWMA=regional wall motion abnormality, LVEF=left ventricular ejection fraction, RV=right ventricular, AUC=area under the receiver-operator curve, PPV=positive predictive value, NPV=negative predictive value, 95% CIs computed using bootstrapping with 1,000 samples)
Deep neural networks detect regional wall motion abnormalities and preclinical cardiovascular disease from 12-lead ECGs

June 2024

·

41 Reads

Background Identifying regional wall motion abnormalities (RWMAs) is critical for diagnosing and risk stratifying patients with cardiovascular disease, particularly ischemic heart disease. We hypothesized that a deep neural network could accurately identify patients with regional wall motion abnormalities from a readily available standard 12-lead electrocardiogram (ECG), reducing the need for echocardiography. Methods This observational, retrospective study included patients who were treated at Beth Israel Deaconess Medical Center and had an ECG and echocardiogram performed within 14 days of each other between 2008 and 2019. We trained a convolutional neural network to detect the presence of RWMAs, qualitative global right ventricular (RV) hypokinesis, and varying degrees of left ventricular dysfunction (left ventricular ejection fraction [LVEF] ≤50%, LVEF ≤40%, and LVEF ≤35%) identified by echocardiography, using ECG data alone. Patients were randomly split into development (80%) and test sets (20%). Model performance was assessed using area under the receiver operating characteristic curve (AUC). Cox proportional hazard models adjusted for age and sex were performed to estimate the risk of future acute coronary events. Results The development set consisted of 19,837 patients (mean age 66.7±16.4, 46.7% female) and the test set comprised of 4,953 patients (mean age 67.5±15.8 years; 46.5% female). On the test dataset, the model accurately identified the presence of RWMA, RV hypokinesis, LVEF ≤50%, LVEF ≤40%, and LVEF ≤35% with AUCs of 0.87 (95% CI 0.858-0.882), 0.888 (95% CI 0.878-0.899), 0.923 (95% CI 0.914-0.933), 0.93 (95% CI 0.921-0.939), and 0.876 (95% CI 0.858-0.896), respectively. Among patients with normal biventricular function at the time of the index ECG, those classified as having RMWA by the model were 3 times the risk (age- and sex-adjusted hazard ratio, 2.8; 95% CI 1.9-3.9) for future acute coronary events compared to those classified as negative. Conclusions We demonstrate that a deep neural network can help identify regional wall motion abnormalities and reduced LV function from a 12-lead ECG and could potentially be used as a screening tool for triaging patients who need either initial or repeat echocardiographic imaging.




Citations (49)


... At this point, international research funders and academics have widely recognized the FAIR principles and taken steps to advance its implementation 2,3 . Chue Hong et al. 4 present a community-developed adaptation of the original FAIR principles specifically for research software (i.e., FAIR4RS) 5 . This development was based on the fact that while the FAIR principles were initially developed for data, "research software is now being understood as a type of digital object to which FAIR should be applied" 5 . ...

Reference:

Awareness of FAIR and FAIR4RS among international research software funders
FAIR Principles for Research Software (FAIR4RS Principles) RDA Recommendation

... Ensuring equitable access to AI technologies is a core aspect of social sustainability. Currently, the adoption of AI solutions often favors resource-rich settings, leaving underfunded healthcare systems and low-resource regions with limited access to these advancements [60,61]. This imbalance could exacerbate global health inequities, as regions without sufficient infrastructure are unable to benefit from AI-driven diagnostic and operational improvements. ...

Tackling algorithmic bias and promoting transparency in health datasets: the STANDING Together consensus recommendations
  • Citing Article
  • December 2024

The Lancet Digital Health

... This review aimed to investigate the datasets available for development of AI health technologies targeting HF, particularly those generated from investigations used early in the diagnostic pathway. 16,17 We assess the documentation and composition of HF datasets, in particular focusing on demographic attributes to characterize "who" is represented, "how" they are represented, and any groups left behind. ...

Tackling Algorithmic Bias and Promoting Transparency in Health Datasets: The STANDING Together Consensus Recommendations
  • Citing Article
  • December 2024

NEJM AI

... This retrospective study was performed using a multicenter database of patients who underwent surgery for intestinal obstruction: the Third Affiliated Hospital of Sun Yat-sen University, Shenzhen People's Hospital, Foshan First People's Hospital, and the INSPIRE dataset. INSPIRE is a publicly available research dataset for perioperative medicine, which includes approximately 130,000 patients (50% of all surgical patients) who underwent anesthesia for surgery at an academic institution in South Korea between 2011 and 2020 [13,14]. To our knowledge, it is a new dataset that contains data for collaborative research and development in perioperative medicine. ...

INSPIRE, a publicly available research dataset for perioperative medicine

Scientific Data

... The AI system examined a wide array of data including patient demographics, medical history, lab results, vital signs, and social determinants of health to identify which patients faced the greatest risk of developing complications that would necessitate hospital readmission [25]. The healthcare system identified patients at high risk which enabled healthcare providers to arrange proactive follow-up appointments home care services and personalized discharge plans. ...

Evaluating the Impact of Social Determinants on Health Prediction in the Intensive Care Unit
  • Citing Conference Paper
  • August 2023

... Future work should also consider whether algorithmic bias (Baker and Hawn, 2022;Mansfield et al., 2022;Ray, 2023;Xiao et al., 2023) impacts the performance of LLM-based redaction, for instance, if an LLM performs more poorly for PII from less well-represented groups of learners. This is not an issue exclusive to LLMs. ...

In the Name of Fairness: Assessing the Bias in Clinical Record De-identification
  • Citing Conference Paper
  • June 2023

... These findings have importance not only in clinical practice but also in research. Techniques to limit bias encoded by the association between SDOH and eLLST are likely necessary to ensure models designed to predict mortality or emergence from disorders of consciousness are not biased against socially vulnerable patients [64]. Enhancing equity in artificial intelligence prediction models, for example, requires consistent effort to collect granular and diverse factors and integrate them with electronic health records. ...

Evaluating the Impact of Social Determinants on Health Prediction

... Many large-scale studies have compared Sepsis-2 and Sepsis-3 in terms of screening and prognostic accuracy for mortality in intensive care units (ICUs) and non-ICU patients, including emergency departments (EDs) [5][6][7][8]. Despite some inconsistencies, the overall predictive value is not satisfactory. ...

Evaluation of evolving sepsis screening criteria in discriminating suspected sepsis and mortality among adult patients admitted to the intensive care unit
  • Citing Article
  • May 2023

International Journal of Nursing Studies

... The original data for HF patients came from the Medical Information Mart for Intensive Care IV (MIMIC-IV) and the eICU Collaborative Research Database (eICU-CRD). MIMIC-IV is a freely accessible contemporary electronic health record database containing data on more than 40,000 intensive care unit patients from 2008 to 2019 [12]. Another intensive care unit database, the eICU Cooperative Research Database (eICU-CRD) [13], from which we extract the same features as MIMIC-IV. ...

Author Correction: MIMIC-IV, a freely accessible electronic health record dataset

Scientific Data

... There are many high-quality medical databases that excel in the dimensions of volume, velocity, veracity, variety, and value. Examples include the Medical Information Mart for Intensive Care (MIMIC; MIMIC-III, MIMIC-IV, MIMIC-IV-Note, etc) at the Beth Israel Deaconess Medical Center in Boston [1][2][3], the Electronic Intensive Care Unit Collaborative Research Database (eICU-CRD) from 208 hospitals in the United States, and the Amsterdam University Medical Center Database (Amsterdam UMCdb) [4][5][6]. By integrating demographic, monitoring, laboratory, imaging, pharmacy, and waveform data, these databases provide a wealth of valuable information that has facilitated clinical research [7] and expanded the evidence base for clinical practice [8]. ...

Author Correction: MIMIC-IV, a freely accessible electronic health record dataset

Scientific Data