Michael E Matheny

Michael E Matheny
  • MD, MS, MPH
  • Professor at Vanderbilt University

About

335
Publications
49,172
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
11,270
Citations
Introduction
Michael E Matheny currently works at the Department of Biomedical Informatics, Vanderbilt University. Michael does research in Artificial Intelligence, Databases and Data Mining.
Current institution
Vanderbilt University
Current position
  • Professor
Additional affiliations
August 2007 - December 2012
Vanderbilt University
Position
  • Professor (Assistant)
July 2004 - June 2007
Partners HealthCare
Position
  • General Internal Medicine Physician
January 2004 - June 2007
Brigham and Women's Hospital
Position
  • Biomedical Informatics Fellow
Education
August 1991 - May 1997
University of Kentucky
Field of study
  • Chemical Engineering

Publications

Publications (335)
Article
OBJECTIVE To assess the association between glucagon-like peptide 1 receptor agonist (GLP-1RA) use and risk of incident thyroid tumors. RESEARCH DESIGN AND METHODS The retrospective, active-comparator new-user cohort study used international administrative claims and electronic health record databases. Participants included patients with type 2 di...
Article
Background In the IMPROVE AKI (A Cluster‐Randomized Trial of Team‐Based Coaching Interventions to Improve Acute Kidney Injury) trial, a combination of team‐based coaching and data‐driven surveillance dashboards reduced the odds of AKI following cardiac catheterization by 46%. The objective of this study was to determine if improvements in AKI outco...
Preprint
Full-text available
Background: The widespread adoption of electronic health records (EHRs) has resulted in the generation of large volumes of clinical notes. Learning algorithms and large language models (LLMs) train from these resources but are susceptible to noise irrelevant or non informative data from them. This sensitivity can lead to significant challenges, inc...
Article
Importance More than 10% of US patients undergoing endovascular procedures experience contrast-associated acute kidney injuries (AKIs), resulting in increased costs and health deficits. Prevention protocols reduce AKIs, but uptake and adherence vary greatly, and the cost-effectiveness of these interventions is unknown. Objective To analyze the cos...
Article
Background Contemporary research in peripheral artery disease (PAD) remains limited due to lack of a national registry and low accuracy of diagnosis codes to identify patients with PAD. Methods Leveraging a novel natural language processing system that identifies PAD with high accuracy using ankle‐brachial index and toe‐brachial index values, we c...
Article
Objectives While performance drift of clinical prediction models is well-documented, the potential for algorithmic biases to emerge post-deployment has had limited characterization. A better understanding of how temporal model performance may shift across subpopulations is required to incorporate fairness drift into model maintenance strategies. M...
Article
BACKGROUND The American Academy of Pediatrics recommends up to 7 days of observation for neonatal opioid withdrawal syndrome (NOWS) in infants with chronic opioid exposure. However, many of these infants will not develop NOWS, and infants with seemingly less exposure to opioids may develop severe NOWS that requires in-hospital pharmacotherapy. We a...
Article
Full-text available
Objectives To evaluate the feasibility for use of electronic health record (EHR) data in conducting adverse event surveillance among women who received mid-urethral slings (MUS) to treat stress urinary incontinence (SUI) in five health systems. Design Retrospective observational study using EHR data from 2010 through 2021. Women with a history of...
Article
Full-text available
To explore threats and opportunities and to chart a path for safely navigating the rapid changes that generative artificial intelligence (AI) will bring to clinical research, the Duke Clinical Research Institute convened a multidisciplinary think tank in January 2024. Leading experts from academia, industry, nonprofits, and government agencies high...
Preprint
Full-text available
Large language models (LLMs) hold promise for transforming healthcare, from streamlining administrative and clinical workflows to enriching patient engagement and advancing clinical decision-making. However, their successful integration requires rigorous development, adaptation, and evaluation strategies tailored to clinical needs. In this Review,...
Article
Background Fluoroquinolones (FQs) are commonly used to treat urinary tract infections (UTIs), but some studies have suggested they may increase the risk of aortic aneurysm or dissection (AA/AD). However, no large-scale international study has thoroughly assessed this risk. Methods A retrospective cohort study was conducted using a large, distribut...
Preprint
Full-text available
Objective To evaluate the validity of death ascertainment from publicly available internet media sources by benchmarking against state and Federal vital statics data for patients in two large healthcare systems from the US. Methods We extracted names and dates of birth and death from publicly available data including obituaries and memorial website...
Article
The field of artificial intelligence (AI) has entered a new cycle of intense opportunity, fueled by advances in deep learning, including generative AI. Applications of recent advances affect many aspects of everyday life, yet nowhere is it more important to use this technology safely, effectively, and equitably than in health and health care. Here,...
Article
Full-text available
Accurate detection and prevalence estimation of behavioral health conditions, such as opioid use disorder (OUD), are crucial for identifying at-risk individuals, determining treatment needs, monitoring prevention and intervention efforts, and recruiting treatment-naive participants for clinical trials. The availability of extensive health data, com...
Article
Full-text available
Electronic health records (EHRs) have emerged as resources for both the identification of adverse drug events (ADEs) and general population health surveillance, however questions remain around how best to utilize EHR data for drug safety signal identification. While the majority of signal identification research has utilized spontaneous reports and...
Preprint
Full-text available
Background: Mortality is a critical variable in healthcare research, but inconsistencies in the availability of death date and cause of death (CoD) information limit the ability to monitor medical product safety and effectiveness. Objective: To develop scalable approaches using natural language processing (NLP) and large language models (LLM) for t...
Article
Background: Hospital Score is a well-known and validated tool for predicting readmission risk among diverse patient populations. Integrating social risk factors using natural language processing with the Hospital Score may improve its ability to predict 30-day readmissions following an acute myocardial infarction. Methods: A retrospective cohort in...
Article
Objectives Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sou...
Article
Full-text available
Purpose The US Food and Drug Administration's Sentinel Innovation Center aimed to establish a query‐ready, quality‐checked distributed data network containing electronic health records (EHRs) linked with insurance claims data for at least 10 million individuals to expand the utility of real‐world data for regulatory decision‐making. Methods In thi...
Article
Full-text available
Rationale and Objective Acute kidney injury (AKI) is a common complication among hospitalized adults, but AKI prediction and prevention among adults has proved challenging. We used machine learning to update the nephrotoxic injury negated by just-in time action (NINJA), a pediatric program that predicts nephrotoxic AKI, to improve accuracy among ad...
Article
Full-text available
Objectives To assess the feasibility of assessing long-term outcomes of peripheral vascular intervention (PVI) by linking data from a clinical registry to electronic health records (EHR) data from a clinical research network. Design Observational cohort study. Setting Vascular Quality Initiative registry linked to INSIGHT Clinical Research Networ...
Article
Background Sodium-glucose cotransporter 2 inhibitors (SGLT2is) and glucagon-like peptide-1 receptor agonists (GLP-1 RAs) reduce the risk of major adverse cardiovascular events (MACE) in patients with type 2 diabetes mellitus (T2DM). However, their effectiveness relative to each other and other second-line antihyperglycemic agents is unknown, withou...
Article
Background Little is known about the association of discontinuation of sodium-glucose co-transporter-2 (SGLT2) inhibitors and glucagon-like peptide-1 receptor agonists (GLP-1 RA) with outcomes in patients with chronic kidney disease (CKD). Methods We identified adults with CKD stages 3-4 from 2005-2022 in the Veterans Affairs healthcare system. In...
Preprint
Full-text available
Background Contemporary research in peripheral artery disease (PAD) remains limited due to lack of a national registry and low accuracy of diagnosis codes to identify PAD patients in electronic health records. Methods & Results Leveraging a novel natural language processing (NLP) system that identifies PAD with high accuracy using ankle brachial i...
Article
Importance The Sentinel System is a key component of the US Food and Drug Administration (FDA) postmarketing safety surveillance commitment and uses clinical health care data to conduct analyses to inform drug labeling and safety communications, FDA advisory committee meetings, and other regulatory decisions. However, observational data are frequen...
Article
Full-text available
Background Posttraumatic stress disorder (PTSD) and traumatic brain injury (TBI) are associated with self-reported problems with cognition as well as risk for Alzheimer’s disease and related dementias (ADRD). Overlapping symptom profiles observed in cognitive disorders, psychiatric disorders, and environmental exposures (e.g., head injury) can comp...
Article
Full-text available
Background Despite efforts to enhance the quality of medication prescribing in outpatient settings, potentially inappropriate prescribing remains common, particularly in unscheduled settings where patients can present with infectious and pain-related complaints. Two of the most commonly prescribed medication classes in outpatient settings with freq...
Article
Full-text available
Introduction Acute kidney injury (AKI) is associated with increased risk of heart failure (HF). Determining the type of HF experienced by AKI survivors (heart failure with preserved or reduced ejection fraction, HFpEF or HFrEF) could suggest potential mechanisms underlying the association and opportunities for improving post-AKI care. Methods In t...
Article
Full-text available
The Phenome-Wide Association Study (PheWAS) is increasingly used to broadly screen for potential treatment effects, e.g., IL6R variant as a proxy for IL6R antagonists. This approach offers an opportunity to address the limited power in clinical trials to study differential treatment effects across patient subgroups. However, limited methods exist t...
Article
Full-text available
Reducing the prevalence of acute kidney injury (AKI) is an important patient safety objective set forth by the National Quality Forum. Despite international guidelines to prevent AKI, there continues to be an inconsistent uptake of these interventions by cardiac teams across practice settings. The IMPROVE-AKI study was designed to test the effectiv...
Article
Full-text available
Background As the enthusiasm for integrating artificial intelligence (AI) into clinical care grows, so has our understanding of the challenges associated with deploying impactful and sustainable clinical AI models. Complex dataset shifts resulting from evolving clinical environments strain the longevity of AI models as predictive accuracy and assoc...
Preprint
Full-text available
Background SGLT2 inhibitors (SGLT2is) and GLP-1 receptor agonists (GLP1-RAs) reduce major adverse cardiovascular events (MACE) in patients with type 2 diabetes mellitus (T2DM). However, their effectiveness relative to each other and other second-line antihyperglycemic agents is unknown, without any major ongoing head-to-head trials. Methods Across...
Article
Full-text available
Objective Assigning outcome labels to large observational data sets in a timely and accurate manner, particularly when outcomes are rare or not directly ascertainable, remains a significant challenge within biomedical informatics. We examined whether noisy labels generated from subject matter experts’ heuristics using heterogenous data types within...
Preprint
Full-text available
Objective: Assigning outcome labels to large observational data sets in a timely and accurate manner, particularly when outcomes are rare or not directly ascertainable, remains a significant challenge within biomedical informatics. We examined whether noisy labels generated from subject matter experts' heuristics using heterogenous data types withi...
Article
Full-text available
Standardized operational definitions are an important tool to improve reproducibility of research using secondary real-world healthcare data. This approach was leveraged for studies evaluating the effectiveness of AZD7442 as COVID-19 pre-exposure prophylaxis across multiple healthcare systems. Value sets were defined, grouped, and mapped. Results o...
Article
Full-text available
Introduction: Angiotensin converting enzyme inhibitors (ACEI) and angiotensin receptor blockers (ARB) improve outcomes but are underutilized in patients with chronic kidney disease (CKD). Little is known about reasons for discontinuation and lack of reinitiating these medications. We aimed to explore clinicians’ and patients’ experiences and percep...
Preprint
Post marketing safety surveillance depends in part on the ability to detect concerning clinical events at scale. Spontaneous reporting might be an effective component of safety surveillance, but it requires awareness and understanding among healthcare professionals to achieve its potential. Reliance on readily available structured data such as diag...
Article
Chronic age-related imbalance is a common cause of falls and subsequent death in the elderly and can arise from dysfunction of the vestibular system, an elegant neuroanatomical group of pathways that mediates human perception of acceleration, gravity, and angular head motion. Studies indicate that 27–46% of the risk of age-related chronic imbalance...
Article
Full-text available
Background Measuring the effectiveness of preventing and treating viruses like SARS-CoV-2 poses a challenge in understanding the variants’ susceptibility and resistance to being neutralized. Ideally, each breakthrough case would be sequenced, but in real-world settings, we rely on surveillance samples and estimated date cutoffs to determine effecti...
Article
Objective We examined the influence of 4 different risk information formats on inpatient nurses’ preferences and decisions with an acute clinical deterioration decision-support system. Materials and methods We conducted a comparative usability evaluation in which participants provided responses to multiple user interface options in a simulated set...
Article
Full-text available
Significance Statement African Americans are at increased risk of CKD in part due to high-risk (HR) variants in the apolipoprotein L1 ( APOL1 ) gene, termed G1/G2. A different APOL1 variant, p.N264K , reduced the risk of CKD and ESKD among carriers of APOL1 HR variants to levels comparable with individuals with APOL1 low-risk variants in an analysi...
Article
Full-text available
Objective To assess the uptake of second line antihyperglycaemic drugs among patients with type 2 diabetes mellitus who are receiving metformin. Design Federated pharmacoepidemiological evaluation in LEGEND-T2DM. Setting 10 US and seven non-US electronic health record and administrative claims databases in the Observational Health Data Sciences a...
Article
Introduction: The pitfalls of label leakage, contamination of model input features with outcome information, are well established. Unfortunately, avoiding label leakage in clinical prediction models requires more nuance than the common advice of applying "no time machine rule." Framework: We provide a framework for contemplating whether and when...
Article
Study design: A 5-year longitudinal, retrospective, cohort study. Objectives: Develop a prediction model based on electronic health record (EHR) data to identify veterans with spinal cord injury/diseases (SCI/D) at highest risk for new pressure injuries (PIs). Setting: Structured (coded) and text EHR data, for veterans with SCI/D treated in a...
Article
Background: Rapid progression of chronic kidney disease (CKD) is associated with poor clinical outcomes. Despite extensive study of the genetics of cross-sectional estimated glomerular filtration rate (eGFR), only a few loci associated with eGFR decline over time have been identified. Methods: We performed a meta-analysis of genome-wide associat...
Article
Background: Angiotensin converting enzyme inhibitors (ACEi) and angiotensin receptor blockers (ARB) are frequently discontinued in patients with chronic kidney disease (CKD). Documented adverse drug reactions (ADRs) in medical records may provide insight into the reasons for treatment discontinuation. Methods: In this retrospective cohort of U.S...
Article
Introduction: Lung cancer screening with low-dose computed tomography (LDCT) is widely underutilized. Organizational factors, such as readiness for change and belief in the value of change (change valence), may contribute to underutilization. The aim of this study was to evaluate the association between healthcare organizations' preparedness and l...
Article
Full-text available
Background: Validating new algorithms, such as methods to disentangle intrinsic treatment risk from risk associated with experiential learning of novel treatments, often requires knowing the ground truth for data characteristics under investigation. Since the ground truth is inaccessible in real world data, simulation studies using synthetic datas...
Article
Purpose: Understanding the real-world safety of paclitaxel (PTX) coated devices for treating lower extremity peripheral artery disease remains a high clinical priority of the Food and Drug Administration. Materials and methods: Data from FAIR Health, the largest commercial claims data warehouse in the US, was used for this study. The study consi...
Article
Rationale & objective: Sodium-glucose co-transporter 2 inhibitors (SGLT2i) are recommended for type 2 diabetes mellitus (T2DM) in patients with chronic kidney disease (CKD) or atherosclerotic cardiovascular disease (ASCVD). We evaluated factors associated with SGLT2i prescription, disparities by race and sex, and facility-level variation in prescr...
Article
Full-text available
Background: Super-utilizers consume the greatest share of resource intensive healthcare (RIHC) and reducing their utilization remains a crucial challenge to healthcare systems in the United States (U.S.). The objective of this study was to predict RIHC among U.S. counties, using routinely collected data from the U.S. government, including informat...
Article
Background: Up to 14% of patients in the United States undergoing cardiac catheterization each year experience AKI. Consistent use of risk minimization preventive strategies may improve outcomes. We hypothesized that team-based coaching in a Virtual Learning Collaborative (Collaborative) would reduce postprocedural AKI compared with Technical Assi...
Article
Background : Characterization studies of COVID-19 patients with chronic obstructive pulmonary disease (COPD) are limited in size and scope. The aim of the study is to provide a large-scale characterization of COVID-19 patients with COPD. Methods : We included thirteen databases contributing data from January-June 2020 from North America (US), Europ...
Preprint
Full-text available
Objectives To assess the uptake of second-line antihyperglycemic agents among patients with type-2 diabetes mellitus (T2DM) receiving metformin. Design Serial cross-sectional study (2011-2021). Setting Ten US and seven non-US electronic health record and administrative claims databases in the Observational Health Data Sciences and Informatics netwo...
Article
Background Postoperative infections constitute more than half of all postoperative complications. Surveillance of these complications is primarily done through manual chart review, which is time consuming, expensive, and typically only covers 10% to 15% of all operations. Automated surveillance would permit the timely evaluation of and reporting of...
Article
Background: Current methods to identify cardiac implantable electronic devices (CIED) lead failure include post-approval studies, which may be limited in scope, participant numbers, and attrition; studies relying on administrative codes, which lack specificity; and voluntary adverse event reporting, which cannot determine incidence or attribution...
Article
Full-text available
Objectives Generating and using real-world evidence (RWE) is a pragmatic solution for evaluating health technologies. RWE is recognized by regulators, health technology assessors, clinicians, and manufacturers as a valid source of information to support their decision-making. Well-designed registries can provide RWE and become more powerful when li...
Article
Full-text available
Purpose: Alpha-1 blockers, often used to treat benign prostatic hyperplasia (BPH), have been hypothesized to prevent COVID-19 complications by minimising cytokine storm release. The proposed treatment based on this hypothesis currently lacks support from reliable real-world evidence, however. We leverage an international network of large-scale heal...
Article
Full-text available
As the implementation of artificial intelligence (AI)-enabled tools is realized across diverse clinical environments, there is a growing understanding of the need for ongoing monitoring and updating of prediction models. Dataset shift—temporal changes in clinical practice, patient populations, and information systems—is now well-documented as a sou...
Article
Background The total impact of the current COVID-19 pandemic on cancer screenings and diagnostic procedures by race and ethnicity has not yet been fully characterized. Methods In this study, we compared the ethnic and racial differences in cancer screening for breast, colon, and prostate cancer and compared them to population-level SARS-CoV2 infec...
Article
BACKGROUND The utility of quality dashboards to inform decision-making and improve clinical outcomes is tightly linked to the accuracy of the information they provide and, in turn, accuracy of underlying prediction models. Despite recognition of the need to update prediction models to maintain accuracy over time, there is limited guidance on updati...
Article
Full-text available
Background and Aims Gastrointestinal (GI) symptoms are well-recognized manifestations of coronavirus disease 2019 (COVID-19). Our primary objective was to evaluate the association between GI symptoms and COVID-19 severity. Methods In this nationwide cohort of US veterans, we evaluated GI symptoms (nausea/vomiting/diarrhea) reported 30 days prior t...
Article
Full-text available
Importance: Acute kidney injury (AKI) is a heterogeneous syndrome prevalent among hospitalized patients. Personalized risk estimation and risk factor identification may allow effective intervention and improved outcomes. Objective: To develop and validate personalized AKI risk estimation models using electronic health records (EHRs), examine whe...
Article
Full-text available
Objectives To assess the feasibility of using electronic health record (EHR) derived clinical data within an active surveillance setting to evaluate the safety of a novel intervertebral body implant (IVBI) stabilization device. Design Retrospective, longitudinal observational cohort study comparing clinical outcomes for patients seen through 1 yea...
Article
Full-text available
Background P2Y12 inhibitor medications are critical following percutaneous coronary intervention (PCI); however, adherence remains suboptimal. Our objective was to assess the effectiveness of a multifaceted intervention to improve P2Y12 inhibitor adherence following PCI. Methods and Results This was a modified stepped wedge trial of 52 eligible ho...
Article
Full-text available
Given the growing number of prediction algorithms developed to predict COVID-19 mortality, we evaluated the transportability of a mortality prediction algorithm using a multi-national network of healthcare systems. We predicted COVID-19 mortality using baseline commonly measured laboratory values and standard demographic and clinical covariates acr...
Article
Background: Current important regulatory resources and methods to identify cardiac implantable electronic devices (CIED) lead failure include post-approval studies, which may be limited in scope, participant numbers, and by attrition; studies relying on administrative codes, which lack specificity in multi-chamber systems; and voluntary adverse eve...
Article
Full-text available
PurposeStatin-associated side effects (SASEs) can limit statin adherence and present a potential barrier to optimal statin utilization. How standardized reporting of SASEs varies across medical facilities has not been well characterized.Methods We assessed facility-level variation in SASE reporting among patients with atherosclerotic cardiovascular...
Article
Full-text available
Background Veterans Health Administration (VHA) issued policy for lung cancer screening; resources at eight Veterans Affairs medical centers (VAMCs) in a demonstration project (DP) from 2013 to 2015. Research Question Do policies that provide resources increase lung cancer screening rates? Study Design and Methods Data from 8 DP VAMCs (DP group)...
Article
Full-text available
OBJECTIVES/GOALS: Using the covariate-rich Veteran Health Administration data, estimate the association between Proton Pump Inhibitor (PPI) use and severe COVID-19, rigorously adjusting for confounding using propensity score (PS)-weighting. METHODS/STUDY POPULATION: We assembled a national retrospective cohort of United States veterans who tested p...
Article
Full-text available
Background Social risk factors influence rehospitalization rates yet are challenging to incorporate into prediction models. Integration of social risk factors using natural language processing (NLP) and machine learning could improve risk prediction of 30‐day readmission following an acute myocardial infarction. Methods and Results Patients were e...
Article
Full-text available
Background : Characterization studies of COVID-19 patients with chronic obstructive pulmonary disease (COPD) are limited in size and scope. The aim of the study is to provide a large-scale characterization of COVID-19 patients with COPD. Methods : We included thirteen databases contributing data from January-June 2020 from North America (US), Europ...
Article
Full-text available
Purpose Routinely collected real world data (RWD) have great utility in aiding the novel coronavirus disease (COVID-19) pandemic response. Here we present the international Observational Health Data Sciences and Informatics (OHDSI) Characterizing Health Associated Risks and Your Baseline Disease In SARS-COV-2 (CHARYBDIS) framework for standardisati...
Article
Many clinical natural language processing methods rely on non-contextual word embedding (NCWE) or contextual word embedding (CWE) models. Yet, few, if any, intrinsic evaluation benchmarks exist comparing embedding representations against clinician judgment. We developed intrinsic evaluation tasks for embedding models using a corpus of radiology rep...
Article
Background Despite its high prevalence and clinical impact, research on peripheral artery disease (PAD) remains limited due to poor accuracy of billing codes. Ankle-brachial index (ABI) and toe-brachial index can be used to identify PAD patients with high accuracy within electronic health records. Methods We developed a novel natural language proc...
Article
Artificial intelligence (AI) is poised to significantly impact healthcare systems, including clinical diagnosis, healthcare administration and delivery, and public health infrastructures. In the context of the Quintuple Aim of healthcare (patient outcomes, cost reduction, population impact, provider wellness, and equity/inclusion), this chapter dis...

Network

Cited By