Access to this full-text is provided by Springer Nature.
Content available from Scientific Reports
This content is subject to copyright. Terms and conditions apply.
1
Vol.:(0123456789)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports
Machine learning vs. traditional
regression analysis for uid
overload prediction in the ICU
Andrea Sikora
1, Tianyi Zhang
2, David J. Murphy
3, Susan E. Smith
1, Brian Murray
4,
Rishikesan Kamaleswaran
5,6, Xianyan Chen
2, Mitchell S. Buckley
7, Sandra Rowe
8 &
John W. Devlin
9,10*
Fluid overload, while common in the ICU and associated with serious sequelae, is hard to predict and
may be inuenced by ICU medication use. Machine learning (ML) approaches may oer advantages
over traditional regression techniques to predict it. We compared the ability of traditional regression
techniques and dierent ML-based modeling approaches to identify clinically meaningful uid
overload predictors. This was a retrospective, observational cohort study of adult patients admitted
to an ICU ≥ 72 h between 10/1/2015 and 10/31/2020 with available uid balance data. Models to
predict uid overload (a positive uid balance ≥ 10% of the admission body weight) in the 48–72 h
after ICU admission were created. Potential patient and medication uid overload predictor variables
(n = 28) were collected at either baseline or 24 h after ICU admission. The optimal traditional logistic
regression model was created using backward selection. Supervised, classication-based ML models
were trained and optimized, including a meta-modeling approach. Area under the receiver operating
characteristic (AUROC), positive predictive value (PPV), and negative predictive value (NPV) were
compared between the traditional and ML uid prediction models. A total of 49 of the 391 (12.5%)
patients developed uid overload. Among the ML models, the XGBoost model had the highest
performance (AUROC 0.78, PPV 0.27, NPV 0.94) for uid overload prediction. The XGBoost model
performed similarly to the nal traditional logistic regression model (AUROC 0.70; PPV 0.20, NPV
0.94). Feature importance analysis revealed severity of illness scores and medication-related data
were the most important predictors of uid overload. In the context of our study, ML and traditional
models appear to perform similarly to predict uid overload in the ICU. Baseline severity of illness and
ICU medication regimen complexity are important predictors of uid overload.
Fluid overload, a frequent and unintended consequence of the resuscitation process in critically ill adults may
result in increased rates of acute kidney injury and invasive mechanical ventilation initiation, prolonged intensive
care unit (ICU) stay, and mortality1,2. Timely de-resuscitation to remove excess uid is associated with improved
outcomes3–6. While the predictors of volume responsiveness are well-established7,8, particularly in patients with
sepsis9,10, the predictors for ICU uid overload remain unclear. Development of rigorous uid overload prediction
algorithms could shorten the time to the implementation of uid overload mitigation strategies [e.g., concentra-
tion of intravenous (IV) uid products, discontinuation of maintenance uids, administration of diuretics] and
improve outcomes.
Non-diuretic ICU medication use may aect uid overload risk; preliminary data suggests the medication
regimen complexity-ICU (MRC-ICU) score is associated with both uid overload and uid balance11. is score
has also been shown to predict mortality and length of stay and also the medication interventions needed to
OPEN
1Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, 1120 15th
Street, HM-118, Augusta, GA 30912, USA. 2Department of Statistics, University of Georgia Franklin College of
Arts and Sciences, Athens, GA, USA. 3Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Emory
University, Atlanta, GA, USA. 4Department of Pharmacy, University of North Carolina Medical Center, Chapel
Hill, NC, USA. 5Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA,
USA. 6Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA. 7LaJolla
Pharmaceuticals, Waltham, USA. 8Department of Pharmacy, Oregon Health and Science University, Portland, OR,
USA. 9Northeastern University School of Pharmacy, Boston, MA, USA. 10Division of Pulmonary and Critical Care
Medicine, Brigham and Women’s Hospital, Boston, MA, USA. *email: j.devlin@northeastern.edu
Content courtesy of Springer Nature, terms of use apply. Rights reserved
2
Vol:.(1234567890)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
optimize a patient’s pharmacotherapy regimen12–19. erefore, quantifying patient-specic, medication-related
data is likely an important consideration in the prediction of uid overload in critically adults2,20,21.
Event prediction in the ICU remains a perennial area of research given the many challenges that exist for clini-
cians to accurately predict clinical outcomes in the highly complex and dynamic critical care environment22,23.
Articial intelligence and machine learning techniques have been proposed as a method to improve ICU clinical
outcome prediction given their unique ability to handle multi-dimensional problems and identify novel patterns
within the vast troves of continuously-generated patient data21,24–26. However, to some ICU clinicians, the use
of articial intelligence/machine learning approaches to predict clinical events may have a ‘black-box eect,’
which can ultimately preclude implementation. e rigorous evaluation of whether articial intelligence-based
approaches predict clinical events better than traditional regression models (or clinical expertise alone) remains
a key question in critical care practice27–31.
In this study, we sought to compare the ability of machine learning approaches to traditional regression
models to predict uid overload and the individual predictors for its occurrence in critically ill adults. We
hypothesized that advanced machine learning techniques perform better than traditional regression models to
predict uid overload and that the predictors for uid overload identied through machine learning approaches
may be dierent.
Methods
We conducted a retrospective, observational study of adults admitted ICUs at the University of North Carolina
Health System (UNCHS), an integrated health system, who had uid overload data available. e protocol for
this study was approved with waivers of informed consent and HIPAA authorization granted by UNHCS Insti-
tutional Review Board (approval number: Project00001541; approval date: October 2021). Procedures followed
in the study were in accordance with the ethical standards of the of the UNHCS Institutional Review Board
and the Helsinki Declaration of 1975, as most recently amended32. e reporting of this study adheres to the
STrengthening and reporting of OBservational data in Epidemiology statement33.
Population
A random sample of 1000 adults (≥ 18years) admitted to an ICU at UNCHS between October 2015 and October
2020 was generated. Patients on their index ICU admission with uid balance data available for the rst 72h
were included (Supplemental Digital Content (SDC) Fig.S1). Patients were excluded if the admission was not
their index ICU admission.
Data collection and outcomes
De-identied UNCHS electronic health record (EHR) data (Epic Systems, Verona, WI) housed in the Carolina
Data Warehouse (CDW) was extracted by a trained CDW data analyst. e primary outcome was the presence
of uid overload at the 48–72h (i.e., day 3) aer ICU admission. Fluid overload was dened as a positive uid
balance in milliliters (mL) greater than or equal to 10% of the patient’s admission body weight in kilograms
(kg)2,34. For example, a patient with a body weight of 100kg at ICU admission having a positive uid balance at
72h of 12,000mL (or 12kg) would be considered to have uid overload. A secondary outcome was the amount
of uid overload as a function of body weight. For example, the aforementioned patient would have a uid
overload amount of 12%.
Following a literature review, and through investigator consensus, potential predictor variables for uid
overload were dened2,35–38. A total of 28potential predictors were identied: 1) ICU baseline: age ≥ 65years, sex,
admission to a medical (vs. surgical) ICU, primary ICU admission diagnosis (i.e., cardiac, chronic kidney disease,
heart failure, hepatic, pulmonary, sepsis, trauma), and select co-morbidities (i.e., chronic kidney disease, heart
failure); 2) 24h aer ICU admission: APACHE II and SOFA score (using worst values in the 24h period), use
of supportive care devices (i.e., renal replacement therapy, invasive mechanical ventilation), serum laboratory
values (i.e., albumin < 3mg/dL, bicarbonate < 22mEq/L or > 29mEq/L, chloride ≥ 110mEq/L, creatinine ≥ 1.5mg/
dL, lactate ≥ 2mmol/L, potassium ≥ 5.5mEq/L, sodium ≥ 148mEq/L or < 134mEq/L), uid ba lance (mL), and
presence of acute kidney injury (as dened by need for renal replacement therapy or serum creatinine greater
than or equal two times baseline); 3) Medication data at 24h: MRC-ICU score, vasopressor use in the rst 24h,
use of continuous medication infusions, and the number of continuous medication infusions.
Data analysis
Data missingness
Due to the hypothesis-generating nature of our study and the lack of published data on ICU uid overload
prediction, no attempt was made to estimate a study sample size. e 991 patients were split into training and
testing datasets using a 80:20 ratio. We assumed data was missing at random (MAR) (i.e., related to observed, not
unobserved values) and therefore chose Multiple Imputation by Chained Equation (MICE), rather than complete
case analysis or simple imputation, as the most appropriate approach to address missingness. Ten imputations
per variable were therefore applied for all missing data in the training and testing datasets to generate multiple
imputed training and testing datasets (SDC Fig.S1).
Machine learning models
We employed Random Forest, SVM, and XGBoost for the task of modeling the presence of uid overload39–41.
During the model training on each of the ten imputed training sets, vefold cross validation wasapplied for
Random Forest, SVM and XGBoost, using their most appropriate R package42–44, to choose the hyperparameters
for these machine learning models that resulted in the highest prediction accuracy. Each of these models was
Content courtesy of Springer Nature, terms of use apply. Rights reserved
3
Vol.:(0123456789)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
then tted on the corresponding imputed training set, and predictions for probability of uid overload were
made on each of the ten imputed testing sets using the corresponding optimal model.During this phase, hyper-
parameters were tuned. For Random Forest, two hyperparameters were tuned (number of trees and number of
variables randomly sampled as candidates at each split). For SVM, linear kernel and cost of constraints violation
were tuned. For XGBoost, two hyperparameters were tuned (maximum depth of a tree and maximum number
of boosting iterations). For each model, ten dierent predictions were generated on ten dierent imputed test
sets. ese predictions of the probability for uid overload were averaged as the nal prediction.
For the degree of uid overload, we built models with the amount of uid overload at 72h. Since this is a
continuous variable, we employed their regression of the above machine learning models: Random Forest regres-
sion, SVM regression, and XGBoost regression. For XGBoost, feature importance was measured as the frequency
a feature was used in the trees. For Random Forest, feature importance was measured by mean decrease in node
impurity. Because ten dierent models were used on each imputed dataset, ten dierent feature importance lists
were generated for each. A subsequent analysis modeling uid overload as a continuous variable (percent of net
milliliters of uid by body weight) instead of dichotomous presence or absence of uid overload) was performed
(see SDC—Additional Methods S1).
Traditional regression models
Subsequently, a full logistic regression model was built for the presence of uid overload for each of the ten
complete training sets. We then applied backward elimination to select the nal model. e initial set of variables
for the variable selection were determined by the signicance of variables in the ten full models by multivariate
Wald testing45. We built our linear regression models so that the degree of uid overload was similar to that of
the ten completed training sets. In order to compare these models with the MRC-ICU only model, we also built
logistic regression and linear regression models with MRC-ICU as the sole predictor in the ten training sets.
Aer model tting, model ts were pooled using Rubin’s method46. Using the pooled models, odds ratios (OR)
and their 95% condence intervals (CI) were reported.
For each regression model, ten dierent predictions were generated on ten dierent imputed test sets as well.
ese predictions of the probability for uid overload were averaged as the nal prediction. We compared the
variables selected through backward selection with the top ve variables chosen by Random Forest (see SDC
Additional Methods S1). To further evaluate our results in those patients with high APACHE-2 (≥ 25) and high
SOFA (≥ 10) scores, we generated predictions using the backward section model (see SDC Additional Methods
S1).
Ethical approval
e protocol for this study was approved with waivers of informed consent and HIPAA authorization granted by
UNHCS Institutional Review Board (approval number: Project00001541; approval date: October 2021).
Results
A total of 49 (12.5%) of the 391 included patients had uid overload on ICU day 3. e degree of day 3 uid
overload was signicantly greater in the uid overload (vs non overload) patients (16.6% vs 2.2%, p < 0.01).
Overall, the mean APACHE II score was 15.7 ± 6.6, mean SOFA score was 8.3 ± 3.3, and MRC-ICU score was
11.8 ± 8.7. A signicantly greater proportion of uid overload patients (vs. those without) had an elevated serum
lactate ≥ 2mmol/L (32.7% vs. 14.9%, p = 0.01) and AKI (28.6% vs. 10.5%, p < 0.001) at 24h and positive uid
balance (1,840mL vs. 390mL, p < 0.001) on ICU day 3. All model covariates are summarized in Table1. At ICU
day 3, patients with uid overload (vs those without) were more likely to be dead (20.4% vs. 7.3%, p = 0.01), have
AKI (34.7% vs. 15.8%, p < 0.001), and remain on mechanical ventilation (12.7% vs. 4.2%, p = 0.05).
Among the machine learning models, XGBoost demonstrated the highest AUROC (0.78) compared to SVM
(0.69) and RF (0.76) and was associated with a PPV of 0.27 and NPV of 0.94. Notably, all models tested at
relatively poor PPV. In comparison, stepwise logistic regression had an AUROC of 0.70, PPV 0.26, and NPV
0.94. Full results are reported in Table2, and AUROC curves for all models are provided in SDC Supplemental
Fig.S2. Results of the full logistic regression are reported in SDC Supplemental TableS1. Stepwise regression
resulted in a more parsimonious model (7 variables vs. 31 variables) but demonstrated similar performance to
the machine learning models (SDC Supplementary TableS2). In the stepwise regression, presence of sepsis, male
sex, the SOFA score at 24h, and the 24h serum sodium and bicarbonate comprised the stepwise regression
model (Table2). In an analysis of MRC-ICU as a single predictor for uid overload, the model had an AUROC
of 0.74 (0.60–0.84), sensitivity 0.62 (0.35–0.85), specicity 0.70 (0.63–0.77), PPV 0.16 (0.08–0.27), and NPV
0.96 (0.90–0.98).
Feature importance graphs were plotted for XGBoost (Fig.1), RF (SDC Supplemental Fig.S3) and SVM (SDC
5 Supplemental Fig.S4). Among the 10 dierent feature importance lists generated for each model, dierences
between top features were noted. For example, for two of the machine learning models, XGBoost (Fig.2) and RF,
the top ve most important features were uid balance at 24h, SOFA score at 24h, MRC-ICU at 24h, APACHE
II at 24h, and the number of continuous infusions at 24h. While the stepwise regression model found uid
balance at 24h and APACHE II at 24h to be top features, the SOFA score at 24h, the MRC-ICU at 24h and the
number of continuous infusions were not found to be model features. e full regression results for predicting
the amount of uid overload at 72h are reported in SDC Supplemental TableS3. For stepwise regression, twelve
variables were included with uid balance, laboratory values, and severity of illness being signicant predictors
(SDC Supplemental TableS4). All models demonstrated similar performance as measured by MSE (SDC Sup-
plemental TableS5). Feature importance graphs are presented in SDC Supplemental Figs.S5–S7).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
4
Vol:.(1234567890)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
Table 1. Study cohort characteristics by presence of uid overload within 72h of ICU admission. Data are
presented as n (%) unless otherwise stated.
All (n = 391) Fluid overload (n = 49) No uid overload (n = 342) p-value
ICU baseline
Age ≥ 65years 202 (51.7) 19 (38.8) 183 (53.5) 0.08
Male sex 213 (54.5) 23 (46.9) 190 (55.6) 0.33
Chronic comorbidities
Chronic kidney disease 13 (3.3) 1 (2.0) 12 (3.5) 0.06
Heart failure 19 (4.9) 2 (4.1) 17 (4.9) 0.06
Admission to medical ICU 156 (39.9) 24 (48.9) 132 (38.6) 0.22
Primary ICU admission diagnosis
Cardiac 81 (20.7) 3 (6.1) 78 (22.8)
0.06
Chronic kidney disease 13 (3.3) 1 (2.0) 12 (3.5)
Hepatic 6 (1.5) 1 (2.0) 5 (1.5)
Pulmonary 58 (14.8) 8 (16.3) 50 (14.6)
Sepsis/septic shock 29 (7.4) 7 (14.3) 22 (6.4)
Trauma 10 (2.6) 3 (6.1) 7 (2.0)
24h aer ICU admission
Severity of illness, mean (SD)
APACHE II Score 15.7 (6.6) 17.5 (7.0) 15.4 (6.6) 0.06
SOFA Score 8.3 (3.3) 9.9 (4.6) 8.2 (3.1) 0.07
Supportive devices
Any renal replacement therapy 5 (1.3) 1 (2.0) 4 (1.2) 1.00
Any mechanical ventilation 140 (35.8) 21 (42.9) 119 (34.8) 0.53
Serum laboratory values
Albumin < 3mg/dL 88 (22.5) 18 (36.7) 70 (20.5) 0.02
Bicarbonate < 22mEq/L 74 (18.9) 14 (28.6) 60 (17.5) 0.16
Bicarbonate > 29mEq/L 64 (16.4) 6 (12.2) 58 (16.9)
Creatinine ≥ 1.5mg/dL 28 (7.2) 7 (14.3) 21 (6.1) 0.02
Chloride ≥ 110mEq/L 125 (31.9) 19 (38.8) 106 (30.9) 0.33
Potassium ≥ 5.5mEq/L 19 (4.9) 5 (10.2) 14 (4.1) 0.12
Lactate ≥ 2mmol/L 67 (17.1) 16 (32.7) 51 (14.9) 0.01
Sodium ≥ 148mEq/L 22 (5.6) 6 (12.2) 16 (4.7) 0.01
Sodium < 134mEq/L 33 (8.4) 4 (8.1) 29 (8.5)
Fluid balance (mL), mean (SD) 570 (1960) 1840 (301) 390 (168) < 0.001
Acute kidney injury 50 (12.8) 14 (28.6) 26 (10.5) < 0.001
Medications
MRC-ICU, mean (SD) 11.8 (8.7) 13.4 (8.4) 11.5 (8.7) 0.06
Any vasopressor 119 (30.4) 16 (32.6) 103 (30.1) 0.85
Any continuous infusions 249 (63.6) 34 (69.3) 215 (62.8) 0.47
Infusions/patient, mean (SD) 2.29 (3.3) 1.98 (2.2) 2.33 (3.4) 0.35
Table 2. Performance of presence of uid overload prediction models, mean (condence interval). AUROC
area under the receiver operating characteristic, PPV positive predictive value, NPV negative predictive value.
AURO C Accuracy Sensitivity Specicity PPV NPV
Traditional regression
All variables 0.70 (0.53, 0.82) 0.82 (0.76, 0.87) 0.43 (0.19, 0.70) 0.85 (0.79, 0.89) 0.20 (0.08, 0.37) 0.94 (0.89, 0.97)
Stepwise selected regression 0.70 (0.52, 0.82) 0.86 (0.80, 0.90) 0.43 (0.19, 0.70) 0.89 (0.84, 0.93) 0.26 (0.11, 0.47) 0.94 (0.90, 0.97)
Supervised machine learning models
Random forest 0.76 (0.62, 0.86) 0.83 (0.77, 0.88) 0.56 (0.29, 0.80) 0.8571 (0.80, 0.90) 0.25 (0.12, 0.43) 0.95 (0.91, 0.98)
Support vector machine 0.69 (0.51, 0.82) 0.80 (0.74, 0.86) 0.50 (0.24, 0.75) 0.82 (0.76, 0.88) 0.21 (0.09, 0.36) 0.94 (0.90, 0.97)
XGBoost 0.78 (0.62, 0.87) 0.87 (0.81, 0.91) 0.37 (0.15, 0.64) 0.91 (0.86, 0.94) 0.27 (0.10, 0.50) 0.94 (0.89, 0.97)
Content courtesy of Springer Nature, terms of use apply. Rights reserved
5
Vol.:(0123456789)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
When variables selected through backward selection were compared with the variables chosen by the Ran-
dom Forest model, we found MRC-ICU at 24h to be highly correlated with sex-male, number of IV continuous
infusions to be highly correlated with sex-male and age—≥ 65), uid balance at 24h (mL) to be highly correlated
with admission diagnosis-sepsis/septic shock, laboratory values-serum bicarbonate, and age—≥ 65 (SDC Sup-
plemental TablesS6 and S7). ese results indicate high explanatory power exists between the backward selec-
tion and random forest variables. e vast majority of cases of uid overload occur in patients with both high
APACHE II and SOFA scores (SDC Supplemental TableS8).
Discussion
Although machine learning models have been shown to outperform traditional regression models in a variety
of settings47,48, the potential benets of machine learning in critical care remain an open eld of exploration, in
part due to a current lack of rigorous comparison in high quality ICU datasets29,49,50. Our analysis represents the
Figure1. Feature importance for presence of uid overload prediction with XGBoost.
Figure2. Most common features for presence of uid overload prediction with XGBoost imputations.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
6
Vol:.(1234567890)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
rst published comparison of machine learning approaches with traditional regression methods to predict uid
overload using a novel dataset with granular medication data.
We report that machine learning and logistic regression analyses demonstrate a similar predictive power
to identify patients with uid overload on day 3 of their ICU stay. Although use of machine learning did not
appear to improve predictive performance over regression analysis, it expanded the number of variables critical
to uid overload prediction and highlights the importance of further articial intelligence-based exploration in
this area. is analysis of individual predictors may help bedside clinicians better understand how the machine
learning models work and may help overcome their ‘black box’ hesitancy to trust machine learning-generated
results51,52. For example, feature importance graphs for the machine learning analyses found complexity of the
daily ICU medication regimen (i.e., MRC-ICU score), which includes the number of intravenous medication
infusions (the primary method to administer medications in this population and a primary source of uids to a
patient), to be an important contributor to uid overload. In comparison, in the traditional multivariable regres-
sion, the MRC-ICU score was not associated with uid overload. is may be because machine learning analyses
better account for severity of illness and the response of clinicians to respond to this severity by administering
more medication infusions leading to a more complex daily medication regimen; however, the methods applied,
including feature importance, preclude causal inference at this juncture. As such, our results highlight the unique
power of machine learning to identify complex relationships that can be further elucidated via machine-learning
based causal inference modeling and other designs aimed at causation2,20.
Optimizing uid management (or uid stewardship) has been previously dened by the ROSE model of
Resuscitation, Optimization, Stabilization, and dE-resuscitation35. Aer an initial 24—48h period characterized
by overt volume resuscitation (e.g., a crystalloid bolus) and IV medication initiation (e.g., antibiotics), and the
associated uid administration, the care priority shis from volume administration to volume removal. While
comprehensive uid stewardship management strategies including reduced uid use and diuretic administra-
tion can eectively reduce uid overload and its sequelae, they are oen deployed too late1,2. Interestingly, some
reports have indicated ‘hidden uids’ (dened as blood products, enteralnutrition, ushes, and intravenous
medications) were signicantly associated with the development of uid overload. During critical illness many
of these ‘hidden uids’ are necessary (e.g., blood products), given that intravenous medications account for over
40% of total uid intake in this analysis, interventions such as concentrating intravenous medications, employ-
ing oral formulations when feasible, careful evaluation of maintenace uids, and antibiotic de-escalation are
potoentially still viable even in high illness severity that can reduce this complication. However, weighing risks
and benets associated with these interventions in thiscontext may be aided by more quantitative prediction
data56,57. Overall, de-resuscitation and uid stewardship can be deceptively complex53. In a patient with shock,
balancing the dueling forces of volume responsiveness assessment and timely volume resuscitation with the
risks associated with uid overload represents a highly complex Goldilocks scenario that requires clinicians to
have high clinical precision, essentially pivoting ‘on a dime’, from a strategy of aggressive volume expansion to
one of rapid volume removal36,54,55.
Despite the complexities of this decision process, limited prediction tools for uid overload are available
to assist clinicians at the ICU bedside. As such, real-time recognition identifying when to make the shi from
resuscitation to de-resuscitation has the potential to improve bedside management. However, to go beyond
the hourly assessment of ‘Ins and Outs’ would require accurate prediction of future uid overload risk and the
adverse events associated with it, in the time-dependent context of intervention delivery (e.g., diuretics). In such
a scenario, an algorithm would be able to accurately interpret a septic patient who is 3 L positive 24h aer uid
resuscitation initiation as being in a ‘green zone’ (i.e., appropriately resuscitated). However, 24h later, if the same
patient is 4 L positive while o vasopressors and with down-trending sepsis markers the algorithm could alert
clinicians that the patient is now in a ’yellow zone’ where interventions like diuretic therapy and uid reductions
are required to reduce acute kidney injury and intubation risk. is type of real-time predictive capability could
support continuous clinician decision-making but requires evaluation outside the scope of our current study.
Fluid overload also presents an important test case for exploring and adapting articial intelligence methods
to ICU problems, particularly those related to ICU medication use. Fluid overload represents a uniquely inter-
venable event in the ICU. Intervenable events share three key characteristics: they are predictable, preventable,
and otherwise associated with poor outcomes. e results of our study, and others, indicate that uid overload
can be predicted with modeling of some kind, especially given its ability to be quantitatively dened56–58. Fluid
overload has been associated with poor outcomes including acute kidney injury, delirium, poor respiratory out-
comes, prolonged length of stay, and potentially increasing mortality2,37,59–62. Evidence demonstrates the timely
recognition and management of uid overload is feasible and is associated with reduced mortality and time in
the ICU3,5,63,64. Notably, uid stewardship has been adapted by critical care pharmacists as key component of
comprehensive medication management5,6,65. As such, these results may support other investigations as they
identify patients in whom it is safe to initiate de-resuscitation or importantly never needed that degree of uid
volume initially and at the bedside may prompt clinicians to be more targeted in therapies initiated or aggres-
sive in curtailing early ‘hidden’ uids to avoid the complications of uid overload and/or the need for a highly
interventional period of de-resuscitation (e.g., diuretics, dialysis). Articial intelligence may be particularly well
suited to bolster these eorts, and thus while feature importance analyses cannot provide foundation for causal
inference, they may guide such future investigations.
Our study has limitations. Our patient sample may have been too small to demonstrate superiority of the
machine learning approaches compared to traditional regression, and no validation in a separate, external data-
set was undertaken at this juncture66. Future studies applying this approach to alternative, larger datasets (e.g.,
MIMIC-III) should be considered to examine the external validity of our ndings. Although MICE is the estab-
lished approach to address missingness in cohort studies that includes variables that are a composite of several
individual patient-specic values (e.g. SOFA), it is possible that some of the values in the imputed datasets that
Content courtesy of Springer Nature, terms of use apply. Rights reserved
7
Vol.:(0123456789)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
represented our new ground truth may not have been accurate67. Bias may exist due to which patients had uid
balance data available. Other predictors for uid overload not included in our models may exist68. By relying on
prediction data derived in the rst 24h of ICU admission, we did not fully capture the dynamic nature of critical
illness over the entire three day ICU period before uid overload occurred. Future time-dependent evaluations
of changing features employing unsupervised learning techniques may yield novel insights.
Conclusion
Fluid overload is an important, intervenable event in the ICU population. Incorporation of medication-related
variables and articial intelligence has demonstrated promise to improve prediction that may ultimately guide
timely intervention and mitigation of this ICU complication; however, comparative advantages over traditional
modeling techniques may remain warranted.
Data availability
e datasets used and/or analyzed during the current study available from the corresponding author on reason-
able request.
Received: 1 June 2023; Accepted: 4 November 2023
References
1. Carr, J. R. et al. Fluid stewardship of maintenance Intravenous uids. J. Pharm. Pract. 897, 190 (2021).
2. Hawkins, W. A. et al. Fluid stewardship during critical illness: A call to action. J. Pharm. Pract. 33(6), 863–873 (2020).
3. Bissell, B. D. et al. Impact of protocolized diuresis for de-resuscitation in the intensive care unit. Crit. Care 24(1), 70 (2020).
4. Jones, T. W. et al. Early diuretics for de-resuscitation in septic patients with le ventricular dysfunction. Clin. Med. Insights Cardiol.
16, 11795468221095876 (2022).
5. Hawkins, W. A. et al. From theory to bedside: implementation of uid stewardship in a medical ICU pharmacy practice. Am. J.
Health Syst. Pharm. 79(12), 984–992 (2022).
6. Bissell, B. D. et al. A narrative review of pharmacologic de-resuscitation in the critically ill. J. Crit. Care 59, 156–162 (2020).
7. Messmer, A. S. et al. Fluid overload phenotypes in critical illness-a machine learning approach. J. Clin. Med. 11(2), 1 (2022).
8. Zhang, Z., Ho, K. M. & Hong, Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute
kidney injury in critical care. Crit. Care 23(1), 112 (2019).
9. R aghu, A., Komorowski, M., Celi, L. A., Szolovits, P., & Ghassemi, M. Continuous state-space models for optimal sepsis treatment:
A deep reinforcement learning approach. In Machine Learning for Healthcare Conference 2017; pp. 147–163.
10. Komorowski, M., Celi, L. A., Badawi, O., Gordon, A. C. & Faisal, A. A. e articial intelligence clinician learns optimal treatment
strategies for sepsis in intensive care. Nat. Med. 24(11), 1716–1720 (2018).
11. Olney, W. J. et al. Medication regimen complexity score as an indicator of uid balance in critically ill Patients. J. Pharm. Pract.
897, 190 (2021).
12. Sikora, A. et al. Impact of pharmacists to improve patient care in the critically ill: A large multicenter analysis using meaningful
metrics with the medication regimen complexity-ICU (MRC-ICU) score. Crit. Care Med. 50(9), 1318–1328 (2022).
13. Newsome, A. S. et al. Multicenter validation of a novel medication-regimen complexity scoring tool. Am. J. Health Syst. Pharm.
77(6), 474–478 (2020).
14. Newsome, A. S. et al. Characterization of changes in medication complexity using a modied scoring tool. Am. J. Health Syst.
Pharm. 76(Supplement 4), S92-s95 (2019).
15. Gwynn, M. E. et al. Development and validation of a medication regimen complexity scoring tool for critically ill patients. Am. J.
Health Syst. Pharm. 76(Suppl 2), S34–S40 (2019).
16. Al-Mamun, M. A., Brothers, T. & Newsome, A. S. Development of machine learning models to validate a medication regimen
complexity scoring tool for critically ill patients. Ann. Pharmacother. 55(4), 421–429 (2021).
17. Smith, S. E., Shelley, R. & Sikora, A. Medication regimen complexity vs patient acuity for predicting critical care pharmacist
interventions. Am. J. Health Syst. Pharm. 79(8), 651–655 (2022).
18. Webb, A. J., Rowe, S. & Newsome, A. S. A descriptive report of the rapid implementation of automated MRC-ICU calculations in
the EMR of an academic medical center. Am. J. Health Syst. Pharm. 79(12), 979–983 (2022).
19. Newsome, A. S. et al. Medication regimen complexity is associated with pharmacist interventions and drug-drug interactions: A
use of the novel MRC-ICU scoring tool. J. Am. Coll. Clin. Pharm. 3(1), 47–56 (2020).
20. Sanchez, P. et al. Causal machine learning for healthcare and precision medicine. R Soc. Open Sci. 9(8), 220638 (2022).
21. Iwase, S. et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Sci. Rep. 12(1), 12912 (2022).
22. Beil, M. et al. On predictions in critical care: e individual prognostication fallacy in elderly patients. J. Crit. Care 61, 34–38
(2021).
23. Lovejoy, C. A., Buch, V. & Maruthappu, M. Articial intelligence in the intensive care unit. Crit. Care 23(1), 7 (2019).
24. Gutierrez, G. Articial intelligence in the intensive care unit. Crit. Care 24(1), 101 (2020).
25. Goh, K. H. et al. Articial intelligence in sepsis early prediction and diagnosis using unstructured data in healthcare. Nat. Com-
mun. 12(1), 711 (2021).
26. Hyland, S. L. et al. Early prediction of circulatory failure in the intensive care unit using machine learning. Nat. Med. 26(3), 364–373
(2020).
27. DeGrave, A. J., Janizek, J. D. & Lee, S. I. AI for radiographic COVID-19 detection selects shortcuts over signal. medRxiv 1, 1 (2020).
28. Nguyen, D., Ngo, B. & vanSonnenberg, E. AI in the intensive care unit: Up-to-date review. J. Intensive Care Med. 36(10), 1115–1123
(2021).
29. Yoon, J. H., Pinsky, M. R. & Clermont, G. Articial intelligence in critical care medicine. Crit. Care 26(1), 75 (2022).
30. Farion, K. J. et al. Comparing predictions made by a prediction model, clinical score, and physicians: Pediatric asthma exacerba-
tions in the emergency department. Appl. Clin. Inf. 4(3), 376–391 (2013).
31. Feng, J. Z. et al. Comparison between logistic regression and machine learning algorithms on survival prediction of traumatic
brain injuries. J. Crit. Care 54, 110–116 (2019).
32. World Medical Association. World Medical Association Declaration of Helsinki: Ethical principles for medical research involving
human subjects. JAMA 312, 2191–2194 (2023).
33. Von Elm, E. A. D. et al. STROBE Initiative: Strengthening the reporting of observational studies in epidemiology (STROBE) state-
ment: Guidelines for reporting observational studies. BMJ 335, 806–808 (2007).
34. Bouchard, J. et al. Fluid accumulation, survival and recovery of kidney function in critically ill patients with acute kidney injury.
Kidney Int. 76(4), 422–427 (2009).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
8
Vol:.(1234567890)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
35. Carr, J. R. et al. Fluid stewardship of maintenance intravenous uids. J. Pharm. Pract. 35(5), 769–782 (2022).
36. Malbrain, M. et al. Principles of uid management and stewardship in septic shock: It is time to consider the four D’s and the four
phases of uid therapy. Ann. Intensive Care 8(1), 66 (2018).
37. Claure-Del Granado, R. & Mehta, R. L. Fluid overload in the ICU: Evaluation and management. BMC Nephrol. 17(1), 109 (2016).
38. O’Connor, M. E. & Prowle, J. R. Fluid overload. Crit. Care Clin. 31(4), 803–821 (2015).
39. Chen, T., & Guestrin, C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining [Internet]. New York, NY, USA: ACM; 2016. p. 785–94. Available from:
https:// doi. org/ 10. 1145/ 29396 72. 29397 85.
40. Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20(3), 273–297 (1995).
41. Ho, T. K. Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition. p. 278–82
(1995).
42. Liaw, A. & Wiener, M. Classication and regression by Random Forest. R News 2(3), 18–22 (2002).
43. Meyer, D., Dimitriadou, E., & Hornik, K., et al. Miscellaneous functions of the Department of Statistics and Probability eory
Group (2023). R Package version 1.7–13. https:// CRAN.R- proje ct. org/ packa ge= e1071.
44. Chen, T., He, T., Benesty, M., et al. XGBoost: Extreme gradient boosting. R package versions 1.7.5.1. https:// CRAN.R- proje ct. org/
packa ge= xgboo st.
45. Li, K. H., Raghunathan, T. E. & Rubin, D. B. Large-sample signicance levels from multiply imputed data using moment-based
statistics and an F reference distribution. J. Am. Stat. Assoc. 86, 1065–1073 (1991).
46. Rubin, D. B. Multiple imputation for nonresponse in surveys (Wiley-Interscience, Hoboken, NJ, 2004).
47. Topol, E. J. Deep medicine: how articial intelligence can make healthcare human again. First edition. pp 1 online resource (Basic
Books, New York, 2019).
48. Kahneman, D., Sibony, O., & Sunstein, C. R. Noise: A aw in human judgment. First edition. Edition (Little, Brown Spark, New
York, 2021)
49. D’Hondt, E. et al. Identifying and evaluating barriers for the implementation of machine learning in the intensive care unit. Com-
mun. Med. (Lond) 2(1), 162 (2022).
50. van de Sande, D. et al. Moving from bytes to bedside: A systematic review on the use of articial intelligence in the intensive care
unit. Intensive Care Med. 47(7), 750–760 (2021).
51. Moss, L. et al. Demystifying the black box: e importance of interpretability of predictive models in neurocritical care. Neurocrit.
Care 37(Suppl 2), 185–191 (2022).
52. e Lancet Respiratory M: Opening the black box of machine learning. Lancet Respir. Med. 6(11), 801 (2018).
53. Malbrain, M., Martin, G. & Ostermann, M. Everything you need to know about deresuscitation. Intensive Care Med. 48(12),
1781–1786 (2022).
54. Gelbart, B. et al. Fluid accumulation in mechanically ventilated, critically ill children: retrospective cohort study of prevalence and
outcome. Pediatr. Crit. Care Med. 23(12), 990–998 (2022).
55. National Heart L, Blood Institute Acute Respiratory Distress Syndrome Clinical Trials N, Wiedemann, H. P., et al. Comparison of
two uid-management strategies in acute lung injury. N. Engl. J. Med. 354(24), 2564–2575 (2006).
56. Gamble, K. C. et al. Hidden uids in plain sight: Identifying intravenous medication classes as contributors to intensive care unit
uid intake. Hosp. Pharm. 57(2), 230–236 (2022).
57. Branan, T. et al. Association of hidden uid administration with development of uid overload reveals opportunities for targeted
uid minimization. SAGE Open Med. 8, 2050312120979464 (2020).
58. Mitchell, K. H. et al. Volume Overload: prevalence, risk factors, and functional outcome in survivors of septic shock. Ann. Am.
orac. Soc. 12(12), 1837–1844 (2015).
59. Ouchi, A. et al. Association between uid overload and delirium/coma in mechanically ventilated patients. Acute Med. Surg. 7(1),
e508 (2020).
60. Murphy, C. V. et al. e importance of uid management in acute lung injury secondary to septic shock. Chest 136(1), 102–109
(2009).
61. Boyd, J. H. et al. Fluid resuscitation in septic shock: A positive uid balance and elevated central venous pressure are associated
with increased mortality. Crit. Care Med. 39(2), 259–265 (2011).
62. Woodward, C. W. et al. Fluid overload associates with major adverse kidney events in critically ill patients with acute kidney injury
requiring continuous renal replacement therapy. Crit. Care Med. 47(9), e753–e760 (2019).
63. Silversides, J. A., Perner, A. & Malbrain, M. Liberal versus restrictive uid therapy in critically ill patients. Intensive Care Med.
45(10), 1440–1442 (2019).
64. Goldstein, S. et al. Pharmacological management of uid overload. Br. J. Anaesth. 113(5), 756–763 (2014).
65. Silversides, J. A. et al. Fluid management and deresuscitation practices: A survey of critical care physicians. J. Intensive Care Soc.
21(2), 111–118 (2020).
66. Burkov, A. e hundred-page machine learning book (Quebec City, Canada, Andriy Burkov, 2019).
67. O’Keefe, A. G., Farewell, D. M., Tom, B. D. M. & Farewell, V. T. Multiple imputation of missing composite outcomes in longitudinal
data. Stat. Biosci. 8(2), 310–332 (2016).
68. Qin, X. et al. A deep learning model to identify the uid overload status in critically ill patients based on chest X-ray images. Pol.
Arch. Intern. Med. 133(2), 1 (2023).
Acknowledgements
Data acquisition were supported by NC TraCS, funded by Grant Number UL1TR002489 from the National
Center for Advancing Translations Sciences at the National Institutes of Health, and Data Analytics at the Uni-
versity of North Carolina Medical Center Department of Pharmacy.
Author contributions
A.S. was responsible for project execution, design, and initial manuscript writing. J.D., D.M., and R.K. provided
critical revisions of manuscript, data interpretation, and senior level oversight. M.Y., T.Z, and X.C. handled data
pre-processing and analysis (M.Y., T.Z.) and methodology support and data interpretation (X.C., R.K.). B.M.
served as site coordinator for all data validation and procurement as well as manuscript revisions and data inter-
pretation. S.S., M.B., and S.R. provided clinical interpretation, results interpretation, and manuscript revision.
Funding
Funding through Agency of Healthcare Research and Quality for Drs. Devlin, Murphy, Sikora, Smith, and
Kamaleswaran was provided through R21HS028485 and R01HS029009.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
9
Vol.:(0123456789)
Scientic Reports | (2023) 13:19654 | https://doi.org/10.1038/s41598-023-46735-3
www.nature.com/scientificreports/
Competing interests
e authors declare no competing interests.
Additional information
Supplementary Information e online version contains supplementary material available at https:// doi. org/
10. 1038/ s41598- 023- 46735-3.
Correspondence and requests for materials should be addressed to J.W.D.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Open Access is article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons licence, and indicate if changes were made. e images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.
© e Author(s) 2023
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com