Uffe Kock Wiil’s research while affiliated with University of Melbourne and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (238)


Prescription data and demographics: An explainable machine learning exploration of colorectal cancer risk factors based on data from Danish national registries
  • Article

April 2025

·

3 Reads

Computer Methods and Programs in Biomedicine

·

Olav Sivertsen Garvik

·

·

[...]

·


Flow diagram of study selection (PRISMA chart).
Search criteria.
Studies' sample numbers.
Characteristics of included studies.
Summary of AI algorithms.
Applications of Artificial Intelligence for Metastatic Gastrointestinal Cancer: A Systematic Literature Review
  • Literature Review
  • Full-text available

February 2025

·

37 Reads

Background/Objectives: This systematic literature review examines the application of Artificial Intelligence (AI) in the diagnosis, treatment, and follow-up of metastatic gastrointestinal cancers. Methods: The databases PubMed, Scopus, Embase (Ovid), and Google Scholar were searched for published articles in English from January 2010 to January 2022, focusing on AI models in metastatic gastrointestinal cancers. Results: forty-six studies were included in the final set of reviewed papers. The critical appraisal and data extraction followed the checklist for systematic reviews of prediction modeling studies. The risk of bias in the included papers was assessed using the prediction risk of bias assessment tool. Conclusions: AI techniques, including machine learning and deep learning models, have shown promise in improving diagnostic accuracy, predicting treatment outcomes, and identifying prognostic biomarkers. Despite these advancements, challenges persist, such as reliance on retrospective data, variability in imaging protocols, small sample sizes, and data preprocessing and model interpretability issues. These challenges limit the generalizability, clinical application, and integration of AI models.

Download



Predicting patients’ sentiments about medications using artificial intelligence techniques

December 2024

·

16 Reads

The increasing development of technology has led to the increase of digital data in various fields, such as medication-related texts. Sentiment Analysis (SA) in medication is essential to give clinicians insights into patients’ feedback about the treatment procedure. Therefore, this study intends to develop Artificial Intelligence (AI) models to predict patients’ sentiments. This study used a large medication review dataset to perform a SA of medications. Three scenarios were considered for classification, including two, three, and ten classes. The Word2Vec algorithm and pre-trained word embeddings, including the general and clinical domains, were utilized in model development. Seven Machine Learning (ML) and Deep Learning (DL) models were developed for various scenarios. The best hyperparameters for all models were fine-tuned. Moreover, two ensemble learning models were developed from the proposed ML and DL models. For the first time, a technique was implemented to interpret the results for explainability and interpretability. The results showed that the developed deep ensemble model (DL_ENS), using PubMed and PMC, as pre-trained word embedding representation, achieved the best results, with accuracy and F1-Score of 92.96% and 92.27% in two classes, 92.18% and 88.50 in three classes, and 90.31% and 67.07% in ten classes, respectively. Combining DL models and developing a DL_ENS with clinical domain pre-trained word embedding representation can accurately predict classes and scores of patients’ sentiments about medications compared to previous studies on the same dataset. Due to the transparency in decision-making, our DL_ENS model can be used as an auxiliary tool to help clinicians prescribe medications.


Flowchart illustrating the LC detection from laboratory and smoking status data. (a) The composition of the study cohort. (b) The inclusion criteria for the data collection of patients who were suspicious of having LC. (c) The workflow of splitting the data into train, validation, and test sets. The train and validation sets are used for the learning process of the model and to minimize the prediction/detection error. The test set of 200 samples are utilized for the comparison between the model’s prediction and five pulmonologists diagnosis. (d) The collected data from different sources are concatenated to be used as inputs for the DES model and to be also provided for the pulmonologists in a fair manner for their diagnoses.
Comparison of evaluation metrics for the validation set using 5-fold cross-validation. (a) Models comparison using sensitivity metric. There is a significant difference between the two highest models (i.e., LGBM and SVM) and LR. (b) Models comparison using specificity metric. There is only significant difference between DES and LR. (c) Models comparison using ROC-AUC metric. There is only significant difference between DES and SVM. (d) Models comparison using F1-score metric. There is no significant difference between the models. The central marker represents mean values along with corresponding standard deviations. The horizontal brackets indicate significant differences in performance, as determined by the Nemenyi post-hoc test, with a two-sided p-value threshold of 0.05.
Assessment of the Dynamic Ensemble Selection Model (DES) through 5-fold cross-validation. (a) Average confusion matrix for 5-fold cross-validation. (b) Average ROC curve for 5-fold cross-validation. The highlighted pink area around the ROC curve represents the standard deviation of 5-fold cross-validation. (c) Calibration curve showing mean predicted probability against actual fraction of positives before and after calibration of the DES model. A decrease in overestimation is shown. (d) Decision curve analyses displaying the relationship between threshold probablilities and the net benefit when utilizing the DES-model for classification of patients at high risk of LC. This is compared to selecting all patients (grey line) or no patients (blue line). The DES-model demonstrates a higher net benefit across threshold probabilities ranging from approximately 7% to 70% compared to the other two clinical strategies. (e) SHAP summary plot with features listed in descending order of importance.
Assessment of the DES model on the 200 samples and the comparison with pulmonologists. (a) Confusion matrix representing the DES model’s prediction versus the actual diagnosis. (b) Confusion matrix of the predictions made by the averaged pulmonologists votes versus the actual diagnosis. (c) ROC curve with the individual pulmonologist’s performance marked by red marks and averaged performance marked by a green dot. (d) Correct predictions of the DES model and averaged pulmonologists in relation to the four stages of lung cancer, alongside the actual distribution of each stage.The numbers above the bars represent the total number of patients (n) in each stage, providing a reference for the proportional comparisons.
Pulmonologists-level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach

December 2024

·

18 Reads

·

4 Citations

Lung cancer (LC) remains the primary cause of cancer-related mortality, largely due to late-stage diagnoses. Effective strategies for early detection are therefore of paramount importance. In recent years, machine learning (ML) has demonstrated considerable potential in healthcare by facilitating the detection of various diseases. In this retrospective development and validation study, we developed an ML model based on dynamic ensemble selection (DES) for LC detection. The model leverages standard blood sample analysis and smoking history data from a large population at risk in Denmark. The study includes all patients examined on suspicion of LC in the Region of Southern Denmark from 2009 to 2018. We validated and compared the predictions by the DES model with diagnoses provided by five pulmonologists. Among the 38,944 patients, 9,940 had complete data of which 2,505 (25%) had LC. The DES model achieved an area under the roc curve of 0.77±0.01, sensitivity of 76.2%±2.04%, specificity of 63.8%±2.3%, positive predictive value of 41.6%±1.2%, and F1-score of 53.8%±1.0%. The DES model outperformed all five pulmonologists, achieving a sensitivity 6.5% higher than their average. The model identified smoking status, lactate dehydrogenase, age, total calcium levels, low values of sodium, leucocytes, neutrophil count, and C-reactive protein as the most important factors for LC detection. The results highlight the successful application of the ML approach in detecting LC, surpassing pulmonologists’ performance. Incorporating clinical and laboratory data in future risk assessment models can improve decision-making and facilitate timely referrals.


Predicting stroke severity of patients using interpretable machine learning algorithms

November 2024

·

47 Reads

European Journal of Medical Research

Background Stroke is a significant global health concern, ranking as the second leading cause of death and placing a substantial financial burden on healthcare systems, particularly in low- and middle-income countries. Timely evaluation of stroke severity is crucial for predicting clinical outcomes, with standard assessment tools being the Rapid Arterial Occlusion Evaluation (RACE) and the National Institutes of Health Stroke Scale (NIHSS). This study aims to utilize Machine Learning (ML) algorithms to predict stroke severity using these two distinct scales. Methods We conducted this study using two datasets collected from hospitals in Urmia, Iran, corresponding to stroke severity assessments based on RACE and NIHSS. Seven ML algorithms were applied, including K-Nearest Neighbor (KNN), Decision Tree (DT), Random Forest (RF), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), and Artificial Neural Network (ANN). Hyperparameter tuning was performed using grid search to optimize model performance, and SHapley Additive Explanations (SHAP) were used to interpret the contribution of individual features. Results Among the models, the RF achieved the highest performance, with accuracies of 92.68% for the RACE dataset and 91.19% for the NIHSS dataset. The Area Under the Curve (AUC) was 92.02% and 97.86% for the RACE and NIHSS datasets, respectively. The SHAP analysis identified triglyceride levels, length of hospital stay, and age as critical predictors of stroke severity. Conclusions This study is the first to apply ML models to the RACE and NIHSS scales for predicting stroke severity. The use of SHAP enhances the interpretability of the models, increasing clinicians’ trust in these ML algorithms. The best-performing ML model can be a valuable tool for assisting medical professionals in predicting stroke severity in clinical settings.


SPIRIT Figure. Template of content for the schedule of enrollment, interventions, and assessments. Abbreviations: PA: Physical activity; PROM: Patient-reported outcome measures; IPAQ: The International Physical Activity Questionnaire; OKS: Oxford Knee Score; EQ-5D-5L: Health-related quality of life; GPE: Global Perceived Effect
Flowchart of the study. Abbreviations: TKA: Total knee arthroplasty; mUKA: Medial uni-compartmental knee arthroplasty; PA: Physical activity; IPAQ: The International Physical Activity Questionnaire; OKS: Oxford Knee Score; EQ-5D-5L: Health-related quality of life; GPE: Global Perceived Effect
A Example of the graphical feedback screen (In Danish). B Example of the visual feedback screen. Pictures: Healthcare – SENS Innovation ApS, used with consent
Impact of motivational feedback on levels of physical activity and quality of life by activity monitoring following knee arthroplasty surgery—protocol for a randomized controlled trial nested in a prospective cohort (Knee-Activity)

October 2024

·

24 Reads

Background Evidence on how to improve daily physical activity (PA) levels following total knee arthroplasty (TKA) or medial uni-compartmental knee arthroplasty (mUKA) by motivational feedback is lacking. Moreover, it is unknown whether a focus on increased PA after discharge from the hospital improves rehabilitation, physical function, and quality of life. The aim of this randomized controlled trial (RCT) nested in a prospective cohort is (a) to investigate whether PA, physical function, and quality of life following knee replacement can be increased using an activity monitoring device including motivational feedback via a patient app in comparison with activity monitoring without feedback (care-as-usual), and (b) to investigate the potential predictive value of PA level prior to knee replacement for the length of stay, return to work, and quality of life. Methods The study is designed as a multicenter, parallel-group, superiority RCT with balanced randomization (1:1) and blinded outcome assessments. One hundred and fifty patients scheduled for knee replacement (TKA or mUKA) will be recruited through Odense University Hospital, Denmark, Vejle Hospital, Denmark and Herlev/Gentofte Sygehus, Denmark. Patients will be randomized to either 12 weeks of activity monitoring and motivational feedback via a patient app by gamification or 'care-as-usual,' including activity monitoring without motivational feedback. The primary outcome is the between-group change score from baseline to 12-week follow-up of cumulative daily accelerometer counts, which is a valid proxy for average objectively assessed daily PA. Discussion Improving PA through motivational feedback following knee replacement surgery might improve post-surgical function, health-related quality of life, and participation in everyday life. Trial registration ClinicalTrials.gov, ID: NCT06005623. Registered on 2023–08-22. Trial status Recruiting.



xECG-Beats: an explainable deep transfer learning approach for ECG-based heartbeat classification

August 2024

·

25 Reads

·

2 Citations

Network Modeling Analysis in Health Informatics and Bioinformatics

Early detection of abnormal heartbeats is of great importance for cardiologists for early diagnosis of cardiac diseases. This will help patients to receive in time diagnosis and prevention. Conventionally, physicians provide cardiac diagnoses by visual examination of electrocardiograms (ECGs). However, this can be a very time consuming and demanding task and, in some cases, may lead to overlooking and wrong diagnosis of life-threatening heart diseases. Therefore, an intelligent model can help to automatically analyze these huge amount of ECGs captured by different devices in clinical practice. A deep transfer learning approach is used to utilize the capability of different trained deep neural networks and to test them on new unseen datasets without the need to fully re-train the model. Two deep neural networks, namely, Visual Geometry Group (VGG) and Residual Network (ResNet) are utilized for classification of ECGs heartbeats. The models are evaluated using two unseen ECG datasets (i.e., SVDB and INCARTDB) by only optimizing their last classification layers. The overall area under curve for receiver operating characteristic (AUCROC) of two VGG and ResNet models are 0.961 and 0.966 on the SVDB dataset, respectively, and both models achieve 0.981 on the INCARTDB. This paper proposes an accurate and explainable model to classify ECG heartbeats into five categories recommended by the ANSI/AAMI standard. The proposed method paves the way to use pre-trained deep neural networks in real-time monitoring of heart patients using ECG data and to help clinicians understand the decision made by the models on each case using an explainable approach.


Citations (68)


... However, the required biomarkers are not assessed within routine blood testing, posing a challenge for the test's clinical utility. In contrast, our model is only based on routine laboratory markers and even performs comparable to models that integrate knowledge about smoking status or smoking history 35,36 . ...

Reference:

Deep learning-based identification of patients at increased risk of cancer using routine laboratory markers
Pulmonologists-level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach

... Our findings reveal that ARR is the most frequently studied type of CVD, likely because it serves as an early warning sign for more severe cardiac conditions. Studies such as those by Peimankar et al. (2024) and H. Zhang et al. (2023) highlight the critical role of ARR detection in preventing adverse outcomes. Furthermore, advancements in systems capable of detecting multiple CVDs offer significant potential to improve diagnostic efficiency, particularly for patients with complex conditions. ...

xECG-Beats: an explainable deep transfer learning approach for ECG-based heartbeat classification

Network Modeling Analysis in Health Informatics and Bioinformatics

... Additionally, natural language processing was used to determine smoking status from unstructured electronic health records of patients with suspected lung cancer in Denmark, demonstrating its ability to convert unstructured text into structured data. 16 Healthcare data sources include electronic health records, wearable devices, and genetic databases. 17 Healthcare data analytics can minimize treatment costs, prevent diseases, slow disease progression, and improve quality of life, ultimately leading to life-saving outcomes. ...

Identification of patients’ smoking status using an explainable AI approach: a Danish electronic health records case study

... Desde la perspectiva de la Minería de Datos, esta contribuye significativamente a la educación, haciéndola más efectiva y eficiente al proporcionar herramientas y metodologías para extraer conocimiento útil de grandes conjuntos de datos educativos (García-Herrero et al., 2018). Esta disciplina emergente permite analizar y comprender los procesos de aprendizaje, identificar patrones de comportamiento estudiantil, evaluar la efectividad de recursos educativos y enfoques pedagógicos, y personalizar el aprendizaje (Al-Saggaf et al., 2024;Rohani et al., 2024). ...

Using data mining to discover new patterns of social media and smartphone use and emotional states

Social Network Analysis and Mining

... Table 8 shows explanations of the standard evaluation metrics of AI models considered in studies. The area under the curve (AUC) metric and other metrics such as ROC [60], Sensitivity, Precision, Specificity, and Accuracy were used in 37 articles. In six of these, AUC was the sole evaluation metric. ...

Survival prediction of glioblastoma patients using modern deep learning and machine learning techniques

... The XGBoost algorithm developed by Medial Early Sign ("MES" model) outperformed the PLCO2012 model, incorporating laboratory results and smoking history [15]. Based on the same type of variables from a Danish high-risk cohort [17], the Dynamic Ensemble Selection (DES) ML model was recently developed by Flyckt et al. [18]. The DES model demonstrated moderate performance, although not surpassing the PLCO2012 model [18]. ...

A collection of multiregistry data on patients at high risk of lung cancer—a Danish retrospective cohort study of nearly 40,000 patients

Translational Lung Cancer Research

... The current study is a secondary analysis of data from the national Danish ACQUIRE-ICD trial [NCT02976961] that compared usual care with a comprehensive 12-month web-based, and nurse-led intervention + usual care [16]. The web-based intervention included goal setting and support for instigating behavioral changes, and the possibility to have a dialogue with and continuous feedback from nursing staff. ...

Efficacy of a web-based health care innovation to AdvanCe the QUalIty of life and caRE of patients with an Implantable Cardioverter Defibrillator (ACQUIRE-ICD): A randomized controlled trial

Europace

... [16][17][18][19][20][21][22][23][24][25][26][27] Previous studies have developed models with excellent predictive performance using structured EHR or claims data. [28][29][30] However, a limitation to these previous efforts is that they focused on predicting outcomes, such as documented OUD or AUD diagnoses, which were obtained from the same EHR or claims data that was the source of the model inputs and may thus be subject to the same under-reporting. It is therefore unknown whether the same type of EHR data can be used to predict prevalent SUD, which could include patients who meet criteria but do not receive a diagnosis of SUD. ...

AUD-DSS: a decision support system for early detection of patients with alcohol use disorder

BMC Bioinformatics

... Then, from the perspective of the research object, similar research work should be conducted on the health risk assessment of different subway manufacturing plants in China (the world's factory). Furthermore, from the perspective of research methods, it is necessary to explore the application of artificial intelligence modeling methods in human health risk assessment, such as the artificial-intelligence-based risk assessment of older adults [45]. Finally, from the perspective of health risk assessment, a comprehensive evaluation index system for health issues in welding and grinding should be established based on existing research results. ...

Important steps for artificial intelligence-based risk assessment of older adults
  • Citing Article
  • August 2023

The Lancet Digital Health

... It is mainly predominant in the DRC and the rest of the Congo Basin. The clade can transmit better from person to person and is often associated with more severe clinical outcomes and higher complication rates [5]. The clinical severity and transmission rates might be higher for one of the clades compared to the other. ...

Monkeypox detection using deep neural networks

BMC Infectious Diseases