Charles Bernando’s research while affiliated with Multimedia Nusantara University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (6)


Dataset attributes.
Findings of classification model performance using scenario 1.
Findings of classification model performance using scenario 2.
Findings of classification model performance using scenario 3.
Findings of classification model performance using scenario 4.
The Impact of Augmentation and SMOTE Implementation on the Classification Models Performance: A Case Study on Student Academic Performance Dataset
  • Article
  • Full-text available

January 2024

·

7 Reads

Procedia Computer Science

Albert Verasius Dian Sano

·

Faqir M. Bhatti

·

·

[...]

·

Charles Bernando
Download

Figure 3. Receiver operating characteristic (ROC) curve and area under the curve (AUC) for (A) random forest, (B) XG­ Boost, and (C) AdaBoost.
Figure 4. Global interpretability: feature importance for the Ada­ Boost algorithm to detect and predict atherosclerotic heart disease (as visualized by summary_plot method with plot type bar in the Python library). MCH: mean cor­ puscular hemoglobin, MCHC: mean corpuscular hemoglo­ bin concentration, SHAP: Shapley Additive exPlanations.
Figure 5. Local interpretability for the AdaBoost algorithm to detect and predict atherosclerotic heart disease (as vi­ sualized by the plot method with plot type bar plot in the Python library). In data preprocessing, we converted female to 0 and male to 1. MCH: mean corpuscular hemoglobin, MCHC: mean corpuscular hemoglobin con­ centration SHAP: Shapley Additive exPlanations.
Predictor attributes and their baseline characteristics in the dataset (n = 6,837)
Predictive model performance for the test set
Understanding Arteriosclerotic Heart Disease Patients Using Electronic Health Records: A Machine Learning and Shapley Additive exPlanations Approach

July 2023

·

86 Reads

·

7 Citations

Healthcare Informatics Research

Objectives: The number of deaths from cardiovascular disease is projected to reach 23.3 million by 2030. As a contribution to preventing this phenomenon, this paper proposed a machine learning (ML) model to predict patients with arteriosclerotic heart disease (AHD). We also interpreted the prediction model results based on the ML approach and deployed modelagnostic ML methods to identify informative features and their interpretations. Methods: We used a hematology Electronic Health Record (EHR) with information on erythrocytes, hematocrit, hemoglobin, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, leukocytes, thrombocytes, age, and sex. To detect and predict AHD, we explored random forest (RF), XGBoost, and AdaBoost models. We examined the prediction model results based on the confusion matrix and accuracy measures. We used the Shapley Additive exPlanations (SHAP) framework to interpret the ML model and quantify the contribution of features to predictions. Results: Our study included data from 6,837 patients, with 4,702 records from patients diagnosed with AHD and 2,135 records from patients without an AHD diagnosis. AdaBoost outperformed RF and XGBoost, achieving an accuracy of 0.78, precision of 0.82, F1-score of 0.85, and recall of 0.88. According to the SHAP summary bar plot method, hemoglobin was the most important attribute for detecting and predicting AHD patients. The SHAP local interpretability bar plot revealed that hemoglobin and mean corpuscular hemoglobin concentration had positive impacts on AHD prediction based on a single observation. Conclusions: ML models based on real clinical data can be used to predict AHD.


Top 10 Importance Scores of All Risk Factors Associated with Heart Disease.
Measurements of Random Forest model.
Prediction Models of Coronary Heart Disease Using Machine Learning and Deep Learning Algorithms

April 2022

·

425 Reads

·

5 Citations

Coronary heart disease (CHD), alternatively known as cardiovascular disease (CVD) is the number one cause of death in the world. Each person may develop different symptoms or no symptoms of CHD, which means that they do not know they have CHD until they experience a chest pain, a heart attack or cardiac arrest. These situations may be avoided if we are able to predict the early diagnosis of the heart disease and determine the most important risk factors associated with the disease. Currently, the accuracy of the prediction has remained inadequate and the most important risk factors have remained elusive. This research paper discusses many risk factors associated with the disease and presents the prediction models of coronary heart disease using supervised machine learning algorithms, namely Random Forest, XGBoost algorithms, as well as using Artificial Neural Network (ANN), a deep-learning-based algorithm. It uses the public dataset from the Cleveland database of UCI repository of coronary heart disease patients. The models are further optimized using Grid Search optimization algorithm. The results show that the Random Forest, XGBoost and ANN algorithms have accuracies of 81.11%, 82.22% and 86.67%, respectively. Equally important, the results of the feature importance signify the importance of maximum heart rate and nuclear stress test in predicting the early diagnosis of the disease.




Citations (5)


... While the study identifies the top 12 clinical features contributing to heart failure prediction, it does not provide detailed interpretability mechanisms to understand how these features influence the model's decisions. More transparent models or the inclusion of interpretability techniques such as SHAP (SHapley Additive exPlanations) [71,72] values could improve trust and usability among clinicians. The study primarily uses static EHR data for prediction. ...

Reference:

AI-Driven Technology in Heart Failure Detection and Diagnosis: A Review of the Advancement in Personalized Healthcare
Understanding Arteriosclerotic Heart Disease Patients Using Electronic Health Records: A Machine Learning and Shapley Additive exPlanations Approach

Healthcare Informatics Research

... RNNs can be used to predict disease Ensemble methods (EMs): EMs combine multiple ML/DL models to enhance performance. For example, XGBoost, a gradient-boosting algorithm and a widely used EM approach, has been successful in various medical applications, including cardiovascular disease prediction[83]. ...

Prediction Models of Coronary Heart Disease Using Machine Learning and Deep Learning Algorithms

... Valarmathi et al. [32] proposed a Hyper Parameter Optimization (HPO) to increase the efficiency of the RF model to predict the HD with the highest accuracy of 97.52%. The CART model is utilized to anticipate the early state of HD with an accuracy of 88.33% and recall of 84.62% to save human lives as proposed by Miranda et al. [33]. In order to diagnose CVD effectively, Velusamy et al. [19] suggested an ensemble approach by incorporating KNN, RF, and SVM classifiers. ...

Application for Early Heart Disease Prediction Based on Data Mining Approach

... A non-invasive method [24] used for predicting heart disease using LR and Stochastic Gradient Decent on 303 patient's record from UPI repository in which LR achieved accuracy of 91.67% and SDG with 80.0%. Classification methods of machine learning and image fusion are better in providing good accuracy for predicting heart diseases [25]. ...

Intelligent Computational Model for Early Heart Disease Prediction using Logistic Regression and Stochastic Gradient Descent (A Preliminary Study)