Afreen Khan’s research while affiliated with Integral University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (8)


Hybrid-clinical model architecture.
Workflow design for data cleansing framework.
Five-step process: step forward feature selection.
Ten-fold repeated stratified cross-validation.
3 × 3 confusion matrix.

+1

Development of a robust parallel and multi-composite machine learning model for improved diagnosis of Alzheimer's disease: correlation with dementia-associated drug usage and AT(N) protein biomarkers
  • Article
  • Full-text available

September 2024

·

71 Reads

·

1 Citation

Afreen Khan

·

·

·

[...]

·

Introduction Machine learning (ML) algorithms and statistical modeling offer a potential solution to offset the challenge of diagnosing early Alzheimer's disease (AD) by leveraging multiple data sources and combining information on neuropsychological, genetic, and biomarker indicators. Among others, statistical models are a promising tool to enhance the clinical detection of early AD. In the present study, early AD was diagnosed by taking into account characteristics related to whether or not a patient was taking specific drugs and a significant protein as a predictor of Amyloid-Beta (Aβ), tau, and ptau [AT(N)] levels among participants. Methods In this study, the optimization of predictive models for the diagnosis of AD pathologies was carried out using a set of baseline features. The model performance was improved by incorporating additional variables associated with patient drugs and protein biomarkers into the model. The diagnostic group consisted of five categories (cognitively normal, significant subjective memory concern, early mildly cognitively impaired, late mildly cognitively impaired, and AD), resulting in a multinomial classification challenge. In particular, we examined the relationship between AD diagnosis and the use of various drugs (calcium and vitamin D supplements, blood-thinning drugs, cholesterol-lowering drugs, and cognitive drugs). We propose a hybrid-clinical model that runs multiple ML models in parallel and then takes the majority's votes, enhancing the accuracy. We also assessed the significance of three cerebrospinal fluid biomarkers, Aβ, tau, and ptau in the diagnosis of AD. We proposed that a hybrid-clinical model be used to simulate the MRI-based data, with five diagnostic groups of individuals, with further refinement that includes preclinical characteristics of the disorder. The proposed design builds a Meta-Model for four different sets of criteria. The set criteria are as follows: to diagnose from baseline features, baseline and drug features, baseline and protein features, and baseline, drug and protein features. Results We were able to attain a maximum accuracy of 97.60% for baseline and protein data. We observed that the constructed model functioned effectively when all five drugs were included and when any single drug was used to diagnose the response variable. Interestingly, the constructed Meta-Model worked well when all three protein biomarkers were included, as well as when a single protein biomarker was utilized to diagnose the response variable. Discussion It is noteworthy that we aimed to construct a pipeline design that incorporates comprehensive methodologies to detect Alzheimer's over wide-ranging input values and variables in the current study. Thus, the model that we developed could be used by clinicians and medical experts to advance Alzheimer's diagnosis and as a starting point for future research into AD and other neurodegenerative syndromes.

Download


Development of a three tiered cognitive hybrid machine learning algorithm for effective diagnosis of Alzheimer’s disease

July 2022

·

31 Reads

·

33 Citations

Journal of King Saud University - Computer and Information Sciences

Alzheimer's disease (AD) is one of the most frequent neurodegenerative disorders in the elderly subjects. Since early detection can prevent or delay cognitive decline in the older subjects, it is desirable to develop effectual protocols for the diagnosis of the disease. Most of the existing diagnostic tools fail to improvise timely disease prognosis in susceptible patients. Keeping this fact into consideration, we developed a cognitive-based 3-tiered machine learning (ML) algorithm employing baseline characteristics to predict AD or mild cognitive impairment (MCI) to construct psychometric test results. Earlier machine learning based AD diagnosis methods used a binary or multinomial classification technique. We relied on the development of a sophisticated hybrid cognitive ML algorithm that provides an accurate and precise prediction of the disease. We built an ML model using cognitive and demographic data. The prediction method consisted of a three-step process. Alzheimer’s Disease Neuroimaging Initiative (ADNI) database was used to develop a novel prediction algorithm. Considering the fact that nineteen ML and deep learning classifiers could not adequately classify ADNI data, we created a 2-layer model stacking procedure. Model stacking outperformed six ML classifier combinations, including Logistic Regression, Naïve Bayes, Support Vector Machine, Decision Trees, Random Forest, and eXtreme Gradient Boosting. The performance of the as-proposed model was evaluated employing seven performance assessment measures and four classification error indicators. Each model was evaluated in three separate strategical assessment modules. In the first experiment, XGB, Random Forest, and SVM achieved 89.63% accuracy, while Random Forest achieved 93.90% accuracy in the second experiment. Experiment 2 improved the classification and performance of overall prediction. In the third experiment, hybrid modeling, the accuracy increased significantly, with experiment 1 giving 90.24% accuracy and experiment 2 yielding 95.12% accuracy. The as-proposed model successfully predicted early AD and MCI in an effective manner. We were able to reduce nineteen classifiers into four classifiers (from experiment-1) and six classifiers (from experiment-2) and subsequently into one meta-learner (19 → 4 → 1 and 19 → 6 → 1), with high predictive power. Finally, we performed a thorough comparative analysis of different ADNI datasets to validate our findings.


FIGURE 3. Partial Dependence for CDRSB
FIGURE 5. ML Explainability of Potential Biomarkers. (where A, B, C and D in RAVLT (Rey Auditory Verbal Learning Test) are the sum of 5 trials, trial 5-trial 1, trial 5-delayed and percent forgetting)
Prospectives of Big Data Analytics and Explainable Machine Learning in Identification of Probable Biomarkers of Alzheimer's disease

April 2021

·

166 Reads

·

1 Citation

The recent advancement in the healthcare domain results in the generation of a large amount of clinical, imaging, and medication data. The extensive analysis of such data targets employing big data analytics helps in the timely identification of various diseases which thereby aids in building precautionary measures. Alzheimer's disease (AD) is the most common neurodegenerative disorder worldwide. To find an effective management strategy for AD; clinical, biological, and behavioral data of various cohorts are gathered, managed, and broadcasted through various AD coordinating centers. In general, the collected data used to be imbalanced, incongruent, heterogeneous, and sparse. In the present study, we employed the big data technology and machine learning correlate for modeling the bulky Alzheimer's disease Neuroimaging Initiative (ADNI) dataset to identify the potential biomarkers of AD. A total of 12741 data values and 1907 clinical variables for 1738 subjects were used. About 20 variables out of this AD big data, were identified as the most suitable biomarkers for the prediction of AD respectively. Through ML explainability modeling, we identified the correlation and significance of various cognitive, MRI, PET, and CSF metrics in contrast to the risk factors i.e., age and APOE4. The approach used in this study could be beneficial for AD-based research enrichment in pre-clinical tests, where enrolling patients at the jeopardy of cognitive degeneration is critical for verifying the efficiency of the study.



Severity Model Based Prediction of Early Trend and Pattern Recognition of the COVID-19 Infection in India: Exploratory Data Analysis and Machine Learning Study (Preprint)

May 2020

·

16 Reads

UNSTRUCTURED Objective: Recent Coronavirus Disease 2019 (COVID-19) pandemic has inflicted the whole world critically. Despite the fact that India has not been listed amongst the top ten highly affected countries, one cannot rule out COVID-19 associated complications in the near future. The accumulative testing facilities has resulted in exponential increase in COVID-19 infection cases. In figures, the number of positive cases have risen up to 33,614 as of 30 April, 2020. Keeping into consideration the serious consequences of pandemic, we aim to establish correlations between the numerous features which was acquired from the various Indian-based COVID datasets, and the impact of the containment of the pandemic on the current state of Indian population using machine learning approach. We aim to build the COVID-19 severity model employing logistic function which determines the inflection point and help in prediction of the future number of confirmed cases. Methods: An empirical study was performed on the COVID-19 patient status in India. We performed the study commencing from 30 January, 2020 to 30 April, 2020 for the analysis. We applied the machine learning (ML) approach to gain the insights about COVID-19 incidences in India. Several diverse exploratory data analysis ML tools and techniques were applied to establish a correlation amongst the various features. Also, the acute stage of the disease was mapped in order to build a robust model. Results: We collected five different datasets to execute the study. The data sets were integrated extract the essential details. We found that men were more prone to get infected of the coronavirus disease as compared to women. Also, the age group was the middle-young age of patients. On 92-days based analysis, we found a trending pattern of number of confirmed, recovered, deceased and active cases of COVID-19 in India. The as-developed growth model provided an inflection point of 85.0 days. It also predicted the number of confirmed cases as 48,958.0 in the future i.e. after 30th April. Growth rate of 13.06 percent was obtained. We achieved statistically significant correlations amongst growth rate and predicted COVID-19 confirmed cases. Conclusion: This study demonstrated the effective application of exploratory data analysis and machine learning in building a mathematical severity model for COVID-19 in India.


Longitudinal MRI as a potential correlate of Exploratory Data Analysis in the diagnosis of Alzheimer disease (Preprint)

April 2019

·

15 Reads

·

15 Citations

JMIR Biomedical Engineering

Background Alzheimer disease (AD) is a degenerative progressive brain disorder where symptoms of dementia and cognitive impairment intensify over time. Numerous factors exist that may or may not be related to the lifestyle of a patient that result in a higher risk for AD. Diagnosing the disorder in its beginning period is important, and several techniques are used to diagnose AD. A number of studies have been conducted on the detection and diagnosis of AD. This paper reports the empirical study performed on the longitudinal-based magnetic resonance imaging (MRI) Open Access Series of Brain Imaging dataset. Furthermore, the study highlights several factors that influence the prediction of AD. Objective This study aimed to correlate the effect of various factors such as age, gender, education, and socioeconomic background of patients with the development of AD. The effect of patient-related factors on the severity of AD was assessed on the basis of MRI features, Mini-Mental State Examination (MMSE), Clinical Dementia Rating (CDR), estimated total intracranial volume (eTIV), normalized whole brain volume (nWBV), and Atlas Scaling Factor (ASF). Methods In this study, we attempted to establish the role of longitudinal MRI in an exploratory data analysis (EDA) of AD patients. EDA was performed on the dataset of 150 patients for 343 MRI sessions (mean age 77.01 [SD 7.64] years). The T1-weighted MRI of each subject on a 1.5-Tesla Vision (Siemens) scanner was used for image acquisition. Scores of three features, MMSE, CDR, and ASF, were used to characterize the AD patients included in this study. We assessed the role of various features (ie, age, gender, education, socioeconomic status, MMSE, CDR, eTIV, nWBV, and ASF) on the prognosis of AD. Results The analysis further establishes the role of gender in the prevalence and development of AD in older people. Moreover, a considerable relationship has been observed between education and socioeconomic position on the progression of AD. Also, outliers and linearity of each feature were determined to rule out the extreme values in measuring the skewness. The differences in nWBV between CDR=0 (nondemented), CDR=0.5 (very mild dementia), and CDR=1 (mild dementia) are significant (ie, P<.01). Conclusions A substantial correlation has been observed between the pattern and other related features of longitudinal MRI data that can significantly assist in the diagnosis and determination of AD in older patients.


Longitudinal MRI as a potential correlate of Exploratory Data Analysis in the diagnosis of Alzheimer disease (Preprint)

April 2019

·

46 Reads

BACKGROUND Alzheimer’s disease (AD) is a degenerative progressive brain disorder where symptoms of dementia and cognitive impairment intensify over time. Numerous factors exist which may or may not be related to the lifestyle of a patient, can trigger off a higher risk for AD. Diagnosing the disorder in its beginning period is of incredible significance and several techniques are used to diagnose AD. A number of studies have been conducted for the detection and diagnosis of AD. This paper reports the empirical study performed on the longitudinal-based MRI OASIS data set. Furthermore, the study highlights several factors which influence in the prediction of AD. OBJECTIVE This study aims to examine the effect of longitudinal MRI data in demented and non-demented older adults. The purpose of this study is to investigate and report the correlation among various MRI features, in particular, the role of different scores obtained while MR image acquisition. METHODS In this study, we attempted to establish the role of the longitudinal magnetic resonance imaging (MRI) in exploratory data analysis (EDA) of AD patients. EDA was performed on the dataset of 150 patients for 343 MRI sessions [Mean age ± SD = 77.01 ± 7.64]. T1-weighted MRI of each subject on a 1.5-T Vision scanner was used for the image acquisition. Scores of three features, viz.- mini-mental state examination (MMSE), clinical dementia rating (CDR), and atlas scaling factor (ASF) were used to characterize the AD patients included in this study. We assessed the role of various features i.e. age, gender, education, socioeconomic status, MMSE, CDR, estimated total intracranial volume, normalized whole brain volume and ASF in the prognosis of AD. RESULTS The analysis further establishes the role of gender in prevalence and development of AD in older people. Moreover, a considerable relationship has been observed between education and socioeconomic position on the progression of AD. Also, outliers and linearity of each feature were determined to rule out the extreme values in measuring the skewness. The differences in nWBV between CDR = 0 (non-demented), CDR = 0.5 (very mild dementia), CDR = 1 (mild dementia) comes out to be significant i.e. p<0.01. CONCLUSIONS A substantial correlation has been observed between pattern and other related features of longitudinal MRI data that can significantly assist in the diagnosis and determination of AD in older patients.

Citations (5)


... In terms of the model's performance, its ability to distinguish between AD and non-AD is excellent. Previous studies have rarely focused only on the contribution of Tau and Aβ proteins to constructing AD diagnostic models, often incorporating some other biomarkers (Gaetani et al., 2021;Ficiarà et al., 2021;Franciotti et al., 2023;Khan et al., 2024). These studies have their own innovations and strengths, but inevitably have some shortcomings. ...

Reference:

Association between Alzheimer's disease pathologic products and age and a pathologic product-based diagnostic model for Alzheimer's disease
Development of a robust parallel and multi-composite machine learning model for improved diagnosis of Alzheimer's disease: correlation with dementia-associated drug usage and AT(N) protein biomarkers

... The results obtained in this study align with the findings from previous research [47,48] to confirm the efficacy of machine learning models in predicting dementia progression [47]. reported high accuracy scores for Logistic Regression and Gaussian Naive Bayes, with SVM achieving slightly lower accuracy. ...

A Hybrid Approach for Weak Learners Utilizing Ensemble Technique for Alzheimer’s Disease Prognosis
  • Citing Article
  • August 2023

Indian Journal of Science and Technology

... It also finds its echo in the research work by Jieke Lim et al. [4], who integrated sheltering machine learning techniques to predict TCM patterns in PCOS patients and discussed precise feature selection, which is crucial for furthering diagnostic health studies. Afreen Khan and Swaleha Zubair [5] contributed to this domain by developing a three-layered cognitive hybrid machine learning algorithm to effectively diagnose Alzheimer's disease, thus significantly enhancing the accuracy of diagnosis through a sophisticated hybrid cognitive ML model. ...

Development of a three tiered cognitive hybrid machine learning algorithm for effective diagnosis of Alzheimer’s disease
  • Citing Article
  • July 2022

Journal of King Saud University - Computer and Information Sciences

... Multiple recent papers using interpretability techniques have provided compelling results and guidelines 35 for further medical expertise, including regular 36 and multi layer multi modal 37 interpretability of the Alzheimer's disease, interpretability of ensemble learning algorithms for predicting dementia 38 and extracting explainable assessments from MRI imaging scans 39 . Hypothesis. ...

Prospectives of Big Data Analytics and Explainable Machine Learning in Identification of Probable Biomarkers of Alzheimer's disease

... To this end, various biomarkers related to the physiological, pathological, or anatomical characteristics of AD have been studied [6,7]. Cerebrospinal fluid amyloid levels [8][9][10] and magnetic resonance imaging (MRI) results [11][12][13], which include representative AD biomarkers, have been used to objectively quantify the early clinical symptoms of patients with AD [14]. However, continuously monitoring these biomarkers is unfeasible as obtaining these data incurs either high cost (eg, MRI) or great inconvenience for the patients due to the invasive nature of the procedure [15][16][17]. ...

Longitudinal MRI as a potential correlate of Exploratory Data Analysis in the diagnosis of Alzheimer disease (Preprint)
  • Citing Article
  • April 2019

JMIR Biomedical Engineering