Sparse learning and stability selection for predicting MCI to AD conversion using baseline ADNI data

Center for Evolutionary Medicine and Informatics, The Biodesign Institute, Arizona, State University, Tempe, AZ, USA. .
BMC Neurology (Impact Factor: 2.04). 06/2012; 12(1):46. DOI: 10.1186/1471-2377-12-46
Source: PubMed


Patients with Mild Cognitive Impairment (MCI) are at high risk of progression to Alzheimer’s dementia. Identifying MCI individuals with high likelihood of conversion to dementia and the associated biosignatures has recently received increasing attention in AD research. Different biosignatures for AD (neuroimaging, demographic, genetic and cognitive measures) may contain complementary information for diagnosis and prognosis of AD.

We have conducted a comprehensive study using a large number of samples from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) to test the power of integrating various baseline data for predicting the conversion from MCI to probable AD and identifying a small subset of biosignatures for the prediction and assess the relative importance of different modalities in predicting MCI to AD conversion. We have employed sparse logistic regression with stability selection for the integration and selection of potential predictors. Our study differs from many of the other ones in three important respects: (1) we use a large cohort of MCI samples that are unbiased with respect to age or education status between case and controls (2) we integrate and test various types of baseline data available in ADNI including MRI, demographic, genetic and cognitive measures and (3) we apply sparse logistic regression with stability selection to ADNI data for robust feature selection.

We have used 319 MCI subjects from ADNI that had MRI measurements at the baseline and passed quality control, including 177 MCI Non-converters and 142 MCI Converters. Conversion was considered over the course of a 4-year follow-up period. A combination of 15 features (predictors) including those from MRI scans, APOE genotyping, and cognitive measures achieves the best prediction with an AUC score of 0.8587.

Our results demonstrate the power of integrating various baseline data for prediction of the conversion from MCI to probable AD. Our results also demonstrate the effectiveness of stability selection for feature selection in the context of sparse logistic regression.

Download full-text


Available from: Victor Lobanov,
30 Reads
  • Source
    • "Reported areas under the ROC curve (AUC) range from .647 to .859. Importantly , both the study that reports the highest accuracy (Devanand, et al., 2008) and the study that reports the highest AUC (Ye, et al., 2012) include both cognitive and biological measures in their classifiers. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Objective We constructed random forest classifiers employing either the traditional method of scoring semantic fluency word lists or new methods. These classifiers were then compared in terms of their ability to diagnose Alzheimer disease (AD) or to prognosticate among individuals along the continuum from cognitively normal (CN) through mild cognitive impairment (MCI) to AD. Method Semantic fluency lists from 44 cognitively normal elderly individuals, 80 MCI patients, and 41 AD patients were transcribed into electronic text files and scored by four methods: traditional raw scores, clustering and switching scores, “generalized” versions of clustering and switching, and a method based on independent components analysis (ICA). Random forest classifiers based on raw scores were compared to “augmented” classifiers that incorporated newer scoring methods. Outcome variables included AD diagnosis at baseline, MCI conversion, increase in Clinical Dementia Rating-Sum of Boxes (CDR-SOB) score, or decrease in Financial Capacity Instrument (FCI) score. ROC curves were constructed for each classifier and the area under the curve (AUC) was calculated. We compared AUC between raw and augmented classifiers using Delong’s test and assessed validity and reliability of the augmented classifier. Results Augmented classifiers outperformed classifiers based on raw scores for the outcome measures AD diagnosis (AUC 0.97 vs. 0.95), MCI conversion (AUC 0.91 vs. 0.77), CDR-SOB increase (AUC 0.90 vs. 0.79), and FCI decrease (AUC 0.89 vs. 0.72). Measures of validity and stability over time support the use of the method. Conclusion Latent information in semantic fluency word lists is useful for predicting cognitive and functional decline among elderly individuals at increased risk for developing AD. Modern machine learning methods may incorporate latent information to enhance the diagnostic value of semantic fluency raw scores. These methods could yield information valuable for patient care and clinical trial design with a relatively small investment of time and money.
    Cortex 06/2014; 55(1). DOI:10.1016/j.cortex.2013.12.013 · 5.13 Impact Factor
  • Source
    • "Neuroimages allow the identification of brain changes and have been used for automated diagnosis of AD and MCI (Silveira and Marques, 2010; Ye et al., 2012). Due to the high variability of the pattern of brain degeneration in AD and MCI, the analysis of brain images is a very difficult task. "
    [Show abstract] [Hide abstract]
    ABSTRACT: Alzheimer's disease is a type of dementia that mainly affects elderly people, with unknown causes and no effective treatment up to date. The diagnosis of this disease in an earlier stage is crucial to improve patients' life quality. Current techniques focus on the analysis of neuroimages, like FDG-PET or MRI, to find changes in the brain activity. While high accuracies can be obtained by combining the analysis of several types of neuroimages, they are expensive and not always available for medical analysis. Achieving similar results using only 3-D FDG-PET scans is therefore of huge importance. While directly applying classifiers to the FDG-PET scan voxel intensities can lead to good prediction accuracies, it results in a problem that suffers from the curse of dimensionality. This paper thus proposes a methodology to identify regions of interest by segmenting 3-D FDG-PET scans and extracting features that represent each of those regions of interest, reducing the dimensionality of the space. Experimental results show that the proposed methodology outperforms the one using voxel intensities despite only a small number of features is needed to achieve that result.
    BIOIMAGING 2014; 01/2014
  • Source
    • "For example, in our SLEP package (Liu et al., 2009), we add an ' 2 -norm regularization to the Lasso formulation. Our experience showed that it generally improved Lasso performance (Ye et al., 2012). "
    [Show abstract] [Hide abstract]
    ABSTRACT: Many methods have been proposed for computer-assisted diagnostic classification. Full tensor information and machine learning with 3D maps derived from brain images may help detect subtle differences or classify subjects into different groups. Here we develop a new approach to apply tensor-based morphometry to parametric surface models for diagnostic classification. We use this approach to identify cortical surface features for use in diagnostic classifiers. First, with holomorphic 1-forms, we compute an efficient and accurate conformal mapping from a multiply connected mesh to the so-called slit domain. Next, the surface parameterization approach provides a natural way to register anatomical surfaces across subjects using a constrained harmonic map. To analyze anatomical differences, we then analyze the full Riemannian surface metric tensors, which retain multivariate information on local surface geometry. As the number of voxels in a 3D image is large, sparse learning is a promising method to select a subset of imaging features and to improve classification accuracy. Focusing on vertices with greatest effect sizes, we train a diagnostic classifier using the surface features selected by an L1-norm based sparse learning method. Stability selection is applied to validate the selected feature sets. We tested the algorithm on MRI-derived cortical surfaces from 42 subjects with genetically confirmed Williams syndrome and 40 age-matched controls, multivariate statistics on the local tensors gave greater effect sizes for detecting group differences relative to other TBM-based statistics including analysis of the Jacobian determinant and the largest eigenvalue of the surface metric. Our method also gave reasonable classification results relative to the Jacobian determinant, the pair of eigenvalues of the Jacobian matrix and volume features. This analysis pipeline may boost the power of morphometry studies, and may assist with image-based classification.
    NeuroImage 02/2013; 74:209–230. DOI:10.1016/j.neuroimage.2013.02.011 · 6.36 Impact Factor
Show more