Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis.

Department of Biomedical Engineering, Duke University, Durham, North Carolina 27705, USA.
Medical Physics (Impact Factor: 2.91). 08/2006; 33(8):2945-54. DOI: 10.1118/1.2208934
Source: PubMed

ABSTRACT As more diagnostic testing options become available to physicians, it becomes more difficult to combine various types of medical information together in order to optimize the overall diagnosis. To improve diagnostic performance, here we introduce an approach to optimize a decision-fusion technique to combine heterogeneous information, such as from different modalities, feature categories, or institutions. For classifier comparison we used two performance metrics: The receiving operator characteristic (ROC) area under the curve [area under the ROC curve (AUC)] and the normalized partial area under the curve (pAUC). This study used four classifiers: Linear discriminant analysis (LDA), artificial neural network (ANN), and two variants of our decision-fusion technique, AUC-optimized (DF-A) and pAUC-optimized (DF-P) decision fusion. We applied each of these classifiers with 100-fold cross-validation to two heterogeneous breast cancer data sets: One of mass lesion features and a much more challenging one of microcalcification lesion features. For the calcification data set, DF-A outperformed the other classifiers in terms of AUC (p < 0.02) and achieved AUC=0.85 +/- 0.01. The DF-P surpassed the other classifiers in terms of pAUC (p < 0.01) and reached pAUC=0.38 +/- 0.02. For the mass data set, DF-A outperformed both the ANN and the LDA (p < 0.04) and achieved AUC=0.94 +/- 0.01. Although for this data set there were no statistically significant differences among the classifiers' pAUC values (pAUC=0.57 +/- 0.07 to 0.67 +/- 0.05, p > 0.10), the DF-P did significantly improve specificity versus the LDA at both 98% and 100% sensitivity (p < 0.04). In conclusion, decision fusion directly optimized clinically significant performance measures, such as AUC and pAUC, and sometimes outperformed two well-known machine-learning techniques when applied to two different breast cancer data sets.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: In this work the effects of simple imputations are studied, regarding the integration of multimodal data originating from different patients. Two separate datasets of cutaneous melanoma are used, an image analysis (dermoscopy) dataset together with a transcriptomic one, specifically DNA microarrays. Each modality is related to a different set of patients, and four imputation methods are employed to the formation of a unified, integrative dataset. The application of backward selection together with ensemble classifiers (random forests), followed by principal components analysis and linear discriminant analysis, illustrates the implication of the imputations on feature selection and dimensionality reduction methods. The results suggest that the expansion of the feature space through the data integration, achieved by the exploitation of imputation schemes in general, aids the classification task, imparting stability as regards the derivation of putative classifiers. In particular, although the biased imputation methods increase significantly the predictive performance and the class discrimination of the datasets, they still contribute to the study of prominent features and their relations. The fusion of separate datasets, which provide a multimodal description of the same pathology, represents an innovative, promising avenue, enhancing robust composite biomarker derivation and promoting the interpretation of the biomedical problem studied.
    BioMed research international. 01/2014; 2014:145243.
  • Source
    International Journal of Multimedia and Ubiquitous Engineering. 01/2013; 8(4):45-58.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Mammography is the most effective screening tool for early diagnosis of breast cancer. Based on the mammography findings, radiologists need to choose from one of the following three alternatives: 1) take immediate diagnostic actions including prompt biopsy to confirm breast cancer; 2) recommend a follow-up mammogram; 3) recommend routine annual mammography. There are no validated structured guidelines based on a decision-analytical framework to aid radiologists in making such patient management decisions. Surprisingly, only 15-45% of the breast biopsies and less than 1% of short-interval follow-up recommendations are found to be malignant, resulting in unnecessary tests and patient-anxiety. We develop a finite-horizon discrete-time Markov decision process (MDP) model that may help radiologists make patient-management decisions to maximize a patient's total expected quality-adjusted life years. We use clinical data to find the policies recommended by the MDP model and also compare them to decisions made by radiologists at a large mammography practice. We also derive the structural properties of the MDP model, including sufficiency conditions that ensure the existence of a double control-limit type policy.
    Decision Analysis 09/2013; 10(3):200-224. · 2.14 Impact Factor


1 Download
Available from