Nonlinear Kernel-Based Approaches for Predicting Normal Tissue Toxicities
ABSTRACT Since the early demonstration of the curative potential of radiation therapy for tumor sterilization, normal tissue toxicity continues to be dose limiting. Accurate prediction of patientÂ¿s complication risk would allow personalization of treatment planning decisions. Nonlinear kernel methods can provide a robust framework for learning complex interactions between observed toxicities and treatment, anatomical, and patient-related variables. However, proper application of these powerful methods would require better understanding of a high-dimensional feature space that is spanned by all these variables. In this work, we investigate methods for visualization of this high-dimensional space and compare different approaches for extracting discriminant features. Our preliminary results demonstrate that principle component analysis is a valuable tool for visualizing high dimensional data and for determining proper kernel type. In addition, variable selection based on resampling methods within the logistic regression framework seemed to yield improved prediction performance compared to the recursive-feature elimination method.
SourceAvailable from: Joseph O Deasy[Show abstract] [Hide abstract]
ABSTRACT: Radiation-induced lung injury, radiation pneumonitis (RP), is a potentially fatal side-effect of thoracic radiation therapy. In this work, using an ensemble of support vector machines (SVMs), we build a binary RP risk model from clinical and dosimetric parameters. Patient/treatment data is partitioned into balanced subsets to prevent model bias. Forward feature selection, maximizing the area under the curve (AUC) for a cross-validated receiver operating characteristic (ROC) curve, is performed on each subset. Model parameter selection and construction occurs concurrently via alternating SVM and gradient descent steps to minimize estimated generalization error. We show that an ensemble classifier with a mean fusion function, five component SVMs, and limit of five features per classifier exhibits a mean AUC of 0.818—an improvement over previous SVM models of RP risk.Neurocomputing 06/2010; 73(10-12-73):1861-1867. DOI:10.1016/j.neucom.2009.09.023 · 2.01 Impact Factor
[Show abstract] [Hide abstract]
ABSTRACT: Patients undergoing thoracic radiation therapy can de- velop radiation pneumonitis (RP), a potentially fatal inflam- mation of the lungs. Support vector machines (SVMs), a sta- tistical machine learning method, have recently been used to build binary-outcome RP prediction models with promis- ing results. In this work, we (1) introduce a feature-ranking selection step to improve the parsimony of our previous en- semble SVM model (2) show that ensembles of SVMs pro- vide a statistically significant performance improvement in the area under the cross-validated receiver operating curve and (3) apply Platt's tuning to the component SVMs to gen- erate probability estimates in order to augment clinical rel- evance.International Conference on Machine Learning and Applications, ICMLA 2009, Miami Beach, Florida, USA, December 13-15, 2009; 01/2009
[Show abstract] [Hide abstract]
ABSTRACT: Purpose: Radiation pneumonitis (RP) is a potentially fatal side effect arising in lung cancer patients who receive radiotherapy as part of their treatment. For the modeling of RP outcomes data, several predictive models based on traditional statistical methods and machine learning techniques have been reported. However, no guidance to variation in performance has been provided to date. Materials and methods: In this study, we explore several machine learning algorithms for classification of RP data. The performance of these classification algorithms is investigated in conjunction with several feature selection strategies and the impact of the feature selection strategy on performance is further evaluated. The extracted features include patient's demographic, clinical and pathological variables, treatment techniques, and dose-volume metrics. In conjunction, we have been developing an in-house Matlab-based open source software tool, called dose-response explorer system (DREES), customized for modeling and exploring dose response in radiation oncology. This software has been upgraded with a popular classification algorithm called support vector machine (SVM), which seems to provide improved performance in our exploration analysis and has strong potential to strengthen the ability of radiotherapy modelers in analyzing radiotherapy outcomes data. These tools are demonstrated on an institutional non-small cell lung carcinoma (NSCLC) dataset of patients who received radiotherapy. Results: Our methods were applied to an NSCLC dataset that consists of 209 patients' information, each having 160 variables. Using several feature selection methods, relevant features were searched. Subsequently, with the selected features, various classification algorithms were tested. Through these experiments, we showed the usefulness of machine learning methods in the analysis of radiation oncology dataset. Conclusions: We have presented an open-source software tool and several machine learning algorithms for analyzing radiotherapy outcomes. We demonstrated the tool on a lung cancer patient dataset. We believe that the improved tool will provide radiation oncology modelers with new means to analyze radiation response data.