A Comparative Study of Microarray Data Classification Methods Based on Ensemble Biological Relevant Gene Sets

DOI: 10.1007/978-3-642-13214-8_4
Source: dx.doi.org


In this work we study the utilization of several ensemble alternatives for the task of classifying microarray data by using
prior knowledge known to be biologically relevant to the target disease. The purpose of the work is to obtain an accurate
ensemble classification model able to outperform baseline classifiers by introducing diversity in the form of different gene
sets. The proposed model takes advantage of WhichGenes, a powerful gene set building tool that allows the automatic extraction
of lists of genes from multiple sparse data sources. Preliminary results using different datasets and several gene sets show
that the proposal is able to outperform basic classifiers by using existing prior knowledge.

Keywordsmicroarray data classification-ensemble classifiers-gene sets-prior knowledge

Download full-text


Available from: Miguel Reboiro-Jato, Apr 07, 2014