Article

NEUROSVM: an architecture to reduce the effect of the choice of kernel on the performance of svm

Journal of Machine Learning Research (impact factor: 2.56). 01/2009; 10:591-622. pp.591-622

ABSTRACT In this paper we propose a new multilayer classifier architecture. The proposed hybrid architecture has two cascaded modules: feature extraction module and classification module. In the feature extraction module we use the multilayered perceptron (MLP) neural networks, although other tools such as radial basis function (RBF) networks can be used. In the classification module we use sup-port vector machines (SVMs)—here also other tool such as MLP or RBF can be used. The feature extraction module has several sub-modules each of which is expected to extract features capturing the discriminating characteristics of different areas of the input space. The classification module classifies the data based on the extracted features. The resultant architecture with MLP in feature extraction module and SVM in classification module is called NEUROSVM. The NEUROSVM is tested on twelve benchmark data sets and the performance of the NEUROSVM is found to be better than both MLP and SVM. We also compare the performance of proposed architecture with that of two ensemble methods: majority voting and averaging. Here also the NEUROSVM is found to perform better than these two ensemble methods. Further we explore the use of MLP and RBF in the classification module of the proposed architecture. The most attractive feature of NEUROSVM is that it practically eliminates the severe dependency of SVM on the choice of kernel. This has been verified with respect to both linear and non-linear kernels. We have also demonstrated that for the feature extraction module, the full training of MLPs is not needed.

0 0
 · 
0 Bookmarks
 · 
43 Views

Full-text (2 Sources)

View
5 Downloads
Available from
23 Jan 2013

Keywords

attractive feature
 
benchmark data sets
 
classification module
 
classification module classifies
 
different areas
 
ensemble methods
 
extracted features
 
feature extraction module
 
full training
 
input space
 
MLPs
 
multilayered perceptron
 
NEUROSVM
 
new multilayer classifier architecture
 
non-linear kernels
 
proposed architecture
 
proposed hybrid architecture
 
radial basis function
 
resultant architecture
 
two ensemble methods
 

Pradip Ghanty