Question
Asked 11th Mar, 2016

What is the difference between feature extraction and classification?

where does feature extraction and classification apply?

Most recent answer

11th Mar, 2016
Ebenezer R.H.P. Isaac
Ericsson
As Sachin mentioned, feature selection is part of preprocessing. Be wary of the difference between feature extraction and selection. Feature extraction is the process in which the each given sample is broken down to quantifiable (and sometimes qualitative) attributes. Feature selection is the process of choosing a subset of those extracted attributes for a domain specific application.
Both the above tasks are to be applied on the whole data set. Although technically, the analysis that forms the basis for feature selection is done only on the training set but the resulting outcome is applied to all training, validation and testing sets. 
2 Recommendations

Popular answers (1)

11th Mar, 2016
Ebenezer R.H.P. Isaac
Ericsson
Let's be clear what a feature is. Given a set of samples on which you want a make a decision on, a feature is something that would possibly be different among those samples, i.e, discriminate one sample from another.  This decision may be to classify the sample to a finite set, group the samples based on similarity (clustering) or predict a real value based on those features (regression).
Feature extraction is the process of collecting discriminative information from a set of samples. Feature classification is the grouping of features based on some criteria. Sometimes feature classification might also be related to feature selection which is to select a subset of the extracted features that would optimise the machine learning algorithm and possible reduce noise removing unrelated features.
To get further information regarding this, you may refer the following paper.
5 Recommendations

All Answers (5)

11th Mar, 2016
Ebenezer R.H.P. Isaac
Ericsson
Let's be clear what a feature is. Given a set of samples on which you want a make a decision on, a feature is something that would possibly be different among those samples, i.e, discriminate one sample from another.  This decision may be to classify the sample to a finite set, group the samples based on similarity (clustering) or predict a real value based on those features (regression).
Feature extraction is the process of collecting discriminative information from a set of samples. Feature classification is the grouping of features based on some criteria. Sometimes feature classification might also be related to feature selection which is to select a subset of the extracted features that would optimise the machine learning algorithm and possible reduce noise removing unrelated features.
To get further information regarding this, you may refer the following paper.
5 Recommendations
11th Mar, 2016
Archana J S
Amrita Vishwa Vidyapeetham
Thanks for the response Ebenezer R.H.P. Isaac
I would like to ask one more question on this.
Is Feature extraction on different samples of data set or on the entire data set ? If so, what difference does it make?
Feature selection is the data preprocessing task. In feature selection, we try to identify relevant attributes of the data which can be useful for the analysis and removing those attributes which are not so relevant for the analysis.
Classification is a data mining task, which can be applied to the data sets with class labels already known for the target attribute or dependent attribute (after preprocessing). If you don not know the class label information, then go for clustering.
All the Best
2 Recommendations
11th Mar, 2016
Archana J S
Amrita Vishwa Vidyapeetham
Thank you Sir...
11th Mar, 2016
Ebenezer R.H.P. Isaac
Ericsson
As Sachin mentioned, feature selection is part of preprocessing. Be wary of the difference between feature extraction and selection. Feature extraction is the process in which the each given sample is broken down to quantifiable (and sometimes qualitative) attributes. Feature selection is the process of choosing a subset of those extracted attributes for a domain specific application.
Both the above tasks are to be applied on the whole data set. Although technically, the analysis that forms the basis for feature selection is done only on the training set but the resulting outcome is applied to all training, validation and testing sets. 
2 Recommendations

Similar questions and discussions

How to select number of features selected from DWT decomposition ?
Question
7 answers
  • Wassim DiaiWassim Diai
Dear community , currently working on emotions recognition , as a first step I'm trying to extract features , I was checking some recources , I found that they used the SEED dataset , it contains EEG signals of 15 subjects that were recorded while the subjects were watching emotional film clips. Each subject is asked to carry out the experiments in 3 sessions. There are 45 experiments in this dataset in total. Different film clips (positive, neutral, and negative emotions) were chosen to receive highest match across participants. The length of each film clip is about 4 minutes. The EEG signals of each subject were recorded as separate files containing the name of the subjects and the date. These files contain a preprocessed, down-sampled, and segmented version of the EEG data. The data was down-sampled to 200 Hz. A bandpass frequency filter from 0–75 Hz was used. The EEG segments associated with every movie were extracted. There are a total of 45 .mat files, one for each experiment. Every person carried out the experiment three times within a week. Every subject file includes 16 arrays; 15 arrays include preprocessed and segmented EEG data of 15 trials in one experiment. An array named LABELS contains the label of the corresponding emotion- al labels (−1 for negative, 0 for neutral, and +1 for positive). I found that they loaded each dataset separately (negative , neutral , positive) , and they fixed length of signal at 4096 and number of signal for each class at 100 , and fixed number of features extracted from Wavelet packet decomposition at 83 , my question is why they selected 83 , 4096 and 100 exactly ?
I know that my question is a bit long but I tried to explain clearly the situation , I appreciate your help thank you .

Related Publications

Got a technical question?
Get high-quality answers from experts.