Conference PaperPDF Available

Deep Learning for COPD Analysis Using Lung Sounds

Authors:
  • KTO Karatay Univeristy
  • Iskenderun Technical University
  • Iskenderun Technical University, Hatay, Turkey

Abstract and Figures

In this study, Hilbert-Huang Transform (HHT) was applied to the lung sounds from RespiratoryDatabase@TR and the statistical features were calculated from the different modulations of the HHT. The statistical features were fed into the DBN to classify the lung sounds from Chronic Obstructive Pulmonary Disease (COPD) and healthy subjects.
Content may be subject to copyright.
DEEP LEARNING FOR COPD ANALYSIS USING LUNG SOUNDS
GOKHAN ALTAN1, YAKUP KUTLU2, NOVRUZ ALLAHVERDI3
1,2Iskenderun Technical University, Computer Engineering Dept., Iskenderun, Hatay, Turkey
3KTO Karatay University, Computer Engineering Dept., Karatay, Konya, Turkey
e-mail: gokhan altan@hotmail.com, yakup.kutlu@iste.edu.tr, novruz.allahverdi@karatay.edu.tr
Keywords: Extreme Learning Machines, ELM, Hessenberg Decomposition, ELM Autoencoder,
RespiratoryDatabase@TR, COPD.
1. Introduction
Deep Learning (DL) algorithms have become popular with the detailed analyzing capabilities
with many hidden layers in recent years. The size of hidden layer in the classifier models is
completely correlated with the analyzing capability of the proposed models. Multiple hidden
layers and neuron size in the hidden layers enhance the analyzing capability of the models,
whereas increasing the training time [2]. When using lots of hidden layers provides enhancing
analyzing capabilities by assessing different presentations of the input, on the other hand it
costs much training time. The idea of reducing the training time for the DL algorithms is the
main focus point of recent researches. Although the DL is a neural network structure which
has many hidden layers, they differ in consequence of performing variant back-propagation
procedures during the training and the definition of the classification parameters (including
weight and biases) in pre-training. While the input weights and the hidden node parameters
are randomly defined for neural network model, the DL algorithm pre-defines the weights and
biases using unsupervised learning models including Restricted Boltzmann Machines (RBM)[9],
Sparse autoencoders [8], and Extreme Learning Machines (ELM) Autoencoders [6].
The DL algorithms differ with the unsupervised learning phase and the feature learning
models in the training process. The Deep Belief Networks (DBN) is frequently used for training
considering the capabilities of accessing the global minimum and high classification performances
with fast greedy layer-wise pre-training of the layers [8]. The DBN has two stage classification.
The input weights and the hidden layer biases are defined using RBM in the first stage, the
pre-defined parameters are optimized unfolding them into neural network model with the same
structure at the second stage [9, 10, 8]. The main point of the DBN is fast training speed causing
pre-defining the parameters before optimization and enabling the global minimum with small
number of iterations for optimization [10].
The DBN was applied to classify various biomedical signals for asthma disease diagnosis
models [7], diagnosis of the coronary artery disease [2], arryhthmia classification [3] and brain
activity detection [4]. In this study, Hilbert-Huang Transform (HHT) was applied to the lung
sounds from RespiratoryDatabase@TR and the statistical features were calculated from the
different modulations of the HHT. The statistical features were fed into the DBN to classify the
lung sounds from Chronic Obstructive Pulmonary Disease (COPD) and healthy subjects.
1
2 GOKHAN ALTAN, YAKUP KUTLU, NOVRUZ ALLAHVERDI
2. Materials and Method
2.1. Classifier. The DBN algorithm performs RBM to pre-define the classification parameters
using unsupervised ways to address the deficiency of training time on deep models with multiple
hidden layers. The DBN performs layer-by-layer top-down directed learning operations and
defines generative weights. The generative weights represents the relationship between adjacent
layers, how the parameters in a layer rely on the parameters in the adjacent layer above. Upper
layers of the DBN provide to represent more abstract features where as the lower layers of the
DBN learn simple features. Each RBM in the DBN model generates different presentation of
the input data [10]. Energy function (1) and probability function (2) of the DBN model are :
E(v, h) = X
i
fi(v, h) = bv ch W v h (1)
P(v, h) = 1
ZeE(v,h)(2)
vis input layer vector, his hidden layer vector. band crepresent for biases of the DBN model
for visible and hidden layer, respectively. Zis normalization constant for the RBM distribution,
Wis the weights for pre-training phase of the model.
2.2. Database. RespiratoryDatabase@TR is a unique multimedia respiratory database which
has 12-channel lung sounds, chest X-rays, 4-channel heart sounds, spirometry metrics from
subjects with the COPD and healthy subjects [5]. It generates a wide analysing potentiality
for the COPD and asthma diseases using computerized signal analysis and machine learning
approaches. 12-channel lung sounds with 15s duration from 30 subjects (15 COPD+15 Healthy)
were utilized in the analysis.
2.3. Hilbert-Huang Transform. HHT is an adaptable and efficient transformation method to
overcome the non-linearity and non-stationarity signal problems. The HHT enables extracting
time-frequency-energy characteristics of the signal [11].
The HHT is a two step transformation including Empirical Mode Decomposition(EMD) and
Hilbert Transform (HT), consecutively. The EMD extracts Intrinsic Mode Functions (IMFs)
which are the ortogonal basis frequency modulations of the signals without leaving the time
domain.The formulation of the EMD process is:
X(t) =
n
X
j=1
IMFj+rn(3)
rnis the residual signal, Xrepresent the input signal and nis the number of the sifted
IMFs. The HT is applied to the sifted IMF modulations for counting instantaneous frequency
characteristics[1, 11]. Analytical function of the HT for an x(t) is formulated as follows:
x(t) = <(n
X
i=1
ai(t)ejWi(t)dt )(4)
3. Experimental Results
The HHT is applied to the 12-channel lung sounds. The HHT-based statistical features
including standard deviation, mean, median, maximum, minimum, variance, mode, correlation
coefficient, kurtosis, moment, cumulant, and energy for each IMFs were calculated as dataset. It
was fed into the DBN model with 2 hidden layers (340,580 neurons). The DBN was iterated for
50 epochs. The learning rate was selected as 2 and the sigmoid activation function was utilized
as the output function for the DBN model. The statistical metrics such as accuracy, sensitivity,
and selectivity were calculated from the contingency table of classification to estimate differences
in distribution of lung sounds from the COPD patients and healthy subjects using 6-fold cross
G. ALTAN, Y. KUTLU, N. ALLAHVERDI: DEEP LEARNING FOR COPD ANALYSIS USING LUNG SOUNDS 3
Table 1. The classification performances (%) of the DBN
Accuracy Sensitivity Specificity
IMF1 33.61 28.89 38.33
IMF2 62.78 66.67 58.89
IMF3 50.83 53.33 48.33
IMF4 47.50 54.44 40.56
IMF5 38.06 36.67 39.44
All 70.28 67.22 73.33
SFFS 90.83 94.44 87.22
validation technique. The achieved results are presented in Table 1 considering the IMF-based
feature sets and the entire feature set.
The proposed DBN model has achieved high results for each IMF feature set and entire
feature set expect IMF1. IMF1 is the first modulation which has still noise, that is why it is
the lowest responsible feature for the classifier. The DBN has separated the lung sounds from
the COPD and healthy lung sounds with classification performance rates of 70.28%, 67.22%,
and 73.33% for accuracy, sensitivity, and specificity, respectively. The sequential forward feature
selection (SFFS) algorithm is performed on the Deep ELM classifier model and has increased
the classification performance rates to 90.83%, 94.44%, and 87.22% for accuracy, sensitivity and
specificity.
4. Acknowledgements
This study is supported by Scientific and Technological Research Council of Turkish (TUBITAK-
116E190). The authors express their thanks to TUBITAK for providing fully support.
References
[1] Altan G., Kutlu Y., Allahverdi N., A new approach to early diagnosis of congestive heart failure disease by
using Hilbert Huang transform, Computer Methods and Programs in Biomedicine, Vol.137, December 2016,
p.23-34, ISSN 0169-2607, http://dx.doi.org/10.1016/j.cmpb.2016.09.003
[2] Altan G., Allahverdi N., Kutlu Y., Diagnosis of Coronary Artery Disease Using Deep Belief Networks,
European Journal of Engineering and Natural Sciences, 2017, Vol:2 Issue(1),ISSN:2458-8156, p.29-36
[3] Altan G., Kutlu Y., Allahverdi N., A Multistage Deep Belief Networks Application on Arrhythmia Classifi-
cation, International Journal of Intelligent Systems and Applications in Engineering, Volume 4, Special Issue
1, December 2016, Pages 23-34, ISSN 2147-6799, DOI: 10.18201/ijisae.270367
[4] Altan G., Kutlu Y., Allahverdi N., Deep Belief Networks Based Brain Activity Classification Using EEG from
Slow Cortical Potentials in Stroke, International Journal of Applied Mathematics, Electronics and Computers,
Vol.4, Issue.I,2016, p.205-210, http://dx.doi.org/10.18100/ijamec.270307
[5] Altan G., Kutlu Y., Garbi Y., Pekmezci A.O., Nural S., Multimedia Respiratory Database (Respiratory-
Database@TR): Auscultation Sounds And Chest X-rays, Natural and Engineering Sciences, 2017, ISSN:2458-
8989, Vol.2, Issue:3, pp.5972, DOI: 10.28978/nesciences.349282
[6] Altan G., Kutlu Y., Pekmezci A.O., Yayik A., Diagnosis of Chronic Obstructive Pulmonary Disease using
Deep Extreme Learning Machines with LU Autoencoder Kernel, 7th International Conference on Advanced
Technologies, 2018, (Accepted)
[7] Altan G., Kutlu Y., Pekmeci A.O., Nural S., Asthma Analysis using Deep Learning, Full Text, 7th Interna-
tional Conference on Advanced Technologies (ICAT18) (Accepted)
[8] Bengio Y., Lamblin P., Popovici D., Larochelle H., Greedy layer-wise training of deep networks, NIPS’06
Proceedings of the 19th International Conference on Neural Information Processing Systems, 2006, p.153-160
[9] Hinton G.E, Osindero S., Teh Y.W., A fast learning algorithm for deep belief nets. Neural Computation,
2006, Vol.18 Issue:7, p.1527-1554, DOI=http://dx.doi.org/10.1162/neco.2006.18.7.1527.
[10] Hinton G., Deep belief networks, Scholarpedia, 2009, Vol.4 Issue:5, p.5947. doi:10.4249/scholarpedia.5947.
[11] Huang N.E., Shen Z., Long S.R. et al., The empirical mode decomposition and the Hilbert spectrum for
nonlinear and non-stationary time series analysis. Proceedings of the Royal Society of London, 1998, A(454),
p.903-995.
... HHT is a new nonstationary and nonlinear signal analysis method proposed by Huang E after an in-depth study and summary of previous signal analysis methods [27], which has been widely used in the model research of disease diagnosis, such as chronic respiratory diseases [20,22,28] and heart diseases [29]. The method mainly includes empirical mode decomposition (EMD), Hilbert transform (HT), and its spectral analysis, among which EMD is the core part of the algorithm [30]. ...
... DBN network is a deep learning network that consists of several unsupervised restricted Boltzmann machines and a supervised backpropagation network. DBN had achieved good results in disease diagnosis by using physiological signals such as electrocardiogram [34], electroencephalogram [35], and lung sounds [19,20,28]. Bayes and decision tree used the same feature set as SVM, and DBN worked poorly on this feature set. ...
Article
Full-text available
Background Chronic obstructive pulmonary disease (COPD) is a chronic respiratory disease that seriously threatens people’s health, with high morbidity and mortality worldwide. At present, the clinical diagnosis methods of COPD are time-consuming, invasive, and radioactive. Therefore, it is urgent to develop a non-invasive and rapid COPD severity diagnosis technique suitable for daily screening in clinical practice. Results This study established an effective model for the preliminary diagnosis of COPD severity using lung sounds with few channels. Firstly, the time-frequency-energy features of 12 channels lung sounds were extracted by Hilbert–Huang transform. And then, channels and features were screened by the reliefF algorithm. Finally, the feature sets were input into a support vector machine to diagnose COPD severity, and the performance with Bayes, decision tree, and deep belief network was compared. Experimental results show that high classification performance using only 4-channel lung sounds of L1, L2, L3, and L4 channels can be achieved by the proposed model. The accuracy, sensitivity, and specificity of mild COPD and moderate + severe COPD were 89.13%, 87.72%, and 91.01%, respectively. The classification performance rates of moderate COPD and severe COPD were 94.26%, 97.32%, and 89.93% for accuracy, sensitivity, and specificity, respectively. Conclusion This model provides a standardized evaluation with high classification performance rates, which can assist doctors to complete the preliminary diagnosis of COPD severity immediately, and has important clinical significance.
... Deep Learning (DL) is the specified type of machine learning algorithms by handling artificial neural network basics with many hidden layers, various kernels and excessive neuron sizes. The DL has ability to perform computer vision (Krizhevsky, Sutskever, & Hinton, 2012), time-series analysis (Allahverdi, Altan, & Kutlu, 2018;Altan, Kutlu, & Allahverdi, 2016), modeling diagnostic applications (Allahverdi, Altan, & Kutlu, 2016;Gokhan Altan, Kutlu, Pekmezci, & Nural, 2018;Gokhan Altan, Kutlu, Pekmezci, & Yayık, 2018), natural language processing and more. Whereas the most famous DL algorithm including convolution neural networks, deep belief networks (DBN), deep reinforcement learning and deep neural network models are referred with the computer vision and digital image analysis problems, the researchers are focused on reducing the training time with kernel-based solutions. ...
Preprint
Full-text available
Deep Learning (DL) is a two-step classification model that consists feature learning, generating feature representations using unsupervised ways and the supervised learning stage at the last step of model using at least two hidden layers on the proposed structures by fully connected layers depending on of the artificial neural networks. The optimization of the predefined classification parameters for the supervised models eases reaching the global optimality with exact zero training error. The autoencoder (AE) models are the highly generalized ways of the unsupervised stages for the DL to define the output weights of the hidden neurons with various representations. As alternatively to the conventional Extreme Learning Machines (ELM) AE, Hessenberg decomposition-based ELM autoencoder (HessELM-AE) is a novel kernel to generate different presentations of the input data within the intended sizes of the models. The aim of the study is analyzing the performance of the novel Deep AE kernel for clinical availability on electroencephalogram (EEG) with stroke patients. The slow cortical potentials (SCP) training in stroke patients during eight neurofeedback sessions were analyzed using Hilbert-Huang Transform. The statistical features of different frequency modulations were fed into the Deep ELM model for generative AE kernels. The novel Deep ELM-AE kernels have discriminated the brain activity with high classification performances for positivity and negativity tasks in stroke patients.
... Deep Extreme Learning Machines, which is seen as some basic biological learning structures, was used for Chronic Obstructive Pulmonary Disease (COPD) and obtained a high accuracy of 91.39% (Altan, Kutlu, & Yayık, 2018). In recent years, deep learning, a highly popular machine learning approach have also been tried COPD and asthma which are major diseases that can be diagnosed from respiratory sounds and achieved highly successful results (Allahverdi, Altan, & Kutlu, 2018) . One of the most commonly used methods for automatic recognition of respiratory sounds is the Mel Frequency Cepstrum Coefficient (MFCC) technique (Palaniappan, Sundaraj, & Sundaraj, 2014a). ...
Preprint
Full-text available
The recording of respiratory sounds was of significant benefit in the diagnosis of abnormalities in respiratory sounds. The duration of the sounds used in the diagnosis affects the speed of the diagnosis. In this study, the effect of window size on diagnosis of abnormalities in respiratory sounds and the most efficient recording time for diagnosis were analyzed. First, window size was applied to each sound in the data set consisting of normal and abnormal breathing sounds, 0.5 second and from 1 to 20 seconds Increased by 1 second. Then, the data applied to window size was inferred feature extraction with Mel Frequency Cepstral Coefficient (MFCC) and the performance of each window was calculated using the leave one-out classifier and the k-nearest neighbor (KNN) algorithm. As a result, it was determined that the data was significant with an average performance of 92.06% in the records between 2 and 10 seconds.
... Deep ELM uses autoencoder kernels for generating different presentations of the input data for using them as the hidden neuron weights. Autoencoder models are the unsupervised stage of the Deep ELM, at the last stage the lastly calculated output parameters are set as the input of the ELM training supervised ways [15]. ...
Conference Paper
Full-text available
Lung sounds are the fundamental and effective diagnostic signals for the Chronic Obstructive Pulmonary Disease (COPD). In this study, the contribution of the heart sounds to the lung sounds is focused on to diagnose the COPD. The heart sounds have a significant characteristic as the symptoms leading to heart failure. The RespiratoryDatabase@TR that consists of 12-channel lung sounds and 4-channel heart sounds was utilized in the computerized analysis. The 15s of auscultation sounds from 15 patients with the COPD and 15 of healthy subjects were analyzed to diagnose COPD. Hilbert-Huang Transform was applied to the heart and lung sounds for obtaining amplitude-frequency-time features. Intrinsic Mode Functions based statistical features were calculated and were fed into the Deep Extreme Learning Machines (ELM) classifier. Using heart sounds in addition to lung sounds has achieved better classification performance rates of 84.16%, 88.33%, and 87.27% for overall accuracy, sensitivity and specificity. The heart sounds have an incontrovertible impact for the COPD classification for Deep ELM classifier along with lung sounds.
... Deep Learning (DL) is the specified type of machine learning algorithms by handling artificial neural network basics with many hidden layers, various kernels and excessive neuron sizes. The DL has ability to perform computer vision (Krizhevsky, Sutskever, & Hinton, 2012), time-series analysis (Allahverdi, Altan, & Kutlu, 2018;Altan, Kutlu, & Allahverdi, 2016), modeling diagnostic applications (Allahverdi, Altan, & Kutlu, 2016;Gokhan Altan, Kutlu, Pekmezci, & Nural, 2018;Gokhan Altan, Kutlu, Pekmezci, & Yayık, 2018), natural language processing and more. Whereas the most famous DL algorithm including convolution neural networks, deep belief networks (DBN), deep reinforcement learning and deep neural network models are referred with the computer vision and digital image analysis problems, the researchers are focused on reducing the training time with kernel-based solutions. ...
Article
Full-text available
Deep Learning (DL) is a two-step classification model that consists feature learning, generating feature representations using unsupervised ways and the supervised learning stage at the last step of model using at least two hidden layers on the proposed structures by fully connected layers depending on of the artificial neural networks. The optimization of the predefined classification parameters for the supervised models eases reaching the global optimality with exact zero training error. The autoencoder (AE) models are the highly generalized ways of the unsupervised stages for the DL to define the output weights of the hidden neurons with various representations. As alternatively to the conventional Extreme Learning Machines (ELM) AE, Hessenberg decomposition-based ELM autoencoder (HessELM-AE) is a novel kernel to generate different presentations of the input data within the intended sizes of the models. The aim of the study is analyzing the performance of the novel Deep AE kernel for clinical availability on electroencephalogram (EEG) with stroke patients. The slow cortical potentials (SCP) training in stroke patients during eight neurofeedback sessions were analyzed using Hilbert-Huang Transform. The statistical features of different frequency modulations were fed into the Deep ELM model for generative AE kernels. The novel Deep ELM-AE kernels have discriminated the brain activity with high classification performances for positivity and negativity tasks in stroke patients.
Preprint
Full-text available
This study was carried out to enable individuals with limited mobility skills to play chess in real time and to play games with the individuals around them without being under any social distress or stress. Voice recordings were taken from 50 people (23 men and 27 women). While recording the sound, 29 words from each person were used which are determined as necessary for playing the game. Mel Frequency Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC) qualification methods were used. In addition, k-NN, Naive Bayes and Neural Network classification methods were used for classification. Two different classification procedures were applied, namely, person-based and general. While the performance rate in person-based classification ranged from 75% to 98%, a performance over 84% was achieved in general classification.
Conference Paper
Full-text available
Chronic Obstructive Pulmonary Disease (COPD) is a completely untreatable disease that results in exposure of lungs to harmful dusts, gases or micro-particles. The Lower-Upper decomposition based ELM Autoencoder kernel is adapted to the Deep ELM model and is tested on lung sounds. 10s of lung sounds, recorded from 12-channels with right and left focal points of posterior and anterior sides from 15 of patients with Chronic Obstructive Pulmonary Disease (COPD) and 15 healthy subjects, are utilized from the RespiratoryDatabase@TR. The Discrete Wavelet Transform at 5th level is applied to lung sounds with Haar, Daubechies, and Symlets wavelets. The statistical features are extracted and are fed into the Deep ELM model with LU-based Autoencoder kernel. The proposed Deep ELM model with LU-based Autoencoder kernel has separated COPD and healthy subjects with high and remarkable classification performance rates of 95.00%, 93.33% and 93.54% for accuracy, sensitivity and specificity. This study was supported under the TUBITAK project (Project No: 116E190).
Article
Full-text available
Auscultation is a method for diagnosis of especially internal medicine diseases such as cardiac, pulmonary and cardio-pulmonary by listening the internal sounds from the body parts. It is the simplest and the most common physical examination in the assessment processes of the clinical skills. In this study, the lung and heart sounds are recorded synchronously from left and right sides of posterior and anterior chest wall and back using two digital stethoscopes in Antakya State Hospital. The chest X-rays and the pulmonary function test variables and spirometric curves, the St. George respiratory questionnaire (SGRQ-C) are collected as multimedia and clinical functional analysis variables of the patients. The 4 channels of heart sounds are focused on aortic, pulmonary, tricuspid and mitral areas. The 12 channels of lung sounds are focused on upper lung, middle lung, lower lung and costophrenic angle areas of posterior and anterior sides of the chest. The recordings are validated and labelled by two pulmonologists evaluating the collected chest x-ray, PFT and auscultation sounds of the subjects. The database consists of 30 healthy subjects and 45 subjects with pulmonary diseases such as asthma, chronic obstructive pulmonary disease, bronchitis. The novelties of the database are the combination ability between auscultation sound results, chest X-ray and PFT; synchronously assessment capability of the lungs sounds; image processing based computerized analysis of the respiratory using chest X-ray and providing opportunity for improving analysis of both lung sounds and heart sounds on pulmonary and cardiac diseases.
Article
Full-text available
In this study, a decision-support system is presented to aid cardiologists during the diagnosis and to create a base for a new diagnosis system which separates two classes (CAD and no-CAD patients) using an electrocardiogram (ECG). 24 hour filtered ECG signals from PhysioNet were used. 15 second short-term ECG segments were extracted from 24 hour ECG signals to increase the number of samples and to provide a convenient transformation in a short period of time. The Hilbert-Huang Transform, which is effective on non-linear and non-stationary signals, was used to extract the features from short-term ECG signals. Instinct Mode Function (IMF) was extracted by applying Empirical Mode Decomposition to short-term ECG signals. The Hilbert Transform (HT) was applied to each IMF to obtain instantaneous frequency characteristics of the signal. Dataset was created by extracting statistical features from HT applied to IMF. Deep Belief Networks (DBN) which have a common use in Deep Learning algorithms were used as the classifier. DBN classification accuracy in the diagnosis of the CAD is discussed. The extracted dataset was tested using the 10-fold cross validation method. The test characteristics (sensitivity, accuracy and specificity) that are the basic parameters of independent testing in the medical diagnostic systems were calculated using this validation method. Short-term ECG signals of CAD patients and no-CAD groups were classified by the DBN with the rates of 98.05%, 98.88% and 96.02%, for accuracy, specificity and sensitivity, respectively. The DBN model achieved higher accuracy rates than the Neural Net
Article
Full-text available
An electrocardiogram (ECG) is a biomedical signal type that determines the normality and abnormality of heart beats using the electrical activity of the heart and has a great importance for cardiac disorders. The computer-aided analysis of biomedical signals has become a fabulous utilization method over the last years. This study introduces a multistage deep learning classification model for automatic arrhythmia classification. The proposed model includes a multi-stage classification system that uses ECG waveforms and the Second Order Difference Plot (SODP) features using a Deep Belief Network (DBN) classifier which has a greedy layer wise training with Restricted Boltzmann Machines algorithm. The multistage DBN model classified the MIT-BIH Arrhythmia Database heartbeats into 5 main groups defined by ANSI/AAMI standards. All ECG signals are filtered with median filters to remove the baseline wander. ECG waveforms were segmented from long-term ECG signals using a window with a length of 501 data points (R wave centered). The extracted waveforms and elliptical features from the SODP are utilized as the input of the model. The proposed DBN-based multistage arrhythmia classification model has discriminated five types of heartbeats with a high accuracy rate of 96.10%.
Article
Full-text available
An electroencephalogram (EEG) is an electrical activity which is recorded from the scalp over the sensorimotor cortex during vigilance or sleeping conditions of subjects. It can be used to detect potential problems associated with brain disorders. The aim of this study is assessing the clinical usefulness of EEG which is recorded from slow cortical potentials (SCP) training in stroke patients using Deep belief network (DBN) which has a greedy layer wise training using Restricted Boltzmann Machines based unsupervised weight and bias evaluation and neural network based supervised training. EEGs are recorded during eight SCP neurofeedback sessions from two stroke patients with a sampling rate of 256 Hz. All EEGs are filtered with a low pass filter. Hilbert-Huang Transform is applied to the trails and various numbers of Instinct Mode Functions (IMFs) are obtained. High order statistics and standard statistics are extracted from IMFs to create the dataset. The proposed DBN-based brain activity classification has discriminated positivity and negativity tasks in stroke patients and has achieved high rates of 90.30%, 96.58%, and 91.15%, for sensitivity, selectivity, and accuracy, respectively. Deep Belief Networks Based Brain Activity Classification Using EEG from Slow Cortical Potentials in Stroke. Available from: https://www.researchgate.net/publication/311850531_Deep_Belief_Networks_Based_Brain_Activity_Classification_Using_EEG_from_Slow_Cortical_Potentials_in_Stroke [accessed Jul 7, 2017].
Article
Full-text available
A new method for analysing nonlinear and non-stationary data has been developed. The key part of the method is the `empirical mode decomposition' method with which any complicated data set can be decomposed into a finite and often small number of 'intrinsic mode functions' that admit well-behaved Hilbert transforms. This decomposition method is adaptive, and, therefore, highly efficient. Since the decomposition is based on the local characteristic time scale of the data, it is applicable to nonlinear and non-stationary processes. With the Hilbert transform, the 'instrinic mode functions' yield instantaneous frequencies as functions of time that give sharp identifications of imbedded structures. The final presentation of the results is an energy-frequency-time distribution, designated as the Hilbert spectrum. In this method, the main conceptual innovations are the introduction of `intrinsic mode functions' based on local properties of the signal, which make the instantaneous frequency meaningful; and th
Conference Paper
Full-text available
Complexity theory of circuits strongly suggests that deep architectures can be much more efficient (sometimes exponentially) than shallow architectures, in terms of computational elements required to represent some functions. Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly non-linear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization appears to often get stuck in poor solutions. Hinton et al. recently introduced a greedy layer-wise unsupervised learning algorithm for Deep Belief Networks (DBN), a generative model with many layers of hidden causal variables. In the context of the above optimization problem, we study this algorithm empirically and explore variants to better understand its success and extend it to cases where the inputs are continuous or where the structure of the input distribution is not revealing enough about the variable to be predicted in a supervised task. Our experiments also confirm the hypothesis that the greedy layer-wise unsupervised training strategy mostly helps the optimization, by initializing weights in a region near a good local minimum, giving rise to internal distributed representations that are high-level abstractions of the input, bringing better generalization.
Article
Congestive heart failure (CHF) is a degree of cardiac disease occurring as a result of the heart's inability to pump enough blood for the human body. In recent studies, coronary artery disease (CAD) is accepted as the most important cause of CHF. This study focuses on the diagnosis of both the CHF and the CAD. The Hilbert–Huang transform (HHT), which is effective on non-linear and non-stationary signals, is used to extract the features from R-R intervals obtained from the raw electrocardiogram data. The statistical features are extracted from instinct mode functions that are obtained applying the HHT to R-R intervals. Classification performance is examined with extracted statistical features using a multilayer perceptron neural network. The designed model classified the CHF, the CAD patients and a normal control group with rates of 97.83%, 93.79% and 100%, accuracy, specificity and sensitivity, respectively. Also, early diagnosis of the CHF was performed by interpretation of the CAD with a classification accuracy rate of 97.53%, specificity of 98.18% and sensitivity of 97.13%. As a result, a single system having the ability of both diagnosis and early diagnosis of CHF is performed by integrating the CAD diagnosis method to the CHF diagnosis method.
Article
We show how to use "complementary priors" to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model gives better digit classification than the best discriminative learning algorithms. The low-dimensional manifolds on which the digits lie are modeled by long ravines in the free-energy landscape of the top-level associative memory, and it is easy to explore these ravines by using the directed connections to display what the associative memory has in mind.