Conference PaperPDF Available

Blind separation of skewed signals in instantaneous mixtures

Authors:

Abstract and Figures

The problem of source separation of instantaneous mixtures has been addressed thoroughly in literature in the past. The assumption of statistical independence between the source signals, led to the introduction of independent component analysis (ICA). A number of methods, based on the ICA framework, can identify nonGaussian sources in instantaneous mixtures with robust convergence and performance. However, in several biomedical applications, there is a need to identify and separate signals that, apart from being nonGaussian, are not symmetric. In this article, the authors present a method for blind identification and separation of skewed (non-symmetric) signals in a linear instantaneous mixture.
Content may be subject to copyright.
Blind separation of skewed signals in instantaneous
mixtures
Nikolaos Mitianoudis, Tania Stathaki
Communications and Signal Processing Group
Imperial College
Exhibition Road, SW7 2AZ London, UK
Email: n.mitianoudis@imperial.ac.uk
Mike Davies
Centre for Digital Music
Queen Mary London
Mile End Road, E1 4NS, London, UK
Abstract The problem of source separation of instantaneous
mixtures has been addressed thoroughly in literature in the
past. The assumption of statistical independence between the
source signals, led to the introduction of Independent Component
Analysis (ICA). A number of methods, based on the ICA
framework, can identify nonGaussian sources in instantaneous
mixtures with robust convergence and performance. However,
in several biomedical applications, there is a need to identify
and separate signals that, apart from being nonGaussian, are
not symmetric. In this article, the authors present a method for
blind identification and separation of skewed (non-symmetric)
signals in a linear instantaneous mixture.
I. INTRODUCTION
Assume a set of Msensors monitoring a phenomenon
via the signals x(n) = [x1(n), x2(n), . . . , xM(n)]T. Let us
also assume that there are a number of underlying Nfactors
(sources) s(n)=[s1(n), s2(n), . . . , sN(n)]Tthat trigger the
phenomenon, observed by the sensors. We will assume that
the contribution of each factor is transmitted with insignificant
delay to the observing sensors, i.e. instantaneously. In addition,
possible corruption by additive noise is considered insignifi-
cant. The following model connects the observed signals with
the source signals via instantaneous mixing.
x=As (1)
where Ais a mixing matrix denoting instantaneous transmis-
sion. In this study, we will assume equal number of sources
and sensors (N=M). Although this linear instantaneous
model may seem unrealistic, there are a lot of real-life ap-
plications, where it can serve as a very good approximation.
In biomedical signal processing, there are a number of mon-
itoring signals that are considered instantaneous mixtures of
input sources, such as the electrocardiograph (ECG) and the
electroencephalogram (EEG) signals. The general non-linear
source separation problem is a more demanding problem,
which usually can not be addressed by the methods proposed
for linear mixtures.
A number of blind source separation approaches have
been proposed in the past [4]. Introducing the assumption
of statistical independence between the source signals, led to
the development of Independent Component Analysis (ICA).
Using this framework, one can separate nonGaussian sources
(in fact only one is allowed to be Gaussian) with a number of
different methodologies. Some approaches perform separation
by minimising the Kullback-Leibler (KL) divergence between
the separated sources and several probabilistic priors on the
source signals. Other approaches minimise the mutual infor-
mation conveyed by the separated sources or perform approx-
imate diagonalisation of a cumulant tensor of the mixtures.
Finally, some methods perform separation by estimating the
directions of the most nonGaussian components using kurtosis
or negentropy, as nonGaussianity measures. For more on these
techniques, one can refer to tutorial books on ICA, such as [1],
[4].
However, in some applications it is necessary to identify
signals with other statistical properties, apart from nonGaus-
sianity. One of these characteristics might be the symmetry
of the distribution. Several sources of interest in biomedical
signals are skewed. In [9], Stetson used ICA and skewness
as a criterion to identify the arterial pulse signal from noisy
mixtures. In [8], Sanei and Shoker proposed the use of
ICA and support-vector machines together with a number of
features (among which was skewness) to identify the eye-
blinking artifact in EEG signals. Consequently, this asymmetry
can be used as a tool to identify certain signals in biomedical
applications.
In this paper, we derive a FastICA [5] type of algorithm
for skewness explicitly. In a similar manner to optimising
kurtosis [5], one can optimise third-order moments, like
skewness, to separate non-symmetrical signals. The proposed
algorithm shows promising results on artificial data in terms
of convergence and separation quality. We also present some
preliminary results on Ventricular Activity separation from
ECG data and on eye-blinking identification in EEG data.
II. SOU RCE S EPARATI ON OF S KEW ED SI GNALS
A. Definition of skewness
In statistics, skewness is a measure of symmetry, or more
precisely, the lack of symmetry. A data set is symmetric if it
looks the same to the left and right of the center point (sample
mean). Skewness can also be considered a third-order moment.
Assuming that uis a random variable of non-zero mean, then
skewness can be defined as follows:
skew(u) = E{(u E{u})3}
E{(u E{u})2}3/2(2)
where E{·} represents the expectation operator. Skewness
takes positive values when the signal is asymmetrical to the
right and negative values, when it is skewed to the left. As
we are not interested in the sign of skewness, it would be
appropriate to optimise the absolute value of skewness. As
some of the input signals will be skewed, the mean should
not be considered negligible and should be taken into account
in our analysis.
B. Principal Component Analysis
The first task will be to “prewhiten” the data. A Principal
Component Analysis (PCA) step will orthogonalise (decor-
relate) and normalise the data to unit variance [4]. The
prewhitening matrix Vis formed by the eigenvectors of
the covariance matrix C=E{(x E {x})(x E {x})T}.
Assuming that His a matrix containing all the eigenvectors
of Cand Da diagonal matrix containing the eigenvalues
of C. The eigenvalue at the i-th diagonal element should
correspond to the eigenvector at the i-th column of H. Then,
the prewhitening matrix Vand the “whitened” data zare given
by the following equations:
V=D0.5HT(3)
z=V x (4)
where E{(z E{z})(z E{z})T}=I. As it is shown in
source separation literature [4], decorrelation is not a sufficient
condition to separate independent signals, as it will only
decorrelate (orthogonalise) the data. Therefore, in order to
isolate Lskewed components, we need to estimate Lpro-
jection operators withat will isolate the skewed components
ui, existing in the mixture (LN).
ui=wT
izi= 1, . . . , L (5)
C. Separation optimising skewness
In this section, we seek to identify a single skewed compo-
nent uin the mixture with the help of skewness. In order
to estimate u, we need to optimise the absolute value of
skewness to cater for both directions of skew. In this effort,
we will optimise the nominator only. Restricting the projection
operator wto perform rotation only, we have to impose the
constraint that ||w||2= 1, where || · || represents the L2-norm.
In this case, we have
E{(u E{u})2}=wTE{(z E {z})(z E {z})T}w
=wTIw =||w||2= 1 (6)
Consequently, the denominator in the definition of skewness
remains 1, as long as we prewhiten the input data and keep the
projection vector wnormalised to unit variance. As a result,
we need to optimise the following cost function:
J(w) = |G(w)|=¯
¯E{(u E{u})3}¯
¯(7)
Expanding J(w), we get the following simplified expression
J(w) = |E{u3} 3E{u2}E{u}+ 2E {u}3|(8)
The optimisation problem to be solved is set as follows:
max
wJ(w)(9)
subject to ||w||2= 1 (10)
The first step is to estimate the derivative of ∂J/∂w.
∂J
∂w =sgn (G(w))
·(3E{z(wTz)2} 6E{z(wTz)}E{wTz}
3E{(wTz)2}E{z}+ 2 ·3E{(wTz)}2E {z})(11)
∂J
∂w =sgn(G(w))[3(E {z(wTz)2} E {(wTz)2}E{z})
6(E{z(wTz)}E{wTz} E{wTz}2E {z})] (12)
One can perform gradient ascent optimisation to find the
optima of the cost function, as shown below.
w+w+η∂J (w)
∂w (13)
w+w+/||w+|| (14)
The new estimate for the projection operator w+is estimated
in terms of the previous estimate wand the derivative of the
cost function, weighted by the learning rate η. The second
step normalises the unmixing vector to unit L2-norm. These
two steps are repeated until convergence, i.e. |wTw+| 1.
However, gradient optimisation methods suffer from conver-
gence speed problems, as their convergence is controlled by
the learning rate η. In practice, a bad choice for the learning
rate ηcan inhibit or delay the convergence of the optimisation
considerably.
However, one can form a “fixed-point” rule to accelerate
the convergence of the algorithm [5]. At a stable point of
the algorithm, the gradient must point to the direction of
w. This implies that the gradient should be equal to w
multiplied by some scalar constant. Only in this case adding
wto the gradient is not changing its direction and we have
convergence. In addition, the arbitrary scaling (or sign) is
effectively removed by normalising to unit norm. Hence, the
“fixed-point” can be found at those points where the new
estimate is equal to the gradient.
w+∂J (w)
∂w (15)
To reduce the computational cost, one can remove the sgn(·)
expression from ∂J/∂w [5]. Effectively, the correct sign
identifies signals skewed to the left or to the right. However,
this information might as well be lost in the scale ambiguity of
the linear instantaneous model. As a result, there is no practical
need to maintain this costly term in the update algorithm.
Consequently, the following update algorithm is proposed:
w+ E{z(wTz)2} E{(wTz)2}E{z}
2E{wTz}(E{z(wTz)} E{wTz}E {z})(16)
w+w+/||w+|| (17)
The algorithm proposed above, extracts only one compo-
nent, which corresponds to the local optimum that is closest
to the random initial guess for w. The same rule can be
used to get other skewed components that exist in the linear
mixture. The above rule is randomly re-initialised to trace
other skewed components. However, the algorithm should not
converge to the same component. As all solutions lie in a
orthogonal structure, due to prewhitening, we must always
search for a solution in planes that are orthogonal to the planes
defined by the estimated components. In other words, the
new components should always be orthogonal to the already
estimated components. Hence, for the ith component, the
update for w+
ishould always be orthogonal to the space
spanned by the vectors w1, w2, . . . , wi1[5].
w+
iw+
iBBTw+
i(18)
where B= [w1w2. . . wi1].
If we are not interested in preserving the mean of the
original signals (which will be also affected by the scale am-
biguity), we can simplify the proposed update rule and reduce
the computational cost of the algorithm. We can normalise
the input data to zero mean, using the following prewhitening
step.
z=V(x E{x})(19)
Consequently, E{z}= 0 and E {wTz}= 0 and therefore the
update rule in (16) can be simplified to the following:
w+ E{z(wTz)2}(20)
This simplification decreases the computational cost of the
algorithm significantly, however, we lose possible bias infor-
mation of the input signals.
One benefit of the proposed algorithm is that it might be
a more appropriate tool for biomedical applications, where
skewness is more important to nonGaussianity for certain
categories of signals. Thus, one can gain all the benefits
of estimating lower-order moments. Also, skewness, along
with kurtosis, can be defined as a cumulant, unlike general
nonlinearities that can be used in ICA. Therefore, it should be
possible to make the proposed algorithm “blind” to additive
noise of known covariance [3].
III. EXPERIMENTS
For this experiment, we created four artificial mixtures of
two symmetrical and two skewed signals, in order to test
the algorithm’s performance. We used a uniformly distributed
source from -1 to 1 and a Gaussian source with zero mean
0 0.2 0.4 0.6 0.8 1
0
50
100
150
200
250
300 Source 1
−1 −0.5 0 0.5 1
0
50
100
150
200
250
300
350 Source 2
−1 −0.5 0 0.5 1
0
20
40
60
80
100
120 Source 3
−1 −0.8 −0.6 −0.4 −0.2 0
0
200
400
600
800 Source 4
Fig. 1. Histograms of the four input sources used in the first experiment.
Source 1 is skewed to the left and Source 4 is skewed to the right. Both
skewed signals have non-zero mean.
and unit variance as symmetrical signals. The skewed signals
were two Weibull distributed signals with different parameter
values, one skewed to the left and one skewed to the right.
The distributions of the four input signals are shown in figure
1. We used 5000 points of these randomly generated sources,
which were mixed with the following example mixing matrix
A.
A=
0.40 0.25 0.1 0.35
0.17 0.25 0.45 0.13
0.15 0.10 0.20 0.65
0.23 0.57 0.10 0.10
(21)
In figure 2, we can see the convergence of the update
rule in (16). The algorithm converged almost after 510
iterations using random initialisation. We also measured the
Signal-to-Noise Ratio (SNR), comparing the original input
signal siwith the corresponding separated signal uj, using
the definition in (22). To perform accurate measurement, the
scale and permutation ambiguity of the model should be taken
into account and the signals should be normalised accordingly.
SN R(dB) = 10 log10
E{s2
i}
E{(siuj)2}(22)
The algorithm managed to identify and separate both sig-
nals with promising performance. The histograms of the two
identified sources are depicted in figure 3. The algorithm has
isolated the two skewed signals from the mixture. The quality
of separation was also promising, giving SN R1= 25.4060dB
and SN R2= 40.4802dB for the two sources. If we are
searching for more than 2sources, then the algorithm will
not be able to unmix the other signals. Instead, the separation
results are Gaussian-like signals, i.e. the mixture of the two
symmetrical signals.
0 5 10 15 20 25 30
−1
−0.8
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
0.8
1
Iterations
Fig. 2. Convergence of the four coefficients of the vector wusing the fixed-
point algorithm in (16) and random initialisation.
0123456789
0
50
100
150
200
250
300
350 Separated skewed source 1
0 1 2 3 4 5 6 7
0
50
100
150
200 Separated skewed source 2
Fig. 3. Histograms of the two skewed separated sources. Comparing to the
original histograms in figure 1, we can observe the scale and permutation
ambiguity in the estimation of the linear generative model in (1).
IV. APPLICATION TO BIOMEDICAL SIGNALS
The application of Independent Component Analysis (ICA)
on biomedical signal processing has been highlighted in the
literature during the last years [4]. In medical applications, a
number of sensors are used to observe the function of body
organs. Usually, these sensors capture the activity of the organs
by measuring the electric potential at several points of the
body. The medical doctor examines these measurements over
time series and can infer about the medical condition.
Let us briefly explore how the linear generative model
of (1) can be applied in this case. In fact, these sensors
capture the overall picture of a more complicated phenomenon
that can be usually analysed into contributions from different
underlying components. The contribution of each of these
components to the signal, observed by each sensor, may be
different in terms of amplitude and delay. However, as we are
referring to electrical current measurements and noting that
the sensors are usually placed relatively close, it is common
to assume that the contribution of each component arrives with
insignificant delay. Consequently, we can consider the mixing
to be instantaneous.
Assume that the individual components are also statistically
independent, ICA can be employed to separate these com-
ponents. A number of approaches have employed ICA for
separation for several types of biomedical signals. However,
there are several components that can be identified using other
statistical characteristics than statistical independence, i.e. non-
Gaussianity. In this paper, we will use skewness to identify
several known components in electrocardiograms (ECG) and
electroencephalograms (EEG) that are not symmetric.
A. Electrocardiogram (ECG) Signals
Electrocardiogram (ECG) is a test that measures the elec-
trical activity of the heart. It is used to measure the rate and
regularity of heartbeats, as well as the size and position of
the chambers, the presence of any damage to the heart and
the effects of drugs or devices used to regulate the heart
(e.g. a pacemaker). In literature, it has been shown that the
electric potential in one part of the body surface can be
obtained by adding partial contributions of the heart potentials,
each one scaled by a transfer coefficient [7]. As a result, the
instantaneous mixture model of (1) can be used to model the
ECG monitoring system. The voltages for the 12-lead ECG
can be expressed as an instantaneous mixture of the heart
potentials.
In figure 4, we can see twelve signals that are given from
a 12-lead ECG. The ECG can detect several parts of these
signals that are associated with Ventricular Activity (VA) and
also a number of more subGaussian signals that are associated
with Atrial Activity (AA) [7]. A number of methods have been
proposed for the identification of VA or AA signals, based on
either ICA or other methods. However, one can observe that
VA signals are not symmetrical, compared to AA signals or
other noise signals. Therefore, one can optimise skewness, as
shown earlier on, to identify and separate signals associated
with VA.
For this experiment, we used a 12-lead set of signals from
the PhysioNet database [11], sampled at 1 KHz (see figure
4). The signals were filtered initially using a notch filter to
remove mains interference and then with a band-pass filter
with cut-off frequencies of 0.5 and 60 Hz to remove baseline
wandering and thermal noise [7]. We applied the algorithm in
(16), using 5000 input samples of each electrode, to estimate
4 skewed components in the mixture. The separated signals
are depicted in figure 5. The algorithm managed to identify
three components that are associated with VA. The algorithm
also identified a fourth non-symmetrical component, but can
not be clearly associated with VA.
0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Samples
12−lead ECG
Fig. 4. A twelve-lead input ECG signal from the Physionet database [11].
0 500 1000 1500 2000 2500
−5
0
5
10
samples
0 500 1000 1500 2000 2500
−5
0
5
10
samples
0 500 1000 1500 2000 2500
−5
0
5
10
samples
0 500 1000 1500 2000 2500
−10
−5
0
5
samples
Fig. 5. Trying to separate four skewed components, the algorithm identified
the first three, which can be associated with VA.
Hence, the proposed algorithm can be used as a method of
isolating VA signals. It can also be used as a preprocessing step
to identify the AA signal. Using the proposed algorithm, one
can separate VA signals and then use second-order methods
to isolate the AA signal, as proposed by Rieta et al [7].
B. Electroencephalogram (EEG) Signals
The electroencephalogram (EEG) is a medical test used to
measure/monitor the electrical activity of the brain via elec-
trodes applied to the scalp. This safe and painless procedure
can help diagnose a number of conditions, including epilepsy,
sleep disorders and brain tumours. This method can provide
direct information about the neural dynamics on a millisecond
scale [6], [10]. The EEG are very sensitive, low-amplitude
signals (µV ) and are usually contaminated by noise. As a
result, a number of methods were proposed in literature to
0 200 400 600 800 1000 1200 1400 1600 1800 2000
0
20
40
60
80
100
120
Samples
32−lead EEG
Fig. 6. A thirty two-lead input EEG signal from the EEGLAB test data [2].
isolate meaningful signals from artifacts and noise signals in
an EEG recording.
The application of ICA to the study of EEG signals is valid
only if some conditions are at least approximately satisfied [4].
First of all, we have to assume the existence of independent
components (source signals), i.e. hidden neural centres that
can serve as sources. This assumption can be justified sta-
tistically, however, in general there is no physical evidence
confirming its validity in full. Secondly, the assumption of
linear instantaneous mixing should also hold. As most EEG
signals lie below 1 KHz, the propagation of the signals can
be considered immediate and hence, there is no need to
introduce time delays in the model. Finally, the mixing has to
be stationary, i.e. the mixing matrix Ashould not be changing.
Although, the underlying source signals are documented to be
non-stationary, the mixing matrix should remain stationary, as
long as the electrodes in the cask are not moving. In practice,
we are bound to have slight movement during the experiment,
however, we will assume that the change in the mixing matrix
is insignificant.
Using ICA, one can identify hidden underlying sources in
EEG [6], [10]. The estimated independent components can be
localised and projected on the sensors’ space and highlight
activity in certain parts of the brain. However, EEG signals
are contaminated by certain artifacts, i.e. signals that are not
generated by brain activity, but by some external disturbances
(e.g. muscle activity, heart beat and eye blinking). A number
of methods have been used to remove the possible artifacts
present in EEG [6], [8], [10]. The eye blinking signal is
an artifact, reported to be usually skewed to one side [8].
Consequently, one might use the update algorithm described
earlier to identify and remove the eye blinking signal from
EEG recordings.
We performed some preliminary experiments to test the
previous argument. We used the test EEG data set that is
provided with the EEG software package EEGLAB [2]. This
0 200 400 600 800 1000 1200 1400 1600 1800 2000
−5
0
5
10
Samples
0 200 400 600 800 1000 1200 1400 1600 1800 2000
−4
−2
0
2
4
Samples
0 200 400 600 800 1000 1200 1400 1600 1800 2000
−4
−2
0
2
4
Samples
Fig. 7. Identifying three skewed components from the 32-lead EEG. The
first choice for the algorithm was an artifact associated with eye-blinking.
consists of a 32-lead EEG recording, sampled at 500 Hz
(see figure 6). The whole available data set (61 secs) was
used in the update rule of (16). The algorithm was requested
to identify three skewed components. In figure 7, the first
identified component is the one usually associated with eye
blinking [6], [10]. This implies that skewness is a proper
criterion to identify the eye blinking artifacts. This preliminary
effort demonstrates a valid application of the proposed method
in the area of EEG analysis.
The algorithm’s convergence in the case of real biomedical
data was comparable to the performance with artificial data. In
either case, the algorithm required not more than 30 iterations
for convergence.
V. CONCLUSION
In this paper, we have proposed an algorithm to separate
skewed sources in an instantaneous mixture. After a prewhiten-
ing step, the algorithm optimises a third-order moment, i.e.
skewness, using a “fixed-point” optimisation scheme to per-
form identification and separation of the skewed sources.
The algorithm’s convergence and performance in an artificial
experiment seems very promising. In addition, we explored
the use of the proposed technique in several biomedical appli-
cations. More specifically, we have given some preliminary
examples of Ventricular Activity signal separation in ECG
signals and eye blinking removal from EEG signals.
In the future, the authors would like to quantify the conver-
gence of the proposed technique and investigate more solidly
its performance in the case of ECG and EEG signals.
ACK NO WL EDG MEN T
The authors would like to thank Dr. Stella S. Daskalopoulou
for the insightful advice and guidance on the medical part of
the paper.
REFERENCES
[1] A. Cichocki, S.I. Amari, Adaptive Blind Signal and Image Processing.
Learning algorithms and applications, John Wiley & Sons, 2002.
[2] A. Delorme, S. Makeig, EEGLAB: an open source toolbox for analysis of
single-trial EEG dynamics, Journal of Neuroscience Methods, 134:9-21,
(2004).
[3] A. Hyv¨
arinen. Independent component analysis in the presence of
Gaussian noise by maximizing joint likelihood. Neurocomputing, 22:49–
67, 1998.
[4] A. Hyv¨
arinen, J. Karhunen, E. Oja, Independent Component Analysis,
John Wiley & Sons, 2001.
[5] A. Hyv¨
arinen, E. Oja, A Fast Fixed-Point Algorithm for Independent
Component Analysis, Neural Computation, 9(7):1483-1492, 1997.
[6] T.-P. Jung, C. Humphries, T.-W. Lee, S. Makeig, M. J. McKeown,
V. Iragui, T. Sejnowski, Extended ICA removes artifacts from elec-
troencephalographic recordings, Psychophysiology, 37, 2000, 163178.
Cambridge University Press.
[7] J.J. Rieta, F. Castells, C. Sanchez, J. Igual, ICA applied to atrial
fibrillation analysis, 4-th Int. Symposium on Independent Component
Analysis and Blind Signal Separation, April 2003, Nara, Japan.
[8] S. Sanei, L. Shoker, Artefact Removal from EEGs Using a Hybrid
BSS-SVM Algorithm, Invited Talk, IEE Biomedical Signal Processing
Workshop, London, Dec. 1, 2004.
[9] P.F. Stetson, Independent Component Analysis of Pulse Oximetry Signals
based on Derivative Skew, 5-th Int. Symposium on Independent Com-
ponent Analysis and Blind Signal Separation, September 2004, Granada,
Spain.
[10] R. Vig´
ario, V. Jousm¨
aki, M. Ham¨
al¨
ainen, R. Hari, E. Oja, Independent
Component Analysis for identification of artifacts in Magnetoencephalo-
graphic recordings, Advances in Neural Information Processing System
10 (Proc. NIPS 97), pp. 229 - 235. MIT Press.
[11] PhysioBank website, The PTB Diagnostic ECG Database,
http://www.physionet.org/physiobank/database/ptbdb/
... In the results presented in this study, different contrast functions have been tested, ranging from the skewness G (x)=x 3 /3 (Hyvarinen 1999, Mitianoudis et al 2005), to functions more robust to outliers, such as G(x)=log(cosh(x)) and G(x)=exp(−x 2 /2), suggested for the fastICA algorithm. The skewness function offers very fast convergence, but is also sensitive to outliers and signal artifacts and, thus, less appropriate when the EMG signals are of poor recording quality. ...
Article
Full-text available
Objective. The study of motor unit behavior has been classically performed by selective recording systems of muscle electrical activity (EMG signals) and decomposition algorithms able to discriminate between individual motor unit action potentials from multi-unit signals. In this study, we provide a general framework for the decomposition of multi-channel intramuscular and surface EMG signals and we extensively validate this approach with experimental recordings. Approach. First, we describe the conditions under which the assumptions of the convolutive blind separation model are satisfied. Second, we propose an approach of convolutive sphering of the observations followed by an iterative extraction of the sources. This approach is then validated using intramuscular signals recorded by novel multi-channel thin-film electrodes on the Abductor Digiti Minimi of the hand and Tibilias Anterior muscles, as well as on high-density surface EMG signals recorded by electrode grids on the First Dorsal Interosseous muscle. The validation was based on the comparison with the gold standard of manual decomposition (for intramuscular recordings) and on the two-source method (for comparison of intramuscular and surface EMG recordings) for the three human muscles and contraction forces of up to 90% MVC. Main results. The average number of common sources identified for the validation was 14 ± 7 (averaged across all trials and subjects and all comparisons), with a rate of agreement in their discharge timings of 92.8 ± 3.2%. The average Decomposability Index, calculated on the automatic decomposed signals, was 16.0 ± 2.2 (7.3–44.1). For comparison, the same index calculated on the manual decomposed signals was 15.0 ± 3.0 (6.3–76.6). Significance. These results show that the method provides a solid framework for the decomposition of multi-channel invasive and non-invasive EMG signals that allows the study of the behavior of a large number of concurrently active motor units.
Article
Objective: The identification of individual neuronal activity from multielectrode arrays poses significant challenges, including handling data from numerous electrodes, resolving overlapping action potentials and tracking activity across long recordings. This study introduces NeuroNella, an automated algorithm developed to address these challenges. Approach: NeuroNella employs blind source separation to leverage the sparsity of action potentials in multichannel recordings. It was validated using three datasets, including two publicly available ones: (1) in vitro recordings (252 channels) of retinal ganglion cells from mice with simultaneous ground-truth loose patch data to assess accuracy; (2) a Neuropixel recording from an awake mouse, comprising 374 channels spanning different brain areas, to demonstrate scalability with dense multielectrode configurations in in vivo recordings; and (3) data (32 channels) recorded from the medullary reticular formation in a terminally anaesthetised macaque, to showcase decomposition over long periods of time. Main Results: The algorithm exhibited an error rate of less than 1% compared to ground-truth data. It reliably identified individual neurons, detected neuronal activity across a wide amplitude range, and tolerated minor probe shifts, maintaining robustness in prolonged experimental sessions. Significance: NeuroNella provides an automated and efficient method for neuronal activity identification. Its adaptability to diverse dataset, species, and recording configurations underscores its potential to advance studies of neuronal dynamics and facilitate real-time neuronal decoding systems.
Article
Full-text available
In this work we present a new biomedical application of independent component analysis (ICA) to solve the problem of atrial activity (AA) extraction from real electrocardiogram (ECG) recordings of atrial fibrillation (AF). The proper analysis and characterization of AA from ECG recordings requires, as a first step, the cancellation of ventricular activity (VA). The present contribution demonstrates the appropriateness of ICA to solve this problem based on three considerations: firstly AA and VA are generated by independent bioelectric sources, secondly AA and VA are subgaussian and supergaussian activities, respectively, and finally the surface ECG can be regarded as an instantaneous linear mixing process. After ICA algorithm application to recordings from 7 patients in AF, we prove that the AA source can be identified using a kurtosis-based reordering of the separated sources and a ulterior spectral analysis for those sources with subgaussian kurtosis.
Article
Full-text available
We introduce a novel fast algorithm for independent component analysis, which can be used for blind source separation and feature extraction. We show how a neural network learning rule can be transformed into a fixedpoint iteration, which provides an algorithm that is very simple, does not depend on any user-defined parameters, and is fast to converge to the most accurate solution allowed by the data. The algorithm finds, one at a time, all nongaussian independent components, regardless of their probability distributions. The computations can be performed in either batch mode or a semiadaptive manner. The convergence of the algorithm is rigorously proved, and the convergence speed is shown to be cubic. Some comparisons to gradient-based algorithms are made, showing that the new algorithm is usually 10 to 100 times faster, sometimes giving the solution in just a few iterations.
Conference Paper
Various artefacts such as eye-blinking, also called electrooculogram (EOG), and electrocardiograms (ECG) deteriorate the quality of electroencephalograms (EEGs) which are going to be used for clinical diagnosis or to be processed. Blind source separation (BSS) has been a powerful tool to separate different EEG and artefact sources. Identification of different sources from the independent components however, has been always under question. A robust technique for detection and removal of the artefacts requires an efficient classification of the independent components estimated by an effective BSS method. In this talk we will explore different schemes in EEG artefact removal. Then we will concentrate on a robust technique based on fusion of blind identification based BSS algorithm and an effective classification method using support vector machines (SVM). The classifier exploits the statistical properties of both the EEG and the artefact using certain features to identify the best separating hyperplane. We will show that the proposed supervised method is very effective, robust and computationally cost efficient. (19 pages)
Book
A comprehensive introduction to ICA for students and practitionersIndependent Component Analysis (ICA) is one of the most exciting new topics in fields such as neural networks, advanced statistics, and signal processing. This is the first book to provide a comprehensive introduction to this new technique complete with the fundamental mathematical background needed to understand and utilize it. It offers a general overview of the basics of ICA, important solutions and algorithms, and in-depth coverage of new applications in image processing, telecommunications, audio signal processing, and more.Independent Component Analysis is divided into four sections that cover:* General mathematical concepts utilized in the book* The basic ICA model and its solution* Various extensions of the basic ICA model* Real-world applications for ICA modelsAuthors Hyvarinen, Karhunen, and Oja are well known for their contributions to the development of ICA and here cover all the relevant theory, new algorithms, and applications in various fields. Researchers, students, and practitioners from a variety of disciplines will find this accessible volume both helpful and informative.
Article
We consider the estimation of the data model of independent component analysis when Gaussian noise is present. We show that the joint maximum likelihood estimation of the independent components and the mixing matrix leads to an objective function already proposed by Olshausen and Field using a different derivation. Due to the complicated nature of the objective function, we introduce approximations that greatly simplify the optimization problem. We show that the presence of noise implies that the relation between the observed data and the estimates of the independent components is non-linear, and show how to approximate this non-linearity. In particular, the non-linearity may be approximated by a simple shrinkage operation in the case of super-Gaussian (sparse) data. Using these approximations, we propose an efficient algorithm for approximate maximization of the likelihood. In the case of super-Gaussian components, this may be approximated by simple competitive learning, and in the case of sub-Gaussian components, by anti-competitive learning.
Conference Paper
We have studied the application of an independent component analysis (ICA) approach to the identification and possible removal of artifacts from a magnetoencephalographic(MEG) recording. This statistical technique separates components according to the kurtosis of their amplitude distributions over time, thus distinguishing between strictly periodical signals, and regularly and irregularly occurring signals. Many artifacts belong to the last category. In order to assess the effectiveness of the method, controlled artifacts were produced, which included saccadic eye movements and blinks, increased muscular tension due to biting and the presence of a digital watch inside the magnetically shielded room. The results demonstrate the capability of the method to identify and clearly isolate the produced artifacts.
Conference Paper
This paper describes the application of ICA to signals from a pulse oximeter, a device for measuring arterial blood oxygen content. Since the arterial pulse signal resembles a sawtooth wave, taking the time derivative enhances the skew of this component relative to that of the interference. Third-order ICA techniques can take advantage of this to aid the separation. This paper shows an example of ICA used on pulse oximetry signals and then presents simulations to demonstrate that, for pulse oximetry signals, ICA based on third-order statistics of the derivative is indeed superior to fourth-order ICA.