ArticlePDF Available

Heart rate from face videos under realistic conditions for advanced driver monitoring

Authors:
  • University of Augbsurg

Abstract and Figures

The role of physiological signals has a large impact on driver monitoring systems, since it tells something about the human state. This work addresses the recursive probabilistic inference problem in time-varying linear dynamic systems to incorporate invariance into the task of heart rate estimation from face videos under realistic conditions. The invariance encapsulates motion as well as varying illumination conditions in order to accurately estimate vitality parameters from human faces using conventional camera technology. The solution is based on the canonical state space representation of an Itô process and a Wiener velocity model. Empirical results yield to excellent real-time and estimation performance of heart rates in presence of disturbing factors, like rigid head motion, talking, facial expressions and natural illumination conditions making the process of human state estimation from face videos applicable in a much broader sense, pushing the technology towards advanced driver monitoring systems.
Content may be subject to copyright.
Current Directions in Biomedical Engineering 2017; 3(2): 483487
Christian S. Pilz*, Sebastian Zaunseder, Ulrich Canzler and Jarek Krajewski
Heart rate from face videos under realistic
conditions for advanced driver monitoring
Abstract: The role of physiological signals has a large
impact on driver monitoring systems, since it tells something
about the human state. This work addresses the recursive
probabilistic inference problem in time-varying linear
dynamic systems to incorporate invariance into the task of
heart rate estimation from face videos under realistic
conditions. The invariance encapsulates motion as well as
varying illumination conditions in order to accurately
estimate vitality parameters from human faces using
conventional camera technology. The solution is based on the
canonical state space representation of an Itô process and a
Wiener velocity model. Empirical results yield to excellent
real-time and estimation performance of heart rates in
presence of disturbing factors, like rigid head motion, talking,
facial expressions and natural illumination conditions making
the process of human state estimation from face videos
applicable in a much broader sense, pushing the technology
towards advanced driver monitoring systems.
Keywords: Photoplethysmography Imaging, Diffusion
Process, SDE, Invariance, Psychophysiology, Human State
Computing.
https://doi.org/10.1515/cdbme-2017-0102
1 Introduction
During the last years, the task of measuring skin blood
perfusion and heart rate measurements from facial images
became inherent part of several top conferences.
Interestingly, most contributions focus on how to cope with
motion like head pose variations and facial expressions since
any kind of motion on a specific skin region of interest will
destroy the underlying blood perfusion signal in a way that
no reliable information can be extracted anymore. Figure 1
illustrates the disturbing influence of head motions on the
raw pulse signal.
1.1 Related work
The term Photoplethysmography, short PPG, dates back to
the late first half of the 20th century, when Molitor and
Kniazak [1] recorded peripheral circulatory changes in
animals. One year later, Hertzmann [2] introduced the term
Photoelectric Plethysmograph as he observed "the amplitude
of volume pulse as a measure of the blood supply of the
skin". With the ongoing fast development of semiconductor
technology, the last three decades has seen large progress in
the PPG instrumentation. PPG sensors have been explored
extensively, including the ring finger, wrist, brachia, earlobe,
and external ear cartilage. Advancement to the classical PPG
is the camera based Photoplethysmography Imaging (PPGI)
method introduced by the pioneering work of Blazek [3].
Since his first published visualisation of pulsatile skin
perfusion patterns in the time and frequency domain,
classical signal processing methods are applied commonly to
extract reasonable information out of the perfusion signals
[4][5][6]. Hülsbusch [4] realized that motion of the skin area
of interest inherently induces artifacts into the extracted
signal. Therefore, canceling motion artifacts during signal
processing became an important aspect for reliable skin
______
* Corresponding author: Christian S. Pilz: CanControls GmbH,
Markt 45, 52064 Aachen, Germany, e-mail: pilz@cancontrols.com
Sebastian Zaunseder: TU Dresden, Institut für Biomedizinische
Technik, Fetscherstr. 29, 01307 Dresden, Germany
Ulrich Canzler: CanControls GmbH, Markt 45, 52064 Aachen,
Germany, e-mail: canzler@cancontrols.com
Jarek Krajewski: Bergische Universität Wuppertal, Institut für
Sicherheitstechnik HF - Arbeitsbereich Human Factors &
Diagnostik, Gaußstraße 20 42119 Wuppertal, Germany
krajewsk@uni-wuppertal.de
Figure 1: A Typical scenario where heart rate estimation becomes
challenging, rigid head motions. In the first 250 frames the user is
in a resting state and the fine pulsation of blood flow is visible on
the averaged green channel of skin pixels. After 300 frames the
user starts to move his head and the pulse signal gets lost.
484
blood perfusion measurements. From the basic early idea of
compensating the motion of the skin area of interest by
optical flow methods directly in the image plane [4], Poh et
al. [6] regarded the problem solution for facial videos as a
blind source separation task using Independent Component
Analysis (ICA) over the different color channels. De Haan
and Jeanne [10] proposed to map the PPGI-signals by linear
combination of RGB data to a direction that is orthogonal to
motion induced artifacts. A recent alternative, which does not
require skin-tone or pulse-related priors in contrast to the
channel mapping algorithms, determines the spatial subspace
of skin-pixels and measures its temporal rotation for signal
extraction [11]. We go beyond the state of the art and
propose a holistic classical interpretation of the blood
perfusion phenomena.
2 Methodology
The underlying system of measuring heart rates from face
regions using conventional camera technology is modelled
upon a diffusion process. The entire process itself is divided
into independent single processes; the heart frequency, the
illumination and the users head movement and facial motion.
The periodic event of heart frequency is expressed in form of
a stochastic resonator
󰇛󰇜
 󰇛󰇜󰇛󰇜 󰇛󰇜 (1)
with
󰇛󰇜 󰇛󰇜 󰇛󰇜 (2)
representing the solution of a second order differential
equation with respect to the classical mechanics of circular
motion [13]. The white noise component 󰇛󰇜 reflects small
changes in amplitude and phase. The major advantage of
such a stochastic representation of a resonator is, even when
the frequency has discontinuous the signal is always
continuous. Figure 2 shows a single stochastic oscillator with
time-varying frequency and amplitude. The illumination as
well as the head movement and facial motion are expressed
as a Wiener process
󰇛󰇜
 󰇛󰇜 (3)
whereby a violation of the smoothness criterion yields to a
generalized Poisson (e.q. Cox) process
󰇛󰇜  (4)
describing the time varying jump frequency and magnitude
of pixel intensities. Figure 3 shows a simulated trajectory of
a Wiener process and its realization modulated by a Poisson
process.
The general solution of the corresponding stochastic
differential equations

󰇍
󰇛󰇜

󰇍
󰇛󰇜 
󰇍
󰇍
󰇍
󰇛󰇜 (5)
is given by Itô's lemma [12].
Figure 2: A simulated trajectory of a stochastic oscillator with
frequency trace in a range typical for a human in resting state.
Figure 1: A simulated trajectory of a Wiener process and its
realization modulated by a Poisson process with jump frequency
1.95 and magnitude 35.
The discrete-time approximation yields to [14]

󰇍
󰇛 󰇜 󰇛󰇜
󰇍
󰇛󰇜
 󰇛 󰇜
󰇍
󰇍
󰇍
󰇛󰇜

 (6)
with   , the Wiener process 󰇛󰇜 with spectral
density and the covariance of the stochastic integral


 (7)
with 󰇛󰇜, which results to the discrete-time model

󰇍
󰇛󰇜
󰇍
󰇛󰇜
󰇍
󰇛󰇜 (8)

󰇍
󰇛󰇜
󰇍
󰇛󰇜
󰇍
󰇛󰇜 (9)
with process noise
󰇍
󰇛󰇜  (10)
and measurement noise
󰇍
󰇛󰇜 (11)
If the resonator's fundamental frequency is known, the
solution yields to a general time-discrete linear dynamic
system [9]. However, since the resonator's fundamental
frequency is unknown, the problem is given as latent state of
the frequency. This results in a Markov process, whereby the
latent states are time-discrete linear dynamic systems. The
closed form solution to this problem is described by Bloom
and Bar-Shalom [8]. The advantage of this kind of
formulation is that the case of non-uniform sampling as well
as missing observations is naturally included in the model.
The basic idea of methodology is inspired by the work of
Särrkä [7].
3 Experiments
To evaluate the proposed model, empirical data is collected
under natural environmental conditions with a typical low-
cost opto-electronical sensor device, a Logitech HD C270
webcam, as well as reference ground truth measurements
using a common finger pulseoximeter, a CMS50E PPG
device. 25 users were asked to perform video recordings in
two sessions resulting in a total amount of 50 videos. The
first session is selected to be even-tempered without any kind
of larger head or body movements and facial expressions.
During the second session, participants were free to move
their head naturally while remaining seated. Typical
movements included tilting the head sideways, nodding the
head, looking up/down and leaning forward/backward. Some
participants also made facial expressions, or started to talk.
There reflecting typcial driver behaviour. The recording
illumination environment was chosen as daylight scenario
without any additional lighting. The duration of each session
is approximately one minute. The frame rate was fixed to 15
fps in average and the corresponding time stamps for each
frame were captured too. The finger pulseoximeter data for
each session and participant was stored for later comparison.
For every video recording a standard face finder was used to
determine the analysis region of interest. The extracted
averaged gray intensity feature was feed into the vector
valued representation of the diffusion process on a frame by
frame basis. On every estimated pulse trace a spectral peak is
determined by the Lomb periodogram. The frame duration
was set to 10 seconds with 90 percent overlap. The
correlation and Bland-Altman plots for the resting and head
motion condition are reported in the following figure 5 and
figure 6 respectively. To obtain further insides about the
potential strength of the diffusion process model, the
approach is compared against the recently published Spatial
Subspace Rotation (SSR) [11] and the baseline ICA approach
Figure 2: Comparison of an estimated pulse signal under rigid
head motions and the corresponding spectrogram for the ICA [6],
the SSR [11] and the diffusion process method. These estimates
are based upon the video illustrated in figure 1.
Figure 5: Correlation and Bland-Altman plots of PPGI diffusion
process estimated heart rate against CMS50E PPG reference of
25 users in resting state.
486
[6]. Figure 4 compares an users estimated pulse signal under
rigid head motions and the corresponding spectrogram for the
three methods. The heart rate for the ICA methods nearly
gets lost completely. For the SSR method the frequency trace
is better visible but cannot compete against the diffusion
process model where the heart rate is very clear over the
entire sequence of head movements. The detailed correlation
coefficients and squared errors of prediction for all
approaches are provided for the two data sessions in Table 1.
ICA performs worst and is not able to provide reliable heart
rate information during head motion. Although SSR performs
better it cannot compete against the robustness of the
diffusion process.
Table 1: Pearson's correlation coefficient and squared errors
of prediction of ICA, the SSR and the diffusion process (DP)
method under different scenarios.
Type
Resting
Head Rotation
ICA
0.61/7.8
0.21/14.6
SSR
0.78/4.8
0.47/7.6
DP
0.95/3.6
0.87/3.9
4 Conclusion
In this work, we have presented a holistic signal
interpretation of heart rate estimation from face videos under
realistic simulated driving conditions. The closed form
solution of the corresponding stochastic differential equations
yields to a diffusion process where the exact estimate of the
source separated heart rate signal is obtained via the posterior
distribution of the process. We compared the model against
two common approaches on face videos under resting as well
as head and facial motion scenarios under natural
illumination conditions. Measurements on a 25 user
experiment showed clearly superior robustness of the
diffusion process modelling, although the uncertainty of
prediction still gets slightly increased during natural head
motion. We conclude that an entirely invariant process model
still depends on a more robust feature representation.
Author’s Statement
Research funding: The research leading to these results has
received funding from the German Federal Ministry of
Education and Research (BMBF) under grant agreement
01|S15024 (VIVID - IKT2020/2015-2018). The opinions
expressed here are those of the authors and may or may not
reflect those of the sponsoring parties. Conflict of interest:
Authors state no conflict of interest. Informed consent:
Informed consent has been obtained from all individuals
included in this study. Ethical approval: The research related
to human use complies with all the relevant national
regulations, institutional policies and was performed in
accordance with the tenets of the Helsinki Declaration, and
has been approved by the authors' institutional review board
or equivalent committee.
References
[1] H. Molitor and M. Knaizuk, A new bloodless method for
continuous recording of peripheral change. Jour. Phar. Expr.
Ther., 27: 5-16.1936.
[2] A.B. Hertzman. Photoelectric Plethysmography of the
Fingers and Toes in Man. Exp. Biol. Med.,37:,529-534.1937.
[3] V. Blazek. Optoelektronische Erfassung und
rechnerunterstützte Analyse der Mikrozirkulations-Rhythmik.
Biomed. Techn. 30 (1):121-122.1985.
[4] M. Hülsbusch. A functional imaging technique for opto-
electronic assessment of skin perfusion. PhD thesis, RWTH
Aachen University.2008
[5] W. Verkruysse, L.O. Svaasand and J.S. Nelson. Remote
plethysmographic imaging using ambient light. Optics
Express, 16 (16):21434-21445. 2008.
[6] M.Z. Poh, J.D. McDuff and R.W. Picard. Non-contact,
automated cardiac pulse measurements using video imaging
and blind source separation. Optics Express, 18 (10): 10762-
10774. 2010.
[7] S. Särkkä. Recursive Bayesian Inference on Stochastic
Differential Equations. PhD thesis, Helsinki University of
Technology. 2006
[8] H.A.P Bloom and Y. Bar-Shalom. The interacting multiple
model algorithm for systems with Markovian switching
coefficients. IEEE Transactions on Automatic Control, 33
(8):780-783.1988.
[9] R. Kalman and R. Bucy. New results in linear filtering and
prediction theory. Transactions of the ASME-Journal of Basic
Engineering, 83:95-108.1961.
[10] G. de Haan and V. Jeanne. Robust pulse-rate from
chrominance-based rPPG. IEEE Transactions on Biomedical
Engineering, 60 (10): 2878-2886.2014.
Figure 6: Correlation and Bland-Altman plots of PPGI diffusion
process estimated heart rate against CMS50E PPG reference of
25 users performing head rotations.
[11] W. Wang, S. Stuijk and G. de Haan. A Novel Algorithm for
Remote Photoplethysmography: Spatial Subspace Rotation.
IEEE Transactions on Biomedical Engineering, 63 (9):1974-
1984.2016
[12] K. Itô. On Stochastic Differential Equations. Memoris Of The
American Mathematical Society, 4.1951.
[13] R. Feynman, R. Leighton and M. Sands. The Feynman
Lectures on Physics Vol. 1. Chapter 21. Addison-Wesley.
1963.
[14] B. Øksendal. Stochastic Differential Equations. Springer,
2003.
... In recent years, the development of noncontact heart rate detection and estimation has become a research hotspot in the field of physiological information monitoring to overcome the limitations of traditional instruments. Scholars have proposed a number of methods for noncontact heart rate detection and estimation, including ultrasonic detection, thermodynamic imaging, and photoplethysmography (PPG) [3,4,5]. Recent studies have shown that heart rate can be determined by analyzing skin color changes in video signals. ...
... Results show that the energy of the blood volume signal is rearranged in vector space with a more concentrated distribution. Moreover, the recursive probabilistic inference problem in time-varying linear dynamic systems has been addressed to incorporate the invariance into the task of heart rate estimation from face videos under realistic conditions [5]. ...
Article
Full-text available
Video-based noncontact detection of heart rate has a wide range of applications in the field of medicine and health. However, this method is susceptible to noise interference, making it difficult to effectively extract blood volume pulse (BVP) signals. To overcome this problem, a new method of noncontact heart rate estimation that can suppress noise interference is proposed in this paper. First, the established data acquisition system conducts video collection, and the captured videos are divided into multiple small regions. Subsequently, the initial signals of BVP are extracted in accordance with the chrominance features extracted through multi-channel data fusion. The BVP signals are separated using the FastICA algorithm. The kurtosis value and signal-to-noise ratios of the power spectrum of the separated signals are analyzed to determine the effective separation component. Results show that this method can extract and process pulse signals, effectively suppressing non-periodic interference. The experiment also proves that the method has good consistency with the measurement of pulse oximeter and has good stability and accuracy in the detection of heart rate of the human body.
... The number of sensors must also match the number of individuals who are required to be measured. Therefore, wearable technology is unsuitable for certain individuals and environments, including newborns, drivers, exercisers, skindamaged patients and disaster-affected patients [10]. 2 of 21 To solve this problem, researchers are examining non-contact methods for measuring the pulse using facial video, which minimizes user intervention and enables monitoring of the pulse rate when driving [11], sleeping [12], or stress monitoring [13] is required. ...
Article
Full-text available
Pulse wave and pulse rate are important indicators of cardiovascular health. Technologies that can check the pulse by contacting the skin with optical sensors built into smart devices have been developed. However, this may cause inconvenience, such as foreign body sensation. Accordingly, studies have been conducted on non-contact pulse rate measurements using facial videos focused on the indoors. Moreover, since the majority of studies are conducted indoors, the error in the pulse rate measurement in outdoor environments, such as an outdoor bench, car and drone, is high. In this paper, to deal with this issue, we focus on developing a robust pulse measurement method based on facial videos taken in diverse environments. The proposed method stably detects faces by removing high-frequency components of face coordinate signals derived from fine body tremors and illumination conditions. It optimizes for extracting skin color changes by reducing illumination-caused noise using the Cg color difference component. The robust pulse wave is extracted from the Cg signal using FFT–iFFT with zero-padding. It can eliminate signal-filtering distortion effectively. We demonstrate that the proposed method relieves pulse rate measurement problems, producing 3.36, 5.81, and 6.09 bpm RMSE for an outdoor bench, driving car, and flying drone, respectively.
... A significant milestone were the works of [4,5] which transformed contact PPG into a completely contact-free method of investigating cutaneous perfusion using a camera system. The PPGI technology finds its application in the monitoring of many physiological or pathophysiological processes, among which we can include vital signs monitoring [6][7][8][9][10][11], heart rate variability [12], oximetry [13], pulse transit time assessment [14], wound assessment [15,16], driver state estimation [17][18][19], or pain evaluation [20]. Tissue perfusion changes are also related to respiratory activity. ...
Article
Full-text available
Perfusion patterns of cutaneous tissue represent a valuable source of information about the state of the patient’s cardiovascular system and autonomic nervous system (ANS). This concept aims to observe the perfusion changes in the foot sole in two healthy individuals and two subjects affected by diabetes mellitus (DM). We use photoplethysmography imaging (PPGI) to monitor cutaneous perfusion changes. This method, in contrast to conventional contact photoplethysmography (PPG), allows the monitoring of skin perfusion with spatial distribution. We use a machine vision camera and an illumination system using the green light. To induce the perfusion changes, we perform an experiment in the form of a deep breathing test (DBT). The experiment consists of three stages, with the middle stage being the DBT. To evaluate spatial perfusion changes, we use a normalized measure of the correlation of PPGI signals with a reference PPG signal obtained from the foot’s little toe. This method also increases the signal-to-noise ratio (SNR). Subjects with DM shows different patterns of tissue perfusion changes compared to healthy subjects. The DM subjects show increased perfusion after DBT compared to the pre-DBT state, whereas in healthy subjects, the tissue perfusion does not reach the level of the pre-DBT phase. This work can be considered as proof of concept in developing a non-contact and non-intrusive monitoring system that allows a different view of microcirculatory damage in patients with diabetes mellitus, focusing on its spatial distribution.
... Over the last 20 years, camera-based sensing of photoplethysmographic signals became the focus of research in many fields of applications, such as vital sign monitoring (Blazek and Schultz-Ehrenburg 1998, Blazek 1999, Verkruysse et al 2008, Tarassenko et al 2014, Kumar et al 2015, Blanik et al 2016, Wang et al 2017, wound assessment (Huelsbusch andBlazek 2002, Thatcher et al 2016), driver state estimation (Kuo et al 2015, Pilz et al 2017 or pain evaluation (Koeny et al 2014). With the broader availability of sensitive sensors, fast interfaces and processing capabilities the modality is now emerging to the field of promising remote, non-contact and multi-site measurements. ...
Article
Objective: Photoplethysmography Imaging (PPGI) is a promising contactless camera-based method of non-invasive cardiovascular diagnostics. To achieve the best results, it is important to choose the most suitable camera for a specific application. The settings of the camera influence the quality of the detected signal. Approach: The standard (EMVA 2016) for evaluating the imaging performance of machine vision cameras (MVC) helps at the initial decision of the sensor, but the camera should always be tested in terms of usability for a specific application. So far, PPGI lacks a standardized measurement scenario for evaluating the performance of individual cameras. For this, we realized a controllable optoelectronic phantom with artificial silicone skin allowing reproducible tests of cameras to verify their suitability for PPGI. The entire system is housed in a light-tight box. We tested an MVC, a digital single-lens reflex camera (DSLR) camera and a webcam. Each camera varies in used technology and price. Main results: We simulated real PPGI measurement conditions simulating the ratio of pulse (AC) and non-pulse (DC) component of the photoplethysmographic signal and achieved AC/DC ratios of 0.5% on average. An additional OLED panel ensures proper DC providing reproducible measurement conditions. We evaluated the signal morphological features, amplitude spectrum, signal-to-noise ratio (SNR) and spatially dependent changes of simulated subcutaneous perfusion. Here, the MVC proved to be the most suitable device. A DSLR is also suitable for PPGI, but a larger smoothing kernel is required to obtain a perfusion map. The webcam, as the weakest contender, proved to be very susceptible to any inhomogeneous illumination of the examined artificial skin surface. However, it is still able to detect cardiac rhythm. Significance: The result of our work is an optoelectronic phantom for reproducible testing of PPGI camera performance in terms of signal quality and measurement equipment costs.
... Many works show the dramatic effect of nonstation ary conditions on the performance of processing methods [e.g. a drop of signaltonoise ratio (SNR) from over 10 dB to approximately 5 dB [33], a reduction of more than 50% in the correlation of iPPG HR and reference HR [55], etc.]. Notably, in many cases, algorithms, which were previ ously shown to produce stable results even under nonsta tionary conditions, do not yield satisfactory results when applied to other data. ...
Article
Full-text available
Over the last few years, the contactless acquisition of cardiovascular parameters using cameras has gained immense attention. The technique provides an optical means to acquire cardiovascular information in a very convenient way. This review provides an overview on the technique’s background and current realizations. Besides giving detailed information on the most widespread application of the technique, namely the contactless acquisition of heart rate, we outline further concepts and we critically discuss the current state.
Article
Full-text available
In this paper, we propose a conceptually novel algorithm, namely “Spatial Subspace Rotation” (2SR), that improves the robustness of remote photoplethysmography. Based on the assumption of (1) spatially redundant pixel-sensors of a camera and (2) a well-defined skin mask, our core idea is to estimate a spatial subspace of skin-pixels and measure its temporal rotation for pulse extraction, which does not require skin-tone or pulse-related priors in contrast to existing algorithms. The proposed algorithm is thoroughly assessed on a large benchmark dataset containing 54 videos, which includes challenges of various skin-tones, body-motions in complex illuminance conditions, and pulse-rate recovery after exercise. The experimental results show that given a well-defined skin mask, 2SR outperforms the popular ICA-based approach and two state-of-the-art algorithms (CHROM and PBV). When comparing the pulse frequency spectrum, 2SR improves on average the SNR of ICA by 2.22 dB, CHROM by 1.56 dB, and PBV by 1.95 dB. When comparing the instant pulse-rate, 2SR improves on average the Pearson correlation and precision of ICA by 47% and 65%, CHROM by 22% and 23%, PBV by 21% and 39%. ANOVA confirms the significant improvement of 2SR in peak-to-peak accuracy. The proposed 2SR algorithm is very simple to use and extend, i.e., the implementation only requires a few lines Matlab code.
Article
Full-text available
Remote measurements of the cardiac pulse can provide comfortable physiological assessment without electrodes. However, attempts so far are non-automated, susceptible to motion artifacts and typically expensive. In this paper, we introduce a new methodology that overcomes these problems. This novel approach can be applied to color video recordings of the human face and is based on automatic face tracking along with blind source separation of the color channels into independent components. Using Bland-Altman and correlation analysis, we compared the cardiac pulse rate extracted from videos recorded by a basic webcam to an FDA-approved finger blood volume pulse (BVP) sensor and achieved high accuracy and correlation even in the presence of movement artifacts. Furthermore, we applied this technique to perform heart rate measurements from three participants simultaneously. This is the first demonstration of a low-cost accurate video-based method for contact-free heart rate measurements that is automated, motion-tolerant and capable of performing concomitant measurements on more than one person at a time.
Article
Full-text available
This thesis is concerned with recursive Bayesian estimation of non-linear dynamical systems, which can be modeled as discretely observed stochastic differential equations. The recursive real-time estimation algorithms for these continuous-discrete filtering problems are traditionally called optimal filters and the algorithms for recursively computing the estimates based on batches of observations are called optimal smoothers. In this thesis, new practical algorithms for approximate and asymptotically optimal continuous-discrete filtering and smoothing are presented. The mathematical approach of this thesis is probabilistic and the estimation algorithms are formulated in terms of Bayesian inference. This means that the unknown parameters, the unknown functions and the physical noise processes are treated as random processes in the same joint probability space. The Bayesian approach provides a consistent way of computing the optimal filtering and smoothing estimates, which are optimal given the model assumptions and a consistent way of analyzing their uncertainties. The formal equations of the optimal Bayesian continuous-discrete filtering and smoothing solutions are well known, but the exact analytical solutions are available only for linear Gaussian models and for a few other restricted special cases. The main contributions of this thesis are to show how the recently developed discrete-time unscented Kalman filter, particle filter, and the corresponding smoothers can be applied in the continuous-discrete setting. The equations for the continuous-time unscented Kalman-Bucy filter are also derived. The estimation performance of the new filters and smoothers is tested using simulated data. Continuous-discrete filtering based solutions are also presented to the problems of tracking an unknown number of targets, estimating the spread of an infectious disease and to prediction of an unknown time series. Helsinki University of Technology Laboratory of Computational Engineering publications. Report B, ISSN 1457-0404; 54
Article
Full-text available
Plethysmographic signals were measured remotely (> 1m) using ambient light and a simple consumer level digital camera in movie mode. Heart and respiration rates could be quantified up to several harmonics. Although the green channel featuring the strongest plethysmographic signal, corresponding to an absorption peak by (oxy-) hemoglobin, the red and blue channels also contained plethysmographic information. The results show that ambient light photo-plethysmography may be useful for medical purposes such as characterization of vascular skin lesions (e.g., port wine stains) and remote sensing of vital signs (e.g., heart and respiration rates) for triage or sports purposes.
Article
Photoelectric plethysmographs for the fingers and toes are described which use electrocardiographs for the recording and which have definite advantages in routine clinical observations on the circulation. The validity of the technique is established (1) by comparison of the photoelectric records with simultaneous records obtained with transmission plethysmographs, (2) by comparison of the photoelectric records in instances of circulatory disturbances with independent directional confirmation by other methods in the literature.
Article
Remote photoplethysmography (rPPG) enables contactless monitoring of the blood volume pulse using a regular camera. Recent research focused on improved motion robustness, but the proposed blind source separation techniques (BSS) in RGB color space show limited success. We present an analysis of the motion problem, from which far superior chrominance-based methods emerge. For a population of 117 stationary subjects, we show our methods to perform in 92% good agreement (1:96) with contact PPG, with RMSE and standard deviation both a factor of two better than BSS-based methods. In a fitness setting using a simple spectral peak detector, the obtained pulse-rate for modest motion (bike) improves from 79% to 98% correct, and for vigorous motion (stepping) from less than 11% to more than 48% correct. We expect the greatly improved robustness to considerably widen the application scope of the technology.