A preview of this full-text is provided by Springer Nature.
Content available from Applied Intelligence
This content is subject to copyright. Terms and conditions apply.
S3D-CNN: skeleton-based 3D consecutive-low-pooling neural
network for fall detection
Xin Xiong
1
&Weidong Min
2,3
&Wei-Shi Zheng
4
&Pin Liao
1
&Hao Yang
1
&Shuai Wang
1
#Springer Science+Business Media, LLC, part of Springer Nature 2020
Abstract
Most existing deep-learning-based fall detection methods use either 2D neural network without considering movement repre-
sentation sequences, or whole sequences instead of only those in the fall period. These characteristics result in inaccurate
extraction of human action features and failure to detect falls due to background interferences or activity representation beyond
the fall period. To alleviate these problems, a skeleton-based 3D consecutive-low-pooling neural network (S3D-CNN) for fall
detection is proposed in this paper. In the S3D-CNN, an activity feature clustering selector is designed to extract the skeleton
representation in depth videos using pose estimation algorithm and form optimized skeleton sequence of fall period. A 3D
consecutive-low-pooling (3D-CLP) neural network is proposed to process these representation sequences by improving network
in terms of layer number, pooling kernel size, and single input frame number. The proposed method is evaluated on public and
self-collected datasets respectively, outperforming the existing methods.
Keywords Fall detection .Optimized skeleton representation .Depth video .Pose estimation .3D-CLP network
1 Introduction
Falls are a leading cause of injury and the most common
reason for non-fatal hospitalization among the elderly. The
World Health Organization reports that more than 28% of all
persons aged over 65 years fall every year; it also projects the
global incidence of falls among those aged 70 years and over
to rise to 32%–42% [1]. Falls are a main cause of death from
injury-related or unintentional injuries, second only to road
traffic injuries. Among 37.3 million patients referred to their
doctors for falls each year, 646,000 have died and over 80%
reside in low- to middle-income countries. As the population
ages, these figures are expected to worsen; falls are the leading
cause of accidental death among persons aged 79 years and
older [2]. According to the National Institutes of Health, about
1.6 million elderly Americans are injured by falling each year
[3]. Over half of the seniors that lay on the floor for more than
1 h after a fall has been reported to have died within 6 months
[4]. As China has a large elderly population, the fall problem
is of great importance. If falls can be detected in a timely and
automatic manner, then rapid delivery of medical services to
the injured may be achieved. The existing common action
recognition methods do not detect fall well due to the lack of
fall datasets and the poor fall feature extraction by the complex
network which is easy to over-fitting in training. As such,
development of an intelligent system for automatic fall detec-
tion is a crucial undertaking.
In general, fall behaviors can be identified by using several
approaches, such as wearable sensors, detection via traditional
geometric and movement features, and deep-learning
methods. Most existing deep-learning-based fall detection
methods use either 2D neural network without considering
movement representation sequences or whole sequences in-
stead of only those in the fall period. These characteristics
result in inaccurate extraction of human action features and
failure to detect falls due to background interferences or ac-
tivity representation beyond the fall period. Modeling based
on depth videos and apply the 3D feature extraction of a 3D-
CLP neural network to eliminate major interferences is a rea-
sonable approach to focus on fall features. Unfortunately, re-
search on this topic is limited.
*Weidong Min
minweidong@ncu.edu.cn
1
School of Information Engineering, Nanchang University,
Nanchang 330031, China
2
School of Software, Nanchang University, Nanchang 330047, China
3
Jiangxi Key Laboratory of Smart City, Nanchang 330047, China
4
School of Data and Computer Science, Sun Yat-sen University,
Guangzhou 510006, China
https://doi.org/10.1007/s10489-020-01751-y
Published online: 13 June 2020
Applied Intelligence (2020) 50:3521–3534
Content courtesy of Springer Nature, terms of use apply. Rights reserved.