Conference PaperPDF Available

Abstract and Figures

Existing human activity recognition approaches require either the deployment of extra infrastructure or the cooperation of occupants to carry dedicated devices, which are expensive, intrusive and inconvenient for pervasive implementation. In this paper, we propose SmartSense, a device-free human activity recognition system based on a novel machine learning algorithm with existing commercial off-the-shelf (COTS) WiFi routers. By exploiting the prevalence of existing WiFi infrastructure in buildings, we developed a novel OpenWrt based firmware for COTS WiFi routers to collect the CSI measurements from regular data frames. To identify different human activities, an automatic kernel representation learning method, namely auto-HSRL, is established to selection informative Hilbert space patterns from time, frequency, wavelet, and shape domains. A new information fusion tool based on multi-view kernel learning is proposed to combine the representations extracted from diverse perspectives and build up a robust and comprehensive activity classifier. Extensive experiments were conducted in an office and the experimental results demonstrate that SmartSense outperforms existing methods and achieves a 98% activity recognition accuracy.
Content may be subject to copyright.
Poster: WiFi-based Device-Free Human Activity Recognition
via Automatic Representation Learning
Han Zou
University of California, Berkeley
Berkeley, CA
hanzou@berkeley.edu
Yuxun Zhou
University of California, Berkeley
Berkeley, CA
yxzhou@berkeley.edu
Jianfei Yang
Nanyang Technological University
Singapore
yang0478@ntu.edu.sg
Weixi Gu
Tsinghua University
China
guweixigavin@gmail.com
Lihua Xie
Nanyang Technological University
Singapore
elhxie@ntu.edu.sg
Costas Spanos
University of California, Berkeley
Berkeley, CA
spanos@berkeley.edu
ABSTRACT
Existing human activity recognition approaches require either
the deployment of extra infrastructure or the cooperation
of occupants to carry dedicated devices, which are expen-
sive, intrusive and inconvenient for pervasive implementation.
In this paper, we propose SmartSense, a device-free human
activity recognition system based on a novel machine learn-
ing algorithm with existing commercial off-the-shelf (COTS)
WiFi routers. By exploiting the prevalence of existing WiFi
infrastructure in buildings, we developed a novel OpenWrt
based firmware for COTS WiFi routers to collect the CSI
measurements from regular data frames. To identify different
human activities, an automatic kernel representation learning
method, namely auto-HSRL, is established to selection infor-
mative Hilbert space patterns from time, frequency, wavelet,
and shape domains. A new information fusion tool based
on multi-view kernel learning is proposed to combine the
representations extracted from diverse perspectives and build
up a robust and comprehensive activity classifier. Extensive
experiments were conducted in an office and the experimental
results demonstrate that SmartSense outperforms existing
methods and achieves a 98% activity recognition accuracy.
CCS CONCEPTS
Human-centered computing Ubiquitous and mo-
bile computing;
KEYWORDS
WiFi, human activity recognition, representation learning
1 INTRODUCTION
Human activity recognition plays an indispensable role in a
myriad of emerging applications in smart buildings. Various
Permission to make digital or hard copies of part or all of this work
for personal or classroom use is granted without fee provided that
copies are not made or distributed for profit or commercial advantage
and that copies bear this notice and the full citation on the first page.
Copyrights for third-party components of this work must be honored.
For all other uses, contact the owner/author(s).
MobiCom ’17, October 16–20, 2017, Snowbird, UT, USA
©2017 Copyright held by the owner/author(s).
ACM ISBN 978-1-4503-4916-1/17/10.
https://doi.org/10.1145/3117811.3131266
human activity recognition systems, such as camera, Passive
Infra-Red (PIR) sensor, and embedded sensors in mobile
and wearable devices, have been proposed in recent years
[
1
]. However, certain limitations hinder them from practical
and pervasive implementation. They either require occupants
to carry or wear the devices or need extra infrastructure
for activity monitoring. With the pervasive and wide avail-
ability of WiFi infrastructure, leveraging WiFi signals to
estimate the location of occupants and distinguish human
activities becomes feasible [
4
,
5
]. Channel State Information
(CSI), a fine-grained channel measurement from the physical
layer that describes how WiFi signals propagate from the
transmitter (TX) to receiver (RX) through multiple paths
at the granularity of OFDM subcarriers, has become avail-
able from commodity WiFi NIC [
2
]. CSI is able to reveal
human activity in a non-intrusive manner because the body
movements during different activities change the signal propa-
gation paths and lead to the high variation of CSI. Identifying
activities with high recognition accuracy can be feasible by
analyzing the CSI measurement at RX. Although some C-
SI based activity recognition systems have been proposed
in recent years [
1
], they only explore features in time and
frequency domain, require expert feature engineering, and
these features may not be transferable due to environmental
and temporal dynamics. Therefore, it is still challenging to
realize human activity recognition in an accurate, automatic
and non-invasive manner.
In this paper, we propose SmartSense, a device-free human
activity recognition scheme that is able to accurately dis-
tinguish common activities using only COTS WiFi routers.
We developed a novel OpenWrt based firmware for WiFi
routers so that the CSI measurements from regular data
frames can be obtained directly from them. For the purpose
of human activity recognition, a novel machine learning al-
gorithm, namely automatic kernel representation learning
(auto-HSRL), is established. The algorithm starts with a
library based Hilbert space patterns generation from time,
frequency, wavelet, and shape domains. Then an optimal k-
ernel string representation is learned with an efficient greedy
method. Finally, the representations obtained from diverse
domains are combined together via a new information fusion
tool in the form of multi-view kernel learning. Extensive
Poster
MobiCom’17, October 16-20, 2017, Snowbird, UT, USA
606
Figure 1: System architecture of auto-HSRL.
experiments were conducted in typical indoor environment
and demonstrated that SmartSense can distinguish numbers
of daily human activities with 97.9% recognition accuracy
using only two COTS WiFi routers.
2 SYSTEM DESIGN
2.1 CSI enabled WiFi Router Platform
Conventional CSI-based sensing systems adopt either the
Intel 5300 NIC tool or the Atheros 9390 tool to extract the
CSI data from modified WiFi NIC card equipped with laptop
or PC. The demand of laptops as receivers severely hinders
them from large-scale implementation. To overcome this
bottleneck, we upgrade the Atheros CSI Tool [
2
] and develop
a new OpenWrt based firmware for COTS WiFi routers
so that the CSI measurements from regular data frames
transmitted in the existing traffic can be obtained directly
from routers instead of using laptop or PC with external
WiFi NIC adapter. Moreover, the traditional Intel 5300 NIC
tool only provides CSI for 30 out of the 56 subcarriers. Our
platform reports CSI data on all the 114 subcarriers for 40
MHz bandwidth on each TX-RX pair, which provides much
more information than conventional CSI tools. SmartSense
only requires two routers, one serves as TX and the other is
adopted as RX, to perform device-free activity recognition.
TX continuously transmits data packets and RX monitors the
channels, captures and analyzes these packets. Suppose
𝑁𝑇 𝑋
and
𝑁𝑅𝑋
represent the number of transmitting and receiving
antennas. At each time instant,
𝑁𝑇 𝑋 ×𝑁𝑅𝑋 ×
114 CSI streams
are available to analyze the variations of WiFi communication
links caused by human presence and movement.
2.2 Automatic Hilbert Space
Representation Learning
In order to recognize various human activities in an accurate,
automatic and robust manner, we propose auto-HSRL, a nov-
el automatic representation learning and multi-view learning
scheme. Figure 1 illustrates its system architecture, which
consists three stages. Firstly, we calculate various features
based on the CSI measurements from time (mean, variance,
skewness, kurtosis), frequency (FFT), wavelet (DTW), and
shape (DTW) domains. Previous WiFi-based activity recogni-
tion approaches often resort to manual inspection, or simply
combine all information together as the input of an ML algo-
rithm, which leads to degraded recognition performance due
to the suboptimal usage of extracted patterns. We propose
auto-HSRL, a simple yet powerful tool to learn useful repre-
sentation from data. Consider a possibly nonlinear mapping
𝜑
:
𝒳 → ℋ
from the raw data space
𝑥∈ 𝒳
to a feature space
. The Mercer’s theorem allows the expression of the inner
product between the features via a positive definite (PD or
Mercer’s) kernel function, i.e.,
𝑘
(
𝑥, 𝑥
) =
𝜑(𝑥), 𝜑(𝑥)
The
kernel function
𝑘
(
·,·
) is uniquely determined by the respec-
tive reproducing kernel Hilbert space
. Multiple Mercer’s
kernels are calculated and leveraged to offer more modeling
capacity and performance improvement due to increased mod-
eling flexibilities. Then, auto-HSRL learns a multiple kernel
information representation for each domain. The combina-
tion structure is called “string kernel”, which contains the
multiplication and addition operations of base kernels. The
Hilbert-Schmidt Independence Criterion (HSIC) is leveraged
to learn the optimal string kernel. HSIC can be computed
with statistical estimation techniques, such as
1
𝑚(𝑚1) tr(𝐾𝐿) + 1𝑇𝐾11𝑇𝐿1
(𝑚1)(𝑚2) 2
𝑚21𝑇𝐾𝐿1
,\
HSIC(𝐾, 𝐿)
where
𝐾
is the kernel matrix calculated by
𝐾𝑖,𝑗
=
𝑘
(
𝑥𝑖, 𝑥𝑗
),
and
𝐿
is the kernel matrix of the target output space. In
the current case dealing with activity recognition, the output
kernel is computed with
𝐿𝑖,𝑗
=
𝑚1
𝑦𝑖𝛿
(
𝑦𝑖, 𝑦𝑗
). We propose a
greedy algorithm to learn the optimal string kernel based on
HSIC. After that, we propose a multi-view learning based
information fusion through multiple kernel learning (MKL).
The key idea is to perform an overall regularized empirical
risk minimization over the combined Hilbert space identified
from the previous step, in order to find the classifier that takes
all information sources into consideration. We substantiate
information fusion by solving the following MKL problem:
min
{𝑓𝑚},𝑏,𝜉,𝑑
1
2
𝑚
1
𝑑𝑚
𝑓𝑚2
𝑚+𝐶
𝑖
𝜉𝑖
s.t. 𝑦𝑖
𝑚
𝑓𝑚(𝑥𝑖) + 𝑦𝑖𝑏1𝜉𝑖𝑖
𝑚
𝑑𝑚= 1, 𝑑𝑚0𝑚, 𝜉𝑖0𝑖
(Primal)
The above Primal problem can be rewritten into a dual form
that directly uses kernel representations learned from the last
step. The Primal problem is equivalent to
min
𝑑𝐽(𝑑) s.t.
𝑚
𝑑𝑚= 1, 𝑑𝑚0
where
𝐽(𝑑) = max𝛼1
2𝑖𝑗𝛼𝑖𝛼𝑗𝑦𝑖𝑦𝑗𝑚𝑑𝑚𝐾𝑚+𝑖𝛼
𝑖𝛼𝑖𝑦𝑖= 0,0𝛼𝑖𝐶𝑖
𝐽
(
𝑑
) is a convex function of
𝑑
, which can be solved with gradi-
ent based methods [
3
]. The kernel representations
𝐾𝑇, 𝐾𝐹, 𝐾𝑊, 𝐾𝑆
obtained from auto-HSRL can be combined together within
the above MKL learning framework.
Poster
MobiCom’17, October 16-20, 2017, Snowbird, UT, USA
607
96.9%
0.0%
0.0%
0.0%
1.3%
98.1%
0.0%
0.0%
0.9%
0.0%
99.0%
2.0%
0.9%
1.9%
1.0%
98.0%
sit stand walk run
Estimation
sit
stand
walk
run
Grounf Truth
(a) Confusion matrix of human activity recogni-
tion accuracy.
0 300 600 900 1200 1500 1800 2100
Time [min]
Empty
Sitting
Standing
Walking
Running
(b) Occupant behavior profile in the office on one
weekday.
(c) Occupant behavior profile in the office on
one weekday.
Figure 2: Performance of SmartSense.
3 PERFORMANCE EVALUATION
We implemented SmartSense on 2 TPLINK N750 WiFi router-
s: one serves as TX and another one acts as RX. We upgraded
their firmware to our CSI OpenWrt version so that the CSI
measurements from regular data frames are reported directly
from them. TX was operated on 5 GHz frequency band and
we leveraged the 40 MHz channel bandwidth. We used 1
TX antenna to send data packets to 3 RX antennas at a
transmission rate of 700 packets/s. 4 common activities, e.g.
sitting, standing, walking and running, were performed by 10
volunteers to validate the activity recognition performance
of SmartSense in an office (50
𝑚2
). Training and testing data
were collected on different days to evaluate the performance
of SmartSense under both temporal and environmental dy-
namics. Figure 2(a) depicts the confusion matrix of activity
recognition accuracy using SmartSense. It achieves an aver-
age cross-validation activity recognition accuracy of 97.9%
across all the four activities. As shown in Figure 2(a), its
recognition accuracy for sitting, standing, walking and run-
ning is 96.9%, 98.1%, 99% and 98% respectively. SmartSense
considers the features from four domains and combines them
with an optimal representation, which can reveal the nature
of each activity more clearly than existing approaches.
To validate its performance for practical implementation,
we leveraged SmartSense to monitor one occupant’s activities
and analyzed his activity patterns in an office during one
weekday. As presented in Figure 2(b), we can infer when the
occupant arrived at the office, the duration of his lunch break
and when he got off work with the measurements of Smart-
Sense. With this fine-grained occupant activity information,
the building management systems can adjust the light and
ventilation accordingly to reduce the energy consumption in
commercial buildings. As shown in Figure 2(c), the occupant
spent more than 53% of the time sitting in the office. Thus,
we can recommend him to exercise more in order to have a
healthier lifestyle and also improve his productivity.
4 CONCLUSION
In this paper, we proposed SmartSense, a device-free human
activity recognition scheme using only COTS WiFi router-
s. We designed a CSI enabled WiFi router platform and
various features from time, frequency, wavelet, and shape
domains were calculated based on the de-noised CSI data and
developed a multi-view kernel learning model to select the
most representative subset of features to build up a robust
activity classifier. Extensive experiments were conducted and
demonstrate that SmartSense can distinguish numbers of
daily human activities with 97.9% recognition accuracy by
leveraging only two WiFi routers. It has great potential to
serve as a fundamental service to facilitate a broad range of
emerging applications in smart buildings.
ACKNOWLEDGEMENT
This research is funded by the Republic of Singapore National
Research Foundation (NRF) through a grant to the Berkeley
Education Alliance for Research in Singapore (BEARS) for
the Singapore-Berkeley Building Efficiency and Sustainability
in the Tropics (SinBerBEST) Program. BEARS has been
established by the UC Berkeley as a center for intellectual
excellence in research and education in Singapore.
REFERENCES
[1]
Wei Wang, Alex X Liu, Muhammad Shahzad, Kang Ling, and
Sanglu Lu. 2015. Understanding and modeling of wifi signal based
human activity recognition. In MobiCom 2015. ACM, 65–76.
[2]
Yaxiong Xie, Zhenjiang Li, and Mo Li. 2015. Precise power delay
profiling with commodity wifi. In MobiCom 2015. ACM, 53–64.
[3]
Yuxun Zhou, Ninghang Hu, Costas J Spanos, et al
.
2016. Veto-
Consensus Multiple Kernel Learning.. In AAAI. 2407–2414.
[4]
Han Zou, Baoqi Huang, Xiaoxuan Lu, Hao Jiang, and Lihua
Xie. 2016. A robust indoor positioning system based on the
procrustes analysis and weighted extreme learning machine. IEEE
Transactions on Wireless Communications 15, 2 (2016), 1252–
1266.
[5]
Han Zou, Yuxun Zhou, Hao Jiang, Baoqi Huang, Lihua Xie, and
Costas Spanos. 2016. A transfer kernel learning based strategy
for adaptive localization in dynamic indoor environments: poster.
In Proceedings of the 22nd Annual International Conference on
Mobile Computing and Networking. ACM, 462–464.
Poster
MobiCom’17, October 16-20, 2017, Snowbird, UT, USA
608
... The mm-Vital system [19] utilizes the RSS of 60 GHz millimeter wave signals to estimate breathing rates with a larger bandwidth around 7 GHz, but RSS measurements provide coarse-grained channel information, which can only help to derive the breathing rate, and is easy to be affected by ambient interferences. Recently, channel state information (CSI) has been widely adopted to facili-tate many wireless sensing capabilities, such as suspicious object detection, indoor localization [20], fall detection [21], human activity recogniztion [22], and RF imaging [23]. It can provide fine-grained channel information with multiple OFDM subcarriers of 802.11n ...
... WiFi systems, and is readily available on many commercial WiFi network interface cards (NIC), e.g., Intel WiFi Link 5300 NIC [24] and the Atheros AR9380/9390, AR9580/9590 chipset [25]. Intel 5300 NIC tool only provides CSI for 30 out of the 56 subcarriers, while OpenWrt can capture the CSI measurements from regular data frames [22] which can report CSI data on all the 114 subcarriers. Compared with RSS in MAC layer, CSI depicts fine-grained physical layer information and is more sensitive to environments changes, which makes it more suitable to detect human breathing. ...
Article
Full-text available
This paper opens up a new pathway of the utility of breathing pattern for user verification. We demonstrate that it is possible to capture people's breathing pattern leveraging commodity WiFi devices. While prior solutions for biometrics-based user recognition usually require dedicated devices (e.g., video cameras or IR sensors), this paper introduces the first general, low-cost breathing-based user verification system using commodity WiFi devices. The proposed system is based on the fact that the breathing pattern always keeps consistent for the same user but distinct among different people. Our innovative method successfully extracts the breathing pattern of different people based on channel state information of WiFi signal to facilitate user verification. The prototype study using two commodity WiFi devices can differentiate people with an average verification accuracy over 90%, suggesting that our breathing-based user verification system using commerical off-the-shelf (COTS) WiFi is promising to be one of the most critical methods in biometrics.
... In the real world, the sensing environment is usually dynamic due to changes of environment settings, such as the layout of furniture and equipment, positions of surrounding people, etc. Having the training data collected, a model will be trained to learn the mapping relationship between training data and corresponding human activities [16]- [21]. However, training data are usually collected from limited environment settings. ...
Article
Full-text available
Channel State Information (CSI) based human activity recognition has received great attention in recent years due to its advantages in privacy protection, insensitivity to illumination, and no requirement for wearable devices. In this paper, we propose a Multimodal Channel State Information Based Activity Recognition (MCBAR) system that leverages existing WiFi infrastructures and monitors human activities from CSI measurements. MCBAR aims to address the performances degradation of WiFi-based human recognition systems due to environmental dynamics. Specifically, we address the issue of non-uniformly distributed unlabelled data with rarely-performed activities by taking advantages of the generative adversarial network (GAN) and semi-supervised learning. We apply a multimodal generator to approximate the CSI data distribution in different environment settings with limited measured CSI data. The generated CSI data using the multimodal generator can provide better diversity for knowledge transfer. This multimodal generator improves the ability of MCBAR to recognize specific activities with various CSI patterns caused by environmental dynamics. Compared to state-of-the-art CSI-based recognition systems, MCBAR is more robust as it is able to handle the non-uniformly distributed CSI data collected from a new environment setting. In addition, diverse generated data from the multimodal generator improves the stability of the system. We have tested MCBAR under multiple experimental settings at different places. The experimental results demonstrate that our algorithm overcomes environmental dynamics and outperforms existing human activity recognition systems.
... It was collected by IoTbased WiFi sensors and the Channel State Information (CSI) data [32] was extracted that represents the states of the WiFi propagation [33]. When persons perform actions, such CSI data will perform different patterns [26] that can be modeled by traditional models [34] or deep neural network [35] for activity recognition [36], [37], [38], [25], gesture recognition [13], and crowd counting [39]. However, the environment changes leads to the domain difference, and the model trained in the original domain cannot be directly used in the new domain. ...
Article
Full-text available
Deep neural networks (DNNs) have made significant advances in computer vision and sensor-based smart sensing. DNNs achieve prominent results based on standard datasets and powerful servers, whereas in real applications with domain-shift data and resource-constrained environments such as Internet of Things (IoT) devices in the edge computing, DNNs are likely to have degraded performance in terms of accuracy and efficiency. To this end, we develop the MobileDA framework that learns transferable features while keeping the simple structure of the deep model. Our method allows a novel teacher network trained in the server to distill the knowledge for a student network running in the edge device, which is achieved by cross-domain distillation. Leveraging unlabeled data in the new environment, our student model amends the feature learning to be domain-invariant, then being our objective model running in the edge device. Our approach is evaluated on a challenging IoT-based WiFi gesture recognition scenario, and three classic visual adaptation benchmarks. The empirical studies corroborate the effectiveness of distillation for domain transfer, and the overall results show that our model achieves state-of-the-art performance merely using a simple network.
... We can measure RSS using most wireless devices easily because the RSS collection is supposed by almost all wireless chips. As for CSI, we have to modify NIC driver to measure the CSI using COTS devices, including Intel 5300 NIC [15], Atheros AR9382 [16], Atheros AR9462 and AR9480 [17], Atheros 9580 [18], [19], and Atheros 9390 [20], [21]. Alternatively, we can measure CSI using a special device such as SDR [22] or WARP [23], [24]. ...
Article
Full-text available
Recently, device-free human behavior recognition has become a hot research topic and has achieved significant progress in the field of ubiquitous computing. Among various implementation, behavior recognition based on WiFi CSI (channel state information) has drawn wide attention due to its major advantages. This paper investigates more than 100 latest CSI based behavior recognition applications within the last 6 years and presents a comprehensive survey from every aspect of human behavior recognition. Firstly, this paper reviews general behavior recognition applications using the WiFi signal and presents the basic concept of CSI and the fundamental principle of CSI-based behavior recognition. This paper analyzes the key components and core characteristics of the system architecture of human behavior recognition using CSI. Afterward, we divide the sensing procedures into many steps and summarize the typical studies from these steps, including base signal selection, signal preprocessing, and identification approaches. Next, based on the recognition technique, we classify the applications into three groups, including pattern-based, model-based, and deep learning-based approach. In every group, we categorize the state-of-the-art applications into three groups, including coarse-grained specific behavior recognition, fine-grained specific behavior recognition, and activity inference. It elaborates the typical behavior recognition applications from five aspects, including experimental equipment, experimental scenario, behavior, classifier, and system performance. Then, this paper presents comprehensive discussions about representative applications from the implementation view and outlines the major consideration when developing a recognition system. Finally, this article concludes by analyzing the open issues of CSI-based behavior recognition applications and pointing out future research directions.
Article
Wi-Fi sensing technology has shown superiority in smart homes among various sensors for its cost-effective and privacy-preserving merits. It is empowered by channel state information (CSI) extracted from Wi-Fi signals and advanced machine learning models to analyze motion patterns in CSI. Many learning-based models have been proposed for kinds of applications, but they severely suffer from environmental dependency. Though domain adaptation methods have been proposed to tackle this issue, it is not practical to collect high-quality, well-segmented, and balanced CSI samples in a new environment for adaptation algorithms, but randomly captured CSI samples can be easily collected. In this article, we first explore how to learn a robust model from these low-quality CSI samples, and propose AutoFi, an annotation-efficient Wi-Fi sensing model based on a novel geometric self-supervised learning algorithm. The AutoFi fully utilizes unlabeled low-quality CSI samples that are captured randomly, and then transfers the knowledge to specific tasks defined by users, which is the first work to achieve cross-task transfer in Wi-Fi sensing. The AutoFi is implemented on a pair of Atheros Wi-Fi APs for evaluation. The AutoFi transfers knowledge from randomly collected CSI samples into human gait recognition and achieves state-of-the-art performance. Furthermore, we simulate cross-task transfer using public data sets to further demonstrate its capacity for cross-task learning. For the UT-HAR and Widar data sets, the AutoFi achieves satisfactory results on activity recognition and gesture recognition without any prior training. We believe that AutoFi takes a huge step toward automatic Wi-Fi sensing without any developer engagement. Our codes have been included in https://github.com/xyanchen/Wi-Fi-CSI-Sensing-Benchmark .
Article
Device-free activity recognition plays a crucial role in smart building, security, and human–computer interaction, which shows its strength in its convenience and cost-efficiency. Traditional machine learning has made significant progress by heuristic hand-crafted features and statistical models, but it suffers from the limitation of manual feature design. Deep learning overcomes such issues by automatic high-level feature extraction, but its performance degrades due to the requirement of massive annotated data and cross-site issues. To deal with these problems, transfer learning helps to transfer knowledge from existing datasets while dealing with the negative effect of background dynamics. This paper surveys the recent progress of deep learning and transfer learning for device-free activity recognition. We begin with the motivation of deep learning and transfer learning, and then introduce the major sensor modalities. Then the deep and transfer learning techniques for device-free human activity recognition are introduced. Eventually, insights on existing works and grand challenges are summarized and presented to promote future research.
Article
Nowadays, human behavior recognition research plays a pivotal role in the field of human-computer interaction. However, comprehensive approaches mainly rely on video camera, ambient sensors or wearable devices, which either require arduous deployment or arouse privacy concerns. In this paper, we propose WiAct, a passive WiFi-based human activity recognition system, which explores the correlations between body movement and the amplitude information in Channel State Information (CSI) to classify different activities. The system designs a novel Adaptive Activity Cutting Algorithm (AACA) based on the difference in signal variance between the action and non-action parts, which adjusts the threshold adaptively to achieve the best trade-off between performance and robustness. The Doppler shift correlation value is used as classification features, which is extracted by using the correlation of the WiFi device’s antennas. Extreme Learning Machine (ELM) is utilized for activity data classification because of its strong generalization ability and fast learning speed. We implement the WiAct prototype using commercial WiFi equipment and evaluate its performance in real-world environments. In the evaluation, WiAct achieves an average accuracy of 94.2% for distinguishing ten actions. We compare different experimental conditions and classification methods, and the results demonstrate its robustness.
Article
Full-text available
WiFi received signal strength indicator seem to be the basis of the most widely used method for indoor positioning systems driven by the growth of deployed WiFi access points, especially within urban areas. However, there are still several challenges to be tackled: its accuracy is often 2–3 m, it is prone to interference and attenuation effects, and the diversity of radio frequency receivers, for example, smartphones, affects its accuracy. Received signal strength indicator fingerprinting can be used to mitigate against interference and attenuation effects. In this article, we present a novel, more accurate, received signal strength indicator ranking–based method that consists of three parts. First, an access point selection based on a genetic algorithm is applied to reduce the positioning computational cost and increase the positioning accuracy. Second, Kendall tau correlation coefficient and a convolutional neural network are applied to extract the ranking features for estimating locations. Third, an extended Kalman filter is then used to smooth the estimated sequential locations before multi-dimensional dynamic time warping is used to match similar trajectories or paths representing activities of daily living from different or the same users that vary in time and space. In order to leverage and evaluate our indoor positioning system, we also used it to recognise activities of daily living in an office-like environment. It was able to achieve an average positioning accuracy of 1.42 m and a 79.5% recognition accuracy for nine location-driven activities.
Conference Paper
Full-text available
Existing WiFi fingerprinting-based Indoor Positioning System (IPS) suffers from the vulnerability of environmental dynamics. To address this issue, we propose TKL-WinSMS as a systematic strategy, which is able to realize robust and adaptive localization in dynamic indoor environments. We developed a WiFi-based Non-intrusive Sensing and Monitoring System (WinSMS) that enables COTS WiFi routers as online reference points by extracting real-time RSS readings among them. With these online data and labeled source data from the offline calibrated radio map, we further combine the RSS readings from target mobile devices as unlabeled target data, to design a robust localization model using an emerging transfer learning algorithm, namely transfer kernel learning (TKL). It can learn a domain-invariant kernel by directly matching the source and target distributions in the reproducing kernel Hilbert space instead of the raw noisy signal space. By leveraging the resultant kernel as input for the SVR training, the trained localization model can inherit the information from online phase to adaptively enhance the offline calibrated radio map. Extensive experimental results verify the superiority of TKL-WinSMS in terms of localization accuracy compared with existing solutions in dynamic indoor environments.
Conference Paper
Full-text available
We propose Veto-Consensus Multiple Kernel Learning (VCMKL), a novel way of combining multiple kernels such that one class of samples is described by the logical intersection (consensus) of base kernelized decision rules, whereas the other classes by the union (veto) of their complements. The proposed configuration is a natural fit for domain description and learning with hidden subgroups. We first provide generalization risk bound in terms of the Rademacher complexity of the classifier, and then a large margin multi-ν learning objective with tunable training error bound is formulated. Seeing that the corresponding optimization is non-convex and existing methods severely suffer from local minima, we establish a new algorithm, namely Parametric Dual Descent Procedure (PDDP) that can approach global optimum with guarantees. The bases of PDDP are two theorems that reveal the global convexity and local explicitness of the parameterized dual optimum, for which a series of new techniques for parametric program have been developed. The proposed method is evaluated on extensive set of experiments, and the results show significant improvement over the state-of-the-art approaches.
Article
Full-text available
Indoor Positioning System (IPS) has become one of the most attractive research fields due to the increasing demands on Location Based Services (LBSs) in indoor environments. Various IPSs have been developed under different circumstances, and most of them adopt the fingerprinting technique to mitigate pervasive indoor multipath effects. However, the performance of the fingerprinting technique severely suffers from device heterogeneity existing across commercial off-the-shelf mobile devices (e.g. smart phones, tablet computers, etc.) and indoor environmental changes (e.g. the number, distribution and activities of people, the placement of furniture, etc.). In this paper, we transform the Received Signal Strength (RSS) to a standardized location fingerprint based on the Procrustes analysis, and introduce a similarity metric, termed Signal Tendency Index (STI), for matching standardized fingerprints. An analysis on the capability of the proposed STI in handling device heterogeneity and environmental changes is presented. We further develop a robust and precise IPS by integrating the merits of both the STI and Weighted Extreme Learning Machine (WELM). Finally, extensive experiments are carried out and a performance comparison with existing solutions verifies the superiority of the proposed IPS in terms of robustness to device heterogeneity.
Article
We propose Veto-Consensus Multiple Kernel Learning (VCMKL), a novel way of combining multiple kernels such that one class of samples is described by the logical intersection (consensus) of base kernelized decision rules, whereas the other classes by the union (veto) of their complements. The proposed configuration is a natural fit for domain description and learning with hidden subgroups. We first provide generalization risk bound in terms of the Rademacher complexity of the classifier, and then a large margin multi-ν learning objective with tunable training error bound is formulated. Seeing that the corresponding optimization is non-convex and existing methods severely suffer from local minima, we establish a new algorithm, namely Parametric Dual Descent Procedure (PDDP) that can approach global optimum with guarantees. The bases of PDDP are two theorems that reveal the global convexity and local explicitness of the parameterized dual optimum, for which a series of new techniques for parametric program have been developed. The proposed method is evaluated on extensive set of experiments, and the results show significant improvement over the state-of-the-art approaches.
Conference Paper
Power delay profiles characterize multipath channel features, which are widely used in motion- or localization-based applications. Recent studies show that the power delay profile may be derived from the CSI traces collected from commodity WiFi devices, but the performance is limited by two dominating factors. The resolution of the derived power delay profile is determined by the channel bandwidth, which is however limited on commodity WiFi. The collected CSI reflects the signal distortions due to both the channel attenuation and the hardware imperfection. A direct derivation of power delay profiles using raw CSI measures, as has been done in the literature, results in significant inaccuracy. In this paper, we present Splicer, a software-based system that derives high-resolution power delay profiles by splicing the CSI measurements from multiple WiFi frequency bands. We propose a set of key techniques to separate the mixed hardware errors from the collected CSI measurements. Splicer adapts its computations within stringent channel coherence time and thus can perform well in presence of mobility. Our experiments with commodity WiFi NICs show that Splicer substantially improves the accuracy in profiling multipath characteristics, reducing the errors of multipath distance estimation to be less than $2m$. Splicer can immediately benefit upper-layer applications. Our case study with recent single-AP localization achieves a median localization error of $0.95m$.
Conference Paper
Some pioneer WiFi signal based human activity recognition systems have been proposed. Their key limitation lies in the lack of a model that can quantitatively correlate CSI dynamics and human activities. In this paper, we propose CARM, a CSI based human Activity Recognition and Monitoring system. CARM has two theoretical underpinnings: a CSI-speed model, which quantifies the correlation between CSI value dynamics and human movement speeds, and a CSI-activity model, which quantifies the correlation between the movement speeds of different human body parts and a specific human activity. By these two models, we quantitatively build the correlation between CSI value dynamics and a specific human activity. CARM uses this correlation as the profiling mechanism and recognizes a given activity by matching it to the best-fit profile. We implemented CARM using commercial WiFi devices and evaluated it in several different environments. Our results show that CARM achieves an average accuracy of greater than 96%.