ArticlePDF Available

Variable Accuracy of Wearable Heart Rate Monitors during Aerobic Exercise

Authors:

Abstract and Figures

Purpose: Athletes and members of the public increasingly rely on wearable HR monitors to guide physical activity and training. The accuracy of newer, optically based monitors is unconfirmed. We sought to assess the accuracy of five optically based HR monitors during various types of aerobic exercise. Methods: Fifty healthy adult volunteers (mean ± SD age = 38 ± 12 yr, 54% female) completed exercise protocols on a treadmill, a stationary bicycle, and an elliptical trainer (±arm movement). Each participant underwent HR monitoring with an electrocardiogaphic chest strap monitor (Polar H7), forearm monitor (Scosche Rhythm+), and two randomly assigned wrist-worn HR monitors (Apple Watch, Fitbit Blaze, Garmin Forerunner 235, and TomTom Spark Cardio), one on each wrist. For each exercise type, HR was recorded at rest, light, moderate, and vigorous intensity. Agreement between HR measurements was assessed using Lin's concordance correlation coefficient (rc). Results: Across all exercise conditions, the chest strap monitor (Polar H7) had the best agreement with ECG (rc = 0.996) followed by the Apple Watch (rc = 0.92), the TomTom Spark (rc = 0.83), and the Garmin Forerunner (rc = 0.81). Scosche Rhythm+ and Fitbit Blaze were less accurate (rc = 0.75 and rc = 0.67, respectively). On treadmill, all devices performed well (rc = 0.88-0.93) except the Fitbit Blaze (rc = 0.76). While bicycling, only the Garmin, Apple Watch, and Scosche Rhythm+ had acceptable agreement (rc > 0.80). On the elliptical trainer without arm levers, only the Apple Watch was accurate (rc = 0.94). None of the devices was accurate during elliptical trainer use with arm levers (all rc < 0.80). Conclusion: The accuracy of wearable, optically based HR monitors varies with exercise type and is greatest on the treadmill and lowest on elliptical trainer. Electrode-containing chest monitors should be used when accurate HR measurement is imperative.
Content may be subject to copyright.
Variable Accuracy of Wearable Heart Rate
Monitors during Aerobic Exercise
STEPHEN GILLINOV, MUHAMMAD ETIWY, ROBERT WANG, GORDON BLACKBURN, DERMOT PHELAN,
A. MARC GILLINOV, PENNY HOUGHTALING, HODA JAVADIKASGARI, and MILIND Y. DESAI
The Heart and Vascular Institute, Cleveland Clinic, Cleveland, OH
ABSTRACT
GILLINOV, S., M. ETIWY, R. WANG, G. BLACKBURN, D. PHELAN, A. M. GILLINOV, P. HOUGHTALING, H.
JAVADIKASGARI, and M. Y. DESAI. Variable Accuracy of Wearable Heart Rate Monitors during Aerobic Exercise. Med. Sci. Sports
Exerc., Vol. 49, No. 8, pp. 1697–1703, 2017. Purpose: Athletes and members of the public increasingly rely on wearable HR monitors to
guide physical activity and training. The accuracy of newer, optically based monitors is unconfirmed. We sought to assess the accuracy of
five optically based HR monitors during various types of aerobic exercise. Methods: Fifty healthy adult volunteers (mean TSD age = 38 T
12 yr, 54% female) completed exercise protocols on a treadmill, a stationary bicycle, and an elliptical trainer (Tarm movement). Each
participant underwent HR monitoring with an electrocardiogaphic chest strap monitor (Polar H7), forearm monitor (Scosche Rhythm+), and
two randomly assigned wrist-worn HR monitors (Apple Watch, Fitbit Blaze, Garmin Forerunner 235, and TomTom Spark Cardio), one on
each wrist. For each exercise type, HR was recorded at rest, light, moderate, and vigorous intensity. Agreement between HR measurements
was assessed using Lin"s concordance correlation coefficient (r
c
). Results: Across all exercise conditions, the chest strap monitor (Polar H7)
had the best agreement with ECG (r
c
= 0.996) followed by the Apple Watch (r
c
= 0.92), the TomTom Spark (r
c
= 0.83), and the Garmin
Forerunner (r
c
= 0.81). Scosche Rhythm+ and Fitbit Blaze were less accurate (r
c
= 0.75 and r
c
= 0.67, respectively). On treadmill, all devices
performed well (r
c
= 0.88–0.93) except the Fitbit Blaze (r
c
= 0.76). While bicycling, only the Garmin, Apple Watch, and Scosche Rhythm+
had acceptable agreement (r
c
90.80). On the elliptical trainer without arm levers, only the Apple Watch was accurate (r
c
= 0.94). None of
the devices was accurate during elliptical trainer use with arm levers (all r
c
G0.80). Conclusion: The accuracy of wearable, optically
based HR monitors varies with exercise type and is greatest on the treadmill and lowest on elliptical trainer. Electrode-containing chest
monitors should be used when accurate HR measurement is imperative. Key Words: WEARABLE HEART RATE MONITORS,
ACCURACY, RANDOMIZED TRIAL
Over the last decade, there has been a proliferation of
commercially available HR monitors and wearable
fitness devices. Targeting a larger audience than the
elite athletes who use HR monitoring to inform their training
and assess aerobic fitness, companies have entered the market
of population health, offering a variety of wearable HR and
activity monitoring systems to the public. Annual worldwide
sales of such devices are projected to reach 100,000,000 units
and $50 billion by 2019 (5,13,14).
Although many consumers purchase these wearable fitness
trackers to catalog their HR response to exercise, others use
them with the hope that they will improve health via weight
loss and/or increased aerobic fitness (3,4,10,12,13). However,
surveys document substantial attrition in the use of fitness
wearables, with up to one-third of individuals discontinuing
their use within 6 months of purchase (12).
In clinical practice, physicians and trainers frequently see
patients who report physiologic and behavioral data obtained
from their wearable devices; such data often include energy
expenditure, steps taken, sleep/wake times, and HR. Controlled
studies demonstrate variable accuracy of activity trackers, with
error margins approaching 25% for some devices (3,7,10,12,17).
A previous study suggests a somewhat lower error margin with
selected HR monitors (19).
Current questions regarding the accuracy of wearable HR
monitors are particularly relevant, given the recent consumer
shift from HR monitors that rely on chest straps with electrodes
that measure cardiac electrical activity toward more convenient
wrist-worn monitors that use optical sensing technology similar
to that used for pulse oximetry. Although the accuracy of chest
strap monitors has been confirmed in various previous reports
(6,18), there is a paucity of data validating the accuracy of
wrist-worn, optically based HR monitors (12). A recent study
from our group suggests that wrist-worn monitors fail to pro-
vide accurate readings during treadmill exercise; however, that
study examined only one form of exercise (treadmill walking/
running) and included first generation wrist-worn monitors,
one of which is no longer commercially available (19). Broad
Address for correspondence: Milind Y. Desai, M.D., Department of Car-
diovascular Medicine, Cleveland Clinic/Desk J1-5, Cleveland, OH 44195;
E-mail: desaim2@ccf.org.
Stephen Gillinov and Muhammad Etiwy are co-first authors and contributed
equally to the work.
Submitted for publication January 2017.
Accepted for publication March 2017.
0195-9131/17/4908-1697/0
MEDICINE & SCIENCE IN SPORTS & EXERCISE
Ò
Copyright Ó2017 by the American College of Sports Medicine
DOI: 10.1249/MSS.0000000000001284
1697
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
assessment of the monitors"accuracy is important both for the
individuals who rely on these monitors to guide their athletic,
physical, and rehabilitative activity and for the physicians to
whom these individuals report their HR readings for the pur-
pose of potentially guiding therapy.
The objective of this study was to assess the accuracy of five
commonly used, currently commercially available, optically
based wearable HR monitors in an appropriately powered study
under various forms of aerobic exercise conditions.
METHODS
Participants
This prospective study recruited 50 healthy adults 18 yr or
older through fliers and Internet notices at Cleveland Clinic from
June 2016 through August 2016. Participants were screened to
ensure that they were able to safely perform an 18-min exercise
protocol, including a treadmill, an elliptical trainer, and a sta-
tionary bicycle. The screening tool was adapted from the Na-
tional Academy of Sports Medicine"s screening questionnaire
(11). Study exclusion criteria included known cardiovascular or
lung disease, presence of a cardiac pacemaker, treatment with
beta-blockers or heart rhythm medications, and self-reported
chest pain, dizziness, or loss of balance. The protocol was ap-
proved by the Institutional Review Board of the Cleveland
Clinic, and all subjects provided written informed consent. The
study was registered at clinicaltrials.gov (NCT02818244).
HR Monitors
All participants wore standard ECG leads (Mason-Likar
electrode placement of torso-mounted limb leads), a Polar H7
chest strap monitor, and a Scosche Rhythm+ on the forearm. In
addition, each participant was randomly assigned by a com-
puter program to wear two different wrist-worn HR monitors,
one on each wrist; this enabled the assessment of each type of
wrist-worn monitor in 25 subjects. The wrist-worn monitors
assessed included Fitbit Blaze (Fitbit), Apple Watch (Apple),
Garmin Forerunner 235 (Garmin), and TomTom Spark Cardio
(TomTom). Four units of each type of monitor were pur-
chased from retail outlets and studied in random order. Each of
these optically based wearable monitors measures HR via an
optically obtained plethysmogram that is processed according
to proprietary algorithms.
Exercise Protocol
In each subject, HR was assessed using five different moni-
toring modalities (ECG, Polar H7 chest strap, Scosche
Rhythm+, and two different wrist-worn monitors). The Mason–
Likar electrode placement allowed the assessment of modified
leads I, II, and III on ECG. An aggressive electrode preparation
was performed at each site, which included cleansing with al-
cohol and light abrasion to reduce resistance and optimizing
signal quality. ECG was monitored on a Quinton Q-tel RMS
telemetry system, and hard copy rhythm strips were obtained to
measure HR. ECG-based HR was determined by visual as-
sessment under direct supervision by a cardiologist. In addi-
tion, the HR was measured when performing four different
types of exercise at varying intensities; these included tread-
mill, stationary bicycle, elliptical trainer with arm levers, and
elliptical trainer without arm levers. The order of exercises
wasassignedrandomly.
Exercise protocols for each piece of equipment were as
follows:
Treadmill
)2 mph for 1.5 min
)3.5 mph for 1.5 min
)6 mph for 1.5 min
Stationary bicycle
)25 W for 1.5 min
)55 W for 1.5 min
)125 W for 1.5 min
Elliptical (without arm levers)
)Light for 1.5 min: crossramp = 1, resistance = 1, cadence =
60–70 min
j1
)Moderate for 1.5 min: crossramp = 1, resistance = 5,
cadence = 90–100 min
j1
)Vigorous for 1.5 min: crossramp = 10, resistance = 10,
cadence = 90–100 min
j1
Elliptical (with arm levers)
)Light for 1.5 min: crossramp = 1, resistance = 1, cadence =
60–70 min
j1
)Moderate for 1.5 min: crossramp = 1, resistance = 5,
cadence = 90–100 min
j1
)Vigorous for 1.5 min: crossramp = 10, resistance = 10,
cadence = 90–100 min
j1
The treadmill settings of 2, 3.5, and 6 mph correspond to
workloads of 2.5, 3.7, and 10.2 METs, respectively. For a 70-kg
individual, the bicycle settings of 25, 55, and 125 W correspond
to 2.4, 3.7, and 8.8 METs, respectively. Because there are no
standard workload settings for elliptical trainers, we identified
three settings that were judged to represent light, moderate, and
vigorous activity.
Each subject spent 4.5 min at each of the four exercise sta-
tions and then rested for 2 min between different exercise sta-
tions; therefore, total exercise time was 18 min, and total time of
each trial was 24 min. HR signals for all devices were checked
at the beginning of each exercise/rest segment to ensure device
function. HR was recorded from HR monitors at the completion
of each 1.5 min exercise segment and at the end of each 2 min
rest period; preliminary studies in three subjects confirmed that
HR had reached a steady state at these time points. At each
timepoint,HRwasrecordedbytwotrainedresearchper-
sonnel (SMG and ME), one situated on each side of the sub-
ject. HR recordings from all devices and ECG were obtained
for a period of approximately 5 s. Values were entered into an
IRB-approved database.
http://www.acsm-msse.org1698 Official Journal of the American College of Sports Medicine
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
Statistical Methods
Sample size. Sample size was based on the use of Lin"s
concordance correlation coefficient (r
c
) to compare HR
measurements with wearable, optically based HR monitors
to those obtained with the ECG, which is considered the
standard (10). On the basis of previous work, we deemed an
r
c
90.8 to represent acceptable accuracy in HR measure-
ment (20). Generation of 25 pairs of data for each device
(i.e., device and ECG) was necessary to provide 90% power
to determine a difference from r
c
of 0.82 to r
c
of 0.93.
Analysis plan. Paired differences. Using the ECG-
determined HR as the standard, each of the HR monitoring
systems was assessed for accuracy by calculation of the
difference between the measures and compared. The paired
differences, both relative and absolute, were calculated as
(HR
ecg
–HR
device
) for each device under the various con-
ditions. The absolute percent differences were calculated as
([HR
ecg
–HR
device
]/HR
ecg
100).
Agreement. The Bland–Altman analysis was performed
to assess agreement for each device with ECG (2). In addition,
Lin"s concordance correlation coefficients (r
c
) and associated
95% confidence intervals were calculated to provide a mea-
sure of agreement for each device with ECG. The concordance
correlation coefficient (r
c
) measures the degree to which the
paired observations fall on the identity line (9).
Multivariable testing. Repeated-measures mixed model
ANOVA was used to test the overall effect of the fitness
devices while adjusting for covariates and taking into account
multiple measurements for each subject. In addition to HR
device and exercise condition (activity type and intensity), factors
in the final adjusted model included age, gender, body mass in-
dex, wrist size, and days of typical aerobic exercise per week.
Data were analyzed using SAS version 9.4 (SAS Institute
Inc., Cary, NC) and R software version 3.2.3 (15).
Presentation. Continuous variables are reported as mean T
SD, with median and percentile values. Categorical variables
are reported as percent and frequency.
RESULTS
Subjects. The study randomized 50 subjects (mean TSD
age = 38 T12 yr, 27 [54%] females, 6 [12%] non-Whites)
(Table 1, Fig. 1). Subjects were examined for the presence of
tattoos on the wrist; none had tattoos in this location. All
subjects engaged in regular aerobic exercise (including
walking), and 82% reported that they exercised regularly to
the point of perspiration. Subjects"mean TSD resting HR on
ECG was 86 T18 bpm.
Aggregate results. Of the 4000 possible HR mea-
surements, 3985 were recorded (99.6%). Across all ECG
tracings, there was minimal artifact and in no situation did
ECG artifact interfere with visual HR determination. Miss-
ing data were attributable to failure of the device to record
HR (eight for Apple Watch, four for Fitbit, two for Scosche
Rhythm+, and one for Garmin Forerunner 235.
Measured HR ranged from 51 to 184 bpm. Average dif-
ferences from the ECG standard were less than 1 bpm for the
Polar H7 under all exercise conditions but extended to
nearly 20 bpm for other monitors (Table 2). The average
differences from the ECG standard were calculated as both
relative error (which averages positive and negative differ-
ences from the ECG standard) and the absolute value of error,
TABLE 1. Baseline characteristics of participants.
Characteristic Mean TSD 15th/50th/85th Percentiles Min, Max
Age (yr) 38 T12 26/33/52 21, 64
Height (cm) 171 T11 160/170/181 155, 198
Weight (kg) 73 T16 57/70/88 46, 118
Body mass index (kgIm
j2
)25T3.5 21.5/24/29 19, 33
Days per week of aerobic exercise (including walking) 4.6 T1.7 3/5/7 1, 7
Left wrist circumference (cm) 16 T1.4 14.5/15.4/17.5 13.4, 19.2
Right wrist circumference (cm) 16 T1.4 14.6/15.5/17.5 13.7, 19.6
Male 23 (46%)
Nonwhite race 6 (12%)
Perspire during aerobic exercise 41 (82%)
FIGURE 1—Flow of participants through the study.
VARIABLE ACCURACY OF HR MONITORS Medicine & Science in Sports & Exercise
d
1699
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
regardless of direction. HR values on the wrist-worn monitors
varied from the ECG standard by approximately 2% to nearly
20%, depending on the monitor and the activity (Table 2).
Bland–Altman analysis revealed that all monitors had some
measurements that did not reflect HR accurately (Fig. 2);
however, this variation was not linked to specific HR values,
meaning that variability was not influenced by the HR magni-
tude. The Apple Watch had 95% of differences fall within j17
and 20 bpm of the ECG, whereas TomTom Spark Cardio and
Garmin Forerunner 235 had 95% of values fall within j24 and
31 bpm and j27 and 33 bpm, respectively. The corresponding
values for Scosche Rhythm+ and Fitbit Blaze were j31 and
38 bpm and j30 and 45 bpm, respectively.
Under all conditions combined, when compared with
ECG, the Polar chest strap had the highest agreement with
ECG with an r
c
of 0.99. Among wrist-worn monitors, the
Apple Watch performed best with r
c
= 0.92, followed by the
TomTom Spark Cardio (r
c
= 0.83) and Garmin Forerunner
235 (r
c
= 0.81). The Scosche Rhythm+ and Fitbit Blaze had
r
c
= 0.75 and r
c
= 0.67, respectively (Fig. 3).
The results of the mixed model confirmed that among the
optically based HR monitors, the Apple Watch was the most
accurate, with no statistical difference from ECG (P= 0.22),
even after adjustment for all other factors. The other opti-
cally based HR monitors often underestimated the true HR
(PG0.0001). Subject factors (age, gender, body mass index,
wrist circumference, and days of typical aerobic exercise per
week) were not associated with HR monitor accuracy.
Agreement with ECG during various types of
exercise. The Polar H7 Chest Strap performed well during
all different aerobic exercise modalities (r
c
= 0.99), but other
HR monitors"agreement with ECG varied with the type of ex-
ercise (Table 2). At rest, all monitors had r
c
90.88. With the
treadmill, all devices provided acceptable agreement (r
c
=
0.88–0.93) except the Fitbit Blaze (r
c
= 0.76). While biking,
Garmin Forerunner 235, Apple Watch, and Scosche Rhythm+
had the highest agreement with ECG (r
c
90.80). On the el-
liptical trainer without using the arm levers, only the Apple
Watch provided readings that agreed with the ECG (r
c
=0.94).
None of the optically based HR monitors provided good
agreement with ECG during elliptical trainer use with the arm
levers engaged (r
c
G0.80).
Although HR monitor agreement with ECG varied with
the type of exercise, it did not vary with the intensity of
exercise from easy to moderate on each piece of equipment.
However, when moving to vigorous exercise, only the Ap-
ple Watch had readings with similar agreement to ECG to
those obtained with less intense exercise; all other monitors
had less agreement during vigorous activity (PG0.003).
DISCUSSION
The results of this study demonstrate that optically based
wearable HR monitors are less accurate than electrode-
containing chest strap monitors. In addition, the accuracy of
these monitors varies with the type of aerobic activity. These
TABLE 2. HR monitor differences from ECG according to activity.
Activity nDevice
HR (bpm) Differences from ECG
Agreement
r
c
Paired Relative
Difference
Paired Absolute
Difference
Absolute Percent
(%) Difference
Mean SD Mean SD Mean SD
Treadmill 150 Polar Chest Strap 0.4 3.1 1.2 2.9 1.1 2.4 0.99
149 Scosche Rhythm+ j2.8 11.3 6.3 9.8 5.9 9.7 0.92
71 Apple Watch j1.7 9.6 5.6 8.0 4.9 6.7 0.93
74 Fitbit 5.9 17.5 12.4 13.7 10.4 10.8 0.76
74 Garmin j0.3 11.2 7.3 8.5 6.1 7.0 0.92
75 TomTom 1.4 13.3 7.2 11.2 6.2 9.5 0.88
Bike 150 Polar Chest Strap 0.1 1.9 0.6 1.8 0.6 1.6 0.99
149 Scosche Rhythm+ 3.9 12.6 5.5 11.9 4.8 8.8 0.84
75 Apple Watch 3.1 10.2 4.6 9.6 4.1 7.8 0.88
73 Fitbit 18.2 25.2 18.9 24.6 15.9 18.2 0.41
75 Garmin 3.4 9.2 4.8 8.5 4.6 7.7 0.91
75 TomTom 6.7 14.3 7.0 14.2 5.9 10.7 0.77
Elliptical (no arms) 150 Polar Chest Strap 0.1 2.0 0.6 1.9 0.6 2.3 0.99
150 Scosche Rhythm+ 14.1 24.7 16.0 23.5 13.1 17.6 0.27
74 Apple Watch 0.6 6.6 3.5 5.6 3.2 4.9 0.94
74 Fitbit 10.0 15.9 11.6 14.8 9.8 11.5 0.58
75 Garmin 7.7 18.1 11.0 16.2 9.7 13.6 0.54
75 TomTom 7.3 19.4 8.0 19.1 6.4 13.4 0.55
Elliptical (with arms) 150 Polar Chest Strap 0.5 2.7 1.0 2.6 0.9 2.4 0.99
150 Scosche Rhythm+ 2.9 22.5 14.0 17.9 12.4 15.6 0.41
72 Apple Watch 5.5 13.6 7.4 12.7 6.5 10.8 0.75
75 Fitbit 2.1 20.2 13.8 14.9 11.7 12.1 0.48
75 Garmin 2.2 23.3 14.8 18.0 13.7 16.8 0.31
75 TomTom 1.4 14.0 8.0 11.6 6.7 9.6 0.76
Rest 200 Polar Chest Strap j0.3 1.0 0.7 0.8 0.8 1.0 0.99
200 Scosche Rhythm+ 1.1 6.2 3.9 4.9 4.6 6.0 0.93
100 Apple Watch j0.4 4.2 3.0 3.0 3.5 3.4 0.96
100 Fitbit 2.8 8.2 5.4 6.8 5.6 6.4 0.89
100 Garmin 1.1 8.1 5.0 6.5 5.9 7.5 0.88
100 TomTom 1.3 5.6 3.8 4.2 4.5 5.3 0.94
http://www.acsm-msse.org1700 Official Journal of the American College of Sports Medicine
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
findings raise questions concerning the role of such monitors
in individuals"management of their health, assessment of
their fitness, and guidance of their fitness regimens.
Introduced in the 1980s, chest strap–based HR monitors
function much like an ECG, sensing cardiac electrical ac-
tivity. Several studies confirm the accuracy of most of these
HR monitors under conditions of both rest and moderate
exercise (6,8,18). Although chest strap–based HR monitors
have been favored by elite athletes because of their proven
accuracy, they are relatively inconvenient and have not been
widely adopted by the public. By contrast, the recent intro-
duction of convenient, wrist-worn HR monitors that include
the capability for wireless transmission has stirred widespread
public interest in HR monitoring. However, as reported by
major media outlets, individuals"experiences with the
newer class of HR monitors suggest that their accuracy may
be poor, particularly during exercise (16). This controversy
has reached the courtroom in the form of a class action
lawsuit alleging that the Fitbit device is inaccurate and
potentially harmful (14).
The new wrist-worn HR monitors do not measure cardiac
electrical activity; rather, they rely on photoplethysmography.
The monitor illuminates the skin with an LED and then mea-
sures the amount of light reflected back to a photodiode
FIGURE 2—Bland–Altman plots and 95% limits of agreement with electrocardiographically measured HR. A, Polar H7; B, Apple Watch; C,
TomTom Spark; D, Garmin Forerunner 235; E, Scosche Rhythm+; F, Fitbit Blaze.
VARIABLE ACCURACY OF HR MONITORS Medicine & Science in Sports & Exercise
d
1701
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
sensor; this enables detection of variations in blood volume
associated with the pulse of blood caused by each cardiac
contraction. Potential sources of error with optically based
monitors include motion artifact from physical movement,
misalignment between the skin and the optical sensor, varia-
tions in skin color/tone, ambient light, and poor tissue perfu-
sion (1). The accuracy of such monitors during exercise is
controversial, some studies suggesting that wrist-worn HR
monitors perform best at rest or slow walking, and others
asserting assert good accuracy even during vigorous exercise
(1,5). In a recent study examining subjects on a treadmill, we
found variable accuracy between different optically based HR
monitors; however, when compared with an ECG, the tested
monitors all had a concordance correlation coefficient ex-
ceeding 0.80 (19).
Extending that work, the current study assessed the per-
formance of wearable HR monitors using varying aerobic
exercise modalities (treadmill, stationary bicycle, and elliptical
trainer with and without arms) and at different levels of in-
tensity. Recognizing that people engage in a variety of types of
FIGURE 3—Concordance correlation coefficients depicting agreement of device-measured HR with ECG. A, Polar H7; B, Apple Watch; C, TomTom
Spark; D, Garmin Forerunner 235; E, Scosche Rhythm+; F, Fitbit Blaze.
http://www.acsm-msse.org1702 Official Journal of the American College of Sports Medicine
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
exercise beyond walking on a treadmill, the primary purpose
of the current study was to assess the monitors"agreement with
ECG during different forms of aerobic activity. Distinct from
the previous study, the current study enrolled a new cadre of
subjects and assessed several monitors that had not previously
been tested. Although all monitors performed well in subjects
at rest, their accuracy varied with different exercise modalities.
Certain monitors were better suited for the stationary bicycle
and the elliptical trainer (without arm motion), and this may be
a result of variable tolerances for motion artifact associated
with different exercises. In particular, none of the optical
monitors performed well when assessing HR in subjects using
the elliptical trainer with arm motion, likely a result of motion
artifact related to arm movement (1). By contrast, a chest strap
containing an electrically based monitor provided accurate
measurements, regardless of exercise intensity or modality.
Although this study is the largest of its kind and included
nearly 4000 HR measurements, it has limitations. The cur-
rent study methodology (e.g., visual recording of HR on
ECG) may have contributed to some error as compared with
a more rigorous approach wherein time stamped raw device
data were extracted. The results apply only to the HR
monitors tested. These monitors were chosen because of
their apparent popularity with the public, and each monitor
was the manufacturer"s most recent offering at the time of
the study; however, they represent an opportunistic sample
of the wide range of available HR monitors. Continuous HR
assessment, which is currently not feasible with all devices,
would enable more detailed comparisons. The devices were
assessed in young, healthy volunteers exercising in a labo-
ratory setting. Results may vary for different subsets of in-
dividuals, including cardiac patients. Although we accounted
for participant factors including age and BMI, the relatively
narrow distribution of age and BMI in this study of young,
healthy volunteers does not enable us to rule out a potential
effect of these factors on the accuracy of HR measurement. In
addition, these results may not be representative of those
obtained during more vigorous exercise or during different
activities (e.g., running on pavement, swimming, or other
sports participation).
CONCLUSION
This study demonstrates that optically based wrist-worn HR
monitors vary in their accuracy and that their accuracy is ac-
tivity dependent. Individuals who use such monitors should be
aware of the possibility of inaccurate measurements and that
some monitors (i.e., the Apple Watch) provide greater agree-
ment with ECG than do other monitors. Apparently, spurious
HR measurements should be confirmed by simple palpation to
measure HR or, if readily available, by ECG. When accurate
HR monitoring is essential, an electrically based chest strap
monitor should be used.
This study was supported by the Mary Elizabeth Holdsworth Fund
at the Cleveland Clinic.
The Mary Elizabeth Holdsworth Fund had no role in the design
and conduct of the study; collection, management, analysis, and
interpretation of the data; preparation, review, or approval of the
manuscript; and decision to submit the manuscript for publication.
There are no relevant conflicts of interest to disclose.
The results of the present study do not constitute endorsement by
the American College of Sports Medicine. The results of the study
are presented clearly, honestly, and without fabrication, falsification,
or inappropriate data manipulation.
REFERENCES
1. Alzahrani A, Hu S, Azorin-Peris V, et al. A multi-channel opto-
electronic sensor to accurately monitor heart rate against motion
artefact during exercise. Sensors (Basel). 2015;15(10):25681–702.
2. Bland JM, Altman DG. Statistical methods for assessing agree-
ment between two methods of clinical measurement. Lancet. 1986;
1(8476):307–10.
3. Case MA, Burwick HA, Volpp KG, Patel MS. Accuracy of
smartphone applications and wearable devices for tracking physi-
cal activity data. JAMA. 2015;313(6):625–6.
4. Diaz KM, Krupka DJ, Chang MJ, et al. FitbitÒ: an accurate and
reliable device for wireless physical activity tracking. Int J Cardiol.
2015;185:138–40.
5. El-Amrawy F, Nounou MI. Are currently available wearable devices
for activity tracking and heart rate monitoring accurate, precise, and
medically beneficial? Healthc Inform Res. 2015;21(4):315–20.
6. Laukkanen RM, Virtanen PK. Heart rate monitors: state of the art.
J Sports Sci. 1998;16(Suppl):S3–7.
7. Lee JM, Kim Y, Welk GJ. Validity of consumer-based physical
activity monitors. Med Sci Sports Exerc. 2014;46(9):1840–8.
8. Le
´ger L, Thivierge M. Heart rate monitors: validity, stability, and
functionality. Phys Sportsmed. 1988;16(5):143–51.
9. Lin LI. A concordance correlation coefficient to evaluate repro-
ducibility. Biometrics. 1989;45(1):255–68.
10. Murakami H, Kawakami R, Nakae S, et al. Accuracy of wearable
devices for estimating total energy expenditure: comparison with
metabolic chamber and doubly labeled water method. JAMA Intern
Med;176(5):702–3.
11. National Academy of Sports Medicine Data Collection Sheet
[cited 2016 April 5]. Available from: http://www.nasm.org/docs/
pdf/nasm:par-q-(pdf-21k).pdf.
12. Patel MS, Asch DA, Volpp KG. Wearable devices as facilitators,
not drivers, of health behavior change. JAMA. 2015;313(5):459–60.
13. Piwek L, Ellis DA, Andrews S, Joinson A. The rise of consumer
health wearables: promises and barriers. PLoS Med;13(2):e1001953.
14. Profils S. Do wristband heart trackers actually work? A checkup
[cited 2016 April 4]. Available from: http://www.cnet.com/news/
how-accurate-are-wristband-heart-rate-monitors/.
15. R package epiR for calculating concordance correlation coefficients
[cited 2016 April 4]. Available from: http://cran.r-project.org/web/
packages/epiR.
16. Stern J. Fitness bands with heart-rate tracking are missing a beat.
Wall Street Journal. December 16, 2014.
17. Swan M. Emerging patient-driven health care models: an examination
of health social networks, consumer personalized medicine and quan-
tified self-tracking. Int J Environ Res Public Health. 2009;6:492–525.
18. Terbizan DJ, Dolezal BA, Albano C. Validity of seven commer-
cially available heart rate monitors. Measurement in Physical Ed-
ucation and Exercise Science. 2002;6(4):243–7.
19. Wang R, Blackburn G, Desai M, et al. Accuracy of wrist-worn
heart rate monitors. JAMA Cardio. 2016;2(1):104–6.
VARIABLE ACCURACY OF HR MONITORS Medicine & Science in Sports & Exercise
d
1703
APPLIED SCIENCES
Copyright © 2017 by the American College of Sports Medicine. Unauthorized reproduction of this article is prohibited.
... Commercially available fitness monitors have become an incredibly popular form of wearable technology for tracking health and performance at rest and during exercise [1][2][3][4]. In fact, the American College of Sports Medicine has named "wearable technology" as the top fitness trend for seven of the last nine years [1], and the global industry of these products is currently valued at approximately 100 billion USD [4], with an estimated 100 million wearables sold annually [5,6]. These wearable devices are used to estimate numerous cardiovascular (heart rate, heart rate variability, percent oxygen saturation, and irregular rhythms); pulmonary (breathing rate); metabolic (oxygen uptake, energy expenditure, lactate threshold, and body temperature); performance (step count, distance covered, duration of exercise, training volume, pace, and acceleration); recovery; sleep; and stress variables [1]. ...
... Of these features, the monitoring of the heart rate is often highly beneficial due to its utility in prescribing optimal exercise intensity zones specifically designed for cardiorespiratory fitness, weight loss, and rehabilitation strategies in athletic and clinical populations [1,[7][8][9][10]. Previous studies, however, have established that the validity of measuring the heart rate in these devices can be influenced by the mode and intensity of exercise, location of placement on the body, sensor technology, and other factors [6,[11][12][13]. Moreover, wide-ranging accuracy values have been reported, with some devices exhibiting measurement errors exceeding 20% or up to 50 beats per minute compared to criterion methods (i.e., electrocardiogram, ECG) [6,8,12,[14][15][16][17]. ...
... Previous studies, however, have established that the validity of measuring the heart rate in these devices can be influenced by the mode and intensity of exercise, location of placement on the body, sensor technology, and other factors [6,[11][12][13]. Moreover, wide-ranging accuracy values have been reported, with some devices exhibiting measurement errors exceeding 20% or up to 50 beats per minute compared to criterion methods (i.e., electrocardiogram, ECG) [6,8,12,[14][15][16][17]. Concerns over these issues continue to be reported [3,4,12], and many authors have proposed more research should be completed on the accuracy, as well as reliability, of these devices [1,[16][17][18][19]. ...
Article
Full-text available
The purpose of this study was to determine the accuracy and inter-device reliability of the OTBeat BurnTM heart rate monitor during an incremental test to exhaustion on a cycle ergometer. Twenty males (mean ± SD age = 21.1 ± 1.9 years) volunteered to complete a test to exhaustion on a cycle ergometer with OTBeat BurnTM devices placed on the forearm and upper arm, with a 12-lead electrocardiogram used as the criterion. The heart rate was recorded every 30 s and averaged across each two-minute stage. Accuracy was assessed through calculation of the mean absolute percent error (MAPE), Bland–Altman plot, and Lin’s concordance correlation coefficient (CCC). An intraclass correlation coefficient (ICC) was used to assess the inter-device reliability. Statistical significance was set at α < 0.05. The MAPE (±SD), Bland–Altman regression analyses, and Lin’s CCC values were 0.9 (±0.6)% and 0.8 (±0.5)%, r = 0.107 and 0.303, and rc = 0.998 and 0.998 for the forearm and the upper arm monitors, respectively. The ICC for inter-device reliability was R = 0.999. Our findings indicated the OTBeat BurnTM monitors placed on the forearm and upper arm provided highly accurate and reliable values when compared to an electrocardiogram from low to maximal exercise intensities.
... The exercise test was terminated when the participant reached 80% HRR or by voluntary termination (example, lower limb fatigue). The recumbent bike was selected to minimize motion artefact from arm movement [11,22,40]. Participants were instructed to relax their hands at their sides throughout testing, as gripping handles and upper body movement may contribute to measurement error [11]. ...
... Motion artefact is considered a primary factor associated with HR measurement error in PPG devices during exercise [22]. We used a recumbent cycle to minimize upper body motion and its subsequent impact on motion artefact [11,40]. Nonetheless, we continued to observe measurement errors with increasing exercise intensity, predominantly in people with darker skin tones. ...
Article
Full-text available
Purpose To evaluate the accuracy of a wrist-based heart rate (HR) monitor at different exercise intensities across different skin tones. Methods Using a cross-sectional design, we compared HR measures from the wrist-based photoplethysmography Fitbit Charge 5 to the Polar H10 chest strap at rest and during the YMCA Protocol using a recumbent cycle ergometer. Participant were grouped into three skin tone categories: light (Fitzpatrick Scale Skin Types 1+2), medium (Types 3+4), and darker skin tone (Types 5+6). HR measures using the Polar chest strap during the exercise test were categorized as <40%, 40–60%, or >60% HR reserve (HRR). Absolute error in beats per minute (bpm) between the two devices was calculated for each measure. A linear mixed effects model was used to assess interaction effects between skin tone and exercise intensity, with participants as the random effect. Bland-Altman plots were used for visual analyses. Results Twenty-five participants [mean (SD): 25.8 (1.9) years old; 64% female] were included with 495 observations of simultaneous Fitbit and Polar HR recordings collected during exercise. During exercise, we observed a statistically significant interaction effect between skin tone and exercise intensity. Compared with light skin tone at <40% HRR, mean error was greater for medium skin tone at >60% HRR [mean error (95%CI): 11.8 (5.6–17.9) bpm, p<0.001] and darker skin tone at 40–60% HRR [7.6 (1.7–13.5) bpm, p = 0.011] and >60% HRR [11.7 (5.3–18.0) bpm, p<0.001]. Conclusion HR measurement error using a wrist-based device was greater with increasing exercise intensity for people with darker skin tones.
... Previous studies indicate that the accuracy of optical HR sensors may suffer due to high-intensity, excessive motion artifacts and/or rapid HR fluctuations (Gillinov et al., 2017;Horton et al., 2017;Merrigan et al., 2023;Müller et al., 2019;Spierer et al., 2015). Coincidentally, in our study, when focusing solely on HR measurements above the HIIT threshold, the correlation between PVS and Polar H10 decreased from 0.93 (very strong) to 0.84 (strong). ...
... The integration of wearable devices provided valuable real-time data that were instrumental in monitoring and assessing the patient's recovery progress. These devices [30] are widely accessible and known for their reliability in tracking physical activity [31], captured key metrics such as heart rate [32,33], swimming stroke efficiency [34], cycling cadence [35], training duration, and running biomechanics. This data enabled a detailed analysis of the patient's physiological responses to rehabilitation exercises and sports-specific activities. ...
Article
Full-text available
Introduction: Meniscus injuries are common among endurance athletes, requiring structured rehabilitation to restore function and facilitate a safe return to sport. Traditional in-person rehabilitation may not always be accessible or feasible for high-performance athletes. Telerehabilitation, incorporating virtual reality, motion tracking, and telemonitoring, offers an innovative approach to guided recovery. However, evidence supporting its effectiveness in elite athletes remains limited. Case presentation: This case report explores the application of an innovative telerehabilitation program for a 49-year-old triathlete recovering from partial meniscectomy following a medial meniscus tear. The program was structured into three progressive phases over 12 weeks, focusing on restoring range of motion (ROM), muscle strength, and functional stability while gradually reintroducing sports-specific activities. Results: By the end of the rehabilitation, the patient achieved full ROM and muscle strength (scoring 5/5 on the Medical Research Council scale for the vastus medialis), along with a pain-free state in both static and dynamic conditions. The integration of telemonitoring devices facilitated detailed monitoring and feedback, enabling personalized adjustments to the rehabilitation protocol. Key milestones included a return to swimming and cycling in Phase 2, reintroduction of running in Phase 3, and a full resumption of triathlon training by week 12. Conclusion: Despite the positive results, the study highlights the need for further research to validate these findings across larger cohorts and establish standardized telerehabilitation protocols for athletes. This case underscores the potential of digital health technologies in enhancing recovery trajectories for high-demand athletes post-meniscus surgery, paving the way for supervised, accelerated, and effective sports reintegration.
... Similarly, wearable heart rate monitors can provide valuable insights into cardiovascular health and help in the early detection of arrhythmias [3]. However, the success of these technologies depends not only on their technical accuracy but also on their integration into the daily lives of users. ...
Article
Full-text available
Non-invasive health monitoring technologies, such as continuous glucose monitors and wearable heart rate sensors, have revolutionized the way we manage health by offering real-time data without the need for invasive procedures. However, the success and accuracy of these devices are not solely dependent on technological advancements. Behavioral human factors, including user-centered design, psychological drivers, and behavioral aptitudes, play a crucial role in their effectiveness. This opinion paper explores the intersection of technology and human behavior, emphasizing how user habits, health literacy, and engagement influence the reliability and adoption of non-invasive health monitors
Article
Full-text available
Wearable technology like the Apple Watch is increasingly important for monitoring health metrics. Accurate measurement is crucial, as inaccuracies can impact health outcomes. Despite extensive research, findings on the Apple Watch’s accuracy vary across different conditions. While previous reviews have summarized findings, few have utilized a meta-analytic approach. This study aims to quantitatively evaluate the accuracy of the Apple Watch in measuring health metrics. The accuracy of the Apple Watch was assessed in measuring energy expenditure (EE), heart rate (HR), and step counts (steps). We searched Embase, PubMed, Scopus, and SPORTDiscus for studies on adults using the Apple Watch compared to reference measures. The Bland-Altman framework was applied to assess mean bias and limits of agreement (LoA), with robust variance estimation to address within-study correlations. Heterogeneity was assessed across variables such as age, health status, device series, activity intensity, and activity type. Additionally, the mean absolute percentage error (MAPE) reported in the included studies was summarized by subgroups. This review included 56 studies, comprising 270 effect sizes on EE (71), HR (148), and steps (51). The meta-analysis showed a mean bias of 0.30 (LoA: -2.09 to 2.69) for EE (kcal/min), -0.12 (LoA: -11.06 to 10.81) for HR (beats/min), -1.83 (LoA: -9.08 to 5.41) for steps (steps /min). The forest plots showed variability in LoA across subgroups. For MAPE, all subgroups for EE exceeded the 10% validity threshold, while none of the subgroups for HR exceeded this threshold. For steps, some subgroups exceeded 10%, highlighting variability in accuracy based on different conditions. This study demonstrates that while the Apple Watch generally provides accurate HR and step measurements, its accuracy for EE is limited. Although HR and step measurements showed acceptable accuracy, variability was observed across different user characteristics and measurement conditions. These findings highlight the importance of considering such factors when evaluating validity.
Article
Full-text available
Monitoring heart rate (HR) is vital for health management and athletic performance, and wearable technology enables scientists to obtain real-time cardiovascular insights. This study compares Machine Learning (ML) techniques, including Long Short-Term Memory (LSTM) networks, Physics-Informed Neural Networks (PINNs), and 1D Convolutional Neural Networks (1D CNNs). Then, we develop a hybrid Singular Spectrum Analysis (SSA)-Augmented ML technique to predict HR using wearable sensor data. Additionally, we investigate the impact of incorporating auxiliary physiological inputs, such as breathing rate (BR) and RR intervals, on predictive accuracy. The study utilizes the cardiorespiratory data acquired through wearable sensors while practising sports, including 126 recordings from 81 participants (53 males, 28 females) engaged in 10 different sports. Physiological signals were collected at 1 Hz using the BioHarness 3.0 (Zephyr Technology, Mangaluru, India). The dataset includes individuals with varied levels of sports experience (beginner, intermediate, and advanced), allowing for a more comprehensive evaluation of HR variability across different expertise levels. Our results demonstrate that the hybrid SSA-LSTM model reaches the lowest prediction error by effectively capturing HR dynamics. Furthermore, integrating HR, BR, and RR data significantly enhances accuracy over single or dual parameter inputs. These findings support adopting multivariate machine learning models for health monitoring, improving HR prediction accuracy for fitness and preventive healthcare.
Article
Full-text available
Background Consumer-grade wearables are becoming increasingly popular in research and in clinical contexts. These technologies hold significant promise for advancing digital medicine, particularly in remote and rural areas in low-income settings like sub-Saharan Africa, where climate change is exacerbating health risks. This study evaluates the data agreement between consumer-grade and research-established devices under standardized conditions. Methods Twenty-two participants (11 women, 11 men) performed a structured protocol, consisting of six different activity phases (sitting, standing, and the first four stages of the classic Bruce treadmill test). We collected heart rate, (core) body temperature, step count, and energy expenditure. Each variable was simultaneously tracked by consumer-grade and established research-grade devices to evaluate the validity of the consumer-grade devices. We statistically compared the data agreement using Pearson’s correlation r, Lin’s concordance correlation coefficient (LCCC), Bland-Altman method, and mean absolute percentage error. Results A good agreement was found between the wrist-worn Withings Pulse HR (consumer-grade) and the chest-worn Faros Bittium 180 in measuring heart rate while sitting, standing, and slow walking on a treadmill at a speed of 2.7 km/h (r ≥ 0.82, |bias| ≤ 3.1 bpm), but this decreased with increasing speed (r ≤ 0.33, |bias| ≤ 11.7 bpm). The agreement between the Withing device and the research-established device worn on the wrist (GENEActiv) for measuring the number of steps also decreased during the treadmill phases (first stage: r = 0.48, bias = 0.6 steps/min; fourth stage: r = 0.48, bias = 17.3 steps/min). Energy expenditure agreement between the Withings device and the indirect calorimetry method was poor during the treadmill test (|r| ≤ 0.29, |bias | ≥ 1.7 MET). The Tucky thermometer under the armpit (consumer-grade) and the Tcore sensor on the forehead were found to be in poor agreement in measuring (core) body temperature during resting phases (r ≤ 0.53, |bias| ≥ 0.8°C) and deteriorated during the treadmill test. Conclusion The Withings device showed adequate performance for heart rate at low activity levels and step count at higher activity levels, but had limited overall accuracy. The Tucky device showed poor agreement with the Tcore in all six different activity phases. The limited accuracy of consumer-grade devices suggests caution in their use for rigorous research, but points to their potential utility in capture general physiological trends in long-term field monitoring or population-health surveillance.
Article
Full-text available
Wrist-worn fitness and heart rate (HR) monitors are popular.¹,2 While the accuracy of chest strap, electrode-based HR monitors has been confirmed,³,4 the accuracy of wrist-worn, optically based HR monitors is uncertain.⁵,6 Assessment of the monitors’ accuracy is important for individuals who use them to guide their physical activity and for physicians to whom these individuals report HR readings. The objective of this study was to assess the accuracy of 4 popular wrist-worn HR monitors under conditions of varying physical exertion.
Article
Full-text available
Accurate estimation of energy expenditure is a key element in determining the relationships between aspects of human behavior , physical activity, and overall health. 1,2 Although wear-able devices for estimating energy expenditure are becoming increasingly popular, there is little evidence regarding their validity. 3,4 This study was performed to examine the validity of total energy expenditure estimates made by several wearable devices compared with gold standard measurements for a standardized day (metabolic chamber method) and free-living days (doubly labeled water [DLW] method).
Article
Full-text available
Lukasz Piwek and colleagues consider whether wearable technology can become a valuable asset for health care.
Article
Full-text available
Objectives The new wave of wireless technologies, fitness trackers, and body sensor devices can have great impact on healthcare systems and the quality of life. However, there have not been enough studies to prove the accuracy and precision of these trackers. The objective of this study was to evaluate the accuracy, precision, and overall performance of seventeen wearable devices currently available compared with direct observation of step counts and heart rate monitoring. Methods Each participant in this study used three accelerometers at a time, running the three corresponding applications of each tracker on an Android or iOS device simultaneously. Each participant was instructed to walk 200, 500, and 1,000 steps. Each set was repeated 40 times. Data was recorded after each trial, and the mean step count, standard deviation, accuracy, and precision were estimated for each tracker. Heart rate was measured by all trackers (if applicable), which support heart rate monitoring, and compared to a positive control, the Onyx Vantage 9590 professional clinical pulse oximeter. Results The accuracy of the tested products ranged between 79.8% and 99.1%, while the coefficient of variation (precision) ranged between 4% and 17.5%. MisFit Shine showed the highest accuracy and precision (along with Qualcomm Toq), while Samsung Gear 2 showed the lowest accuracy, and Jawbone UP showed the lowest precision. However, Xiaomi Mi band showed the best package compared to its price. Conclusions The accuracy and precision of the selected fitness trackers are reasonable and can indicate the average level of activity and thus average energy expenditure.
Article
Full-text available
This study presents the use of a multi-channel opto-electronic sensor (OEPS) to effectively monitor critical physiological parameters whilst preventing motion artefact as increasingly demanded by personal healthcare. The aim of this work was to study how to capture the heart rate (HR) efficiently through a well-constructed OEPS and a 3-axis accelerometer with wireless communication. A protocol was designed to incorporate sitting, standing, walking, running and cycling. The datasets collected from these activities were processed to elaborate sport physiological effects. t-test, Bland-Altman Agreement (BAA), and correlation to evaluate the performance of the OEPS were used against Polar and Mio-Alpha HR monitors. No differences in the HR were found between OEPS, and either Polar or Mio-Alpha (both p > 0.05); a strong correlation was found between Polar and OEPS (r: 0.96, p < 0.001); the bias of BAA 0.85 bpm, the standard deviation (SD) 9.20 bpm, and the limits of agreement (LOA) from −17.18 bpm to +18.88 bpm. For the Mio-Alpha and OEPS, a strong correlation was found (r: 0.96, p < 0.001); the bias of BAA 1.63 bpm, SD 8.62 bpm, LOA from −15.27 bpm to +18.58 bpm. These results demonstrate the OEPS to be capable of carrying out real time and remote monitoring of heart rate.
Article
Corresponding Author: Mitesh S. Patel, MD, MBA, MS, University of Pennsylvania, 13th Floor Blockley Hall, 423 Guardian Dr, Philadelphia, PA 19104 (mpatel@upenn.edu). Author Contributions: Ms Case had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Study concept and design: All authors. Acquisition, analysis, or interpretation of data: Case, Patel. Drafting of the manuscript: All authors. Critical revision of the manuscript for important intellectual content: Case, Patel. Statistical analysis: Case, Patel. Administrative, technical, or material support: Case, Burwick, Patel. Study supervision: Volpp, Patel. Conflict of Interest Disclosures: The authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Dr Volpp reported receiving research funding from Humana, Merck, Discovery, Weight Watchers, and CVS; consulting income from CVS and VALhealth; and being a principal at VALhealth. No other disclosures were reported. Funding/Support: This study was funded in part through grant RC4 AG039114-01 from the National Institute on Aging. Dr Patel was supported by the US Department of Veteran Affairs and the Robert Wood Johnson Foundation. Role of the Funder/Sponsor: The National Institute on Aging, the US Department of Veteran Affairs, and the Robert Wood Johnson Foundation had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Article
This Viewpoint discusses issues that prevent wearable medical devices from effectively bridging the gap between recording information and changing health-related behavior. Several large technology companies including Apple, Google, and Samsung are entering the expanding market of population health with the introduction of wearable devices. This technology, worn in clothing or accessories, is part of a larger movement often referred to as the “quantified self.” The notion is that by recording and reporting information about behaviors such as physical activity or sleep patterns, these devices can educate and motivate individuals toward better habits and better health. The gap between recording information and changing behavior is substantial, however, and while these devices are increasing in popularity, little evidence suggests that they are bridging that gap.
Article
Many consumer-based monitors are marketed to provide personal information on the levels of physical activity and daily energy expenditure (EE), but little or no information is available to substantiate their validity. To examine the validity of EE estimates from a variety of consumer-based, physical activity monitors under free-living conditions. Sixty (26.4 ± 5.7 years) healthy males (n = 30) and females (n = 30) wore eight different types of activity monitors simultaneously, while completing a 69-minute protocol. The monitors included the BodyMedia FIT armband worn on the left arm, DirectLife monitor around the neck, Fitbit One, Fitbit Zip, and ActiGraph worn on the belt, as well as the Jawbone Up and Basis B1 band monitor on the wrist. The validity of the EE estimates from each monitor was evaluated relative to criterion values concurrently obtained from a portable metabolic system (i.e., Oxycon Mobile). Differences from criterion measures were expressed as a mean absolute percent error (MAPE) and were evaluated using 95% equivalence testing. For overall group comparisons, MAPE values (computed as the average absolute value of the group level errors) were 9.3, 10.1, 10.4, 12.2, 12.6, 12.8, 13.0, and 23.5% for the BodyMedia FIT, Fitbit Zip, Fitbit One, Jawbone Up, ActiGraph, DirectLife, NikeFuel Band, and Basis B1 band, respectively. The results from the equivalence testing showed the estimates from the BodyMedia FIT, Fitbit Zip, and NikeFuel Band (90% CI: 341.1, 359.4) were each within the 10% equivalence zone around the indirect calorimetry estimate. The indicators of the agreement clearly favored the BodyMedia FIT armband but promising preliminary findings were also observed with the Fibit Zip.
Article
Heart rate monitors have been marketed as an accurate way to measure heart rate. This investigation was designed to determine the validity of seven different types of heart rate monitors. Fourteen men (19.6 ± 2.3 years, 77.7 ±7.1 kg) participated in the study. Heart rate was measured simultaneously for 10 sec during minute 9 -10 of rest or during exercise on a treadmill at 85.7 m · min-1, 107.3 m · min-1, and 160.8 m · min-1. Pearson product-moment correlation (r) and standard error of estimate (SEE) were calculated between an electrocardiogram (ECG) measurement and the measurement of heart rate from each of the 7 heart rate monitors. The POLAR Vantage XL, POLAR Accurex II, Cardiochamp, and the Cateye-PL 6000 heart rate monitors accurately assessed heart rates (r 0.90, SEE £ 5 beats · min-1) during rest and at moderate activity. At the highest speed, the accuracy decreased, leading us to caution the heart rate monitor user regarding the validity of any of these monitors at high heart rates and motion levels.