Content uploaded by Clayton Miller
Author content
All content in this area was uploaded by Clayton Miller on Dec 10, 2022
Content may be subject to copyright.
Indoor Air. 2022;32:e13160.
|
1 of 16
https://doi.org/10.1111/ina.13160
wileyonlinelibrary.com/journal/ina
Received:5July202 2
|
Revised:2 0Septem ber2022
|
Accepted :15Octob er2022
DOI:10.1111/ina.13160
ORIGINAL ARTICLE
Personal comfort models based on a 6- month experiment using
environmental parameters and data from wearables
Federico Tartarini1 | Stefano Schiavon2 | Matias Quintana3 | Clayton Miller3
This is an op en access arti cle under the ter ms of the CreativeCommonsAttribution License, which permits use, distribution and reproduction in any medium,
provide d the original wor k is properly cited.
©2022TheAuthor s.Indoor AirpublishedbyJohnWiley&So nsLtd.
1BerkeleyEducat ionAllianceforResearch
inSingapore,Singapore,Singapore
2Center for the Built Environment,
University of C alifor nia, Berkeley,
Califo rnia,US A
3Department of the Built Environment,
Nationa lUniver sityofSingapore,
Singapore,Singapore
Correspondence
Federico Tartarini, Berkeley Education
AllianceforRese archinSingapore,
Singapore,Singapore.
Email: federicotartarini@berkeley.edu
Funding information
NationalResearchFoundationSingapore
Abstract
Personal thermal comfort models are a paradigm shift in predicting how building oc-
cupants perceive their thermal environment. Previous work has critical limitations
related to the length of the data collected and the diversity of spaces. This paper
outlines a longitudinal field study comprising 20 participants who answered Right-
Here-Right-Nowsurveysusing a smartwatchfor 180 days.Wecollected more than
1080field-basedsurveysperparticipant.Surveyswerematchedwithenvironmental
and physiological measured variables collected indoors in their homes and offices.
We then trained and tested seven machine learning models per participant to predict
their thermal preferences. Participants indicated 58% of the time to want no change in
their thermal environment despite completing 75% of these surveys at temperatures
higherthan26.6°C.Allbutonepersonalcomfortmodelhadamedianpredictionaccu-
racyof0.78(F1-score).Skin,indoor,nearbodytemperatures,andheartratewerethe
mostvaluablevariablesforaccurateprediction.Wefoundthat≈250–300datapoints
per participant were needed for accurate prediction. We, however, identified strate-
giestosignificantlyreducethisnumber.Ourstudyprovidesquantitativeevidenceon
how to improve the accuracy of personal comfort models, prove the benefits of using
wearable devices to predict thermal preference, and validate results from previous
studies.
KEYWORDS
ecologicalmomentaryassessment,internetofthings(IoT),machinelearning,personalthermal
comfort model, skin temperature
1 | INTRODUC TION
Occupantthermalcomfor tsignificantlyaffectshowpeopleperceive
their indoor environment, and thermal dissatisfaction is an ongoing
challeng e. Evidence shows th at approximatel y 40% of the 90 0 00
surveye d occupant s in North A merica wer e dissatisf ied with the ir
thermal environment.1 Thermal comfort models are designed to
predic t comfort toward add ressing this challe nge. All major ther-
mal comfort standards have models that are considered aggregate
in nature.2,3 All mainstream aggregate models aim to predict how
a “typical” person or a group of people would perceive their ther-
mal environment in terms of given environmental (e.g., relative hu-
midity, indoor air temperature [ti]), and personal (i.e., clothing and
metabolic rate) parameters. For example, the Predicted Mean Vote
(PMV) predicts the average thermal sensation of a group of people
sharing the same environment, as an outcome of the heat transfer
balance model bet ween the human body and its surrounding envi-
ronment. The PMV was developed through laboratory experiments
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
2 of 16
|
TARTAR I NI et al .
by Fanger,4andisnowincludedinboththeISO7730:2005
2 and
ASHR AE55-2020Standards.3
1.1 | Limitations of aggregate models
Both the PMV and the adaptive models have several limitations
when used to control the temperature in buildings,5- 7 despite
their successful adoption into international standards. (1) Required
inputs—In real buildings, it is extremely challenging to accurately
measure some input variables needed to calculate PMV, such as
metabolic rate, clothing, airspeed, and mean radiant temperature.8
(2) Prediction accuracy— Even when all input variables are accurately
measured, these models have poor accuracy both in predicting
group and individual thermal comfort.9 (3) Training—A ggregatemod-
els do not adapt or re- learn.6 They were developed using fixed and
limited dat asets and did not benefit from new feedback provided
by people. They do not learn and adapt to specific conditions.5 (4)
Limited inputs—Aggregatemodelsonlyuse asmallsetof inputvari-
ables. They do not use variables, such as skin temperature (tsk), heart
rate (HR), age, or health status, that may affect the thermal percep-
tions of people.5
1.2 | The emergence of personal comfort models
Personal comfort models challenge the one- size- fits- all approach of
aggregatemodels.Instead of an averageresponsefrom agroup of
people, a single model is trained and tested for each participant.
Personal comfort models are, however, not limited to predicting one
person's thermal preference. Their aggregated outputs can be used
to predict the thermal preference of a large group of people sharing
the same environment.5Since theirintroduction,personalcomfort
models have been expanded to leverage data collected using a wide
array of sensors, including portable sensors and devices,10,11 build-
ing management systems,12,13 personal comfort systems,14 as well
as onboard sensors in wearable devices and smartphones. This net-
work of sensors can remotely and non- intrusively measure, log and
store spatiotemporal environmental and physiological data.
Wearable devices have increased the viability of personal model
development due to the use of physiological sensors to improve
model accuracy. For example, skin temperature (
tsk
) reflects the va-
somotor tone15 while heart rate correlates with activity levels. This
is suppor ted by previous research that has shown that the use of
tsk
as an independent variable c an improve the prediction accuracy of
thermal comfort models.16- 21Incer tainapplications,
tsk
may be even
determined using non- contact sensors like infrared.22- 24 However, it
is essential to emphasize that non- contact sensors are less accurate
than those that are in direct contact with the skin; they can only
monitor
tsk
from body areas that are in the line of sight to the camera
and are expensive to install.6They,however,donotrequirehavinga
sensor to be worn by people. Experimental methodologies collect-
ing
tsk
are common and iButtons sensors are of ten used. They can
accurately measure and log
tsk
.
25- 2 7 Currently, most smartwatches
on the market can measure HR with suf ficient accuracy for thermal
comfort research; however, none incorporate sufficiently accurate
skin temperature sensors.18
1.3 | Limitations of personal comfort models
Despite the momentum of personal comfort models, there are still
several unknowns and limitations as outlined in a recent review.28
This analysis pinpoints a lack of diversity in space types, climates,
and conditions used to train personal comfort models. The review
showed that only 3 out of 37 studies selected for analysis included
data collection outside office spaces or lab- based thermal chambers
used to emulate an office environment. 28Another limitationisthat
there was a wide range of the amount of longitudinal data collected
in the studies, with anywhere between 8 and 416 points collec ted
per person. Researchers placed little emphasis on whether the
length and data amount were exhaustive in capturing the predic t-
abilit y of an individu al. In addit ion, in pers onal comfor t model ex-
periments, it is not common or easy to log and measure information
about the participant's dynamic personal factors such as clothing or
activity levels.29Addressingthelackofdiversityandtheamountof
data is not easy due to experimental constraints.
One of the bi ggest challenge s that researche rs currently fa ce
is recording how people perceive their thermal environment over
a long period of time while minimizing the fatigue of completing a
Right- Here- Right- Now (RHRN) thermal comfor t survey. To partially
solve this issue, Kim et al.30 tried to infer occupants' thermal pref-
erences by analyzing specific behaviors, such as turning on and off
heating and cooling devices. They then coupled these data with
environmental readings to infer a user's preferences without them
having to complete a sur vey. However, thermal actions may be trig-
gered by other reasons besides thermal discomfor t; for example,
Practical implications
Inadditiontodemonstratingtheadvantagesofemploying
wearable technology to gather subjective feedback from
people, our study validates the findings from earlier re-
searchandoffersquantitativeevidenceonhowtoincrease
theprecisionof personal comfor tmodels.Ourmethodol-
ogy and results c an be used in buildings to develop and im-
plement occupants centric controls. This enables building
operators to enhance thermal comfort conditions indoors
while possibly reducing the overall energy consumption of
the building. We made the decision to openly publish our
data so that others might use it to test various assump-
tions or create personal comfort models utilizing various
methodologies.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
3 of 16
TAR TAR I NI et al.
Kim et al.30 found that users turn on the heating element in their
chair to mitigate back pain.
1.4 | Improving personal comfort models through
larger and more diverse longitudinal data
To address the limitations mentioned above, an emerging method-
ology focuses on the use of wearable devices to collect physiologi-
cal data and act as the subjective feedback collection interface. This
method builds upon research in the area of Ecological Momentary
Assessm en ts(EM A),af or mof colle ctingsubjectiveinfor mationi ndi-
verse field- based settings.31Ast y leoft hism et hodol og yemergingas
ap o p ul a r wa y to r ed u ce t h ei n ci d e nc e of s ur v e yf a t ig u e is m ic r o- EM A ,
in which smartwatches are used to prompt a research participant to
leave feedback in a fast and time- efficient manner.32Micro-EMAhas
been shown to deliver higher response rates with a lower burden
on research participants than a smartphone or computer- based sur-
vey. 33 To build upon this foundation and help solve the issue of col-
lecting perception data from people, our team has contributed to the
developmentofthemicro-EM ACozieprojectthattargetsindo oroc-
cupant data collection.34,35 Cozie is an open- source application that
onecaninstallon Fitbit(Versa2andIonic)orApplesmart watches.
The platform has been utilized in previous studies to test the im-
plementation and modelling of smartwatch- based subjective data
collection,36- 38 study thermal preference, imbalanced classes,39 and
create personal comfort models using building information model
components as inputs.40OnecanfindmoreinformationaboutCozie
and the official documentation at https://cozie.app and https://cozie
- apple.com. Cozie allows people to conveniently complete an RHRN
survey v ia their smar twatches. S ubjects' p erceptions, p references,
and behaviors collected via Cozie can then be coupled with environ-
mental data collected from wireless sensing devices and physiologi-
cal data collected by the smartwatch.
1.5 | Aim and objectives
Ourr esearchaimstoresolvegapsinper son alther malcomfor tmo d-
elsbycollectingfield-basedthermalpreferencedata.Ourmethodol-
ogyisdesignedtoenableustoaddressthefollowingquestionswith
resulting novel insights:
• How many data points per user must be collected to develop a
reliable and robust personal comfor t model? We collected data
for180 daysresultinginmorefeedbackresponsesperperson(up
to 1080) than in any previous study.28
• Areenvironmental and physiologicaldata sufficient to trainper-
sonal thermal comfort models while minimizing the impact on
users? The methodology of this paper utilizes a novel framework
of simple-to-use non-intru sivetechniques to collect physiologi-
cal, environmental, and geospatial data using smart watch- based
micro-EMA.
• C an increasing the diversity of space t ypes and conditions im-
prove the accuracy of personal comfort models? How can differ-
ent variables contribute to the overall model accuracy? This study
is designed to collect data from diverse spaces, including the par-
ticipant s' homes, where there is a lack of data in previous studies.
Inaddition,thispaper is novel in accurately monitoring whether
the RHRN was completed during transitory conditions.
Inaddition,wedecidedtopubliclyshareourdatasootherpeo-
ple can use it to test different hypotheses or develop personal com-
fort models using a different methodology.
2 | METHODOLOGY
We collected subjective responses and physiological dat a from
human subjects using wearable devices, per sonal data using sur veys,
and environmental data using data loggers. We then applied super-
vised machine learning algorithms to train personal thermal comfort
models for each study participant. Thermal preference votes from
the RHRN survey (i.e., Q.1 Cozie Survey—Thermal preference—
please see Section 2.4) were utilized as the ground truth labels for
model training and evaluation. The methodology and sensors we
used to measure and log data are summarized in Figure 1, while a
flowchar t depicting the methodology we used to analyze the data
is shown in Figure A.2. The human subject experiment for this
study wa s approved by the Unive rsity of Califo rnia Berkeley IRB
(Institutional Review Board: 2020-01-12899). We compensated
participants who completed the study with gift vouchers for a total
amountofSGD400.
2.1 | Subjects
Participants were recruited through online posting. The inclu-
sion criteria were that the participant must: have lived for at least
3 months in Si ngapore, be at lea st 21 years old, and b e fluent in
English. Personal information (e.g., sex, age, and education) about
participants was collected using a web- based survey at the begin-
ning of the study.
2.2 | Wearable sensors
Each par ticipant received a Fitbit Versa (v1 or v2) and was asked to
wear it daily for the whole duration of the study.
To measure and log wrist skin temperature (
t
sk
,w
) and wrist near
body temperature (
t
nb
,w
)we installedtwo iButtons, model DS1925,
ontheFitbitwristband.OneiButtonwasinstalledontheinnerside
of the wristband and measured
t
sk
,w
in the front part of the wrist. The
other was installed above the watch display and was used to mea-
sure
t
nb
,w
. Figure A .1 shows the exact location of where the iButtons
were installed.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
4 of 16
|
TARTAR I NI et al .
More information about the rationale on why we used Fitbit and
iButtoncanbefoundinSection1oftheAppendix.
Participants were asked to complete the RHRN no sooner than
10 min after either wearing the Fitbit or changing clothes or activities.
This further limits the error in the measurement of
t
sk
,w
and ensured
that they did not complete an RHRN survey during a transitory.
2.3 | Environmental sensors
Environmental data were monitored and logged using three sensors.
Onewasinstalledintheroomoftheirhouse,wheretheyspentthe
majority of their time indoors. This room corresponds to the “Home”
locationinquestionthreeoftheCoziesur veyasshowninFigure 2.
Anotherwasusedtomeasureandlog
ti
and relative humidit y at the
participant's workplace. This room corresponds to the “Work” loca-
tion in the Cozie survey. The workstation could be in their office or
home if they were working from home. Finally, the third sensor on a
bag/backpack of their choice. Participants were instructed to select
“Portable”inquestionthreewhenwithina2mradiusofthissensor.
DetailedinformationabouteachsensorusedispresentedinSection
1 of the Appendix and Table A.1.
2.4 | Sur veys
Participants were asked to complete, on average, a total of 42 RHRN
surveysperweekoveraperiodof180 daysusingtheCozieclockface.
Figure 2showstheflowofquestionsthatwereincludedintheRHRN
survey.
Q.1— “Would you prefer to be?” assesses the thermal preference
usingathree-pointscale.Q.2—“Areyou?”logsifparticipantscompleted
thesurveyeitherindoorsoroutdoors.Q.3—“Areyounearasensor?”de-
termines if a participant is in proximity to one of the three environmen-
tal sensors. Q.4— “What are you wearing?” participants reported their
clothing level using a 4- point ordinal scale. Q.5— “Can you perceive air
movement around you?” assesses if the air surrounding the participant
wasstill.Q.6—“Activitylast10-min?”participantsreportedtheiractivity
level over the last 10 min. Q.7— logged if the survey is answered during
a transitory situation or in a near “steady- state” environment.
FIGURE 1 Methodologyusedtocollectdatainourstudy.Par ticipantsansweredtheRHRNsur veysusingtheFitbitCozieclockface.
Physiological data and RHRN responses were first sent to the Fitbit companion applic ation and then synced with a cloud database.
The HR data were downloaded from the Fitbit accounts.
t
sk
,w
and
t
nb
,w
were measured using t wo iButtons which were installed on the
Fitbitwristband.IndoorlocationwasmonitoredusingtwoBLEbeaconscommunicatingwiththeBEARSAndroidapplicationwheneach
participant's phone was in their proximity. Environmental data were uploaded to the cloud database using Wi- Fi. Finally, participants were
reminded to complete the RHRN surveys using Telegram, a messaging application.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
5 of 16
TAR TAR I NI et al.
The questions flow was always displayed in the same order.A
custom- made algorithm analyzed real- time environmental data and
occupants' indoor location that was logged by an application we de-
veloped. Participants received a message when in the proximity of the
two environmental sensors, and they had completed less than 10% of
the total RHRN surveys in those environmental conditions.
2.5 | Weather data
WeatherdatawereobtainedfromtheSingaporeGovernmentweb-
site that provides 1- min interval data.41 Weather data was merged
withtheGPSinformationcollectedbytheCozieappandanswersto
questiontwooftheRHRNsur vey.
2.6 | Data analysis
The source code we used to analyze the data and the full dataset are
publicly available at this URL: https://github.com/Feder icoTa rtari ni/
dorn- longi tudin al- tc- study.
2.6.1 | Datapreparation
Participants completed surveys while performing a wide range of
activities, wearing different clothing, being in multiple locations, and
being exposed to a broad range of environmental conditions.
We aimed to develop a personal thermal comfort model for each
participant, which could potentially be used to better control and op-
eratebuildings. Consequently, we decided to excludethe responses
that participants provided: (i) while exercising, (ii) when not in the
proximity of either of the environmental sensors provided (answered
“No” to Q.3), (iii) during a transitory situation (answered “Yes” to Q.7),
(iv) when outdoors, and (v) while not wearing the smartwatch cor-
rectly. The rationale behind our decisions was that personal comfort
models could mainly be used indoors to improve thermal comfort con-
ditions where environmental conditions can be controlled. We provide
a detailed description of how we implemented the above- mentioned
selectioncriteriainSection2oftheAppendix.
2.6.2 | Supervisedmachinelearningalgorithms
We used seven supervised machine learning classifiers to predict
thermal preferences: Logistic Regression (LR), Random Forest (RDF),
ExtremeGradientBoosting( XGB), Support Vector Machine(SVM),
K-NearestNeighbors(KN),GaussianNaiveBayes(GNB),andMulti-
LayerPerceptron(MLP).We used the Kruskal–WallisH- test to test
the null hypothesis that the population median of all the groups is
equal. The Kruskal–Wallis H-tes t was used since the ANOVA as-
sumptions were not satisfied, and it is a non- parametric version of
ANOVA.Therejectionsofthenullhypothesisdonotindicatewhich
ofthegroups differs.Comparisonsbetweengroupsarerequiredto
determine which groups are different.
2.6.3 | Trainingdatasize
Oneofourobjectiveswastodeterminehowthenumberoftraining
data points would af fect the model accuracy. This has practical ap-
plicationssinceitwouldinformusoftheminimumrequirednumber
FIGURE 2 Right-Here-Right-Now(RHRN)surveyquestionsdisplayedusingtheCozieclockface
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
6 of 16
|
TARTAR I NI et al .
of RHRN to be collected from each participant. The hypothesis is
that a higher number of data for each participant would lead to more
accurate results. To test this, we randomly selected 100 data points
for testing and then trained the models using the first 42 RHRN sur-
veys(approximately1 weekofdata)eachparticipantcompleted.We
then iteratively trained a new model for each increment which com-
prised additional 84 training data points.
2.6.4 | Independentvariableselection
The independent variables we used to train our models are shown
in Table 1. Each column represents a sub- set of variables and each
row the respective model. The variables were grouped as follows:
environmental— outdoor air temperature, outdoor humidity ratio,
indoor air temperature (
ti
), and humidity ratio indoors (Wi); clo–
met—self-reportedclothingandactivityasexplainedinSection2.4;
wearable— location, heart rate (HR), wrist skin temperature (
t
sk
,w
), and
wrist near body temperature (
t
nb
,w
); time— hour of the day, weekday
or weekend, and day of the week.
We also computed some variables (hist) to take into account how
thermal history may have influenced how participants perceived their
environment at the time of completing the RHRN survey. For each of
the time- series data included in either the environmental or the wear-
able variable sets, we calculated the following additional variables:
exponentially weighted moving average and gradient over a 20 and
60 minperiodprecedingthesurvey.Theaverageandgradientforthe
weather data were calculated using timeframes of 1 and 8 h.
We used the SHapley Additive exPlainations (SHAP) method
to determine how much each variable influences the output of the
model.TheprimaryideabehindShapley'svalue-basedexplanations
of machine learning models is to divide the credit for a model's out-
put among its input variables using fair allocation outcomes from co-
operative game theory.42,43Theuseof theSHAPapproachallowed
us to understand and interpret how and why our complex models
made specific predictions.
We included env, time, and wearable in all models since previous
research has demonstrated that the inclusion of these variables into
personal comfort models significantly increases their prediction ac-
cura cy.18 We, therefore, decided only to test whether the use of his-
torical and self- reported clothing and activity would have improved
the prediction accuracy in our case.
We have shared the data we collected publicly so other research-
ers may test different hypotheses or use a different approach from
the one described in this paper.
Including indoor air temperature (
ti
), wrist skin temperature
(
t
sk
,w
),andwristnearbodytemperature(
t
nb
,w
) in all models may in-
troduce multicollinearity. The environment to which a person is
exposed, the clothing they wear, and the actions they perform,
together which several other factors that affect how indoor air
temperature (
ti
), wrist skin temperature (
t
sk
,
w), and wrist near body
temperature (
t
nb
,w
) are correlated. We, therefore, decided to keep
them all in the models since they allowed us to potentially capture
all the above- mentioned interactions that cannot be measured but
still play a significant role in how people perceive their thermal en-
vironment. For example, the near- body temperature may approxi-
mate the air temperature when a person is exposed to elevated air
speeds.On theotherhand, itwillbemoreinfluenced bytheskin
temperature when the person is resting and the air in the room is
still. Itis worthmentioningthat Applein their latestsmartwatch,
the AppleWatch 8 released in October 2022, also included t wo
temperature sensors, one that measures the skin temperature
and one below the screen to isolate the body temperature from
the out side environme nt. Apple claim s that this allows t hem to
get a more accurate estimate of the variables that they want to
predict.44
2.7 | PMV estimation
We used the measured environmental variables and personal fac-
tors, qualitatively logged by the par ticipants to calculate the PMV
using the following assumptions. The activit y levels reported by the
participants were mapped using the following values resting = 0.8
met, sitting = 1.1 met, and standing = 1.4 met. While repor ted cloth-
ing values were mapped as follows very light = 0.3 clo, light = 0.5
clo, medium = 0.7 clo, and heavy = 1.0 clo. These numbers were de-
termined by asking each participant which clothes on average they
wore when selecting one of the above options. The mean radiant
temperature was assumedtobeequal to
ti
.
45 The relative airspeed
value was calculated assuming theairspeedtobeequal to 0.1 m/s
and using the self- reported activit y levels. We are fully aware that
these assumptions have limitations and do affect PMV prediction
accuracy; however, similar assumptions have been previously used
TAB LE 1 Independentvariablesusedtotraintherespectivemodel
Variable sets
Model env time wear clo– met env- hist wear- hist
Thermal preference PCM xXx
ThermalpreferencePCMclo–met xXx x
ThermalpreferencePCMclo–methist xXx x x x
Note:Weusedthefollowingabbreviationsinthetable:self-reportedclothingandac tivity(clo–met),environmental(env),wearable(wear),and
historical (hist).
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
7 of 16
TAR TAR I NI et al.
in thermal comfort research.30 Finally, we mapped the PMV val-
ues into thermal preference votes using the following assumptions:
“Warmer ” for PMV <1.5, “Cooler” for PMV >1.5, and “No Change”
for −1.5 ≤ PMV ≤ 1.5. Th is is the same as sumption ma de by Fanger
who considers dissatisfied those people who reported their abso-
lute value of thermal sensation to be either 2 or 3.4 This is based on
the assumption that , for example, people who have a thermal sensa-
tion of “Warm” or “Hot” is highly probable that they may want to be
“Cooler.”Inthispaper,wedidnotdrawconclusionsontheaccuracy
of the PMV model, but we only used it as a benchmark value to as-
sess the accuracy of the thermal personal comfort models.
2.7.1 | Evaluationcriteria
The model prediction accuracy was evaluated using the following
metrics: F1- micro, F1- macro, and Cohen's kappa. We calculated
all these metrics for a more precise interpretation of the results,
however, we only reported the F1- micro scores unless there was a
significant disagreement between the prediction accuracy scores
of different metrics. F1- micro ranges between 0 and 1 where 1
represents the optimal prediction value. F1- micro measures the
predic tion accura cy and gives eq ual impor tance to prec ision (true
positives divided by all positive result s) and recall (true positives di-
vided by the number of samples that should have been identified as
positives).Inmultilabelclassification,(i.e.,in our case since thermal
preference assumes three values) the F1- micro is calculated globally
across all classes.
2.7.2 | Trainingandtesting
Hyper- parameters optimization is done using a random search and
5- fold cr oss- validation . We tested 10 ran dom combinations of hy per-
parameters in each of the 5- fold, and the best performing model,
intermsof objec tivemetricasspecifiedinSection 2 . 7.1, is chosen.
Table A.2 shows the parameters chosen for training the models and
performing the random search. We repeated this entire process 100
times for each model.
3 | RESULTS
The longitudinal study commenced in April 2020 and ended in
December 2020 in Singapore. A total of 20 participants (10 males
and 10 females) took part in our study. Key information about each
participant is presented in Table 2.
3.1 | Dataset preparation and cleaning
Participantscompletedatotalof22212RHRN.Ofthetotalsurveys
collected, participants completed 2% of them while exercising, 6%
while outdoors, and 12% while in transitory conditions. These sur-
veys were not included in the data analysis as previously explained
inSection2.6.1.
The
t
sk
,w
and
t
nb
,w
data we measured while the par ticipants com-
pleted the RHRN are depicted in Figure 3A.Inapproximately97%of
the total completed surveys, the value of
t
sk
,w
was higher than
t
nb
,w
.
This result was expected since the maximum value of
ti
that partici-
pants experienced throughout the study never exceeded 34°C . For
example, the delta between
t
sk
,w
and
t
nb
,w
in participant 10 was as
low as −0.7°C, while the average value across all participants was
−3.2°C .Weconsequently remove thedata usingthemethodology
detailedinSection2oftheAppendix. This removed more than 15%
of the total number of surveys collected by the following partici-
pants 05, 10 (73% excluded), 12, 14, and 18.
This sub-set of theoriginal dataset,which included 13 073sur-
vey responses, was used in the data analysis. The filtered number of
surveys for each participant is shown in Figure 3B.
3.2 | Dataset overview
The 13 073survey responsesare summarized in Figure 4. Votes in
Q.1— “Thermal preference” were mostly “No Change” (58%) followed
by“Cooler ”(35%).This study took part duringthe COVID-19pan-
demic, and most of the participant s had to work from home for the
whole study duration. Participants in their homes had full control of
the air- conditioning set- point and could use electric fans to increase
airspeed in their surroundings.
Most of the participants reported being involved in sedentary
activities in 77% of the cases. Par ticipants perceived air movement
only less t han 30% of the time , and 69% of them wore “Light ” clothes.
To better depict how participants perceive their thermal environ-
ment, in Figure 5 we plotted the distribution of the thermal prefer-
ence votes (Q.1) grouped by the participant. While the great majority
voted “No Change,” two wanted to be “Cooler” more than 90% of the
time. Even if participants had similar distributions of thermal pref-
erence votes, such as participants 05 and 13, they might have dif-
ferentthermalcomfortneeds, requirements,andpreferences.This
situation can be explained by the fact that the participants wore dif-
ferent clothes, engaged in different activities, and were exposed to
different environmental conditions. The values of
ti
recorded when a
participant completed the survey are shown in Figure 6. The Figure
alsodepictstheoutdoortemperaturemeasuredinSingaporeduring
the entir e study peri od. Singapo re is charac terized by a tropi cally
hot and humid climate with limited seasonal temperature variation.
Temperature variation mainly occurs intra- day.
The thermal preference votes grouped by the self- reported
clothing and metabolic rates are shown in Figure 7. Participants ac-
tively adjusted clothing to improve their thermal comfort . They wore
“Very light” clothes to compensate for warm indoor air tempera-
tures. Participants also actively increased their clothing levels when
exposed to temperatures they deemed to be “Cold.” Thus, 67% of
participants wearing “Heavy” clothing felt comfortable. Wearing
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
8 of 16
|
TARTAR I NI et al .
more clothes alone did not always suffice to compensate for cold
indoor conditions.Overcooling indoors was theleading cause that
27% of them wanted to be “Warmer,” even though participants wore
“Heavy” clothing in a tropical climate. This is a common issue for
buildings located in the tropics.46Overcoolingdoesnotonlyneg-
atively impact building energy consumption, but in the tropic s has
also been shown to worsen occupants' cognitive per formance.47
Approximately74%oftheparticipantswhorepor tedtobe“Resting”
voted“NoChange”inquestionQ.1.
3.3 | Thermal preference personal comfort models
The prediction accuracy of the personal comfort model we devel-
oped is depicted in Figure 8. The figure shows the F1- micro scores
for the three sets of variables grouped by the supervised machine
learning model we used to train the personal comfort models. We
also report the PMV model results.
The prediction accuracy of all the personal comfort models de-
veloped with the supervised machine learning algorithms was sig-
nificantly (p < 0.01) andsubstantially(excludingXGB)higher(≈37%)
thantheresultsobtainedfromthePMVmodel.Inourstudy,weonly
qualitatively loggedclothing levels and metabolic rates,andwe did
not measur e airspeed as d etailed in Sec tion 2. Hence, we do not
have sufficient evidence to prove that the PMV has low predic-
tion accuracy. We simply report the results of the PMV to provide
a benchmark to show the increase in accuracy that personal com-
fort models can achieve. This is, however, a common issue in real
buildings, hence these values must also be assumed to calculate the
PM V.
Oneofthe main objectives ofthisstudy was todeterminehow
different sub- sets of variables would affec t the accuracy of the mod-
els .Addi ngani ncrease dnumberofv ar iablest othem od eldidnot al-
waysimproveitsaccuracy.Insomecases,ithadtheoppositeeffect
andledtoadecreasedF1-microscore.Similarresultswerealsoob-
tained in previous studies.18 This can be partially explained because
participants completed surveys in near- steady- state conditions.
Hence, including historical data is not always beneficial. Moreover,
self- reported clothing and activity may not have accurately enough
represented participants' actual clothing ensembles or metabolic
rates since their selection was limited to four choices. This is a pos-
itive result since in a real- life scenario we would not have access to
this information.
The distribution of the F1- micro scores was significantly dif-
ferent when we compared the results of the following models:
XGB,SVM, RDF,LR ,MLPusingdifferentvariablesets. However,
the significant increase in model complexit y would not justify the
modest increase in prediction accuracy in most practical applica-
tions. On average,trainingonemodel once withthefull variable
set for eac h 20 users resul ted took 83, 6, 62 0, 11, and 67 s for
XGB,SVM,RDF,LR,MLPmodels,respectively.Weconsequently
decided to present only theresults fromtheSVM modeltrained
with the environmental— wearable— time independent sets of vari-
ables in Figures 9 and 10. Firstly,becausetheSVMmodel isless
computationally intensive to train and secondly because it is a lin-
ear model, hence it is better suited to predict thermal preference
which is an ordinal variable. We are providing supporting evidence
onthisinSection4. Linearmodelsuseamultidimensionalhyper-
plane to classify the data, this may lead to lower prediction ac-
curacy if compared with non- linear models. Nevertheless, linear
models ensure that as
ti
increases, all other variables being fixed,
the prediction does not switch back and forth between “Warmer,”
“No Change,” and “Cooler.” This issue is particularly relevant when
personal comfort models are used in real- life applications to op-
erate buildings. Non- linear model predictions may be the cause of
instabilities inthe HVACcontrollerand limit theuse of personal
comfort models to control buildings.
3.3.1 | Influenceofdatasizeonpredictionpower
Figure 9A depicts how the F1- micro score varies as a function of
the number of training data points for each participant. The figure
also shows the F1 mean score (black line) and its standard deviation
(shaded area) across all participants.
The sample average accuracy mean score plateaued at around
≈300datapoints.Thissuggest sthatthismaybetheoptimalnumber
of points we may need to collect when training personalized comfort
models.It should be noted thatthere was high variabilit ywhen the
curve plateaued for each individual. This is due to the inherited dif fer-
ences across the personal preferences of subjec ts and the conditions
TAB LE 2 Informationaboutthesubjects
ID Sex Age Education BMI (kg/
m2
)
1 M 38 Doctoral degree 23 .51
2 M 36 Doctoral degree 29. 4 0
3 M 30 Doctoral degree 25.54
4 F 40 Master's degree 18.29
5 M 31 Doctoral degree 25.39
6 M 44 Doctoral degree 21.22
7 F 30 Bachelor's degree 25.93
8 M 35 Doctoral degree 25.10
9 F 24 Master's degree 23.24
10 M24 High school graduate 23.05
11 F29 Master's degree 20.20
12 M34 Doctoral degree 28.20
13 M31 Bachelor's degree 25.34
14 M35 Bachelor's degree 23.03
15 F33 Doctoral degree 18.34
16 F26 Bachelor's degree 20.45
17 F36 Doctoral degree 18.37
18 F26 Bachelor's degree 22.04
19 F24 Bachelor's degree 16.44
20 F32 Doctoral degree 20.96
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
9 of 16
TAR TAR I NI et al.
they were exposed to. Figure 9B shows the overall accuracy of each
personalcomfort modeloverthe100iterations.Itcanbeobserved
that each personal comfort model converged to a st able value across
all 100 iterations. The standard deviation of all 20 personal comfor t
models over all 100 iterations was similar across different partici-
pants, with a mean value of 0.035 and a standard deviation of 0.011.
The same cannot be said about the overall accuracy of each personal
comfort model, where the median F1 score for participant 14 was
0.99 while for participant 7 was 0.56. This, in other words, means
thatnotallpersonalcomfortmodelsperformedequally.Somealmost
always correctly predicted the thermal preference vote reported by
the participants, while others had a significantly lower accuracy.
3.3.2 | Importanceofindependentvariables
The absolute mean SHAP values across all six best-performing
supervised machine learning models are shown in Figure 10. Sub-
variablesgroupsdefinedinSection2.6.4 are color- coded. While in-
door air temperature (
ti
), wrist near body temperature (
t
nb
,w
), heart
FIGURE 3 Wristskintemperature(
tsk,w
) and wrist near body temperature (
tnb,w
) measured when the participants completed the RHRN
survey.(A)Showsallthedatacollectedfromtheparticipantswhile(B)showsthesub-setoftheoriginaldatasetthatwasusedinthedata
analysis.TheinclusioncriteriaweusedtofiltertheoriginaldatasetaredetailedinSection3 .1. The number above each violin plot is the
number of RHRN surveys completed by each participant.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
10 of 16
|
TARTAR I NI et al .
FIGURE 4 Distributionoftheanswers
provided by all the par ticipants.
FIGURE 5 Distributionofthethermal
preference responses (Q.1) provided by
each par ticipant throughout the study
period.
FIGURE 6 Indoorairtemperature(
ti
) measured when participants completed
the RHRN survey. Data have been
grouped by the participant. The last violin
plot (purple) shows the average outdoor
airtemperaturemeasuredinSingapore
(SG)throughoutthewholedurationofthe
stud y.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
11 of 16
TAR TAR I NI et al.
rate (HR), wrist skin temperature (
t
sk
,w
), and humidity ratio indoors
(Wi) contributed the most to the models' final predictions, we ob-
serve d a significant d ifference of SH AP values bet ween differ ent
parti cipants an d across dif ferent mod els. In Figu re A.3, we report
the mean S HAP values a cross all par ticipant s for each sup ervised
machine-learning model. A detailed discussion of these results is
presentedinSection4.2.
4 | DISCUSSION
The results of our study enabled us to draw several connections to
the existing literature, discuss the usefulness and limitations of the
methodology and result s, and motivate future work.
4.1 | Impact of training data size on
model prediction
Onenovelaspectof our studywastheduration ofthedatacollec-
tion, which enabled us to gather the longest longitudinal data set
so far among studies that aimed to develop personal thermal com-
fort models.28 We collected more than double the amount of points
per participant and we made the dataset publicly available. Personal
comfort models necessitate data for both testing and training.
Hence, a suf ficiently large number of data points from each par-
ticipantisrequiredforthemachine-learningalgorithmtoconverge.
Figure 9 illustrates how increasing the size of trained data improves
themodelpredictionpowerbasedonthecollecteddataset.Across
all participants, the model prediction accuracy (F1- micro) stabilized
toa plateauataround 30 0data points.Individual personal models
show varying degrees of sensitivit y to dataset size. This insight high-
lightsthediminishingreturnofcollectingmore than250–300data
points for most test participant s. This result is specific to our study
and other authors may find a dif ferent range based on their study
methodology. Our results agree and provide additional suppor t-
ing evidence to validate those obt ained by Liu et al.18Arguably,the
amount of dat a needed to characterize thermal comfort could be re-
duced even further with the development of targeted sampling that
strategicallyrequestsfeedbackonlywhenrequiredtoincreasethe
model prediction power.48Inou rst ud y,wea lre ad yimpl emented th is
strateg y. Participants received a text message when exposed to en-
vironmental conditions that they rarely experienced before, to maxi-
mize the chances of obtaining a balanced dataset. However, we still
asked them to complete, on average, a total of six surveys per day.
Thisrequirementcanbesignificantlyreducedorremovedaltogether
in future studies thanks to t argeted surveys. For some participants,
the prediction accuracy slightly decreased as the trained data size
increased from 42 to 126. This situation is expected since, as time
passes, they may be exposed to a broader range of environmental
FIGURE 7 Distributionofthethermal
preference responses (Q.1) provided by
all participant s throughout the study
period grouped by their reported clothing
insulation (Q.4) and metabolic rate (Q.6).
The number above each bar shows the
total number of responses collected for
that specific answer.
FIGURE 8 F1-microscoresforthe
thermal preference personal comfort
models determined using the full dataset
for each participant over 100 iterations.
The light blue shaded area depicts the
interquartilerangeforthePMVmodel.
We used the following abbreviations:
MLP, Multi- Layer Perceptron; RDF,
RandomForest;SVM,SupportVector
Machine;KN,K-NearestNeighbors;GNB,
GaussianNaiveBayes;XGB,Extreme
GradientBoosting;LR,LogisticRegression
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
12 of 16
|
TARTAR I NI et al .
factors and conditions that they did not experience before, and the
model needs to learn how to predict par ticipants' thermal prefer-
ences under these new sets of conditions. This result is a signifi-
cant advantage that personal comfor t models have over aggregate
models since they can be re- trained as new data are collected. This
situation may be partially alleviated by the use of transfer learning,
ensemble strategies, and domain adaptation which can be used to
predict individual thermal preference even when there is a lack of
data regarding a specific person.49, 50
We also obser ved that, for some participants, the F1- micro curves
did not var y much as a function of the data size (e.g., participants 9
and10).Somepossiblecausesofthisarethatparticipantswerecon-
stantly expose d to warm temperatures and t hat some did not mainta in
compliance with experimental guidelines. The latter point is discussed
in Sect ion 4.3. For example, participant 10 was always exposed to
temperatures above 27.5°C when completing the RHRN survey and
reported wanting to be “cooler” 98% of the time. This scenario is ex-
pectedinSingapore,wheretherecordedoutdoor temperatureover
the 6- month study period was higher than 26.5°C for 75% of the time.
4.2 | Independent variables' importance in thermal
preference prediction
WeusedSHAPvalue stoqua ntifyoft hei mpactthateachindep end-
ent variable had on the accuracy of the personal models. While the
average magnitude for each variable varied in different models in-
door air temperature (
ti
), wrist near body temperature (
t
nb
,w
), heart
rate (HR), wrist skin temperature (
t
sk
,w
), and humidity ratio indoors
(Wi) contributed the most to the models' final predic tions. This in-
sight is in line with the existing body of knowledge since
ti
is the
primar y driver of sensible heat loss or gain from the environment to
FIGURE 9 F1-microscoresforthethermalpreferencepersonalcomfortmodelsdeterminedusingtheSuppor tVectorMachine(SVM)
algorithm.(A)ShowsthemeanF1-microscoreforeachparticipant,aswellasthemeanscore(blackline)andst andarddeviationacross
(shaded area) the whole study sample. The markers show the participant's mean F1- micro scores calculated by averaging the mean scores
obtainedacrossthe100iterations,forthatspecificnumberoftrainingdatapoints.Adifferentnumberofvalidsurveyswerecompletedby
differentparticipants.Thebarplot,in(A)overthechart,showsthenumberofanswersthatwereusedtocalculatethesamplemeanscore
andtherespectivestandarddeviation.(B)ShowsalltheF1-microscoresdeterminedusingthefulldatasetforeachparticipantover100
iterations
FIGURE 10 AbsolutemeanSHAP
value of the six best- performing
supervised machine learning models.
Variables are color- coded, environmental—
using shades of gray, wearable— using
shades of purple, and time— using shades
of orange. Where
tout
stands for outdoor
air temperature and
Wout
stands for
humidity ratio outdoors.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
13 of 16
TAR TAR I NI et al.
thehumanbody.Ourresultsreinforcepreviouswork.18 The HR is a
proxy for the level of ac tivit y of the person, and it is positively corre-
lated with the metabolic rate. The value of
t
sk
,
w reflects the vasomo-
tor tone. The human body uses vasoconstriction and vasodilation for
thermoregulation.15 Fina lly, Wi influences the latent heat loss toward
theenvironment.On theotherhand,theoutdoorairtemperature,
occupant location, and outdoor humidity ratio only had a marginal
contribution to the final prediction, which can be explained by the
fact that these variables do not directly influence people's thermal
sensation or preference, in particular during steady- state conditions.
The value of the outdoor air temperature only indirectly affec ts oc-
cupants' thermal preferences since they may influence the t ype of
clothing that participants decide to wear before leaving their homes.
This result may, however, only be applicable to climates similar to the
oneinSingaporethatarecharacterizedbylimitedvariability.
4.2.1 | Self-reportedclothingandactivity
We found that including self- reported clothing and activity in some
models did not significantly increment the model prediction accu-
racy. While this seems to be counterintuitive since both clothing and
metabolic rate play a significant role in human thermoregulation, we
believed that they did not increase the model prediction accuracy
sincetheywerereportedqualitativelybyparticipantswhoonlyhad
fouroptionstochoosefrom.OthermeasuredvariableslikeHR may
better correlate with the participant's actual metabolic rate than
self- reported activity. This result has positive implications since, in a
real- world application, the building controller would not have access
to information about clothing and activity levels.
4.2.2 | Near-bodytemperature
While our results showed that
t
nb
,w
significantly contributed to the
model prediction, it should be noted that
t
nb
,w
was strongly corre-
lated with both
ti
and
t
sk
,w
.Consequently, it would besufficient to
measurethesetwolattervariablesinmostcases.Ontheotherhand,
only using
t
nb
,w
as a proxy for
ti
would decrease the complexity of
the data collection, but at the same time, it would reduce the over-
all model accuracy. We decided to measure, log, and include in the
models
t
nb
,w
since many people in warm climates use fans to cool
themselves. Measuring airspeed in the proximity of the occupants
in longitudinal studies is impractical, very expensive, and inaccurate.
Battery- powered anemometers would need to be recharged fre-
quently,areveryexpensive,andaresensitivetodirec tion.Airspeed
variessignificantlybothspatiallyandtemporally;consequently,ac-
curate readings can only be obtained in laboratories using scientific-
grade sensors installed on stands mounted near the subject. The
value of wrist near body temperature c an then be used as a proxy
to partially compensate for the lack of airspeed data. When airspeed
is low,
t
nb
,w
is significantly af fected by the thermal plume of the par-
ticipant and in turn by
tsk
.
51 On the other hand, when participants
are cooling themselves using electric fans, the airflow disrupts the
thermal plume, and
t
nb
,w
is mainly influenced by
ti
.
4.2.3 | Skintemperature
Participants did not report any significant discomfort by wearing the
iButto n for an extende d period. At the e nd of the study, 16 par-
ticipant sansweredpositivelytothefollowingquestion:“Wouldyou
wear the Fitbit and complete a few surveys p er day for two weeks for
no financial reward, if you knew that the information would improve
your well- being indoors?” However, measuring
t
sk
,w
using an iBut-
ton adds complexit y and maybe still a source of mild discomfort for
some people. iButton cannot communicate wirelessly; hence data
cannot be accessed in real time. There have been several announce-
ments from the leading smar twatch manufacturers to include a skin
temperaturesensor intheirdevices.Still,atthetimeof writingthis
manuscript, no smartwatch available on the market could measure
it accurat ely.H owever, in September 2 022 at the time of r eview-
ingthismanuscript,Appleannouncedthattheyhavereleasedanew
AppleWatchthatcanaccuratelymeasureskintemperature.
4.2.4 | Historicalvariables
The increases in model accuracy when historical variables were added
to the model did not justify the increased complexity. This situation
can be par tially explained by the fact that we carefully chose to ana-
lyze data collected when participants were in near “steady- state” con-
ditions. This choice was driven by the fac t that people in thei r office, on
average, spend most of their time at their desks in near “steady- state”
conditions. Predicting how people perceive their thermal environment
during transitor y conditions goes beyond the scope of our research.
4.3 | The compliance rate of participants and data
quality considerations
Sixmonthsofthedailylongitudinalcollectionisachallengeinterms
of ensuring that participants maint ain compliance with experimental
guidelines. The Cozie smartwatch- based methodology turned out
to facilitate high compliance with none of the participant s dropping
out from the study, and all completed at least 1080 surveys. This
result reinforces previous work inmicro-EMA and its ease of de-
ployment in collecting longitudinal data with less sur vey fatigue.32
Compliance maintenance was enhanced with notifications sent
through a messaging app that would remind the par ticipants about
notable achievements or deficiencies in the experimental process.
Despite the compliance rate, some participants were not fully
cognizant of their perspective on each response given over the
6 monthsduetosurveyfatigue.Thisriskcouldbemitigatedinfuture
work through early detection, incentives, and by significantly re-
ducing the number of surveys that each participant has to complete
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
14 of 16
|
TARTAR I NI et al .
every week. This risk is significant for data- driven models, which are
highlysusceptibleto“bad”data.Onepossibleothersolutiontothis
problem is utilizing the model to control their environment actively.
4.4 | Limitations
One notable limitation ofthe deployment is that the Singapore cli-
matehaslittle diversityacrosstheyear.Seasonalityinotherclimates
mayresultinlongitudinaldataneedingmoretrainingbeyondthe200–
300pointsfoundinthisstudy.Studiesinotherclimatesmayneedto
spread data collection into phases that account for different seasons.
In addition, the experimental deployment for this study began
in April 2020, just as Singapore entereda lockdown period due to
COVID-19restrictions.Throughoutthestudy,thelockdownsituation
was dynamic, but overall there was less diversity of data collection
locations than intended. Most of the occupants were forced to work
from home for the whole duration of the study, while those who were
allowed to resume going to the office were required to wear face
masks at all times. We started this study before the pandemic started,
hencewedidnotincludeanyquestionsaboutfacemasks.
Another notable limitation category relates to the nature of
black- box machine learning models in the application of thermal
comfort prediction. The lack of conversion of model output or ac-
curacy into the physical understanding of what makes people feel
comfort able or not is troublesome in the context of improving com-
fort, particularly for facility operators. Future work should focus
on the conversion of the accuracy of predic tion to the applicabil-
ity to system and occupant interaction. The previously mentioned
personal comfort review found similar insight in the literature
of such models.28 Among the different models tested, Random
Forest is one of the most widely adopted in the literature and it s
performance justifies its adoption (Figure 8). Nevertheless, when
comparedto a regression-based model like SVM with similar pre-
dictionperformance,RandomForestrequired100timesmorecom-
putational time for model training, i.e., 620 and 6 s, respectively.
Coincid entally, XGB and MLP also a chieve a similar per formance
but requi re roughly 12 ti mes the compu tational tim e of SVM, 83
and67 s,respectively.TheseresultsreinforcetheselectionofSVM
since it does not sacrifice prediction accuracy; as a regression- based
model ,itismorei nterpre tabl eandr eq uiresle sscompu tati onalc os t.
It should a lso be noted that s ince some machi ne learning m odels
are not linear, like RDF, this may cause the personal comfort model
may still predict thermal preference to vary back and forward from
“warmer” to “cooler” as the temperature increases, despite all other
inputs being fixed. This situation has several issues. Firstly, it does
not provide an accurate representation of how people perceive their
thermal environment nor take into account that thermal preference
isanordinal variable. Secondly,it may be the causeofinstabilities
if the model is used to actively control a space. We believe that this
issue has had very lit tle coverage in previous studies that aimed to
develop personal thermal comfort models, and it should be further
investigated.
5 | CONCLUSIONS
We conducted a longitudinal thermal comfort study that aimed to
develop personal thermal comfort models. Twenty participants took
part in it, and they completed on average at least six RHRN sur veys
per day for a period of 6 months. We developed an effective meth-
odology that simplified the life of the participants, and none of them
dropped from the study. We measured and logged environmental
parameters, physiological signals, outdoor weather data, and partici-
pants' location outdoors and indoors. We used these dat a to train
and test a personal thermal comfort model for each par ticipant. We
were able to determine that:
• Cozie,amicro-EMAopen-sourceFitbitandAppleapplication,isa
reliable and robust solution to non- intrusively collect participants’
feedback in field studies.
• Personal comfort models were able to accurately predict (median
F1- micro score 0.78) occupants’ thermal preferences. With the
limitations in data collection posed by the study methodolog y,
they could outperform the PMV model.
• Indoor air temperature(
ti
), wrist near body temperature (
t
nb
,w
),
heart rate (HR), wrist skin temperature (
t
sk
,w
), and humidity ratio
indoors (Wi), listed in decreasing order of importance, had the
highest average marginal contribution to the overall model
prediction.
• The thermal personal comfort model prediction accuracy (F1-
micro) plateaued at around 300 data points across all par ticipants.
Individualpersonal modelsare sensitiveto datasetsizetovary-
ing degre es. The amoun t of data require d to characte rize ther-
mal comfort could potentially be reduced with the development
oftargetedsampling,whichstrategicallyrequestsfeedbackonly
when it is necessary.
• We made available publicly the data we collected and open-
sourced the Python code we used to analyze them to enable
other researchers to test different hypotheses utilizing our data.
NOMENCLATURE
HR heart rate, beat s per minute
PMV Predicted Mean Vote
RHRN Right- Here- Right- Now
SVM SupportVectorMachine
ti indoor air temperature, °C
tnb,w wrist near body temperature, °C
tsk skin temperature, °C
tsk,w wrist skin temperature, °C
Wi humidity ratio indoors, kgwater vapor/kgdry a ir
ACKNOWLEDGEMENTS
This rese arch has been sup ported by the Re public of Singap ore's
National Research Foundation through a grant to the Berkeley
Education Alliance for Research in Singapore (BEARS) for the
Singapore-Berkeley Building Efficiency and Sustainability in the
Tropics(SinBerBEST )Program.
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
|
15 of 16
TAR TAR I NI et al.
FUNDING INFORMATION
This research has been supported by the Republic of Singapore's
National Research Foundation through a grant to the Berkeley
Education Alliance for Research in Singapore (BEARS) for the
Singapore-Berkeley Building Efficiency and Sustainability in the
Tropics(SinBerBEST)Program.
CONFLICT OF INTEREST
The authors declare that they have no known competing financial
interests or personal relationships that could have appeared to influ-
ence the work repor ted in this paper.
DATA AVA ILAB ILITY STATE MEN T
The source code we used to analyze the data and the full dataset are
publicly available at this URL: https://github.com/Feder icoTa rtari ni/
dorn- longi tudin al- tc- study.
ORCID
Federico Tartarini https://orcid.org/0000-0002-8739-5062
Stefano Schiavon https://orcid.org/0000-0003-1285-5682
Matias Quintana https://orcid.org/0000-0002-0486-221X
Clayton Miller https://orcid.org/0000-0002-1186-4299
REFERENCES
1. GrahamLT,ParkinsonT,SchiavonS.Lessonslearnedfrom20 years
of CBE's occupant sur veys. Build Cities. 2021;2(1):166- 184.
2. ISO. ISO 7730 – Ergonomics of the thermal environment –
Analy tical determination and interpretation of thermal comfort
using calculation of the PMV and PPD indices and local thermal
comfort criteria. 2005.
3 . ANSI and A SHRAE . Standar d – 55 t hermal env ironment al condi-
tionsforhumanoccupancy.Atlanta,2020.
4. FangerPO.Thermal comfort: analysis and applications in environmen-
tal engineering.McGraw-Hill;1970.
5. Kim J, Sch iavon S, Brager G. P ersonal comfor t models – a new
paradigm in thermal comfort for occupant- centric environ-
mental control. Build Environ. 2018;132:114- 124. doi:10.1016/j.
buildenv.2018.01.023
6. XieJ, LiH ,LiC,ZhangJ,Luo M.Reviewonoccupant-centricther-
mal comfor t sensing, predicting, and controlling. Energy Build.
2020;226:110392. doi:10.1016/j.enbuild.2020.110392
7. deDearRJ,AkimotoT,ArensEA,etal.Progressinthermalcomfor t
research over the last twenty year s. Indoor Air. 2013;23:442- 461.
doi:10.1111/in a.1 20 46
8. Van Hoof J. Forty years of Fanger's model of thermal
comfort: comfort for all? Indoor Air. 2008;18:182- 201.
doi:10.1111/j.1600- 0668.2007.00516.x
9. Cheung T,Schiavon S, Parkinson T,LiP,Brager G. Analysis of the
accuracyonPMV– PPD modelusingtheA SHR AEglobal thermal
comfortdatabaseII.Build Environ. 2019;153:205- 217. doi:10.1016/j.
buildenv.2019.01.055
10. LeeS,BilionisI,KaravaP,Tzempelikos A.ABayesianapproach for
probabilistic classification and inference of occupant thermal pref-
erences in office buildings. Build Environ. 2017;118:323- 343.
11. KonisK ,AnnavaramM.Theoccupant Mobile gateway:apar tici-
patory sensing and machinelearning approach for occupant- aware
energy management. Build Environ. 201 7;118: 1- 13.
12. LeeS,Kar avaP,TzempelikosA,BilionisI.Inferenceofthermalpref-
erence profiles for personalized thermal environments with actual
building occupants. Build Environ. 2019;148:714- 729.
13. GuentherJ,SawodnyO.Featureselectionandgaussianprocessre-
gression for personalized thermal comfor t prediction. Build Environ.
2019;148:448- 458.
14. Aryal A , Bece rik-Gerbe r B. Therm al comfor t modelin g when per-
sonalized comfort systems are in use: comparison of sensing and
learning methods. Build Environ. 2020;185:107316.
15. Romanovsky A A. The thermoregulation system and how it
works. Vol 156. 1st ed. Elsevier B.V.; 2018:3- 43. doi:10.1016/
B978- 0- 444- 63912- 7.00001- 1
16. Choi JH, Yeom D. Study of data-driven thermal sensation pre-
diction model as a function of loc al body skin temperatures in a
built environment. Build Environ. 2017;121:13 0- 147. doi:10.1016/j .
buildenv.2017.05.004
17. Choi J-H, Loftness V.Investig ation of human body skin tempera-
tures as a bio- signal to indicate overall thermal sensations. Build
Environ. 2012;58:258- 269. doi:10.1016/j.buildenv.2012.07.003
18. LiuS,SchiavonS,dasHP,JinM,SpanosCJ.Personalthermalcom-
fort mo dels with weara ble sensors. Build Environ. 2019;162:106281.
doi:10.1016/j.buildenv.2019.106281
19. JiW, Cao B, GengY,Zhu Y,Lin B. Studyonhuman skin tempera-
ture and thermal evaluation in step change conditions: from non-
neutrality to neutrality. Energy Build. 2017;156:29- 39. d oi:10.1016/j.
enbuild.2017.09.037
20. Xiong J, Zhou X, Lian Z, You J, Lin Y. Thermal perception and
skin temperature in different transient thermal environments
in summer. Energy Build. 2016;128:155 - 163. do i:10.1016/j.
enbuild.2016.06.085
21. Kat ić K, Li R , Zeiler W. Machin e learning a lgorithm s applied to
a predic tion of personal overall thermal comfort using skin
temperatures and occupants' heating behavior. Appl Ergon.
2020;85:103078.
22. Ghahra mani A, Cas tro G, Becer ik-G erber B, Yu X. Inf rared ther-
mography of human face for monitoring thermoregulation per-
formance and estimating personal thermal comfort. Build Environ.
20 16; 10 9 :1- 11 . doi:10.1016/j.buildenv.2016.09.005
23. Li D, Menassa CC, Kamat VR. Robust non- intrusive interpret a-
tion of occupant thermal comfort in built environments with low-
cost net worked thermal c ameras. Appl Energy. 2019;251:113336.
doi:10.1016/j.apenergy.2019.113336
24. CosmaAC ,SimhaR.Thermalcomfortmodelingintransientcondi-
tions using real- time local body temperature extraction with a ther-
mographic camera. Build Environ. 2018;143:36- 47. doi:10.1016/j.
buildenv.2018.06.052
25. van Marken Lichtenbelt WD, Daanen HAM, Wouters L , et al.
Evaluation of wireless determination of skin temperature using
iButtons. Physiol Behav. 2 0 0 6 ; 8 8 ( 4 – 5 ) : 4 8 9 - 4 9 7 . doi:10.1016/j.
physbeh.2006.0 4.026
26. Hasselberg MJ, McMahon J, Par ker K. The validity, reliability, and
utility of the iButton® for measurement of body temperature cir-
cadian r hythms in sleep/wake research. Sleep Med. 2013;14(1):5- 11.
doi:10.1016/j.sleep.2010.12.011
27. TartariniF and Schiavon S. Skin temperature sampling period for
longitu dinal ther mal comfor t studies. I n: 16th Conferen ce of the
Internat ional Socie ty of Indoor ai r Quality an d Climate: Crea tive
and Smar t Solutions for B etter Built Environments, Indoor Air
2020.2020;1–20.doi: 10.5281/zenodo.4289851
28. MartinsL A, Soebarto V,WilliamsonT.Asystematicreviewofper-
sonal thermal comfort models. Build Environ.2022;207.PA:108502.
doi:10.1016/j.buildenv.2021.108502
29. NgarambeJ,YunGY,KimG.Predictionofindoorclothinginsulation
levels: a deep learning approach. Energy Build. 2019;202:109402.
30. KimJ,ZhouY,Schiavon S,et al.Personalcomfor tmodels:predic t-
ing individuals' thermal preference using occupant heating and
cooling behavior and machine learning. Build Environ. 2018;129:96-
106. doi:10.1016/j.buildenv.2017.12.011
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
16 of 16
|
TARTAR I NI et al .
31. ShiffmanS,StoneAA,HuffordMR .Ecologicalmomentar yassess-
ment. Annu Rev Clin Psychol. 2 0 0 8; 4:1- 3 2.
32. Intille S , Haynes C, M aniar D, Ponnad a A, Manjo urides J. μEMA :
microinteraction- based ecological momentary assessment
(EMA) using a smartwatch. Proc ACM Int Conf Ubiquitous Comput.
2016;2016:1124- 1128.
33. Ponnada A , Haynes C, Maniar D, Manjourides J, Intille S.
Microinteraction ecological momentary assessment response
rates: ef fect of microinteractions or the smartwatch? Proc ACM
Interact Mob Wearable Ubiquitous Technol. 2017;1(3):1- 16.
34. Jayathiss a P, Q uintana M, So od T, Nazari an N, Miller C. Is yo ur
clock-facecozie?Asmartwatchmethodologyforthein-situcollec-
tion of occupant comfort dat a. J Phys Conf Ser. 2019;1343(1):12145.
doi:10.10 88/1742- 6596/13 43/1/012145
35. Tartarini, F.,Miller,C., & Schiavon, S.(2022). CozieApple:aniOS
mobileandsmartwatchapplicationforenvironmentalqualitysatis-
faction and physiological data collection (Version 1). a rXiv. h t tp s : //
doi.org/10.48550/ARXIV.2210.13977
36. JayathissaP,QuintanaM,AbdelrahmanM,MillerC.Humans-as-a-
sensor for buildings— intensive longitudinal indoor comfort mod-
els. Buildings. 2020;10(10):174. doi:10.3390/buildings10100174
37. Quintana M, Abdelrahman M, Frei M, Tartarini F, Miller C.
Longitudinal personal thermal comfor t preference dat a in the
wild. In: Proceedings of the 19thACMConference on Embedded
Networ ked Sensor Systems. SenSys ‘21. Coimbra, Por tugal:
Association for Computing Machinery, 2021, pp. 556–559. doi:
10.1145/3485730.3493693
38. Quintana, M., Schiavon, S., Tart arini, F., Kim, J., & Miller, C.
(2023). Cohort comfort models — Using occupant’s similarity
to predic t personal thermal preference with less data. Building
and Environment, 227, 109685. https://doi.org/10.1016/j.build
env.2022.109685
39. Quintan a M, Schiavon S, Th am KW, Miller C. B alancing the rmal
comfortdat asets:we GAN,butshould we?In:Proceedingsofthe
7thACMInternationalConferenceonSystemsforEnergy-Efficient
Buildin gs, Citie s, and Transpor tation. B uildSys ‘20. V irtual Even t,
Japan:Association forComputing Machinery.2020;120–129.doi:
10.1145/3408308.3427612
40. Abdelrahman MM , Chong A , Miller C. Per sonal thermal com-
fort mo dels using digi tal twins: p reference pre diction wit h BIM-
extr acted spatial–temp oral proximit y data from Bui ld2Vec. Build
Environ. 2021;207:108532.
41. NationalEnvironmentAgencySingapore. Realtimeweather read-
ingsacrossSingapore.2020.https://data.gov.sg/datas et/realt ime-
weath er- readings.Accessed:2020-09-03.
42. Lundber g SM, Lee S-I . A unified app roach to interp reting mode l
predic tions.In: GuyonIet al., eds. Advances in Neural Information
Processing Systems 30.CurranAssociates,Inc.;2017:4765-4774.
43. RibeiroMT,SinghS ,andGuestrinC.“WhyshouldItrustYou?”:ex-
plainingthepredictionsofanyclassifier.In:CoRRabs/1602.04938
2016.arXiv:1602.04938.url:http://arxiv.org/abs/1602.04938.
44. AppleInc.Apple reveals AppleWatch Series8 andthenewApple
Watch SE. 2022. https://www.apple.com/newsr oom/2022/09/
apple - revea ls- apple - watch - serie s- 8- and- the- new- apple - watch
- se/.Accessed:2022-09-13.
45. DaweM, Raftery P,WoolleyJ,SchiavonS,BaumanF.Comparison
of mean radiant and air temperatures in mechanically- conditioned
commercial buildings from over 200,00 0 field and laborator y
measurements. Energy Build. 2020;206:109582. doi:10.1016/j.en-
build.2019.109582. https://linki nghub.elsev ier.com/retri eve/pii/
S 0 3 7 8 7 7 8 8 1 9 3 1 4 8 5 9
46. Sekhar SC. Thermal comfort in air-conditioned buildings in hot
and humid climates - why are we not get ting it right? Indoor Air.
2016;26:138- 152. doi :10.1111 /ina.1218 4
47. SchiavonS,YangB,DonnerY,ChangVW-C,NazaroffWW.Thermal
comfor t, perceived air qu ality, and cognitive performance when
personally controlled air movement is used by tropically acclima-
tized persons. Indoor Air. 2016;27:690- 702. doi:10.1111 /ina.1 2352
48. Roa CD, Schiavon S, Park inson T. Targeted occupant sur veys: a
novel method to effectively relate occupant feedback with envi-
ronmental conditions. Build Environ. 2020;184:107129.
49. Park H, Park DY. Prediction of individual thermal comfort based on
ensemble transfer learning method using wearable and environ-
mental sensors. Build Environ. 2022;207.PB:108492. doi:10.1016/ j.
buildenv.2021.108492
50. Das HP, Schiavon S, Spanos CJ. Unsupervise d personal the rmal
comfort prediction via adversarial domain adaptation. Proceedings
of the 8th ACM International Conference on Systems for Energy-
Efficient Buildings, Cities, and Transportation.Vol1.ACM;2021:230-
231. doi:10.1145/3486611.3492231
51. ZhangY,LiuJ,PeiJ,WangC. Statisticalanalysisof turbulentther-
mal convec tion in a cabin mockup. Build Environ. 2017;115:34- 41.
doi:10.1016/j.buildenv.2017.01.012 https://linki nghub.elsev ier.
c o m / r e t r i e v e / p i i / S 0 3 6 0 1 3 2 3 1 7 3 0 0 1 2 4
SUPPORTING INFORMATION
Additional supporting information can be found online in the
Suppor tingInformationsectionattheendofthisarticle.
How to cite this article: TartariniF,SchiavonS,QuintanaM,
Miller C. Personal comfort models based on a 6- month
experiment using environmental parameters and data from
wearables. Indoor Air. 2022;32:e13160. doi:10.1111/
ina.13160
16000668, 2022, 11, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/ina.13160, Wiley Online Library on [10/12/2022]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License