ArticlePDF Available

Probabilistic Load Forecasting for Building Energy Models

Authors:

Abstract and Figures

In the current energy context of intelligent buildings and smart grids, the use of load forecasting to predict future building energy performance is becoming increasingly relevant. The prediction accuracy is directly influenced by input uncertainties such as the weather forecast, and its impact must be considered. Traditional load forecasting provides a single expected value for the predicted load and cannot properly incorporate the effect of these uncertainties. This research presents a methodology that calculates the probabilistic load forecast while accounting for the inherent uncertainty in forecast weather data. In the recent years, the probabilistic load forecasting approach has increased in importance in the literature but it is mostly focused on black-box models which do not allow performance evaluation of specific components of envelope, HVAC systems, etc. This research fills this gap using a white-box model, a building energy model (BEM) developed in EnergyPlus, to provide the probabilistic load forecast. Through a Gaussian kernel density estimation (KDE), the procedure converts the point load forecast provided by the BEM into a probabilistic load forecast based on historical data, which is provided by the building’s indoor and outdoor monitoring system. An hourly map of the uncertainty of the load forecast due to the weather forecast is generated with different prediction intervals. The map provides an overview of different prediction intervals for each hour, along with the probability that the load forecast error is less than a certain value. This map can then be applied to the forecast load that is provided by the BEM by applying the prediction intervals with their associated probabilities to its outputs. The methodology was implemented and evaluated in a real school building in Denmark. The results show that the percentage of the real values that are covered by the prediction intervals for the testing month is greater than the confidence level (80%), even when a small amount of data are used for the creation of the uncertainty map; therefore, the proposed method is appropriate for predicting the probabilistic expected error in load forecasting due to the use of weather forecast data.
Content may be subject to copyright.
sensors
Article
Probabilistic Load Forecasting for Building
Energy Models
Eva Lucas Segarra , Germán Ramos Ruiz *,† and Carlos Fernández Bandera
School of Architecture, University of Navarra, 31009 Pamplona, Spain; elucas@unav.es (E.L.S.);
cfbandera@unav.es (C.F.B.)
*Correspondence: gramrui@unav.es; Tel.: +34-948-425-600 (ext. 802751)
These authors contributed equally to this work.
Received: 12 October 2020; Accepted: 12 November 2020 ; Published: 15 November 2020


Abstract:
In the current energy context of intelligent buildings and smart grids, the use of load
forecasting to predict future building energy performance is becoming increasingly relevant.
The prediction accuracy is directly influenced by input uncertainties such as the weather forecast,
and its impact must be considered. Traditional load forecasting provides a single expected value for
the predicted load and cannot properly incorporate the effect of these uncertainties. This research
presents a methodology that calculates the probabilistic load forecast while accounting for the
inherent uncertainty in forecast weather data. In the recent years, the probabilistic load forecasting
approach has increased in importance in the literature but it is mostly focused on black-box models
which do not allow performance evaluation of specific components of envelope, HVAC systems, etc.
This research fills this gap using a white-box model, a building energy model (BEM) developed in
EnergyPlus, to provide the probabilistic load forecast. Through a Gaussian kernel density estimation
(KDE), the procedure converts the point load forecast provided by the BEM into a probabilistic load
forecast based on historical data, which is provided by the building’s indoor and outdoor monitoring
system. An hourly map of the uncertainty of the load forecast due to the weather forecast is generated
with different prediction intervals. The map provides an overview of different prediction intervals for
each hour, along with the probability that the load forecast error is less than a certain value. This map
can then be applied to the forecast load that is provided by the BEM by applying the prediction
intervals with their associated probabilities to its outputs. The methodology was implemented and
evaluated in a real school building in Denmark. The results show that the percentage of the real
values that are covered by the prediction intervals for the testing month is greater than the confidence
level (80%), even when a small amount of data are used for the creation of the uncertainty map;
therefore, the proposed method is appropriate for predicting the probabilistic expected error in load
forecasting due to the use of weather forecast data.
Keywords:
probabilistic load forecasting; white-box models; building energy models; weather
forecast; uncertainty analysis; monitoring; reliability
1. Introduction
In the era of the Internet of Things (IoT), virtual and physical environments are being closely
linked and widely used in various areas of the industry. Among these areas, building operation and
management is receiving more attention due to the growing development of intelligent buildings
and smart grids [
1
,
2
]. Smart buildings integrate connected objects through systems that monitor
and control a great variety of variables, such as indoor temperature, weather data, airflow rates and
CO
2
concentration. Sensors are fundamental devices in this new generation of buildings because
they connect the simulation model with the real world and they enable appropriate management
Sensors 2020,20, 6525; doi:10.3390/s20226525 www.mdpi.com/journal/sensors
Sensors 2020,20, 6525 2 of 20
and adequate decision making [
3
]. In this field, the energy efficiency of buildings is one of the most
important research areas: buildings represent almost 40% of the world’s total energy consumption
and thus hold great energy-saving potential [
4
]. Load forecasting is one of the key elements of this
new intelligent building and smart grid environment, where network solutions are used to optimize
energy sources.
Accurate load forecasts for buildings allow the optimum management of buildings’ energy
systems and low-voltage networks in different contexts, such as energy management systems [
5
,
6
],
energy storage system control [
7
], demand response (DR) and demand-side management (DSM) [
8
]
and the integration of distributed energy resources [9].
The prediction of building energy is key for the optimization of its management, and it falls into
three general categories in the literature [
10
]: black-box or data-driven models, white-box or physical
models and gray-box or hybrid models. Black-box models are mathematical models constructed
from historical data and lack explicit link between model inputs and physical building parameters.
In recent years, several reviews articles have studied the growing use of this type of prediction models
for building energy prediction [
11
13
]. Following the classification proposed in the most recent
review from Sun et al. [
13
], data driven approaches can be divided into statistical, which derive
correlations between the variable of study and influential parameters, such as linear regression (LR) or
time series analysis (ARMA and ARIMA) [
14
,
15
]; and machine learning (ML) approaches, which are
a more advanced statistical methods and use prediction algorithms. ML includes, among others,
support vector machine (SVM) [
16
], ensemble methods [
17
], deep learning [
5
] and the increasing and
most common in the recent literature [
13
,
18
], artificial neural networks (ANN) [
19
21
]. White-box or
physical models are based on physical principles and predict loads with detailed heat and mass transfer
equations using simulation software such as EnergyPlus and TRNSYS. These software packages
calculate building energy prediction based on building construction details, heating, ventilation and
air conditioning (HVAC) design information, operation schedules and climate information
[2224].
Finally, gray-box or hybrid models, which are a combination of data-driven and physics-based
models, use simplified physical descriptions but also require parameter estimation based on measured
data [25,26].
Among these building energy prediction models, black-box-based models lack an understanding
of the underlying parameters of the energy prediction and its behavior so they are not transparent [
18
].
However, white-box models allow to monitor the modeling and analyze the process step by step
interpreting the results for different scales (whole building, thermal zones, etc.) and link them with
the physics and architectural parameters of the buildings. While black-box models, such as ANNs,
are commonly used for small-scale modeling tasks or assuming that the zone temperature distribution
is uniform [
27
], white-box models are able to characterize large multi-thermal zones buildings. On the
other hand, white-box models are more flexible to changes in the buildings characteristics or operation
since they do not require the re-training of the model, avoiding problems of input data quality
[28,29]
.
Once the model is developed, it can be more easily used for other applications like retrofit analysis or
fault detection and diagnostics [
30
], or conversely, it is easy to exploit a white-box model to provide
load forecast previously developed for other purposes.
The common factor among building load forecasting techniques is that their accuracy depends not
only on the accuracy of the model itself but also on the accuracy of the predicted external inputs [
31
].
Among these driving factors, weather parameter forecasting is a fundamental element because it
has a great influence on the building’s actual energy consumption but has inherent uncertainty.
The literature recognizes the significant influence of the weather forecast on a building’s energy
performance, especially outdoor temperature [
32
]. However, the impact of the uncertainty due to
forecast weather data on building load forecasting is not well represented in the literature [
33
35
],
and few studies have directly investigated its effect [31,32,3638].
On the other hand, traditionally, a load forecast is generated with a point or deterministic approach,
which means that a single expected value for the predicted load is provided. The problem is that this
Sensors 2020,20, 6525 3 of 20
point load forecast is not able to properly consider and quantify the effect of its inherent uncertainties.
Therefore, it is necessary to develop a tool to quantitatively describe the uncertainties of load prediction
and to assess the risk of relying on these forecasts. Uncertainties in load forecasting can be addressed
through probabilistic load forecasting (PLF), an approach that can provide future predictions with the
associated prescribed probabilities. This probabilistic approach is more adaptable to the current energy
context, where the dependence of load forecasting on its inherent uncertainties complicates reliable
and efficient energy management [39].
Probabilistic load forecasting is gradually increasing in importance in the literature, especially
after the Global Energy Forecasting Competition 2014 [
40
], since it can provide more comprehensive
information for the energy management decision-making process. Hong and Fan [
39
] provided a
review of the state-of-the-art in probabilistic electric load forecasting where they stated that it can
be implemented in practically the same cases in which single-valued load forecasts are applied.
For example, it has been used for electricity consumption prediction in buildings [
41
43
], but also for
distributed renewable energy production forecasts, such as photovoltaic power generation or wind
speed forecasts [
44
,
45
]; applications related to electric vehicles [
46
]; and the quantification of the power
reserve of a microgrid [47].
The literature includes several studies with different approaches that incorporate weather
uncertainty into PLF development to forecast the building’s load. Xu et al. developed a probabilistic
load forecasting model using an artificial neural network (ANN) and probabilistic temperature
forecasts. Their results showed that the probabilistic normal load forecasts had satisfactory accuracies,
and the load forecasts based on one-day-ahead probabilistic weather forecasts were the best [
48
].
Dahl et al. presented an autoregressive heat forecast model with weather prediction input and
concluded that ensemble weather predictions could improve supply temperature control in district
heating area substations [
49
]. Zhao et al. used the Monte Carlo method (MCM) to pre-process
meteorological forecast data to improve the accuracy of load forecasts provided by a support
vector machine (SVM) model, and the forecasting results became closer to the actual data [
34
].
Similarly, Fan et al. also employed the MCM to calibrate the input variables of their proposed
SVM cooling load prediction model with the aim of reducing the influence of the uncertainty of the
input variables (weather parameters, among others). With calibrated inputs, this approach produced
a more accurate prediction, which was closer to the load prediction based on measured data [
50
].
Although probabilistic load forecasting studies are increasing in the literature, they are mainly based
on black-box models which cannot clarify the link between inputs and the forecasted building loads.
This research aims to fill this gap by providing a probabilistic load forecasting methodology that
considers the weather prediction uncertainty using white-box models (building energy models, BEMs)
instead of black-box models. The proposed methodology converts the point load forecast provided by
a BEM into a probabilistic load forecast using historical data based on indoor and outdoor building
monitoring. An hourly map of the uncertainty of the load forecast due to the weather forecast is
provided for a specific building and weather forecast source. After applying the uncertainty map
on the BEM outputs, the hourly load forecast is obtained with the probabilistic error due to weather
forecast data, providing a tool that can help the building’s energy managers and network operators to
make more informed decisions.
The load forecast is provided by a BEM (physics-based model), which is fed weather files that
are generated using a methodology that respects the thermal history of the building. Different time
horizons, from 1 day to 6 days ahead, were employed in this study to assess their influence on the
probabilistic load prediction. The methodology was applied to a real case study, a building located in
Gedved (Denmark), which is equipped with indoor temperature sensors and an on-site weather station.
Following the recommendations of Agüera-Pérez [
35
], this research used a real weather forecast from
an external provider instead of synthetic data [
51
] and six weather parameters instead of using only
temperature [52] or temperature and humidity [36].
Sensors 2020,20, 6525 4 of 20
The main contributions of this research are: (1) a probabilistic load forecasting approach is
provided based on white-box models, instead of black-box models; (2) an hourly uncertainty map
is provided as an easy tool to represent the expected hourly probabilistic load forecast error due to
weather forecast for a specific building and weather prediction source; and (3) a dedicated script is
developed to generate the daily weather files that feed the building energy model.
This paper is organized as follows: Section 2shows the proposed PLF methodology, including the
simulation and the probabilistic process performed on the data generated by the BEM. Section 3focuses
on the description of the case study for which the methodology was implemented, and Section 4shows
the results, including the evaluation of the methodology. Finally, Sections 5and 6present the discussion
and conclusions, respectively.
2. Methodology
This section presents the methods for the proposed probabilistic load forecast technique,
which uses a BEM and accounts for weather forecast uncertainty. This technique requires two
procedures, which are detailed in the following sections: the simulation process to determine the
historical impact of the weather forecast data on the load provided by the BEM and the probabilistic
processing of the simulation outputs. The overall approach of the proposed probabilistic load
forecasting methodology is illustrated in the Figure 1.
Combined
weather files
Building
Energy Model
Kernel
density
estimation
(KDE)
Hourly
Uncertainty
Map
Measured
weather data
Forecast
weather data
Actual
building's load
Forecast
building's load
Indoor measured
temperature
Data from on-site sensors Probabilistic load forecast (PLF)
Simulation Process Probabilistic Process
(Point load forecast)
Figure 1.
Components and steps of the proposed probabilistic load forecasting methodology based on
white-box models (building energy model (BEM)).
Since the effect of the weather forecast on the energy model is considered in this methodology,
all uncertainties related to the simulation must be minimized so that the results of the probabilistic
analysis are valid. Section 2.1 discusses aspects such as the thermal history, the creation of weather
files, the accuracy of the building energy model used, which accounts for the building’s internal
loads and the preparation of outputs for the probabilistic analysis. Then, Section 2.2 details the
probabilistic processing of the historical differences between the loads obtained from the observed and
forecast weather data employed to obtain the uncertainty map, which is subsequently applied to the
point forecast.
2.1. Simulation Process through a BEM
In the field of forecasting building loads, many studies have used statistical or machine learning
approaches and reported low errors and good prediction accuracy. However, since these models were
trained using historical data, their ability to adapt to changes in the building is limited. Physical models,
which are based on physical and universal laws and equations, are more adaptive to changes [
53
].
These models facilitate an understanding of the thermodynamic aspects and interactions with the
internal and external environments. The proposed methodology employs calibrated building energy
models (BEMs) based on EnergyPlus simulation software [
54
,
55
] as the best representation of the
behavior of the real building.
In order to compare the load differences using observed and forecast weather data, it is
necessary to explain how the forecast and observed data are implemented in the BEM. In this regard,
important roles are played by weather files, generated with both observed and forecast information;
Sensors 2020,20, 6525 5 of 20
the indoor temperature, as the best representation of the internal loads of the building; and the
simulation periods, which are very relevant when processing the thermal history in the simulation.
To ensure the appropriate initial conditions in the simulation process, the weather files must be
created with respect to the thermal history of the building for both the internal and external conditions.
In this research, a procedure was developed to generate weather files in
EPW
(EnergyPlus weather
file) format to meet this requirement. The Weather Converter [
56
] tool, provided as an auxiliary
program by EnergyPlus, was used for the creation of all of these files by translating and extending
typical weather data into the
EPW
format and making the necessary calculations for unavailable
data. The source of the observed weather data is an on-site weather station installed in the building
surroundings. Following the recommendations made by Agüera et al. in [
35
], real weather forecast
data supplied by an external provider were used in this methodology, instead of using arbitrary
forecasts based on synthetic data [
57
] or historical forecasts, which can be treated as perfect forecasts or
modified by adding variations [
36
] that do not provide the real context of the building’s performance.
The process of creating the weather file started with the collection of daily forecast weather data from
the external provider. Then, one weather file was generated for each day with the measured weather
information (historical data) and forecast data. The resulting file is called the combined weather
file and contains both observed and forecast data. The process was implemented using a dedicated
script that injects the forecast weather data into the historical weather data. Therefore, the combined
daily weather files contain
n
days of forecast weather data, and the rest of the data correspond to the
observed data provided by an on-site weather station. Figure 2depicts the methodology.
day 1
day 2
day 3
day 4
day 5
day n
forecast
data 1
Forecast data from external provider
MEASURED DATA
Script to
inject data
Weather creation
day 1
day 2
day 3
day 4
day 5
day n
forecast data 1
weather file
MEASURED
*.epw 1
day 1
day 2
day 3
day 4
day 5
day n
forecast
data 2
day 1
day 2
day 3
day 4
day 5
day n
forecast
data n
day 1
day 2
day 3
day 4
day 5
day n
forecast data 2
weather file
MEASURED
*.epw 2
day 1
day 2
day 3
day 4
day 5
day n
forecast data n
weather file
MEASURED
*.epw n
WEATHER FILE
Combined weather files - One per day of analysis
Figure 2. Weather file creation methodology.
It is crucial that the BEM correctly reflects the past thermal behavior by taking into account all
loads of the building (HVAC system, people, lighting, electric equipment, etc.). The best way to
account for these loads is to use the building’s indoor temperature measurements via the sensors of
the building management system (BMS). An external file with the actual indoor temperature is used
by the simulation model as a dynamic set-point for the HVAC system. The simulation output is the
energy demand required by the model to follow it. This is shown in Figure 3.
Figure 3. Simulation process methodology.
The probabilistic analysis was carried out by comparing the energy differences in the models
when using the observed and forecast weather data. To obtain accurate energy differences, the use of
the correct simulation period is very important. One simulation per day of analysis was executed since
each day has its own weather file. For each day, the simulation was configured to run 15 days before
the baseline day (day 0) with the measured weather data to capture the thermal history of the model.
Sensors 2020,20, 6525 6 of 20
The loads on day 0 and
n
days of the forecast were obtained as the outputs of each simulation.
These results (ordered and classified) were subsequently used in the probabilistic analysis.
The error of the point load prediction provided by the BEM when using weather forecast can be
evaluated using three error metrics commonly employed in the forecasting literature: mean absolute
error (MAE) (Equation
(1)
), which measures the average magnitude of the error in the units of
the variable of interest; mean absolute percentage error (MAPE) (Equation
(2)
), which is a relative
error measure that allows comparing the forecasts performance on different data sets; and the
coefficient of determination (
R2
) (Equation
(3)
), which allows to measure the linear relationship
of the two patterns [58].
MAE =1
n
n
i=1|yiˆ
yi|, (1)
MAPE =1
n
n
i=1
|yiˆ
yi|
ˆ
yi×100%, (2)
R2=
n·n
i=1yi·ˆ
yn
i=1yi·n
i=1ˆ
y
q(n·n
i=1y2
i(n
i=1yi)2)·(n·n
i=1ˆ
y2(n
i=1ˆ
y)2)
2
(3)
2.2. Probabilistic Load Forecast
The probability load forecast (PLF) proposed for this methodology was produced by applying the
probability density function of residuals to the point forecast. This approach to producing the PLF
was classified as output by Hong et al. in [
39
]. The point forecast provided by the BEM was converted
to the PLF using historical data based on the observed and forecast weather data and the building’s
indoor temperature.
First, the distribution of the residuals, which are the energy load differences provided by the BEM
when it is fed the observed and forecast weather data, was studied through a probabilistic histogram.
This is useful because it provides a straightforward visualization of the spread and the skewness of
the data, the presence of outliers and the presence of multiple modes in the data. Second, to obtain a
smooth curve that represents the data, a probability density estimation was performed. Many studies
have used the normal distribution to estimate the density function, but it performs well only when the
data follow a bell-shaped distribution. To avoid making assumptions about the distribution of the data,
kernel density estimate (KDE) method was employed in this methodology, which is a non-parametric
representation of the probability density function [
45
]. If sequence Xconsists of N1-dimensional
observations
x1
,
x2
,
. . .
,
xN
, the KDE method estimates the actual probability density function fthrough
the following function (4):
ˆ
fh(x) = 1
n
n
i=1
K(x,xi)(4)
where Kis the kernel function, which is a non-negative symmetric function that integrated to one and
has mean zero. There are many types of kernels but empirically is it not very relevant which one is
employed [
59
]. In this case, Gaussian kernel is used to realize the KDE. It replaces each sample point
with a Gaussian-shaped kernel and then obtains the resulting estimate for the density by adding these
Gaussians. It can be expressed by (5) [45]:
K(x1,x2,σ) = 1
2πσ
e(x1x2)2
2σ2(5)
The
σ>
0 is the bandwidth, a smoothing parameter that influences the shape of the distribution.
There are many bandwidth selection methods [
60
] and the advantages of using Gaussian kernel KDE
is that it can calculate the bandwidth by a rule of thumb automatically [61].
Sensors 2020,20, 6525 7 of 20
The objective of this methodology is to obtain the expected probability that the load forecast
error is below a certain value. The cumulative distribution function (CDF) or S-curve is an easily
interpretable representation of the probability that the variable (here, the load forecast error due to
the weather forecast) will be less than or equal to a certain value. Finally, from the CDF plot of each
forecast hour, prediction intervals (PIs) of load forecast errors are extracted and transformed into an
hourly map of uncertainty. The schema in Figure 4shows the complete process.
Probability of occurrence
Energy differences (kWh)
Probability density
Energy differences (kWh)
Kernel Density
Estimation (KDE)
Cumulative probability
1.0
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.0
Energy differences (kWh)
Quantiles
Hourly
Uncertainty Map
Hours of forecast day
Energy demand uncertainty
Prediction
Intervals (PI)
Probability
Histogram
Probability Density
Function (PDF)
Cumulative Distribution
Function (CDF)
Figure 4. Process of the probabilistic load forecast.
The map of the uncertainty of the load forecast due to weather forecast data shows an overview
of the probability that the load error is below a certain value for each hour. The map was constructed
with the available hours ahead of the forecast time, and it is read as follows: for hour
n
, there is an
x
% probability that the energy demand error is less than
y
kWh. This map of uncertainty can then
be applied to the load forecast provided by the BEM by applying the intervals of the energy demand
error with their probabilities to the load forecast outputs of the model. In this way, the load forecast
is obtained with the probability error due to weather forecast data, similar to a risk map. It can be
used, for example, by the building’s energy manager to make a more informed choice according to the
bearable risk.
Finally, for the evaluation of the methodology, two indicators commonly used in the related
literature were employed for the prediction interval assessment: the prediction interval coverage
probability (PICP) and the mean prediction interval width (MPIW) [
62
]. PICP measures the reliability
of the predictions and shows the percentage of the real values that will be covered by the upper and
lower bounds. The larger the PICP, the more likely that the real values are within the prediction interval.
It can be defined as:
PI CP =1
H
H
i=1
Ci, (6)
in which His the number of samples, and Ciis a Boolean variable defined as follows:
Ci=(1, yi[Li,Ui]
0, yi/[Li,Ui],(7)
where
Li
and
Ui
are the lower and upper PI bounds of target
yi
, respectively. PICP ranges between 0
and 100%. The prediction interval is considered valid if the PICP value is greater than the prediction
interval nominal confidence (PI NC =100(1α)%), where αrepresents the probability of error.
High PICP values can be easily reached when the width of the prediction intervals (PIs) is large.
However, large PIs have higher levels of uncertainty, and, thus, they are useless for decision making.
Therefore, a complementary metric is required to assess the prediction interval widths: this metric is
the Mean Prediction Interval Width (MPIW), which is defined as:
MPIW =1
H
H
i=1
(UiLi). (8)
In conclusion, to make a suitable decision, small MPIW and high PICP values are desirable.
Sensors 2020,20, 6525 8 of 20
As mentioned in the methodology explanation, the validity of the results of the probabilistic
analysis is closely related to the data selection and processing used to create the uncertainty map.
The following section describes a case study in which the methodology was implemented, and it
illustrates the importance of sensors for obtaining both the weather file and the indoor temperatures.
3. Description of the Case Study
In this section, the case study that was used to apply the proposed methodology is presented.
The test site is a public elementary school in Gedved, Denmark. This building is part of the EU-funded
H2020 research and innovation project SmArt BI-directional multi eNergy gAteway (SABINA) [
63
].
Gedved School consists of 6 buildings and was built in 1979 and renewed in 2007. For this case study,
the library was selected, which is a one-story building with a total surface area of 1138 m
2
. The main
characteristics of the building are as follows: there are two brick layers with 150 mm insulation in
between them for the facades; the windows are two-layer double-glazed with cold frames; the ceiling
is insulated with 200–250 mm mineral wool for sloping and flat ceilings, respectively; and the floor
is made up of concrete and contains 150 mm insulation underneath. Regarding HVAC systems,
only heating is provided to the building, and it is connected for 24 h every day from October to May.
Figure 5shows an outdoor photograph of the library.
Figure 5.
Library building from Gedved School, Denmark. Left: Outdoor image.
Middle: Weather station
installed on the building’s roof. Right: The building energy model
(OpenStudio plugin for SketchUp [64]).
The building energy model used in this study was provided by the SABINA project and was
developed using the EnergyPlus engine. In the load forecasting field, when using BEMs, it is necessary
to take into account three main sources of uncertainty: BEM accuracy, building use and external
conditions. In order to minimize the first uncertainty, this case study employed a calibrated BEM,
obtained using a calibration methodology explained in the authors’ previous papers [
58
,
65
68
].
Regarding the building’s use, no uncertainty was consider in the indoor conditions since the model
used indoor temperatures measured in each thermal zone by the BMS.
For the creation of weather files, the external conditions are required for both the observed
and the forecast weather data. The observed weather data were obtained from a weather station
installed on the building’s roof, which provides measurements with hourly intervals for atmospheric
pressure, temperature, humidity, direct and diffuse irradiation, wind speed, wind direction and rainfall.
Figure 5shows the weather station location. The forecast weather information used in this study is a real
forecast supplied by the commercial service Meteoblue [
69
]. This company uses a multimodel/machine
learning approach to calculate the forecast weather data using both Meteoblue weather models
(Nonhydrostatic Meso-Scale Modeling) and third-party models for the simulations. More information
about Meteoblue’s forecast weather data process is available on its web page [
69
]. For this study,
the weather forecast data were gathered at 09:00 on each day. For the time horizon, 6 days ahead of the
weather forecast data were available for the present study.
The period in which all of the required data were available is from December 2018 to April
2020. The summer months (from June 2019 to September 2019) were not useful for the present study
since no cooling system is installed in the building, and, therefore, only heating load forecasting was
Sensors 2020,20, 6525 9 of 20
considered. Thus, the final period of the study is composed of 13 months, from December 2018 to
May 2019 and from October 2019 to April 2020, so two complete winter campaigns (2018–2019 and
2019–2020) were analyzed.
First, the methodology is illustrated using a map of uncertainty generated from all 13 months
of data to show how this probabilistic load forecast (PLF) method is implemented using a BEM.
Then, the application of the uncertainty map to the heating load forecast provided by the BEM is
presented for a random day. Finally, it is evaluated the ability of the proposed PLF methodology to
predict the expected error in the heating load using the 12 first months of the period of the study to
generate the map of uncertainty and the last month (April 2020) to test the method.
4. Results
In this section, the results of applying the proposed probabilistic load forecasting (PLF)
methodology to a real test site are presented. As mentioned before, the methodology converts the
point load forecast provided by a BEM, which is a single-value, into a probabilistic load forecast.
Table 1presents the quantitative errors of the point load forecast provided by the BEM for the different
time horizons (from 1 to 6 day-ahead). It shows the error metrics MAE, MAPE and
R2
between
the forecast and real heating load when using the forecast and actual weather data, respectively.
The results showed, as expected, the increase in the error as the day ahead grows (MAE and MAPE
values increased and R2decreased).
Table 1.
Error metrics for the point load forecast. Comparison between forecast and real heating load.
Index Forecast Day 1 Forecast Day 2 Forecast Day 3 Forecast Day 4 Forecast Day 5 Forecast Day 6
MAE (kWh) 9.91 10.50 11.04 11.57 12.43 13.74
MAPE (%) 38.99 41.26 43.56 47.39 51.10 55.84
R2(%) 74.48 70.82 65.99 59.65 49.72 45.66
In order to show how the methodology is implemented, the whole period of study (13 months of
2 winter campaigns) was employed for the construction of the map of uncertainty. The differences in the
hourly heating load between simulations with observed and forecast weather data were transformed
into a probability histogram. This process was first performed for each full day-ahead in order to show
the influence of the forecast time horizon on the heating load provided by the BEM. Figure 6shows
the histograms for the 6 forecast days. The highlighted gray area shows the spread of the variables
for each forecast day. The graphs show that this gray area grows, which means that there are larger
errors, as the forecast days increase. The errors in all the forecast days do not follow a symmetrical
distribution with respect to the zero, and they all tend to skew toward the right, which means that the
heating energy demand simulated with the forecast weather data is mainly overestimated. The graphs
also present the probability density functions (PDFs) based on the Gaussian kernel density estimation
(KDE) (red line).
The information in Figure 6was transformed into a cumulative density function (CDF),
which depicts the probability that an error in the heating load (in kWh) is below a particular value.
Figure 7shows the representation of the CDF for the six forecast days. For example, in this case,
forecast day 1 has a 90% probability that the hourly load forecast error is below 18.5 kWh or a 10%
probability that it is below 1.2 kWh (blue lines). The plot also shows that the heating load error
increases as the forecast days increase. In this example, for the same 90% probability, the heating load
error for forecast day 6 grows to 27 kWh.
Sensors 2020,20, 6525 10 of 20
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
Forecast day 1 Forecast day 2
Forecast day 3 Forecast day 4
Forecast day 5 Forecast day 6
Heating load difference (kWh) Heating load difference (kWh)
Heating load difference (kWh) Heating load difference (kWh)
Heating load difference (kWh) Heating load difference (kWh)
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
30 20 10 0 10 20 30 40 50 60 70 80 90 100
0.00
0.01
0.02
0.03
0.04
0.05
0.06
0.07
Probability
N: 7493
Bandwidth: 1.4
N: 7493
Bandwidth: 1.14
N: 7493
Bandwidth: 1.84
N: 7493
Bandwidth: 1.52
N: 7493
Bandwidth: 1.25
N: 7493
Bandwidth: 2.05
Figure 6.
Probability histogram of the 6 forecast days and the Gaussian kernel density estimation
(red line).
30 25 20 15 10 5 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
Cumulative probability F(x)
CDF: Forecast day: 1
CDF: Forecast day: 2
CDF: Forecast day: 3
CDF: Forecast day: 4
CDF: Forecast day: 5
CDF: Forecast day: 6
Heating load difference (kWh)
90% = 18.5 kWh
10% = 1.2 kWh
Figure 7. Cumulative distribution function (CDF) for the 6 forecast days.
The previous plots graphically show the hourly heating load error probability as a function of
the full forecast day. In many applications of load forecasting, such as demand response or model
predictive control, an hour-by-hour load error approach is more useful since most applications are
implemented on an hourly basis. Thus, the same procedure was performed but based on an hourly
approach from hour 1 to hour 144 with respect to the forecast time (09:00 h). The probability histogram,
Sensors 2020,20, 6525 11 of 20
the fitted PDFs (KDE), and the CDF were calculated for all of the data that correspond to the first
forecast hour, the second forecast hour and so on, with respect to the forecast time.
With the hourly CDF, the heating load error probability can be extracted as prediction intervals
(in this case, intervals of 10%), making the probabilistic load forecast information more useful
and applicable. For this purpose, a map that summarizes all of the hourly information was computed
in order to present an overview of the heating load error due to the weather forecast as a function of
the hours ahead of the forecast time. This generates the hourly uncertainty map of the heating load
forecast due to weather data uncertainty for this specific building and weather forecast provider.
The bound margins are obtained by a normal inverse cumulative distribution function for
a desired probability index
x
, which, in this case, ranges from 10% to 90%. Figure 8shows the
uncertainty map for the first 24 h ahead of the forecast hour (09:00 h). For each hour, the expected
heating load error due to weather forecast uncertainty can be extracted. For example, at 10:00 on
forecast day 1, there is a 50% probability that the error in the heating load is less than 8.1 kWh, or,
being more conservative, there is a 90% probability that the error is less than 17.1 kWh. In other
words, the use of the forecast weather data leads to an overestimation of the heat load of 8.1 kWh
and 17.1 kWh, respectively. The map shows that the heating load predicted with the forecast weather
is likely to be overestimated with respect to the observed weather data for all of the forecast hours
since the prediction intervals are mainly above 0. When the map is applied to the forecast heat load
provided by the BEM, this overestimation must be considered inverted: there is a 50% probability that
the simulation with forecast weather data provides 8.1 kWh in excess.
This map can be computed for as many forecast hours as there are available. Figure 9shows
the uncertainty map for the 144 h forecast hours available in this case study, divided into the
six forecast days. As expected, the load uncertainty increases as the forecast hours increase, and this
is reflected in these maps: for example, at 10:00 on forecast day 6 (121 h ahead of the forecast time),
there is a 50% probability that the heat load error is lower than 10 kWh, which is 1.9 kWh higher than
that for the same probability at 10:00 on forecast day 1 (1 h ahead of the forecast time), and there is a
90% probability that the heat load error is lower than 27.9 kWh, which is 10.8 kWh higher than that for
the same probability at 10:00 on forecast day 1. The graphs also show that the width of the prediction
intervals in the results grows as the forecast hours increase since, as seen in the histograms for the
daily analysis (see Figure 6), as the forecast days increase, the dispersion of the results also increases.
Once the map of uncertainty is obtained, it can be applied to the heating point load forecast
provided by the BEM. To illustrate this, the previous map is shown for a random day (2 January 2020).
Figure 10 shows the heating load forecast with the uncertainty map for prediction intervals from 10%
to 90% implemented for 2 January 2020 and for 1 day ahead (from 1 to 24 h ahead). In the graph,
the red dots represent the load forecast provided by the BEM for this specific day. The load uncertainty
is represented as prediction intervals around the heating load forecast. When the predictions intervals
(PIs) are below the heating load forecast, the energy demand is overestimated, and if they are above,
it is underestimated. For the case study, the graph shows that the prediction intervals tend to be below
the forecast energy demand, which means that it is much more likely that the model overestimates
the heating load due to the weather forecast. In order to clarify how the graph is read, we provide
the following example. At 10:00, which is 1 h ahead of the forecast time, the forecast heating load
provided by the BEM is 47.6 kWh (red dot), and there is a 90% probability that the real heating load
(with the observed weather data) is more than 30.5 kWh; as another way to analyze it, there is an
80% probability (prediction intervals between 10% and 90%) that the real heating load is higher than
30.5 kWh and lower than 46.1 kWh. According to the risk that the energy manager can accept, higher or
lower probability intervals can be used for decision making. In the graph, the blue dots represent
the real heating load, which is the result of the simulation with the observed weather data. For this
example, all the dots are within the PI, which means that, for this case, this procedure provided a good
prediction of the error in the heating load forecast due to the weather forecast data.
Sensors 2020,20, 6525 12 of 20
The evaluation of the ability of the methodology to predict the load forecast error due to the
weather forecast was performed for the 1 day-ahead data since it is the most commonly used time
horizon in load forecasting. PICP and MPIW indicators were used to perform the assessment,
and prediction intervals from 10% to 90% were considered. The validation of the methodology
was performed using the first 12 months of the data (from December 2018 to May 2019 and from
October 2019 to March 2020) to generate the uncertainty map and the most recent month, April 2020,
to test the prediction interval accuracy. This month has a range of actual hourly heating loads from 0
on the mild days to 45.8 kWh on the coolest days. The PICP result is 83.1%, which is greater than the
confidence level (PINC = 80%), and the MPIW is 17.5 kWh. Figure 11 shows the graphs of the results of
the testing period (April 2020). In order to facilitate its interpretation, the graph for the whole month is
divided into three parts. The actual loads are indicated by the blue line, the heating load forecast is the
red line and the gray area represents the 80% prediction interval (from 10% to 90%). It can be seen that
the prediction intervals generated with the proposed method cover the actual value most of the time.
Previous evaluation is performed using the 12 previous months prior to the testing month
(April 2020) to generate the uncertainty map, but, how does the amount of data affect the uncertainty
map and the results? In order to analyze this influence an analysis is performed gradually increasing
the amount of data used for the creation of the uncertainty map. Maintaining April 2020 as the testing
period, the uncertainty map is created using only the previous month (March 2020), the previous two
(February 2020–March 2020) and so on. Table 2presents the results. It shows that in all the cases the
PICP values are higher than the confidence level (80%). It is remarkable that PICP values increase as
months closer to the testing period are used for the creation of the uncertainty map, despite having a
lesser amount of data, but MPIW values increase as well providing higher prediction intervals widths.
Therefore, the results are robust and similar despite the number of months employed to build the
uncertainty map.
Table 2.
Prediction interval coverage probability (PICP) and mean prediction interval width (MPIW)
results when using a gradually increasing amount of data for the creation of the uncertainty map.
April 2020 is maintained as the testing period.
Months
Uncertainty
Map
March 2020 February 2020
March 2020
January 2020
March 2020
December 2019
March 2020
November 2019
March 2020
October 2019
March 2020
May 2019
March 2020
April 2019
March 2020
March 2019
March 2020
February 2019
March 2020
January 2019
March 2020
December 2018
March 2020
Nº of months 1 2 3 4 5 6 7 8 9 10 11 12
PICP (%)91.5 89.9 84.4 82.2 83.1 84.2 83.8 84.3 84.4 84.2 84.0 83.1
MPIW (kWh) 27.1 23.1 18.8 18.4 18.3 17.8 17.5 17.2 17.7 17.4 17.7 17.5
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
Hours
Hours
ahead
10
+1
11
+2
12
+3
13
+4
14
+5
15
+6
16
+7
17
+8
18
+9
19
+10
20
+11
21
+12
22
+13
23
+14
24
+15
01
+16
02
+17
03
+18
04
+19
05
+20
06
+21
07
+22
08
+23
09
+24
PI 90% = 17.1 KWh
PI 50% = 8.1 KWh
80–90%
70–80%
60–70%
50–60%
40–50%
30–40%
20–30%
10–20%
Prediction intervals
Figure 8.
Probabilistic heating load forecast results: Hourly uncertainty map of the heating load
forecast due to weather forecast data for the first 24 h ahead of the forecast hour (09:00 h).
Sensors 2020,20, 6525 13 of 20
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
10
+1
11
+2
12
+3
13
+4
14
+5
15
+6
16
+7
17
+8
18
+9
19
+10
20
+11
21
+12
22
+13
23
+14
24
+15
01
+16
02
+17
03
+18
04
+19
05
+20
06
+21
07
+22
08
+23
09
+24
10
+25
11
+26
12
+27
13
+28
14
+29
15
+30
16
+31
17
+32
18
+33
19
+34
20
+35
21
+36
22
+37
23
+38
24
+39
01
+40
02
+41
03
+42
04
+43
05
+44
06
+45
07
+46
08
+47
09
+48
Hours
Hours
ahead
Hours
Hours
ahead
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
10
+49
11
+50
12
+51
13
+52
14
+53
15
+54
16
+55
17
+56
18
+57
19
+58
20
+59
21
+60
22
+61
23
+62
24
+63
01
+64
02
+65
03
+66
04
+67
05
+68
06
+69
07
+70
08
+71
09
+72
Hours
Hours
ahead
10
+73
11
+74
12
+75
13
+76
14
+77
15
+78
16
+79
17
+80
18
+81
19
+82
20
+83
21
+84
22
+85
23
+86
24
+87
01
+88
02
+89
03
+90
04
+91
05
+92
06
+93
07
+94
08
+95
09
+96
Hours
Hours
ahead
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
10
+97
11
+98
12
+99
13
+100
14
+101
15
+102
16
+103
17
+104
18
+105
19
+106
20
+107
21
+108
22
+109
23
+110
24
+111
01
+112
02
+113
03
+114
04
+115
05
+116
06
+117
07
+118
08
+119
09
+120
Hours
Hours
ahead
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
10
+121
11
+122
12
+123
13
+124
14
+125
15
+126
16
+127
17
+128
18
+129
19
+130
20
+131
21
+132
22
+133
23
+134
24
+135
01
+136
02
+137
03
+138
04
+139
05
+140
06
+141
07
+142
08
+143
09
+144
Hours
Hours
ahead
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
-5
0
5
10
15
20
25
30
Energy demand uncertainty (kWh)
Uncertainty map forecast day 2 (25–48 h)
Uncertainty map forecast day 4 (73–96 h)
Uncertainty map forecast day 6 (121–144 h)
Uncertainty map forecast day 1 (1–24 h)
Uncertainty map forecast day 3 (49–72 h)
Uncertainty map forecast day 5 (97–120 h)
80–90%
70–80%
60–70%
50–60%
40–50%
30–40%
20–30%
10–20%
Prediction intervals
Figure 9.
Probabilistic heating load forecast results: Hourly uncertainty map of the heating load
forecast due to weather forecast data for 1 to 6 days ahead.
10
15
20
25
30
35
40
45
50
55
Energy demand (kWh)
10
+1
11
+2
12
+3
13
+4
14
+5
15
+6
16
+7
17
+8
18
+9
19
+10
20
+11
21
+12
22
+13
23
+14
24
+15
01
+16
02
+17
03
+18
04
+19
05
+20
06
+21
07
+22
08
+23
09
+24
Hours
Hours
ahead
Forecast hours
Real energy demand
Forecast energy demand
10–20%
20–30%
30–40%
40–50%
50–60%
60–70%
70–80%
80–90%
PI 10% = 46.1 KWh
PI 90% = 30.5 KWh
Figure 10.
Application of the map of the heating load forecast uncertainty due to forecast weather data
using the whole period of the study for a random day: 2 January 2020.
Sensors 2020,20, 6525 14 of 20
0
5
10
15
20
25
30
35
40
45
50
55
60
65
70
1 2 3 4 5 6 7 8 9 10
0
5
10
15
20
25
30
35
40
45
50
55
60
65
70
11 12 13 14 15 16 17 18 19 20
0
5
10
15
20
25
30
35
40
45
50
55
60
65
70
21 22 23 24 25 26 27 28 29 30
Heating Load (kWh)
Heating Load (kWh)
Heating Load (kWh)
Date
Forecast heating load (kWh)
Actual heating load (kWh)
Prediction interval PINC 80
% (10%–90%)
Figure 11.
Results for the validation of the probabilistic load forecasting methodology for the testing
period (April 2020). Above: from 1 to 10 April 2020; middle: from 11 to 20 April 2020; and below:
from 21 to 30 April 2020.
5. Discussion
In the current energy context, load forecasting is becoming increasingly widespread, and the
influence of the uncertainties that affect the prediction of future energy demand must be taken
into consideration. One of the most important uncertainties is the weather forecast, but traditional
point load forecast cannot properly reflect the effect of this uncertainty. This paper aims to provide a
methodology that calculates the probabilistic load forecast while accounting for the inherent uncertainty
in forecast weather data. Recently, PLF approaches have gained importance in the literature but focused
mainly in black-box models. In order to fill this gap, this research provides a PLF methodology based
on white-box models (BEMs). This methodology generates an hourly map of the uncertainty of
the load forecast, which allows the point load forecasting provided by the BEM to be converted
into a probabilistic load forecast. The map of uncertainty is created through a probabilistic analysis
of the impact that the weather forecast has on the building’s load, which is provided by the BEM
using data from the past. This analysis is possible thanks to the use of an accurately calibrated BEM
and data gathered from different sources: the indoor conditions are provided by the sensors of the
Sensors 2020,20, 6525 15 of 20
building’s monitoring; the observed outdoor conditions are supplied by an on-site weather station;
and the weather forecast is provided by an external service.
A case study in a real school building in Gedved, Denmark, is presented. Thirteen months
of data from the building’s monitoring, weather station and weather forecast were available,
which corresponds to two complete winter campaigns. As a result that this building lacks a cooling
system, the study was performed for the heating load forecast, and only the months in which the
systems were actuated were considered.
Unlike other studies from the literature, this research employed a physics-based model, which is
a calibrated BEM developed with EnergyPlus. Following the recommendations of Agüera et al.
in [
35
], this research used the real weather forecast from an external provider instead of synthetic
data. Agüera et al. also found that many papers that employ weather forecast for load forecasting
employed test periods of one day or shorter, and they recommended the use of longer testing periods.
In the present study, the methodology was validated using a whole month. Regarding the load
forecasts, Agüera et al. [
35
] recommended the introduction of at least outdoor temperature forecasts,
and they considered humidity estimations to be valuable. Studies that have accounted for weather
forecasts have usually incorporated only outdoor temperature [
52
], outdoor temperature and relative
humidity [
34
,
36
] or temperature and solar irradiation [
32
] but not all influential weather parameters.
Previous studies from the authors have shown that weather parameters such as wind speed can be
very influential on the building’s load [
70
]. For this reason, this research introduced forecast data
for outdoor temperature, relative humidity, direct normal irradiation, diffuse horizontal irradiation,
wind speed and wind direction.
Regarding the results from the case study, first, the probabilistic load forecast was generated
based on all 13 months of data. The hourly map of the uncertainty of the heating load for this building
and weather provider is presented, and an example of its application is shown. This map shows the
probability of having
x
error in the load forecast for each forecast hour (from 1 to 144 h) for different
prediction intervals from 10% to 90%. In this case, the map shows clearly that the forecast weather
data of the weather provider and location of the study generate an overestimation of the heating load.
The validation of the methodology was performed for the one-day-ahead weather forecast,
which is the most common time horizon used, using the PICP and MPIW indicators. The evaluation
was performed using April 2020 as the testing period, and the previous months were used to generate
the prediction intervals. The results reveal a good performance: the PICP is 83.1%, which was greater
than the confidence level (PINC = 80%), with an MPIW of 17.5 kWh. In order to evaluate the influence
in the results of the size of the sample used for the creation of the uncertainty map, an analysis was
performed gradually increasing the number of months employed and maintaining April 2020 as the
testing period. It showed robust and similar results despite the amount of data employed in the
uncertainty map creation. In all the cases, PICP values were above the confidence level ranging from
82.2 to 91.5%.
6. Conclusions
The present study shows a probabilistic load forecasting methodology based on the use of white-box
models developed in EnergyPlus instead of data-driven or black-box models, which are the commonly
used prediction models employed in the literature for PLF approaches. White-box models allow to link
the results to the physical and architectural building parameters, evaluate the thermal indoor conditions in
different time and spatial scales and are more flexible and robust to changes in the building or operation
schedules. The proposed methodology allows to build an uncertainty map of the building load prediction,
which considers the uncertainty due to the use of weather forecast data, and then apply it to the point load
forecast provided by the building simulation model.
The proposed methodology was applied in a case study showing, through the uncertainty map, an easy
way to represent the expected hourly probabilistic load forecast error, which is very useful for interpretation
by the building’s manager, aggregators, microgrids operators, etc. The results of the evaluation of this
Sensors 2020,20, 6525 16 of 20
methodology showed that for the testing period (April 2020), morethan 80% of the actual hourly heating
loads values were within the prediction intervals. The influence of the amount of data used for the creation
of the uncertainty map was analyzed gradually increasing the number of months employed. It showed
that the methodology is robust since similar results, always with PICP values above the confidence level,
were obtained despite the amount of data employed. Therefore, the proposed methodology and the resulting
map of uncertainty shown in this research are useful tools for applications in which the future load forecast is
required, since future predictions always imply weather forecast uncertainties. For example, when predictive
control strategies are implemented in a microgrid and the consumer has to provide a day-ahead demand plan
with an hourly resolution, which is supposed to be granted within a given confidence interval, the probabilistic
load forecast of the building allows the network operators to better make decisions and plan for future smart
grids; or when the building participates in a demand response (DR) program, the aggregator that manages
the DR events can use a probabilistic load forecast that accounts for uncertainties such as the weather forecast
to make an informed decision about which buildings to rely on for the event.
A limitation of the study was that only heating load forecast, and therefore only cold months,
were considered since no cooling system was available in the building. In future research, this methodology
will be applied and tested in other buildings with both heating and cooling systems in order to assess how
the seasonality of the weather data influences the results when the whole year is employed. Furthermore,
we are currently analyzing whether the proposed map of uncertainty can be generated for a cluster of
buildings with similar architectural characteristics, use patterns and weather conditions. This could be a
simple and useful tool for network managers or aggregators in the DR context.
Author Contributions:
Conceptualization, methodology, formal analysis and visualization, E.L.S. and G.R.R.;
writing—original draft, E.L.S.; writing—review and editing, G.R.R. and C.F.B.; software, G.R.R.; and supervision, C.F.B.
All authors have read and agreed to the published version of the manuscript.
Funding:
This project received funding from the European Union’s Horizon 2020 research and innovation
program under Grant Agreement No. 731211, project SABINA.
Acknowledgments:
We would like to thank the Insero for providing the data of the Gedved school located in
Gedved (Denmark).
Conflicts of Interest: The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
ANN Artificial neural network
ARIMA Autoregressive integrated moving average
ARMA Autoregressive moving average
BEM Building energy model
BMS Building management system
CDF Cumulative distribution function
DR Demand response
DSM Demand-side management
EPW EnergyPlus weather file
HVAC Heating, ventilation, and air conditioning
Iot Internet of things
KDE Kernel density estimation
kWh Kilowatt hour
LR Linear regression
MAE Mean absolute error
MAPE Mean absolute percentage error
MCM Monte Carlo method
ML Machine learning
MPC Model predictive control
Sensors 2020,20, 6525 17 of 20
MPIW Mean prediction interval width
PDF Probability density function
PICP Prediction interval coverage probability
PINC Prediction interval nominal confidence
PLF Probabilistic load forecast
PI Prediction intervals
SVM Support vector machine
R2Coefficient of determination
References
1.
Marinakis, V.; Doukas, H. An advanced IoT-based system for intelligent energy management in buildings.
Sensors 2018,18, 610. [CrossRef] [PubMed]
2.
Jia, M.; Komeily, A.; Wang, Y.; Srinivasan, R.S. Adopting Internet of Things for the development of smart
buildings: A review of enabling technologies and applications. Autom. Constr.
2019
,101, 111–126. [CrossRef]
3.
Lu, X.; O’Neill, Z.; Li, Y.; Niu, F. A novel simulation-based framework for sensor error impact analysis
in smart building systems: A case study for a demand-controlled ventilation system. Appl. Energy
2020,263, 114638. [CrossRef]
4.
2019 Global Status Report for Buildings and Construction: Towards a Zero Emissions, Efficient and Resilient
Buildings and Construction Sector. Available online: https://wedocs.unep.org/bitstream/handle/20.500.
11822/30950/2019GSR.pdf?sequence=1&isAllowed=y (accessed on 15 November 2020).
5.
Khan, Z.A.; Hussain, T.; Ullah, A.; Rho, S.; Lee, M.; Baik, S.W. Towards Efficient Electricity Forecasting in
Residential and Commercial Buildings: A Novel Hybrid CNN with a LSTM-AE based Framework. Sensors
2020,20, 1399. [CrossRef] [PubMed]
6.
El Jaouhari, S.; Jose Palacios-Garcia, E.; Anvari-Moghaddam, A.; Bouabdallah, A. Integrated Management of
Energy, Wellbeing and Health in the Next Generation of Smart Homes. Sensors 2019,19, 481. [CrossRef]
7.
Lee, S.; Choi, D.H. Energy Management of Smart Home with Home Appliances, Energy Storage System and
Electric Vehicle: A Hierarchical Deep Reinforcement Learning Approach. Sensors
2020
,20, 2157. [CrossRef]
8.
Kerk, S.G.; Hassan, N.U.; Yuen, C. Smart Distribution Boards (Smart DB), Non-Intrusive Load Monitoring
(NILM) for Load Device Appliance Signature Identification and Smart Sockets for Grid Demand Management.
Sensors 2020,20, 2900. [CrossRef]
9.
Lee, S.; Choi, D.H. Reinforcement learning-based energy management of smart home with rooftop solar
photovoltaic system, energy storage system, and home appliances. Sensors 2019,19, 3937. [CrossRef]
10.
Foucquier, A.; Robert, S.; Suard, F.; Stéphan, L.; Jay, A. State of the art in building modelling and energy
performances prediction: A review. Renew. Sustain. Energy Rev. 2013,23, 272–288. [CrossRef]
11.
Amasyali, K.; El-Gohary, N.M. A review of data-driven building energy consumption prediction studies.
Renew. Sustain. Energy Rev. 2018,81, 1192–1205. [CrossRef]
12.
Bourdeau, M.; Zhai, X.; Nefzaoui, E.; Guo, X.; Chatellier, P. Modeling and forecasting building energy
consumption: A review of data-driven techniques. Sustain. Cities Soc. 2019,48, 101533. [CrossRef]
13.
Sun, Y.; Haghighat, F.; Fung, B.C. A Review of the-State-of-the-Art in Data-driven Approaches for Building
Energy Prediction. Energy Build. 2020,221, 110022. [CrossRef]
14.
Chou, J.S.; Ngo, N.T. Time series analytics using sliding window metaheuristic optimization-based machine
learning system for identifying building energy consumption patterns. Appl. Energy
2016
,177, 751–770.
[CrossRef]
15.
Nepal, B.; Yamaha, M.; Yokoe, A.; Yamaji, T. Electricity load forecasting using clustering and ARIMA model
for energy management in buildings. Jpn. Archit. Rev. 2020,3, 62–76. [CrossRef]
16.
Moradzadeh, A.; Mansour-Saatloo, A.; Mohammadi-Ivatloo, B.; Anvari-Moghaddam, A. Performance Evaluation
of Two Machine Learning Techniques in Heating and Cooling Loads Forecasting of Residential Buildings. Appl. Sci.
2020,10, 3829. [CrossRef]
17.
Khoshrou, A.; Pauwels, E.J. Short-term scenario-based probabilistic load forecasting: A data-driven approach.
Appl. Energy 2019,238, 1258–1268. [CrossRef]
18.
Runge, J.; Zmeureanu, R. Forecasting energy use in buildings using artificial neural networks: A review.
Energies 2019,12, 3254. [CrossRef]
Sensors 2020,20, 6525 18 of 20
19.
Cox, S.J.; Kim, D.; Cho, H.; Mago, P. Real time optimal control of district cooling system with thermal energy
storage using neural networks. Appl. Energy 2019,238, 466–480. [CrossRef]
20.
Kim, J.H.; Seong, N.C.; Choi, W. Forecasting the Energy Consumption of an Actual Air Handling Unit and
Absorption Chiller Using ANN Models. Energies 2020,13, 4361. [CrossRef]
21.
Sadeghian Broujeny, R.; Madani, K.; Chebira, A.; Amarger, V.; Hurtard, L. Data-driven living spaces’ heating
dynamics modeling in smart buildings using machine learning-based identification. Sensors 2020,20, 1071.
[CrossRef]
22.
Kwak, Y.; Huh, J.H. Development of a method of real-time building energy simulation for efficient
predictive control. Energy Convers. Manag. 2016,113, 220–229. [CrossRef]
23.
Kwak, Y.; Huh, J.H.; Jang, C. Development of a model predictive control framework through real-time
building energy management system data. Appl. Energy 2015,155, 1–13. [CrossRef]
24.
Kampelis, N.; Papayiannis, G.I.; Kolokotsa, D.; Galanis, G.N.; Isidori, D.; Cristalli, C.; Yannacopoulos, A.N.
An Integrated Energy Simulation Model for Buildings. Energies 2020,13, 1170. [CrossRef]
25.
Ghosh, S.; Reece, S.; Rogers, A.; Roberts, S.; Malibari, A.; Jennings, N.R. Modeling the thermal dynamics of
buildings: A latent-force-model-based approach. ACM Trans. Intell. Syst. Technol.
2015
,6, 1–27. [CrossRef]
26.
Gray, F.M.; Schmidt, M. A hybrid approach to thermal building modelling using a combination of Gaussian
processes and grey-box models. Energy Build. 2018,165, 56–63. [CrossRef]
27.
Huang, H.; Chen, L.; Hu, E. A neural network-based multi-zone modelling approach for predictive control
system design in commercial buildings. Energy Build. 2015,97, 86–97. [CrossRef]
28.
Luo, J.; Hong, T.; Fang, S.C. Benchmarking robustness of load forecasting models under data integrity attacks.
Int. J. Forecast. 2018,34, 89–104. [CrossRef]
29.
Zhang, Y.; Lin, F.; Wang, K. Robustness of Short-Term Wind Power Forecasting Against False Data
Injection Attacks. Energies 2020,13, 3780. [CrossRef]
30.
Henze, G. Model predictive control for buildings: A quantum leap? J. Build. Perform. Simul.
2013
,
doi:10.1080/19401493.2013.778519. [CrossRef]
31.
Petersen, S.; Bundgaard, K.W. The effect of weather forecast uncertainty on a predictive control concept for
building systems operation. Appl. Energy 2014,116, 311–321. [CrossRef]
32.
Sandels, C.; Widén, J.; Nordström, L.; Andersson, E. Day-ahead predictions of electricity consumption in
a Swedish office building from weather, occupancy, and temporal data. Energy Build.
2015
,108, 279–290.
[CrossRef]
33.
Thieblemont, H.; Haghighat, F.; Ooka, R.; Moreau, A. Predictive control strategies based on weather forecast
in buildings with energy storage system: A review of the state-of-the art. Energy Build.
2017
,153, 485–500.
[CrossRef]
34.
Zhao, J.; Duan, Y.; Liu, X. Uncertainty analysis of weather forecast data for cooling load forecasting based on
the Monte Carlo method. Energies 2018,11, 1900. [CrossRef]
35.
Agüera-Pérez, A.; Palomares-Salas, J.C.; González de la Rosa, J.J.; Florencias-Oliveros, O. Weather forecasts
for microgrid energy management: Review, discussion and recommendations. Appl. Energy
2018
,
228, 265–278. [CrossRef]
36.
Wang, Z.; Hong, T.; Piette, M.A. Building thermal load prediction through shallow machine learning and
deep learning. Appl. Energy 2020,263, 114683. [CrossRef]
37.
Henze, G.P.; Kalz, D.E.; Felsmann, C.; Knabe, G. Impact of forecasting accuracy on predictive optimal control
of active and passive building thermal storage inventory. HVAC R Res. 2004,10, 153–178. [CrossRef]
38.
Oldewurtel, F.; Parisio, A.; Jones, C.N.; Gyalistras, D.; Gwerder, M.; Stauch, V.; Lehmann, B.; Morari, M.
Use of model predictive control and weather forecasts for energy efficient building climate control.
Energy Build. 2012,45, 15–27. [CrossRef]
39.
Hong, T.; Fan, S. Probabilistic electric load forecasting: A tutorial review. Int. J. Forecast.
2016
,32, 914–938.
[CrossRef]
40.
Hong, T.; Pinson, P.; Fan, S.; Zareipour, H.; Troccoli, A.; Hyndman, R.J. Probabilistic energy forecasting:
Global energy forecasting competition 2014 and beyond. Int. J. Forecast. 2016,32, 896–913. [CrossRef]
41.
Gerossier, A.; Girard, R.; Kariniotakis, G.; Michiorri, A. Probabilistic day-ahead forecasting of household
electricity demand. CIRED-Open Access Proc. J. 2017,2017, 2500–2504. [CrossRef]
Sensors 2020,20, 6525 19 of 20
42.
van der Meer, D.W.; Shepero, M.; Svensson, A.; Widén, J.; Munkhammar, J. Probabilistic forecasting of
electricity consumption, photovoltaic power generation and net demand of an individual building using
Gaussian Processes. Appl. Energy 2018,213, 195–207. [CrossRef]
43.
Rouleau, J.; Ramallo-González, A.P.; Gosselin, L.; Blanchet, P.; Natarajan, S. A unified probabilistic model
for predicting occupancy, domestic hot water use and electricity use in residential buildings. Energy Build.
2019,202, 109375. [CrossRef]
44.
El-Baz, W.; Tzscheutschler, P.; Wagner, U. Day-ahead probabilistic PV generation forecast for buildings
energy management systems. Sol. Energy 2018,171, 478–490. [CrossRef]
45.
Zhao, X.; Liu, J.; Yu, D.; Chang, J. One-day-ahead probabilistic wind speed forecast based on optimized
numerical weather prediction data. Energy Convers. Manag. 2018,164, 560–569. [CrossRef]
46.
Huber, J.; Dann, D.; Weinhardt, C. Probabilistic forecasts of time and energy flexibility in battery electric
vehicle charging. Appl. Energy 2020,262, 114525. [CrossRef]
47.
Yan, X.; Abbes, D.; Francois, B. Uncertainty analysis for day ahead power reserve quantification in an urban
microgrid including PV generators. Renew. Energy 2017,106, 288–297. [CrossRef]
48.
Xu, L.; Wang, S.; Tang, R. Probabilistic load forecasting for buildings considering weather forecasting
uncertainty and uncertain peak load. Appl. Energy 2019,237, 180–195. [CrossRef]
49.
Dahl, M.; Brun, A.; Andresen, G.B. Using ensemble weather predictions in district heating operation and
load forecasting. Appl. Energy 2017,193, 455–465. [CrossRef]
50.
Fan, C.; Liao, Y.; Zhou, G.; Zhou, X.; Ding, Y. Improving cooling load prediction reliability for HVAC system
using Monte-Carlo simulation to deal with uncertainties in input variables. Energy Build.
2020
,226, 110372.
[CrossRef]
51.
Luna, A.C.; Meng, L.; Diaz, N.L.; Graells, M.; Vasquez, J.C.; Guerrero, J.M. Online energy management
systems for microgrids: Experimental validation and assessment framework. IEEE Trans. Power Electron.
2017,33, 2201–2215. [CrossRef]
52.
Lamoudi, M.Y.; Béguery, P.; Alamir, M. Use of simulation for the validation of a model predictive
control strategy for energy management in buildings. In Proceedings of the Building Simulation 2011,
11th international IBPSA conference, Sydney, Australia, 14–16 November 2011; pp. 2703–2710.
53.
Lazos, D.; Sproul, A.B.; Kay, M. Optimisation of energy management in commercial buildings with weather
forecasting inputs: A review. Renew. Sustain. Energy Rev. 2014,39, 587–603. [CrossRef]
54.
Crawley, D.B.; Lawrie, L.K.; Winkelmann, F.C.; Buhl, W.F.; Huang, Y.J.; Pedersen, C.O.; Strand, R.K.;
Liesen, R.J.; Fisher, D.E.; Witte, M.J.; et al. EnergyPlus: Creating a new-generation building energy
simulation program. Energy Build. 2001,33, 319–331. [CrossRef]
55.
Crawley, D.B.; Lawrie, L.K.; Pedersen, C.O.; Winkelmann, F.C.; Witte, M.J.; Strand, R.K.; Liesen, R.J.;
Buhl, W.F.; Huang, Y.J.; Henninger, R.H.; et al. EnergyPlus: An update. Proc. Simbuild 2004,1, 1.
56.
DOE, E. Auxiliary Programs: EnergyPlus
TM
Version 8.9.0 Documentation; US Department of Energy:
Washington, DC, USA, 2018.
57.
Romero-Quete, D.; Cañizares, C.A. An affine arithmetic-based energy management system for isolated
microgrids. IEEE Trans. Smart Grid 2018,10, 2989–2998. [CrossRef]
58.
González, V.G.; Colmenares, L.Á.; Fidalgo, J.F.L.; Ruiz, G.R.; Bandera, C.F. Uncertainy’s Indices Assessment
for Calibrated Energy Models. Energies 2019,12, 2096. [CrossRef]
59. Rao, B.P. Nonparametric Function Estimation; Academic Press: London, UK, 1983.
60.
Bashtannyk, D.M.; Hyndman, R.J. Bandwidth selection for kernel conditional density estimation.
Comput. Stat. Data Anal. 2001,36, 279–298. [CrossRef]
61.
Scott, D.W. Multivariate Density Estimation: Theory, Practice, and Visualization; John Wiley & Sons: New York,
NY, USA, 2015.
62.
Shrivastava, N.A.; Panigrahi, B.K. Point and prediction interval estimation for electricity markets with
machine learning techniques and wavelet transforms. Neurocomputing 2013,118, 301–310. [CrossRef]
63.
SABINA SmArt BI-directional multi eNergy gAteway. Available online: https://sabina-project.eu/
(accessed on 20 April 2020).
64.
Guglielmetti, R.; Macumber, D.; Long, N. OpenStudio: An Open Source Integrated Analysis Platform;
Technical Report; National Renewable Energy Laboratory (NREL): Golden, CO, USA, 2011.
65.
Ruiz, G.R.; Bandera, C.F.; Temes, T.G.A.; Gutierrez, A.S.O. Genetic algorithm for building
envelope calibration. Appl. Energy 2016,168, 691–705. [CrossRef]
Sensors 2020,20, 6525 20 of 20
66.
Ruiz, G.R.; Bandera, C.F. Analysis of uncertainty indices used for building envelope calibration. Appl. Energy
2017,185, 82–94. [CrossRef]
67.
Fernández Bandera, C.; Ramos Ruiz, G. Towards a new generation of building envelope calibration. Energies
2017,10, 2102. [CrossRef]
68.
Gutiérrez González, V.; Ramos Ruiz, G.; Fernández Bandera, C. Empirical and Comparative Validation for a
Building Energy Model Calibration Methodologya. Sensors 2020,20, 5003. [CrossRef] [PubMed]
69. Meteoblue. Available online: https://meteoblue.com/ (accessed on 20 April 2020).
70.
Segarra, E.L.; Ruiz, G.R.; González, V.G.; Peppas, A.; Bandera, C.F. Impact Assessment for Building Energy
Models Using Observed vs. Third-Party Weather Data Sets. Sustainability 2020,12, 6788. [CrossRef]
Publisher’s Note:
MDPI stays neutral with regard to jurisdictional claims in published maps and institutional
affiliations.
©
2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
... To achieve sustainability in a Building Management System (BMS), the goal is to minimize energy consumption while maintaining the occupants' comfort at a satisfactory level [8]. The optimum management of building energy systems is achieved by implementing accurate load forecasting models [9]. In literature, the buildings' load forecasting models are categorized into physical, data-driven and hybrid. ...
... Different models have been proposed in the literature for buildings' load forecasting and can be divided into three types: physical, data-driven and hybrid models [9]. The physical models use mathematical equations for forecasting. ...
Article
Full-text available
The smart buildings’ load forecasting is necessary for efficient energy management, and it is easily possible because of the data availability based on widespread use of Internet of Things (IoT) devices and automation systems. The information of buildings’ occupancy is directly associated with energy consumption. Therefore, we present a hybrid model consisting of a Long Short-Term Memory (LSTM) network, Extreme Gradient Boosting (XgBoost), Random Forest (RF) and Linear Regression (LR) for commercial and academic buildings’ load forecasting. The correlation between occupants’ count and total load of the building is calculated using Pearson Correlation Coefficient (PCC). The comparative analysis of the proposed approach with LSTM, XgBoost, RF and Gated Recurrent Unit (GRU) is also performed. Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Square Error (MSE) and Normalized Root Mean Square Error (NRMSE) are used as performance indicators for evaluating performance. Findings indicate that the proposed hybrid approach outperforms other models. The RMSE and MAE of 2.99 and 2.18, respectively, are recorded by the proposed model for commercial building dataset while for academic building the RMSE and MAE are 4.48 and 2.85, respectively. Occupancy and load consumption have a positive correlation as evident from PCC analysis. Therefore, we have scheduled the forecasted load based on occupancy patterns for two different cases. Cost is reduced by 17.42% and 33.40% in case 1 and case 2, respectively. Moreover, the performance of the proposed hybrid approach is compared with different techniques presented in literature for buildings load forecasting.
... For its winter operation set up, running from January to April 2019 and from November to December 2019, the HVAC system is shifted and locked into heating mode. Table 2 shows that the site temperatures Since weather directly influences the building's energy demand, stressing its envelope and affecting the performance of HVAC production systems [32,40], it is one of the main aspects to carefully observe in order to perform any building energy strategy [41]. This is particularly evident when analyzing cooling/heating performance of outdoor production units like heat pumps and variable refrigerant flow units. ...
Article
Full-text available
To face the challenge of climate change and achieve the decarbonization target set by the European Union, the current trend is to electrify building services, replacing the use of fossil fuels for renewable energy sources. The installation of grid-connected photovoltaic (PV) systems is becoming a popular strategy. However, the widespread application of PV solutions carries certain concerns about grid-network security and stability, since intermittent renewable energy excess pouring into the grid may exceed voltage limits. Therefore, an optimization of the consumption of a building's own PV production (self-consumption) to reduce the excess output is vital. The following paper performs a demand side optimization strategy of the building's thermostatic controllable loads (heating and cooling), which represent at least 50% of the total energy consumed by the building. The process is applied in a previously calibrated building energy model (BEM) that describes a fully operational building under a typical Mediterranean climate (Greece). The site contains a PV plant and a multi-split Variable Refrigerant Flow (VRF) system dedicated to maintain indoor comfort conditions. The technology used is simple, able to perform 15 minute time-step yearly optimizations while saving a large amount of computational time. It performs a bi-dimensional optimization of both: indoor thermal-zone set-points and ventilation air supply temperature. The optimization process performed is based on 2019 data gathered from European Project SABINA, resulting in a self-consumption improvement of 11.6% for summer scenario (reaching 69.16%) and 78.7% for winter (reaching 57.47%) in comparison to a non-optimized “business as usual” base model.
... Examples of probabilistic household load forecasting for the individual dwelling can be found in the literature, either with black box models [38,55,56], or with white box models, appropriately postprocessed with uncertainty analysis techniques [57], in contrast, forecasting at low levels of aggregation appears to be poorly studied and lacks an approach to probabilistic forecasting of domestic energy demand that combines the power and accuracy of modern data-driven approaches with the interpretability of the physical and linear models. Such an approach is necessary to enable forecasting not only the overall energy demand of a community, but also that of the load components such as those related to indoor heating and cooling systems. ...
Article
Full-text available
Indoor heating and cooling systems largely influence the power demand of residential buildings and can play a significant role in the Demand Side Management for energy communities. We propose a novel method for probabilistic forecasting of the total load of a residential community and its base and thermal components, combining conformalized quantile regression and causal machine learning techniques, using only aggregate consumption and environmental conditions data. We applied the proposed methods to the dataset of a residential community in Germany, which includes separate measurements of the total electricity demand and of domestic heating system consumption. The results show that the proposed method produces probabilistic day-ahead hourly forecasts of total energy demand that outperform benchmarks and forecasts of electricity consumption components that are not only more accurate than benchmarks but also close to the accuracy achievable with models trained directly on individual load component data. The T-learner method resulted the most effective among the causal methods for load disaggregation in terms of accuracy, simplicity, and potential for extension.
... Several heating demand forecasting techniques have been investigated in the literature [35]. Traditional approaches are based on white-box models [36,37] in which the system is described with equations that reflect its physics, producing easily interpretable results. When a large amount of historical data is available, like weather forecasts in the past, calendar conditions, and past heat demand, more complex and accurate data-driven models based on regression [38] can be applied to evaluate future consumption. ...
Article
Planning and managing the operation of cogenerative plants (CHP) is increasingly becoming an industrial necessity because of the participation of CHP plants in the day-ahead energy market and the need to deal with heat demand fluctuations. Short-term operational planning usually considers power and heat demand forecast, which can widely fluctuate daily and seasonally, to maximise the net revenue and fulfil the heat requirements. In this framework, a Thermal Storage (TS) can balance day-night fluctuations due to outdoor temperature, as well as unexpected energy surplus and deficiencies caused by heat demand forecast errors, thus giving the CHP plant more operational flexibility. In this paper, the capability of a TS to balance thermal load fluctuations and forecast errors is investigated when a Machine Learning (ML) thermal load forecast for short-term predictions up to 48 h is used. The TS is modelled as a layered storage tank with perfectly mixed layers. Weather and consumption data from 2018 to 2020 related to a large greenhouse powered by a CHP plant located in Tuscany were used as a case study to perform the training and validation of the forecast model as well as to analyse the TS capability in balancing fluctuations with volumes ranging from 500 up to 10,000 m 3. Several ML algorithms were used and compared against a naive prediction based on load persistency. Support Vector Machine (SVM) resulted as the best-performing algorithm. Using SVM leads to better exploitation of the TS capacity, compared to persistence, leading to a more regular State of Charge (SOC) trend and allowing the system to operate within expected conditions up to 80 % of the year. In contrast, a more naïve forecast approach brings to relevant volume size increase to achieve equal performance. Finally, a more accurate forecast reduced the TS size to 50 %, potentially cutting the investment and operational costs compared to the load persistency forecast strategy.
... This information can be used by energy grid operators to make more informed decisions and reduce the risk of energy shortages or surpluses. Probabilistic load forecasting considers the uncertainty associated with load demand predictions [82]. Bayesian networks can capture the probabilistic dependencies between different variables and estimate the probability distribution of load demand [83]. ...
Preprint
Full-text available
The optimization of the energy grid is a critical task for ensuring a sustainable and efficient energy future. Deep machine learning techniques have the potential to improve energy grid optimization by predicting energy demands and supplies, optimizing energy production and distribution, and detecting and preventing fraud. However, there are also several challenges associated with the use of deep machine learning in energy grid optimization. These include the lack of standardized datasets and data quality issues, interpretability and explain-ability of machine learning models, ethical and social implications of using machine learning, and integration with existing energy infrastructure and regulatory frameworks. Having a clear understanding that continued research and developments in deep learning applications to energy field are crucial for achieving a sustainable and efficient energy future. This paper therefore reviewed existing literature for challenges and opportunities associated with deep machine learning in energy grid optimization; and highlights the importance of continued research and development in the field. The paper found out that opportunities for future research in deep machine learning for energy grid optimization include advancements in machine learning algorithms and techniques, development of new datasets and data collection methods, integration of machine learning with other emerging technologies. It also established needs for collaborative research and public-private partnerships.
Article
The study searches for a probabilistic forecasting model for electric power demand based on the correlation between weather temperature and electric power demand using hourly time series data spanning a year. The model is based on the temperature and power demand data's cumulative probability distribution functions or CDFs. In probabilistic forecasting, a range of potential values and their associated probabilities are predicted in addition to a single value for a given range. The qualities of CDF modeling allow it to cover these uncertainties. For scenario 1, case 2, and case 3, the mean absolute percentage error (MAPE)-based forecasting accuracy for the hourly load demand is 3.93, 4.64, and 2.56, respectively. The Gaussian distribution-based CDFs' capacity to forecast outside of the present data range is an extra advantage. The world's electric power networks are growing increasingly intricate and unpredictable through the integration of several distributed energy resources (DERs) and cutting-edge technology from renewable energy resources (RERs). Other variables, such as variations in weather patterns, shifts in the status of the economy, or adjustments to energy policies, can also add to uncertainty and not be foreseen. A body of literature demonstrates a shortage of research on LTLF and general-purpose forecasting methods. The proposed approach is flexible and might establish a standard for many different forecasting tasks. The proposed method is intended to be a flexible strategy that may be used for different utilities with varying time horizons.
Article
With the increasing global demand for new energy sources, Photovoltaic (PV) is increasingly emphasized as a renewable energy source globally. Consequently, the assessment of PV resources has become crucial. Existing single frameworks and algorithms for PV resource assessment lead to low assessment accuracy. To alleviate the deficiency, this study proposes a two-layer network algorithm based on Xception and VGG19 for the evaluation of PV resources. The proposed method combines Xception convolutional neural network and VGG19 convolutional neural network. In addition, this study constructs a two-layer network framework based on the two-layer network algorithm. The feasibility and reliability of the proposed method are verified by simulating the proposed method under the case. Compared with existing algorithms, the proposed method and framework can improve the accuracy of PV resource assessment.
Article
Forecasting renewable energy efficiency significantly impacts system management and operation because more precise forecasts mean reduced risk and improved stability and reliability of the network. There are several methods for forecasting and estimating energy production and demand. This paper discusses the significance of artificial neural network (ANN), machine learning (ML), and Deep Learning (DL) techniques in predicting renewable energy and load demand in various time horizons, including ultra-short-term, short-term, medium-term, and long-term. The purpose of this study is to comprehensively review the methodologies and applications that utilize the latest developments in ANN, ML, and DL for the purpose of forecasting in microgrids, with the aim of providing a systematic analysis. For this purpose, a comprehensive database from the Web of Science was selected to gather relevant research studies on the topic. This paper provides a comparison and evaluation of all three techniques for forecasting in microgrids using tables. The techniques mentioned here assist electrical engineers in becoming aware of the drawbacks and advantages of ANN, ML, and DL in both load demand and renewable energy forecasting in microgrids, enabling them to choose the best techniques for establishing a sustainable and resilient microgrid ecosystem.
Article
Full-text available
The digital world is spreading to all sectors of the economy, and Industry 4.0, with the digital twin, is a reality in the building sector. Energy reduction and decarbonization in buildings are urgently required. Models are the base for prediction and preparedness for uncertainty. Building energy models have been a growing field for a long time. This paper proposes a novel calibration methodology for a building energy model based on two pillars: simplicity, because there is an important reduction in the number of parameters (four) to be adjusted, and cost-effectiveness, because the methodology minimizes the number of sensors provided to perform the process by 47.5%. The new methodology was validated empirically and comparatively based on a previous work carried out in Annex 58 of the International Energy Agency (IEA). The use of a tested and structured experiment adds value to the results obtained.
Article
Full-text available
Air conditioning in buildings accounts for 60% of the total energy consumption. Therefore, accurate predictions of energy consumption are needed to properly manage the energy consumption of buildings. For this purpose, many studies have been conducted recently on the prediction of energy consumption of buildings using machine learning techniques. The energy consumption of the air handling unit (AHU) and absorption chiller in an actual building’s air conditioning system is predicted in this paper using prediction models that are based on artificial neural networks (ANNs), which simply and accurately allow us to forecast energy consumption with limited variables. Using these ANN models, the energy usage of the AHU and chiller could be predicted by collecting a month’s worth of driving data during the summer cooling period. After the forecast models had been verified, the AHU prediction model showed performance in the ranges of 13.27% to 15.25% and 19.42% to 19.53% for the training period and testing period, respectively, and the mean bias error (MBE) ranges were 4.03% to 4.97% and 3.48% to 4.39% for the training period and testing period, respectively. The chiller prediction model satisfied the energy consumption forecast performance criteria presented by American Society of Heating, Refrigerating and Air-Conditioning Engineers (ASHRAE) guideline 14 (the measurement of energy and demand savings), with a performance of 24.64~25.58% and 7.12~29.39% in the training period and testing period, respectively, and MBE ranges of 2.59~3.40% and 1.35~2.87% in the training period and testing period, respectively. When the training period and testing period were combined for the AHU data, the actual energy usage forecast showed a lower error rate range of 0.22% to 1.11% for the training period and 0.17% to 2.44% for the testing period. For the chiller data, the error rate range was 0.22% to 2.12% for the entire training period, but was somewhat higher at 11.67% to 15.18% for the testing period. The study found that, even if the performance criteria were met, high accuracy results were not obtained, which was due to the poor data set quality. Although the forecast model based on artificial neural network can achieve relatively high-accuracy results with sufficient amounts of data, it is believed that this will require a thorough verification of the data used, as well as improvements in the predictive model to avoid overfitting and underfitting, to achieve such good results.
Article
Full-text available
The use of building energy models (BEMs) is becoming increasingly widespread for assessing the suitability of energy strategies in building environments. The accuracy of the results depends not only on the fit of the energy model used, but also on the required external files, and the weather file is one of the most important. One of the sources for obtaining meteorological data for a certain period of time is through an on-site weather station; however, this is not always available due to the high costs and maintenance. This paper shows a methodology to analyze the impact on the simulation results when using an on-site weather station and the weather data calculated by a third-party provider with the purpose of studying if the data provided by the third-party can be used instead of the measured weather data. The methodology consists of three comparison analyses: weather data, energy demand, and indoor temperature. It is applied to four actual test sites located in three different locations. The energy study is analyzed at six different temporal resolutions in order to quantify how the variation in the energy demand increases as the time resolution decreases. The results showed differences up to 38% between annual and hourly time resolutions. Thanks to a sensitivity analysis, the influence of each weather parameter on the energy demand is studied, and which sensors are worth installing in an on-site weather station are determined. In these test sites, the wind speed and outdoor temperature were the most influential weather parameters.
Article
Full-text available
The accuracy of wind power forecasting depends a great deal on the data quality, which is so susceptible to cybersecurity attacks. In this paper, we study the cybersecurity issue of short-term wind power forecasting. We present one class of data attacks, called false data injection attacks, against wind power deterministic and probabilistic forecasting. We show that any malicious data can be injected to historical data without being discovered by one of the commonly-used anomaly detection techniques. Moreover, we testify that attackers can launch such data attacks even with limited resources. To study the impact of data attacks on the forecasting accuracy, we establish the framework of simulating false data injection attacks using the Monte Carlo method. Then, the robustness of six representative wind power forecasting models is tested. Numerical results on real-world data demonstrate that the support vector machine and k-nearest neighbors combined with kernel density estimator are the most robust deterministic and probabilistic forecasting ones among six representative models, respectively. Nevertheless, none of them can issue accurate forecasts under very strong false data attacks. This presents a serious challenge to the community of wind power forecasting. The challenge is to study robust wind power forecasting models dealing with false data attacks.
Article
Full-text available
Nowadays, since energy management of buildings contributes to the operation cost, many efforts are made to optimize the energy consumption of buildings. In addition, the most consumed energy in the buildings is assigned to the indoor heating and cooling comforts. In this regard, this paper proposes a heating and cooling load forecasting methodology, which by taking this methodology into the account energy consumption of the buildings can be optimized. Multilayer perceptron (MLP) and support vector regression (SVR) for the heating and cooling load forecasting of residential buildings are employed. MLP and SVR are the applications of artificial neural networks and machine learning, respectively. These methods commonly are used for modeling and regression and produce a linear mapping between input and output variables. Proposed methods are taught using training data pertaining to the characteristics of each sample in the dataset. To apply the proposed methods, a simulated dataset will be used, in which the technical parameters of the building are used as input variables and heating and cooling loads are selected as output variables for each network. Finally, the simulation and numerical results illustrates the effectiveness of the proposed methodologies.
Article
Full-text available
Traditionally, the choices to balance the grid and meet its peaking power needs are by installing more spinning reserves or perform load shedding when it becomes too much. This problem becomes worse as more intermittent renewable energy resources are installed, forming a substantial amount of total capacity. Advancements in Energy Storage System (ESS) provides the utility new ways to balance the grid and to meet its peak demand by storing un-used off peak energy for peak usage. Large sized ESS—mega watt (MW) level—are installed by different utilities at their substations to provide the high speed grid stabilization to balance the grid to avoid installing more capacity or triggering any current load shedding schemes. However, such large sized ESS systems and their required inverters are costly to install, require much space and their efficacy could also be limited due to network fault current limits and impedances. In this paper, we propose a novel approach and trial for 3000+ homes in Singapore of achieving a large capacity of demand management by developing a smart distribution board (DB) in each home with the high speed metering sensors (>6 kHz sampling rate) and non-intrusive load monitoring (NILM) algorithm, that can assist home users to perform the load/appliance profile identification with daily usage patterns and allow targeted load interruption using the smart sockets/plugs provided. By allowing load shedding at device or appliance level, while knowing their usage profile and preferences, this can allow such an approach to become part of a new voluntary interruptible load management system (ILMS) that requires little user intervention, while minimizing disruption to them, allowing ease of mass participation and thus achieving the intended MW demand management capacities for the grid. This allows for a more cost effective way to better balance the grid without the need for generation capacity growth, large ESS investment while improving the way to perform load shedding without disruptions to entire districts. Simply, home users can now know and participate with the grid in interruptible load (IL) schemes to target specific home appliance, such as water heaters or air conditioning, allowing interruptions during certain times of the day, instead of the entire house, albeit with the right incentives. This allows utilities to achieve MW capacity load shedding with millions of appliances with their preferences, and most importantly, with minimal disruptions to their consumers quality of life. In our paper, we will also consider coupling a small sized Home Energy Storage System (HESS) to amplify the demand management capacity. The proposed approach does not require any infrastructure or wiring changes and is highly scalable. Simulation results demonstrate the effectiveness of the NILM algorithm and achieving high capacity grid demand management. This approach of taking user preferences for appliance level load shedding was developed from the results of a survey of 500 households that indicates >95% participation if they were able to control their choices, possibly allowing this design to be the most successful demand management program than any large ESS solution for the utility. The proposed system has the ability to operate in centralized as part of a larger Energy Management System (EMS) Supervisory Control And Data Acquisition (SCADA) that decide what to dispatch as well as in autonomous modes making it simpler to manage than any MW level large ESS setup. With the availability of high-speed sampling at the DB level, it can rely on EMS SCADA dispatch or when disconnected, rely on the decaying of the grid frequency measured at the metering point in the Smart DB. Our simulation results demonstrate the effectiveness of our proposed approach for fast grid balancing.
Article
Full-text available
This paper presents a hierarchical deep reinforcement learning (DRL) method for the scheduling of energy consumptions of smart home appliances and distributed energy resources (DERs) including an energy storage system (ESS) and an electric vehicle (EV). Compared to Q-learning algorithms based on a discrete action space, the novelty of the proposed approach is that the energy consumptions of home appliances and DERs are scheduled in a continuous action space using an actor–critic-based DRL method. To this end, a two-level DRL framework is proposed where home appliances are scheduled at the first level according to the consumer’s preferred appliance scheduling and comfort level, while the charging and discharging schedules of ESS and EV are calculated at the second level using the optimal solution from the first level along with the consumer environmental characteristics. A simulation study is performed in a single home with an air conditioner, a washing machine, a rooftop solar photovoltaic system, an ESS, and an EV under a time-of-use pricing. Numerical examples under different weather conditions, weekday/weekend, and driving patterns of the EV confirm the effectiveness of the proposed approach in terms of total cost of electricity, state of energy of the ESS and EV, and consumer preference.
Article
Full-text available
Due to industrialization and the rising demand for energy, global energy consumption has been rapidly increasing. Recent studies show that the biggest portion of energy is consumed in residential buildings, i.e., in European Union countries up to 40% of the total energy is consumed by households. Most residential buildings and industrial zones are equipped with smart sensors such as metering electric sensors, that are inadequately utilized for better energy management. In this paper, we develop a hybrid convolutional neural network (CNN) with an long short-term memory autoencoder (LSTM-AE) model for future energy prediction in residential and commercial buildings. The central focus of this research work is to utilize the smart meters’ data for energy forecasting in order to enable appropriate energy management in buildings. We performed extensive research using several deep learning-based forecasting models and proposed an optimal hybrid CNN with the LSTM-AE model. To the best of our knowledge, we are the first to incorporate the aforementioned models under the umbrella of a unified framework with some utility preprocessing. Initially, the CNN model extracts features from the input data, which are then fed to the LSTM-encoder to generate encoded sequences. The encoded sequences are decoded by another following LSTM-decoder to advance it to the final dense layer for energy prediction. The experimental results using different evaluation metrics show that the proposed hybrid model works well. Also, it records the smallest value for mean square error (MSE), mean absolute error (MAE), root mean square error (RMSE) and mean absolute percentage error (MAPE) when compared to other state-of-the-art forecasting methods over the UCI residential building dataset. Furthermore, we conducted experiments on Korean commercial building data and the results indicate that our proposed hybrid model is a worthy contribution to energy forecasting.
Article
Accurate and reliable cooling load prediction is the basis for model predictive control (MPC) of HVAC systems. Cooling load prediction process is constrained by many input variables. These input variables contain different kinds of uncertainties that should not be ignored in pursuing a reliable prediction result. However, previous load prediction methods usually improve the prediction accuracy by optimizing model structure but fail to consider the influence of input variables uncertainties. To reduce the influence of uncertainty of input variables forecast data scientifically, this paper proposes an improved cooling load prediction reliability method in which the input variables are calibrated offline with Monte-Carlo simulations and stochastic treatment before inputting them into the prediction model. A machine learning algorithm (support vector machine, SVM) is used as the prediction model. Uncertainties in weather parameters, indoor heat gains, and historical cooling load were taken into consideration. Uncertainty analysis results showed that prediction accuracy with calibrated input variables was much better (R² = 0.9627) than those predicted with forecast input data directly (R² = 0.9488), which was closer to load prediction obtained by using measured input data (R² = 0.9632). The coefficient of variation (CV) value using the calibrated input data was less than 15%. The reliable cooling load prediction results are beneficial for more informed, reliable decisions on MPC application, thereby promoting energy conservation potential of HVAC systems.
Article
Building energy prediction plays a vital role in developing a model predictive controller for consumers and optimizing energy distribution plan for utilities. Common approaches for energy prediction include physical models, data-driven models and hybrid models. Among them, data-driven approaches have become a popular topic in recent years due to their ability to discover statistical patterns without expertise knowledge. To acquire the latest research trends, this study first summarizes the limitations of earlier reviews: seldom present comprehensive review for the entire data-driven process for building energy prediction and rarely summarize the input updating strategies when applying the trained data-driven model to multi-step energy prediction. To overcome these gaps, this paper provides a comprehensive review on building energy prediction, covering the entire data-driven process that includes feature engineering, potential data-driven models and expected outputs. The distribution of 105 papers, which focus on building energy prediction by data-driven approaches, are reviewed over data source, feature types, model utilization and prediction outputs. Then, in order to implement the trained data-driven models into multi-step prediction, input updating strategies are reviewed to deal with the time series property of energy related data. Finally, the review concludes with some potential future research directions based on discussion of existing research gaps.