ArticlePDF Available

Deep Learning Neural Networks for Short-Term PV Power Forecasting via Sky Image Method

MDPI
Energies
Authors:

Abstract and Figures

Solar photovoltaic (PV) power generation is prone to drastic changes due to cloud cover. The power is easily affected within a very short period of time. Thus, the accuracy of grasping cloud distribution is important for PV power forecasting. This study proposes a novel sky image method to obtain the cloud coverage rate used for short-term PV power forecasting. The authors developed an image analysis algorithm from the sky images obtained by an on-site whole sky imager (WSI). To verify the effectiveness of cloud coverage rate as the parameter for PV power forecast, four different combinations of weather features were used to compare the accuracy of short-term PV power forecasting. In addition to the artificial neural network (ANN) model, long short-term memory (LSTM) and the gated recurrent unit (GRU) were also introduced to compare their applicability conditions. After a comprehensive analysis, the coverage rate is the key weather feature, which can improve the accuracy by about 2% compared to the case without coverage feature. It also indicates that the LSTM and GRU models revealed better forecast results under different weather conditions, meaning that the cloud coverage rate proposed in this study has a significant benefit for short-term PV power forecasting.
This content is subject to copyright.
Citation: Kuo, W.-C.; Chen, C.-H.;
Chen, S.-Y.; Wang, C.-C. Deep
Learning Neural Networks for
Short-Term PV Power Forecasting via
Sky Image Method. Energies 2022,15,
4779. https://doi.org/10.3390/
en15134779
Academic Editor: Antonino
Laudani
Received: 30 May 2022
Accepted: 27 June 2022
Published: 29 June 2022
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in
published maps and institutional affil-
iations.
Copyright: © 2022 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and
conditions of the Creative Commons
Attribution (CC BY) license (https://
creativecommons.org/licenses/by/
4.0/).
energies
Article
Deep Learning Neural Networks for Short-Term PV Power
Forecasting via Sky Image Method
Wen-Chi Kuo 1,* , Chiun-Hsun Chen 2, Sih-Yu Chen 1and Chi-Chuan Wang 1
1Department of Mechanical Engineering, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan;
h860731abc.me08g@nctu.edu.tw (S.-Y.C.); ccwang@nycu.edu.tw (C.-C.W.)
2Department of Aerospace and Systems Engineering, Feng Chia University, Taichung 407, Taiwan;
chiunhchen@fcu.edu.tw
*Correspondence: wenchi.en08@nycu.edu.tw
Abstract:
Solar photovoltaic (PV) power generation is prone to drastic changes due to cloud cover.
The power is easily affected within a very short period of time. Thus, the accuracy of grasping
cloud distribution is important for PV power forecasting. This study proposes a novel sky image
method to obtain the cloud coverage rate used for short-term PV power forecasting. The authors
developed an image analysis algorithm from the sky images obtained by an on-site whole sky imager
(WSI). To verify the effectiveness of cloud coverage rate as the parameter for PV power forecast,
four different combinations of weather features were used to compare the accuracy of short-term PV
power forecasting. In addition to the artificial neural network (ANN) model, long short-term memory
(LSTM) and the gated recurrent unit (GRU) were also introduced to compare their applicability
conditions. After a comprehensive analysis, the coverage rate is the key weather feature, which can
improve the accuracy by about 2% compared to the case without coverage feature. It also indicates
that the LSTM and GRU models revealed better forecast results under different weather conditions,
meaning that the cloud coverage rate proposed in this study has a significant benefit for short-term
PV power forecasting.
Keywords: deep learning (DL); forecasting; neural network; renewable energy; solar power genera-
tion; sky image
1. Introduction
Solar photovoltaics (PVs) have an intermittent characteristic and are easily affected by
the climate environment, cloud cover, and sunlight intensity. These weather conditions will
cause the instability of renewable energy power generation [
1
]. Therefore, it is imperative
to provide good weather forecasting to master the power generation of renewable energy.
Furthermore, the weather parameters in different forecasting time horizons are also slightly
different. The input meteorological factors of short-term forecasting are usually cloud cover
and wind direction. The weather parameters of medium-term forecasting are temperature,
humidity, irradiance intensity, wind speed, and wind direction from the site of some
meteorological equipment in the local area. For the long-term forecasting, the long-term
observation data of the weather forecast needs to be considered [
2
,
3
]. There have been
many studies that have explored the influence of weather parameters on the accuracy of
PV power forecasting. This research includes finding the best weather influencing factor,
comparing the forecasting deviation under different weather, or adding parameters that
worsen the forecasting results [47].
The cloud coverage and solar irradiation prediction are the important weather param-
eters that directly influence the variety of solar photovoltaic power from minutes to hours.
The cloud coverage and solar irradiation can be obtained from the analysis of the sky im-
age. In solar irradiance prediction, image pixels are processed to identify clear/thin/thick
Energies 2022,15, 4779. https://doi.org/10.3390/en15134779 https://www.mdpi.com/journal/energies
Energies 2022,15, 4779 2 of 17
clouds, the function of the solar pixel angle (SPA), image zenith angle (IZA), and solar
zenith angle (SZA) [
8
10
]. In addition, the solar irradiation prediction value is obtained
through analyzing the difference in the picture pixel brightness, the difference in the contin-
uous picture pixel and cloud image trajectory, and used as one of the learning parameters
for solar photovoltaic forecasting [1115].
However, pyrheliometer equipment is installed in most of the PV plants, which can
easily obtain the ultra-violet index (UVI) value, which represents the solar radiation index.
Therefore, the prior research directly used the sky image to analyze the cloud trajectory
or cloud cover and directly used it as a forecasting method for PV power generation. In
particular, Zhang et al. [
16
] used CNN, integrating sky images by stacking four exposures in
grayscale images for the 60 s before the current time t with a 15 s interval. The result showed
that using the LSTM-based model yielded a 21% RMSE skill score that outperformed other
approaches. Kong et al. [
17
] proposed a novel approach based on deep whole-sky-image
learning architectures for 4 to 20 min of solar photovoltaic generation forecasting. The
experimental results showed that the hybrid static image forecaster provided superior
performance, up to an 8.3% improvement in general and up to 32.8% improvement in
the cases of ramp events when compared to that without sky images. Both Wang et al.
and Zhen et al. [
18
,
19
] proposed a hybrid mapping model for solar PV power forecasting,
and their models yielded higher accuracy than the CNN, LSTM, and ANN models. In
the data processing, they filtered the noises of the original sky image and reduced the
image resolution. However, they trained the dataset in three different ways including
color images with a 3D form, stitching of the three R, G, B channels, and all pixel values
of the image. On the other hand, Wen et al. [
20
] used the sky image parameter in solar
photovoltaic prediction for industrial applications. The results showed that it could be used
as a reference basis for smooth photovoltaic power generation control upon cloudy and
sunny days, thereby reducing the energy curtailment. Furthermore, the sky image analysis
methods they used are complicated and difficult to implement. Therefore, the present sky
image analysis method is much simpler and easier to implement in this paper.
This paper proposes a new and interesting image pixel calculation formula for image
processing. It obtains the cloud coverage rate per minute based on the defined threshold as
one of the PV power forecasting parameters. The advantage of the proposed method in
comparison with the previous literature is that these values are easy to parse and sequence
for the time series model. Furthermore, the collected data can be easily integrated with
other weather data quickly and introduced into the PV forecasting deep learning model
to learn and forecast. In particular, the cloud coverage rate can improve the PV power
forecasting accuracy of this study. The structure of this article is as follows. Section 2
describes the solar photovoltaic forecasting framework and the local weather equipment.
Section 3focuses on the proposed sky image coverage analysis method, and the deep
learning models are introduced. Section 4presents and discusses the experimental results
with different weather combinations. Finally, Section 5concludes this article.
2. Forecasting Procedures and Data Set
In this section, the PV forecasting operation procedures is introduced, as shown in
Figure 1, which contains four main processes including the data collection, data processing,
model building and training, and forecasting processes. The details are described as
follows.
Energies 2022,15, 4779 3 of 17
Energies 2022, 15, x FOR PEER REVIEW 3 of 17
Figure 1. The PV power forecasting procedures, introducing Data Collection, Data Preprocessing,
Model Building & Training, and Forecasting Processes.
2.1. Data Collection
The equipment used for data collection in this paper is shown in Figure 2 including
the power data of the 8 kW PV plants, the UVI data from the pyrheliometer, local micro
weather station (contains wind speed and direction, temperature, relative humidity), and
the whole sky imager (WSI). Furthermore, in the overall dataset in this study, the collec-
tion period of the weather and PV data was from 21 February 2021 to 13 June 2021.
Figure 1.
The PV power forecasting procedures, introducing Data Collection, Data Preprocessing,
Model Building & Training, and Forecasting Processes.
2.1. Data Collection
The equipment used for data collection in this paper is shown in Figure 2including
the power data of the 8 kW PV plants, the UVI data from the pyrheliometer, local micro
weather station (contains wind speed and direction, temperature, relative humidity), and
the whole sky imager (WSI). Furthermore, in the overall dataset in this study, the collection
period of the weather and PV data was from 21 February 2021 to 13 June 2021.
Energies 2022, 15, x FOR PEER REVIEW 3 of 17
Figure 1. The PV power forecasting procedures, introducing Data Collection, Data Preprocessing,
Model Building & Training, and Forecasting Processes.
2.1. Data Collection
The equipment used for data collection in this paper is shown in Figure 2 including
the power data of the 8 kW PV plants, the UVI data from the pyrheliometer, local micro
weather station (contains wind speed and direction, temperature, relative humidity), and
the whole sky imager (WSI). Furthermore, in the overall dataset in this study, the collec-
tion period of the weather and PV data was from 21 February 2021 to 13 June 2021.
Figure 2.
The use of the equipment (PV inverter, Weather Station, Whole Sky Imagers) and data
collection schema (using Modbus TCP/IP).
Energies 2022,15, 4779 4 of 17
2.2. Data Processing
The purpose of the present work was to calculate the cloud coverage rate through
the sky image. For simplicity, the authors preprocessed the sky image to exclude the
unnecessary objects in the image including trees, buildings, and the outside black areas
of the image. Then, the information was fed into the proposed RGB sky image processing
method. This pretreatment converts each image into a numerical data type. It calculates
the cloud coverage rate as the weather characteristic value. In the meantime, the proposed
method also preprocesses the weather data from the local small weather station and
examines whether it contains missing data to ensure the photovoltaic data integrity.
2.3. Model Building and Training Processes
Establishing a deep learning model: The deep learning neural network model based
on the ANN, LSTM, and GRU model were applied in this study. In addition, determining
the model hyperparameters is essential. Thus, the parameter turning of hidden units,
training steps, and epochs are incorporated to ensure that there is no overfitting of these
models to maintain the suitability and predictive accuracy of hyperparameters.
2.4. Forecasting Processes
This part uses the model that has already trained hyperparameter and weights for
short-term PV power forecasting. On the other hand, the authors used four different
weather feature combinations and verified the prediction performance of cloud coverage
rate as the parameter for short-term PV power forecasting including Case 1: Six weather
values (wind speed and direction, UVI, temperature, relative humidity, coverage rate); Case
2: Five weather value (without coverage rate); Case 3: Five weather values (without UVI);
and Case 4: Only coverage rate and relative humidity. The predictive abilities of these three
models for short-term PV power forecasting were compared in the final result.
3. Methodology
A new sky image analysis method for short-term PV power generation forecasting
is proposed in this article. In this section, sky image coverage processing, the cloud
computing method, deep learning models (ANN, LSTM, GRU), and evaluation indices will
be introduced separately.
3.1. Sky Image Coverage Processing Method
The sky image processing of this research focuses on RGB pixel composition. In RGB,
the grayscale color refers to converting a color picture into a grayscale picture that is still
not far from the original color, as shown in Equation (1). Since each picture is composed
of different color components, the authors defined
(pixel composition) to represent the
three combined color difference results, and is expressed in Equation (2):
RGB (1)
=(RG)2+(RB)2+(GB)2(2)
Furthermore, to perform the RGB distribution analysis of each element image under
different weather patterns and to analyze the sunny, mostly clear, and cloudy images before
performing the sky image analysis, the authors adopted the Open-Source Computer Vision
(OpenCV) library in python to analyze the sky image and determine the RGB distribution
ratio. The analysis and statistical results are shown in Figure 3.
Energies 2022,15, 4779 5 of 17
Energies 2022, 15, x FOR PEER REVIEW 5 of 17
Figure 3. The results of the average distribution of each element.
After analyzing the color composition, the difference between the blue and green
pixels is about one δ variable. Moreover, the difference between blue and red pixels is
about two δ variables, and the mathematical relationship between the three combined col-
ors is listed in Equation (3). Substituting Equation (3) into Equation (2), subject to some
mathematical manipulation, yields Equation (4).
∀δ:(B−G)(B−R)≅2δ (3)
= (δ)+()+(δ) (4)
The statistical analysis shows that the change in is mainly because of the color dif-
ference between the red and blue pixels. Therefore, the deviation in the green pixel in the
sky image analysis occurs. In order to maintain the same importance among the three
pixels (R, G, B), the authors employed the weighting coefficient method to adjust the re-
lationship of the three pixels. Finally, this study proposes the general mathematical for-
mula of the RGB analysis method as follows:
= 4 × (R G)+(RB)+4×(G−B) (5)
Figure 3. The results of the average distribution of each element.
After analyzing the color composition, the difference between the blue and green
pixels is about one
δ
variable. Moreover, the difference between blue and red pixels is
about two
δ
variables, and the mathematical relationship between the three combined
colors is listed in Equation (3). Substituting Equation (3) into Equation (2), subject to some
mathematical manipulation, yields Equation (4).
δN:(BG)=δ(BR)
=2δ(3)
=(δ)2+(4δ)2+(δ)2(4)
The statistical analysis shows that the change in
is mainly because of the color
difference between the red and blue pixels. Therefore, the deviation in the green pixel
in the sky image analysis occurs. In order to maintain the same importance among the
three pixels (R, G, B), the authors employed the weighting coefficient method to adjust
the relationship of the three pixels. Finally, this study proposes the general mathematical
formula of the RGB analysis method as follows:
=4×(RG)2+(RB)2+4×(GB)2(5)
3.2. Cloud Covering Analysis Structure and Calculation Method
The cloud covering analysis structure is shown in Figure 4. This process can be divided
into the following four parts:
(1)
Calculate the Pixel Composition
Energies 2022,15, 4779 6 of 17
Energies 2022, 15, x FOR PEER REVIEW 6 of 17
3.2. Cloud Covering Analysis Structure and Calculation Method
The cloud covering analysis structure is shown in Figure 4. This process can be di-
vided into the following four parts:
(1) Calculate the Pixel Composition
Calculate the pixel composition and set up the initial threshold (TS) value: The pixel
composition is determined by Equation (5). The initial value of TS is random, and it will
increase by one after each cycle until the coverage condition is reached.
(2) Define the Sky Image Covering Limiting
Define the sky image covering limiting condition: By analyzing the pixel composi-
tion, the authors defined the limit condition of the sun, clouds, and sky as listed in Table
1. Sunny conditions prevail when the pixel composition exceeds TS, and the R, G, B ele-
ments are greater than 760. Conversely, it is regarded as cloudy if the value is less than
760. Furthermore, sky conditions occur if the pixel composition is less than or equal to TS
and the blue value is larger than the red and green values at the same time.
Figure 4. The cloud covering process.
Figure 4. The cloud covering process.
Calculate the pixel composition and set up the initial threshold (TS) value: The pixel
composition is determined by Equation (5). The initial value of TS is random, and it will
increase by one after each cycle until the coverage condition is reached.
(2)
Define the Sky Image Covering Limiting
Define the sky image covering limiting condition: By analyzing the pixel composition,
the authors defined the limit condition of the sun, clouds, and sky as listed in Table 1.
Sunny conditions prevail when the pixel composition exceeds TS, and the R, G, B elements
are greater than 760. Conversely, it is regarded as cloudy if the value is less than 760.
Furthermore, sky conditions occur if the pixel composition is less than or equal to TS and
the blue value is larger than the red and green values at the same time.
(3)
Cloud Covering Calculation
The authors defined the cloud coverage rate in Equation (6). In addition, there will be
a similar coverage rate in a specific range by analyzing the time-series sky images through
testing. Therefore, when the coverage rate increases and the slope decreases, the program
will record the coverage rate as the current value in this cycle. Table 2depicts the result of
the automatic convergence example. From this table, when TS is 16, the cloud coverage
rate is 49.95%, and when TS is 18, the cloud coverage rate changes by 15%. In addition,
Energies 2022,15, 4779 7 of 17
the sky image shows that it will be over-covered when the coverage rate = 61.53% and the
TS = 18, thereby revealing the non-cloud situations.
Cloud coverage rate(%) = [1all pixels mask0s pixels cloud0s pixels
all pixels cloud0s pixels](6)
Table 1. The sky image covering limiting condition.
Item Limit Condition Pixel Status Pixel Status
(pixel composition)
Threshold (TS)
(R,G,B)>760 Sun Remain
(R,G,B)760 Cloud Cover
(pixel composition)
<
Threshold (TS)
B>R
B>G
Sky Remain
Others Ex: Tree Cover
Table 2. The results of the automatic convergence.
TS Sky Image Cover Result Coverage Rate
12
Energies 2022, 15, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/energies
TS Sky Image Cover Result Coverage Rate
12
32.52%
14
44.83%
16
45.95%
18
61.53%
(1) Update the Threshold Value
Update the threshold value: In this study, updating the threshold value for every 15
min was adopted to accelerate the cloud image analysis. Through the testing experience,
each coverage analysis was about 6 to 7 s, which was sufficient for short-term photovol-
taics forecasting.
32.52%
14
Energies 2022, 15, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/energies
TS Sky Image Cover Result Coverage Rate
12
32.52%
14
44.83%
16
45.95%
18
61.53%
(1) Update the Threshold Value
Update the threshold value: In this study, updating the threshold value for every 15
min was adopted to accelerate the cloud image analysis. Through the testing experience,
each coverage analysis was about 6 to 7 s, which was sufficient for short-term photovol-
taics forecasting.
44.83%
16
Energies 2022, 15, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/energies
TS Sky Image Cover Result Coverage Rate
12
32.52%
14
44.83%
16
45.95%
18
61.53%
(1) Update the Threshold Value
Update the threshold value: In this study, updating the threshold value for every 15
min was adopted to accelerate the cloud image analysis. Through the testing experience,
each coverage analysis was about 6 to 7 s, which was sufficient for short-term photovol-
taics forecasting.
45.95%
18
Energies 2022, 15, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/energies
TS Sky Image Cover Result Coverage Rate
12
32.52%
14
44.83%
16
45.95%
18
61.53%
(1) Update the Threshold Value
Update the threshold value: In this study, updating the threshold value for every 15
min was adopted to accelerate the cloud image analysis. Through the testing experience,
each coverage analysis was about 6 to 7 s, which was sufficient for short-term photovol-
taics forecasting.
61.53%
(4)
Update the Threshold Value
Update the threshold value: In this study, updating the threshold value for every
15 min
was adopted to accelerate the cloud image analysis. Through the testing experience,
each coverage analysis was about 6 to 7 s, which was sufficient for short-term photovoltaics
forecasting.
3.3. Choice of The Deep Learning Model
The authors used artificial neural networks and recurrent neural network models
to train and predict the PV power, which are the ANN, LSTM, and GRU models. The
ANN model has a strong nonlinear fitting ability and strong adaptive ability. The LSTM
can control the transmission state through the gated state, remembering the unimportant
Energies 2022,15, 4779 8 of 17
information that needs to be remembered for a long time. The GRU has the same effect as
LSTM, but needs fewer parameters to build models with a large amount of training [
21
23
].
The models have their advantages in training and forecasting. During the training
process, all choices were performed by trying different hyperparameters on the same
training set including hidden units, training steps (Epochs), and input time intervals. Thus,
the authors will discuss which model is more suitable for this field based on the short-term
PV power forecast results in this paper.
3.4. Evaluation Indices
The forecasting performance of PV power prediction models is evaluated using three
statistical indicators, which are the mean absolute error (MAE), root mean squared error
(RMSE), and mean absolute percentage error (MAPE) [
24
]. Their corresponding formulas
are given by Equations (7)–(9):
MAE =1
N
N
i=1
|ˆ
XiXi|(7)
RMSE =v
u
u
t
1
N
N
i=1
(ˆ
XiXi)2(8)
MAPE =1
N
N
i=1
|Xiˆ
Xi
Xi
| × 100% (9)
where
ˆ
Xi
and
Xi
represent the ith forecasted and actual value, respectively, and Nis the
size of the test dataset.
4. Numerical Results
This section evaluates the influence of the coverage rate under different weather
conditions on the performance of one-hour ahead PV power generation forecasting. The
numerical results were comprehensively compared and subsequently discussed.
4.1. Results of Sky Image Processing by RGB Formula
This study proposed two RGB formulas (Equations (2) and (5)) for the sky image
processing. In this section, the authors simulated and compared the threshold ratio of
Equations (2) and (5), which is (1 + 1 + 1):(4 + 1 + 4) = 1:3. Therefore, in the sky image
analysis, the threshold values will represent TS = n
×
30 and TS = n
×
90, where n is a
constant 1, 2, 3
. . .
. Note that it is quite challenging to analyze the difference between the
two RGB analysis methods on sunny days because there were fewer clouds under sufficient
sunlight. However, there was a noticeable difference in cloudy days. Specifically, when
n increased continuously, the difference in cloud covering will become more apparent, as
shown in Tables 3and 4. The result indicates that Equation (5) is superior to Equation (2),
especially on a cloudy day. From the result, Equation (5) is more sensitive and accurate,
and it is the main RGB formula used in this article.
Energies 2022,15, 4779 9 of 17
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n ×30 Equations (5): TS = n ×90
n=1
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
n=2
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
n=3
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n ×30 Equations (5): TS = n ×90
n=1
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
n=2
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
n=3
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Energies 2022, 15, x FOR PEER REVIEW 10 of 19
Table 3. The results of the RGB comparison on a sunny day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
Table 4. The results of the RGB comparison on a cloudy day.
Index Equations (2): TS = n × 30 Equations (5): TS = n × 90
n = 1
n = 2
n = 3
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the
training and predictive results. Therefore, the authors first used 21 days of weather infor-
mation (starting from 24February) for training and testing including hidden units, train-
ing steps (Epochs), and input time intervals to find the most suitable hyperparameter com-
binations for the three models (ANN, LSTM, GRU). All data were sorted by the minute.
The fixed parameters were learning rate (learning rate: 1 × 103) and batch size (batch size:
16). Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the
model. The results showed that the ANN model used five hidden layers, the number of
Epochs was 3000, and data input interval was one hour; in the LSTM model, it used seven
hidden layers, the number of Epochs was 2500, and the data input interval was 2 h; in the
GRU model, it used six hidden layers, the number of Epochs was 1500, and data input
interval was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
4.2. Results of Adjust Hyperparameters
The adjustment of hyperparameters in deep learning has a specific impact on the train-
ing and predictive results. Therefore, the authors first used 21 days of weather information
(starting from 24 February) for training and testing including hidden units, training steps
(Epochs), and input time intervals to find the most suitable hyperparameter combinations
for the three models (ANN, LSTM, GRU). All data were sorted by the minute. The fixed
parameters were learning rate (learning rate: 1
×
10
3
) and batch size (batch size: 16).
Finally, three evaluation indices (MAE, RMSE, and MAPE) were used to evaluate the model.
The results showed that the ANN model used five hidden layers, the number of Epochs
was 3000, and data input interval was one hour; in the LSTM model, it used seven hidden
layers, the number of Epochs was 2500, and the data input interval was 2 h; in the GRU
model, it used six hidden layers, the number of Epochs was 1500, and data input interval
was 3 h. The results of the hyperparameter adjustment are shown in Table 5.
Table 5. The results of the adjustment of the hyperparameters.
Index Model
Adjust Parameters Fixed Parameters Evaluation
Input Time Layers Epochs Learning Rate Batch Size MAE RMSE MAPE
1 ANN 1 h 5 3000 1×10316 0.026 0.046 18.94%
2 LSTM 2 h 7 1500 1×10316 0.027 0.049 14.33%
3 GRU 3 h 6 2500 1×10316 0.015 0.030 10.54%
Energies 2022,15, 4779 10 of 17
4.3. Performance Comparison with Different Weather Features on Sunny and Cloudy Days
The four different weather feature combinations for short-term solar photovoltaic
forecast analysis are as follows. Case 1: Six weather features (wind speed and direction,
UVI, temperature, relative humidity, coverage rate); Case 2: Five weather features (without
coverage rate); Case 3: Five weather features (without UVI), and Case 4: Only coverage rate
and relatively humidity. These cases aim to study whether the cloud coverage proposed
in this paper can reduce the use of features and promote the accuracy of the PV power
generation forecasts.
4.3.1. Case 1: Results of Six Weather Features
In this case, there were six weather features of the dataset: wind speed and direction,
UVI, temperature, relative humidity, and coverage rate. Figure 5intuitively shows the PV
forecast results of the three models on sunny and mostly cloudy days. The best forecasting
model was ANN on sunny days, which is more consistent with the actual tracked solar
photovoltaic power generation. In addition, the GRU and LSTM yielded large predictive
deviations between 3 and 4 PM, respectively. The preliminary assessment may be the
interaction between the weather parameters. On the other hand, it can be found that the
predictive result of ANN varied widely on mostly cloudy days. In contrast, the prediction
trend of LSTM and GRU was better. There are two possible reasons for this difference. First,
the time series forecasting model was more adaptable in the large ramp up and ramp down
situation. Second, there were less data of cloudy days and poor weather. As a result, the
ANN model showed insufficient experience upon prediction.
Energies 2022, 15, x FOR PEER REVIEW 10 of 17
4.3. Performance Comparison with Different Weather Features on Sunny and Cloudy Days
The four different weather feature combinations for short-term solar photovoltaic
forecast analysis are as follows. Case 1: Six weather features (wind speed and direction,
UVI, temperature, relative humidity, coverage rate); Case 2: Five weather features (with-
out coverage rate); Case 3: Five weather features (without UVI), and Case 4: Only coverage
rate and relatively humidity. These cases aim to study whether the cloud coverage pro-
posed in this paper can reduce the use of features and promote the accuracy of the PV
power generation forecasts.
4.3.1. Case 1: Results of Six Weather Features
In this case, there were six weather features of the dataset: wind speed and direction,
UVI, temperature, relative humidity, and coverage rate. Figure 5 intuitively shows the PV
forecast results of the three models on sunny and mostly cloudy days. The best forecasting
model was ANN on sunny days, which is more consistent with the actual tracked solar
photovoltaic power generation. In addition, the GRU and LSTM yielded large predictive
deviations between 3 and 4 PM, respectively. The preliminary assessment may be the in-
teraction between the weather parameters. On the other hand, it can be found that the
predictive result of ANN varied widely on mostly cloudy days. In contrast, the prediction
trend of LSTM and GRU was better. There are two possible reasons for this difference.
First, the time series forecasting model was more adaptable in the large ramp up and ramp
down situation. Second, there were less data of cloudy days and poor weather. As a result,
the ANN model showed insufficient experience upon prediction.
(a) (b)
Figure 5. Case 1—(a) Forecasting results of 16 March 2021(sunny). (b) Forecasting results of 21 May
2021(mostly cloudy). The frequency of data collection was once per minute.
Further evaluation and analysis results are shown in Table 6. The result indicates that
there was a better prediction effect of ANN under sunny conditions, with a MAPE of
6.54%, MAE of 0.017, and RMSE of 0.029. In contrast, the prediction effect of GRU was
better under cloudy conditions with a MAPE of 10.25%, MAE of 0.039, and RMSE of 0.078.
Based on the average of the two-day forecast results, it was found that GRU performed
better, with a MAPE of 9.02%, MAE of 0.029, and RMSE of 0.055.
Figure 5.
Case 1—(
a
) Forecasting results of 16 March 2021 (sunny). (
b
) Forecasting results of
21 May 2021
(mostly cloudy). The frequency of data collection was once per minute.
Further evaluation and analysis results are shown in Table 6. The result indicates that
there was a better prediction effect of ANN under sunny conditions, with a MAPE of 6.54%,
MAE of 0.017, and RMSE of 0.029. In contrast, the prediction effect of GRU was better
under cloudy conditions with a MAPE of 10.25%, MAE of 0.039, and RMSE of 0.078. Based
on the average of the two-day forecast results, it was found that GRU performed better,
with a MAPE of 9.02%, MAE of 0.029, and RMSE of 0.055.
Energies 2022,15, 4779 11 of 17
Table 6. Case 1. The results of the six weather features.
Index Model
Six Weather Values
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 6.544 0.017 0.029 11.763 0.052 0.115 9.154 0.035 0.072
2 LSTM 8.869 0.024 0.046 12.439 0.046 0.088 10.654 0.035 0.067
3 GRU 7.779 0.019 0.032 10.253 0.039 0.078 9.016 0.029 0.055
4.3.2. Case 2: The Results of Five Weather Features (without Coverage Rate)
In this case, there were five weather features of the dataset: wind speed and direction,
UVI, temperature, relative humidity. The predictive results are shown in Figure 6. In the
sunny condition, it can be found that the solar photovoltaic power curve predicted by ANN
significantly deviated from the actual one due to a lack of the cloud coverage rate. The
deviation occurred in the interval of 9 to 10 AM and 2 to 3 PM. Furthermore, the GRU
model also revealed a significant forecast departure at 1 PM. However, the prediction trend
of LSTM was close to the actual PV power curve. Under mostly cloudy conditions, the
forecast trends of the ANN and GRU models deviated significantly from the actual curve,
especially between 1:30 to 2:30 PM. In contrast, the predictive results of the LSTM model
were only slightly different from Case 1.
Energies 2022, 15, x FOR PEER REVIEW 11 of 17
Table 6. Case 1. The results of the six weather features.
Index Model
Six Weather Values
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 6.544 0.017 0.029 11.763 0.052 0.115 9.154 0.035 0.072
2 LSTM 8.869 0.024 0.046 12.439 0.046 0.088 10.654 0.035 0.067
3 GRU 7.779 0.019 0.032 10.253 0.039 0.078 9.016 0.029 0.055
4.3.2. Case 2: The Results of Five Weather Features (without Coverage Rate)
In this case, there were five weather features of the dataset: wind speed and direction,
UVI, temperature, relative humidity. The predictive results are shown in Figure 6. In the
sunny condition, it can be found that the solar photovoltaic power curve predicted by
ANN significantly deviated from the actual one due to a lack of the cloud coverage rate.
The deviation occurred in the interval of 9 to 10 AM and 2 to 3 PM. Furthermore, the GRU
model also revealed a significant forecast departure at 1 PM. However, the prediction
trend of LSTM was close to the actual PV power curve. Under mostly cloudy conditions,
the forecast trends of the ANN and GRU models deviated significantly from the actual
curve, especially between 1:30 to 2:30 PM. In contrast, the predictive results of the LSTM
model were only slightly different from Case 1.
(a) (b)
Figure 6. Case 2—(a) Forecasting results of 16 March 2021 (sunny). (b) Forecasting results of 21 May
2021 (mostly cloudy). The frequency of data collection was once per minute.
Table 7 describes the forecasting results of the three models on sunny and mostly
cloudy days. In the sunny day, the ANN still outperformed other models where MAPE
was about 7.85%, MAE was 0.019, and RMSE was 0.032. Under the mostly cloudy condi-
tion, the accuracy of the LSTM model was about 2 to 3% higher than the others.
Table 7. Case 2. The results of five weather values (without coverage rate).
Index Model
Five Weather Value (without Coverage Rate)
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 7.857 0.019 0.032 16.251 0.071 0.135 12.054 0.045 0.084
2 LSTM 7.233 0.020 0.035 12.861 0.049 0.101 10.047 0.035 0.068
Figure 6.
Case 2—(
a
) Forecasting results of 16 March 2021 (sunny). (
b
) Forecasting results of
21 May 2021
(mostly cloudy). The frequency of data collection was once per minute.
Table 7describes the forecasting results of the three models on sunny and mostly
cloudy days. In the sunny day, the ANN still outperformed other models where MAPE was
about 7.85%, MAE was 0.019, and RMSE was 0.032. Under the mostly cloudy condition,
the accuracy of the LSTM model was about 2 to 3% higher than the others.
Table 7. Case 2. The results of five weather values (without coverage rate).
Index Model
Five Weather Value (without Coverage Rate)
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 7.857 0.019 0.032 16.251 0.071 0.135 12.054 0.045 0.084
2 LSTM 7.233 0.020 0.035 12.861 0.049 0.101 10.047 0.035 0.068
3 GRU 7.628 0.022 0.039 15.953 0.0614 0.122 11.791 0.042 0.081
Energies 2022,15, 4779 12 of 17
4.3.3. Case 3: Results of Five Weather Feature (without UVI)
Five weather features except for UVI were used in this case. The results of the predic-
tion are shown in Figure 7. Compared with the actual curve, the prediction trend of the
three models was roughly the same when the UVI was removed under sunny conditions,
which was quite different from those of removing the coverage rate. Therefore, the coverage
rate was the dominant feature that influences the predictive performance in the sunny
condition. On the other hand, under a mostly cloudy condition, the performance of the
three models was not significantly different from that in Case 2.
Energies 2022, 15, x FOR PEER REVIEW 12 of 17
3 GRU 7.628 0.022 0.039 15.953 0.0614 0.122 11.791 0.042 0.081
4.3.3. Case 3: Results of Five Weather Feature (without UVI)
Five weather features except for UVI were used in this case. The results of the pre-
diction are shown in Figure 7. Compared with the actual curve, the prediction trend of the
three models was roughly the same when the UVI was removed under sunny conditions,
which was quite different from those of removing the coverage rate. Therefore, the cover-
age rate was the dominant feature that influences the predictive performance in the sunny
condition. On the other hand, under a mostly cloudy condition, the performance of the
three models was not significantly different from that in Case 2.
The further evaluation and analysis results are summarized in Table 8. The forecast
evaluation indices were significantly improved when the coverage rate was included, and
this is applicable even without the UVI feature. Based on the average forecasting results
of the different weather conditions, the overall forecasting effect of ANN was better than
the other models, with a MAPE of 7.7%, a MAE of 0.025, and an RMSE of 0.058, respec-
tively.
(a) (b)
Figure 7. Case 3—(a) Forecasting results of 16 March 2021 (sunny). (b) Forecasting results of 21 May
2021 (mostly cloudy). The frequency of data collection was once per minute.
Table 8. Case 3. The results of five weather values (without UVI).
Index Model
Five Weather Values (without UVI)
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE%% MAE RMSE
1 ANN 4.001 0.009 0.019 11.492 0.041 0.096 7.747 0.025 0.058
2 LSTM 6.023 0.019 0.033 15.749 0.057 0.114 10.886 0.038 0.074
3 GRU 8.799 0.021 0.036 15.902 0.587 0.112 12.351 0.304 0.074
4.3.4. Case 4: The Results of Only Coverage Rate and Relative Humidity
In this case, only the relative humidity and coverage rate features of the dataset were
used in the prediction. Figure 8 shows the forecast result of the three models subjected to
sunny and mostly cloudy weather. In the sunny condition, either ANN, LSTM, or GRU
all showed good predictive ability, in particular, the prediction curve of ANN almost co-
incided with the trajectory of the real PV power curve. The prediction curves of ANN and
GRU were close to the actual curve between 1:30 PM to 2:10 PM.
Figure 7.
Case 3—(
a
) Forecasting results of 16 March 2021 (sunny). (
b
) Forecasting results of
21 May 2021
(mostly cloudy). The frequency of data collection was once per minute.
The further evaluation and analysis results are summarized in Table 8. The forecast
evaluation indices were significantly improved when the coverage rate was included, and
this is applicable even without the UVI feature. Based on the average forecasting results of
the different weather conditions, the overall forecasting effect of ANN was better than the
other models, with a MAPE of 7.7%, a MAE of 0.025, and an RMSE of 0.058, respectively.
Table 8. Case 3. The results of five weather values (without UVI).
Index Model
Five Weather Values (without UVI)
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE%% MAE RMSE
1 ANN 4.001 0.009 0.019 11.492 0.041 0.096 7.747 0.025 0.058
2 LSTM 6.023 0.019 0.033 15.749 0.057 0.114 10.886 0.038 0.074
3 GRU 8.799 0.021 0.036 15.902 0.587 0.112 12.351 0.304 0.074
4.3.4. Case 4: The Results of Only Coverage Rate and Relative Humidity
In this case, only the relative humidity and coverage rate features of the dataset were
used in the prediction. Figure 8shows the forecast result of the three models subjected to
sunny and mostly cloudy weather. In the sunny condition, either ANN, LSTM, or GRU all
showed good predictive ability, in particular, the prediction curve of ANN almost coincided
with the trajectory of the real PV power curve. The prediction curves of ANN and GRU
were close to the actual curve between 1:30 PM to 2:10 PM.
Energies 2022,15, 4779 13 of 17
Energies 2022, 15, x FOR PEER REVIEW 13 of 17
Table 9 summarizes the further analysis results of three models in the sunny and
mostly cloudy conditions. In the sunny condition, the forecast result of ANN still pre-
vailed over the LSTM and GRU model. The MAPE, MAE, and RMSE were 3.93%, 0.008,
and 0.015, respectively.
(a) (b)
Figure 8. Case 4—(a) Forecasting results of 16 March 2021 (sunny). (b) Forecasting results of 21 May
2021 (mostly cloudy). The frequency of data collection was once per minute.
Table 9. Case 4. The results of the only coverage rate and relatively humidity.
Index Model
Only Coverage Rate and Relatively Humidity
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 3.933 0.008 0.015 10.342 0.036 0.069 7.138 0.022 0.042
2 LSTM 7.195 0.021 0.035 15.377 0.069 0.129 11.286 0.045 0.082
3 GRU 7.130 0.021 0.036 13.939 0.046 0.087 10.535 0.034 0.062
In summary of the aforementioned four forecasting results, it can be concluded that
the ANN model is more accurate in terms of the short-term PV forecasting in the sunny
condition. However, the advantage of ANN gradually decreased in the mostly cloudy
condition. From another point of view, to balance the predictive ability of the sunny and
mostly cloudy, the average MAPE change in different cases is shown in Figure 9. The
statistical results show that in contrast to other cases, the average change in the MAPE of
case 1 was smaller. Therefore, Case 1 (using six weather features) can adapt better when
subjected to the different weather conditions upon PV prediction.
Furthermore, Case 1 showed that the average degree of change in the MAPE of ANN
was about 6%, which was significantly higher than that of LSTM and GRU, while the av-
erage degree of change in MAPE in the GRU model was about 2%. Thus, when the pre-
diction time is lengthened, more weather conditions will accumulate during photovoltaic
power forecasting, so the prediction effect of the GRU model may have better adaptability
than the LSTM and ANN models. This can be subsequently seen from the analysis results
of one-week forecasting.
Figure 8.
Case 4—(
a
) Forecasting results of
16 March 2021
(sunny). (
b
) Forecasting results of
21 May 2021
(mostly cloudy). The frequency of data collection was once per minute.
Table 9summarizes the further analysis results of three models in the sunny and
mostly cloudy conditions. In the sunny condition, the forecast result of ANN still prevailed
over the LSTM and GRU model. The MAPE, MAE, and RMSE were 3.93%, 0.008, and 0.015,
respectively.
Table 9. Case 4. The results of the only coverage rate and relatively humidity.
Index Model
Only Coverage Rate and Relatively Humidity
Sunny Cloudy Average
MAPE% MAE RMSE MAPE% MAE RMSE MAPE% MAE RMSE
1 ANN 3.933 0.008 0.015 10.342 0.036 0.069 7.138 0.022 0.042
2 LSTM 7.195 0.021 0.035 15.377 0.069 0.129 11.286 0.045 0.082
3 GRU 7.130 0.021 0.036 13.939 0.046 0.087 10.535 0.034 0.062
In summary of the aforementioned four forecasting results, it can be concluded that
the ANN model is more accurate in terms of the short-term PV forecasting in the sunny
condition. However, the advantage of ANN gradually decreased in the mostly cloudy
condition. From another point of view, to balance the predictive ability of the sunny and
mostly cloudy, the average MAPE change in different cases is shown in Figure 9. The
statistical results show that in contrast to other cases, the average change in the MAPE of
case 1 was smaller. Therefore, Case 1 (using six weather features) can adapt better when
subjected to the different weather conditions upon PV prediction.
Furthermore, Case 1 showed that the average degree of change in the MAPE of ANN
was about 6%, which was significantly higher than that of LSTM and GRU, while the aver-
age degree of change in MAPE in the GRU model was about 2%. Thus, when the prediction
time is lengthened, more weather conditions will accumulate during photovoltaic power
forecasting, so the prediction effect of the GRU model may have better adaptability than
the LSTM and ANN models. This can be subsequently seen from the analysis results of
one-week forecasting.
Energies 2022,15, 4779 14 of 17
Energies 2022, 15, x FOR PEER REVIEW 14 of 17
Figure 9. The comprehensive analysis and average degree of the MAPE change with different
weather feature combinations.
4.4. Performance Comparison with Different Weather Features in One Week
The forecast performance evaluation of the three deep learning models with four
cases for one week (from 7 June to 13 June 2021 including sunny, mostly clear, cloudy,
and short showers weather) is shown in Figure 10 and Table 10. The results showed that
using the six weather features yielded better forecasting accuracy. Among them, 7 June
and 8 June were days of short showers. In contrast with the sunny day, adding the cloud
coverage rate to the forecast for the PV power showed that the effect was not significant.
The main reason for the poor prediction results on rainy days is that the raindrop attaches
to the sky imager, which will cause misjudgment when the equation calculates each pixel,
resulting in more errors when calculating the coverage rate. This is a shortcoming of the
RGB method in calculating the coverage rate. As addressed in the previous section, upon
longer prediction period and under more weather conditions, the prediction effect of the
GRU model showed better adaptability than that of the LSTM and ANN models. How-
ever, the corresponding MAPE was about 16.9%, the MAE was 0.048, and the RMSE was
0.098. On the other hand, from the comparison results, the proposed coverage rate as a
weather feature can improve the accuracy by about 2% when compared with the case
without coverage rate, meaning that the coverage rate is an imperative feature for short-
term PV forecasting.
(a)
Figure 9.
The comprehensive analysis and average degree of the MAPE change with different weather
feature combinations.
4.4. Performance Comparison with Different Weather Features in One Week
The forecast performance evaluation of the three deep learning models with four cases
for one week (from 7 June to 13 June 2021 including sunny, mostly clear, cloudy, and short
showers weather) is shown in Figure 10 and Table 10. The results showed that using the six
weather features yielded better forecasting accuracy. Among them, 7 June and 8 June were
days of short showers. In contrast with the sunny day, adding the cloud coverage rate to
the forecast for the PV power showed that the effect was not significant. The main reason
for the poor prediction results on rainy days is that the raindrop attaches to the sky imager,
which will cause misjudgment when the equation calculates each pixel, resulting in more
errors when calculating the coverage rate. This is a shortcoming of the RGB method in
calculating the coverage rate. As addressed in the previous section, upon longer prediction
period and under more weather conditions, the prediction effect of the GRU model showed
better adaptability than that of the LSTM and ANN models. However, the corresponding
MAPE was about 16.9%, the MAE was 0.048, and the RMSE was 0.098. On the other hand,
from the comparison results, the proposed coverage rate as a weather feature can improve
the accuracy by about 2% when compared with the case without coverage rate, meaning
that the coverage rate is an imperative feature for short-term PV forecasting.
Energies 2022, 15, x FOR PEER REVIEW 14 of 17
Figure 9. The comprehensive analysis and average degree of the MAPE change with different
weather feature combinations.
4.4. Performance Comparison with Different Weather Features in One Week
The forecast performance evaluation of the three deep learning models with four
cases for one week (from 7 June to 13 June 2021 including sunny, mostly clear, cloudy,
and short showers weather) is shown in Figure 10 and Table 10. The results showed that
using the six weather features yielded better forecasting accuracy. Among them, 7 June
and 8 June were days of short showers. In contrast with the sunny day, adding the cloud
coverage rate to the forecast for the PV power showed that the effect was not significant.
The main reason for the poor prediction results on rainy days is that the raindrop attaches
to the sky imager, which will cause misjudgment when the equation calculates each pixel,
resulting in more errors when calculating the coverage rate. This is a shortcoming of the
RGB method in calculating the coverage rate. As addressed in the previous section, upon
longer prediction period and under more weather conditions, the prediction effect of the
GRU model showed better adaptability than that of the LSTM and ANN models. How-
ever, the corresponding MAPE was about 16.9%, the MAE was 0.048, and the RMSE was
0.098. On the other hand, from the comparison results, the proposed coverage rate as a
weather feature can improve the accuracy by about 2% when compared with the case
without coverage rate, meaning that the coverage rate is an imperative feature for short-
term PV forecasting.
(a)
Figure 10. Cont.
Energies 2022,15, 4779 15 of 17
Energies 2022, 15, x FOR PEER REVIEW 15 of 17
(b)
(c)
(d)
Figure 10. The forecasting results of the four cases from 7 June to 13 June 2021. The frequency of
data collection was once per minute. (a) One week forecasting results of Case 1 (six weather values).
Figure 10.
The forecasting results of the four cases from 7 June to 13 June 2021. The frequency of
data collection was once per minute. (
a
) One week forecasting results of Case 1 (six weather values).
(b) One
week forecasting results of Case 2 (five weather values (without coverage rate)). (
c
) One
week forecasting results of Case 3 (five weather values (without UVI)). (
d
) One week forecasting
results of Case 4 (only coverage rate and relatively humidity).
Energies 2022,15, 4779 16 of 17
Table 10. The results of the one week PV prediction evaluation.
Index Model
Case1 1Case2 2Case3 3Case4 4
MAPE
(%) MAE RMSE MAPE
(%) MAE RMSE MAPE
(%) MAE RMSE MAPE
(%) MAE RMSE
1 ANN 19.635 0.048 0.095 22.295 0.054 0.106 20.928 0.055 0.102 21.517 0.056 0.114
2 LSTM 16.969 0.048 0.098 18.856 0.054 0.109 24.536 0.058 0.107 20.745 0.061 0.119
3 GRU 16.679 0.049 0.098 18.565 0.529 0.105 17.962 0.057 0.106 18.731 0.056 0.106
1
Case 1: Six weather values;
2
Case 2: Five weather values (without coverage rate);
3
Case 3: Five weather values
(without UVI); 4Case 4: Only coverage rate and relatively humidity.
5. Conclusions
The accuracy of grasping cloud distribution has greatly helped short-term PV power
forecasting. The RGB calculation formula and limitation conditions as sky image analysis
are proposed in this paper. Furthermore, the authors defined the threshold for obtaining
the best coverage rate per minute and as the deep learning model training feature. The
novelty of this method is that the calculation is fast and simple. By comparing the one-week
forecast results of the four cases, it can be seen that considering all six weather features
yielded a complementary effect to improve the forecast accuracy. In particular, adding
the cloud coverage rate as a key feature for the PV power forecast increased up to a 2%
improvement in accuracy. On the other hand, the result shows that the application of
the cloud coverage rate with the deep learning models LSTM and GRU is beneficial to
the short-term PV power forecasting. In this regard, adding the cloud coverage rate as a
feature parameter to forecast the renewable energy power generation of the microgrid is
reliable and robust. In addition, the forecasting result of PV power can also be provided as
a reference for the EMS of the regional microgrid to power dispatch.
However, the forecasting result shows that this method is ineffective on rainy days,
mainly because the raindrop attaches to the sky imager; the RGB pixels cannot be distin-
guished. Therefore, the authors will introduce a hybrid model into a micro-grid energy
management system for further research. If the weather forecast is specifically a rainy day,
a hybrid model will be utilized instead of using the model run by the cloud coverage rate.
In contrast, other weather forecast results will be estimated with the model run by the
cloud coverage rate.
Author Contributions:
This paper is a collaborative work of all authors. Conceptualization, C.-H.C.
and C.-C.W.; methodology, W.-C.K. and S.-Y.C.; software, W.-C.K. and S.-Y.C.; validation, W.-C.K.,
C.-H.C. and S.-Y.C.; formal analysis, W.-C.K.; investigation, W.-C.K. and C.-H.C.; resources, C.-H.C.;
data curation, W.-C.K.; writing—original draft preparation, W.-C.K.; writing—review and editing,
C.-C.W.; supervision, C.-H.C.; project administration, C.-H.C., C.-C.W. and W.-C.K. All authors have
read and agreed to the published version of the manuscript.
Funding:
This research was funded by the Ministry of Science and Technology, Taiwan, R.O.C., grant
number MOST 110-2221-E-035-091 and MOST 110-2221-E-A49-079.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
Shah, R.; Mithulananthan, N.; Bansal, R.C.; Ramachandaramurthy, V.K. A review of key power system stability challenges for
large-scale PV integration. Renew. Sustain. Energy Rev. 2015,41, 1423–1436. [CrossRef]
2.
Zhang, Y.; Beaudin, M.; Taheri, R.; Zareipour, H.; Wood, D. Day-ahead power output forecasting for small-scale solar photovoltaic
electricity generators. IEEE Trans. Smart Grid 2015,6, 2253–2262. [CrossRef]
Energies 2022,15, 4779 17 of 17
3.
Verma, T.; Tiwana, A.; Reddy, C.; Arora, V.; Devanand, P. Data Analysis to Generate Models Based on Neural Network and
Regression for Solar Power Generation Forecasting. In Proceedings of the 2016 7th International Conference on Intelligent
Systems, Modelling and Simulation (ISMS), Bangkok, Thailand, 25–27 January 2016; IEEE: New York, NY, USA, 2016; pp. 97–100.
4.
Alluhaidah, B.M.; Shehadeh, S.; El-Hawary, M. Most Influential Variables for Solar Radiation Forecasting Using Artificial Neural
Networks. In Proceedings of the 2014 IEEE Electrical Power and Energy Conference, Toronto, ON, Canada, 10–11 October 2014;
IEEE: New York, NY, USA, 2014; pp. 71–75.
5.
Nitisanon, S.; Hoonchareon, N. Solar Power Forecast with Weather Classification Using Self-Organized Map. In Proceedings of the
2017 IEEE Power & Energy Society General Meeting, Chicago, IL, USA, 16–20 July 2017; IEEE: New York, NY, USA, 2017; pp. 1–5.
6.
Agoua, X.G.; Girard, R.; Kariniotakis, G. Short-term spatio-temporal forecasting of photovoltaic power production. IEEE Trans.
Sustain. Energy 2017,9, 538–546. [CrossRef]
7.
Suksamosorn, S.; Hoonchareon, N.; Songsiri, J. Influential Variable Selection for Improving Solar Forecasts from Numerical
Weather Prediction. In Proceedings of the 2018 15th International Conference on Electrical Engineering/Electronics, Computer,
Telecommunications and Information Technology (ECTI-CON), Chiang Rai, Thailand, 18–21 July 2018; IEEE: New York, NY, USA,
2018; pp. 333–336.
8.
Ghonima, M.; Urquhart, B.; Chow, C.; Shields, J.; Cazorla, A.; Kleissl, J. A method for cloud detection and opacity classification
based on ground based sky imagery. Atmos. Meas. Tech. 2012,5, 2881–2892. [CrossRef]
9.
Shields, J.E.; Karr, M.E.; Burden, A.R.; Johnson, R.W.; Mikuls, V.W.; Streeter, J.R.; Hodgkiss, W.S. Research toward Multi-Site
Characterization of Sky Obscuration by Clouds; Scripps Institution of Oceanography La Jolla Ca Marine Physical Lab: La Jolla, CA,
USA, 2009.
10.
Ghanbarzadeh, A.; Noghrehabadi, A.; Assareh, E.; Behrang, M. Solar Radiation Forecasting Based on Meteorological Data Using
Artificial Neural Networks. In Proceedings of the 2009 7th IEEE International Conference on Industrial Informatics, Cardiff, UK,
23–36 June 2009; IEEE: New York, NY, USA, 2009; pp. 227–231.
11. Pawar, P.; Cortés, C.; Murray, K.; Kleissl, J. Detecting clear sky images. Sol. Energy 2019,183, 50–56. [CrossRef]
12.
Zhao, X.; Wei, H.; Wang, H.; Zhu, T.; Zhang, K. 3D-CNN-based feature extraction of ground-based cloud images for direct normal
irradiance prediction. Sol. Energy 2019,181, 510–518. [CrossRef]
13.
Mammoli, A.; Terren-Serrano, G.; Menicucci, A.; Caudell, T.P.; Martínez-Ramón, M. An experimental method to merge far-field
images from multiple longwave infrared sensors for short-term solar forecasting. Sol. Energy 2019,187, 254–260. [CrossRef]
14.
Kamadinata, J.O.; Ken, T.L.; Suwa, T. Sky image-based solar irradiance prediction methodologies using artificial neural networks.
Renew. Energy 2019,134, 837–845. [CrossRef]
15.
Quesada-Ruiz, S.; Chu, Y.; Tovar-Pescador, J.; Pedro, H.; Coimbra, C. Cloud-tracking methodology for intra-hour DNI forecasting.
Sol. Energy 2014,102, 267–275. [CrossRef]
16. Zhang, J.; Verschae, R.; Nobuhara, S.; Lalonde, J.-F. Deep photovoltaic nowcasting. Sol. Energy 2018,176, 267–276. [CrossRef]
17.
Kong, W.; Jia, Y.; Dong, Z.Y.; Meng, K.; Chai, S. Hybrid approaches based on deep whole-sky-image learning to photovoltaic
generation forecasting. Appl. Energy 2020,280, 115875. [CrossRef]
18.
Wang, F.; Zhang, Z.; Chai, H.; Yu, Y.; Lu, X.; Wang, T.; Lin, Y. Deep Learning Based Irradiance Mapping Model for Solar PV Power
Forecasting Using Sky Image. In Proceedings of the 2019 IEEE Industry Applications Society Annual Meeting, Baltimore, MD,
USA, 29 September–3 October 2019; IEEE: New York, NY, USA, 2019; pp. 1–9.
19.
Zhen, Z.; Liu, J.; Zhang, Z.; Wang, F.; Chai, H.; Yu, Y.; Lu, X.; Wang, T.; Lin, Y. Deep learning based surface irradiance mapping
model for solar PV power forecasting using sky image. IEEE Trans. Ind. Appl. 2020,56, 3385–3396. [CrossRef]
20.
Wen, H.; Du, Y.; Chen, X.; Lim, E.; Wen, H.; Jiang, L.; Xiang, W. Deep learning based multistep solar forecasting for PV ramp-rate
control using sky images. IEEE Trans. Ind. Inform. 2020,17, 1397–1406. [CrossRef]
21.
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks
from overfitting. J. Mach. Learn. Res. 2014,15, 1929–1958.
22.
Greff, K.; Srivastava, R.K.; Koutník, J.; Steunebrink, B.R.; Schmidhuber, J. LSTM: A search space odyssey. IEEE Trans. Neural Netw.
Learn. Syst. 2016,28, 2222–2232. [CrossRef] [PubMed]
23.
Cho, K.; Van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations
using RNN encoder-decoder for statistical machine translation. arXiv 2014, arXiv:1406.1078.
24. Chen, Z.; Yang, Y. Assessing forecast accuracy measures. Prepr. Ser. 2004,2010, 2004–2010.
... Traditional forecasting approaches, which primarily leverage historical data and meteorological predictions, often struggle to maintain accuracy under dynamic and complex weather conditions. In recent years, sky imagers have gained prominence as effective tools for monitoring cloud cover, offering valuable insights for solar energy forecasting [4][5][6][7]. However, the application of sky imager data in PV power forecasting still encounters several significant limitations. ...
Article
Full-text available
Accurate photovoltaic (PV) power forecasting is crucial for stable grid integration, particularly under rapidly changing weather conditions. This study presents an ultra-short-term forecasting model that integrates sky imager data and meteorological radar data, achieving significant improvements in forecasting accuracy. By dynamically tracking cloud movement and estimating cloud coverage, the model enhances performance under both clear and cloudy conditions. Over an 8-day evaluation period, the average forecasting accuracy improved from 67.26% to 77.47% (+15%), with MSE reduced by 39.2% (from 481.5 to 292.6), R2 increased from 0.724 to 0.855, NSE improved from 0.725 to 0.851, and Theil’s U reduced from 0.42 to 0.32. Notable improvements were observed during abrupt weather transitions, particularly on 1 July and 3 July, where the combination of MCR and sky imager data demonstrated superior adaptability. This integrated approach provides a robust foundation for advancing ultra-short-term PV power forecasting.
... This procedure is repeated until the loss reaches its minimum value. Since the process of backward propagation of Conv1D-LSTM requires a lengthy explanation, we recommend reading [66][67][68]. The final output of the detector is a set of prediction packets F f uture det based on (3); F s is the sequence of output h of the model: ...
Article
Full-text available
In modern network security setups, Intrusion Detection Systems (IDS) are crucial elements that play a key role in protecting against unauthorized access, malicious actions, and policy breaches. Despite significant progress in IDS technology, two of the most major obstacles remain: how to avoid false alarms due to imbalanced data and accurately forecast the precise type of attacks before they even happen to minimize the damage caused. To deal with two problems in the most optimized way possible, we propose a two-task regression and classification strategy called Hybrid Regression–Classification (HRC), a deep learning-based strategy for developing an intrusion detection system (IDS) that can minimize the false alarm rate and detect and predict potential cyber-attacks before they occur to help the current wireless network in dealing with the attacks more efficiently and precisely. The experimental results show that our HRC strategy accurately predicts the incoming behavior of the IP data traffic in two different datasets. This can help the IDS to detect potential attacks sooner with high accuracy so that they can have enough reaction time to deal with the attack. Furthermore, our proposed strategy can also deal with imbalanced data. Even when the imbalance is large between categories. This will help significantly reduce the false alarm rate of IDS in practice. These strengths combined will benefit the IDS by making it more active in defense and help deal with the intrusion detection problem more effectively.
... Compared to satellite imagery, it boasts superior temporal and spatial resolution, allowing for minute-by-minute calculations and forecasts of photovoltaic power output. For instance, features extracted from ground-based cloud imagery, such as cloud-coverage [12] and cloud-motion trends [13,14], can serve as inputs for predictive models. However, there has been no prior research in the existing research focused on using ground-based cloud imagery for forecasting PV power generation during hazy conditions. ...
Article
Full-text available
Haze constitutes a pivotal meteorological variable with notable implications for photovoltaic power forecasting. The presence of haze is anticipated to lead to a reduction in the output power of photovoltaic plants. Therefore, achieving precise forecasts of photovoltaic power in hazy conditions holds paramount significance. This study introduces a novel approach to forecasting photovoltaic power under haze conditions, leveraging ground-based cloud images. Firstly, the aerosol scattering coefficient is introduced as a pivotal parameter for characterizing photovoltaic power fluctuations influenced by haze. Additionally, other features, such as sky cloud cover, color attributes, light intensity, and texture characteristics, are considered. Subsequently, the Spearman correlation coefficient is applied to calculate the correlation between feature sequences and photovoltaic power. Effective features are then selected as inputs and three models—LSTM, SVM, and XGBoost—are employed for training and performance analysis. After comparing with existing technologies, the predicted results have achieved the best performance. Finally, using actual data, the effectiveness of the aerosol scattering coefficient is confirmed, by exhibiting the highest correlation index, as a pivotal parameter for forecasting photovoltaic output under the influence of haze. The results demonstrate that the aerosol scattering coefficient enhances the forecast accuracy of photovoltaic power in both heavy and light haze conditions by 1.083% and 0.599%, respectively, while exerting minimal influence on clear days. Upon comprehensive evaluation, it is evident that the proposed forecasting method in this study offers substantial advantages for accurately predicting photovoltaic power output in hazy weather scenarios.
Article
Full-text available
Solar energy is largely dependent on weather conditions, resulting in unpredictable, fluctuating, and unstable photovoltaic (PV) power outputs. Thus, accurate PV power forecasts are increasingly crucial for managing and controlling integrated energy systems. Over the years, advanced artificial neural network (ANN) models have been proposed to increase the accuracy of PV power forecasts for various geographical regions. Hence, this paper provides a state-of-the-art review of the five most popular and advanced ANN models for PV power forecasting. These include multilayer perceptron (MLP), recurrent neural network (RNN), long short-term memory (LSTM), gated recurrent unit (GRU), and convolutional neural network (CNN). First, the internal structure and operation of these models are studied. It is then followed by a brief discussion of the main factors affecting their forecasting accuracy, including forecasting horizons, meteorological conditions, and evaluation metrics. Next, an in-depth and separate analysis of standalone and hybrid models is provided. It has been determined that bidirectional GRU and LSTM, whether used as a standalone model or in a hybrid configuration, offer greater forecasting accuracy. Furthermore, hybrid and upgraded metaheuristic algorithms have demonstrated exceptional performance when applied to standalone and hybrid ANN models. Finally, this study discusses various limitations and shortcomings that may influence the practical implementation of PV power forecasting.
Article
Recent advancements in data science and artificial intelligence, as well as the development of clean and sustainable energy sources, have created numerous opportunities for energy researchers to conduct their studies. Artificial Intelligence (AI) and Machine Learning (ML) techniques have been applied to Renewable Energy Systems (RES) for several years, and their intensity and scope have grown in recent years. Since artificial intelligence and machine learning have a wide range of applications, it may be difficult to select and implement suitable methods for future research. In order to address this issue, this study examines several of the most popular and well-known AI techniques in renewable energy. This paper describes over ten of the most prevalent RES modeling and optimization algorithms, including Artificial Neural Networks (ANN), Long and Short-Term Memory (LSTM), Recurrent and Convolutional Neural Networks (RNNs and CNNs), the Genetic Algorithm (GA), and the Particle Swarm Optimization algorithm (PSO). More than a hundred different studies between 2020 and 2022 have been compiled and organized in the results section according to the method used and the field of application. At the end, the results are discussed, and the limitations of the current study will be mentioned, followed by some suggestions to complete our work and make it more useful.
Article
Full-text available
Solar forecasting is becoming increasingly important due to the exponential growth in total global solar capacity each year. More photovoltaic (PV) penetration in the grid poses problems for grid stability due to the inherent intermittent and variable nature of PV power production. Therefore, forecasting of solar quantities becomes increasingly important to grid operators and market participants. This review presents the most recent relevant studies focusing on short-term forecasting of solar irradiance and PV power production. Recent research has increasingly turned to machine learning to address this challenge. The paper provides a discussion about building a solar forecasting model, including evaluation measures and machine learning method selection through analysed literature. Given that machine learning is data-driven, the focus of this review has been placed on data sources referenced in the literature. Open-access data sources have been compiled and explored. The main contribution of this paper is the establishment of a benchmark for assessing the performance of solar forecasting models. This benchmark utilizes the mentioned open-source datasets, offering a standardized platform for future research. It serves the crucial purpose of streamlining investigations and facilitating direct comparisons among different forecasting methodologies in the field of solar forecasting.
Article
Full-text available
Solar forecasting is one of the most promising approaches to address the intermit PV power generation by providing predictions before upcoming ramp events. In this paper, a novel multi-step forecasting (MSF) scheme is proposed for PV power ramp-rate control (PRRC). This method utilizes an ensemble of deep ConvNets without additional time-series models and exogenous variables, thus more suitable for industrial applications. The MSF strategy can make multiple predictions in comparison with a single forecasting point produced by a conventional method while maintaining the same high temporal resolution. Besides, stacked sky images that integrate temporal-spatial (ST) information of cloud motions are used to further improve the forecasting performance. The results demonstrate a favorable forecasting accuracy in comparison to the existing forecasting models with the highest skill score of 17.7%. In the PRRC application, the MSF-based PRRC can detect more ramp-rates violations with a higher control rate of 98.9% compared with the conventional forecasting based control. Thus, the PV generation can be effectively smoothed with less energy curtailment on both clear and cloudy days using the proposed approach.
Article
Full-text available
Many solar forecast algorithms based on ground based sky imagery apply the red-blue ratio (RBR) method to classify image pixels as clear or cloudy, by comparing the current image with the corresponding image from a clear sky library (CSL). The CSL needs to be updated regularly due to changes in clear sky readings over time caused by aerosols and imager dome properties. This clear sky library is typically created by visually scrutinizing daily sky videos and selecting appropriate clear sky periods. This practice takes a significant amount of time and manual intervention can result in human errors. To avoid this, an automated CSL algorithm (ACSL) was developed which filters each image for clear sky features including maximum green pixel brightness, average RBR, and red channel difference by pixel with respect to the previous image. The root mean square difference (RMSD) between the image RBR of the manually created CSL and the ACSL for November and April 2013 at UC San Diego were observed to be less than 6% over the full range of solar zenith angles. The ACSL was found to be more representative of clear conditions than its manual counterpart.
Article
With the ever-increased penetration of solar energy in the power grid, solar photovoltaic forecasting has become an indispensable aspect in maintaining power system stability and economic operation. At the operating stage, the forecasting accuracy of renewables has a direct influence on energy scheduling and dispatching. In this paper, we propose a series of novel approaches based on deep whole-sky-image learning architectures for very short-term solar photovoltaic generation forecasting, of which the lookahead windows concern the scales from 4 to 20 min. In particular, multiple deep learning models with the integration of both static sky image units and dynamic sky image stream are explicitly investigated. Extensive numerical studies on various models are carried out, through which the experimental results show that the proposed hybrid static image forecaster provides superior performance as compared to the benchmarking methods (i.e. the ones without sky images), with up to 8.3% improvement in general, and up to 32.8% improvement in the cases of ramp events. In addition, case studies at multiple time scales reveal that sky-image-based models can be more robust to the ramp events in solar photovoltaic generation.
Article
With the increase of solar photovoltaic (PV) permeability in power system, the impact of random fluctuation of PV power on the safe and stable operation of power grid becomes more and more serious. High-precision PV power forecasting can effectively promote the grid’s absorption of PV power generation. Cloud is the most important factor affecting the surface irradiance and PV power. For the ultra- short-term solar PV power forecast considering the influence of cloud movement, it is necessary to be able to obtain the surface irradiance according to the sky cloud observation data. Therefore, in order to accurately achieve the real-time mapping relationship between sky image and surface irradiance, a hybrid mapping model based on deep learning applied for solar PV power forecasting is proposed in this paper. First, the sky image data is clustered based on the feature extraction of convolutional autoencoder and k-means clustering algorithm after preprocess stage. Second, a hybrid mapping model based on deep learning methods are established for surface irradiance. Finally, the simulation results are compared and evaluated with different deep learning methods (CNN, LSTM and ANN). The results show that the proposed model in this paper has higher accuracy and can maintain robustness under different weather conditions.
Article
In a system used for short-term forecasting of solar irradiance, multiple longwave infrared sensors are used to acquire an image of a large, continuous section of the sky dome. The field of view of each sensor is directed at a particular, fixed portion of the sky dome, with some overlap between adjacent edges of each field of view to ensure that a continuous image can be acquired. Because of unavoidable imperfections in the optical components and in the alignment of the sensors, and because of the complex optics, it is difficult to pre-determine adjustable parameters in the geometric transformations required to merge the multiple images into a single one. Instead, it is possible to do this experimentally, by rotating the imaging sensor array via a two-axis rotating platform, using the sun itself as a convenient far-field reference object, and using the images collected to generate relations that map each sensor pixel into an altitude-azimuth direction, α and ϕ respectively. The experimental proof of concept of this method is described here.
Article
Cloud cover and cloud motion have a large impact on solar irradiance. One of the effective ways for direct normal irradiance (DNI) prediction is to use cloud features, which has been extensively studied. A Convolutional Neural Network (CNN) has the advantage of automatic features extraction by using strong computing capabilities. In this paper, a novel 3D-CNN method is proposed by processing multiple consecutive ground-based cloud (GBC) images in order to extract cloud features including texture and temporal information. The resulting features and the DNI data are then used to establish a DNI forecasting model. The experiments are carried out to evaluate the performance of the proposed forecasting method by using the data from January 1, 2013 to December 31, 2014. The experimental results show that the proposed method coupled with the multilayer perceptron (MLP) model achieves forecast skill of 17.06% for 10-minute ahead DNI prediction.
Article
Predicting the short-term power output of a photovoltaic panel is an important task for the efficient management of smart grids. Short-term forecasting at the minute scale, also known as nowcasting, can benefit from sky images captured by regular cameras and installed close to the solar panel. However, estimating the weather conditions from these images—sun intensity, cloud appearance and movement, etc.—is a very challenging task that the community has yet to solve with traditional computer vision techniques. In this work, we propose to learn the relationship between sky appearance and the future photovoltaic power output using deep learning. We train several variants of convolutional neural networks which take historical photovoltaic power values and sky images as input and estimate photovoltaic power in a very short term future. In particular, we compare three different architectures based on: a multi-layer perceptron (MLP), a convolutional neural network (CNN), and a long short term memory (LSTM) module. We evaluate our approach quantitatively on a dataset of photovoltaic power values and corresponding images gathered in Kyoto, Japan. Our experiments reveal that the MLP network, already used similarly in previous work, achieves an RMSE skill score of 7% over the commonly-used persistence baseline on the 1-min future photovoltaic power prediction task. Our CNN-based network improves upon this with a 12% skill score. In contrast, our LSTM-based model, which can learn the temporal dependencies in the data, achieves a 21% RMSE skill score, thus outperforming all other approaches.
Article
In order to decelerate global warming, it is important to promote renewable energy technologies. Solar energy, which is one of the most promising renewable energy sources, can be converted into electricity by using photovoltaic power generation systems. Whether the photovoltaic power generation systems are connected to an electrical grid or not, predicting near-future global solar radiation is useful to balance electricity supply and demand. In this work, two methodologies utilizing artificial neural networks (ANNs) to predict global horizontal irradiance in 1 to 5 minutes in advance from sky images are proposed. These methodologies do not require cloud detection techniques. Sky photo image data have been used to detect the clouds in the existing techniques, while color information at limited number of sampling points in the images are used in the proposed methodologies. The proposed methodologies are able to capture the trends of fluctuating solar irradiance with minor discrepancies. The minimum root mean square errors of 143 W/m², which are comparable with the existing prediction techniques, are achieved for both of the methodologies. At the same time, the proposed methodologies require much less image data to be handled compared to the existing techniques.