ArticlePublisher preview available

LSTM auto-encoder based representative scenario generation method for hybrid hydro-PV power system

Wiley
IET Generation, Transmission & Distribution
Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

The increasing penetration of renewable energy sources causes complex uncertainties of the power system. To capture such uncertainties in power system planning, an important step is to generate representative scenarios. In this work, a long short term memory (LSTM) auto‐encoder based approach is proposed to generate representative scenarios in an integrated hydro‐photovoltaic (PV) power generation system, which consists of feature extraction by LSTM Encoder, scenario clustering in feature domain by combining gap statistics method and K‐means++, and representative scenario reconstruction by using LSTM Decoder. Compared with traditional scenario selection and generation methods, the proposed method can better capture the patterns of multivariate time‐series data in both temporal and spatial dimensions. A case study in southwest China is used to demonstrate the effectiveness of the proposed method, which outperforms other existing methods by achieving the lowest SSE and DBI indices of 0.89 and 0.12, respectively, and obtaining the best SIL and CHI scores of 0.93 and 2.30, respectively, In addition, the case study shows the proposed model setup works more stable for scenario generation.
This content is subject to copyright. Terms and conditions apply.
IET Generation, Transmission & Distribution
Special Issue: Advanced Data-Analytics for Power System Operation,
Control and Enhanced Situational Awareness
LSTM auto-encoder based representative
scenario generation method for hybrid hydro-
PV power system
ISSN 1751-8687
Received on 21st April 2020
Revised 23rd June 2020
Accepted on 14th July 2020
E-First on 4th August 2020
doi: 10.1049/iet-gtd.2020.0757
www.ietdl.org
Jingxian Yang1, Shuai Zhang1, Yue Xiang1,2 , Jichun Liu1, Junyong Liu1, Xiaoyan Han3, Fei Teng2
1College of Electrical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
2Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK
3State Grid Sichuan Electric Power Company, Chengdu 610041, People's Republic of China
E-mail: xiang@scu.edu.cn
Abstract: The increasing penetration of renewable energy sources causes complex uncertainties of the power system. To
capture such uncertainties in power system planning, an important step is to generate representative scenarios. In this work, a
long short term memory (LSTM) auto-encoder based approach is proposed to generate representative scenarios in an
integrated hydro-photovoltaic (PV) power generation system, which consists of feature extraction by LSTM Encoder, scenario
clustering in feature domain by combining gap statistics method and K-means++, and representative scenario reconstruction by
using LSTM Decoder. Compared with traditional scenario selection and generation methods, the proposed method can better
capture the patterns of multivariate time-series data in both temporal and spatial dimensions. A case study in southwest China
is used to demonstrate the effectiveness of the proposed method, which outperforms other existing methods by achieving the
lowest SSE and DBI indices of 0.89 and 0.12, respectively, and obtaining the best SIL and CHI scores of 0.93 and 2.30,
respectively, In addition, the case study shows the proposed model setup works more stable for scenario generation.
1 Introduction
Power generation using renewable energy sources gaining
momentum among power sectors in recent years owing to the fast
depletion of fossil fuels and consideration to reduce greenhouse
gases emissions [1]. Of all renewable sources, solar energy has
achieved significant market share and been recognised as a
competitive resource because of its simple construction process,
declining prices, and low operating costs [2]. As the ‘Renewable
Energy Statistic’ stated, by the end of 2019, the global solar
photovoltaic (PV) installation reached 580 GW, a 97.6 GW annual
increase and accounting for nearly 55% of newly increased
renewable energy capacity in 2019. However, the randomness and
intermittency nature of PV generation affects the security and
reliability of the power system. A promising approach is to operate
PV generation together with other renewable energy resources as
hybrid energy systems (HESs) to provide greater balance in energy
supply [3, 4]. The hybrid hydro-PV system is among the most
widely used HES as hydropower can be regulated rapidly to
complement solar energy [5]. Along with the successful
commissioning of the world's largest complementary hydro-PV
plant-Longyangxia hydro-PV plant (850 MW PV arrays + 1280 
MW hydropower units) in northwestern China in 2013,
increasing hydro-PV systems are being developed in regions that
are rich in both hydro and solar resources, such as west China [6–
8]. However, hydro-PV power generation may not completely
follow the local load demand, leading to a growing probability of
load shedding and reduced productivity of hybrid power plants. So
with the increase in penetration of PV resources, it is important for
system planner and operator to analyse and manage the uncertainty
of PV power generation to maintain the balance of integrated
hydro-PV power system and accommodate the expected load
increasing over the planning horizon [9–11].
The economic planning in power system involves the analysis
and evaluation of different scenarios and conditions. The multi-
scenario operating and planning models have been widely used to
capture system operating variations in power system planning [12–
14]. Solar radiation, hydropower and load may display daily or
seasonal patterns over longer time periods, although they follow a
stochastic process in the short term [15]. Therefore, generating
scenarios that characterise the uncertainty and reflect the periodic
variation of regional load, PV power generation and hydropower
generation is an essential step in the planning of hybrid hydro-PV
power system [16].
There are two widely used methods for scenario generation: (i)
probabilistic distribution based approach [17, 18]; and (ii) data
mining-based approach. The former approach mainly follows two
steps, which includes obtaining the probabilistic distribution of
power generation or errors and sampling the scenarios from the
statistical distribution. For example, Diaz et al. [19] established the
cumulative distribution function of wind power and generated
scenarios with Latin Hypercube Sampling method. The scenario
for wind units and cascaded hydro generation is generated with the
Monte Carlo method via the statistical distribution of forecast
errors in [20]. The wind speed scenarios are formulated with hourly
time series data based on the transition matrix approach of the
Markov Chain process [21]. However, large-scale power systems
usually involve different generating components and loads, it is
extremely challenging to develop a general probabilistic model that
captures both spatial–temporal and across-component correlations.
For the latter approach, clustering analysis, as an unsupervised data
mining technique, has been applied to efficiently reflect the
diversity of scenarios [22]. Baringo and Conejo [23] adopted K-
means clustering to characterise the scenarios of electric load and
wind production for investment decisions. Different clustering
techniques, such as Fuzzy C-Means, Gaussian mixture models and
Dynamic Time Warping (DTW), have been applied in daily
clearness index profiles to obtain typical scenarios of PV power
generation [24]. Hierarchical clustering and density-based
clustering are also adopted to cluster load profiles to recognise
different underline patterns [25, 26].
Traditional clustering algorithms face significant challenges
when applied to multivariate time sequence hydro/PV/load data: (i)
traditional clustering methods, which often cluster single variable,
are lack of effective uniform similarity measurement structure for
multivariate hydro/PV/load data; (ii) massive and high dimensional
data may cause complexity and heavy computational burden; (iii)
the spatial and temporal correlations between uncertain
hydropower, PV and load cannot be fully captured by traditional
clustering technologies [27, 28].
IET Gener. Transm. Distrib., 2020, Vol. 14 Iss. 24, pp. 5935-5943
© The Institution of Engineering and Technology 2020
5935
... However, the dimensionality of feature combinations may exceed 1000. Directly inputting such high-dimensional feature combinations into deep learning models for training, while effective, is often extremely time-consuming [28,29]. ...
Article
Full-text available
Recognition of partial discharge (PD) patterns is essential for insulation diagnosis of covered conductors in overhead lines. Current research has not sufficiently addressed the complex background noise in real environments, and most detection methods depend primarily on feature engineering or deep learning, suggesting potential for improvement in accuracy and efficiency. This has led the authors to propose a PD pattern recognition algorithm that integrates feature selection and deep learning. This algorithm incorporates the design of a discrete wavelet denoising function specifically tailored to the characteristics of PD for data preprocessing. It employs Bayesian optimization algorithms and light gradient boosting machines for characterizing corona discharge features. Furthermore, it develops multi‐scale clustering features and phase‐resolved features for feature fusion, and constructs insightful features based on the light gradient boosting machine. Finally, a novel deep learning model is formulated, demonstrating exceptional detection performance for early faults in covered conductors. Experimental results show that this algorithm attains an Matthews correlation coefficient score of 0.814, a 13.2% improvement over the baseline algorithm's 0.719, and a speed increase of 39.18%. The final accuracy amounts to 97.85%. This algorithm demonstrates exceptional performance in detecting early insulation faults in conductors.
Article
Full-text available
This paper reviews scenario generation techniques for modeling uncertainty in wind and photovoltaic (PV) power generation, a critical component as renewable energy integration into power systems grows. Scenario generation enables the simulation of variable power outputs under different weather conditions, serving as essential inputs for robust, stochastic, and distributionally robust optimization in system planning and operation. We categorize scenario generation methods into explicit and implicit approaches. Explicit methods rely on probabilistic assumptions and parameter estimation, which enable the interpretable yet parameterized modeling of power variability. Implicit methods, powered by deep learning models, offer data-driven scenario generation without predefined distributions, capturing complex temporal and spatial patterns in the renewable output. The review also addresses combined wind and PV power scenario generation, highlighting its importance for accurately reflecting correlated fluctuations in multi-site, interconnected systems. Finally, we address the limitations of scenario generation for wind and PV power integration planning and suggest future research directions.
Article
Full-text available
Featured Application This article developed an improved InfoGAN wind–solar–load extreme scenario generation approach that can inform power system evaluation in extreme scenarios. Abstract In recent years, extreme events have frequently occurred, and the extreme uncertainty of the source-demand side of high-ratio renewable energy systems poses a great challenge to the safe operation of power systems. Accurately generating extreme scenarios related to the source-demand side under a high percentage of new power systems is vital for the safe operation of power systems and the assessment of their reliability. However, at this stage, methods for extreme scenario generation that fully consider the correlation between wind power, solar power, and load are lacking. To address these problems, this paper proposes a method for extreme scenario generation based on information-maximizing generative adversarial networks (InfoGANs) for high-proportion renewable power systems. The example analysis shows that the method for extreme scenario generation proposed in this paper can fully explore the correlation between historical wind–solar–load data, greatly improve the accuracy with which extreme scenarios are generated, and provide effective theories and methodologies for the safe operation of a new type of power system.
Article
Full-text available
Businesses are experiencing an ever-growing problem of how to identify and guard in opposition to insider threats. Users with legal access to sensitive organizational data are positioned in a role of power that can be abused and could do harm to an enterprise. This can range from monetary and intellectual property theft to the destruction of assets and enterprise reputation. Traditional intrusion detection structures are neither designed nor able to figure out those who act maliciously inside a business enterprise. In this paper, we describe an automated system capable of detecting insider threats within an enterprise. We outline a tree-shape profiling technique that includes the information on activities conducted by each user and every task after which we use this to obtain a consistent representation of functions that provide a rich description of the user's behavior. The deviation may be assessed based on the amount of variance that each user exhibits across multiple attributes, compared in opposition to their peers. The primary function of User and Entity behavior Analysis(UEBA) is to track normal user behaviors. UEBA defines a baseline for each entity in the environment, and actions will be evaluated by comparing with pr-defined baselines.
Technical Report
Businesses are experiencing an ever-growing problem of how to identify and guard in opposition to insider threats. Users with legal access to sensitive organizational data are positioned in a role of power that can be abused and could do harm to an enterprise. This can range from monetary and intellectual property theft to the destruction of assets and enterprise reputation. Traditional intrusion detection structures are neither designed nor able to figure out those who act maliciously inside a business enterprise. In this paper, we describe an automated system capable of detecting insider threats within an enterprise. We outline a tree-shape profiling technique that includes the information on activities conducted by each user and every task after which we use this to obtain a consistent representation of functions that provide a rich description of the user's behavior. The deviation may be assessed based on the amount of variance that each user exhibits across multiple attributes, compared in opposition to their peers. The primary function of User and Entity behavior Analysis(UEBA) is to track normal user behaviors. UEBA defines a baseline for each entity in the environment, and actions will be evaluated by comparing with pr-defined baselines.
Article
Full-text available
Hybrid systems composed of solar photovoltaic (PV) and battery storage units are reliable and clean technologies for utilization in off-grid cases. Optimal sizing of these systems results in more cost-effective units, which is the main subject of various researches. The current work focuses on the assessment of an optimization approach for finding the optimum size of PV/battery hybrid unit in order to provide the required electricity of the case study and reach the minimum Total Life Cycle Cost (TLCC). In this regard, the components of the designed systems are modelled; afterwards, the objective function is established on the basis of TLCC. In the optimization procedure, a constraint is considered for the highest possible loss of power in order to have a system with acceptable reliability. In addition, Improved Harmony Search (IHS) algorithm is applied to determine the variables with optimal quantities for satisfying the required electricity in the most cost-effective condition. The calculated results are compared with harmony search and simulated annealing algorithms to evaluate the reliability of the applied method. The outcomes show that employing IHS leads to more promising results and it has higher robustness compared with the HS and simulated annealing algorithms. Moreover, the number of units in the hybrid system reduces by increment in the loss of power supply probability.
Article
Full-text available
In this study, a turn‐to‐turn short circuit of motor stator fault diagnosis system based on deep auto‐encoder and soft‐max classifier is proposed. It is also considered in the proposed system that an unbalanced power supply has similar current characteristics and is possible to be misidentified. The determination of neural network parameters and their effect in training are given systematically, which improves the performance compared with traditional auto‐encoder. The proposed fault diagnosis system can map the preprocessed three‐phase motor current signals to a two‐dimensional vector, corresponding to certain area of the feature plane to identify different fault types. The proposed system is verified via simulation experiment using MATLAB and physical experiment using a motor in laboratory. The conclusion illustrates that when signals originating from discrete motor speed, load, and fault severity are used in training, the fault diagnosis is still effective in continuous state. The accuracy is 89% for speed verification and 95% for load verification in simulation and is above 99% in the physical experiment.
Article
Off-grid solar/diesel systems have been widely utilized in remote and rural settings. Some of the main challenges for solar-diesel energies are determining the appropriate capacity and location in rural areas. Therefore, an efficient framework is needed for using off-grid solar/diesel systems. For the optimal sizing and location of off-grid photovoltaic (PV)-diesel schemes in rural areas, a new framework is proposed. In this framework, a geographic information system module is utilized to identify the best location based on technical, economic, reliability, social, and environmental criteria. Then, a hybrid optimization algorithm is utilized to determine the appropriate capacity for continuously meeting of the load via total life cycle cost minimization. A real case study in South Khorasan is considered, in order to illustrate the proposed framework and its effectiveness. The effectiveness of the applied approach is investigated by comparing the outcomes with the results obtained by other heuristic methods. Also, the impacts are investigated of fuel cost variations, initial PV and battery costs, and interest rate on the economics of the hybrid scheme. The simulation results demonstrate that the hybrid algorithm obtains results that are more accurate (by 14.1%) than other applied algorithms, and show that the use of solar energy with a diesel generator, compared to the diesel only system, significantly reduces greenhouse gas emissions (by 59.6%) and supply costs (by 22.2%). The results thus demonstrate the benefits of utilizing the proposed framework for the hybrid system. These results are expected to help enhance acceptance and utilization of cleaner production systems.
Article
Nanofluids are attractive alternatives for the current heat transfer fluids due to their remarkably higher thermal conductivity which leads to the improved thermal performance. Nanofluids are applicable in porous media for improving their heat transfer. Proposing accurate models for forecasting this feature of nanofluids can facilitate and accelerate the design and modeling of nanofluids’ thermal mediums with porous structure. In the present study, three methods including MARS, artificial neural network (ANN) with Levenberg–Marquardt for training and GMDH are employed for thermal conductivity of the nanofluids containing ZnO particles. The confidence of the models is compared according to various criteria. It is observed that the most accurate model is obtained by using ANN with Levenberg–Marquardt followed by GMDH and MARS. R2 of the mentioned models are 0.9987, 0.9980 and 0.9879, respectively. Finally, sensitivity analysis is performed to find the importance of the input variables and it is concluded that the thermal conductivity of the base fluids has the highest importance followed by volume fraction of solid phase, size of particles and temperature.
Article
Nanofluids are employed in different thermal devices due to their enhanced thermophysical features which lead to noticeable heat transfer augmentation. One of the major reasons of the heat transfer improvement by using the nanofluids is their increased thermal conductivity. Several methods have been applied to estimate this property of nanofluids such as correlations and artificial neural networks (ANNs). In the present paper, group method of data handling (GMDH) and a mathematical correlation are proposed for forecasting the thermal conductivity of nanofluids containing CuO nanoparticles. The inputs of the both models are the base fluids’ thermal conductivities, concentration, temperature and nanoparticle dimension. Comparison of the forecasted data by these two approaches revealed more favorable performance of GMDH. The values of R-squared in the cases where polynomial and ANN were utilized were 0.9862 and 0.9996, respectively. Moreover, the average absolute relative deviation values were 5.25% and 0.881% for the indicated methods, respectively. According to these statistical values, it is concluded that employing the ANN-based regression leads to more confident model for forecasting the TC of the nanofluids containing CuO nanoparticles.
Article
Nanofluids are broadly applied in energy systems such as solar collector, heat exchanger and heat pipes. Dynamic viscosity of the nanofluids is among the most important features affecting their thermal behavior and heat transfer ability. Several predictive models, by employing various methods such as Artificial Neural Network, Support Vector Machine and mathematical correlations, have been proposed for estimating dynamic viscosity based on the influential factors such as size, type and volume fraction of nano particles and their temperature. The precision of the models depends on different elements such as the employed approach for modeling, input variables and the structure of the model. In order to have an accurate model for estimating the dynamic viscosity, it is necessary to consider all of the affecting factor. In this regard, the current study aim to review the researches concerns the applications of machine learning methods for dynamic viscosity modeling of nanofluids in order to provide deeper insight for the scientists. According to the reviewed scientific sources, the structure of model, such as number of neurons and layers in artificial neural network (ANN), the applied activation function, and utilized algorithm are the most influential factors on the accuracy of the model. Moreover, based on the studies considered both ANN and mathematical correlations, ANNs are more accurate and confident for estimating the nanofluids' dynamic viscosity. The majority of the studies in this field used temperature and concentration of nanofluids as input data for their models, while size of nanostructures and shear rate are considered in some researches in addition to mentioned variables.
Article
In recent years the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm has become one of the most used and insightful techniques for exploratory data analysis of high-dimensional data. It reveals clusters of high-dimensional data points at different scales while only requiring minimal tuning of its parameters. However, the computational complexity of the algorithm limits its application to relatively small datasets. To address this problem, several evolutions of t-SNE have been developed in recent years, mainly focusing on the scalability of the similarity computations between data points. However, these contributions are insufficient to achieve interactive rates when visualizing the evolution of the t-SNE embedding for large datasets. In this work, we present a novel approach to the minimization of the t-SNE objective function that heavily relies on graphics hardware and has linear computational complexity. Our technique decreases the computational cost of running t-SNE on datasets by orders of magnitude and retains or improves on the accuracy of past approximated techniques. We propose to approximate the repulsive forces between data points by splatting kernel textures for each data point. This approximation allows us to reformulate the t-SNE minimization problem as a series of tensor operations that can be efficiently executed on the graphics card. An efficient implementation of our technique is integrated and available for use in the widely used Google TensorFlow.js, and an open-source C++ library.
Article
The high penetration of variable renewable energy sources (RESs)has greatly increased the difficulty in power system scheduling and operation. To fully utilize the complementary characteristics of various RESs, a stochastic optimization model considering the strong regulation capacity of cascade hydropower stations and the uncertainty of wind and photovoltaic (PV)power is presented. Based on the improved generative adversarial networks, the spatial and temporal correlation characteristics between wind farms and PV plants are accurately captured via measured data. Due to the nonlinear features of the hydroelectric plants, linearization methods are adopted to reformulate the original model into a standard mixed integer linear programming (MILP)formulation. Then, the model is solved with a proposed two-stage approach, in which a heuristic algorithm is used to solve the first-stage unit commitment optimization. The cascade hydraulic connection and time delay of the water flow are established in the second stage to exploit the considerably controllable adjustment capability of hydropower generation. A renewable energy base in southwest China is chosen as a detailed case study. The simulation results reveal the potential of the large-scale application of only a hydro-wind-solar hybrid system to satisfy the power transmission demand with the guidance of the coordinated operation strategy, and the performance of the hybrid system can be further enhanced with high-quality scenarios from the proposed deep neural network.