Lab
Forecasting
Institution: University of Venda
About the lab
Probabilistic Load and Renewable Energy Modelling
Featured research (24)
In today’s world, where sustainable energy is essential for the planet’s survival, accurate solar energy forecasting is crucial. This study focused on predicting short-term Global Horizontal Irradiance (GHI) using data from the Southern African Universities Radiometric Network (SAURAN) at the Univen Radiometric Station in South Africa. Various techniques were evaluated for their predictive accuracy, including Recurrent Neural Networks (RNN), Support Vector Regression (SVR), Gradient Boosting (GB), Random Forest (RF), Stacking Ensemble, and Double Nested Stacking (DNS). The results indicated that RNN performed the best in terms of Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) among the machine learning models. However, Stacking ensembles with XGBoost as the meta-model outperformed all individual models, improving accuracy by 67.06% in MAE and 22.28% in RMSE. DNS further enhanced accuracy, achieving a 93.05% reduction in MAE and an 88.54% reduction in RMSE compared to the best machine learning model, as well as a 78.89% decrease in MAE and an 85.27% decrease in RMSE compared to the best single stacking model. Furthermore, experimenting with the order of the DNS meta-model revealed that using RF as the first-level meta-model followed by XGBoost yielded the highest accuracy, showing a 47.39% decrease in MAE and a 61.35% decrease in RMSE compared to DNS with RF at both levels. These findings underscore the potential of advanced stacking techniques to significantly improve GHI forecasting.
The main source of electricity worldwide stems from fossil fuels, contributing to air pollution, global warming, and associated adverse effects. This study explores wind energy as a potential alternative. Nevertheless, the variable nature of wind introduces uncertainty in its reliability. Thus, it is necessary to identify an appropriate machine learning model capable of reliably forecasting wind speed under various environmental conditions. This research compares the effectiveness of Dynamic Architecture for Artificial Neural Networks (DAN2), convolutional neural networks (CNN), random forest and XGBOOST in predicting wind speed across three locations in South Africa, characterised by different weather patterns. The forecasts from the four models were then combined using quantile regression averaging models, generalised additive quantile regression (GAQR) and quantile regression neural networks (QRNN). Empirical results show that CNN outperforms DAN2 in accurately forecasting wind speed under different weather conditions. This superiority is likely due to the inherent architectural attributes of CNNs, including feature extraction capabilities, spatial hierarchy learning, and resilience to spatial variability. The results from the combined forecasts were comparable with those from the QRNN, which was slightly better than those from the GAQR model. However, the combined forecasts were more accurate than the individual models. These results could be useful to decision-makers in the energy sector.
Given that cryptocurrencies are now involved in nearly every financial transaction due to their widespread acceptance as an alternative method of payment and currency exchange, researchers and economists have increased opportunities to analyze cryptocurrency prices. Over time, predicting the daily closing price of Ethereum has been challenging for investors, traders, and investment banks because of its significant price volatility. The daily closing price of cryptocurrency is crucial for trading or investing in Ethereum. This report aims to conduct a comparative analysis of the predictive performance of deep machine learning algorithms within a stacking ensemble modeling framework, utilizing daily historical price data of Ethereum from Coindesk, tweets from Twitter spanning from August 1, 2022, to August 8, 2022, and five additional covariates (closing price lag1, closing price lag2, noltrend, daytype, and month) derived from Ethereum's closing price. Seven models are employed to forecast the daily closing price of Ethereum: recurrent neural network, ensemble stacked recurrent neural network, gradient boosting machine, generalized linear model, distributed random forest, deep neural networks, and a stacked ensemble of gradient boosting machine, generalized linear model, distributed random forest, and deep neural networks. The primary evaluation metric is the mean absolute error (MAE). Based on MAE, the RNN forecasts outperform the other models in this study, achieving an MAE of 0.0309.
The widespread use of fossil fuels for global energy production significantly contributes to global warming. This study presents a comparative analysis of various machine learning models, which are the long short-term memory (LSTM) network, support vector regression (SVR), and gradient boosting method (GBM). Gaussian process regression (GPR) is a benchmark model across different forecasting horizons. The study uses South African wind speed data from 1 January 2018 to 31 December 2021, sourced from the Western Cape province. The dataset underwent preprocessing, and diverse feature selection techniques were implemented
to enhance model accuracy. Performance evaluation of the models was done using mean absolute error (MAE), root mean squared error (RMSE), and mean absolute scaled error (MASE). Results indicate that SVR exhibits superior accuracy to other models for two distinct forecast horizons (h = 670 and h = 1339), respectively. Additionally, GPR surpasses other models for the forecasting horizon h = 224. This study provides insights into the comparative strengths and weaknesses of different machine learning models for wind speed prediction, which could be useful in selecting an appropriate model for future applications in renewable energy and
weather forecasting. Potential areas for future research include improving prediction accuracy via ensemble deep learning algorithms and incorporating additional meteorological variables. Moreover, investigating temporal dynamics, broadening geographical coverage and integrating uncertainty quantification methods can improve wind speed prediction, thereby facilitating more effective renewable
energy planning and decision-making processes
In recent years, there has been increasing interest in the joint modelling of compound extreme events such as high temperatures and low rainfall. The increase in the frequency of occurrence of these events in many regions has necessitated the development of models for estimating the concurrent probabilities of such compound extreme events. The current study discusses an application of copula models in predicting the concurrent probabilities of compound low rainfall and high-temperature events using data from the Lowveld region of the Limpopo province in South Africa. The second stage discussed two indicators for monitoring compound high temperature and low rainfall events. Empirical results from the study show that elevations ranging from 100–350 m, 350–700 m and 700–1200 m exhibit varying probabilities of experiencing drought, with mild droughts having approximately 64%, 66%, and 65% chances, moderate droughts around 36%, 39%, and 38%, and severe droughts at approximately 16%, 19%, and 18%, respectively. Furthermore, the logistic regression models incorporating the southern oscillation index as a covariate yielded comparable results of copula-based models. The methodology discussed in this paper is robust and can be applied to similar datasets in any regional setting globally. These findings could be useful to disaster management decision makers, helping them formulate effective mitigation strategies and emergency response plans.
Lab head

Department
- Department of Mathematical and Computational Sciences
About Caston Sigauke
- Caston is a statistical modeller with research interests in probabilistic forecasting with applications in energy and environmental systems. Caston’s research is on probabilistic load and renewable energy (solar and wind) forecasting, including the optimization of grid integration of renewable energies