ArticlePDF Available

Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques

Authors:

Abstract

The present paper aims in investigating the performance of state-of-the-art machine learning techniques in trading with the EUR/USD exchange rate at the ECB fixing. For this purpose, five supervised learning classification techniques (K-Nearest Neighbors algorithm, Naïve Bayesian Classifier, Artificial Neural Networks, Support Vector Machines and Random Forests) were applied in the problem of the one day ahead movement prediction of the EUR/USD exchange rate with only autoregressive terms as inputs. For comparison reasons, the performance of all machine learning techniques was benchmarked by two traditional techniques (Naïve Strategy and moving average convergence/divergence model). Trading strategies produced by the machine learning techniques of Support Vector Machines and Random Forests clearly outperformed all other strategies in terms of annualized return and sharp ratio. To the best of our knowledge, this is the first application of Random Forests in the problem of trading with the EUR/USD exchange rate providing extremely satisfactory results.
ETASR - Engineering, Technology & Applied Science Research
Vol. 2, No. 5, 2012, 269-272
269
www.etasr.com Theofilatos et al: Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques
Modeling and Trading the EUR/USD Exchange Rate
Using Machine Learning Techniques
Konstantinos Theofilatos
Dpt of Computer Engineering and
Informatics
University of Patras
Patras, Greece
theofilk@ceid.upatras.gr
Spiros Likothanassis
Dpt of Computer Engineering and
Informatics
University of Patras
Patras, Greece
likothan@ceid.upatras.gr
Andreas Karathanasopoulos
London Metropolitan Business School
London,
United Kingdom
A.Karathanasopoulos@londonmet.ac.uk
Abstract— The present paper aims in investigating the
performance of state-of-the-art machine learning techniques in
trading with the EUR/USD exchange rate at the ECB fixing. For
this purpose, five supervised learning classification techniques
(K-Nearest Neighbors algorithm, Naïve Bayesian Classifier,
Artificial Neural Networks, Support Vector Machines and
Random Forests) were applied in the problem of the one day
ahead movement prediction of the EUR/USD exchange rate with
only autoregressive terms as inputs. For comparison reasons, the
performance of all machine learning techniques was
benchmarked by two traditional techniques (Naïve Strategy and
moving average convergence/divergence model). Trading
strategies produced by the machine learning techniques of
Support Vector Machines and Random Forests clearly
outperformed all other strategies in terms of annualized return
and sharp ratio. To the best of our knowledge, this is the first
application of Random Forests in the problem of trading with the
EUR/USD exchange rate providing extremely satisfactory results.
Keywords- EUR/USD Exchange Rate; future direction prediction;
Naïve strategy; MACD strategy; Naïve Bayesian Classifier; K-
Nearest neighbors classifier; SVM; Random Forests; leverage;
transaction costs
I. I
NTRODUCTION
The application of machine learning techniques for market
predictions has been widely established in the scientific
community. This paper deals with the application of a variety
of state-of-the-art machine learning techniques in the problem
of predicting the one day ahead movement direction of the
EURO-USD exchange rates. Developing high accuracy
techniques for predicting financial time series is a very crucial
problem for economists, investigators and analysts. The
traditional statistical methods, used by economists in the past
years, seem to fail to capture the discontinuities, the
nonlinearities and the high complexity of financial time series.
Complex machine learning techniques like Artificial Neural
Networks, Support Vector Machines (SVM) and Random
Forests provide enough learning capacity and are more likely to
capture the complex non-linear models which are dominant in
the financial markets.
Some approaches examining the performance of machine
learning techniques in trading with the EURO-USD exchange
rate have already been developed. In [1], Dunis and Williams
demonstrated the ability of Multi Layer Perceptron (MLP)
Artificial Neural Networks in modeling and trading with the
EUR/USD exchange rate. Their empirical results showed that
the MLP outperformed all other benchmark models used. Next,
in [2], Ullrich et al. used SVMs to trade with a variety of
foreign exchange rates including EUR/USD. Their results
indicated that SVMs outperformed Artificial Neural Networks
and all other traditional techniques used in this paper for
comparative reasons. Finally, in [3], Dunis et al. compared
Higher Order Neural Networks, Psi Sigma Networks,
Recurrent Networks and MLP in the task of trading with the
EURO/USD exchange rate. MLP was proved to outperform all
other neural networks variation.
The rest of the paper is organized as follows: In section II,
the dataset of EURO-USD exchange rates is presented. In
section III, all the traditional and machine learning techniques
used in the present paper are briefly described. In section IV
the comparative results are presented and in section V they are
discussed and some future research directions are proposed.
II. T
HE EUR
/
USD EXCHANGE RATES AND RELATED
FINANCIAL DATA
The European Central Bank (ECB) publishes a daily fixing
for selected EUR exchange rates: these reference mid-rates are
based on a daily concentration procedure between central banks
within and outside the European System of Central Banks,
which normally takes place at 2.15 p.m. ECB time. The
reference exchange rates are published both by electronic
market information providers and on the ECB’s website shortly
after the concentration procedure has been completed.
Although only a reference rate, many financial institutions are
ready to trade at the EUR fixing and it is therefore possible to
leave orders with a bank for business to be transacted at this
level.
The ECB daily fixing of the EUR/USD is therefore a
tradable level which makes using of a more realistic alternative
to, say, London closing prices and this is the series that we
ETASR - Engineering, Technology & Applied Science Research
Vol. 2, No. 5, 2012, 269-272
270
www.etasr.com Theofilatos et al: Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques
investigate in the present paper. EUR/USD is quoted as the
number of USD per Euro.
Specifically, the data which are used for our problem were
downloaded from [4], and they were split as shown in Table 1
in order to train and evaluate our models.
TABLE I.
EURO
/
YSD DATASETS
.
Name of
period
Trading days Beginning End
Total dataset 2294 17 January 2002 30 December 2010
Training
dataset 1363 17 January 2002 16 May 2007
Validation
dataset 459 17 May 2007 2 March 2009
Testing dataset
[out of sample
dataset]
471 3 March 2009 30 December 2010
The graph in Figure 1 shows the total dataset for the
EUR/USD. As inputs to our models we selected a set of
autoregressive terms of the EUR/USD exchange rate returns
presented in Table 2.
Fig. 1. Euro/USD daily fixing prices (total dataset)
TABLE II.
EXPLANATORY VARIABLES
.
Number Input variable Lag
1 EUR/USD exchange rate return 1
2 EUR/USD exchange rate return 2
3 EUR/USD exchange rate return 3
4 EUR/USD exchange rate return 4
5 EUR/USD exchange rate return 5
6 EUR/USD exchange rate return 6
7 EUR/USD exchange rate return 7
8 EUR/USD exchange rate return 8
9 EUR/USD exchange rate return 9
10 EUR/USD exchange rate return 10
III. F
ORECASTING MODELS
A. Benchmark Models
In the present paper, the machine learning methods
presented in section 3.2 were benchmarked with 2 traditional
strategies, namely Naïve strategy [3] and a moving average
convergence/divergence technical model (MACD) [3].
The Naïve strategy takes the most recent period change as
the best prediction of the future change. The MACD strategy
used is quite simple. Two moving average series are created
with different moving average lengths. The decision rule for
taking positions in the market is straightforward. Positions are
taken if the moving averages intersect. If the short-term moving
average intersects the long term moving average from below a
‘long’ position is taken. Conversely, if the long-term moving
average is intersected from above a ‘short’ position is taken.
B. Machine Learning Models
Some of the state-of-the-art machines learning
classification techniques were applied in the problem of the one
day ahead prediction of the direction movement of the
EUR/USD exchange rate. These machine learning techniques
are: K-nearest neighbor classifier (KNN), Naïve Bayesian
classifier, Support Vector Machines (SVM) and Random
Forests. In order to find the optimal parameters for each
machine learning technique we used only the training and
validation datasets leaving the test set for the final evaluation of
the algorithms. Doing this we avoid specializing in our dataset
and getting misleading results. The parameters were optimized
using genetic algorithms [5]. All implementations were done
using Matlab R2009a edition.
K-nearest neighbors [6] is a method for classifying objects
based on closest training examples in the feature space. The
KNN method is considered one of the simplest machine
learning techniques. An object is classified by a majority vote
of its neighbors. If K=1, then the object is assigned to the class
of its nearest neighbor. In our case study the optimal K found
was 8.
The Naïve Bayesian classifier [7] is a simple probabilistic
classifier with strong assumptions of independence among
input variable. It is the classifier derived from the use of Bayes’
theorem. Bayes’ Theorem expresses the conditional
probability, or “posterior probability”, of a hypothesis H, (i.e.
its probability after evidence E is observed) in terms of the
“prior probability” of H, the prior probability of E and the
conditional probability of E given H. In implies that evidence
has a stronger confirming effect if it was more likely before
being observed. Bayes’ theorem is valid in all common
interpretations of probabilities, and it is commonly applied in
science and engineering. The main disadvantages of Naïve
Bayesian classifiers are their assumptions about the
independency of the input variables, which is not usually the
case in real problems.
Neural networks [8] exist in several forms in the literature.
The most popular architecture is the Multi-Layer Perceptron
(MLP). A MLP consists of at least three layers of nodes. The
network processes information as follows: the input nodes
ETASR - Engineering, Technology & Applied Science Research
Vol. 2, No. 5, 2012, 269-272
271
www.etasr.com Theofilatos et al: Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques
contain the value of the explanatory variables. Each node of the
hidden layers passes incoming information through a nonlinear
activation function and passes it to the output layer if the
calculated value is above a threshold. Each node of one layer
has weighted connections to all other nodes of the next layer.
The training of the network (which is the adjustment of its
weights in the way that the network maps the input value of the
training data to the corresponding output value) starts with
randomly chosen weights and proceeds by applying a
supervised learning algorithm, called back propagation of
errors. Since networks with sufficient hidden nodes are able to
learn the training data (as well as their outliers and their noise)
by heart, it is crucial to stop the training procedure at the right
time to prevent overfitting (this is called ‘early stopping’). In
the present paper the best network’s architecture found was the
one using one hidden layer with 19 hidden neurons.
Support vector machines (SVM) are a group of supervised
learning methods that can be applied in classification and
regression problems. SVMs represent an extension to non
linear models of the generalized algorithm developed by
Vapnik [9]. They have already been applied in many scientific
problems. Specifically, SVM have already been used in many
prediction and classification problems in finance and
economics although they are still far from mainstream. The few
financial applications so far have only been published in
statistical learning and artificial intelligence journals. SVM
models were originally defined for the classification of linearly
separable classes of objects. For any original separable set of
two-class objects SVM are able to find the optimal hyperplanes
that separates them providing the bigger margin area between
the two hyperplanes. Furthermore they can also be used to
separate classes that cannot be separated with a linear classifier.
In such cases, the coordinates of the objects are mapped into a
feature space using nonlinear functions. The feature space in
which every object is projected is a high dimensional space in
which the two classes can be separated with the linear
classifier. In the present work we used the Radial Basis
Function (RBF) as Kernel function for the SVM models
because of its efficiency in providing very high performance
classification results. The optimal RBF parameters C and
gamma were found to be 64 and 2 respectively re-assuring that
the model does not over fit.
Another sophisticated machine learning method, such as the
support vector machines (SVM), is the random forest method.
Random forests [10] are ensemble classifiers that "grow" many
decision trees simultaneously where each node uses a random
subset of the features considered. Specifically, Random forests
are a combination of tree predictors such that each tree depends
on the values of a random vector sampled independently and
with the same distribution for all trees in the forest. The idea of
growing an ensemble of trees and letting them vote for the
most popular class has led to significant improvements in
classification accuracy. The generalization error for the forest
converges to a limit as the number of trees in the forest
becomes large. The generalization error for the forest of tree
classifiers depends on the strength of the individual trees in the
forest and the correlation between them. The optimal number
of classification trees found for our case study was 51 which
seem to be enough for achieving a very good generalization
performance. The applications of Random Forests in predicting
the movement direction of financial time series remains
nowadays quit limited despite of their high classification
performance and their ability to generalize in data that have not
been used to train their classifiers [11].
IV. E
MPIRICAL TRADING SIMULATION RESULTS
The trading performance of all models considered in the
validation subset is presented in Table 3. Due to the stochastic
nature of the machine learning techniques, every method was
executed 10 times and the results presented in Table 3 present
the mean performance of these executions. The trading strategy
derived for all the machine learning classification techniques
used in the present paper is simple and identical for all of them:
go or stay long if the classification model forecasts a positive
movement and go or stay short if the classification model
forecasts a negative movement. Since some of our models trade
quite often, taking transaction costs into account might change
the whole picture.
The transaction costs for a tradable amount, say USD 5-10
million, are about 1 pip (0.0001 EUR/USD) per trade (one
way) between market makers. But since we consider the
EUR/USD time series as a series of bid rates, we have to pay
the costs only one and not two times per position taken. With
an average rate of EUR/USD of 1.332 for the testing period, a
cost of 1 pip is equivalent to an average cost of 0.008% per
position.
TABLE III.
TRADING PERFORMANCE RESULTS
.
Naïve
Strategy
MACD KNN Naïve
Bayes
Information Ratio
(excluded costs) -0.48 0.27 -0.12 -0.21
Annualized Volatility
(excluded costs) 11.44% 11.46% 11.46% 11.46%
Annualized Return
(excluding costs) -5.51% 3.08% -1.36% -2.46%
Maximum Drawdown
(excluding costs) -26.54% -16.96% -13.98% -15.14%
Correct Directional
Prediction 49.24% 50.11% 48.83%
Transaction costs 1.04% 0.28% 0.90% 0.62
Annualized Return
(including costs) -6.59% 2.80% -2.26% -3.08
Artificial Neural
Networks (MLP)
SVM Random
Forests
Information Ratio
(excluded costs) 0.22 0.43 0.72
Annualized Volatility
(excluded costs) 11.46% 11.46% 11.45%
Annualized Return
(excluding costs) 2.51% 4.90% 8.29%
Maximum Drawdown
(excluding costs) -18.89% -14.74% -9.94%
Correct Directional
Prediction 50.12% 52.65% 53.50%
Transaction costs 0.92 0.92% 1.01%
Annualized Return
(including costs) 1.59% 3.98% 7.28
ETASR - Engineering, Technology & Applied Science Research
Vol. 2, No. 5, 2012, 269-272
272
www.etasr.com Theofilatos et al: Modeling and Trading the EUR/USD Exchange Rate Using Machine Learning Techniques
As observed in Table 3, SVM and Random Forests
outperform all models in terms of annualized return and
information ratio even when transaction costs are considered.
Random Forests are the dominant model presenting the higher
minimum drawdown and thus reducing the trading risk.
Furthermore, as a classifier Random Forests demonstrated the
highest correct directional prediction in the out of sample
period and thus being the most accurate one.
V. C
ONCLUSIONS
In the present work, we applied a variety of machine
learning techniques in the problem of modeling and trading
with the EURO/USD exchange rate. From all the applied
machine learning techniques, Random Forests has not been
applied in this problem again while being one of the most
accurate classifiers. The machine learning techniques were
benchmarked with two traditional trading strategies: Naïve
strategy and MACD strategy.
Except from the simple methods of KNN and Naïve
Bayesian classifiers all other machine learning techniques
outperformed the traditional strategies that are even until now
used by economists. Thus, our empirical results encourage
future research in applying machine learning techniques in
trading with financial time series. From all the machine
learning techniques applied in the present paper, Random
Forests indicated the best trading performance in terms of
annualized return and information ration even when the
transaction costs were considered.
As a future direction we propose the application of random
forests and other machine learning techniques in trading with
other financial time series in order to test their performance and
establish them as reliable quantitative trading tools.
R
EFERENCES
[1] C. Dunis, M. Williams, “Modeling and trading the Euro/Us Dollar
exchange rate: do neural networks perform better?”, Derivatives Use,
Trading and Regulation, Vol. 8, No. 3, pp. 211-240, 2002.
[2] C. Ullrich, D. Seese, S. Chalup, “Foreign exchange trading with support
vector machines”, Advances in Data Analysis: Studies in Classification,
Data Analysis and Knowledge Organization, Part VII, pp. 539-546,
2007.
[3] C. Dunis, J. Laws, G. Sermpinis, “Modelling and trading the EUR/USD
exchange rate at the ECB fixing”, The European Journal of Finance,
Vol. 16, No. 6, pp. 641-561, 2010.
[4] Thomson Reuters Datastream: http://online.thomsonreuters.com/
datastream/ .
[5] J. Holland, Adaptation in natural and artificial systems: an introductory
analysis with applications to biology, control, and artificial intelligence,
Cambridge: Mass, MIT Press, 1995.
[6] T. Cover, P. Hart, “Nearest neighbor pattern classification”, IEEE Trans.
Inform. Theory, Vol. 13, No. 1, pp. 21-27, 1967.
[7] C. Howson, P. Urbach, Scientific Reasoning: The Bayesian Approach,
Third Edition, Open Course Publishing Company, 1993.
[8] S. Haykin, Neural Networks: A Comprehensive Foundation, Prentice
Hall, 1998.
[9] V. Vapnik, The Nature of Statistical Learning Theory, Springer 2000.
[10] L. Breiman, “Random Forests”, Machine Learning, Vol. 45, No. 1, pp.
5-32 2001.
[11] K. Manish, M. Thenmozhi, “Forecasting stock index movement: A
comparison of support vector machines and random forest”. In
Proceedings of ninth Indian institute of capital markets conference,
Mumbai, India, 2005.
AUTHORS
PROFILE
Konstantinos Theofilatos is a PhD candidate in the Department of Computer
Engineering and Informatics of the University of Patras, Greece. In 2009, he
received a Master's degree from the Department of Computer Engineering and
Informatics of the University of Patras. His research interests include
computational and artificial intelligence, evolutionary computation, time
series modeling and forecasting, bioinformatics, data mining and web
technologies.
Spiros Likothanassis is an Electrical Engineer with a Ph.D. in Computer
Engineering and Informatics. He currently is a professor in the Department of
Computer Engineering and Informatics of the University of Patras. He is a
reviewer in many scientific journals and conferences including IEEE Neural
Networks, Journal of Knowledge and Information Systems, Information
Sciences Journal and Journal of Computational and Applied Mathematics. He
has a very good knowledge on Adaptive Signal Processing, Adaptive Control,
Computational and Artificial Intelligence, Intelligent Agents and
Bioinformatics. He has also experience in designing Information Systems, E-
commerce Applications, Distance Learning Systems and Teleworking
Applications
Andreas Karathanasopoulos is a senior Lecturer in London Metropolitan
Business School. In 2008 he received the Master of Science in International
Banking and Finance from the Department of Banking and Finance of
Liverpool John Moore’s University. His research interests include financial
forecasting, trading strategies, time series prediction, artificial and
computational intelligence and neural networks.
... This study fills this research gap by investigating water consumption in the northern area of Saudi Arabia using different algorithms, including RF, GB, SVM, and LR. Similar frameworks to these methods can be found in [18][19][20][21][22][23][24]. The novel contributions of this study are that it is carried out for a special area with special characteristics, it also merges two datasets for efficient water consumption prediction, including water and weather data, and finally applies different ML algorithms to the merged datasets. ...
... SVMs consist of a group of supervised learning methods that are usually used in classification and regression problems. SVMs are extended to nonlinear models of the generalized algorithm developed for linear regressions [20]. ...
Article
Full-text available
Water scarcity is considered a major problem in dry regions, such as the northern areas of Saudi Arabia and especially the city of Hail. Water resources in this region come mainly from groundwater aquifers, which are currently suffering from high demand and severe climatic conditions. Forecasting water consumption as accurately as possible may contribute to a high level of sustainability of water resources. This study investigated different Machine Learning (ML) algorithms, namely Support Vector Machine (SVM), Random Forest (RF), Linear Regression (LR), and Gradient Boosting (GB), to efficiently predict water consumption in such areas. These models were evaluated using a set of performance measures, including Mean Squared Error (MSE), R-squared (R2), Mean Absolute Error (MAE), Explained Variance Score (EVS), Mean Absolute Percentage Error (MAPE), and Median Absolute Error (MedAE). Two datasets, water consumption and weather data, were collected from different sources to examine the performance of the ML algorithms. The novelty of this study lies in the integration of both weather and water consumption data. After examining the most effective features, the two datasets were merged and the proposed algorithms were applied. The RF algorithm outperformed the other models, indicating its robustness in capturing water usage behavior in dry areas such as Hail City. The results of this study can be used by local authorities in decision-making, water consumption analysis, new project construction, and consumer behavior regarding water usage habits in the region.
... Bayes' Theorem is used to update prior distributions with observed data to obtain posterior distributions for the coefficients. This Theorem expresses the posterior probability in terms of the prior probability and the conditional probability [38,39]. ...
Article
Full-text available
Background Under-five mortality in Tanzania remains a persistent issue, significantly affecting both the health and economic sectors. Despite various interventions, the under-five mortality rate (U5MR) remains high, impeding progress toward global health targets. This study investigates the factors influencing under-five mortality in Tanzania, focusing on the gross domestic product (GDP), malaria incidence, access to water, and access to sanitation. Methods The study employed data sets for Tanzania U5MR, GDP, access to water and sanitation, and malaria incidences from the World Bank for the years 1960–2020. Missing values are generated through the linear trend at point method. To analyses the data, correlation analysis and Bayesian linear regression are employed. Results The analysis reveals significant relationships between the gross domestic product (GDP), malaria incidence, access to water, access to sanitation and under-five mortality. Furthermore, an increase in malaria incidences increase under-five mortality by 0.14 (14%), while access to water and sanitation exhibit an uncertain relationship. On the other hand, results show that an increase in the GDP lowers the likelihood of U5MR. Conclusion These findings underscore the importance of economic development and public health interventions in reducing the child mortality rate. The study provides valuable insights for policymakers aiming to achieve Sustainable Development Goal (SDG) Target 3.2 by 2030, which aims for all countries to accomplish U5MR of 25 or less deaths per 1000 live births. By highlighting the relationships between these variables, the study contributes practical evidence to support efforts towards SDG 3.2, emphasizing the need for targeted interventions in both health and infrastructure sectors.
... They concluded that the seasonal autoregressive integrated moving average is the most effective model to forecast the exchange rate of the Jordanian Dinar against the United States dollar. On the other hand, the work in [8] tested various machine learning models to forecast EUR/USD exchange rates. Their findings indicated that Random Forest is the most accurate model in predicting such phenomena. ...
... MRMR is a feature selection technique particularly suitable for high-dimensional datasets like gene expression profiles in leukemia [5,6]. This paper use four widely used ML algorithms: Naïve Bayes (NB), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Artificial Neural Networks (ANNs) [3]. The present study utilizes a publicly available dataset originating from a foundational work published in 1999 [7]. ...
Article
Full-text available
Early and accurate diagnosis of leukemia is crucial for effective treatment. Machine Learning (ML) offers promising tools for leukemia diagnosis classification, but the required high-dimensional datasets pose challenges. This study explores the effectiveness of ML algorithms for leukemia disease classification and investigates the impact of feature selection with the Minimum Redundancy Maximum Relevance (MRMR ) technique. MRMR was implemented to select informative features and evaluate four ML algorithms (Naïve Bayes (NB), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Artificial Neural Networks (ANNs)) using feature subsets with varying levels of relevance based on MRMR scores. Our results demonstrate that MRMR effectively reduced dimensionality while maintaining and even improving classification accuracy. KNN and SVM achieved the highest accuracy (100% for 67, 30, and 24 feature subsets), suggesting the benefit of focusing on highly relevant features. NB exhibited consistent accuracy across all feature sets.
... There are many factors affecting apartment sales and rental prices, namely the interest rates, inflation rates, economic growth rate, economic development level, industrialization rate, financial market developments, environmental factors, and building characteristics [1][2][3][4][5]. Recently, Artificial Intelligence (AI) algorithms have been widely applied in engineering techniques [6][7][8][9][10][11][12] and, in particular, the application of AI models in apartment price prediction has been the subject of many studies. In [13], Machine Learning (ML) was used to predict the real-time apartment prices in Tokyo metropolitan area. ...
Article
Full-text available
This study aims to propose an efficient Machine Learning (ML) model, namely Gradient Boosting Regression Trees (GBRT), to predict apartment prices considering the fluctuation of construction material prices and the annual inflation index. For developing the ML model, 480 apartments in Vinh City (Vietnam) were considered. The input parameters employed while training the ML model were the area of the apartments, the number of bedrooms/restrooms, the apartment class, nearby health or education services, investment potential, and parking, whereas the apartment price was the output of the model. The results show that the GBRT model predicts the apartment price accurately with a high value of 0.997 and a small RMSE of 0.26. Additionally, the obtained a20-index is very high, almost 1.0. Finally, a practical graphical user interface was developed to facilitate the prediction of the apartment price in terms of usability.
... ML is widely utilized across a diverse range of applications and disciplines due to their ability to make informed decisions and predictions using recently acquired data [14][15][16][17]. ML algorithms undergo repeated training iterations to adapt and identify patterns. ...
Article
Full-text available
Coastal defense structures are of paramount importance in protecting coastal communities from the adverse impacts of severe weather events and flooding. This study uses machine learning techniques, specifically Decision Tree (DT), Gradient Boosted Tree (GBT), and Support Vector Machine (SVM) models, to estimate wave overtopping discharge at coastal structures with combined slopes employing the recently built EurOtop database. The models were evaluated by deploying statistical metrics and Taylor diagram visualization. The GBT model demonstrated a high level of accuracy in predicting wave-overtopping discharge. Compared to the other models, the scatter index of GBT (0.392) was lower than that of DT (0.512) and SVM (0.823). In terms of the R-index, GBT (0.991) was superior to DT (0.977) and SVM (0.943). The GBT results were also compared with those of previous works. The findings showed that the GBT model significantly decreased the overall error and provided accurate estimations of the wave-overtopping discharge.
... However, the study used data from the NASDAQ stock exchange, which is a US stock market, and did not state whether the resulting model was tested on actual data from East African stock markets. In [24][25][26], several methods were incorporated to predict future stock prices and currency exchange rates. ...
Article
Full-text available
Stock price prediction models help traders to reduce investment risk and choose the most profitable stocks. Machine learning and deep learning techniques have been applied to develop various models. As there is a lack of literature on efforts to utilize such techniques to predict stock prices in Tanzania, this study attempted to fill this gap. This study selected active stocks from the Dar es Salaam Stock Exchange and developed LSTM and GRU deep learning models to predict the next-day closing prices. The results showed that LSTM had the highest prediction accuracy with an RMSE of 4.7524 and an MAE of 2.4377. This study also aimed to examine whether it is significant to account for the outstanding shares of each stock when developing a joint model for predicting the closing prices of multiple stocks. Experimental results with both models revealed that prediction accuracy improved significantly when the number of outstanding shares of each stock was taken into account. The LSTM model achieved an RMSE of 10.4734 when the outstanding shares were not taken into account and 4.7524 when they were taken into account, showing an improvement of 54.62%. However, GRU achieved an RMSE of 12.4583 when outstanding shares were not taken into account and 8.7162 when they were taken into account, showing an improvement of 30.04%. The best model was implemented in a web-based prototype to make it accessible to stockbrokers and investment advisors.
... In recent years, the ML based Text Classification (TC) techniques have been widely employed in various domains [10] such as Sentiment analysis [11,12], E-Commerce portals [13,14], Email classification [15], Human Resource Management [2], Banking and Exchange Stock Markets [16,17], and bioinformatics [18,19]. In this study, ML-based text classification techniques are employed in the Human Resource Management domain. ...
Article
Full-text available
The selection of a suitable job applicant from the pool of thousands applications is often daunting job for an employer. The categorization of job applications submitted in form of Resumes against available vacancy(s) takes significant time and efforts of an employer. Thus, Resume Classification System (RCS) using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process. Moreover, the automation of this process can significantly expedite and transparent the applicants’ screening process with mere human involvement. This experimental study presents an automated NLP and ML-based RCS that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of RCS and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP and ML techniques for RCS, the extracted features were evaluated on nine ML classification models namely Support Vector Machine - SVM (Linear, SGD, SVC and NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN), and Logistic Regression (LR). The Term-Frequency-Inverse-Document-Frequency (TF-IDF) feature representation scheme was proved suitable for RCS. The developed models were evaluated using the Confusion Matrix, F-Score, Recall, Precision, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume classification task, the SVM class of Machine Learning classifiers performed better on the study dataset of over nine hundred sixty plus parsed resumes with more than 96% accuracy. The promising results suggest that NLP and ML techniques employed in this study could be used for developing an efficient RCS.
Chapter
In this study, the philosophy of fundamental and technical investment analysis is analyzed; focusing on how the nature of the technical analysis can be combined with new technologies such as machine learning techniques to developed advanced trading strategies. Finally, we show how trading strategies can be used as a hedging tool in new engineering technologies like 3D printing.KeywordsTechnical analysisFundamental analysisMachine learningQuantitative trading strategies
Article
Full-text available
This research examines and analyses the use of Neural Network Regression (NNR) models in foreign exchange (FX) forecasting and trading models. The NNR models are benchmarked against traditional forecasting techniques to ascertain their potential added value as a forecasting and quantitative trading tool. In addition to evaluating the various models using traditional forecasting accuracy measures, such as root mean squared errors, they are also assessed using financial criteria, such as risk-adjusted measures of return. Having constructed a synthetic EUR/USD series for the period up to 4 January 1999, the models were developed using the same in-sample data, leaving the remainder for
Article
Full-text available
The motivation for this paper is to investigate the use of alternative novel neural network (NN) architectures when applied to the task of forecasting and trading the euro/dollar (EUR/USD) exchange rate, using the European Central Bank (ECB) fixing series with only auto-regressive terms as inputs. This is done by benchmarking four different NN designs representing a higher-order neural network (HONN), a Psi Sigma Network and a recurrent neural network with the classic multilayer perception (MLP) and some traditional techniques, either statistical such as an auto-regressive moving average model, or technical such as a moving average convergence/divergence model, plus a naive strategy. More specifically, the trading performance of all models is investigated in a forecast and trading simulation on the EUR/USD ECB fixing time series over the period 1999-2007 using the last one and half years for out-of-sample testing, an original feature of this paper. We use the EUR/USD daily fixing by the ECB as many financial institutions are ready to trade at this level and it is therefore possible to leave orders with a bank for business to be transacted on that basis. As it turns out, the MLP does remarkably well and outperforms all other models in a simple trading simulation exercise. However, when more sophisticated trading strategies using confirmation filters and leverage are applied, the HONN network produces better results and outperforms all other NN and traditional statistical models in terms of annualized return.
Article
Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
Article
There exists vast research articles which predict the stock market as well pricing of stock index financial instruments but most of the proposed models focus on the accurate forecasting of the levels (i.e. value) of the underlying stock index. There is a lack of studies examining the predictability of the direction/sign of stock index movement. Given the notion that a prediction with little forecast error does not necessarily translate into capital gain, this study is an attempt to predict the direction of S&P CNX NIFTY Market Index of the National Stock Exchange, one of the fastest growing financial exchanges in developing Asian countries. Random forest and Support Vector Machines (SVM) are very specific type of machine learning method, and are promising tools for the prediction of financial time series. The tested classification models, which predict direction, include linear discriminant analysis, logit, artificial neural network, random forest and SVM. Empirical experimentation suggests that the SVM outperforms the other classification methods in terms of predicting the direction of the stock market movement and random forest method outperforms neural network, discriminant analysis and logit model used in this study.
Conference Paper
This paper analyzes and examines the general ability of Support Vector Machine (SVM) models to correctly predict and trade daily EUR exchange rate directions. Seven models with varying kernel functions are considered. Each SVM model is benchmarked against traditional forecasting techniques in order to ascertain its potential value as out-of-sample forecasting and quantitative trading tool. It is found that hyperbolic SVMs perform well in terms of forecasting accuracy and trading results via a simulated strategy. This supports the idea that SVMs are promising learning systems for coping with nonlinear classification tasks in the field of financial time series applications.
Book
Genetic algorithms are playing an increasingly important role in studies of complex adaptive systems, ranging from adaptive agents in economic theory to the use of machine learning techniques in the design of complex devices such as aircraft turbines and integrated circuits. Adaptation in Natural and Artificial Systems is the book that initiated this field of study, presenting the theoretical foundations and exploring applications. In its most familiar form, adaptation is a biological process, whereby organisms evolve by rearranging genetic material to survive in environments confronting them. In this now classic work, Holland presents a mathematical model that allows for the nonlinearity of such complex interactions. He demonstrates the model's universality by applying it to economics, physiological psychology, game theory, and artificial intelligence and then outlines the way in which this approach modifies the traditional views of mathematical genetics. Initially applying his concepts to simply defined artificial systems with limited numbers of parameters, Holland goes on to explore their use in the study of a wide range of complex, naturally occuring processes, concentrating on systems having multiple factors that interact in nonlinear ways. Along the way he accounts for major effects of coadaptation and coevolution: the emergence of building blocks, or schemata, that are recombined and passed on to succeeding generations to provide, innovations and improvements. Bradford Books imprint
Article
The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points. This rule is independent of the underlying joint distribution on the sample points and their classifications, and hence the probability of error R of such a rule must be at least as great as the Bayes probability of error R^{\ast} --the minimum probability of error over all decision rules taking underlying probability structure into account. However, in a large sample analysis, we will show in the M -category case that R^{\ast} \leq R \leq R^{\ast}(2 --MR^{\ast}/(M-1)) , where these bounds are the tightest possible, for all suitably smooth underlying distributions. Thus for any number of categories, the probability of error of the nearest neighbor rule is bounded above by twice the Bayes probability of error. In this sense, it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.
Modeling and trading the Euro/Us Dollar exchange rate: do neural networks perform better?”, Derivatives Use
  • C Dunis
  • M Williams