ArticlePDF Available

Constructing Equity Investment Strategies Using Analyst Reports and Regime Switching Models

Authors:

Abstract and Figures

This study demonstrates whether analysts' sentiments toward individual stocks are useful for stock investment strategies. This is achieved by using natural language processing to create a polarity index from textual information in analyst reports. In this study, we performed time series forecasting for the created polarity index using deep learning, and clustered the forecasted values by volatility using a regime switching model. In addition, we constructed a portfolio from stock data and rebalanced it at each change point of the regime. Consequently, the investment strategy proposed in this study outperforms the benchmark portfolio in terms of returns. This suggests that the polarity index is useful for constructing stock investment strategies.
Content may be subject to copyright.
ORIGINAL RESEARCH
published: 18 May 2022
doi: 10.3389/frai.2022.865950
Frontiers in Artificial Intelligence | www.frontiersin.org 1May 2022 | Volume 5 | Article 865950
Edited by:
Ronald Hochreiter,
Vienna University of Economics and
Business, Austria
Reviewed by:
Bowei Chen,
University of Glasgow,
United Kingdom
Bertrand Kian Hassani,
University College London,
United Kingdom
*Correspondence:
Rei Taguchi
s5abadiee@g.ecc.u-tokyo.ac.jp
Specialty section:
This article was submitted to
Artificial Intelligence in Finance,
a section of the journal
Frontiers in Artificial Intelligence
Received: 30 January 2022
Accepted: 06 April 2022
Published: 18 May 2022
Citation:
Taguchi R, Watanabe H, Sakaji H,
Izumi K and Hiramatsu K (2022)
Constructing Equity Investment
Strategies Using Analyst Reports and
Regime Switching Models.
Front. Artif. Intell. 5:865950.
doi: 10.3389/frai.2022.865950
Constructing Equity Investment
Strategies Using Analyst Reports and
Regime Switching Models
Rei Taguchi 1
*, Hikaru Watanabe 2, Hiroki Sakaji 1, Kiyoshi Izumi 1and Kenji Hiramatsu 3
1School of Engineering, The University of Tokyo, Tokyo, Japan, 2Faculty of Engineering, The University of Tokyo, Tokyo,
Japan, 3IFIS Japan Limited, Tokyo, Japan
This study demonstrates whether analysts’ sentiments toward individual stocks are
useful for stock investment strategies. This is achieved by using natural language
processing to create a polarity index from textual information in analyst reports. In
this study, we performed time series forecasting for the created polarity index using
deep learning, and clustered the forecasted values by volatility using a regime switching
model. In addition, we constructed a portfolio from stock data and rebalanced it at each
change point of the regime. Consequently, the investment strategy proposed in this study
outperforms the benchmark portfolio in terms of returns. This suggests that the polarity
index is useful for constructing stock investment strategies.
Keywords: BERT, Hidden Markov Model, trading strategy, financial market, regime switching model
1. INTRODUCTION
In recent years, the explosive development of artificial intelligence has brought worldwide attention
to the use of alternative data, which is particularly prominent in the financial and economic fields
and is beginning to be widely used for economic forecasting and investment strategies together with
traditional data. In Japan, the Japan Association for the Promotion of Alternative Data (JADAA)1
and other cross-industry organizations have been established to promote its use actively.
This study is expected to expand the application possibilities of alternative data. Additionally,
this study aims to encourage the expansion of artificial intelligence techniques in investment
technology by providing the same reliability criteria as existing economic statistics in this area
using indexes produced from alternative data. Among them, text data is particularly versatile and
can extract predictions and classifications based on the writing style of the author and the content
bais (Izumi and Sakaji, 2019; Sakaji et al., 2019).
Moreover, this study is expected to expand the possibilities of data linkage. The purpose of
data linkage in this study is to encourage the expansion of artificial intelligence technology in the
investment field by providing the same reliability criteria as existing statistical information in the
investment field through alternative data analysis.
In addition, analyst reports are considered to have a very high information value among text data
(Hiramatsu et al., 2021). Analyst reports contain rating information on individual stocks; therefore,
the polarity index generated from these reports is assumed to affect stock prices, Return on Equity
and Price Earnings Ratio. The polarity index is a time series plot of the polarity assigned to the text
data. In recent years, it has been found that polarity index generated from analyst reports are ahead
1https://alternativedata.or.jp
Taguchi et al. Constructing Equity Investment Strategies
of macroeconomic indexes such as exchange rates and
government bonds issuance (Taguchi et al., 2021) and are
expected to be applied to investment technology.
Although there are many studies on portfolio management
methods using machine learning (Wang et al., 2020; Yun et al.,
2020; Zhang et al., 2020; Chen et al., 2021; Ma et al., 2021), this
study is novel as we utilize the method of sentiment analysis into
portfolio management.
This study shows that sentiment toward the future of
individual stocks in analyst reports is useful in investment. The
expected outcome is that the investment strategy proposed in this
study will generate returns exceeding the benchmark portfolio.
Consequently, it is envisioned that changes in the regime of polar
indicators generated from analyst reports will serve as signals to
change the investment allocation of individual stocks.
This study’s contribution is to demonstrate that an investment
strategy can generate returns by using analysts’ sentiments
toward individual stocks to signal portfolio asset allocation.
From this, we demonstrate that analyst reports are useful
for investment.
2. RELATED WORKS
There is considerable literature on portfolio management
methods using machine learning and deep learning. Ma et al.
(2021) performed return prediction with several opportunity
learning models, including deep multilayer perceptron, and
incorporated the prediction results into the advanced mean-
variance model for portfolio optimization. Yun et al. (2020)
proposed a two-stage deep learning model for a forecast-based
Exchange Traded Fund (ETF) portfolio management approach.
Wang et al. (2020) proposed a portfolio management method
that considers the long-term dependence of time series variability
using mean-variance models and deep learning theories such as
Long Short Term Memory (LSTM). Zhang et al. (2020) proposed
a deep learning framework for maximizing the Sharpe ratio of a
portfolio. Chen et al. (2021) proposed a new model to optimize
XGBoost with an improved firefly algorithm (IFA) and perform
portfolio optimization.
Some studies have been conducted on creating polarity
indicators by machine learning. In Yono et al. (2020), Financial
and Economic Statistics Monthly were trained using LSTM, and
indicators for each topic such as consumer spending, capital
investment, and inventory were created using Latent Dirichlet
Allocation (LDA). They also analyzed which macro factors
are more influential by determining how much each topic’s
sentiment contributes to the overall sentiment. Katayama et al.
(2019) developed a sentiment polarity identification model for
finance by processing the Japan Economic Watcher Survey with
Word2vec and training it with LSTM.
The above studies use texts with similar contents, such as the
Financial and Economic Statistics Monthly and the Economic
Watcher Survey. In contrast, our study differs in that we use
analyst reports as an indicator by summing up the sentiment
of individual stocks. Some studies have been conducted as an
example of research on text mining using analyst reports. In
Hiramatsu et al. (2021), stock prices respond strongly to the
sentiment of the report, and drift in stock prices is observed after
the report is issued, suggesting that the textual information in
analyst reports is useful in asset management practice. Further,
Asquith et al. (2005) investigates the association between market
returns and analyst reports content using regression analysis. In
addition, Suzuki et al. (2020) opinion and non-opinion text are
extracted from analyst reports using LSTM and other methods,
and forecasts of net income and stock prices are made.
Our study’s use of Bidirectional Encoder Representations from
Transformers (BERT) to train analyst reports is also different
from other studies. BERT is a Transformer-based language
model. An example of a study using BERT is Hiew et al. (2019)
where text data is given polarity by BERT using Weibo, a Chinese
Social Networking Service, and stock price predictions are made
using LSTM.
Furthermore, our study differs from other studies in that we
use Bi-directional Gated Recurrent Unit (BiGRU) and Hidden
Markov Model with Gaussian Mixture Model emissions (GMM-
HMM) to trigger the portfolio rebalancing. An example of a study
using BiGRU in the financial field is Chen et al. (2020) which used
labeled financial tweet data and BiGRU to perform sentiment
analysis. Liu et al. (2021) used GMM-HMM and LSTM to predict
stock prices. BiGRU and LSTM have several applications outside
the financial sector (Talha et al., 2017; Yan et al., 2021).
3. METHODS
In this study, we performed asset allocation for a stock portfolio
composed of two stocks, using signals as change points in the
polar indicators regime created from analyst reports. We used
analyst reports to develop an investment strategy using natural
language processing and artificial intelligence techniques in the
following four steps. We demonstrate that analyst reports are
useful for investment by comparing this investment strategy with
the benchmark strategy.
Step 1: BERT is used to learn the analyst report text and
classify the polarity into three values: “positive, “negative, and
“neutral.” A detailed description of the method is given in section
3.1, and the results are shown in section 5. This step aims to create
a polarity index.
Step 2: We perform time series forecasting for the created
polarity index (by industry) using a neural network. The
algorithm used is Bi-directional Gated Recurrent Unit (BiGRU).
A detailed description of the method is given in section 3.2, and
the results are shown in section 5. This step in forecasting polar
indicators aims to make a preliminary step in creating signals for
future asset allocations.
Step 3: A three-month moving average is taken for the
forecast value calculated in Step 2, and the volatility is clustered
into “high, “medium,” and “low” states using the regime-
switching model. The algorithm used is the Hidden Markov
Model with GMM-HMM. A detailed description of the method is
provided in section 3.3 and the results are presented in section 5.
This step creates a signal for asset allocation at a future point that
Frontiers in Artificial Intelligence | www.frontiersin.org 2May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
FIGURE 1 | Architecture.
has not yet manifested by dividing the polarity index predicted in
Step 2 into three regimes.
Step 4: We retrieved monthly stock price data from Yahoo!
Finance2. We selected two stocks for each of the 33 industries and
created a portfolio for each industry using the model selected in
Steps 2 and 3. Then we rebalanced the portfolio at the regime’s
turn assigned in Step 3 above. In addition, we compare this
strategy with the benchmark strategy and tabulate the results. A
detailed description of the method is given in section 3.4, and the
results are shown in section 5.
The architecture used in this study can be described as shown
in Figure 1.
We created a polarity index as discussed. The assumption is
that the sentiment of individual stocks is included in this polarity
index, which is used for the investment experiment.
3.1. Bidirectional Encoder Representations
From Transformers (BERT)
BERT is a Transformer-based language model proposed by
Devlin et al. (2019). The reason we chose BERT for this study
is because it has demonstrated its effectiveness against other
models in eight benchmarks in Devlin et al. (2019). BERT enables
successful bidirectional learning by masking some tokens in the
input and predicting the masked words. This study used the
model3published by Inui and Suzuki Laboratory at Tohoku
University as a trained model for Japanese BERT. Additionally,
we use the method of Taguchi et al. (2021) to create two types
of models (UP Model and DOWN Model) using BERT. In the
Taguchi et al. (2021) method, a report with a change in rating
to “buy” is considered to be 1, and 0 otherwise, and a report
with a change in rating to “sell” is considered to be 1. The final
2https://finance.yahoo.co.jp/
3https://github.com/cl-tohoku/bert- japanese
TABLE 1 | Polarity judgment method.
UP model DOWN model Final judgment
0 (no change) 1 (sell) 1 (sell)
1 (buy) 0 (no change) 1 (buy)
1 (buy) 1 (sell) High likelihood
0 (no change) 0 (no change) 0 (no change)
judgment method is shown in Table 1. Details of this method can
be obtained by referring to Taguchi et al. (2021). In addition, this
architecture replaces the model in Sakaji et al. (2008) with BERT.
See section 5 for the calculation results.
3.2. Time Series Forecasting Using
Bi-Directional Gated Recurrent Unit
(BiGRU)
Section 3.2 aims to take a preliminary step in creating a
signal for future asset allocation. Time series forecasting creates
pseudo-sentiment information not apparent. Gated Recurrent
Unit (GRU) is a neural network consisting of Reset and Update
gates, proposed by Cho et al. (2014). GRU is a neural network
that can deal with the gradient vanishing problem of Recurrent
Neural Network (RNN) and has a low computational cost.
This point was corroborated by the verification of performance
in section 5.
The architecture of GRU is represented by Figure 2.
BiGRU is a model that can add past and future information
to GRU.
The architecture of BiGRU is represented by Figure 3. Details
of the GRU and BiGRU algorithms are given in Appendix A.
Time series forecasting was evaluated using Root Mean
Squared Error (RMSE). RMSE is a measure to evaluate the
Frontiers in Artificial Intelligence | www.frontiersin.org 3May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
model’s error, and the smaller it is, the better the model’s
performance. The equation for RMSE is as follows.
RMSE =q1
nPn
i=1yi ˆyi2(1)
RMSE is calculated by dividing the observed values by yi(i=
1, 2, 3, ...,n), and the calculated value (predicted value) from the
model is ˆyi.
FIGURE 2 | Architecture of GRU.
3.3. Regime Assignment Using Hidden
Markov Model With Gaussian Mixture
Model Emissions (GMM-HMM)
In section 3.3, we divide the polarity index predicted in section
3.2 into three regimes to create a signal for asset allocation in
the future not yet manifested. The changes in the regimes are
assumed changes in the environment and are used as signals.
This study used the Hidden Markov Model (HMM) with the
Gaussian Mixture Model (GMM) state output distribution to
assign regimes to the predicted values of the polarity index. GMM
is a statistical model that can model N populations following
a normal distribution, while HMM is a Markov model based
on a Markov process with hidden states. The architecture of
GMM-HMM is represented by Figure 4.
Details of the GMM-HMM algorithms are given in
Appendix B.
3.4. Investment Simulation
In this section, we design an investment simulation using the
models in sections 3.2 and 3.3. As a proposed method, when the
volatility of the polarity index predicted by the GMM-HMM with
three regimes is “low, we increase the allocation to stocks with
high monthly volatility. In addition, when the predicted volatility
of the polarity index is “medium, the investment allocation is
set at 50%. Meanwhile, if the forecasted volatility of the polarity
index is “high, we increase the allocation to stocks with low
monthly volatility. The monthly volatility of a stock is defined
as the maximum value among the current month’s True Range4
÷Typical Price for the current month5×100, which is calculated
below. We assume that the above portfolio rebalancing strategy
generates higher returns than the benchmark strategy.
4Defined in this article as high-low, high-close, and close-low.
5Average of the high, low, and close prices.
FIGURE 3 | Architecture of BiGRU.
Frontiers in Artificial Intelligence | www.frontiersin.org 4May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
FIGURE 4 | Architecture of GMM-HMM.
TABLE 2 | Notation of investment simulations.
Notation
RoPI Regime of polarity index (volatility low =0, medium =1, high =2)
Am Amount of stock purchased
NoR Number of rebalancing (1,...,n)
rInvestment ratio (0 r1)
BLBuying price for stocks with “low” volatility
SLSelling price for stocks with “low” volatility
BHBuying price for stocks with “high” volatility
SHSelling price for stocks with “high” volatility
BαBuying price at the begin of the investment simulation for α
SαSelling price at the end of the investment simulation for α
BβBuying price at the begin of the investment simulation for β
SβSelling price at the end of the investment simulation for β
See Table 2 for Notation in the investment simulation. The
total profit and loss of the portfolio Pare defined below. We
assume that the portfolio consists of two stocks types.
P=PL+PN+PH(2)
PLis the sum of the portfolio’s profit and loss when the polarity
index volatility is “low, PNis the sum of the portfolio’s profit
and loss when the polarity index volatility is “medium, and PH
is the sum of the portfolio’s profit and loss when the polarity
index volatility is “high.” The total of each profit and loss can be
formulated as follows.
PL=Am PNoR
n=1{(1 rn)(SL
nBL
n)+rn(SH
nBH
n)},
if RoPI is 0 (3)
PN=Am PNoR
n=1{0.5 (SL
nBL
n)+0.5 (SH
nBH
n)},
if RoPI is 1 (4)
PH=Am PNoR
n=1{rn(SL
nBL
n)+(1 rn)(SH
nBH
n)},
if RoPI is 2 (5)
Conversely, the total profit and loss of the benchmark strategy
to be compared, PBM, is the profit and loss of the two stocks
purchased at the beginning of the experiment with the investment
ratio of r=0.5 and sold at the end of the experiment period. PBM
is formulated as follows:
PBM =Am {0.5 (SαBα)+0.5 (SβBβ)}(6)
In comparing the benchmark and proposed strategies, we assume
that both portfolios are composed of two similar stocks.
The overall picture of the investment simulation is shown in
Figure 5. In Figure 5, we set the investment ratio r=0.9. For the
results of the investment simulation, see section 5.
Frontiers in Artificial Intelligence | www.frontiersin.org 5May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
FIGURE 5 | Example of investment simulations.
FIGURE 6 | All-industry polarity index with 3-month moving average.
4. EXPERIMENTS
Analyst report data (373,050) was used to create the BERT
polarity index. Utilizing the technologies of IFIS Japan Ltd,
charts and unnecessary texts in research reports were removed.
The data period is from Jan 2010 to December 2020. Of the
data used in BERT, the train, valid, and test ratios are 7:1:2,
respectively. In this study, five models were used for time series
forecasting. The first is Bi-directional GRU (BiGRU); the second
is GRU; the third is Bi-directional LSTM (BiLSTM); the fourth
is Recurrent Neural Network (RNN), and the fifth is LSTM. We
take three-month moving averages for the all-industry polarity
Frontiers in Artificial Intelligence | www.frontiersin.org 6May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
index in preprocessing the data. We use monthly stock price data
obtained from Yahoo! Finance for the investment simulation. See
Appendix C for the stocks used in the experiments.
5. RESULTS
We created a polarity index for all industries and each industry.
The all-industry polarity index was used for model selection, and
the industry-specific polarity index for investment simulation.
The all-industry polarity index is the sum of all outputs (1, 0, 1)
of the industry-specific polarity index.
The 3-month moving averages of the all-industry polarity
index, sorted by time series, are shown in Figure 6.
The results of the comparison experiments are shown in
Table 3. We chose BiGRU, the smallest RMSE among the five in
this study.
TABLE 3 | Time series prediction.
Model RMSE
BiGRU 0.1376
GRU 0.1600
BiLSTM 0.2011
RNN 0.2115
LSTM 0.2156
TABLE 4 | Assigning regimes.
Model Log likelihood
GMM-HMM 72.3536
Gaussian-HMM 72.3544
MS-DRM 84.0420
In this study, we used three models for clustering. The first
is GMM-HMM, the second is Gaussian Hidden Markov Model
(Gaussian-HMM), and the third is Markov witching Dynamic
Regression Models (MS-DRM). In pre-processing the data, three-
month moving averages are taken for the predicted values of the
all-industry polarity index. The log-likelihood was used as the
evaluation index; the larger the model’s log-likelihood, the better
the model’s fit.
The results of the comparison experiments are shown in Table 4.
We chose the GMM-HMM with the largest log-likelihood among
the three in this study.
The results of assigning regimes to the predictions of the
polarity indexes using GMM-HMM are shown in Figure 7.
The investment simulation results in section 3.4 are shown.
Monthly stock price data is obtained from Yahoo! Finance.
For details of the investment simulation, see section 3.4. This
experiment assumes that the amount of stocks Am =12, 000.
See Appendix C for the constituent stocks of each portfolio.
The investment simulation results are shown in Table 5. In the
table, refers to the victory of the proposed strategy against the
benchmark, and refers to the defeat of the proposed strategy
against the benchmark. The NoR in the table refers to the number
of times the 33-industry portfolio is rebalanced.
6. DISCUSSION
First, Table 5 reveals that the 15
33 industry wins when the
investment ratio is r=0.8, and the 16
33 industry wins when
r=0.9. This indicates that the strategy of increasing the
allocation to stocks with high monthly stock volatility when the
predicted values volatility of the polarity index is “low, setting the
investment allocation to 50% when the predicted values volatility
of the polarity index is “medium, increasing the allocation to
stocks with low monthly stock volatility when the predicted
values volatility of polarity index is “high, is somewhat effective
against the benchmark strategy.
FIGURE 7 | Predicted values for all-industry polarity index with regimes.
Frontiers in Artificial Intelligence | www.frontiersin.org 7May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
TABLE 5 | Winners and losers of proposed strategies in benchmarking strategies.
Investment ratio (r)
Industry name NoR r= 0 r= 1 r= 2 r= 3 r= 4 r= 5 r= 6 r= 7 r= 8 r= 9 r= 10
Fisheries, Agriculture and Forestry 10 •••◦◦◦◦◦◦◦
Mining 5 •••◦◦◦◦◦◦◦
Construction 5 ••••••◦◦◦◦
Foodstuffs 8 ◦◦◦◦◦◦◦◦◦◦
Textiles 7 ••••••••••
Pulp and Paper 5 ◦◦◦◦◦◦◦◦◦◦
Chemistry 5 ••••◦◦◦◦◦◦
Pharmaceuticals 4 ••••••••◦◦
Oil and Coal Products 3 ••••••••••
Rubber Products 6 ◦◦◦◦◦◦◦◦◦◦
Glass, Clay and Stone Products 5 ◦◦◦◦◦•••••
Steel 5 ◦◦◦◦◦◦••••
Nonferrous Metals 4 ••••••••••
Metal Products 5 ••••••••◦◦
Machinery 7 ◦◦◦◦◦◦◦◦◦◦
Electronic Equipment 3 ◦◦◦◦◦•••••
Transport Equipment 7 ••••••••••
Precision Equipment 4 ••••••••••
Other Products 8 •••••••◦◦◦
Electricity and Gas 4 ◦•••••••••
Land Transportation 3 ••••••◦◦◦◦
Marine Transportation 6 ••••••••••
Air Transportation 5 ••••••••••
Warehousing and Transportation 5 ◦•••••••••
Information and Communication 4 ••••◦◦◦◦◦◦
Wholesale business 6 ••••••◦◦◦◦
Retailing 6 ••••••••••
Banking 7 •••••••••◦
Securities and Commodity Futures 10 ••••••••••
Insurance 5 ◦◦◦◦◦◦◦◦◦◦
Other Financial Services 5 ••••••••••
Real Estate 7 ••••••••••
Service 7 ◦◦◦◦••••••
Second, among the portfolios of 33 industries, we observed
seven industries in which the profit and loss of the benchmark
strategy were negative. These include pulp and paper, glass and
stone products, and electric equipment. For these industries,
even when the investment ratio rwas changed, many of them
showed extreme trends such as winning all patterns and losing
all patterns. In some cases, such as glass and stone products
and electrical equipment, the results contradict our hypothesis.
Although it is impossible to determine whether the tendency is
industry-specific or market-influenced only from the results of
this experiment, we would like to explore a method to determine
the investment ratio based on these results when constructing a
portfolio among industries.
Third, we found that the number of NoR varies significantly
among industries. For example, the number of NoR in the
fisheries, agriculture, and forestry industry is ten, whereas the
land transportation industry is three. This indicates that the
tendency to change the volatility of polar indexes is remarkably
different among industries.
Fourth, in Table 3, we expected BiLSTM to have the lowest
RMSE because its mechanism is richer among the five; however,
BiGRU has the lowest RMSE. Chung et al. (2014) shows that
the superiority or inferiority of LSTM and GRU cannot be
determined in general but depends on the data set and the given
task. The same can be said for BiLSTM, LSTM, BiGRU, and GRU
in this experiment.
Fifth, there is still room for improvement in the duration of
signal creation, as shown in the GMM-HMM results (Table 4).
Since, in practice, the asset allocation of portfolios is often
done monthly, we used monthly data for both text and
stocks. It is necessary to use frequent data to create more
detailed signals.
Sixth, although the investment experiment in this study was
conducted with two stocks, there are many cases where three or
Frontiers in Artificial Intelligence | www.frontiersin.org 8May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
more stocks are used in actual operations. In future research,
we would like to extend the investment model to multiple
dimensions and give priority to experiments with three or
more stocks. The following points are specific implementation
methods for portfolios of three or more stocks. First, we need
to ensure the diversification effect of stocks by using correlation
coefficients. Second, as the number of stocks increases, the timing
of rebalancing increases in proportion to the number of stocks, it
is necessary to obtain higher frequency data on text and stock
price data. It is also important to set a quantitative threshold
for the volatility of stocks. In addition, it is desirable to use
methods such as the mean-variance model proposed by Ma et al.
(2021) and Wang et al. (2020) when determining the maximum
investment ratio.
Finally, although the experiments in this study were limited
to stock investment strategies, we would like to extend
this study to future corporate bonds, Foreign Exchange and
virtual currencies.
7. CONCLUSION
In this study, we created a polarity index using BERT, performed
time series forecasting using BiGRU, and assigned a regime to
the forecasted values using GMM-HMM to create a signal for
portfolio rebalancing. Consequently, the strategy proposed in
this study proved effective, indicating that analyst reports are
useful for investment. In the future, we would like to create
a signal that replaces the architecture of the regime-switching
model with an anomaly detection algorithm. In addition, we
would like to conduct investment experiments with three or
more stocks in one portfolio since we have created two stocks
in one portfolio in this experiment. We seek to calculate the
investment ratio using reinforcement learning in conjunction
with this. Furthermore, we would like to conduct investment
simulations using polar indexes created by other financial texts
and compare the current results.
DATA AVAILABILITY STATEMENT
The data analyzed in this study is subject to the following
licenses/restrictions: this data can be used for academic
authorized researchers. Requests to access these datasets should
be directed to KH, kenji.hiramatsu@ifis.co.jp; HS, sakaji@sys.t.u-
tokyo.ac.jp.
AUTHOR CONTRIBUTIONS
RT, HW, HS, and KI contributed to the conception and design
of the study. KH organized the database. RT and HW performed
the statistical analysis. RT wrote the first draft of the manuscript
and sections of the manuscript. All authors contributed to
the revision of the manuscript and read and approved the
submitted versions.
FUNDING
This study was partially supported by JST-Mirai Program Grant
Number JPMJPI20B1, Japan. This study was not funded by IFIS
Japan Limited.
SUPPLEMENTARY MATERIAL
The Supplementary Material for this article can be found
online at: https://www.frontiersin.org/articles/10.3389/frai.2022.
865950/full#supplementary-material
REFERENCES
Asquith, P., Mikhail, M. B., and Au, A. S. (2005). Information content of
equity analyst reports. J. Fin. Econ. 75, 245–282. doi: 10.1016/j.jfineco.2004.
01.002
Chen, C.-C., Huang, H.-H., and Chen, H.-H. (2020). “Issues and perspectives
from 10,000 annotated financial social media data, in Proceedings of the 12th
Language Resources and Evaluation Conference (Marseille: European Language
Resources Association), 6106–6110.
Chen, W., Zhang, H., Mehlawat, M. K., and Jia, L. (2021). Mean-variance portfolio
optimization using machine learning-based stock price prediction. Appl. Soft
Comput. 100, 106943. doi: 10.1016/j.asoc.2020.106943
Cho, K., van Merrienboer, B., Gülçehre, Ç., Bougares, F., Schwenk, H., and
Bengio, Y. (2014). Learning phrase representations using RNN encoder-
decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.
doi: 10.3115/v1/D14-1179
Chung, J., Gulcehre, C., Cho, K., and Bengio, Y. (2014). Empirical evaluation
of gated recurrent neural networks on sequence modeling. arXiv preprint
arXiv:1412.3555. doi: 10.48550/ARXIV.1412.3555
Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019). BERT: pre-training
of deep bidirectional transformers for language understanding. arXiv preprint
arXiv:1810.04805. doi: 10.48550/ARXIV.1810.04805
Hiew, J. Z. G., Huang, X., Mou, H., Li, D., Wu, Q., and Xu, Y. (2019). BERT-based
financial sentiment index and LSTM-based stock return predictability. arXiv
preprint arXiv:1906.09024. doi: 10.48550/ARXIV.1906.09024
Hiramatsu, K., Kotaro, M., Sakai, H., and Sakaji, H. (2021). [Analyst report tone
information value] Anarisutorepoto no ton no Joho Kachi (in Japanese). Securit
Anal. J. 59, 86–97. Available online at: https://ci.nii.ac.jp/naid/40022479821/
Izumi, K., and Sakaji, H. (2019). “Economic causal-chain search using text mining
technology, in Proceedings of the First Workshop on Financial Technology and
Natural Language Processing (Macao), 61–65.
Katayama, D., Kino, Y., and Tsuda, K. (2019). A method of sentiment polarity
identification in financial news using deep learning. Proc. Comput. Sci. 159,
1287–1294. doi: 10.1016/j.procs.2019.09.298
Liu, M., Huo, J., Wu, Y., and Wu, J. (2021). Stock market trend analysis
using hidden Markov model and long short term memory. arXiv preprint
arXiv:2104.09700.
Ma, Y., Han, R., and Wang, W. (2021). Portfolio optimization with return
prediction using deep learning and machine learning. Expert Syst. Appl. 165,
113973. doi: 10.1016/j.eswa.2020.113973
Sakaji, H., Kuramoto, R., Matsushima, H., Izumi, K., Shimada, T., and Sunakawa,
K. (2019). “Financial text data analytics framework for business confidence
indices and inter-industry relations, in Proceedings of the First Workshop
on Financial Technology and Natural Language Processing (Macao, China),
40–46.
Sakaji, H., Sakai, H., and Masuyama, S. (2008). “Automatic extraction of basis
expressions that indicate economic trends, in Advances in Knowledge Discovery
and Data Mining. PAKDD 2008, eds T. Washio, E. Suzuki, K. M. Ting, and A.
Inokuchi (Berlin; Heidelberg: Springer), 977–984. doi: 10.1007/978-3-540-681
25-0_102
Frontiers in Artificial Intelligence | www.frontiersin.org 9May 2022 | Volume 5 | Article 865950
Taguchi et al. Constructing Equity Investment Strategies
Suzuki, M., Sakaji, H., Izumi, K., Matsushima, H., and Ishikawa, Y. (2020).
Forecasting net income estimate and stock price using text mining from
economic reports. Information 11, 292. doi: 10.3390/info11060292
Taguchi, R., Watanabe, H., Hirano, M., Suzuki, M., Sakaji, H., Izumi, K., et al.
(2021). “Market trend analysis using polarity index generated from analyst
reports, in 2021 IEEE International Conference on Big Data (Big Data)
(Orlando, FL), 3486–3494. doi: 10.1109/BigData52589.2021.9671702
Talha, S. A. W., Fleury, A., and Ambellouis, S. (2017). “Human action recognition
from body-part directional velocity using hidden Markov models, in 2017 16th
IEEE International Conference on Machine Learning and Applications (ICMLA),
1035–1040. doi: 10.1109/ICMLA.2017.00-14
Wang, W., Li, W., Zhang, N., and Liu, K. (2020). Portfolio formation with
preselection using deep learning from long-term financial data. Expert Syst.
Appl. 143, 113042. doi: 10.1016/j.eswa.2019.113042
Yan, J., Liu, J., Yu, Y., and Xu, H. (2021). Water quality prediction in the Luan river
based on 1-DRCNN and BIGRU hybrid neural network model. Water 13, 1273.
doi: 10.3390/w13091273
Yono, K., Sakaji, H., Matsushima, H., Shimada, T., and Izumi, K. (2020).
Construction of macroeconomic uncertainty indices for financial market
analysis using a supervised topic model. J. Risk Fin. Manage. 13, 79.
doi: 10.3390/jrfm13040079
Yun, H., Lee, M., Kang, Y. S., and Seok, J. (2020). Portfolio management via
two-stage deep learning with a joint cost. Expert Syst. Appl. 143, 113041.
doi: 10.1016/j.eswa.2019.113041
Zhang, Z., Zohren, S., and Roberts, S. (2020). Deep learning for
portfolio optimization. J. Fin. Data Sci. 2, 8–20. doi: 10.3905/jfds.2020.
1.042
Conflict of Interest: KH is employed by IFIS Japan Limited.
The remaining authors declare that the research was conducted in the absence of
any commercial or financial relationships that could be construed as a potential
conflict of interest.
Publisher’s Note: All claims expressed in this article are solely those of the authors
and do not necessarily represent those of their affiliated organizations, or those of
the publisher, the editors and the reviewers. Any product that may be evaluated in
this article, or claim that may be made by its manufacturer, is not guaranteed or
endorsed by the publisher.
Copyright © 2022 Taguchi, Watanabe, Sakaji, Izumi and Hiramatsu. This is an
open-access article distributed under the terms of the Creative Commons Attribution
License (CC BY). The use, distribution or reproduction in other forums is permitted,
provided the original author(s) and the copyright owner(s) are credited and that the
original publication in this journal is cited, in accordance with accepted academic
practice. No use, distribution or reproduction is permitted which does not comply
with these terms.
Frontiers in Artificial Intelligence | www.frontiersin.org 10 May 2022 | Volume 5 | Article 865950
... Regimes in finance theory refer to invisible states of the market, such as expansion, recession, bull, and bear. Some studies have attempted to capture market alpha by incorporating these regime changes into investment strategies [4][5][6]. ...
... There is also research on using text data to predict stock closing prices [18,19]. In addition, research has been conducted on the creation of an economic index using rate information from analyst reports [6,10]. Several studies use text mining of analyst reports to predict stock prices [20,21]. ...
... Daily financial news was used, but not necessarily only when textual information is issued with high frequency. For example, Taguchi et al. [6] predicted the polarity index once using Bidirectional LSTM as an alternative and simulated the investment after regime splitting. If individual polarity indices are created from quarterly earnings reports or analyst reports that are published irregularly, or if a combination of these texts is used to create polarity indices, the method of predicting a polarity index once and then using it may be effective in some extent. ...
Article
Full-text available
This study demonstrates whether financial text is useful for the tactical asset allocation method using stocks. This can be achieved using natural language processing to create polarity indexes in financial news. We perform clustering of the created polarity indexes using the change point detection algorithm. In addition, we construct a stock portfolio and rebalanced it at each change point using an optimization algorithm. Consequently, the proposed asset allocation method outperforms the comparative approach. This result suggests that the polarity index is useful for constructing the equity asset allocation method.
... Regimes in finance theory are the invisible market states, such as expansion, recession, bulls, and bears. Some studies attempted to capture market alpha by incorporating these regime changes into investment strategies [3,4]. We will delve one step further and focus on measuring future regime changes. ...
... Studies in this area include EcoRevs and a new architecture using long short-term memory (LSTM) [9], along with a study of polarity index creation using newspaper articles, economic trend surveys, and bi-directional LSTM [10]. In addition, research has been conducted on the creation of economic indexes using rate information from analyst reports [4,[11][12][13]. The novelty of this study is that the masked language model (MLM) score was used to create a polarity index and index the tone of financial texts. ...
... Wang [23] tested a combination optimization model using BiLSTM and BIGRUs for simultaneous prediction on four stock groups, demonstrating superior performance compared to other benchmark models. Taguchi [24] simulated stock investments in a proposed portfolio strategy, and found that BIGRU had the lowest error, although its generalizability varied for different datasets and given tasks. Liu [25] proposed an EMD-BIGRU model for predicting missing data, and an evaluation of the model's performance indicated that data decomposition improved the prediction accuracy of BIGRUs and enabled capturing more data variations. ...
... Spiral update of position: The distance between the whale position and the prey position is first calculated, and then a spiral equation is created to mimic the humpback whale approaching the prey in a spiral pattern. The equations are shown in Equations (24) and (25). ...
Article
Full-text available
In order to obtain better prediction results, this paper combines improved complete ensemble EMD (ICEEMDAN) and the whale algorithm of multi-objective optimization (MOWOA) to improve the bidirectional gated recurrent unit (BIGRU), which makes full use of original complex stock price time series data and improves the hyperparameters of the BIGRU network. To address the problem that BIGRU cannot make full use of the stationary data, the original sequence data are processed using the ICEEMDAN decomposition algorithm to derive the non-stationary and stationary parts of the data and modeled with the BIGRU and the autoregressive integrated moving average model (ARIMA), respectively. The modeling process introduces a whale algorithm for multi-objective optimization, which improves the probability of finding the best combination of parameter vectors. The R2, MAPE, MSE, MAE, and RMSE values of the BIGRU algorithm, ICEEMDAN-BIGRU algorithm, MOWOA-BIGRU algorithm, and the improved algorithm were compared. An average improvement of 14.4% over the original algorithm’s goodness-of-fit value will greatly improve the accuracy of stock price predictions.
... In financial markets, a considerable quantity of documental data can be accessed via the internet, such as news articles, social media, earning reports and answers given to the inquiry of securities. It may range from the MACD line in conventional stock charts to the volume of traded shares (Taguchi et al., 2022). Traded prices and volume of shares in American stock data include such information. ...
Article
Full-text available
Analysis and evaluation of return opportunities presented by equity markets have always been in the core of investors' interest. As technology advances and markets evolve, new ways to approach the task of straightforward valuation emerge. Judging companies and their assets by traditional yardsticks has kept its relevance throughout decades, however, technological change brings into existence new markets and industries whose value is uncertain and difficult to measure with traditional tools. Because of this, investors striving to make informed decisions need to engage with a growing complexity of financial and technological tools at their disposal.
Article
Full-text available
The current global water environment has been seriously damaged. The prediction of water quality parameters can provide effective reference materials for future water conditions and water quality improvement. In order to further improve the accuracy of water quality prediction and the stability and generalization ability of the model, we propose a new comprehensive deep learning water quality prediction algorithm. Firstly, the water quality data are cleaned and pretreated by isolation forest, the Lagrange interpolation method, sliding window average, and principal component analysis (PCA). Then, one-dimensional residual convolutional neural networks (1-DRCNN) and bi-directional gated recurrent units (BiGRU) are used to extract the potential local features among water quality parameters and integrate information before and after time series. Finally, a full connection layer is used to obtain the final prediction results of total nitrogen (TN), total phosphorus (TP), and potassium permanganate index (COD-Mn). Our prediction experiment was carried out according to the actual water quality data of Daheiting Reservoir, Luanxian Bridge, and Jianggezhuang at the three control sections of the Luan River in Tangshan City, Hebei Province, from 5 July 2018 to 26 March 2019. The minimum mean absolute percentage error (MAPE) of this method was 2.4866, and the coefficient of determination (R²) was able to reach 0.9431. The experimental results showed that the model proposed in this paper has higher prediction accuracy and generalization than the existing LSTM, GRU, and BiGRU models.
Article
Full-text available
This paper proposes and analyzes a methodology of forecasting movements of the analysts’ net income estimates and those of stock prices. We achieve this by applying natural language processing and neural networks in the context of analyst reports. In the pre-experiment, we applied our method to extract opinion sentences from the analyst report while classifying the remaining parts as non-opinion sentences. Then, we performed two additional experiments. First, we employed our proposed method for forecasting the movements of analysts’ net income estimates by inputting the opinion and non-opinion sentences into separate neural networks. Besides the reports, we inputted the trend of the net income estimate to the networks. Second, we employed our proposed method for forecasting the movements of stock prices. Consequently, we found differences between security firms, which depend on whether analysts’ net income estimates tend to be forecasted by opinions or facts in the context of analyst reports. Furthermore, the trend of the net income estimate was found to be effective for the forecast as well as an analyst report. However, in experiments of forecasting movements of stock prices, the difference between opinion sentences and non-opinion sentences was not effective.
Article
Full-text available
The uncertainty in the financial market, whether the US—China trade war will slow down the global economy or not, Federal Reserve Board (FRB) policy to increase the interest rates, or other similar macroeconomic events can have a crucial impact on the purchase or sale of financial assets. In this study, we aim to build a model for measuring the macroeconomic uncertainty based on the news text. Further, we proposed an extended topic model that uses not only news text data but also numeric data as a supervised signal for each news article. Subsequently, we used our proposed model to construct macroeconomic uncertainty indices. All these indices were similar to those observed in the historical macroeconomic events. The correlation was higher between the volatility of the market and uncertainty indices with larger expected supervised signal compared to uncertainty indices with the smaller expected supervised signal. We also applied the impulse response function to analyze the impact of the uncertainty indices on financial markets.
Conference Paper
This study demonstrates whether analysts' sentiment toward individual stocks is useful in predicting the macroeconomic index. This can be achieved by using natural language processing to create polarity indexes from analyst reports. In this study, the created polarity indexes were analyzed using the Vector Autoregressive model with various macroeconomic indexes. Consequently, it was confirmed that the polarity indexes do have an impact on indexes such as prices, exchange rates, and government bonds.
Article
The success of portfolio construction depends primarily on the future performance of stock markets. Recent developments in machine learning have brought significant opportunities to incorporate prediction theory into portfolio selection. However, many studies show that a single prediction model is insufficient to achieve very accurate predictions and affluent returns. In this paper, a novel portfolio construction approach is developed using a hybrid model based on machine learning for stock prediction and mean–variance (MV) model for portfolio selection. Specifically, two stages are involved in this model: stock prediction and portfolio selection. In the first stage, a hybrid model combining eXtreme Gradient Boosting (XGBoost) with an improved firefly algorithm (IFA) is proposed to predict stock prices for the next period. The IFA is developed to optimize the hyperparameters of the XGBoost. In the second stage, stocks with higher potential returns are selected, and the MV model is employed for portfolio selection. Using the Shanghai Stock Exchange as the study sample, the obtained results demonstrate that the proposed method is superior to traditional ways (without stock prediction) and benchmarks in terms of returns and risks.
Article
Integrating return prediction of traditional time series models in portfolio formation can improve the performance of original portfolio optimization model. Since machine learning and deep learning models have shown overwhelming superiority than time series models, this paper combines return prediction in portfolio formation with two machine learning models, i.e., random forest (RF) and support vector regression (SVR), and three deep learning models, i.e., LSTM neural network, deep multilayer perceptron (DMLP) and convolutional neural network. To be specific, this paper first applies these prediction models for stock preselection before portfolio formation. Then, this paper incorporates their predictive results in advancing mean–variance (MV) and omega portfolio optimization models. In order to present the superiority of these models, portfolio models with autoregressive integrated moving average’s return prediction are used as benchmarks. Evaluation is based on historical data of 9 years from 2007 to 2015 of component stocks of China securities 100 index. Experimental results show that MV and omega models with RF return prediction, i.e., RF+MVF and RF+OF, outperform the other models. Further, RF+MVF is superior to RF+OF. Due to the high turnover of these two models, this paper discusses their performance after deducting the transaction fee cased by turnover. Experiments present that RF+MVF still performs the best among MVF models and omega model with SVR prediction (SVR+OF) performs the best among OF models. Moreover, RF+MVF performs better than SVR+OF and high turnover erodes nearly half of their total returns especially for RF+OF and RF+MVF. Therefore, this paper recommends investors to build MVF with RF return prediction for daily trading investment.
Chapter
In this research, we extract causal information from textual data and construct a causality database in the economic field. We develop a method to produce causal chains starting from phrases representing specific events. The proposed method can offer possible ripple effects and factors of particular events or situations. Using our approach to Japanese textual data, we have implemented a prototype system that can display causal chains for user-entered words. A user can interactively edit the causal chains by selecting appropriate causalities and deleting inappropriate causalities. The economic causal-chain search algorithm can be applied to various financial information services.
Article
Portfolio management is a series of processes that maximize returns and minimize risk by allocating assets efficiently. Along with the developments in machine learning technology, it has been studied to apply machine learning methods to prediction-based portfolio management. However, such methods have a few limitations. First, they do not consider the relations between assets for the prediction. In addition, the studies commonly focus on the prediction accuracy, neglecting the construction of portfolios. Furthermore, the methods have usually been evaluated with index data, which hardly represent actual prices to buy or sell an asset. To overcome these problems, Exchange Traded Funds (ETFs) are employed for base assets for the evaluation, and we propose a two-stage deep learning framework, called Grouped-ETFs Model (GEM), with a joint cost function. The GEM is designed to learn the features of inter-asset and groups in each stage. Also, the proposed joint cost can consider relative returns for the training while the relative returns are a crucial factor to construct a portfolio. The results of a rigorous evaluation with global ETF data indicate that the proposed GEM with the joint cost outperforms the equally weighted portfolio and the ordinary deep learning model by 33.7% and 30.1%, respectively. An additional experiment using sector ETFs verifies the generality of the proposed model where the results accord with those of the previous experiment.
Article
Portfolio theory is an important foundation for portfolio management which is a well-studied subject yet not fully conquered territory. This paper proposes a mixed method consisting of long short-term memory networks and mean-variance model for optimal portfolio formation in conjunction with the asset preselection, in which long-term dependences of financial time-series data can be captured. The experiment uses a large volume of sample data from the UK Stock Exchange 100 Index between March 1994 and March 2019. In the first stage, long short-term memory networks are used to forecast the return of assets and select assets with higher potential returns. After comparing the outcomes of the long short-term memory networks against support vector machine, random forest, deep neural networks, and autoregressive integrated moving average model, we discover that long short-term memory networks are appropriate for financial time-series forecasting, to beat the other benchmark models by a very clear margin. In the second stage, based on selected assets with higher returns, the mean-variance model is applied for portfolio optimisation. The validation of this methodology is carried out by comparing the proposed model with the other five baseline strategies, to which the proposed model clearly outperforms others in terms of the cumulative return per year, Sharpe ratio per triennium as well as average return to the risk per month of each triennium. i.e. potential returns and risks.