ChapterPDF Available

Abstract and Figures

We construct a weighted financial network for a subset of NYSE traded stocks, in which the nodes correspond to stocks and edges to interactions between them. We identify clusters of stocks in the network, based on the Forbes business sector classification, and study their intensity and coherence. Our approach indicates to what extent the business sector classifications are visible in market prices, enabling us to gauge the extent of group-behaviour exhibited by stocks belonging to a given business sector.
Content may be subject to copyright.
Financial Market - A Network Perspective
Jukka-Pekka Onnela1, Jari Saram ¨aki1, Kimmo Kaski1, and J ´anos Kert´esz2
1Laboratory Computational Engineering, Helsinki University of Technology, P.O.Box 9203,
FIN-02015 HUT. jonnela@lce.hut.fi
2Department of Theoretical Physics, Budapest University of Technology and Economics,
Budafoki ´ut 8, H-1111 Budapest, Hungary.
We construct a weighted financial network for a subset of NYSE traded stocks, in
which the nodes correspond to stocks and edges to interactions between them. We
identify clusters of stocks in the network, based on the Forbes business sector clas-
sification, and study their intensity and coherence. Our approach indicates to what
extent the business sector classifications are visible in market prices, enabling us to
gauge the extent of group-behaviour exhibited by stocks belonging to a given busi-
ness sector.
1 Introduction
Complex networks provide a very general framework, based on the concepts of sta-
tistical physics, for studying systems with large numbers of interacting agents [1].
The nodes of the network represent the agents and a link connecting two nodes indi-
cates an interaction between them. In the complex networks framework, interactions
have typically been considered to be binary in nature, meaning that either two nodes
interact (are connected) or they do not (are not connected). Imposing a binary interac-
tion requires setting a threshold value for interaction strength, such that interactions
falling below it are discarded. Although this approach is a suitable first approxima-
tion, thresholding can lead to a loss of information. Consequently, a natural step
forward is to assign weights on the links to reflect the strengths of interactions.
In a financial market the performance of a company is compactly characterised
by a single number, the stock price, which results from a large number of interactions
between different market participants. Although the exact nature of these interactions
is not known, they are certainly reflected in the equal-time return correlations. In this
paper we study a financial network in which the nodes correspond to stocks and links
to return correlation based interactions between them. Mantegna [2] was the first to
construct such networks and the idea was followed and extended by others [3,4, 5,
6, 7].
2 Jukka-Pekka Onnela, Jari Saram¨aki, Kimmo Kaski, and anos Kert´esz
2 Methods
2.1 Constructing the Network
We start by considering a price time series for a set of Nstocks and denote the daily
closing price of stock iat time τ(an actual date) by Pi(τ). Since investors work in
terms of relative as opposed to absolute returns, logarithmic returns are commonly
used in studies, and thus we denote the daily logarithmic return of stock iby ri(τ)=
ln Pi(τ)ln Pi(τ1). We extract a time window of width T, measured in days and
in this paper set to T=1000 (equal to four years, assuming 250 trading days a year),
and obtain a return vector rt
ifor stock i, where the superscript tenumerates the time
window under consideration. Then equal time correlation coefficients between assets
iand jcan be written as
ρt
i j =
"rt
irt
j# "rt
i#"rt
j#
!["rt
i
2# "rt
i#2]["rt
j
2# "rt
j#2]
,(1)
where "...#indicates a time average over the consecutive trading days included in
the return vectors. These correlation coefficients between Nassets form a symmetric
N×Ncorrelation matrix Ctwith elements ρt
i j . The different time windows are
displaced by δT, where we have used a step size of one week, i.e. δT=5 days.
Next we define interaction strengths, or link weights, based on the correlation
coefficients. One of the simplest alternatives is to use the absolute values of the cor-
relation coefficients, in which case the interaction strength reflects the strength of
linear coupling between the logarithmic returns of stocks iand jin time window t.
If we use wt
i j to denote the weight on the link connecting node iand node j, with this
choice we have wt
i j = |ρt
i j |,or in matrix form Wt= |Ct|. Because the correlation
coefficients ρt
i j vary between 1 and 1, the interaction strengths wt
i j are naturally
limited to the [0,1]interval. In the correlation matrix Ctwe have estimated the cor-
relations between all the assets. Thus, the resulting network will be fully connected
consisting of Nnodes and N(N1)/2 links, corresponding to the elements in the
upper (or lower) triangular part of the the weight matrix.3
2.2 Characterising Network Clusters
Let us now consider any cluster or subgraph gin the above defined network. To
characterise how compact or tight the subgraph is, we use the concept of subgraph
intensity I (g)introduced in [8]. Put differently, subgraph intensity allows us to char-
acterise the interaction patterns within clusters. If we use vgto denote the set of nodes
and $gthe set of links in the subgraph with weights wi j , we can express subgraph
intensity as the geometric mean of its weights:
3It is possible, using some heuristic, to insert only a fraction of all the links in the network,
but this would result in an additional parameter to be determined.
Financial Market - A Network Perspective 3
I(g)=
$
(ij)!g
wi j
1/|!g|
.(2)
Due to the nature of the geometric mean, the subgraph intensity I(g)may be
low because one of the weights is very low, or it may result from all of the weights
being low. In order to distinguish between these two extremes, we use the concept of
subgraph coherence Q(g)[8]. It assumes values from the interval [0,1]and is close
to unity only if the subgraph weights do not differ much, i.e. are internally coherent.
Subgraph coherence is defined as the ratio of the geometric to the arithmetic mean
of the weights as
Q(g)=I|$g|/'
(ij)!g
wi j .(3)
In order to compare intensity and coherence values, we need to establish a refer-
ence. A very natural reference system is obtained by considering the entire market.
In other words, we take all of the Nnodes and N(N1)/2 links making up the net-
work G, and then using the above definitions compute I(G)and Q(G). We can also
use relative cluster intensity for cluster g,given by I(g)/I(G), and relative cluster
coherence,given by Q(g)/Q(G), if instead of absolute values we wish to examine
the cluster intensity or coherence relative to the reference system.
3 Results
In this section we consider a subset of 116 NYSE-traded stocks from the S&P 500
index from 1.1.1982 to 31.12.2000. We deal with the closing price, resulting in a total
of 4787 price quotes for each stock. To divide the stocks into clusters, we obtained
the Forbes business sector labels for each stock [9]. The stocks in our dataset fall into
12 business sectors, such as Energy and Utilities. Given these labels for each stock,
we use the concepts of subgraph intensity and coherence to gauge howhow similarly
stocks belonging to a given business behave as a function of time.
Let us consider a cluster g, constructed such that all of its nodes vgbelong to
the same business sector, and let ndenote the number of nodes in this cluster. Then
we add all the n(n1)/2 links corresponding to the interaction strengths between
any pair of nodes within g.In one extreme, if all the link weights are equal to unity,
every node participating in ginteracts maximally with its n1 neighbours. In the
other extreme, if one or more of the weights are zero, the subgraph intensity for the
fully connected subgraph gntends to zero because the original topological structure
no longer exists.
In Figure 1, we show the relative cluster intensity as a function of time for se-
lected business sector clusters. Values above unity indicate that the intensity of the
cluster is higher than that of the market. This implies that in most cases stocks be-
longing to a given business sector are tied together in the sense that intra-cluster in-
teraction strengths are considerably stronger than those of the market on the whole.
4 Jukka-Pekka Onnela, Jari Saram¨aki, Kimmo Kaski, and anos Kert´esz
It is also worth noting the high value for the absolute cluster intensity for the mar-
ket roughly between 1986 and 1990. This elevated value is due to the 1987 stock
market crash (Black Monday), which caused the market to behave in a unified man-
ner4. The crash also compresses the relative cluster intensities, which means that
the cluster-specific behaviour is temporarily suppressed by the crash, and after the
market recovers the clusters regain their characteristic behaviour.
1984 1986 1988 1990 1992 1994 1996 1998 2000
0.5
1
1.5
2
2.5
3
3.5
Time
Relative cluster intensity
Basic Materials
Conglomerates
Energy
Financial
Utilities
Market
1984 1988 1992 1996 2000
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
Time
Cluster intensity
Fig. 1. Relative (to the market) cluster intensity as a function of time for select clusters. Inset:
The (absolute) cluster intensity for the market used for normalisation.
Business sector clusters are also more coherent than the market, as shown in
Figure 2, except for Basic Materials. One explanation is obtained from the industry
classifications, which is a finer classification scheme, of stocks comprising the BM
cluster. These include Metal Mining, Paper, Gold & Silver and Forestry & Wood
Products. Therefore, it is clear that the Basic Materials business sector is extremely
diverse. Also, the price of some of these items is determined, at least partially, out-
side the stock market. Consequently, it is not so surprising that the cluster intensity
remains low, at times even falling below the market reference. Similarly, the low co-
herence values indicate that there are stocks in this cluster with very high correlations
(those belonging to the same industry, such as gold mining), but also very low (com-
panies belonging to different industries). In conclusion, our results indicate that, in
most cases, stocks belonging to the same business sector have higher intensity and
more coherent intra-cluster than inter-cluster interactions.
4The length of this elevated period is related to the window width parameter.
Financial Market - A Network Perspective 5
1984 1986 1988 1990 1992 1994 1996 1998 2000
0.75
0.8
0.85
0.9
0.95
1
1.05
1.1
1.15
1.2
Time
Relative cluster coherence
Basic Materials
Conglomerates
Energy
Financial
Utilities
Market
Fig. 2. Relative (to the market) cluster coherence as a function of time.
References
1. Albert R, Barabasi A-L (2002) Statistical mechanics of complex networks. Reviews of
Modern Physics 74, 47-97
2. Mantegna R N (1999) Hierarchical structure in financial markets. European Physical
Journal B 11, 193-197
3. Vandewalle N, Brisbois F, Tordoir X (2001) Non-random topology of stock markets.
Quantitative Finance 1, 372-374
4. Marsili M (2002) Dissecting financial markets: Sectors and states. Quantitative Finance
2, 297-302
5. Caldarelli G, Battiston S, Garlaschelli D, Catanzaro M (2004) In: Ben-Naim E, Frauen-
felder H, Toroczkai Z (eds) Complex Networks. Springer
6. Onnela J-P, Chakraborti A, Kaski K, Kertesz J, Kanto A (2003) Dynamics of market
correlations: Taxonomy and portfolio analysis. Physical Review E 68, 056110
7. Onnela J-P, Chakraborti A, Kaski K, Kertesz J, Kanto A (2003) Asset trees and asset
graphs in financial markets. Physica Scripta T106, 48-54
8. Onnela J-P, Saram¨aki J, Kert´esz J, Kaski K Intensity and coherence of motifs in weighted
complex networks. cond-mat/0408629
9. The website of Forbes at www.forbes.com
... Other examples include the sovereign debt risks in Portugal and Greece which resulted in subsequent turmoil in the global financial markets. In the complex networks structure, interactions have customarily been treated to be binary in nature, indicating that two states can exist, one state being the two nodes interacting with each other, and therefore connected and second state being the two nodes are not interacting with each other hence not connected [13,14]. For imposing such binary interaction rules, there is a necessity of formulating threshold levels for interaction strengths; so that all interactions below those levels are considered to be eliminated. ...
... Despite the fact that this course of action is a reasonable first approximation step, the thresholding process can still drive a high magnitude of information loss in the system. Hence, a natural step forward is to construct a weighted network where weights are assigned on each of the interacting links, expressing their respective strengths of interactions [14]. Weighted network analysis has been used in several studies in past for identification of influential nodes and quantification of their interactive strengths. ...
... Some of them include the following: examining the flux movement in transportation related network, for instance the air traffic networks and Internet networks [15, 16]; analysis of the rate of turnover of molecules through a metabolic pathway in cellular systems [17,18, 19, 20]; study of the statistical properties of trading activity network existent in stock exchanges in developing economies [21,22,23]; investigation into the degree distribution, node distribution and weight distribution of the world investment networks[24] and lately for analysis of the dynamics of stock market networks [25,26,27]. In the context of stock market networks, several studies have employed weighted network models for characterising the behaviour of interacting agents [13][14][28][29][30][31][32][33][34][35]. The need for identification of influential agents in complex equity network is well justified from the study authored by Goyal and Van der Leij [36], which revealed that the topological structure of the financial network has an important role in determining the contagion at the time of crisis. ...
Article
Full-text available
The socio-economic systems today possess high levels of both interconnectedness and interdependencies, and such system-level relationships behave very dynamically. In such situations, it is all around perceived that influence is a perplexing power that has an overseeing part in affecting the dynamics and behaviours of involved ones. As a result of the force & direction of influence, the transformative change of one entity has a cogent aftereffect on the other entities in the system. The current study employs directed weighted networks for investigating the influential relationship patterns existent in a typical equity market as an outcome of inter-stock interactions happening at the market level, the sectorial level and the industrial level. The study dataset is derived from 335 constituent stocks of ‘Standard & Poor Bombay Stock Exchange 500 index’ and study period is 1st June 2005 to 30th June 2015. The study identifies the set of most dynamically influential stocks & their respective temporal pattern at three hierarchical levels: the complete equity market, different sectors, and constituting industry segments of those sectors. A detailed influence relationship analysis is performed for the sectorial level network of the construction sector, and it was found that stocks belonging to the cement industry possessed high influence within this sector. Also, the detailed network analysis of construction sector revealed that it follows scale-free characteristics and power law distribution. In the industry specific influence relationship analysis for cement industry, methods based on threshold filtering and minimum spanning tree were employed to derive a set of sub-graphs having temporally stable high-correlation structure over this ten years period.
... Although several studies analyze financial markets from a network approach [8][9][10], we do not find any studies that apply it to multi-asset markets. Graph-Based Entropy based on co-occurrence networks used in our method has been used in the field of marketing in the retail industry [11,12], and differential networks have been used in the field of bioinformatics [13][14][15]. ...
Article
Full-text available
We study the method for detecting relationship changes in financial markets and providing human-interpretable network visualization to support the decision-making of fund managers dealing with multi-assets. First, we construct co-occurrence networks with each asset as a node and a pair with a strong relationship in price change as an edge at each time step. Second, we calculate Graph-Based Entropy to represent the variety of price changes based on the network. Third, we apply the Differential Network to finance, which is traditionally used in the field of bioinformatics. By the method described above, we can visualize when and what kind of changes are occurring in the financial market, and which assets play a central role in changes in financial markets. Experiments with multi-asset time-series data showed results that were well fit with actual events while maintaining high interpretability. It is suggested that this approach is useful for fund managers to use as a new option for decision-making.
... There has been extensive literature on the structures of networks brought about by stock return correlations [1, 6,891011, but not on the actual networks of trade partners. ...
Article
Full-text available
This paper looks at the properties of networks that emerge by matching traders on a financial market. We use data from the London Stock Exchange (LSE) to compare a limit order market and a block market. The main comparison is done in terms of standard centrality measures such as degree, closeness and betweenness, but we also look briefly at some other measures. The comparison is done on graph level and agent level measures. We show that while degree centrality does not differ between the limit order and the block market, closeness and betweenness differ substantially reflecting the very different structures of the two markets. We also compare the networks to a null model of random matching and show that trader pairing on the limit order market is consistent with the null. Block market trading is not consistent with the null.
Chapter
Full-text available
We review the state of the art of clustering financial time series and the study of their correlations alongside other interaction networks. The aim of the review is to gather in one place the relevant material from different fields, e.g. machine learning, information geometry, econophysics, statistical physics, econometrics, behavioral finance. We hope it will help researchers to use more effectively this alternative modeling of the financial time series. Decision makers and quantitative researchers may also be able to leverage its insights. Finally, we also hope that this review will form the basis of an open toolbox to study correlations, hierarchies, networks and clustering in financial markets.
Article
Full-text available
This document is a preliminary version of an in-depth review on the state of the art of clustering financial time series and the study of correlation networks. This preliminary document is intended for researchers in this field so that they can feedback to allow amendments, corrections and addition of new material unknown to the authors of this review. The aim of the document is to gather in one place the relevant material that can help the researcher in the field to have a bigger picture, the quantitative researcher to play with this alternative modeling of the financial time series, and the decision maker to leverage the insights obtained from these methods. We hope that this document will form a basis for implementation of an open toolbox of standard tools to study correlations, hierarchies, networks and clustering in financial markets. We also plan to maintain pointers to online material and an updated version of this work at www.datagrapple.com/Tech.
Article
We have built the network of the top 100 Italian quoted companies in the decade 2001–2011 using four different methods, comparing the resulting minimum spanning trees for methods and industry sectors. Our starting method is based on Person’s correlation of log-returns used by several other authors in the last decade. The second one is based on the correlation of symbolized log-returns, the third of log-returns and traded money and the fourth one uses a combination of log-returns with traded money. We show that some sectors correspond to the network’s clusters while others are scattered, in particular the trading and apparel sectors. We analyze the different graph’s measures for the four methods showing that the introduction of volumes induces larger distances and more homogeneous trees without big clusters.
Article
Full-text available
Networks consist of agents and the activities of them. A large number of studies in mainstream economics focus on the activities of homogenous agents. However it is clear that the reality is quite different. In fact, economic systems involve many heterogeneous agents and their complex interactions. To overcome this weakness of the classical perspective, in recent years, direction of the studies has changed towards the complex networks. Agent based modeling and simulation (ABMS) is a powerful tool for analyzing such systems. Model enables interacting agents to assess individually their positions and make decisions on the basis of a set of rules that configure the system. The aim of this study is to guide the researchers who are interested in agent based modeling of complex networks, by examining its vide range of applications. Accordingly, first a brief explanation of complexity and complex systems is given. Then a literature survey on the studies concerning agent based modeling applications of social network is presented and suggestions are given for some promising applications.
Conference Paper
As an emerging research field, the complex network theory is able to depict the most daily complex systems’ topologies, but in terms of financial market analysis, it still needs more attention. We can apply this theory to construct financial networks and detect them both from macro level and micro level to support a company in forecasting its revenue. This paper aims to explore the macro-characteristics of the UK stock market. We examine the properties of return ratio series of selected components in FTSE100 index, adopt the Kendall’s τ rank correlation coefficient between series to write adjacency matrices and transform these matrices into complex networks. Then we visualize the networks, analyze features of them at different thresholds and find evidence of WS small world property in the UK stock networks. All these work follow our research framework proposed at beginning of this paper. According to the framework, more future work needs to be done to achieve the goal and make decision support in a company. © 2015 IFIP International Federation for Information Processing.
Article
We analyzed the structures and properties of the global financial market networks using social network analysis approach. The Minimum Spanning Tree (MST) lengths and networks of the global financial markets based on the correlation coefficients have been analyzed. Firstly, similar to the previous studies on the global stock indices using MST length, the diversification effects in the global multi-asset portfolio can disappear during the crisis as the correlations among the asset class and within the asset class increase due to the system risks. Second, through the network visualization, we found the clustering of the asset class in the global financial markets network, which confirms the possible diversification effect in the global multi-asset portfolio. Meanwhile, we found the changes in the structure of the network during the crisis. For the last one, in terms of the degree centrality, the stock indices were the most influential to other assets in the global financial markets network, while in terms of the betweenness centrality, Gold, Silver and AUD. In the practical perspective, we propose the methods such as MST length and network visualization to monitor the change of the correlation risk for the risk management of the multi-asset portfolio.
Conference Paper
Time series analysis can be used to analyze structural changes in dynamic networks. the techniques and tools used to analyze the structure of dynamic networks are investigated in this paper. a large climate anomaly dataset is analyzed and used to construct a time series correlation based graph. the experiment results indicate that this climate dataset has the features of a small-world network.
Article
Full-text available
We have analysed the cross correlations of daily fluctuations for [iopmath latex="N=6358"] N = 6358 [/iopmath] US stock prices during the year 1999. From those [iopmath latex="N(N1)/2N(N-1)/2"] N(N-1)/2 [/iopmath] correlation coefficients, the minimum spanning tree (MST) has been built. We have investigated the topology exhibited by the MST. Even though the average coordination number of stocks is [iopmath latex="langlenrangleapprox2langle n rangleapprox 2"] n2 [/iopmath], the variance [iopmath latex="sigma"] [/iopmath] of the topological distribution [iopmath latex="f(n)"] f(n) [/iopmath] diverges! More precisely, we have found that [iopmath latex="f(n)simn2.2f(n) sim n^{-2.2}"] f(n)~n-2.2 [/iopmath] holds over two decades. We have studied the topological correlations for neighbouring nodes: an extremely broad set of local configurations exists, confirming the divergence of [iopmath latex="sigma"] [/iopmath].
Article
Full-text available
I find a topological arrangement of stocks traded in a financial market which has associated a meaningful economic taxonomy. The topological space is a graph connecting the stocks of the portfolio analyzed. The graph is obtained starting from the matrix of correlation coefficient computed between all pairs of stocks of the portfolio by considering the synchronous time evolution of the difference of the logarithm of daily stock price. The hierarchical tree of the subdominant ultrametric space associated with the graph provides information useful to investigate the number and nature of the common economic factors affecting the time evolution of logarithm of price of well defined groups of stocks.
Article
Full-text available
The time dependence of the recently introduced minimum spanning tree description of correlations between stocks, called the "asset tree" has been studied in order to reflect the financial market taxonomy. The nodes of the tree are identified with stocks and the distance between them is a unique function of the corresponding element of the correlation matrix. By using the concept of a central vertex, chosen as the most strongly connected node of the tree, an important characteristic is defined by the mean occupation layer. During crashes, due to the strong global correlation in the market, the tree shrinks topologically, and this is shown by a low value of the mean occupation layer. The tree seems to have a scale-free structure where the scaling exponent of the degree distribution is different for "business as usual" and "crash" periods. The basic structure of the tree topology is very robust with respect to time. We also point out that the diversification aspect of portfolio optimization results in the fact that the assets of the classic Markowitz portfolio are always located on the outer leaves of the tree. Technical aspects such as the window size dependence of the investigated quantities are also discussed.
Article
Full-text available
The local structure of unweighted networks can be characterized by the number of times a subgraph appears in the network. The clustering coefficient, reflecting the local configuration of triangles, can be seen as a special case of this approach. In this paper we generalize this method for weighted networks. We introduce subgraph "intensity" as the geometric mean of its link weights "coherence" as the ratio of the geometric to the corresponding arithmetic mean. Using these measures, motif scores and clustering coefficient can be generalized to weighted networks. To demonstrate these concepts, we apply them to financial and metabolic networks and find that inclusion of weights may considerably modify the conclusions obtained from the study of unweighted characteristics.
Article
Full-text available
This paper introduces a new methodology for constructing a network of companies called a dynamic asset graph. This is similar to the dynamic asset tree studied recently, as both are based on correlations between asset returns. However, the new modified methodology does not, in general, lead to a tree but a graph, or several graphs that need not be inter-connected. The asset tree, due to the minimum spanning tree criterion, is forced to ``accept'' edge lengths that are far less optimal (longer) than the asset graph, thus resulting in higher overall length for the tree. The same criterion also causes asset trees to be more fragile in structure when measured by the single-step survival ratio. Over longer time periods, in the beginning the asset graph decays more slowly than the asset tree, but in the long-run the situation is reversed. The vertex degree distributions indicate that the possible scale free behavior of the asset graph is not as evident as it is in the case of the asset tree. Comment: 8 pages including 10 figures. Uses REVTeX. Submitted for the conference proceedings of "Unconventional Applications of Statistical Physics", Kolkata (2003)
Article
Full-text available
By analyzing a large data set of daily returns with data clustering technique, we identify economic sectors as clusters of assets with a similar economic dynamics. The sector size distribution follows Zipf's law. Secondly, we find that patterns of daily market-wide economic activity cluster into classes that can be identified with market states. The distribution of frequencies of market states shows scale-free properties and the memory of the market state process extends to long times (50\sim 50 days). Assets in the same sector behave similarly across states. We characterize market efficiency by analyzing market's predictability and find that indeed the market is close to being efficient. We find evidence of the existence of a dynamic pattern after market's crashes.
Article
Complex networks describe a wide range of systems in nature and society, much quoted examples including the cell, a network of chemicals linked by chemical reactions, or the Internet, a network of routers and computers connected by physical links. While traditionally these systems were modeled as random graphs, it is increasingly recognized that the topology and evolution of real networks is governed by robust organizing principles. Here we review the recent advances in the field of complex networks, focusing on the statistical mechanics of network topology and dynamics. After reviewing the empirical data that motivated the recent interest in networks, we discuss the main models and analytical tools, covering random graphs, small-world and scale-free networks, as well as the interplay between topology and the network's robustness against failures and attacks. Comment: 54 pages, submitted to Reviews of Modern Physics