Risk-dependent centrality in economic and financial networks
Paolo Bartesaghi, Michele Benzi, Gian Paolo Clemente, Rosanna Grassi, and Ernesto Estrada

TL;DR
This paper introduces risk-dependent centrality measures for economic and financial networks, accounting for external risk levels and their impact on node importance, revealing phenomena like ranking interlacement.
Contribution
It develops a novel framework for risk-sensitive centrality in networks, extending traditional measures to include external risk factors and analyzing their mathematical properties.
Findings
High risk-centrality correlates with sensitivity during crises.
Ranking interlacement phenomenon observed due to external risk changes.
Validates measures on real-world financial and corporate networks.
Abstract
Node centrality is one of the most important and widely used concepts in the study of complex networks. Here, we extend the paradigm of node centrality in financial and economic networks to consider the changes of node "importance" produced not only by the variation of the topology of the system but also as a consequence of the external levels of risk to which the network as a whole is submitted. Starting from the "Susceptible-Infected" (SI) model of epidemics and its relation to the communicability functions of networks we develop a series of risk-dependent centralities for nodes in (financial and economic) networks. We analyze here some of the most important mathematical properties of these risk-dependent centrality measures. In particular, we study the newly observed phenomenon of ranking interlacement, by means of which two entities may interlace their ranking positions in terms of…
| Density | ||||||
|---|---|---|---|---|---|---|
| 0.1 | 0.3 | 0.5 | 0.7 | 0.9 | ||
| 0.1 | 0.9947 | 0.9967 | 0.9971 | 0.9994 | 0.9998 | |
| 0.5 | 0.9844 | 0.9950 | 0.9966 | 0.9994 | 0.9998 | |
| 1.0 | 0.9813 | 0.9950 | 0.9966 | 0.9994 | 0.9998 | |
| Plot | Corporate 1 | Corporate 2 | calculated | observed |
|---|---|---|---|---|
| (a) | J.P. Morgan&Co Inc. | Bank of America Corp. | 0.375 | 0.37 |
| (b) | Pfizer Inc. | Ashland Inc. | 0.441 | 0.41 |
| (c) | Morgan Stanley & Co. | Bank One Corp. | 0.176 | 0.17 |
| (d) | AT&T Corp. | Airtouch Communications | 0.273 | 0.27 |
| (e) | Union Carbide Corp. New | AON Corp. | 0.353 | 0.32 |
| (f) | General Motors Corp. | Boeing Co. | 0.214 | 0.14 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Complex Systems and Time Series Analysis · Opinion Dynamics and Social Influence
\headers
Risk-dependent centrality in economic and financial networksP. Bartesaghi, M. Benzi, G. P. Clemente, R. Grassi and E. Estrada
\newsiamremarkremarkRemark \newsiamremarkhypothesisHypothesis
\newsiamthmclaimClaim
Risk-dependent centrality in economic and financial networks
Paolo Bartesaghi Department of Statistics and Quantitative Methods, University of Milano - Bicocca, Via Bicocca degli Arcimboldi 8, 20126, Milano, Italy (, ). [email protected]
Michele Benzi Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy (). [email protected]
Gian Paolo Clemente Department of Mathematics for Economics, Financial and Actuarial Sciences, Università Cattolica del Sacro Cuore, Largo Gemelli 1, 20123, Milano (). [email protected]
Rosanna Grassi22footnotemark: 2
Ernesto Estrada Institute of Mathematics and Applications (IUMA), Universidad de Zaragoza, Pedro Cerbuna 12, E-50009 Zaragoza, Spain; ARAID Foundation, Government of Aragón, 50018 Zaragoza, Spain (). [email protected]
Abstract
Node centrality is one of the most important and widely used concepts in the study of complex networks. Here, we extend the paradigm of node centrality in financial and economic networks to consider the changes of node “importance” produced not only by the variation of the topology of the system but also as a consequence of the external levels of risk to which the network as a whole is submitted. Starting from the “Susceptible-Infected” (SI) model of epidemics and its relation to the communicability functions of networks we develop a series of risk-dependent centralities for nodes in (financial and economic) networks. We analyze here some of the most important mathematical properties of these risk-dependent centrality measures. In particular, we study the newly observed phenomenon of ranking interlacement, by means of which two entities may interlace their ranking positions in terms of risk in the network as a consequence of the change in the external conditions only, i.e., without any change in the topology. We test the risk-dependent centralities by studying two real-world systems: the network generated by collecting assets of the S&P 100 and the corporate board network of the US top companies, according to Forbes in 1999. We found that a high position in the ranking of the analyzed financial companies according to their risk-dependent centrality corresponds to companies more sensitive to the external market variations during the periods of crisis.
keywords:
Communicability, Complex networks, SI models, Centrality measures, Risk propagation
{AMS}
05C82, 90B15, 91G80
1 Introduction
Modern economic and financial systems are characterized by a vast collection of interacting agents [1, 6, 12, 37, 47, 55]. In economic systems, for instance, the interdependence among entities characterizes the trade and exchange of goods in non-anonymous markets as well as in risk sharing agreements in developing countries [1]. In this framework, the agents’ interaction is responsible for the nature of the relations between the individual behavior and the aggregate behavior [55].
The human factor which underlies these economic and financial systems is also characterized by the interconnectivity. The existence of networks of interpersonal relations has been empirically observed to constitute a fundamental factor in shaping the inter-institution networks, or in accounting for the networks of risk-sharing agreements [33, 34], the formation of buyer-seller networks [18, 40, 60], product adoption decisions [23, 54], diffusive processes [41, 46, 64], industrial organization [47], trade agreements [37] and even for the existence of interbank networks [1]. This is not surprising as humans are responsible for the execution of deals between the institutions to which they belong to [16, 59, 71].
From a mathematical perspective all these interdependencies between economic and financial entities can be captured by the formal concept of network, in which nodes represent the entities (individuals, firms, countries, etc.) and edges account for the relations between such entities, ranging from social relations to trade agreements [29]. Hence, it is possible to use the tools of network theory to analyze the structure, the evolution and the dynamic processes that take place on these systems. On one side, researchers have studied the topological properties of these networks (sometimes called static analysis), which do not assume mechanisms of transmission of effects through the economic and financial entities [38, 52, 79]. Among such studies, it is frequent to find analyses of clusters formed by groups of institutions, as well as the centrality of individual nodes in the networks [8, 13, 78]. Specifically, centrality measures (see Chapter 5 in [29] for a detailed analysis) are topological characterizations of the nodes and their neighborhood in a network. In the analysis of financial and economic networks, the use of centrality measures is not so effective, as the classical ones provide a static view of the network and even other measures based on dynamic processes, such as random walks based centralities [69], do not capture the changing conditions to which these networks could be submitted in relatively short periods of time. As an illustrative example, let us consider a hypothetical interbank network for which we are interested in analyzing the risk-dependent exposure of the various entities of the system. Any centrality measure will point out a specific and static ranking of the nodes. However, a bank which is very central at a low-level of external risk is not necessarily central when such level of external risk increases, and vice versa. On the other hand, the propagation of shocks through these networks is considered and it is usually known as dynamic analysis [3, 15, 17, 25, 39, 48, 43]. In these studies, a specific way of transmission of these shocks through the network is assumed – as in the case of “Susceptible-Infected” and “Susceptible-Infected-Recovered” epidemiological models [63, 67, 73] – and then a systemic risk analysis is based on the contagion effects observed through such models.
In this work we develop a mathematical model to account for the risk exposure of an entity in a networked (economic or financial) system. This model is based on the relation between the Susceptible-Infected epidemiological model and the so-called communicability functions of a network [30]. Using this connection we derive new centrality indices that quantify the level of risk at which an entity is exposed to as a function of the global external level of risk. Our approach takes advantage of the benefits of both static and dynamic analyses. Indeed, unlike the standard approaches followed in the literature, these risk-dependent centralities are not static indices, as most of centrality indices are, but they vary with the change of the external global risk level at which the system is submitted to. More importantly, the ranking of the nodes in these networks also depends on this global external level of risk. This means that an entity – a node in the network – which is at low (high) level of risk under external conditions can be at high (low) level under different conditions.
We test our model by using two different systems, a network of assets based on the daily returns of the components of the S&P 100 for the period ranging from January 2001 to December of 2017 and a network representing the interconnection between companies in the US top corporates according to Forbes in 1999. In the first case we extract the essential information about asset correlations through the minimum spanning tree. We measure how the centrality of the assets changes at different values of the external risk. What emerges is a high volatility in the rankings during the financial crisis of 2007-2008, when the node centrality proves to be more sensitive to the external risk. In the case of the corporate network we analyze a sample of significant companies, looking for a correlation between the shareholders value creation (SVC) and their behavior during and after the crisis period at which data were collected. We find that a remarkable increase in their risk-centrality ranking during a crisis corresponds to a less resilient reaction to the external market turmoil.
The paper is structured as follows. In Subsection 1.1 we recall the main literature about the use of epidemiological models for modeling financial contagion and we motivate the choice of a Susceptible-Infected model. The necessary mathematical preliminaries are given in Section 2. Therefore, we describe a Susceptible-Infected (SI) model on a financial network (Section 3) and we define the risk-dependent centrality proving some mathematical properties (Section 4). We perform numerical analyses of the proposed centrality for random networks (Section 5), then we apply the proposed measure to real-world financial networks (Section 6) and we analyze the ranking interlacement problem (Section 7). Section 8 remarks how the proposed model could provide additional insights in the analysis of the economic and financial impacts of the crisis related to the diffusion of the new coronavirus named SARS-COV-2. Conclusions follow in Section 9.
1.1 Related literature and motivations
The process in which one financial institution spreads negative effects to another institution resembles very much the propagation of epidemics on networks [66, 48]. The fact that such processes are known as “financial contagion” already captures part of these similarities. Then, it is not strange that epidemiological models are frequently used to capture the subtleties of financial contagion processes. There are many of such compartmental models in epidemiology, but the most widely used for modeling financial contagion are the Susceptible-Infected-Recovered (SIR) [21, 36, 74, 58, 76, 42] and the Susceptible-Infected-Susceptible (SIS) [5, 14] ones. They are not only used to model financial contagion per se, but also for the propagation of rumors and innovations of interest for financial institutions [70, 50]. These models are well-suited in depicting financial contagion because they do not require arbitrary assumption on loss rates and balance sheets. As remarked by Toivanen [80], they capture the psychological aspects of contagion process “by relating a bank’s relative financial strength with the perceived counterparty risk and expectations”.
The previously mentioned SIS/SIR models and their variants are mainly used in studying the dynamics of contagion in a system in a post-mortem way. As it is well-known, both SIS and SIR models are characterized by the presence of a threshold , which is defined as the reciprocal of the principal eigenvalue of the adjacency matrix. The below-the-threshold or above-the-threshold behavior of the spreading process depends on whether the effective infection rate is less than or greater than such a threshold. Below the threshold, we have the extinction of the contagion and above the threshold a non-zero fraction of infected nodes persists in the network even over a wide range of timescales. The effective infection rate depends on both the infection rate per link and on the curing or recovering rate . For instance in Figure 1(a) we illustrate the evolution of a contagion dynamics for an Erdős-Rényi graph with 100 nodes and connection probability 0.1 by using the SIS model. The principal eigenvalue of the adjacency matrix is so that the epidemic threshold is . The infectivity rate per link is 0.002 for both curves and the initial infection probability is 0.2 (20 nodes over 100 initially infected). The curing rate is 0.001 for the dashed red line (epidemic) and 0.04 for the solid blue line (extinction). Then, the effective infection rate is for the the dashed red line (epidemic) and for the solid blue line (extinction).
In this work we are interested in the very early signals that the system can provide for alerting about a propagation of a financial contagion. In this case it is very important to consider the window of vulnerability between the time the contagion phenomenon is firstly recognized and the time an action is taken to face the infection. This window could be arbitrarily wide. In any real condition, there is a non negligible time interval in which a recovery tool is not available yet and the recovering rate is equal to zero. It has been recently shown by Lee et al. [62] that, within this window, the spreading phenomenon is better described by a SI model than by any other model with a non-null recovering rate, e.g., SIR and SIS (see Figure 1(b)). In this framework, a key point is to predict the “most at risk” nodes in the network. Therefore, we are interested in the early times of the epidemic where it is possible to limit or avoid the distress propagation by introducing specific measures on the risky nodes in the network.
Moreover, in order to be effective in reducing the spreading phenomenon, the curing rate has to be large enough. More precisely, since (being the mean degree and the maximum degree), the curing rate has to be at least to get a below-the-threshold behavior. But for a big real network can be very large, even if the mean degree is small. This implies that has to be significantly bigger than , or in other words, the infection significantly weaker than the self-recovering process. This fact could be totally unlikely in a real contagion process on a real network and it makes the use of the SIS or SIR model extremely unrealistic as remarked by Lee et al. [62]. Even when is small and the node infection process is dominant, the corresponding epidemic dynamic is better captured by the SI model. From an application point of view, this is possibly true over a wide range of timescales under constrained environments where applying massive action to limit contagion is practically infeasible.
As mentioned before, the detection of risky nodes in a network could be relevant for limiting the risk propagation effects (see, e.g., [44, 7]). Hence, centrality of a given institution as best spreader node in a contagion process has been widely explored (see, for instance, [63]) in order to identify the most dangerous crisis epicenter. The idea of best spreader node has also been studied in [81] in terms of topological centralities, which was previously investigated under the name of “vibrational centrality”(see, e.g., [31]). Centralities have been also used as measures to assess contagion in the interbank market. In this framework, Dimitros and Vasileios [22] recommended the use of well-established centrality measures as a way to identify the most important variables in a network. Battiston et al. [8] introduce DebtRank, a centrality measure that accounts for distress in one or more banks, based on the possibility of losses occurring prior to default. The concept that some banks might be too central to fail originates from this work (see, e.g., [8]).
2 Preliminaries
Here we use indistinctly the terms graphs and networks. Most of the network theoretic concepts defined hereafter can be found in [29]. A graph is defined by a set of nodes (vertices) and a set of edges between the nodes. is a loop starting and ending in . The degree of a node, denoted by , is the number of edges incident to in . The adjacency matrix of the graph with entries if or zero otherwise. We consider here simple graphs, i.e. without loops and multiedges. The theoretical model will be developed for unweighted networks, we also recall here the definition of weighted graphs, as we consider in the paper two empirical real examples for which the network is weighted. A weighted graph is a graph in which is a positive number assigned to the corresponding edge . In this case the sum of the weights for all edges incident to a node is known as the weighted degree or strength. We consider here only undirected networks, such that implies that . In this case the matrix can be expressed as where is an orthogonal matrix of the eigenvectors of and is the diagonal matrix of eigenvalues . The entries of are denoted by .
An important quantity for studying communication processes in networks is the communicability function [30], defined for a pair of nodes and as
[TABLE]
It counts the total number of walks starting at node and ending at node , weighted in decreasing order of their length by a factor of . A walk of length in is a set of nodes such that for all , . A closed walk is a walk for which . Therefore, is considering shorter walks as more influential than longer ones. The matrix exponential is an example of a general class of matrix functions which are expressible as
[TABLE]
where are coefficients giving more weight to the shorter than to the longer walks, and making the series converge. The term , which counts the number of closed walks starting at the node giving more weight to the shorter than to the longer ones, is known as the subgraph centrality of the node .
We also consider here a Susceptible-Infected (SI) model over an undirected network. Each susceptible node becomes infected at the infection rate per link times the number of infected neighboring nodes. Let be the instant in which a node is infected. Node remains in this state and does not come back susceptible. Let us introduce a random variable denoting the state of a node at time
[TABLE]
Then we define
[TABLE]
which is the probability that node is infected at time . In other words, node is healthy at time with probability . For the whole network, we define the vector of probabilities:
[TABLE]
3 Model
Let us consider a SI model on a financial network. The nodes of a graph represent financial institutions and the edges connecting them represent an interaction that can transmit a “disease” from one institution to another. A node can be susceptible and then get infected from a nearest neighbor or it is infected and can transmit the infection to other susceptible nodes. Let be the infection rate and let be the probability that node get infected at time from any infected nearest neighbor. Then,
[TABLE]
which in matrix-vector form becomes:
[TABLE]
with initial condition
It is well-known that on a strongly connected network111A graph is strongly connected if and only if for each pair of nodes there is a directed walk starting at and ending at , and a directed walk starting at and ending at [67]:
if then for all ; 2. 2.
is monotonically non-decreasing in ; 3. 3.
there are two equilibrium points: , i.e. no epidemic, and (the vector of all ones), i.e. full contagion; 4. 4.
the linearization of the model around the point is given by
[TABLE]
and it is exponentially unstable; in fact, since, in a non-empty undirected graph, has at least one positive eigenvalue, any solution component in the direction of the corresponding eigenvector grows unboundedly as increases; 5. 5.
each trajectory with converges asymptotically to , i.e. the epidemic spreads monotonically to the entire network.
In particular, the linearized problem comes from the following observation. It can be checked that
[TABLE]
or
[TABLE]
and . Then, we can use the linear dynamical system
[TABLE]
as an upper-bound for the original non-linear dynamical system, that has been used in the literature (see [67]) as an approximation of the exact problem. One of its main advantages is that it can be solved analytically and its solution can be written as:
[TABLE]
which using the spectral decomposition of can be written as
[TABLE]
This solution to the linearized model is affected by the following main problems:
grows quickly without bound, in spite of the fact that is a vector of probabilities which should not exceed the unit; 2. 2.
is an accurate solution to the nonlinear SI problem only if and .
The mathematical properties of the linear dynamical system 11 as well as of the solution 15 have been extensively studied by Mugnolo in [68]. We direct the reader to this reference for the details.
Hereafter we will follow the recent work of Lee et al. [62], who proposed the following change of variable to avoid the aforementioned problems with the solution of the linearized SI model:
[TABLE]
which is an increasing convex function. Then, as is the probability that node is not infected at a given time , the new variable can be interpreted as the information content of the node or surprise of not being infected (see, e.g., [19]). According to [62], the SI model Eq. 9 can be now written as
[TABLE]
or
[TABLE]
The approximate solution to the SI model provided by [62] is then given by
[TABLE]
where is the vector in which the th entry is and
[TABLE]
As stressed by [62], the interesting case of the dynamics is when , in which case the solution simplifies to
[TABLE]
Now, we can make the further assumption that the initial probabilities of being infected are equal for every node, i.e. that at the beginning every node has the same probability to be infected and to be the one from which the epidemic starts. This means that we are asking for
[TABLE]
for some scalar constant . In this case . If we set , the approximate solution of the SI on the network becomes:
[TABLE]
and since ,
[TABLE]
The component is called total communicability of node and it will be denoted by . Hence, component-wise we have:
[TABLE]
Keeping in mind that and , we can write the previous equation also as
[TABLE]
which means that at time is proportional to the variation in the information content of node from time 0 to time . Finally, the probability of node of being infected at time can be expressed in terms of as
[TABLE]
When the parameter is fixed, the number of infected nodes depends only on the term and then on the total communicabilities . It is worth noticing that the probability given by 28 for a node represents an upper bound for the exact solution of the SI model. Hence, in this way we do not underestimate the contagion probabilities. Let us consider, for instance, the time evolution of an infection propagation on an Erdős-Rényi network with 100 nodes and edge density . Results are illustrated in Fig. 2 for two different values of the infectivity rate, (left) and (right). The dashed red lines represent the mean probability that a node is infected at time as given by equation 28. The solid blue lines represent the same probability as given by the exact solution of the Kermack-McKendrick SI model with the same mean degree. In both plots, the initial probability is .
4 Risk-dependent centrality
Let us designate , which determines the level of risk to which the whole network is submitted at time . For instance, for , i.e. , there is no risk of infection on the network as a node cannot transmit the disease to a nearest neighbor. This situation corresponds to the case of isolated nodes (no edges). When the risk of infection is very high due to the fact that for a fixed value of the infectivity is infinite. Therefore, we call the risk-dependent centrality of the node . That is, the values of reflects how central a node is in “developing” the epidemics on the network. As the networks considered are undirected, this centrality accounts for both the facility with which the node gets infected as well as the propensity of this node to infect other nodes. The index can be expressed as
[TABLE]
which indicates that it counts the number of walks of different lengths, that have started at the corresponding node, weighted by a factor . It is straightforward to realize from the definition of the risk-dependent centrality that it can be split into two contributions. That is, is composed by a weighted sum of all closed walks that start and end at , and by the weighted sum of walks that start at the node and end elsewhere,
[TABLE]
where the first term in the right-hand side represents the circulability of the disease around a given node and the second one represents the transmissibility of the disease from the given node to any other in the network. The circulability is very important because it accounts for the ways the disease has to become endemic. For instance, a large circulability for a node implies that the disease can infect its nearest neighbors and will keep coming back to over and over again in a circular way. We start now by proving some results about these risk-dependent centralities. The following theorem is a special case of results found, for instance, in [11].
Theorem 4.1**.**
*The node ranking given by the risk dependent centralities , with , reduces to the ranking given by the degree in the limit as the risk , and to the ranking given by eigenvector centrality as . *
Proof 4.2**.**
We begin by observing that the ranking of nodes, in terms of their risk-dependent centrality, is unaffected if all the centralities are shifted and rescaled by the same amount. That is, the same ranking is obtained using either or the equivalent measure
[TABLE]
where . Now, we have
[TABLE]
Hence, in the limit of , the ranking given by is identical to degree ranking.
To study the limit for large we write
[TABLE]
We note again that for ranking purposes we can use the equivalent measure obtained by dividing all risk-dependent centralities by the same quantity, , which is strictly positive. That is, we can use
[TABLE]
*Since the network is connected, the Perron–Frobenius Theorem insures that . Hence, each term for vanishes in the limit as , and we see from Eq. 33 that the risk-dependent centrality measure gives the same ranking as eigenvector centrality for large. *
It is interesting to observe that the risk-dependent centrality of every node also depends on the (strictly positive) quantity
[TABLE]
see Equation Eq. 32. The larger this quantity is, the higher is the risk-dependent centrality of each node. Assuming that the dominant eigenvector is normalized so as to have Euclidean norm equal to 1, it is well known that this quantity is always between 1 and . The value 1 is never attained for a connected graph. It can only be approached in the limit as all the eigenvector centrality is concentrated on one node, say node , where it takes values arbitrarily close to 1, with the values for all taking arbitrarily small values. An example of this would be the star graph222We recall that the star graph consists of nodes , each attached to a central node by an edge. for . The maximum value is attained in the case where all nodes have the same eigenvector centrality: (i.e., in the case of regular graphs).
Let us return to the decomposition of the risk-dependent centrality of a node into its two components, circulability and transmissibility. Similar considerations apply to these quantities. We summarize them in the following result.
Theorem 4.3**.**
*The node rankings given by the degree and the eigenvector centrality are obtained as limiting cases of the risk-dependent circulability as the external level of risk decreases to zero or increases to infinity, respectively. The same is true for the risk dependent transmissibility . *
Proof 4.4**.**
The proof for the circulability is a straightforward adaptation of that for the total communicability; see also [11].
We give the details for the transmissibility, which has not been analyzed before. We have for that
[TABLE]
where denotes the number of walks of length two between node and node . Dividing by , summing over all and taking the limit as , we find
[TABLE]
where we have used the fact that , for all . Hence, transmissibility is equivalent to node degree in the small limit. For the large limit we write
[TABLE]
*Dividing by the positive constant and taking the limit as , the second part of the right-hand side vanishes and we obtain again the eigenvector centrality of node . *
Remark 4.5**.**
A natural question is how rapidly the degree (for ) and eigenvector (for ) centrality limits are approached if the number of nodes in the network goes to infinity. From the Taylor expansions (see for example equation 31) we see that the degree limit is reached more slowly if the row sums of grow as . In this case, as increases must be taken smaller and smaller before the ranking reduces to the one given by the degree. On the other hand, if the network grows in such a way that the maximum degree of any node remains uniformly bounded, then the rate of convergence is independent of the number of nodes, at least asymptotically.
The rate of convergence to the eigenvector centrality ranking is largely determined by the spectral gap, . If the gap remains bounded below by a positive constant as , the value of necessary to reach the eigenvector centrality limit is easily seen to grow at most like , and in practice the rate of convergence is scarcely affected by the size of the network. If, on the other hand, the gap closes as , then the rate of convergence to the eigenvector centrality will become arbitrarily slow. The faster the gap closes for , the more rapidly the rate of convergence deteriorates.
We conclude this section with some comments on the measures , and . While they all display the same limiting behavior and provide identical rankings in the small and large limits, they provide different insights on the network structure (and therefore on node risk). For instance, it is well known that subgraph centrality (which is the same as circulability, see [32, 29]) can discriminate between the nodes of certain regular graphs, that is, graphs in which all the nodes have the same degree. The same holds for transmissibility. Total communicability, on the other hand, is unable to discriminate between the nodes of regular graphs (and neither are degree and eigenvector centrality, of course). These measures are also different from a computational viewpoint. One advantage of the risk centrality based on total communicability is that it only requires the computation of the action of the matrix exponential on the vector . The entries of the resulting vector can be computed efficiently without having to compute any entry of , see [10]. Modern Krylov-type iterative methods (like those based on the Lanczos or Arnoldi process) can handle huge networks (with many millions of nodes) without any difficulty. In contrast, the computation of the circulability requires the explicit computation of the diagonal entries of (the node transmissibility is then easily obtained by subtracting the circulability from the total communicability). Although there are techniques that can handle fairly large graphs (see [9]), these calculations are much more expensive than those for the total communicability. This limits the size of the networks that they can be applied to. However, for most financial networks the computation of the circulability is still feasible.
A final consideration regards the values assumed by the external risk parameter . Although, in principle, it can vary between [math] and infinity, for the purposes of most of the applications that follow, it may be sufficient to vary between [math] and . The rationale for using the interval relies on the fact that, at , the rankings given by are already stabilizing around those provided by eigenvector centrality and therefore no more interlacings between rankings are possible. As we will show, we typically observe a single point of interlacement and it usually occurs before reaching the value . Furthermore, this choice is equivalent to fix in the epidemic model solution 25, and, already as approaches , all the probabilities involved in that model become completely negligible or equal to .
5 Risk-dependent centrality on a random network
For the analysis of real-world (financial and economic) networks it is necessary to investigate how informative the results obtained are with respect to the real system under analysis. This significance is typically addressed by comparing to those properties obtained from network null models. As such null models we consider here Erdős-Rényi (ER) random networks with nodes and wiring probability (see [27, 28]), for which, in this section, we provide a series of analytical results. We start by generating a family of simulated ER graphs and discarding simulations for which the obtained graph is not connected.
In particular, we aim at testing how the external risk and the probability , and hence the graph density. , affect the results. For this purpose, we generate graphs with at different values of . For each graph, we compute the main measures for alternative values of . Firstly, we report in Fig. 3 the behavior of risk-dependent centrality , circulability and transmissibility as functions of the density, assuming a fixed high level of external risk, . Since the values of are significantly increasing when the density of the graph increases, we display, in Fig. 3(a), the distributions of the ratio between the risk-dependent centrality of each node and its average value .
As might be expected, the centralities of nodes tend to be similar when and we move towards the complete graph, i.e. we observe a lower variability of the distribution of the ratios. Similar behaviors are also observed for and , with an higher volatility for the circulability (see Fig. 3(b) and Fig. 3(c)).
In Fig. 3(d), we show the distributions of the incidence of the circulability on the risk-dependent centrality , that is the distribution of the ratio again as a function of the density . When , for all the graphs analyzed, the average value is around , implying that the transmissibility has an average incidence of on . It is noteworthy to look at the variability of the distributions. When the density is extremely low, i.e. we refer to a very sparse graph, the heterogeneity of the nodes degree affects the ratio . For instance, when , the circulability of a node ranges approximately from 0.15% to 2.5% of the risk-dependent centrality for the same node. A lower variability is observed for higher densities. For instance, for , the ratio varies between 0.6% and 1.3%. For , we observe a ratio between 0.9% and 1.15%.
In Fig. 4, we show the corresponding behaviors of risk-dependent centrality , circulability and transmissibility as functions of the density, but assuming a fixed low level of external risk, . Again all Figures are based on 1000 randomly generated ER networks with varying between 0.1 and 0.9.
Focusing on the risk-dependent centrality ratio , we observe that the standard deviation between nodes is lower in the low-risk framework () than in the high-risk one (). For instance, when the density is equal to , the standard deviation of the ratio moves from for to for . At a phenomenological level, this behavior can be justified by the fact that differences between nodes tend to be enhanced when the network is highly risk-exposed.
Furthermore, the pattern of for is very peculiar. In this case, when the network is very sparse, nodes show a similar circulability, while higher differences are observed when the density is around 0.5.
Lastly, in Fig. 5, we focus on the ratio and we report the incidence of the circulability on the risk-dependent centrality as a function of the external risk . In case of sparse networks (Fig. 5(a)), when the external risk is low, we have that the infection remains in larger part circulating in a loopy way around the nodes, while only a lower proportion of risk tends to be transmitted to other nodes. This is due to the fact that, for sparse and small, the matrix is strongly diagonally dominant. When the external risk is high, as already observed, we have an average incidence of the circulability on the risk-dependent centrality around . On the contrary, when a very dense network is considered, the ratio is very little affected by the external risk. In this case, both and increase on average at the same rate when increases. However, the decreasing behavior of is noticeable for very low values of .
In what follows we provide an exhaustive proof of the behaviors observed so far. Let us start with the pattern of the ratio at high density (see Figures 3(d), 4(d) and 5(b)).
The asymptotic behavior of this ratio can be explained as a consequence of Theorem A.1 in the Appendix, where we derive the close expressions of the three risk-dependent centrality measures for a complete graph. In fact, as , the ER network approaches a complete network and, for increasing, the ratio approaches , as shown in 44.
Nonetheless, this result can be generalized. In fact, for an ER network which is dense enough, the following property holds for any .
Theorem 5.1**.**
Let be an Erdős-Rényi random graph with nodes and probability . If the edge density of the graph is and , then for any node
[TABLE]
*independently of . *
Proof 5.2**.**
Let us consider as usual that in a connected graph. It is known that in an ER graph the spectral gap . Indeed, as proved in [51], , while and grow more slowly as and for every , respectively.
Then, if , all but the largest eigenvalue lie with high probability in the interval (see [82] and [57]). Therefore,
[TABLE]
The edge density of an ER graph is In [26], it was proved that for , there exists a positive constants such that the following inequality holds
[TABLE]
*which in plain words means that an ER graph of density is “almost” regular when . That is for every node . Thus, the result immediately follows. *
It is worth pointing out that, when the density of an ER network is very low, the standard deviation of the ratio is very large with respect to that of ER networks with large densities (as shown in Figure 3(d)). As we have proved before, the convergence of this ratio to the value takes place only when the density of the graph is relatively large. Let us now analyze what happens when the edge density is very small for large graphs. In this case, we observe a slower decay of the ratio as a function of the external risk in the range (see Figure 5(a)). This fact can be easily proven as follows. In general, both the numerator and denominator of this ratio can be expressed as infinite series of the type:
[TABLE]
[TABLE]
where counts the number of closed walks of length starting and ending at node and counts all the open walks of length starting at and ending at any node . Let us consider
[TABLE]
Then, for certain the numerator of the previous expression is negative, which means that the ratio is monotonically decreasing with . For instance, let us make a second order approximation to the polynomials and . Then, we have
[TABLE]
where is the number of paths of length 2 (wedges) starting at node . In an ER graph and . Thus,
[TABLE]
where is the mean degree. The first derivative of this rational function is
[TABLE]
which is always negative for any and as can be seen in Figure 6. Moreover, the absolute value of this derivative increases as decreases, implying a slower decay in the function for lower densities.
To conclude this section, we want to focus on the rankings produced by the two main centrality measures and and on the similarities between them. In particular, we are interested in determining if, or for what type of networks, the different centrality measures provide similar rankings. To this end, we display in Table 1 the Spearman correlation coefficient between the risk dependent centrality and the circulability for different graph densities and for various values of . On average, we observe a strong positive monotonic dependence between the two centrality measures. As expected, the two measures tend towards the perfect monotonicity as the density arises. It is noteworthy the behavior with respect to . The higher dependence is observed in a low-risk framework (), while a slight reduction is noticeable when higher risk contexts are analyzed, providing again an empirical evidence of the fact that differences between nodes are increased in stressed conditions. Furthermore, this result is in line with the higher incidence of on as vanishes, discussed in the previous lines. For the sake of brevity, we do not report the Spearman correlation between and . However, in all cases, the coefficient is larger than 0.9999.
6 Analysis of real-world financial networks
In this section, we perform some empirical studies in order to assess the effectiveness of the proposed approaches. We consider two different networks. In the first one, we collected daily returns of a dataset referred to the time-period ranging from January 2001 to December 2017, that includes leading U.S. stocks constituents of the 100 index at the end of 2017. Data have been downloaded from Bloomberg. Returns have been split by using monthly stepped six-months windows. It means that the data of the first in-sample window of width six-month are used to build the first network. The process is repeated rolling the window one month forward until the end of the dataset is reached, obtaining a total of 199 networks. The first network, denoted as “1-2001”covers the period of January 2001 to of June 2001. The latter one (“7-2017”) covers the period of July 2017 to of December 2017.
Hence, for each window, we have a network (with ), where assets are nodes and links are weighted by computing the correlation coefficient between the empirical returns of each couple of assets. Notice that the number of assets can vary over time. Indeed, as mentioned, we have considered the 102 assets constituents of the 100 index at the end of 2017. Some of these assets have no information available for some specific time periods. Therefore, in each window, we have considered only assets, whose observations are sufficiently large to assure a significant estimation of the correlation coefficient. However, it is not the aim of this paper to deal with the effects of alternative estimation methods. As a consequence, the number of nodes in the 199 networks varies from 83 to 102 during the time-period.
Then, we follow the methodology proposed in [65, 72] and we use the non-linear transformation, based on distances : . The distance matrix , with elements , becomes the weighted adjacency matrix of the graph . As proposed in [72], we extract the minimum spanning tree . This is a simple connected graph that connects all nodes of the graph with edges such that the sum of all edge weights is minimum. As shown in [72], this minimum spanning tree, as a strongly reduced representative of the whole correlation matrix, bears the essential information about asset correlations. Furthermore, the study of the centrality of nodes and the analysis of the evolution of the tree over time are two critical issues in portfolio selection problem (see [72, 75, 77]).
The second dataset consists of a network of the top corporates in US in 1999 according to Forbes magazine. The network is constructed as follows. First we consider a bipartite network in which one set of nodes consists of companies and the other of Chief Executive Officer (CEO)’s of such companies. As one CEO can be in more than one company, we make a projection of this bipartite graph into the company-company space. In this way, the nodes represent corporations and two corporations are joined by an edge if they share at least one director. We consider two versions of this network, in the first we use the number of directors shared by two companies as an edge weight, and in the second we use the binary version of the first. We will refer to these as to the weighted and binary network, respectively. The network has nodes, made up of one giant component of nodes. We selected the giant component, with its binary and weighted adjacency matrices. For a comprehensive description of this network see, for instance, [20].
Networks, derived by both datasets, have been studied by computing the total communicability, circulability and transmissibility for each node with varying in with step .
6.1 Network of assets
Starting from the asset trees , we measure the relevance of each node by using the risk-dependent centrality and by testing different values of . We consider in Fig. 7 the rankings’ distribution of each asset. Different outcomes of each distribution have been obtained by computing the rankings based on for alternative values of in the interval with step . These results regard the first network “1-2001”, namely, the network based on data that cover the period of January 2001 to of June 2001. We observe that some nodes show a significant variability according to different values of . Indeed, some assets have climbed more than 20 positions in the ranking when increases. For instance, Amazon (node 7 in Fig. 7) moved from position 66 to 41 in case of low and high risk, respectively. Vice versa, Exelon Group (node 32 in Fig. 7) lowered its ranking from 15 to 46. On the other hand, the most central nodes in the network remain very central also when external risk is very high. We have indeed that the top 6 is quite stable for different values of . Top assets only exchange a bit their position, preserving their central role. For instance, United Technologies Corporation (node number 79 in Fig. 7) is at the top of the ranking, independent of .
If we consider the period of the global financial crisis of 2007-2008 (see Fig. 9 and 10), we observe an increase in the rankings’ volatility. In shock periods, centrality of nodes is more affected by the value of . In particular, to catch rankings’ volatility, we report in Figure 8 the standard deviations of rankings of each asset computed varying . In shocks periods, results confirm higher average volatility as well as positive skewed distributions because of a greater number of assets whose ranking is highly affected by the value of . We also tested that differences in average volatility are significant by means of a paired t-test, useful for comparing the same sample of assets at different time periods. When the network 1-2001 is compared with the two networks covering period of crisis (End 2007 or End 2008), we obtain -values around and that confirm strong evidence against the null hypothesis that the average difference between the two samples is zero. As expected, the test is not statistically significant (-value is 0.31) when networks covering period of the global financial crisis are compared.
Concerning the behavior of specific assets, we observe, for instance, that some assets move down by approximately 60 positions from a low risk to an high risk framework. Two examples are represented by Danaher Corporation and Honeywell International (assets 28 and 43, respectively, in Fig. 9). Instead, Accenture PLC (node 3 in Fig. 9) increased its ranking from position 61 to 11.
Even top central nodes are affected by as the volatilities of their rankings show. It is instead confirmed the relevance of United Technologies Corporation (node number 82 in Fig. 9 and 83 in Fig. 10) that is again at the top of the ranking at the end of 2017, independent of . At the end of 2008, the centrality of this asset is also confirmed, although, a bit of variability in the ranking is observed for this firm.
6.2 US corporate network
We now analyze the network of US top corporates in 1999 according to Forbes magazine. Before starting our analysis let us explain the importance of studying a spreading dynamics on this network. According to this network, the board of directors of a given corporation is formed by a few members, some of which are also present in the board of other corporations. Then, such directors serving on more than one board can act as spreaders of information between the corresponding corporations. Such information can be about future (favorable or unfavorable) economic situations, alarms, market opportunities, or anything that could be of interest to the companies in which the director is. Due to the global connectivity of the system, such “information” can be spread across the whole network “infecting” all the corporations in a relatively short time. As we have mentioned before, epidemiological models have also being used for modeling such propagation dynamics (see Section 1.1.).
Hence, we devote this section to the investigation about whether a significant increase of the risk-dependent centrality is a proxy of the vulnerability of the corporate to financial infections propagating on the network. At first, we should remark the fact that the network we are considering here was built based on data corresponding to year 1999. At this year the level of stress of the international economic system was relatively high due to the fact that the East Asian financial crisis occurred in the years 1997–1998, which was also followed by the Russian default of 1998. The two aforementioned financial crises had a ripple effect on the US market. In the literature, for instance, it is well-documented the so-called “fire-sale” FDI (Foreign Direct Investment) phenomenon, that is, the surge of massive foreign acquisitions of domestic firms during a financial crisis ([2, 24]).Thus, the level of stress and infectability of the system for the next few years after 1999 (we will eventually see that these correspond to the period 2000-2002) is expected to be significantly larger than in the subsequent years when the effects of these crises gradually relaxed. Therefore, we proceed our analysis by considering that the level of infectability in 1999 is high and we investigate the effects of relaxing such a condition to lower levels of stress. That is, we start by assuming that in 1999 the external market turmoil could be represented by a value of and we want to find out how the companies change their ranking positions in term of risk-dependent centrality333The analysis has been also developed for circulability and transmissibility, but, since the significantly high rank correlation between and (with Spearman correlation coefficients larger than 0.99), we focus here only on . as vanishes. To this purpose, we set up different initial conditions in the contagion model described by Eq. 18, assigning to each year a different value of the infectability parameter , according to the environmental conditions of the market. Therefore, we let factors reduce year by year in order to reflect a reduction in the overall stress on the network. In particular, we decrease linearly from 1 to 0 in the period 1999-2003. Therefore, rankings based on the risk-dependent centrality computed for allow to assess the relevance of each corporate in . Lowering , we test how the positions of firms vary over time when the external risk reduces. It is noteworthy that the connection between this parameter and the risk could be quite loose but as provided by the following analysis the model seems to work quite well in describing firms that reduce their SVC in the period.
The variation of rankings is then compared with the pattern of the shareholder value creation (SVC) over time. According to the OECD Principles of Corporate Governance, corporations should be run, first and foremost, in the interests of shareholders (OECD 1999). Therefore, companies should work to increase their shareholder values. Increasing shareholders value cannot be done without risk. It is known [61] that in the shareholder value model, companies usually take more risk than needed in order to maximize SVC. As a consequence of this additional risk, companies acquire debts which could make them unstable and more exposed to the risk of bankruptcy. Acquiring large debts is seen as conductive to increasing shareholder value, due to the potential of the company to increase value when it has started from a low baseline. Thus, there is a relation between SVC and risk, because in searching for large SVC the companies increase their risks to attract more investors and increasing potential value gain, but, at the same time, the risk also puts the company in a more vulnerable position to bankruptcy and collapse.
To support our interpretation, we make use of SVCs of the companies444In particular, we use a sample of 337 companies in our network whose SVCs are made available in the dataset available in [35]. in the S&P500 for the period 1999-2003, that have been collected by Fernández and Reinoso (see [35]). Hence, we use SVC as a proxy for risk. Indeed, the global average of SVC reflects very well what happens for the period 1999-2003. After the financial crisis of 1998 the world was at a higher level of risk which is reflected by a dramatic drop of the SVC in year 2000 from a positive value in 1999 to a negative one in 2000. This situations remained until 2002, but eventually recovered to positive in 2003 (see Figure 11). It is noteworthy that the data for SVC was reported by Fernandez and Reinoso for the years 1993-2003. From this long period we select the segment 1999-2003 which contains exactly the valley produced from the financial crisis of 1998 and also because the data used for building the corporate network is of 1999. That is, it corresponds to a segment in time in which the world economy drop due to a crisis and then eventually recovered from it.
We focus our statistical analysis on the predictability of the risk-dependent centrality on the evolution of the SVC. We consider the evolution of the SVC of a company for the period 1999-2003, which is the period immediately after the network of corporate elite in US was built. As a proxy for the evolution of the SVC of a company we consider the Pearson correlation coefficient of the ranking position of the company based on SVC versus the reciprocal of the year. In this case, a negative (positive) value of indicates that the corresponding company decreases (increases) its SVC from 1999 to 2003 when the global external infectability decreases. Therefore, we apply a Linear Discriminant Analysis (LDA) to classify the companies into two groups: (i) those with negative trend in the SVC for this period, and (ii) those with a positive one. The only predictor used for this classification is the parameter . This parameter is the difference between the ranking position of the company when and the ranking position of the same company when . In other words, a negative (positive) value of means that the company dropped (increased) its exposure to risk when the infectability of the system is lower.
Before proceeding with the application of the LDA on the whole sample at disposal, we eliminate a few companies whose correlation coefficient between SVC and the reciprocal of the year is marginal (i.e., close to zero). We test empirically the effect produced by the removal of companies for which for different values of the threshold , e.g., . The best classification of the companies into the two groups analyzed is obtained by eliminating those companies for which . In this case the total accuracy of the LDA model is 60.5%. That is, 200 out of 332 of the companies are classified correctly in their respective groups representing their trends in shrinking SVC or expanding it. In particular, the fitted LDA model is , where is the predicted response variable of our analysis that allows to classify companies in their respective group. The positive coefficient of the variable indicates that: (i) increasing the exposure to risk () tends to expand the SVC of the company, and (ii) decreasing the exposure to risk () tends to shrink the SVC of the company. For both groups, we report in Figure 12(a) a comparison between the predicted value with the LDA and the observed value for each firm. Red squares below the line and blue circles over the line are well classified, while blue circles below the line and red squares over the line are wrongly classified. Furthermore, in Figure 12(b) we report the related confusion plot,where the number of true negative and true positive are on the anti-diagonal (bottom and upper parts, respectively) and the number of false negative and false positive are on the main diagonal (bottom and upper parts, respectively). It is noticeable the low classification performance of the model, when only companies that expand their SVC are considered (in this regard, see in Figure 12(b) companies that belong to the observed class denoted with the sign “+”). For instance, from 147 companies in the network which increase their SVC in the period 1999-2003 only 36 are correctly predicted by in their class. On the contrary, from the 175 companies that shrink their SVC in the period 1999-2003, the variable correctly predicts 157 companies in this class. That is, the risk-dependent centrality of the companies clearly identifies about 90% of the companies which will shrink their SVC in the period 1999-2003, using only data referring to the year 1999. In plain words, our results indicate that diminishing the exposure to risk when the external conditions of infectability are low, with high probability, reduces the SVC of a company.
Let us conclude with the following remark. Even if “good” companies increase their risk-centrality ranking as vanishes, it is worth noting that this occurs when the global stress in the market is very low. When the infectability rate is very low, the absolute probability of getting infected also remains very low for both “good” and “bad” companies. To show this fact, let us consider that, according to our model, the probability that a given corporate is not affected by a crisis propagating inside the network is given by , where again and are the initial probabilities to have infected and not-infected nodes, respectively. Hence, the ratio between the probabilities of two nodes and to pass successfully through a crisis is given by . We compute these ratios for different couples of corporates operating in a similar sector, a “good” one and a “bad” one (Fig. 13).
As expected, at low the probability of not being infected by a crisis is the same for both high and low risk-centrality companies. But this ratio decreases very quickly as increases and this means that for companies that reduced their risk (e.g., Lucent Technologies, Morgan Stanley, Union Carbide and American Express) the probabilities to stay safe during a crisis are very small if compared with the analogous probabilities for companies that increased their risk (e.g., General Electric, Bank One, Ashland and Bank of America).
7 Ranking interlacement
During the analysis of the two real-world networks studied above, we have noticed that with the change of some nodes vary their ranking significantly, to the point of changing their positions relative to each other. For instance, in Fig. 14 we illustrate six pairs of corporates that interlace their positions with the change of the global infectability in the network. In the first pair, Fig. 14(a), we see that at low levels of infectability, i.e., , J.P. Morgan&Co Inc. (red) occupies a position in the ranking of more at the bottom than Bank of America Corp. (blue). That is, at low global infectability J.P. Morgan&Co is exposed to less risk than Bank of America. However, when the global infectability in the network increases (), Bank of America is exposed to less risk than J.P. Morgan&Co. A similar interlacement is observed between the other couples in Fig. 14. For instance, in Fig. 14(f), the interlacement between rankings for General Motors Corp. (red) and Boeing Co. (blue) occurs at a smaller value of than for the previous cases. Before proceeding with the analysis of this phenomenon, we would like to remark that the existence of ranking interlacement means that the ranking of the nodes in a network based on the risk-dependent centralities is not unique and fixed as in the case of other classical centrality measures, e.g., degree, eigenvector, closeness, betweenness. Here instead the ranking of nodes depends on the global external conditions to which the network is submitted.
In order to shed light on the issue of ranking interlacement we will make use of different representations of the risk-dependent total communicability and circulability measures (the transmissibility is obtained as the difference of these two and can be treated accordingly). First, expanding the matrix exponential in a power series gives the representation
[TABLE]
where denotes the number of walks of length starting from the node , with . In particular, , the degree of node . Similarly,
[TABLE]
where now is the number of closed walks of length through node ; in particular, , , , and , where is the number of triangles node participates in.
Second, we recall that the spectral theorem yields the formulas
[TABLE]
Using Eq. 37-Eq. 38, we readily see that both functions of are absolutely monotonic for , i.e. they are positive and infinitely differentiable on , with all the derivatives being nonnegative. In particular, both functions are strictly increasing and strictly convex.
Definition 7.1**.**
*We say that the rankings of node and node based on the circulability interlace at if and there exists an such that changes sign exactly once in . *
In other words, nodes and interlace at if the plots of and cross for . We note that, in principle, it is possible to have for some value of without interlacing taking place. Two cases are possible: in the first one, the two curves touch at the isolated point (without crossing), and in the second one the two functions are identical on an open neighborhood of and, therefore, for all since they are analytic functions. In practice, either scenario is very unlikely to occur, at least for real world networks. Note that points of tangency must satisfy the additional condition .
An analogous definition can be given for the ranking based on other -dependent measures, like the total communicability . In the following we limit our discussion to the interlacing of rankings according to the circulability, but analogous observations hold for the total communicability and transmissibility functions.
Identifying the interlacing points (if they exist) requires to find the roots of the transcendental equation , or
[TABLE]
Even if we knew the eigenvalues and eigenvectors of explicitly, there is no general closed form expression for the roots of the transcendental function . Of course one could resort to numerical root-finding techniques, but this would be impractical for large networks. Here and below we give a qualitative discussion followed by a heuristic approach that yields approximations that seem to work well in practice.
We begin with the following result. It applies to both circulability and total communicability-based rankings, and in fact for a much larger class of parameter-dependent centrality ranking functions, including Katz centrality [53]. We remind the reader that we restrict the risk rate to positive values.
Theorem 7.2**.**
*Let and be two nodes with different eigenvector centrality: . Then the number of interlacing points for and is necessarily finite (possibly zero). *
Proof 7.3**.**
Let us assume that there is at least one pair of nodes, and , whose rankings interlace, so that has at least one positive root. Observe that the ranking of node provided by is identical to that obtained using
[TABLE]
*As this quantity tends monotonically to for , there exists a such that no rank interlacing with node can occur for , since all the node rankings must stabilize on the eigenvector rankings in the large limit. Hence, all interlacing points must fall within the compact interval . Suppose that the number of interlacing points is infinite. By the Bolzano-Weierstrass Theorem, this set has a point of accumulation. But since is analytic, and zero on this set, it must be identically zero everywhere, which contradicts the assumption that there is at least one interlacing point in . *
As a consequence:
Corollary 7.4**.**
*If all nodes in the network have different eigenvector centralities, the total number of interlacing points is finite (possibly zero). *
A sufficient condition for the existence of at least one interlacing point for the pair of nodes and is that (or ) while (resp., ). This follows from Theorem 4.3: since interpolates smoothly between degree centrality and eigenvector centrality, the only way that a node with higher degree can have lower eigenvector centrality than another node is that the corresponding circulabilities interlace at some value . If more than one interlacing point exists, this number must be odd, for otherwise the node with higher degree would also have higher eigenvector centrality than the other node. That the above condition is not necessary is made clear considering the possibility of an even number of interlacing points. A necessary condition for the existence of at least one interlacing point is that there exist at least two values of , say and , for which and have different sign. Indeed, it is obvious from Eq. 37-Eq. 38 that if (say) for all , then no rank interlacing point exists. That this condition may not be sufficient is suggested by the fact that the series expansions contain an infinity of terms.
We mention that the same problem has been studied, for a different centrality function (the Katz resolvent), by [56] independently of us.
7.1 A back of envelop approach
We now consider heuristics based on truncated series expansions. Let be the smallest value of such that the sequence of values undergoes a sign change (here zero is considered positive). If no such exists, then no interlacing can take place, as we already observed. We consider approximating with its truncation to an order :
[TABLE]
where we recall that . We emphasize that this polynomial approximation assumes that is small, since the error in it is . In alternative we can also use as a surrogate for the same polynomial shifted by 1 and divided by :
[TABLE]
where now the error is . We can now use these polynomial approximations to try to locate, approximately, any interlacing points sufficiently small in magnitude. This requires finding the (positive) roots, if any, of the polynomial equation of degree :
[TABLE]
It is well known that for degree greater than or equal to 5 there is no closed form expression of the solutions of an algebraic equation involving only arithmetic operations and root extractions, so in general if we will have to resort to numerical methods for solving Eq. 41. Evaluation of the coefficients requires computing the diagonal entries of powers of the adjacency matrix , which can be expensive for very large graphs and large values of .
As the simplest possible example, we consider the case where and (or vice-versa), i.e., . Taking , equation Eq. 41 becomes the linear equation
[TABLE]
which admits the unique solution , which is of course positive. In terms of the degree of the nodes and the number of triangles in which they take place, this can be written in the form:
[TABLE]
In the case of weighted networks, the degree is replaced by the weighted degree or strength, and the number of triangles is replaced by the weighted number of cycles of length 3, i.e., the weight of a cycle of length 3 is the product of the weights at its three edges. A priori, there is no reason to expect that this value is close to an actual interlacing point (assuming it even exists), since the behavior of higher order terms may more than offset the influence of the negative term involving . Better approximations might be obtained by considering higher order approximations; for example using leads to an easily solved quadratic equation in , leads to a cubic, and so forth. In any case, these are heuristics whose usefulness can only be assessed experimentally on concrete examples. We emphasize that the use of power series truncation requires knowledge of , since truncating the series at orders lower than would lead to an equation devoid of positive solutions and therefore to concluding that no interlacing points exist for a given pair of nodes, even if such points do exist.
It is also worth recalling Descartes’s Rule of Signs, according to which the number of positive real roots of a polynomial (counted with their multiplicities) is equal to the number of sign changes in the (nonzero) coefficients or less than that by an even whole number, when the powers are ordered in descending order. If, moreover, the polynomial is known to have only real roots (as in the case of a symmetric adjacency matrix, i.e., of undirected networks) then the number of sign changes is exactly equal to the number of positive roots. It is then obvious that if the power series is truncated at order , i.e., as soon as we observe the first sign change in the coefficients, then there will be exactly one positive root and therefore only one (approximate) interlacing point can be found by this method. A polynomial truncation of higher degree may have more than one positive root, depending on the number of changes in the coefficients (assuming the network is undirected). We will come back to this case shortly.
To exemplify the previous finding let us consider a pair of nodes with a small difference in their degree, e.g., , then , such that if, for instance, and we let vary from [math] to we obtain the plot given in Fig. 15(a). As can be seen there are certain values of for which we can obtain positive and negative values of . This is illustrated in Fig. 15(b) where we can see that when there are both positive and negative values of . In other words, it is possible to find pairs of nodes for which and then , which means that these nodes will change their ranking position in terms of the risk-dependent centrality when the values of change even for a relatively narrow window. Notice that if , and such change is not observed for the corresponding range of analyzed.
If we now consider a large difference in the node degrees, e.g., , and the same range of change for the difference in the number of triangles, e.g., we do not observe any variation in the ranking of pairs of nodes as can be seen in Fig. 16(a). In this case the range of must be increased dramatically to obtain inversions in the ranking of pairs of nodes (see Fig. 16(b)).
To illustrate how well the estimate Eq. 42 performs, we use it for approximating the interlacement point for several pairs of corporates and compare them with the observed values in Table 2 for the weighted version of the US corporate network.
A few more general considerations on the validity of the power series truncation heuristic can be made. The size of the interval containing any interlacing points is dictated to a large extent by how quickly the rankings based on the measures (or ) stabilize near the rankings obtained using eigenvector centrality. This, in turn, depends on the spectral gap : the larger the gap, the faster the eigenvector centrality rankings are approached for increasing values of . Hence, in the case of relatively large gaps, we expect any interlacing values to occur for fairly small values of . In this case, the heuristics based on polynomial approximations may be justified, since interlacing is likely to occur already for small values of . As is well known, however, it is not easy to determine when the spectral gap is “sufficiently large”. On the other hand, when the spectral gap is tiny, then the interval is going to be larger and therefore there is “more room” for the occurrence of interlacing. Unfortunately, in this case it is not clear that polynomial truncation will be effective in approximately locating the interlacing points. In this case, a possible solution is to expand the functions not around the value , but also around a few values . This strategy can also be used to find a possible second point of interlacing after having found a first such point . Expanding around leads to
[TABLE]
Dividing by and setting the result equal to zero leads to an algebraic equation of degree for ; the smallest positive root of this equation, if there are any, leads to the approximation for the next interlace point, and so forth.
Completely analogous considerations apply to the approximation of interlacing points when the ranking of nodes is done according to the risk-based total communicability measure . In this case the transcendental equation to be solved is given by
[TABLE]
Let . Then, truncating the series expansion Eq. 37 and dividing by leads to the approximation
[TABLE]
for the equation whose smallest positive solution approximates the first interlacement value for the rankings of nodes and , assuming it exists; here again where now is the smallest integer value for which the sequence changes sign. The simplest possible case is when , which occurs when and have different sign. In this case Eq. 43 reduces to the linear equation
[TABLE]
with the unique root
[TABLE]
8 Risk prediction and COVID-19
Since we submitted the first versions of this work, a pandemic has been expanding from the city of Wuhan, Hubei province of China [83, 84] starting on December 2019. This disease is produced by a new coronavirus named SARS-CoV-2 [45] and has affected in about three months more than 200 countries around the world. The main problem right now is of a medical nature, but as stated by Balwing and Weder di Mauro this coronavirus is “as contagious economically as it is medically” [4]. One of the most important characteristics of this pandemic in comparison with recent ones is that it is hitting very strongly the most important economies in the world: China, USA, Germany, Italy, Spain. There are some preliminaries studies about the macroeconomic impacts of this pandemic (see for instance [4]). However, it is important to apply mathematical and computational techniques to forecast, at regional, national and international level, the impact of this crisis on financial institutions, corporations and small companies. All of them are highly interconnected in a globally dependent economy, forming series of complex networks. In this new scenario the current work represents an opportunity for modelers to advance predictions on the potential risks to which different institutions are submitted to in the current situation. This modeling scenario consists of the networks of interactions between the institutions under analysis assuming a high infectability in the network. Using the transmissibility and circulability measures defined here, the modeler can understand how at risk of transmitting the crisis to others or, respectively, of staying in a cycle of repeated economic difficulties, a company is. At the same time, the current work allows to model how different palliative measures taken by regional or global financial institutions in the European Union, USA or China can impact these companies. In this case, the modeler should drop the infectivity of the system and analyze how the ranking of risk for the different companies changes to gain insights about their potential recovery or bankruptcy.
9 Conclusions
In general, node centrality in networks are of either of two types: (i) node centrality in networks of time-invariant topology [29], or (ii) node centrality in networks of time-dependent topology (aka temporal networks) [49]. In this work we have developed a new concept of node centrality, depending on both the topology of the network and the external conditions to which the network as a whole is submitted. In particular, we have focused on global risk as the external factor by which an economic and financial network is affected. We started by considering the “Susceptible-Infected” model and its connection to the communicability functions of nodes and edges in a network. Then, we developed a few centrality measures which depend not only on the local and global topological environment of a node but also on the level of infectivity stressing the system as a whole. In this way we have been able to make predictions in financial and economic systems about the changes in the risk-dependent centralities of nodes as a function of the global level of infectivity in the system. We observe that without altering the topology of the network, i.e., without varying any connection between the nodes, the ranking of the nodes, according to these new centrality measures, changes significantly as the infectivity rate changes. In the real-world networks studied here we have been able to associate those changes in the risk-dependent centrality of nodes with events of the real financial and economic worlds in which these networks are embedded. In closing, we provide here both theoretical, computational and empirical evidences that the node centrality is not a static function even when the topology of the system is not varying at all. This new paradigm is expected to play a fundamental role in assessing the robustness of financial and economic systems to the variation of the external conditions which they are submitted to.
Appendix A Risk-dependent centrality measures for complete graphs
The following theorem provides a close expression for , and for a complete network.
Theorem A.1**.**
The risk-dependency for each node in a complete graph is given by
[TABLE]
and the circulability and transmissibility are given by
[TABLE]
Proof A.2**.**
For a complete graph, , because of the mutual orthogonality between and the principal eigenvector of constant components. That is, is completely determined by the eigenvector centralities which of course are equal for every node and equal to . Since , we obtain:
[TABLE]
Subgraph centrality close expression for a complete graph is provided in [32]:
[TABLE]
Multiplying each entry in by and summing up the power series, we get
[TABLE]
*By difference, we get . *
An important remark concerns the ratio . Indeed:
[TABLE]
Similarly,
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Allen and A. Babus , Networks in finance in The network challenge: strategy, profit and risk in an interlinked world, Wharton School Publishing 2009.
- 2[2] R. Alquist, R. Mukherjee and L. Tesar , Fire-sale FDI or Business as Usual? , Working Paper, National Bureau of Economic Research, 2013.
- 3[3] H. Amini, R. Cont and A. Minca , Resilience to contagion in financial networks , Math. Financ, 26(2), 2016, pp. 329-365.
- 4[4] R. Balwin and B. Weder di Mauro , (Eds). Economics in the Time of COVID-19. CEPR Press, London, 2020.
- 5[5] A. Barja, A. Martínez, A. Arenas, P. Fleurquin, J. Nin, JJ. Ramasco, E. Tomás , Assessing the risk of default propagation in interconnected sectoral financial networks EPJ Data Science, Dec 1, 8(1) (2019), 32.
- 6[6] S. Battiston, D. Delli Gatti, M. Gallegati, B. Greenwald and J.E. Stiglitz , Liaisons dangereuses: Increasing connectivity, risk sharing, and systemic risk J. Econ. Dyn. Control, 36(8) (2012), pp. 1121-1141.
- 7[7] S. Battiston, D. Delli Gatti, M. Gallegati, B. Greenwald and J.E. Stiglitz , Default cascades: When does risk diversification increase stability? , J. Financ. Stabil., 8(3) (2012), pp. 138-149.
- 8[8] S. Battiston, M. Puliga, R. Kaushik, P. Tasca and G. Caldarelli Debt Rank: too central to fail? financial networks, the FED and systemic risk , Sci. Rep. 2, 541 (2012) pp.1-6.
