Approximate Identification of the Optimal Epidemic Source in Complex Networks
S. Jalil Kazemitabar, Arash A. Amini

TL;DR
This paper develops a statistical framework to accurately identify the source of an epidemic in complex networks, outperforming traditional methods especially in social and community-rich networks.
Contribution
It introduces a Bayesian optimal solution and two scalable algorithms for epidemic source detection applicable to general network topologies.
Findings
The proposed methods outperform geometric and spectral measures on real social networks.
The algorithms are scalable to large sparse networks.
Performance is robust across networks with high transitivity and community structure.
Abstract
We consider the problem of identifying the source of an epidemic, spreading through a network, from a complete observation of the infected nodes in a snapshot of the network. Previous work on the problem has often employed geometric, spectral or heuristic approaches to identify the source, with the trees being the most studied network topology. We take a fully statistical approach and derive novel recursions to compute the Bayes optimal solution, under a susceptible-infected (SI) epidemic model. Our analysis is time and rate independent, and holds for general network topologies. We then provide two tractable algorithms for solving these recursions, a mean-field approximation and a greedy approach, and evaluate their performance on real and synthetic networks. Real networks are far from tree-like and an emphasis will be given to networks with high transitivity, such as social networks…
| Network | mean | max | clust. | |
|---|---|---|---|---|
| deg. | deg. | coeff. | ||
| Internet | 10670 | 4.0 | 2312 | 0.01 |
| Power | 4941 | 3.0 | 19 | 0.1 |
| Wiki vote | 7066 | 29.0 | 1065 | 0.13 |
| UCSC68 | 8979 | 50.0 | 454 | 0.17 |
| UC64 | 6810 | 46.0 | 660 | 0.19 |
| DC-SBM | 1962 | 66.0 | 897 | 0.3 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Approximate Identification of the Optimal Epidemic Source in Complex Networks
S. Jalil Kazemitabar
Department of Statistics
University of California, Los Angeles
&Arash A. Amini
Department of Statistics
University of California, Los Angeles
Abstract
We consider the problem of identifying the source of an epidemic, spreading through a network, from a complete observation of the infected nodes in a snapshot of the network. Previous work on the problem has often employed geometric, spectral or heuristic approaches to identify the source, with the trees being the most studied network topology. We take a fully statistical approach and derive novel recursions to compute the Bayes optimal solution, under a susceptible-infected (SI) epidemic model. Our analysis is time and rate independent, and holds for general network topologies. We then provide two tractable algorithms for solving these recursions, a mean-field approximation and a greedy approach, and evaluate their performance on real and synthetic networks. Real networks are far from tree-like and an emphasis will be given to networks with high transitivity, such as social networks and those with communities. We show that on such networks, our approaches significantly outperform geometric and spectral centrality measures, most of which perform no better than random guessing. Both the greedy and mean-field approximation are scalable to large sparse networks.
1 Introduction
Modern transportation networks have had profound effects on geographical spread of infectious diseases [1, 2] giving rise to complicated epidemic evolutions [3]. These evolutions can be modeled as dynamic processes on transportation networks. The epidemic spread on networks can take other forms, such as outbreaks of foodborne diseases [4], intercontinental cascade of failures among financial institutions [5, 6], computer malware propagation on the interent and mobile networks [7, 8] spread of targeted fake news [9, 10] and rumors [11] on social media, especially during presidential elections [12, 13, 14]. In response to an adverse diffusion on a network, it is critical to trace back sources to enable appropriate prevention and containment of the spread [15]. Inferential methods have been developed to locate the source of foodborne diseases [16, 17] and influenza pandemics [18, 19]. In the context of online social networks, the spread of misinformation can be limited by the identification of influential users [20, 21]. Source recovery can also be used to assess the power of diffusions in generating anonymity in network protocols [22].
The epidemic source identification problem has received considerable attention in the past decade. Given a snapshot of the infected nodes in a network, the task is to discover who has originated the epidemics. Since the seminal work of Shah and Zaman [23], numerous attempts have been made to address the question and its extensions [24, 25, 26, 27, 28, 29]. By now, there are multiple methods that show satisfactory results in limited experimental setups or have proven guarantees in restricted network topologies [30]. However, identifying the source under general conditions still remains a difficult task. Even under fairly simple models such as the Independent Cascade (IC) dynamics, the problem of optimal recovery appears to be NP-hard in infection size [31]. The theoretical guarantees for optimal and consistent recovery are restricted to regular infinite trees [23, 26], and as we show in this paper, the popular and well-cited methods are quite unreliable in a wide range of real networks.
Source identification has remained largely unsolved and poorly understood for real complex networks. As we will show through experiments in Section 5, in real networks, even the optimal Bayes estimator applied to small infected sets has difficulty narrowing down to the true source. It is thus important to recover as much information from the likelihood of the model as possible. We develop techniques for computing the full likelihood of the infection, as opposed to identifying the most likely sample-path [26]. Moreover, we fully exploit the information from the boundary of the infection set, in addition to the structure inside the infected subgraph. This idea has been pointed out before [32], but has been mostly neglected by subsequent work; cf. [29, 24]. We develop all these ideas without restricting the structure of the network to trees. Our framework also easily extends to the case where there are multiple infecting sources (Appendix A).
In this paper, we develop statistical algorithms that outperform the state-of-the-art in a wide range of network topologies. Our contributions are distinct in several ways:
Our methods are parameter-free, meaning that they do not require knowing the duration of the epidemic or how fast it grows. 2. 2.
We show that the exact maximum likelihood estimator (MLE) of the source—or equivalently the Bayes optimal solution under uniform prior—can be written as a dynamic programming (DP), with easily computable coefficients based on the adjacency matrix of the network. 3. 3.
We develop two schemes to approximate the DP: an efficient greedy elimination (GE), and a novel mean-field approximation (MFA) of the likelihood, computed by solving a linear system. 4. 4.
Our approximations are more disciplined than existing approaches. They do not impose restrictions on the topology of the network. Nor do they appeal to the partial likelihood of the candidate infecting sets. This is in contrast to the use of spanning trees to deal with general topologies [23] or the path-based approaches that rely on the likelihood of individual paths from potential sources to the infected set [26].
We will show that when applied to real networks, both approximation schemes (MFA and GE) outperform various geometric and spectral approaches, most of which perform no better than random guessing. We also show that even for basic models of real networks, e.g., models with community structure, most existing methods dramatically fail. The improvement in performance is most significant for the networks with many cycles, including social networks that are known to have high transitivity. In terms of computational efficiency, both the greedy and mean-field approximations are superior to the state-of-the-art likelihood-based and spectral approaches and comparable to centrality-based methods. In addition, the mean-field algorithm is easily parallelizable through standard linear algebraic routines and can be used to tackle very large-scale epidemics on real networks.
Related work.
Most of the existing literature on the source identification problem are based on a SIR dynamic where the infection spreads with an exponential rate proportional to the number of infected neighbors. All nodes are susceptible to the infection and once infected may recover with a fixed exponential rate [33]. Moreover, the spread of infection through edges are mutually independent. Different variations of SIR may assume that no recovery is possible (SI) or the recovered is not immune to iterated infections (SIS).
Shah and Zaman [23] considered the SI dynamics and proposed the Rumor Centrality (RC), which counts the permitted permutations, a.k.a. infection paths, inside the infected subgraph. Their linear time algorithm is an optimal estimator in regular trees and enjoys strong theoretical properties in such idealized settings [34]. Zhou and Ying [26] consider SIR dynamics on a tree and show that the most likely infection path is rooted at a Jordan center (JC) of the infected set , that is, a node with minimum eccentricity (i.e., maximum distance to others). It has been shown [26, 34] that in regular trees, eccentricity ranking generates, with high probability, a confidence set containing the true source, whose size does not grow with the infection size.
The Dynamic Message Passing (DMP) was proposed in [25] as an approximation of the maximum likelihood estimator in discrete SIR epidemics, by assuming that the marginal probabilities of infection for each node are independent. Despite compelling performance, DMP is computationally intensive and impractical for large networks with moderately dense structures, even for small infection sets. A spectral algorithm, called Dynamical Age (DA) was introduced in [24], based on measuring the sensitivity of the maximum eigenvalue of the Laplacian matrix to the elimination of each node in the infection set. The algorithm was mainly developed to discover the initial node in a growing preferential attachment model. Another spectral method for the discrete SI model is proposed in [29].
2 Source detection in SI epidemics
We consider a continuous-time susceptible-infected (SI) epidemic [33] with rate of infection , on a static undirected network with known edge set and . At time zero, all nodes but the source are in the susceptible state. Infection is a terminal state and susceptible nodes are exposed to the infection at an exponential rate proportional to the number of their infected neighbors. More precisely, given that nodes are infected at some time , we run exponential clocks for all and the first to expire determines the next infected node: If , then the dynamics move to the infected set at time . It is clear that the contagion will eventually spread through the entire graph.
The infection source or patient zero, denoted as , is unknown. What we observe is a snapshot of the contagion at some time , meaning the entire set of infected nodes at that time, which we denote by . The objective is to find or form a confidence set for with desired false exclusion probability. Our focus here will be on the single source setting, but the analysis is extensible to the multi-source setting (cf. Section A).
Notation. We write for the adjacency matrix of the network and for the volume of a cut in the network between subsets of nodes. For singleton subsets, we often drop the braces, e.g., and .
2.1 Time and rate invariant analysis
We start by examining the probability of observing a particular set of infected nodes given a starting source. Let us introduce a parameter-free formulation of the problem (i.e. not dependent on rate and time ) that will be the foundation for our analysis of the continuous SI dynamics. The idea has been introduced in [32]. We generalize it to multiple sources and find forward and backward dynamic programming formulations that allow us to derive efficient approximations.
Suppose that, at some point in time, the infection reaches . Let be some superset of . We are interested in computing , the chance that all the nodes in are infected before any node outside. More precisely, let
[TABLE]
We refer to as the transition probabilities. Note that these transition probabilities are independent of the infection source. Given that in a snapshot of the contagion, nodes are infected, determines how likely it is that in some future snapshot, is the set of infected nodes. The Markov property of (continuous-time) SI dynamics allows us to define without reference to the source, or the time of the first snapshot. We will also show that these probabilities do not depend on the infection rate or the time of the second snapshot.
2.2 Statistical Inference
Given the observed (random) infected set , the function is the likelihood of the model. Writing for this likelihood, we observe that for all not contained in . So, we can restrict to all subsets of . When dealing with the single-source setup, we restrict the parameter space to and with some abuse of notation write for , and for the likelihood.
We can further consider a Bayesian setup by putting a uniform prior on the source (i.e., uniform over ). The Bayesian setup allows us to consider various notions of optimality by changing the loss function. Letting be the random initial source, we have a joint distribution on . Then the posterior probability that the source is , given that we observed infected nodes is
[TABLE]
Therefore, the maximum a posteriori (MAP) estimate of the source is which minimizes the probability of error. That is, minimizes for any estimator . In some applications, the graph geodesic distance () to the source determines the error of estimation. In that case, the Bayes optimal estimator is It is not hard to see that minimizes among all possible estimators .
A third choice is to output a ranking instead of a single source. In this case an estimator is formally a permutation on , suppressing the dependence on for simplicity. We can then consider the rank loss , and we call the associated risk the expected (source) rank . The corresponding optimal Bayes estimator is obtained by minimizing the posterior risk:
[TABLE]
Noting that , the optimal estimator in this case is the ranking that sorts into descending order, i.e., where .
Remark 1**.**
The distance loss might be suitable in some applications, but in general it is a poor measure if the goal is to reveal the actual source. This is especially true in small world networks, including most social networks, where the expected distance between any pair of nodes is small. On the other extreme, in terms of the precision in recovering the source, is the zero-one loss which is too stringent. The rank loss can be considered a more robust version of the zero-one loss, and we will be our main evaluation measure.
3 Exact likelihood computation
The Bayesian estimators introduced in Section 2.2 require us to evaluate the posterior probabilities , or equivalently the likelihood values for all . The main difficulty of the source identification problem is that computing the likelihood is itself challenging. We now develop exact equations that allow us to recursively compute the likelihood values for all subsets .
Dynamic programming.
To begin, note that for any . In addition, whenever corresponds to a connected component of . We develop two dynamic programming expressions for for general :
Proposition 1**.**
For , the probabilities defined in (1) satisfy the forward program
[TABLE]
and the backward program
[TABLE]
In the forward programming (2), effectively iterates over the boundary of in , as if is outside that boundary. Therefore, the running time of the forward programming benefits from the sparsity of the network. Unlike the forward programming, the iteration over in (3) cannot be restricted to a smaller set. A corollary of Proposition 1 is that the transition probabilities are not affected by the rate and the duration of the infection.
Let us now observe some connection with the path-based analysis. A permitted permutation or an infection path starting at a node , refers to a permutation of nodes with , and such that is connected to at least one node in , for all . Notice that the probability of observing a given infection path is
[TABLE]
where . As noted in [32], one can obtain the transition probability by summing (4) over all infection paths such that and . Our recursive representation is novel, avoids these explicit summations, and will be key in deriving approximation schemes for in Section 4.
Path-based approaches such as Jordan center [26] forgo computing the complete likelihood (i.e., avoid summing the odds of all infection paths) and instead find the most probable path, that is, one that maximizes (4). In contrast, equations (2) and (3) compute the complete likelihood of the infection set, which has the following advantages over the path-based likelihood: It fully exploits the structure of the graph inside the infection set, not just a spanning tree or a permitted permutation of nodes in the infected subgraph. Moreover, it takes into account the boundary of the infected subgraph via .
NP-hardness. The recursions we derived are a dynamic programming (DP) solution that computes the likelihood more efficiently, by avoiding the direct summation over all possible paths. More precisely, one can obtain the optimal likelihood values by solving recursion (2) backwards: Starting from , we can determine for all proper subsets of maximal size (i.e., ) and then all the proper subsets of those at the previous stage and so on. However, this DP procedure may still take time in the worst case. It is an interesting open question whether polynomial-time solutions for computing the likelihood or its maximum exist.
Although it has been shown in [23] that for infinite regular trees, computing the likelihood reduces to enumerating all the infection paths from to , and therefore a polynomial-time algorithm exists in that case, the behavior of the exact likelihood under more general network topologies is much more difficult to investigate. Researchers have implicitly assumed the problem to be hard beyond trees and have resorted to approximations that are often based on heuristics. In the next section, we propose disciplined approximation schemes that yield much more accurate results under realistic classes of networks.
4 Approximations
We now provide two approximations to the likelihood function based on the exact dynamic programming developed in Proposition 1.
Greedy Elimination (GE).
We can obtain a singleton source set that maximizes with greedy elimination of elements in . The algorithm we propose is based on the backward recursion (3) and is detailed in Algorithm 1. We start with and consider all maximal proper subsets of that induce a connected subgraph of . Among those, we choose the one that maximizes the transition probability to , i.e. . Suppose that is the maximizer. Next, we iterate the same procedure for and so forth, until we reach a singleton set . The procedure has an runtime where and is the number of edges in the infected subgraph, .
GE has a Bayesian justification. Let be the random infected set after steps. Suppose that we want to find the MAP for given . The Bayesian posterior probability is
[TABLE]
Whenever is connected, the prior is positive. GE finds a proxy for MAP through maximizing the evidence and ensuring the prior is positive.
Algorithm 1 has similarities with finding the most likely path from a source to the observed snapshot. Chang et. al. [32] propose a similar path-based search called GSBA. They start from each node in and approximate the most likely path and use it as a proxy to the most likely source. Algorithm 1, however, does this greedy search in a backward fashion.
Mean-field Approximation (MFA).
We now approximate by the mean-field technique. The idea is to treat the set function as if it was a distribution (or measure) on and approximate it by the product of its marginals. Fix a subset . For any , let be the binary representation of , i.e. for any . We find and such that
[TABLE]
is a good approximation to for all , in the sense of minimizing the quadratic deviation from the solution of the recursion (2). First note that since . Next, we plug-in into the forward recursion, to get
[TABLE]
Dividing both sides by gives These equations in general cannot be satisfied exactly for all . Instead, letting , we solve the following least-squares problem:
[TABLE]
where and are defined as follows:
[TABLE]
The solution of (6) satisfies the normal equations . The following proposition shows that and can be computed efficiently. Let be the adjacency matrix of the network.
Proposition 2**.**
The solution of (6) satisfies the linear system with and given by
[TABLE]
*where and . Here is the element-wise matrix product, is a matrix operator that returns the same matrix with double the diagonal entries, and is the vector of all ones. *
See Appendix B.2 for the proof. Proposition 2 shows that the mean-field approach reduces to solving a linear system of equations in variables, a task with much better computational complexity than solving the original recursion. Both and can be computed in at most time. In the cases where is sparse (which often the case for real networks), will be a rank-one perturbation of a sparse matrix (both and will be sparse), hence solving the resulting system is often much faster than the worst-case, i.e., faster than ).
5 Simulations
The methods proposed in this paper, the Greedy Elimination (GE) and the Mean Field Approximation (MFA), have reasonable runtimes compared to popular source identification procedures and show superior performance in source identification. In this section, we make a comparison based on these two measures on real and synthetic networks. As discussed in Section 2.2, we consider ranking estimators (i.e., those outputting a permutation of the nodes according to their likelihood of being the source) and focus on the rank loss. If the method does not return a ranking, we tweak it to do so. We evaluate the methods based on expected rank , the expectation of the rank of the actual source among the list of candidates (cf. Section 2.2). We normalize to get a number in [0,1], with zero corresponding to perfect recovery, i.e., we use .
We consider a variety of real and simulated networks. Our selection includes an Internet Autonomous System [35, 36], US west-coast power grid [37], two Facebook-100 networks [38, 39], called UC64 and UCSC68, and a Wikipedia voting network [40]. In addition, we present our results on a number of synthetic networks that are well studied in the literature, including regular trees, random trees, and degree-correlated stochastic block models (DC-SBM) [41].
Table 1 summarizes the statistics on the largest connected component of these networks. The regular tree is of degree 3 and depth 10. The random tree has 500 nodes. For the DC-SBM network, we generate from a -community planted partition version, i.e., where if nodes and are in the same community and if they are in different communities. The degree parameters are generated from a rescaled Pareto distribution with and threshold .
The results are illustrated in Figures 1, 2. The methods we consider besides the optimal Bayes solution (BO), the MFA, and the GE are the Rumor Centrality (RC), the Degree Centrality (DC) and the Jordan Center (JC). We have also run the Dynamical Age (DA), but due to its overall poor performance and its computational complexity, we have omitted it from the plots. Our selection of the methods loosely follows the methods surveyed in [30]. Each curve shows the performance of one method for different values of the infection size, . Each point is an average over 500 infection paths rooted at random sources. To avoid an unreasonable computation time, we skip the optimal Bayes for the infected sets of size greater than 10. The BO curve serves as the benchmark for the best achievable performance. Note that even the optimal solution needs to output a large set to catch the source, signifying the inherent difficulty of the problem.
Rumor and Jordan centralities perform optimally on regular trees in Figure 1(a), as expected by the theory [23, 26], although the network is not exactly an infinite tree. Notice that RC, JC, and BO overlap for infection sizes not exceeding the depth of the tree. Degree centrality also appears as a close competitor in this figure. Moving to other networks, however, these popular methods do not perform better than random guessing. For all three, the expected relative rank is close to 0.5, even in a random tree. The plots in this section show that, despite their popularity, the RC and JC are quite unreliable for source recovery.
Among our proposed methods, MFA outperforms RC, JC, and DC in Figures 2(a), 2(b), 2(c), 2(d). MFA finds the true source, on average, in its top 30% guesses. The networks with suprior MFA performance have highest transitivity (aka clustering coefficient) in Table 1, that is, many triangles among triples of nodes. Transitivity has been studied extensively and it distinguishes human social networks from random trees and less cyclical networks, such as water distribution systems and traffic network. In this sense, MFA is suitable for rumor source detection in social networks.
GE is the global winner, except in regular trees (Figure 1(a)). We were surprised that a greedy algorithm had such a widespread success. GE not only performs well in highly transitive networks, but it also outperforms RC, JC, and DC on random trees (Figure 1(b)) and less transitive networks (Figures 1(c), 1(d)).
Figure 3 illustrates the runtimes (on the log scale) for a single run on the UC64 network, when the infection size is 10 (the maximum size for which the Bayesian results are available). Degree centrality is the fastest, followed by RC, JC, MFA and GE, all four having comparable speed, with RC and JC having a slight edge. BO is about 10 times slower and its runtime grows exponentially with infection size.
Based on these results, we advocate for the use of GE as the main tool for identifying sources of epidemics, regardless of the network topology or the nature of the epidemics (rumor propagation, disease contagion, etc.). MFA should be applied with caution. It is superior in social (transitive) networks, and attractive for its simplicity and the potential for parallelism.
Appendix A Multi-source Extension
The inference problem discussed in Section 2.2 immediately extends to the multi-source situations. Consider the case were more than one independent source, denoted by , initiate the infection dynamics. Due to the Markovian nature of the dynamics, the infection path that leads to some set does not influence the value of . Hence, Proposition 1 also describes the likelihood of the transition from the source set to a snapshot .
If we know that there are original sources, e.g. , with a uniform prior on the patient zeros, the Bayesian solution would be characterized by the optimization
[TABLE]
To compute this MAP estimate, we can still use the DP solution in Proposition 1, but we do not need to compute for . Thus, the multi-source problem is in a sense “easier”, especially when , since one can terminate the recursion earlier (i.e., the case is the hardest).
Appendix B Proofs
B.1 Proof of Proposition 1
Let us first recall a known fact about the exponential distribution:
Fact 1**.**
Let be a collection of independent exponential variables. Then,
[TABLE]
The forward programming (2) is an application of the law of total probability in the following sense: The event that nodes in are infected before any other node in splits into sub-events that each node in is infected before those in and we have
[TABLE]
where we have also used the Markov property of SI dynamics to split the probabilities on the RHS into the products. The ratio in (2) corresponds to the transition probability from to , that is . Indeed, given that is infected, we run exponential clocks and the first to expire determines the next infected node. By Fact 1, this happens for any node with probability probability . Thus,
[TABLE]
This proves the forward programming. The backward programming, on the other hand, connects to and is proved similarly. Basically, the event of visiting can be divided into sub-events based on the last node in that is infected.
B.2 Proof of Proposition 2
We prove the following alternative expressions for and ,
[TABLE]
Here, is the degree of node in , and is the number of paths of length 2 between nodes and that pass through . It is not hard to verify that these expressions are equivalent to the matrix form presented in (2).
Recall that and similarity . Here, the indices, , and run over all nodes in the network, i.e. . We have
[TABLE]
where the last equality follows by interchanging the order of summations and defining
[TABLE]
If or do not belong to , or , then . Thus, it what follows assume that and . Then,
[TABLE]
To see the second equality, note that we are counting subsets of the set (of cardinality ) that contain or exclude certain elements. For example, when are pairwise distinct, and , looking at the binary representation of , we have two ones in the positions and and a zero in position , and the rest of positions are free to be zero or one.
Let be the degree of node in . We drop when . In what follows, and range over (otherwise ). Also, condition can be replaced with , since the is implicitly enforced by if (no self-loops). We have
[TABLE]
where in the second term, we used the fact that if then we automatically have since ranges over . We have
[TABLE]
where is the number of paths of length two between and in . Note that and similarly since . Thus,
[TABLE]
where and the third equality follows since we have
[TABLE]
which was used with . Similarly, we have
[TABLE]
It follows that
[TABLE]
Calculating
Let us first take . Then, similar to the previous argument,
[TABLE]
where we have defined
[TABLE]
assuming , otherwise . Thus, restricting summations over indices
[TABLE]
Now consider the case . Then,
[TABLE]
assuming . It follows that
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Andrew Cliff and Peter Haggett. Time, travel and infection. British medical bulletin , 69(1):87–99, 2004.
- 2[2] Mitchell L Cohen. Changing patterns of infectious disease. Nature , 406(6797):762, 2000.
- 3[3] Vittoria Colizza, Alain Barrat, Marc Barthélemy, and Alessandro Vespignani. The role of the airline transportation network in the prediction and predictability of global epidemics. Proceedings of the National Academy of Sciences , 103(7):2015–2020, 2006.
- 4[4] Laurence Slutsker, Sean F Altekruse, and David L Swerdlow. Foodborne diseases: emerging pathogens and trends. Infectious Disease Clinics , 12(1):199–216, 1998.
- 5[5] Matthew Elliott, Benjamin Golub, and Matthew O Jackson. Financial networks and contagion. American Economic Review , 104(10):3115–53, 2014.
- 6[6] Daron Acemoglu, Asuman Ozdaglar, and Alireza Tahbaz-Salehi. Systemic risk and stability in financial networks. American Economic Review , 105(2):564–608, 2015.
- 7[7] Suleyman Kondakci. Epidemic state analysis of computers under malware attacks. Simulation Modelling Practice and Theory , 16(5):571–584, 2008.
- 8[8] Chris Fleizach, Michael Liljenstam, Per Johansson, Geoffrey M Voelker, and Andras Mehes. Can you infect me now?: malware propagation in mobile phone networks. In Proceedings of the 2007 ACM workshop on Recurring malcode , pages 61–68. ACM, 2007.
