A Polya Contagion Model for Networks
Mikhail Hayhoe, Fady Alajaji, Bahman Gharesifard

TL;DR
This paper introduces a network contagion model based on a modified Polya urn scheme, analyzing its stochastic properties, asymptotic behavior, and comparing it with traditional epidemic models.
Contribution
It develops a novel Polya-based network contagion model that accounts for spatial infection and provides analytical and empirical insights into its behavior.
Findings
Model captures non-stationary contagion dynamics.
Analytical approximations fit well across parameters.
Model compares favorably with SIS epidemic models.
Abstract
A network epidemics model based on the classical Polya urn scheme is investigated. Temporal contagion processes are generated on the network nodes using a modified Polya sampling scheme that accounts for spatial infection among neighbouring nodes. The stochastic properties and the asymptotic behaviour of the resulting network contagion process are analyzed. Unlike the classical Polya process, the network process is noted to be non-stationary in general, although it is shown to be time-invariant in its first and some of its second-order statistics and to satisfy martingale convergence properties under certain conditions. Three classical Polya processes, one computational and two analytical, are proposed to statistically approximate the contagion process of each node, showing a good fit for a range of system parameters. Finally, empirical results compare and contrast our model with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Polya Contagion Model for Networks
Mikhail Hayhoe, Fady Alajaji, Senior Member, IEEE, and Bahman Gharesifard, Member, IEEE
Mikhail Hayhoe is with the Department of Electrical and Systems Engineering at the University of Pennsylvania, Philadelphia, PA, USA [email protected]. Fady Alajaji and Bahman Gharesifard are with the Department of Mathematics and Statistics at Queen’s University, Kingston, ON, Canada, {fa, ghareb}@queensu.ca. This work was partially supported by the Natural Sciences and Engineering Research Council of Canada. Parts of this work were presented at the 2017 American Control Conference [1].
Abstract
A network epidemics model based on the classical Polya urn scheme is investigated. Temporal contagion processes are generated on the network nodes using a modified Polya sampling scheme that accounts for spatial infection among neighbouring nodes. The stochastic properties and the asymptotic behaviour of the resulting network contagion process are analyzed. Unlike the classical Polya process, the network process is noted to be non-stationary in general, although it is shown to be time-invariant in its first and some of its second-order statistics and to satisfy martingale convergence properties under certain conditions. Three classical Polya processes, one computational and two analytical, are proposed to statistically approximate the contagion process of each node, showing a good fit for a range of system parameters. Finally, empirical results compare and contrast our model with the well-known discrete time SIS model.
Index terms—Polya contagion networks, epidemics on networks, non-stationary stochastic processes, martingales.
I Introduction
In this paper we examine the dynamics and properties of a contagion process, or epidemic, on a network. Here an epidemic can represent a disease [2], a computer virus [3], the spread of an innovation, rumour or idea [4], or the dynamics of competing opinions in a social network [5].
Many different models for the study of infection propagation and curing exist in the literature. Our model, the network Polya contagion process, bears similarities to the well-known susceptible-infected-susceptible (SIS) infection model [6]. In this model, all nodes may initially be healthy or infected. As the epidemic spreads, nodes that are infected can be cured to become healthy, but any healthy node may become infected at any time, regardless of whether they have been cured previously. Epidemics on networks have been intensively studied in recent years; see [7] and references therein and thereafter. The model that we present is an adaptation of the classical Polya contagion process [8, 9, 10] to a network setting by accounting for spatial infection between nodes. The classical Polya model has been used to study a variety of epidemics such as the bubonic plague in Peru [11] and the spread of chlamydia in a closed population [12], as well as a wide range of other applications; see [13] for a survey. In this work we will examine the stochastic evolution of the network Polya contagion process.
Our model is motivated by the classical Polya contagion process, which evolves by sampling from an urn containing a finite number of red and black balls [8, 9, 10]. In the network Polya contagion model, each node of the underlying network is equipped with an individual urn; however, instead of sampling from these urns when generating its contagion process, each node has a “super urn”, created by combining the contents of its own urn with those of its neighbours’ urns. This adaptation captures the concept of spatial infection, since having infected neighbours increases the chance that an individual is infected in the future. This concept of the super urn sampling mechanism for incorporating spatial interactions was originally introduced in [14] in the context of the image segmentation and labeling problem. We herein adapt the image model of [14] for a network setting and analyze the resulting contagion process affecting each node of the network.
More specifically, we study the time evolution and stochastic properties of the proposed network contagion process. We derive an expression for the temporal -fold joint probability distribution of the process. We show that this process, unlike the classical Polya urn process, is in general non-stationary, and hence not exchangeable. For the special case of complete networks, we analytically find the -dimensional and -dimensional -step marginal distributions of the contagion process. These results show that, even though it is not stationary, the process in this case is nevertheless identically distributed with its later two marginal distributions being invariant to time shifts. We also establish several martingale properties regarding the network urn compositions, proving that the proportions of red balls in each node’s urn as well as the network average urn proportion converge almost surely to a limit as time grows without bound. We next provide three approximations to the network contagion process by modelling each node’s contagion process via a classical stationary Polya process [10]. In the first one, we approximate each node’s process with the classical Polya process whose correlation parameter is empirically selected so that the Kullback-Leibler divergence measure between its -fold joint distribution and that of the original node process is minimized. In the second approximation, we propose an analytical model whose parameters are chosen by matching its first and -step second-order statistics with those of the original node process, which fits well for large networks. The last approximation uses a classical Polya model with parameters chosen analytically that we show fits well for small networks. Finally, simulation results are presented to support the validity of these approximations and to compare our model with the traditional discrete time SIS model, which suggests that the network Polya contagion process captures certain properties of the SIS model, while offering new insights in the case of widespread infection.
The rest of the paper is organized as follows. Section II outlines some preliminary knowledge that will be used throughout the paper. Section III introduces the network contagion process, and Section IV presents its stochastic properties and asymptotic behaviour. Section V proposes three approximations for the individual node contagion processes in the network, along with numerical modelling results. Lastly, Section VI concludes the paper.
II Preliminaries
For a sequence , we use the notation with to denote the vector . Our technical results rely on notions from stochastic processes, some of which we recall here. Throughout, we assume that the reader is familiar with basic notions of probability theory.
Let be a probability space, and consider the stochastic process , where each is a random variable on . We often refer to the indices of the process as “time” indices. We recall that the process is stationary if for any , its -fold joint probability distribution (i.e., the distribution of ) is invariant to time shifts. Further, is exchangeable if for any , its -fold joint distribution is invariant to permutations of the indices . It directly follows from the definitions that an exchangeable process is stationary. Lastly, the process is called a martingale (resp. supermartingale, submartingale) with respect to the process if and almost surely (resp. less than or equal to, greater than or equal to), for all . Precise definitions of all notions, including that of ergodicity, can be found in standard texts (e.g., [15, 16]).
We now recall the classical version of the Polya contagion process [8, 10]. Consider an urn with red balls and black balls. We denote the total number of balls by , i.e., . At each time step, a ball is drawn from the urn. The ball is then returned along with balls of the same colour. We use an indicator to denote the colour of ball in the th draw:
[TABLE]
Let denote the proportion of red balls in the urn after the th draw. Then
[TABLE]
where is the initial proportion of red balls in the urn and is a correlation parameter. Since we draw balls from this urn at each time step, the conditional probability of drawing a red ball at time , given , is given by
[TABLE]
It can be easily shown that is a martingale [17]. The process , whose -fold joint distribution is denoted by , is also exchangeable (hence stationary) and non-ergodic with both and the process sample average converging almost surely as to a random variable governed by the Beta distribution with parameters and ; we denote this probability density function (pdf) by [17, 18]. Lastly, the -dimensional distribution of the Polya process is , for all and . The above classical Polya process is fully described by its parameters and , and thus we denote it by .
III Network Polya Contagion Process
In this section, we introduce a generalization of the Polya contagion process to networks, where each individual node in the underlying graph that describes the network topology is still equipped with an urn; however, the node’s neighbouring structure affects the evolution of its process. This model hence captures spatial contagion, since infected neighbours increase the chance of a node being infected in the future.
Consider an undirected graph , where is the set of nodes and is the set of edges. We assume that is connected, i.e., there is a path between any two nodes in . We use to denote the set of nodes that are neighbours to node , that is , and . If for all , the network is called complete; if for all , we call it regular. Each node is equipped with an urn, initially with red balls and black balls (we do not let or to avoid any degenerate cases). We let be the total number of balls in the th urn, . We use as an indicator for the ball drawn for node at time :
[TABLE]
However, instead of drawing solely from its own urn, each node draws simultaneously from a “super urn” created by combining all the balls in its own urn with the balls in its neighbours’ urns; see Figure 1. This allows the spatial relationships between nodes to influence their state. This means that is the indicator for a ball drawn from node ’s super urn, and not its individual urn. Hence, the super urn of node initially has red balls, black balls, and balls in total.
We further consider a time-varying version of the classical Polya contagion process, following [19], where at time for node , after a red ball is drawn it is returned along with red balls to node ’s urn, and black balls along with the drawn ball are added to node ’s urn when a black ball is drawn. When for all , we write instead; if the ’s are not node-dependent, we omit the node index. We assume throughout that , for all and that there exists and such that ; otherwise we are simply sampling with replacement.
In the context of epidemics, the red and black balls in an urn, respectively, represent units of “infection” and “healthiness”; for example, bacteria and white blood cells. In a super urn, the bacteria can infect others in the area and the white blood cells contribute to the overall health in the neighbourhood of an individual. Drawing red at time means the bacteria in the neighbourhood were successful in reproduction and so the individual was more infected, otherwise they were healthier since the white blood cells reproduced. Thus when , we declare that node is infected at time , and if , then it is healthy. We add more units of bacteria once they reproduce, but commonly assume this number, , is the same across all individuals and time because the bacteria does not evolve or become altered. The amount of white blood cells created, , may change since we can give more medicine to certain people to increase their immune response, or vaccinate them so they are better able to fight the disease.
To express the proportion of red balls in the individual urns of the nodes, we define the random vector , where is the proportion of red balls in node ’s urn after the th draw, . For node ,
[TABLE]
where the numerator represents the total number of red balls in node ’s urn after the th draw, while the denominator is the total number of balls in the same urn. Note that is the initial proportion of balls in node ’s urn. For ease of notation, let
[TABLE]
Furthermore, we define the random vector as the proportion of red balls in the super urns of the nodes after the th draw, so that is the proportion of red balls in node ’s super urn after draws. Hence, for node ,
[TABLE]
Note that . is in fact a function of the random draw variables of the network, and in particular of , but for ease of notation, when the arguments are clear, we write S_{i,n}{\color[rgb]{0,0,0}(Z_{1}^{n},\cdots,Z_{N}^{n})}=S_{i,n}. Then the conditional probability of drawing a red ball from the super urn of node at time given the complete network history, i.e. given all the past draw variables for each node in the network , satisfies
[TABLE]
That is, the conditional probability of drawing a red ball for node at time given the entire past is the proportion of red balls in its super urn, . This is however analogous to the original Polya case, but instead of relying on the individual proportion of red balls to describe the conditional probability of drawing red balls, we use the super urn proportion of red balls since we now draw from there.
Remark III.1
(Non-Markovity):* While (4) may appear to suggest some sort of Markovity property, the process is non-Markovian in general. This can easily be seen due to the fact that a draw at time requires knowledge of all previous draws for the entire neighbourhood.*
A main objective throughout the rest of this paper is to study the evolution and stochastic properties of the process defined above. Using the above conditional probability, we can determine the -fold joint probability of the entire network : for , , we have that
[TABLE]
where is defined in (4). With the above explicit joint distribution, it is possible to determine the distributions of each node’s process. More specifically, using (9), the -fold distribution of node ’s process at time is
[TABLE]
In order to measure the spread of contagion in the network at any given time, we wish to see how likely it is, on average, for a node to be infected at that instant. We hence define the average infection rate in the network at time as the average marginal probability of drawing a red ball,
[TABLE]
Note that is a function of the network topology , the initial placement of balls and , the draw processes , and number of balls added and for each node . Unfortunately for an arbitrary network, the above quantity does not yield an exact analytical formula (except in the simple case of complete networks). As such, in general it is hard to mathematically analyze the asymptotic behaviour of , which we wish to minimize when attempting to cure an epidemic. Instead we examine the asymptotic stochastic behavior of a closely related variable given by the average individual proportion of red balls at time , namely
[TABLE]
which we call the network susceptibility. This quantity is related to the conditional probability of drawing a red ball, as seen in (2). Since the individual urn of node is in every super urn in the neighbourhood, if increases then increases for every , and hence given the past history those nodes are more likely to exhibit infected behaviour as seen from (4). Note that similarly to , is a function of the network variables.
Remark III.2
(Finite Memory):* It is worth pointing out that a practical adaptation to our model can be considered, where urns have “finite memory” in the sense that the balls added after each draw are only kept in each node’s urn for a finite number of future draws. This model is developed in [18] for the classical Polya process in the context of modelling communication channels, where it is shown that the resulting finite memory contagion process is stationary, Markovian and ergodic. We present the following result that states that in this case the entire state is Markovian and hence it is a limited reinforcement model, but leave an in-depth investigation to a future work. *
Proposition III.3
(Finite Memory Markovity):* The entire state of the network Polya contagion process with finite memory is Markovian with memory .*
Proof:
By (1) and (4) and the fact that added balls are removed after steps, we have for that
[TABLE]
Using the above result along with conditional independence, for we have for that
[TABLE]
and hence the entire network process is Markovian with memory . ∎
IV Stochastic Properties
We next examine the stochastic properties of the network contagion process. We assume throughout the beginning of this section that , for all and times ; that is the net number of red and black balls added are equal and constant in time for all nodes. In the case of a complete network, the composition of every nodes’ super urn is identical, since there is only one super urn that is being drawn from. Thus for a complete network the super urn model is analogous to one urn where multiple draws occur with replacement, which has been recently studied in detail [20]. However, the analysis in [20] is carried out in an aggregate sense, i.e., only for the entire urn and not individual processes. Unfortunately, this aggregate approach does not work in a network setting, whereas the super urn model proposed here is applicable.
IV-A Complete Network Marginal Distributions
We first focus on the special case of complete networks to derive some useful probability distributions; later on, we will obtain other stochastic properties that apply to more general networks.
Given that the network is complete, we focus on one of the nodes, say . For ease of notation, we define , and similarly, , for all . Defining the events and with , and parameters and , we can write using (4) under the above assumptions, that
[TABLE]
By examining an arbitrary term in the final sum above, for fixed , we can sum out all the other draw variables:
[TABLE]
Further, by the law of total probability,
[TABLE]
So using (17) and (20), (10) becomes
[TABLE]
Thus, using the law of total probability, we have
[TABLE]
An interesting corollary of this derivation is as follows.
Lemma IV.1
(Complete Network Marginal Distribution):* The -dimensional marginal distribution of node ’s contagion draw process for the -node complete network is given by*
[TABLE]
where , , and .
Proof:
We proceed using strong induction on , showing that , for all nodes and all . The base case readily holds, since at time ,
[TABLE]
Now, assuming that for all and and using (21), we have
[TABLE]
which completes the induction argument. The result now follows using the fact that
[TABLE]
for all and all . ∎
We next show that each node’s draw process is not stationary in general, and hence is different from the classical process.
Remark IV.2
(Non-Stationarity of the Network Contagion Process):* Consider a -node complete network. Then, using (9), one can obtain (after some simplifications) that*
[TABLE]
and hence the network process is not stationary.
Since every exchangeable process is necessarily stationary, Remark IV.2 implies that the network Polya process is not exchangeable in general. However, some notions of stationarity remain; in our next result, we will see that there is a consistent relationship between the draws at the st and th time steps.
Lemma IV.3
(Complete Network -step Marginal Probability):* For the complete network, the -dimensional marginal probability that node ’s draw variables at times and are both one is given by*
[TABLE]
for , . Furthermore, for any other node ,
[TABLE]
Proof:
By Lemma IV.1 we have that for all , so it is enough to show that
[TABLE]
for all and nodes and . Using the law of total probability, (4), and after some simplifications, with defining (note that is a function of , but for simplicity we omit this), we have that
[TABLE]
Then, after arranging terms and using the law of total probability for
[TABLE]
we have
[TABLE]
It can be similarly shown by symmetry of the complete network that (26) holds for if .
In order to show (25), we proceed using strong induction on . For the base case, setting in (26), we have for any ,
[TABLE]
as desired. Assume now that is given by (25), for and any . Then by (26),
[TABLE]
which completes the induction argument. ∎
Although the draw process is not stationary in general, simulated results suggest that it satisfies some asymptotic stationarity properties, in the sense that given sufficient time the process settles and deviations become very small in magnitude. A representative example is shown in Figure 2 for the -dimensional distribution at times and in the 5-node network shown in Figure 3(d).
IV-B Martingale Theorems
We now turn our attention to the martingale properties of the network contagion process, where we do not assume that the network is necessarily complete. Recall that by the martingale convergence theorem [15, 16], if a process is a martingale (or supermartingale, or submartingale), there exists a random variable such that converges almost surely to as .
Theorem IV.4
(Individual Urn Proportion Martingale):* For a network , and , for all and all , the individual proportion of red balls is a martingale with respect to the draws for the whole network if and only if, almost surely,*
[TABLE]
Proof:
Using the expression for , (2), and (4), we have almost surely
[TABLE]
This implies that is a martingale with respect to if and only if
[TABLE]
almost surely. ∎
If the condition in Theorem IV.4 holds, then for any both and converge almost surely to a limit as . However, the condition of Theorem IV.4, barring the trivial single node scenario, is not verifiable. To resolve this issue, we instead examine the evolution of the average proportion of red balls (i.e., the susceptibility) in a regular network.
Theorem IV.5
(Regular Network Susceptibility Martingale):* For a regular network with and for all nodes and times , the network susceptibility , where , is a martingale with respect to .*
Proof:
We have, similar to Theorem IV.4, that
[TABLE]
Let us examine the second term of the last equality. If this term is zero, is a martingale with respect to . We now define the adjacency matrix of our network, where the th entry is 1 if , and 0 otherwise. Since we assumed that our network was undirected, is symmetric, i.e., for all . So,
[TABLE]
Now, we examine the sum of the and components of the double sum, where (otherwise both terms are zero). Recall that , . We have
[TABLE]
From above, it is clear that this term is zero for all and by setting , i.e. in any regular network, and so is a martingale with respect to . ∎
We next allow the net number of black balls to evolve stochastically in time as a function of the past draw history in the network in order to steer to a limit for every node .
Theorem IV.6
(Individual Urn Proportion Categories):* In a general network with for all and , if we choose so that*
[TABLE]
almost surely for all and (resp. equal to, less than or equal to) then is a supermartingale (resp. martingale, submartingale) with respect to .
Proof:
We will start with the case of a supermartingale. That is, we wish to show that almost surely for all ,
[TABLE]
Define , and take as in (1). Then, we have almost surely
[TABLE]
since for all almost surely, we can ignore it. Now, since is almost surely constant given ,
[TABLE]
That is we wish to check if, almost surely,
[TABLE]
Now if
[TABLE]
almost surely, we have
[TABLE]
where the second to last equality comes from the fact that almost surely by (4), and that is almost surely constant given . Thus as long as obeys this bound almost surely for all , is a supermartingale with respect to . Similarly, if is almost surely equal (resp. less than or equal) to this bound, is a martingale (resp. submartingale) with respect to . ∎
Theorem IV.6 tells us what bounds for must be obeyed almost surely to guarantee that admits an asymptotic limit for all in any general network. For instance, this tells us that by choosing almost surely for all and , will be a submartingale and will converge to some limiting random variable. While this result is interesting for modelling contagion, it is especially useful in the context of curing.
V Model Approximations
As previously noted, the dynamics of the network contagion process are complicated, especially when considered on general networks. For this reason, in this section we develop two useful approximations to this process on a general network that allow us to shed some light on its asymptotic behaviour. Throughout this section, unless stated otherwise, we consider general network topologies with {\color[rgb]{0,0,0}\Delta_{r,i}(t)}=\Delta_{b,i}(t)=\Delta for all and . However, to match the -step and -step distributions, we make the simplifying assumption that the neighbourhood of each node can be represented as a complete network, i.e., all of its neighbours are connected to one another, in order to apply Lemmas IV.1 and IV.3.
V-A Approximation: Computational Model
We now introduce our first approximation technique, where we approximate the contagion process of each node in the network with a classical Polya urn process.
Model I
(Computational Model):* We approximate the dynamics of any node ’s contagion process using a classical Polya process , with*
[TABLE]
with
[TABLE]
where is the Gamma function, , and .
Here is chosen to be the proportion of red balls in the node’s super urn, so that the -dimensional distributions of the classical Polya process and the node process coincide, while is set by performing a minimization to find the value that best fits to the distribution of of node . We use a divergence measure, denoted by , to observe the quality of the fit.
The explicit derivation of the distribution can be found in [17, 21]. This method ensures that the fit of is as close as possible under the given divergence measure. Since we are measuring the error in using an approximating distribution, we use the Kullback-Leibler divergence [22]; we thus have that
[TABLE]
since is independent of . The approximating process is stationary and exchangeable, as it is a classical Polya process. We also know (from Section II) that it is non-ergodic with its sample average converging almost surely to the distribution. Calculating an analytic expression for the minimizing is not feasible in general, and hence should be performed computationally. However, due to the above minimization, the value of is, by definition, the best way to fit a Polya process to the process for a given .
V-B Approximation: Analytical Models
An alternative to Model I is to attempt to find approximations whose parameters can be determined analytically.
Model II(a)
(Large-Network Analytical Model):* For any given node , we approximate the dynamics of its process by using a classical Polya process , with*
[TABLE]
where .
Here the parameters of the classical Polya process are chosen by directly matching its first and -step second-order statistics with those of . This method avoids the computational burden of the previous model by yielding an analytical expression for the correlation parameter of the classical Polya process.
We next prove that under some stationarity and symmetry assumptions, the contagion process running on each node in the network is statistically identical to the classical Polya process of Model II(a).
Lemma V.1
(Exact Representation):* Suppose that*
- (i)
, and 2. (ii)
,
for all , , . Then for any node in a complete network, is given exactly by the process.
Proof:
For any node , we wish to show that for all , the -dimensional distributions of and the process are identical. It is enough to show that the conditional probability of one event given the whole past is the same, since any joint probability can be written as a product of conditional probabilities. Let us define the events and . Then,
[TABLE]
Then using assumption (i), we have
[TABLE]
Now using assumption (ii), we get
[TABLE]
Thus, we have that
[TABLE]
which is the conditional probability for a process. A similar calculation can be performed for . ∎
Unfortunately in a general network setting assumptions (i) and (ii) above do not hold true. However, this result motivates the fact that this analytical approximation is reasonable to use for situations where these assumptions hold within tolerable margins of error; empirical evidence indicates that this occurs for large values of , since as increases the quality of the fit improves. This approximation, nevertheless, drastically reduces the complexity in analyzing the individual contagion draw processes, as closed-form expressions for the process parameters are available.
Model II(b)
(Small-Network Analytic Model):* Given any node in the network with a small to moderate number of nodes, we approximate the dynamics of its contagion process using a process, where*
[TABLE]
where .
The idea behind this model is that we want to remove the dependence on the number of nodes from the parameter , and so we divide each instance of in by . The idea is that as grows, it eventually becomes significantly larger than the relatively small number of nodes , and so for all . Hence, we may consider that for a sufficiently large time, we have added balls to the super urn. Effectively, this means we are using a correlation parameter of instead of . Simulation results confirm that this approximation captures the limit distribution of the original process better than Model II(a) when the number of nodes is small. Figure 3 displays this relationship. A summary of all models presented in this section, and the scenarios under which they are most suitable, is provided in Table I.
We close this section with numerical demonstrations on the fitness of all models. Figure 3 shows a representative comparison between the pdf and the simulated histogram of , where , for an arbitrary node in the given networks. Recall that the , and pdfs are the distributions of the limit random variables to which the sample average of the draw processes of Models I, II(a) and II(b) (respectively) converge almost surely, as (see Section II). We use complete networks since they satisfy the assumption that all neighbourhoods are complete, as well as Barabasi-Albert networks which have been shown to be a good model for real-world social networks [23] and do not satisfy this assumption; however, our results show that the approximations still fit quite well. As expected, Model I provides the best approximation in all scenarios, albeit without an analytic expression for its parameters which can provide insight into the behaviour of the underlying process. Model II(a) fits quite well when the number of nodes in the network is large, as seen in Figures 3(b) and 3(e), but fits poorly for a small number of nodes, which is evident in Figures 3(a) and 3(c). Model II(b) is the complement of Model II(a) in the sense that it fits very well for a small number of nodes but poorly for a large network. Hence if analytic expressions for parameters are desired, Models II(a) and II(b) can be used depending on the number of nodes to provide approximations that are marginally worse than the computational exactness of Model I.
V-C Comparison with SIS model
We now provide a number of empirical results in which we compare our model, with both finite and infinite memory, to the traditional discrete time SIS model [24]. In the SIS model, the parameter denotes the probability that a node will recover from infection, and is the probability that a node will become infected through contact with a single infected neighbour. The dynamics are described through the probability that any node will be infected at time , , which evolves according to the equation
[TABLE]
Note in particular that this model exhibits Markovian behaviour, since the evolution of the process depends only on the probability of infection from the previous time step. We make the simplifying assumption that and remain the same for all nodes and throughout time, and hence we will compare it with the network Polya contagion process when and are similarly fixed in time and throughout the network.
The concept of an epidemic threshold for the SIS model gives a value through which one may determine whether the epidemic dies, a priori using only the system parameters [24]. The threshold condition is directly related to the largest-magnitude eigenvalue of the adjacency matrix of the underlying graph of the network, and states that if then the epidemic will be eliminated after some time , i.e., eventually for all and all . Furthermore, it has been shown that this threshold is tight, and indeed if then some non-zero convergence point exists, called an endemic state, and the epidemic will never be eliminated [25].
Figure 4 compares the behaviour of the SIS model and the network Polya contagion process for different selections of these parameters. The initial probabilities of infection for the SIS model were set to coincide with the initial individual proportions of red balls for the nodes . Further, we relate in Figures 4(a)–4(c) the parameters and to and , respectively, using ratios of the largest-magnitude eigenvalue of the adjacency matrix of the graph shown in Figure 4(d).
Figure 4(a) shows a comparison when the SIS model is displaying endemic behaviour. We see here that after a very short time, the SIS model settles and shortly thereafter the finite memory process settles (albeit to a different value), while for the infinite memory process the individual rates of infection and hence the average continue to increase in time. Since both the SIS model and the finite memory process have limited reinforcement, while the infinite memory process does not, these results are to be expected. Figure 4(b) displays a comparison where the epidemic threshold is met and the epidemic dies out for the SIS model. Here we note that for both the infinite and finite memory processes decreases and approaches zero, albeit not as quickly as the SIS model. Hence we observe that when the curing parameter is much larger (in fact, more than five folds larger) than the infection parameter the epidemic is eliminated, as we expect, and this behaviour of the SIS model is captured by the network Polya contagion process. However, the finite memory process does not fully approach zero, since the initial conditions and have a much larger influence relative to the infinite memory process. Finally, Figure 4(c) shows the case where the epidemic does not vanish and the parameters in both models are set to be equal ( and ). We observe a similar trend between all models, with the finite and infinite memory processes exhibiting near-identical behaviour.
Through these observations, we may conclude that both versions of the network Polya contagion process may apply to the modelling of epidemics, albeit in different applications. The finite memory process exhibits behaviour that is more closely related to the SIS model since they are both limited reinforcement processes, and hence it may be best suited to traditional biological diseases. The infinite memory process obeys similar trends, but in the endemic state there are some interesting differences since the effects of the infection continue to spread throughout the population. On the other hand, the SIS model quickly settles and does not change in time. Thus with infinite memory our process is better suited to modelling opinion dynamics, the spread of ideas, and advertising schemes.
VI Conclusion
We introduced a network epidemics model based on the classical Polya urn scheme, and we investigated its stochastic properties and asymptotic behaviour in detail. We showed that under certain conditions the proportion of red balls in individual urns and the network susceptibility, which are processes used to measure infection, admit limits. Three classical Polya processes were proposed, one computational and two analytical, to statistically approximate the contagion process of each node. Empirical results were presented which show that the approximations are a good fit for a range of system parameters. Our process was also compared empirically with the discrete-time SIS model, showing a similar behaviour, particularly in the finite memory mode, while providing different degrees of reinforcement in the endemic state, with the largest reinforcement occurring under the infinite memory mode. Future directions of research include investigations into the curing of these processes, and the further study of the network contagion process with finite memory.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Hayhoe, F. Alajaji, and B. Gharesifard, “A Polya urn-based model for epidemics on networks,” Proc. 2017 American Cont. Conf. , 2017.
- 2[2] L. Kim, M. Abramson, K. Drakopoulos, S. Kolitz, and A. Ozdaglar, “Estimating social metwork structure and propagation dynamics for an infectious disease,” in Proc. Int. Conf. Social Computing, Behavioral-Cultural Modeling, and Prediction , pp. 85–93, Springer, 2014.
- 3[3] M. Garetto, W. Gong, and D. Towsley, “Modeling malware spreading dynamics,” in Proc. IEEE Int. Conf. Comp. Commun. , vol. 3, pp. 1869–1879, 2003.
- 4[4] E. M. Rogers, Diffusion of Innovations . 5 ed., 2003.
- 5[5] E. Adar and L. A. Adamic, “Tracking information epidemics in blogspace,” in Proc. IEEE/WIC/ACM Int. Conf. Web Intelligence , pp. 207–214, 2005.
- 6[6] D. Easley and J. Kleinberg, Networks, Crowds and Markets: Reasoning about a Highly Connected World . Cambridge Univ. Press, 2010.
- 7[7] P. V. Mieghem, J. Omic, and R. Kooij, “Virus spread in networks,” IEEE/ACM Trans. Netw. , vol. 17, no. 1, pp. 1–14, 2009.
- 8[8] F. Eggenberger and G. Polya, “Über die statistik verketteter vorgänge,” Z. Angew. Math. Mech. , vol. 3, no. 4, pp. 279–289, 1923.
