A Polya Contagion Model for Networks

Mikhail Hayhoe; Fady Alajaji; Bahman Gharesifard

arXiv:1705.02239·cs.SI·December 12, 2017

A Polya Contagion Model for Networks

Mikhail Hayhoe, Fady Alajaji, Bahman Gharesifard

PDF

TL;DR

This paper introduces a network contagion model based on a modified Polya urn scheme, analyzing its stochastic properties, asymptotic behavior, and comparing it with traditional epidemic models.

Contribution

It develops a novel Polya-based network contagion model that accounts for spatial infection and provides analytical and empirical insights into its behavior.

Findings

01

Model captures non-stationary contagion dynamics.

02

Analytical approximations fit well across parameters.

03

Model compares favorably with SIS epidemic models.

Abstract

A network epidemics model based on the classical Polya urn scheme is investigated. Temporal contagion processes are generated on the network nodes using a modified Polya sampling scheme that accounts for spatial infection among neighbouring nodes. The stochastic properties and the asymptotic behaviour of the resulting network contagion process are analyzed. Unlike the classical Polya process, the network process is noted to be non-stationary in general, although it is shown to be time-invariant in its first and some of its second-order statistics and to satisfy martingale convergence properties under certain conditions. Three classical Polya processes, one computational and two analytical, are proposed to statistically approximate the contagion process of each node, showing a good fit for a range of system parameters. Finally, empirical results compare and contrast our model with the…

Tables1

Table 1. TABLE I: Approximation Usage Scenarios

Model	Usage Scenario
I	Exactness valued over analytic simplicity
II(a)	Larger values of $N$ , i.e., large network
II(b)	Small to moderate values of $N$ , i.e., small network

Equations196

Z_{n} = {10 if the n th draw is red if the n th draw is black.

Z_{n} = {10 if the n th draw is red if the n th draw is black.

U_{n}

U_{n}

P (Z_{n} = 1 ∣ Z^{n - 1})

P (Z_{n} = 1 ∣ Z^{n - 1})

Z_{i, n} = {10 if the n th draw for node i is red if the n th draw for node i is black.

Z_{i, n} = {10 if the n th draw for node i is red if the n th draw for node i is black.

U_{i, n}

U_{i, n}

X_{j, n} = T_{j} + t = 1 \sum n Z_{j, t} Δ_{r, j} (t) + (1 - Z_{j, t}) Δ_{b, j} (t) .

X_{j, n} = T_{j} + t = 1 \sum n Z_{j, t} Δ_{r, j} (t) + (1 - Z_{j, t}) Δ_{b, j} (t) .

S_{i, n}

S_{i, n}

= \frac{\sum _{j \in N_{i}^{^{'}}} U _{j, n} X _{j, n}}{\sum _{j \in N_{i}^{^{'}}} X _{j, n}} .

P

P

= \frac{R ˉ _{i} + \sum _{j \in N_{i}^{'}} \sum _{t = 1}^{n - 1} Z _{j, t} Δ _{r, j} ( t )}{\sum _{j \in N_{i}^{'}} X _{j, n - 1}}

= S_{i, n - 1} .

P_{G}^{(n)} (a_{1}^{n}, \dots, a_{N}^{n})

P_{G}^{(n)} (a_{1}^{n}, \dots, a_{N}^{n})

:= P ({Z_{i}^{n} = a_{i}^{n}}_{i = 1}^{N})

= t = 1 \prod n P ({Z_{i, t} = a_{i, t}}_{i = 1}^{N} ∣ {Z_{i}^{t - 1} = a_{i}^{t - 1}}_{i = 1}^{N})

\displaystyle=\prod_{t=1}^{n}\prod_{i=1}^{N}\Big{(}S_{i,t-1}\Big{)}^{a_{i,t}}\Big{(}1-S_{i,t-1}\Big{)}^{1-a_{i,t}},

P_{i, t}^{(n)}

P_{i, t}^{(n)}

\tilde{I}_{n} := \frac{1}{N} i = 1 \sum N P (Z_{i, n} = 1) = \frac{1}{N} i = 1 \sum N P_{i, n}^{(1)} (1) .

\tilde{I}_{n} := \frac{1}{N} i = 1 \sum N P (Z_{i, n} = 1) = \frac{1}{N} i = 1 \sum N P_{i, n}^{(1)} (1) .

\tilde{U}_{n} := \frac{1}{N} i = 1 \sum N U_{i, n},

\tilde{U}_{n} := \frac{1}{N} i = 1 \sum N U_{i, n},

P (Z_{i, n} = 1 ∣ {Z_{j}^{n - 1}}_{j = 1}^{N})

P (Z_{i, n} = 1 ∣ {Z_{j}^{n - 1}}_{j = 1}^{N})

= \frac{R ˉ _{i} + \sum _{j \in N_{i}^{'}} \sum _{t = n - M}^{n - 1} Z _{j, t} Δ _{r, j} ( t )}{T ˉ _{i} + \sum _{j \in N_{i}^{'}} \sum _{t = n - M}^{n - 1} Z _{j, t} Δ _{r, j} ( t ) + ( 1 - Z _{j, t} ) Δ _{b, j} ( t )}

= P (Z_{i, n} = 1 ∣ {Z_{j, n - M}^{n - 1}}_{j = 1}^{N}) .

P (Z_{1, n} = a_{1}, \dots, Z_{N, n} = a_{N} ∣ {Z_{j}^{n - 1}}_{j = 1}^{N})

P (Z_{1, n} = a_{1}, \dots, Z_{N, n} = a_{N} ∣ {Z_{j}^{n - 1}}_{j = 1}^{N})

= i = 1 \prod N P (Z_{i, n} = 1 ∣ {Z_{j}^{n - 1}}_{j = 1}^{N})

= i = 1 \prod N P (Z_{i, n} = 1 ∣ {Z_{j, n - M}^{n - 1}}_{j = 1}^{N})

= P (Z_{1, n} = a_{1}, \dots, Z_{N, n} = a_{N} ∣ {Z_{j, n - M}^{n - 1}}_{j = 1}^{N}),

P (Z_{i, n} = 1, A_{n - 1})

P (Z_{i, n} = 1, A_{n - 1})

=

=

=

=

\displaystyle+\frac{\delta}{N}\sum_{t=1}^{n-1}\Bigg{(}a_{t}\frac{P(A_{n-1},\{Z_{j}^{n-1}=b_{j}^{n-1}\}_{j\neq i})}{1+(n-1)\delta}

\displaystyle+\sum_{j\neq i}\frac{b_{j,t}P(A_{n-1},\{Z_{j}^{n-1}=b_{j}^{n-1}\}_{j\neq i})}{1+(n-1)\delta}\Bigg{)}\Bigg{]}.

b_{j}^{n - 1} \in {0, 1}^{n - 1} : j \neq = i \sum b_{k, t} P (A_{n - 1}, {Z_{j}^{n - 1} = b_{j}^{n - 1}}_{j \neq = i})

b_{j}^{n - 1} \in {0, 1}^{n - 1} : j \neq = i \sum b_{k, t} P (A_{n - 1}, {Z_{j}^{n - 1} = b_{j}^{n - 1}}_{j \neq = i})

= b_{k}^{n - 1} \in {0, 1}^{n - 1} \sum b_{k, t} P (A_{n - 1}, Z_{k}^{n - 1} = b_{k}^{n - 1})

= b_{k, t} \in {0, 1} \sum b_{k, t} P (A_{n - 1}, Z_{k, t} = b_{k, t})

= P (A_{n - 1}, Z_{k, t} = 1) .

b_{j}^{n - 1} \in {0, 1}^{n - 1} : j \neq = i \sum P (A_{n - 1}, {Z_{j}^{n - 1} = b_{j}^{n - 1}}_{j \neq = i}) = P (A_{n - 1}) .

b_{j}^{n - 1} \in {0, 1}^{n - 1} : j \neq = i \sum P (A_{n - 1}, {Z_{j}^{n - 1} = b_{j}^{n - 1}}_{j \neq = i}) = P (A_{n - 1}) .

\frac{ρP ( A _{n - 1} ) + \frac{δ}{N} \sum _{t = 1}^{n - 1} [ a _{t} P ( A _{n - 1} ) + \sum _{j \neq = i} P ( A _{n - 1} , Z _{j, t} = 1 ) ]}{1 + ( n - 1 ) δ}

\frac{ρP ( A _{n - 1} ) + \frac{δ}{N} \sum _{t = 1}^{n - 1} [ a _{t} P ( A _{n - 1} ) + \sum _{j \neq = i} P ( A _{n - 1} , Z _{j, t} = 1 ) ]}{1 + ( n - 1 ) δ}

P (Z_{i, n} = 1)

P (Z_{i, n} = 1)

= a^{n - 1} \sum \frac{ρP ( A _{n - 1} ) + \frac{δ}{N} \sum _{t = 1}^{n - 1} a _{t} P ( A _{n - 1} )}{1 + ( n - 1 ) δ}

+ \frac{\frac{δ}{N} \sum _{t = 1}^{n - 1} \sum _{j \neq = i} P ( A _{n - 1} , Z _{j, t} = 1 )}{1 + ( n - 1 ) δ}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Polya Contagion Model for Networks

Mikhail Hayhoe, Fady Alajaji, Senior Member, IEEE, and Bahman Gharesifard, Member, IEEE

Mikhail Hayhoe is with the Department of Electrical and Systems Engineering at the University of Pennsylvania, Philadelphia, PA, USA [email protected]. Fady Alajaji and Bahman Gharesifard are with the Department of Mathematics and Statistics at Queen’s University, Kingston, ON, Canada, {fa, ghareb}@queensu.ca. This work was partially supported by the Natural Sciences and Engineering Research Council of Canada. Parts of this work were presented at the 2017 American Control Conference [1].

Abstract

A network epidemics model based on the classical Polya urn scheme is investigated. Temporal contagion processes are generated on the network nodes using a modified Polya sampling scheme that accounts for spatial infection among neighbouring nodes. The stochastic properties and the asymptotic behaviour of the resulting network contagion process are analyzed. Unlike the classical Polya process, the network process is noted to be non-stationary in general, although it is shown to be time-invariant in its first and some of its second-order statistics and to satisfy martingale convergence properties under certain conditions. Three classical Polya processes, one computational and two analytical, are proposed to statistically approximate the contagion process of each node, showing a good fit for a range of system parameters. Finally, empirical results compare and contrast our model with the well-known discrete time SIS model.

Index terms—Polya contagion networks, epidemics on networks, non-stationary stochastic processes, martingales.

I Introduction

In this paper we examine the dynamics and properties of a contagion process, or epidemic, on a network. Here an epidemic can represent a disease [2], a computer virus [3], the spread of an innovation, rumour or idea [4], or the dynamics of competing opinions in a social network [5].

Many different models for the study of infection propagation and curing exist in the literature. Our model, the network Polya contagion process, bears similarities to the well-known susceptible-infected-susceptible (SIS) infection model [6]. In this model, all nodes may initially be healthy or infected. As the epidemic spreads, nodes that are infected can be cured to become healthy, but any healthy node may become infected at any time, regardless of whether they have been cured previously. Epidemics on networks have been intensively studied in recent years; see [7] and references therein and thereafter. The model that we present is an adaptation of the classical Polya contagion process [8, 9, 10] to a network setting by accounting for spatial infection between nodes. The classical Polya model has been used to study a variety of epidemics such as the bubonic plague in Peru [11] and the spread of chlamydia in a closed population [12], as well as a wide range of other applications; see [13] for a survey. In this work we will examine the stochastic evolution of the network Polya contagion process.

Our model is motivated by the classical Polya contagion process, which evolves by sampling from an urn containing a finite number of red and black balls [8, 9, 10]. In the network Polya contagion model, each node of the underlying network is equipped with an individual urn; however, instead of sampling from these urns when generating its contagion process, each node has a “super urn”, created by combining the contents of its own urn with those of its neighbours’ urns. This adaptation captures the concept of spatial infection, since having infected neighbours increases the chance that an individual is infected in the future. This concept of the super urn sampling mechanism for incorporating spatial interactions was originally introduced in [14] in the context of the image segmentation and labeling problem. We herein adapt the image model of [14] for a network setting and analyze the resulting contagion process affecting each node of the network.

More specifically, we study the time evolution and stochastic properties of the proposed network contagion process. We derive an expression for the temporal $n$ -fold joint probability distribution of the process. We show that this process, unlike the classical Polya urn process, is in general non-stationary, and hence not exchangeable. For the special case of complete networks, we analytically find the $1$ -dimensional and $2$ -dimensional $(n,1)$ -step marginal distributions of the contagion process. These results show that, even though it is not stationary, the process in this case is nevertheless identically distributed with its later two marginal distributions being invariant to time shifts. We also establish several martingale properties regarding the network urn compositions, proving that the proportions of red balls in each node’s urn as well as the network average urn proportion converge almost surely to a limit as time grows without bound. We next provide three approximations to the network contagion process by modelling each node’s contagion process via a classical stationary Polya process [10]. In the first one, we approximate each node’s process with the classical Polya process whose correlation parameter is empirically selected so that the Kullback-Leibler divergence measure between its $n$ -fold joint distribution and that of the original node process is minimized. In the second approximation, we propose an analytical model whose parameters are chosen by matching its first and $(n,1)$ -step second-order statistics with those of the original node process, which fits well for large networks. The last approximation uses a classical Polya model with parameters chosen analytically that we show fits well for small networks. Finally, simulation results are presented to support the validity of these approximations and to compare our model with the traditional discrete time SIS model, which suggests that the network Polya contagion process captures certain properties of the SIS model, while offering new insights in the case of widespread infection.

The rest of the paper is organized as follows. Section II outlines some preliminary knowledge that will be used throughout the paper. Section III introduces the network contagion process, and Section IV presents its stochastic properties and asymptotic behaviour. Section V proposes three approximations for the individual node contagion processes in the network, along with numerical modelling results. Lastly, Section VI concludes the paper.

II Preliminaries

For a sequence $v_{i}=(v_{i,1},...,v_{i,n})$ , we use the notation $v_{i,s}^{t}$ with $1\leq s<t\leq n$ to denote the vector $(v_{i,s},v_{i,s+1},...,v_{i,t})$ . Our technical results rely on notions from stochastic processes, some of which we recall here. Throughout, we assume that the reader is familiar with basic notions of probability theory.

Let $(\Omega,\mathcal{F},P)$ be a probability space, and consider the stochastic process $\{Z_{n}\}_{n=1}^{\infty}$ , where each $Z_{n}$ is a random variable on $\Omega$ . We often refer to the indices of the process as “time” indices. We recall that the process $\{Z_{n}\}_{n=1}^{\infty}$ is stationary if for any $n\in\mathbb{Z}_{\geq 1}$ , its $n$ -fold joint probability distribution (i.e., the distribution of $(Z_{1},...,Z_{n})$ ) is invariant to time shifts. Further, $\{Z_{n}\}_{n=1}^{\infty}$ is exchangeable if for any $n\in\mathbb{Z}_{\geq 1}$ , its $n$ -fold joint distribution is invariant to permutations of the indices $1,...,n$ . It directly follows from the definitions that an exchangeable process is stationary. Lastly, the process $\{Z_{n}\}_{n=1}^{\infty}$ is called a martingale (resp. supermartingale, submartingale) with respect to the process $\{Y_{n}\}_{n=1}^{\infty}$ if $E[|Z_{n}|]<\infty$ and $E[Z_{n+1}|Y_{n}]=Z_{n}$ almost surely (resp. less than or equal to, greater than or equal to), for all $n$ . Precise definitions of all notions, including that of ergodicity, can be found in standard texts (e.g., [15, 16]).

We now recall the classical version of the Polya contagion process [8, 10]. Consider an urn with $R\in\mathbb{Z}_{>0}$ red balls and $B\in\mathbb{Z}_{>0}$ black balls. We denote the total number of balls by $T$ , i.e., $T=R+B$ . At each time step, a ball is drawn from the urn. The ball is then returned along with $\Delta\ >0$ balls of the same colour. We use an indicator $Z_{n}$ to denote the colour of ball in the $n$ th draw:

[TABLE]

Let $U_{n}$ denote the proportion of red balls in the urn after the $n$ th draw. Then

[TABLE]

where $\rho_{c}=\frac{R}{T}$ is the initial proportion of red balls in the urn and $\delta_{c}=\frac{\Delta}{T}$ is a correlation parameter. Since we draw balls from this urn at each time step, the conditional probability of drawing a red ball at time $n$ , given $Z^{n-1}=(Z_{1},\cdots,Z_{n-1})$ , is given by

[TABLE]

It can be easily shown that $\{U_{n}\}_{n=1}^{\infty}$ is a martingale [17]. The process $\{Z_{n}\}_{n=1}^{\infty}$ , whose $n$ -fold joint distribution is denoted by $Q_{\rho_{c},\delta_{c}}^{(n)}$ , is also exchangeable (hence stationary) and non-ergodic with both $U_{n}$ and the process sample average $\frac{1}{n}\sum_{i=1}^{n}Z_{i}$ converging almost surely as $n\rightarrow\infty$ to a random variable governed by the Beta distribution with parameters $\frac{\rho_{c}}{\delta_{c}}$ and $\frac{1-\rho_{c}}{\delta_{c}}$ ; we denote this probability density function (pdf) by $\mathsf{Beta}(\frac{\rho_{c}}{\delta_{c}},\frac{1-\rho_{c}}{\delta_{c}})$ [17, 18]. Lastly, the $1$ -dimensional distribution of the Polya process is $Q_{\rho_{c},\delta_{c}}^{(1)}(a)=P(Z_{n}=a)=(\rho_{c})^{a}(1-\rho_{c})^{1-a}$ , for all $n\in\mathbb{Z}_{\geq 1}$ and $a\in\{0,1\}$ . The above classical Polya process $\{Z_{n}\}_{n=1}^{\infty}$ is fully described by its parameters $\rho_{c}$ and $\delta_{c}$ , and thus we denote it by $\mathsf{Polya}(\rho_{c},\delta_{c})$ .

III Network Polya Contagion Process

In this section, we introduce a generalization of the Polya contagion process to networks, where each individual node in the underlying graph that describes the network topology is still equipped with an urn; however, the node’s neighbouring structure affects the evolution of its process. This model hence captures spatial contagion, since infected neighbours increase the chance of a node being infected in the future.

Consider an undirected graph ${\mathcal{G}}=(V,{\mathcal{E}})$ , where $V=\{1,\ldots,N\}$ is the set of $N\in\mathbb{Z}_{\geq 1}$ nodes and ${\mathcal{E}}\subset V\times V$ is the set of edges. We assume that ${\mathcal{G}}$ is connected, i.e., there is a path between any two nodes in ${\mathcal{G}}$ . We use $\mathcal{N}_{i}$ to denote the set of nodes that are neighbours to node $i$ , that is $\mathcal{N}_{i}=\{v\in V:(i,v)\in{\mathcal{E}}\}$ , and $\mathcal{N}_{i}^{\prime}=\{i\}\cup\mathcal{N}_{i}$ . If $\mathcal{N}_{i}^{\prime}=V$ for all $i\in V$ , the network is called complete; if $|\mathcal{N}_{i}|=|\mathcal{N}_{j}|$ for all $i,j\in V$ , we call it regular. Each node $i\in V$ is equipped with an urn, initially with $R_{i}\in\mathbb{Z}_{>0}$ red balls and $B_{i}\in\mathbb{Z}_{>0}$ black balls (we do not let $R_{i}=0$ or $B_{i}=0$ to avoid any degenerate cases). We let $T_{i}=R_{i}+B_{i}$ be the total number of balls in the $i$ th urn, $i\in\{1,\cdots,N\}$ . We use $Z_{i,n}$ as an indicator for the ball drawn for node $i$ at time $n$ :

[TABLE]

However, instead of drawing solely from its own urn, each node draws simultaneously from a “super urn” created by combining all the balls in its own urn with the balls in its neighbours’ urns; see Figure 1. This allows the spatial relationships between nodes to influence their state. This means that $Z_{i,n}$ is the indicator for a ball drawn from node $i$ ’s super urn, and not its individual urn. Hence, the super urn of node $i$ initially has $\bar{R}_{i}=\sum_{j\in\mathcal{N}_{i}^{\prime}}R_{j}$ red balls, $\bar{B}_{i}=\sum_{j\in\mathcal{N}_{i}^{\prime}}B_{j}$ black balls, and $\bar{T}_{i}=\sum_{j\in\mathcal{N}_{i}^{\prime}}T_{j}$ balls in total.

We further consider a time-varying version of the classical Polya contagion process, following [19], where at time $t$ for node $i\in V$ , after a red ball is drawn it is returned along with $\Delta_{r,i}(t)$ red balls to node $i$ ’s urn, and $\Delta_{b,i}(t)$ black balls along with the drawn ball are added to node $i$ ’s urn when a black ball is drawn. When $\Delta_{r,i}(t)=\Delta_{b,i}(t)$ for all $t\in\mathbb{Z}_{\geq 1}$ , we write $\Delta_{i}(t)$ instead; if the $\Delta$ ’s are not node-dependent, we omit the node index. We assume throughout that $\Delta_{r,i}(t)\geq 0,\Delta_{b,i}(t)\geq 0$ , for all $t\in\mathbb{Z}_{\geq 1}$ and that there exists $i\in V$ and $t$ such that $\Delta_{r,i}(t)+\Delta_{b,i}(t)\neq 0$ ; otherwise we are simply sampling with replacement.

In the context of epidemics, the red and black balls in an urn, respectively, represent units of “infection” and “healthiness”; for example, bacteria and white blood cells. In a super urn, the bacteria can infect others in the area and the white blood cells contribute to the overall health in the neighbourhood of an individual. Drawing red at time $t$ means the bacteria in the neighbourhood were successful in reproduction and so the individual was more infected, otherwise they were healthier since the white blood cells reproduced. Thus when $Z_{i,n}=1$ , we declare that node $i$ is infected at time $n$ , and if $Z_{i,n}=0$ , then it is healthy. We add more units of bacteria once they reproduce, but commonly assume this number, $\Delta_{r,i}(t)$ , is the same across all individuals and time because the bacteria does not evolve or become altered. The amount of white blood cells created, $\Delta_{b,i}(t)$ , may change since we can give more medicine to certain people to increase their immune response, or vaccinate them so they are better able to fight the disease.

To express the proportion of red balls in the individual urns of the nodes, we define the random vector $U_{n}=(U_{1,n},\ldots,U_{N,n})$ , where $U_{i,n}$ is the proportion of red balls in node $i$ ’s urn after the $n$ th draw, $i\in V$ . For node $i$ ,

[TABLE]

where the numerator represents the total number of red balls in node $i$ ’s urn after the $n$ th draw, while the denominator is the total number of balls in the same urn. Note that $U_{i,0}=\frac{R_{i}}{T_{i}}$ is the initial proportion of balls in node $i$ ’s urn. For ease of notation, let

[TABLE]

Furthermore, we define the random vector $S_{n}=(S_{1,n},...,S_{N,n})$ as the proportion of red balls in the super urns of the nodes after the $n$ th draw, so that $S_{i,n}$ is the proportion of red balls in node $i$ ’s super urn after $n$ draws. Hence, for node $i$ ,

[TABLE]

Note that $S_{i,0}=\frac{\bar{R}_{i}}{\bar{T}_{i}}$ . $S_{i,n}$ is in fact a function of the random draw variables of the network, and in particular of $\{Z_{j}^{n}\}_{j\in\mathcal{N}_{i}^{\prime}}$ , but for ease of notation, when the arguments are clear, we write $S_{i,n}{\color[rgb]{0,0,0}(Z_{1}^{n},\cdots,Z_{N}^{n})}=S_{i,n}$ . Then the conditional probability of drawing a red ball from the super urn of node $i$ at time $n$ given the complete network history, i.e. given all the past $n-1$ draw variables for each node in the network $\{Z_{j}^{n-1}\}_{j=1}^{N}=\{(Z_{1,1},\cdots,Z_{1,n-1}),\cdots,(Z_{N,1},\cdots,Z_{N,n-1})\}$ , satisfies

[TABLE]

That is, the conditional probability of drawing a red ball for node $i$ at time $n$ given the entire past $\{Z_{j}^{n-1}\}_{j=1}^{N}$ is the proportion of red balls in its super urn, $S_{i,n-1}$ . This is however analogous to the original Polya case, but instead of relying on the individual proportion of red balls $U_{n}$ to describe the conditional probability of drawing red balls, we use the super urn proportion of red balls since we now draw from there.

Remark III.1

(Non-Markovity):* While (4) may appear to suggest some sort of Markovity property, the process is non-Markovian in general. This can easily be seen due to the fact that a draw at time $n$ requires knowledge of all previous draws for the entire neighbourhood.*

A main objective throughout the rest of this paper is to study the evolution and stochastic properties of the process defined above. Using the above conditional probability, we can determine the $n$ -fold joint probability of the entire network ${\mathcal{G}}$ : for $a_{i}^{n}\in\{0,1\}^{n}$ , $i\in\{1,...,N\}$ , we have that

[TABLE]

where $S_{i,t}=S_{i,t}(a_{1}^{t},\cdots,a_{N}^{t})$ is defined in (4). With the above explicit joint distribution, it is possible to determine the distributions of each node’s process. More specifically, using (9), the $n$ -fold distribution of node $i$ ’s process at time $t\geq n$ is

[TABLE]

In order to measure the spread of contagion in the network at any given time, we wish to see how likely it is, on average, for a node to be infected at that instant. We hence define the average infection rate in the network at time $n$ as the average marginal probability of drawing a red ball,

[TABLE]

Note that $\tilde{I}_{n}$ is a function of the network topology $(V,{\mathcal{E}})$ , the initial placement of balls $R_{i}$ and $B_{i}$ , the draw processes $\{Z_{i,t}\}_{t=1}^{n}$ , and number of balls added $\{\Delta_{r,i}(t)\}_{t=1}^{n}$ and $\{\Delta_{b,i}(t)\}_{t=1}^{n}$ for each node $i\in V$ . Unfortunately for an arbitrary network, the above quantity does not yield an exact analytical formula (except in the simple case of complete networks). As such, in general it is hard to mathematically analyze the asymptotic behaviour of $\tilde{I}_{n}$ , which we wish to minimize when attempting to cure an epidemic. Instead we examine the asymptotic stochastic behavior of a closely related variable given by the average individual proportion of red balls at time $n$ , namely

[TABLE]

which we call the network susceptibility. This quantity is related to the conditional probability of drawing a red ball, as seen in (2). Since the individual urn of node $i$ is in every super urn in the neighbourhood, if $U_{i,n}$ increases then $S_{j,n}$ increases for every $j\in\mathcal{N}_{i}^{\prime}$ , and hence given the past history those nodes are more likely to exhibit infected behaviour as seen from (4). Note that similarly to $\tilde{I}_{n}$ , $\tilde{U}_{n}$ is a function of the network variables.

Remark III.2

(Finite Memory):* It is worth pointing out that a practical adaptation to our model can be considered, where urns have “finite memory” in the sense that the balls added after each draw are only kept in each node’s urn for a finite number of future draws. This model is developed in [18] for the classical Polya process in the context of modelling communication channels, where it is shown that the resulting finite memory contagion process is stationary, Markovian and ergodic. We present the following result that states that in this case the entire state is Markovian and hence it is a limited reinforcement model, but leave an in-depth investigation to a future work. $\bullet$ *

Proposition III.3

(Finite Memory Markovity):* The entire state of the network Polya contagion process $\{Z_{n}\}_{n=1}^{\infty}$ with finite memory $M$ is Markovian with memory $M$ .*

Proof:

By (1) and (4) and the fact that added balls are removed after $M$ steps, we have for $n>M$ that

[TABLE]

Using the above result along with conditional independence, for $(a_{1},\ldots,a_{N})\in\{0,1\}^{N}$ we have for $n>M$ that

[TABLE]

and hence the entire network process $\{Z_{n}\}_{n=1}^{\infty}$ is Markovian with memory $M$ . ∎

IV Stochastic Properties

We next examine the stochastic properties of the network contagion process. We assume throughout the beginning of this section that $\Delta_{r,i}(t)=\Delta_{b,i}(t)=\Delta>0$ , for all $i\in V$ and times $t$ ; that is the net number of red and black balls added are equal and constant in time for all nodes. In the case of a complete network, the composition of every nodes’ super urn is identical, since there is only one super urn that is being drawn from. Thus for a complete network the super urn model is analogous to one urn where multiple draws occur with replacement, which has been recently studied in detail [20]. However, the analysis in [20] is carried out in an aggregate sense, i.e., only for the entire urn and not individual processes. Unfortunately, this aggregate approach does not work in a network setting, whereas the super urn model proposed here is applicable.

IV-A Complete Network Marginal Distributions

We first focus on the special case of complete networks to derive some useful probability distributions; later on, we will obtain other stochastic properties that apply to more general networks.

Given that the network is complete, we focus on one of the nodes, say $i\in V$ . For ease of notation, we define $\bar{T}_{j}=\sum_{k=1}^{N}T_{k}=:\bar{T}$ , and similarly, $\bar{R}_{j}=:\bar{R},\bar{B}_{j}=:\bar{B}$ , for all $j\in V$ . Defining the events $A_{n-1}=\{Z_{i,n-1}=a_{n-1},...,Z_{i,1}=a_{1}\}$ and $W_{n-1}=\{A_{n-1},\{Z_{j,1}^{n-1}=b_{j}^{n-1}\}_{j\neq i}\}$ with $b_{j}^{n-1}\in\{0,1\}^{n-1}$ , and parameters $\rho=\frac{\bar{R}}{\bar{T}}$ and $\delta=\frac{N\Delta}{\bar{T}}$ , we can write using (4) under the above assumptions, that

[TABLE]

By examining an arbitrary term $k\neq i$ in the final sum above, for fixed $t\in\{1,...,n-1\}$ , we can sum out all the other draw variables:

[TABLE]

Further, by the law of total probability,

[TABLE]

So using (17) and (20), (10) becomes

[TABLE]

Thus, using the law of total probability, we have

[TABLE]

An interesting corollary of this derivation is as follows.

Lemma IV.1

(Complete Network Marginal Distribution):* The $1$ -dimensional marginal distribution of node $i$ ’s contagion draw process $\{Z_{i,n}\}_{n=1}^{\infty}$ for the $N$ -node complete network is given by*

[TABLE]

where $i\in V$ , $n\geq 1$ , and $a\in\{0,1\}$ .

Proof:

We proceed using strong induction on $n\geq 1$ , showing that $P(Z_{i,n}=1)=\rho$ , for all nodes $i\in V$ and all $n$ . The base case readily holds, since at time $n=1$ ,

[TABLE]

Now, assuming that $P(Z_{j,t}=1)=\rho$ for all $j\in V$ and $t\leq n$ and using (21), we have

[TABLE]

which completes the induction argument. The result now follows using the fact that

[TABLE]

for all $j\in V$ and all $n$ . ∎

We next show that each node’s draw process is not stationary in general, and hence is different from the classical $\mathsf{Polya}(\rho_{c},\delta_{c})$ process.

Remark IV.2

(Non-Stationarity of the Network Contagion Process):* Consider a $2$ -node complete network. Then, using (9), one can obtain (after some simplifications) that*

[TABLE]

and hence the network process is not stationary. $\bullet$

Since every exchangeable process is necessarily stationary, Remark IV.2 implies that the network Polya process is not exchangeable in general. However, some notions of stationarity remain; in our next result, we will see that there is a consistent relationship between the draws at the $1$ st and $n$ th time steps.

Lemma IV.3

(Complete Network $(n,1)$ -step Marginal Probability):* For the complete network, the $2$ -dimensional marginal probability that node $i$ ’s draw variables at times $n$ and $1$ are both one is given by*

[TABLE]

for $i\in V$ , $n\geq 2$ . Furthermore, for any other node $k$ ,

[TABLE]

Proof:

By Lemma IV.1 we have that $P(Z_{k,1}=1)=\rho$ for all $k\in V$ , so it is enough to show that

[TABLE]

for all $n$ and nodes $i$ and $k$ . Using the law of total probability, (4), and after some simplifications, with defining $W_{n-1}=\{Z_{i,2}^{n-1}=a_{2}^{n-1},\{Z_{j,1}^{n-1}=b_{j,1}^{n-1}\}_{j\neq i}\}$ (note that $W_{n-1}$ is a function of $\{b_{j,1}^{n-1}\}_{j\neq i}$ , but for simplicity we omit this), we have that

[TABLE]

Then, after arranging terms and using the law of total probability for

[TABLE]

we have

[TABLE]

It can be similarly shown by symmetry of the complete network that (26) holds for $P(Z_{k,n}=1\ |\ Z_{i,1}=1)$ if $k\neq i$ .

In order to show (25), we proceed using strong induction on $n\geq 2$ . For the base case, setting $n=2$ in (26), we have for any $i,k\in V$ ,

[TABLE]

as desired. Assume now that $P(Z_{k,t}=1\ |\ Z_{i,1}=1)$ is given by (25), for $2\leq t\leq n-1$ and any $i,k\in V$ . Then by (26),

[TABLE]

which completes the induction argument. ∎

Although the draw process is not stationary in general, simulated results suggest that it satisfies some asymptotic stationarity properties, in the sense that given sufficient time the process settles and deviations become very small in magnitude. A representative example is shown in Figure 2 for the $2$ -dimensional distribution at times $n$ and $n-1$ in the 5-node network shown in Figure 3(d).

IV-B Martingale Theorems

We now turn our attention to the martingale properties of the network contagion process, where we do not assume that the network is necessarily complete. Recall that by the martingale convergence theorem [15, 16], if a process $\{Z_{n}\}_{n=1}^{\infty}$ is a martingale (or supermartingale, or submartingale), there exists a random variable $Z$ such that $\{Z_{n}\}_{n=1}^{\infty}$ converges almost surely to $Z$ as $n\rightarrow\infty$ .

Theorem IV.4

(Individual Urn Proportion Martingale):* For a network ${\mathcal{G}}=(V,{\mathcal{E}})$ , $\Delta_{r,i}(n)=\Delta_{b,i}(n)=\Delta$ and $T_{i}=T$ , for all $i\in V$ and all $n$ , the individual proportion of red balls $\{U_{i,n}\}_{n=1}^{\infty}$ is a martingale with respect to the draws for the whole network $\{Z_{n}\}_{n=1}^{\infty}=\{(Z_{1,n},...,Z_{N,n})\}_{n=1}^{\infty}$ if and only if, almost surely,*

[TABLE]

Proof:

Using the expression for $U_{i,n}$ , (2), and (4), we have almost surely

[TABLE]

This implies that $\{U_{i,n}\}_{n=1}^{\infty}$ is a martingale with respect to $\{Z_{n}\}_{n=1}^{\infty}$ if and only if

[TABLE]

almost surely. ∎

If the condition in Theorem IV.4 holds, then for any $i$ both $U_{i,n}$ and $\frac{1}{n}\sum_{t=1}^{n}Z_{i,t}$ converge almost surely to a limit as $n\rightarrow\infty$ . However, the condition of Theorem IV.4, barring the trivial single node scenario, is not verifiable. To resolve this issue, we instead examine the evolution of the average proportion of red balls (i.e., the susceptibility) in a regular network.

Theorem IV.5

(Regular Network Susceptibility Martingale):* For a regular network ${\mathcal{G}}=(V,{\mathcal{E}})$ with $\Delta_{r,i}(n)=\Delta_{b,i}(n)=\Delta$ and $T_{i}=T$ for all nodes $i\in V$ and times $n$ , the network susceptibility $\{\tilde{U}_{n}\}_{n=1}^{\infty}$ , where $\tilde{U}_{n}=\frac{1}{N}\sum_{i=1}^{N}U_{i,n}$ , is a martingale with respect to $\{Z_{n}\}_{n=1}^{\infty}$ .*

Proof:

We have, similar to Theorem IV.4, that

[TABLE]

Let us examine the second term of the last equality. If this term is zero, $\{\tilde{U}_{n}\}_{n=1}^{\infty}$ is a martingale with respect to $\{Z_{n}\}_{n=1}^{\infty}$ . We now define the adjacency matrix $[a_{ij}]$ of our network, where the $(i,j)$ th entry $a_{ij}$ is 1 if $(i,j)\in{\mathcal{E}}$ , and 0 otherwise. Since we assumed that our network was undirected, $[a_{ij}]$ is symmetric, i.e., $a_{ij}=a_{ji}$ for all $i,j\in V$ . So,

[TABLE]

Now, we examine the sum of the $(i,j)$ and $(j,i)$ components of the double sum, where $(i,j)\in{\mathcal{E}}$ (otherwise both terms are zero). Recall that $(i,i)\not\in{\mathcal{E}}$ , $\forall i$ . We have

[TABLE]

From above, it is clear that this term is zero for all $i$ and $j$ by setting $|\mathcal{N}_{j}|=|\mathcal{N}_{i}|$ , i.e. in any regular network, and so $\{\tilde{U}_{n}\}_{n=1}^{\infty}$ is a martingale with respect to $\{Z_{n}\}_{n=1}^{\infty}$ . ∎

We next allow the net number of black balls $\Delta_{b,i}(\cdot)$ to evolve stochastically in time as a function of the past draw history in the network in order to steer $\{U_{i,n}\}_{n=1}^{\infty}$ to a limit for every node $i$ .

Theorem IV.6

(Individual Urn Proportion Categories):* In a general network ${\mathcal{G}}=(V,{\mathcal{E}})$ with $\Delta_{r,i}(n)=\Delta_{r}$ for all $n\in\mathbb{Z}_{\geq 1}$ and $i\in V$ , if we choose $\{\Delta_{b,i}(n)\}_{n=1}^{\infty}$ so that*

[TABLE]

almost surely for all $n\in\mathbb{Z}_{\geq 1}$ and $i\in V$ (resp. equal to, less than or equal to) then $\{U_{i,n}\}_{n=1}^{\infty}$ is a supermartingale (resp. martingale, submartingale) with respect to $\{Z_{n}\}_{n=1}^{\infty}$ .

Proof:

We will start with the case of a supermartingale. That is, we wish to show that almost surely for all $n\in\mathbb{Z}_{\geq 1}$ ,

[TABLE]

Define $\bar{Z}_{i,n}=\sum_{t=1}^{n}Z_{i,t}$ , and take $X_{i,n}$ as in (1). Then, we have almost surely

[TABLE]

since $X_{i,n}>0$ for all $n\in\mathbb{Z}_{\geq 1}$ almost surely, we can ignore it. Now, since $U_{i,n-1}$ is almost surely constant given $Z_{n-1}$ ,

[TABLE]

That is we wish to check if, almost surely,

[TABLE]

Now if

[TABLE]

almost surely, we have

[TABLE]

where the second to last equality comes from the fact that $E[Z_{i,n}|Z_{n-1}]=P(Z_{i,n}=1|Z_{n-1})=S_{i,n-1}$ almost surely by (4), and that $S_{i,n-1}$ is almost surely constant given $Z_{n-1}$ . Thus as long as $\Delta_{b,i}(n)$ obeys this bound almost surely for all $n\in\mathbb{Z}_{\geq 1}$ , $\{U_{i,n}\}_{n=1}^{\infty}$ is a supermartingale with respect to $\{Z_{n}\}_{n=1}^{\infty}$ . Similarly, if $\Delta_{b,i}(n)$ is almost surely equal (resp. less than or equal) to this bound, $\{U_{i,n}\}_{n=1}^{\infty}$ is a martingale (resp. submartingale) with respect to $\{Z_{n}\}_{n=1}^{\infty}$ . ∎

Theorem IV.6 tells us what bounds for $\{\Delta_{b,i}(n)\}_{n=1}^{\infty}$ must be obeyed almost surely to guarantee that $\{U_{i,n}\}_{n=1}^{\infty}$ admits an asymptotic limit for all $i\in V$ in any general network. For instance, this tells us that by choosing $\Delta_{b,i}(t)=0$ almost surely for all $i\in V$ and $t\in\mathbb{Z}_{\geq 1}$ , $\{U_{i,n}\}_{n=1}^{\infty}$ will be a submartingale and will converge to some limiting random variable. While this result is interesting for modelling contagion, it is especially useful in the context of curing.

V Model Approximations

As previously noted, the dynamics of the network contagion process are complicated, especially when considered on general networks. For this reason, in this section we develop two useful approximations to this process on a general network that allow us to shed some light on its asymptotic behaviour. Throughout this section, unless stated otherwise, we consider general network topologies with ${\color[rgb]{0,0,0}\Delta_{r,i}(t)}=\Delta_{b,i}(t)=\Delta$ for all $t\in\mathbb{Z}_{\geq 1}$ and $i\in V$ . However, to match the $1$ -step and $(n,1)$ -step distributions, we make the simplifying assumption that the neighbourhood of each node $i$ can be represented as a complete network, i.e., all of its neighbours are connected to one another, in order to apply Lemmas IV.1 and IV.3.

V-A Approximation: Computational Model

We now introduce our first approximation technique, where we approximate the contagion process of each node in the network with a classical Polya urn process.

Model I

(Computational Model):* We approximate the dynamics of any node $i$ ’s contagion process using a classical Polya process $\mathsf{Polya}(\rho_{c}=\rho_{i},\delta_{c}=\hat{\delta}_{i})$ , with*

[TABLE]

with

[TABLE]

where $\Gamma(\cdot)$ is the Gamma function, $a^{n}=(a_{1},...,a_{n})\in\{0,1\}^{n}$ , and $\bar{a}^{n}=a_{1}+\cdots+a_{n}$ . $\bullet$

Here $\rho_{c}$ is chosen to be the proportion of red balls $\rho_{i}$ in the node’s super urn, so that the $1$ -dimensional distributions of the classical Polya process and the node process $\{Z_{i,n}\}$ coincide, while $\hat{\delta}_{i}$ is set by performing a minimization to find the value that best fits $Q_{\rho_{i},\hat{\delta}_{i}}^{(n)}$ to the distribution of $\{Z_{i,n}\}_{n=1}^{\infty}$ of node $i\in V$ . We use a divergence measure, denoted by $D(\cdot||\cdot)$ , to observe the quality of the fit.

The explicit derivation of the distribution $Q_{\rho_{i},\hat{\delta}_{i}}^{(n)}$ can be found in [17, 21]. This method ensures that the fit of $Q_{\rho_{i},\hat{\delta}_{i}}^{(n)}$ is as close as possible under the given divergence measure. Since we are measuring the error in using an approximating distribution, we use the Kullback-Leibler divergence [22]; we thus have that

[TABLE]

since $P_{i,n}^{(n)}(a^{n})\log{P_{i,n}^{(n)}(a^{n})}$ is independent of $\tilde{\delta}$ . The approximating process is stationary and exchangeable, as it is a classical Polya process. We also know (from Section II) that it is non-ergodic with its sample average converging almost surely to the $\mathsf{Beta}(\frac{\rho_{i}}{\hat{\delta}_{i}},\frac{1-\rho)i}{\hat{\delta}_{i}})$ distribution. Calculating an analytic expression for the minimizing $\hat{\delta}_{i}$ is not feasible in general, and hence should be performed computationally. However, due to the above minimization, the value of $\hat{\delta}_{i}$ is, by definition, the best way to fit a Polya process to the process $\{Z_{i,n}\}_{n=1}^{\infty}$ for a given $n$ .

V-B Approximation: Analytical Models

An alternative to Model I is to attempt to find approximations whose parameters can be determined analytically.

Model II(a)

(Large-Network Analytical Model):* For any given node $i$ , we approximate the dynamics of its process $\{Z_{i,n}\}_{n=1}^{\infty}$ by using a classical Polya process $\mathsf{Polya}(\rho_{c}=\rho_{i},\delta_{c}=\delta^{\prime}_{i})$ , with*

[TABLE]

where $\delta_{i}=\frac{N\Delta}{\sum_{j\in\mathcal{N}_{i}^{{}^{\prime}}}T_{j}}$ . $\bullet$

Here the parameters of the classical Polya process are chosen by directly matching its first and $(n,1)$ -step second-order statistics with those of $\{Z_{i,n}\}_{n=1}^{\infty}$ . This method avoids the computational burden of the previous model by yielding an analytical expression for the correlation parameter of the classical Polya process.

We next prove that under some stationarity and symmetry assumptions, the contagion process running on each node in the network is statistically identical to the classical Polya process of Model II(a).

Lemma V.1

(Exact Representation):* Suppose that*

(i)

$P(Z_{i,1}=1\ |\ Z_{j,1}^{n-1}=a^{n-1})=\rho_{i}$ , and 2. (ii)

$P(Z_{i,t}=1|Z_{j,1}^{n-1}=a^{n-1})=P(Z_{k,n}=1|Z_{j,1}^{n-1}=a^{n-1})$ ,

for all $n\geq 1,2\leq t<n$ , $i,j,k\in V$ , $a^{n-1}\in\{0,1\}^{n-1}$ . Then for any node $i$ in a complete network, $\{Z_{i,n}\}_{n=1}^{\infty}$ is given exactly by the $\mathsf{Polya}(\rho_{i},\delta^{\prime}_{i})$ process.

Proof:

For any node $i$ , we wish to show that for all $n$ , the $n$ -dimensional distributions of $\{Z_{i,n}\}_{n=1}^{\infty}$ and the $\mathsf{Polya}(\rho_{i},\delta^{\prime}_{i})$ process are identical. It is enough to show that the conditional probability of one event given the whole past is the same, since any joint probability can be written as a product of conditional probabilities. Let us define the events $A_{n-1}=\{Z_{i,1}^{n-1}=a^{n-1}\}$ and $B_{n-1}=\{Z_{j,1}^{n-1}=b_{j,1}^{n-1}\}_{j\neq i}$ . Then,

[TABLE]

Then using assumption (i), we have

[TABLE]

Now using assumption (ii), we get

[TABLE]

Thus, we have that

[TABLE]

which is the conditional probability $P(Z_{n}=1|Z_{1}^{n-1}=a^{n-1})$ for a $\mathsf{Polya}(\rho_{i},\delta^{\prime}_{i})$ process. A similar calculation can be performed for $P(Z_{i,n}=0\ |\ Z_{i,1}^{n-1}=a^{n-1})$ . ∎

Unfortunately in a general network setting assumptions (i) and (ii) above do not hold true. However, this result motivates the fact that this analytical approximation is reasonable to use for situations where these assumptions hold within tolerable margins of error; empirical evidence indicates that this occurs for large values of $N$ , since as $N$ increases the quality of the fit improves. This approximation, nevertheless, drastically reduces the complexity in analyzing the individual contagion draw processes, as closed-form expressions for the process parameters are available.

Model II(b)

(Small-Network Analytic Model):* Given any node $i$ in the network with a small to moderate number of nodes, we approximate the dynamics of its contagion process $\{Z_{n}\}_{n=1}^{\infty}$ using a $\mathsf{Polya}(\rho_{i},\delta^{\star}_{i})$ process, where*

[TABLE]

where $\delta_{i}=\frac{N\Delta}{\sum_{j\in\mathcal{N}_{i}^{{}^{\prime}}}T_{j}}$ . $\bullet$

The idea behind this model is that we want to remove the dependence on the number of nodes $N$ from the parameter $\delta_{i}=\frac{N\Delta}{\bar{T}_{i}}$ , and so we divide each instance of $\delta_{i}$ in $\delta^{\star}_{i}$ by $N$ . The idea is that as $n$ grows, it eventually becomes significantly larger than the relatively small number of nodes $N$ , and so $n|\mathcal{N}_{i}|\approx n$ for all $i\in V$ . Hence, we may consider that for a sufficiently large time, we have added $n\Delta$ balls to the super urn. Effectively, this means we are using a correlation parameter of $\frac{\Delta}{\bar{T}_{i}}$ instead of $\delta=\frac{N\Delta}{\bar{T}_{i}}$ . Simulation results confirm that this approximation captures the limit distribution of the original process better than Model II(a) when the number of nodes is small. Figure 3 displays this relationship. A summary of all models presented in this section, and the scenarios under which they are most suitable, is provided in Table I.

We close this section with numerical demonstrations on the fitness of all models. Figure 3 shows a representative comparison between the $\mathsf{Beta}(\frac{\rho_{i}}{\delta^{\prime}_{i}},\frac{1-\rho_{i}}{\delta^{\prime}_{i}})$ pdf and the simulated histogram of $\frac{1}{n}\sum_{t=1}^{n}Z_{i,n}$ , where $n=1000$ , for an arbitrary node $i$ in the given networks. Recall that the $\mathsf{Beta}(\frac{\rho_{i}}{\hat{\delta}_{i}},\frac{1-\rho_{i}}{\hat{\delta}_{i}})$ , $\mathsf{Beta}(\frac{\rho_{i}}{\delta^{\prime}_{i}},\frac{1-\rho_{i}}{\delta^{\prime}_{i}})$ and $\mathsf{Beta}(\frac{\rho_{i}}{\delta^{\star}_{i}},\frac{1-\rho_{i}}{\delta^{\star}_{i}})$ pdfs are the distributions of the limit random variables to which the sample average of the draw processes of Models I, II(a) and II(b) (respectively) converge almost surely, as $n\rightarrow\infty$ (see Section II). We use complete networks since they satisfy the assumption that all neighbourhoods are complete, as well as Barabasi-Albert networks which have been shown to be a good model for real-world social networks [23] and do not satisfy this assumption; however, our results show that the approximations still fit quite well. As expected, Model I provides the best approximation in all scenarios, albeit without an analytic expression for its parameters which can provide insight into the behaviour of the underlying process. Model II(a) fits quite well when the number of nodes in the network is large, as seen in Figures 3(b) and 3(e), but fits poorly for a small number of nodes, which is evident in Figures 3(a) and 3(c). Model II(b) is the complement of Model II(a) in the sense that it fits very well for a small number of nodes but poorly for a large network. Hence if analytic expressions for parameters are desired, Models II(a) and II(b) can be used depending on the number of nodes to provide approximations that are marginally worse than the computational exactness of Model I.

V-C Comparison with SIS model

We now provide a number of empirical results in which we compare our model, with both finite and infinite memory, to the traditional discrete time SIS model [24]. In the SIS model, the parameter $\delta_{SIS}$ denotes the probability that a node will recover from infection, and $\beta_{SIS}$ is the probability that a node will become infected through contact with a single infected neighbour. The dynamics are described through the probability that any node $i$ will be infected at time $t$ , $P_{i}(t)$ , which evolves according to the equation

[TABLE]

Note in particular that this model exhibits Markovian behaviour, since the evolution of the process depends only on the probability of infection from the previous time step. We make the simplifying assumption that $\delta_{SIS}$ and $\beta_{SIS}$ remain the same for all nodes and throughout time, and hence we will compare it with the network Polya contagion process when $\Delta_{r}$ and $\Delta_{b}$ are similarly fixed in time and throughout the network.

The concept of an epidemic threshold for the SIS model gives a value through which one may determine whether the epidemic dies, a priori using only the system parameters [24]. The threshold condition is directly related to the largest-magnitude eigenvalue $\lambda_{max}$ of the adjacency matrix of the underlying graph of the network, and states that if $\delta_{SIS}>\beta_{SIS}\lambda_{max}$ then the epidemic will be eliminated after some time $n$ , i.e., eventually $P_{i}(t)=0$ for all $i$ and all $t>n$ . Furthermore, it has been shown that this threshold is tight, and indeed if $\delta_{SIS}<\beta_{SIS}\lambda_{max}$ then some non-zero convergence point exists, called an endemic state, and the epidemic will never be eliminated [25].

Figure 4 compares the behaviour of the SIS model and the network Polya contagion process for different selections of these parameters. The initial probabilities of infection $P_{i}(0)$ for the SIS model were set to coincide with the initial individual proportions of red balls for the nodes $\frac{R_{i}}{T_{i}}$ . Further, we relate in Figures 4(a)–4(c) the parameters $\beta_{SIS}$ and $\delta_{SIS}$ to $\Delta_{r}$ and $\Delta_{b}$ , respectively, using ratios of the largest-magnitude eigenvalue $\lambda_{max}$ of the adjacency matrix of the graph shown in Figure 4(d).

Figure 4(a) shows a comparison when the SIS model is displaying endemic behaviour. We see here that after a very short time, the SIS model settles and shortly thereafter the finite memory process settles (albeit to a different value), while for the infinite memory process the individual rates of infection and hence the average $\tilde{I}_{n}$ continue to increase in time. Since both the SIS model and the finite memory process have limited reinforcement, while the infinite memory process does not, these results are to be expected. Figure 4(b) displays a comparison where the epidemic threshold is met and the epidemic dies out for the SIS model. Here we note that $\tilde{I}_{n}$ for both the infinite and finite memory processes decreases and approaches zero, albeit not as quickly as the SIS model. Hence we observe that when the curing parameter $\Delta_{b}$ is much larger (in fact, more than five folds larger) than the infection parameter $\Delta_{r}$ the epidemic is eliminated, as we expect, and this behaviour of the SIS model is captured by the network Polya contagion process. However, the finite memory process does not fully approach zero, since the initial conditions $R_{i}$ and $B_{i}$ have a much larger influence relative to the infinite memory process. Finally, Figure 4(c) shows the case where the epidemic does not vanish and the parameters in both models are set to be equal ( $\delta_{SIS}=\beta_{SIS}$ and $\Delta_{b}=\Delta_{r}$ ). We observe a similar trend between all models, with the finite and infinite memory processes exhibiting near-identical behaviour.

Through these observations, we may conclude that both versions of the network Polya contagion process may apply to the modelling of epidemics, albeit in different applications. The finite memory process exhibits behaviour that is more closely related to the SIS model since they are both limited reinforcement processes, and hence it may be best suited to traditional biological diseases. The infinite memory process obeys similar trends, but in the endemic state there are some interesting differences since the effects of the infection continue to spread throughout the population. On the other hand, the SIS model quickly settles and does not change in time. Thus with infinite memory our process is better suited to modelling opinion dynamics, the spread of ideas, and advertising schemes.

VI Conclusion

We introduced a network epidemics model based on the classical Polya urn scheme, and we investigated its stochastic properties and asymptotic behaviour in detail. We showed that under certain conditions the proportion of red balls in individual urns and the network susceptibility, which are processes used to measure infection, admit limits. Three classical Polya processes were proposed, one computational and two analytical, to statistically approximate the contagion process of each node. Empirical results were presented which show that the approximations are a good fit for a range of system parameters. Our process was also compared empirically with the discrete-time SIS model, showing a similar behaviour, particularly in the finite memory mode, while providing different degrees of reinforcement in the endemic state, with the largest reinforcement occurring under the infinite memory mode. Future directions of research include investigations into the curing of these processes, and the further study of the network contagion process with finite memory.

Bibliography25

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Hayhoe, F. Alajaji, and B. Gharesifard, “A Polya urn-based model for epidemics on networks,” Proc. 2017 American Cont. Conf. , 2017.
2[2] L. Kim, M. Abramson, K. Drakopoulos, S. Kolitz, and A. Ozdaglar, “Estimating social metwork structure and propagation dynamics for an infectious disease,” in Proc. Int. Conf. Social Computing, Behavioral-Cultural Modeling, and Prediction , pp. 85–93, Springer, 2014.
3[3] M. Garetto, W. Gong, and D. Towsley, “Modeling malware spreading dynamics,” in Proc. IEEE Int. Conf. Comp. Commun. , vol. 3, pp. 1869–1879, 2003.
4[4] E. M. Rogers, Diffusion of Innovations . 5 ed., 2003.
5[5] E. Adar and L. A. Adamic, “Tracking information epidemics in blogspace,” in Proc. IEEE/WIC/ACM Int. Conf. Web Intelligence , pp. 207–214, 2005.
6[6] D. Easley and J. Kleinberg, Networks, Crowds and Markets: Reasoning about a Highly Connected World . Cambridge Univ. Press, 2010.
7[7] P. V. Mieghem, J. Omic, and R. Kooij, “Virus spread in networks,” IEEE/ACM Trans. Netw. , vol. 17, no. 1, pp. 1–14, 2009.
8[8] F. Eggenberger and G. Polya, “Über die statistik verketteter vorgänge,” Z. Angew. Math. Mech. , vol. 3, no. 4, pp. 279–289, 1923.