Information transmission and criticality in the contact process

Marzio Cassandro; Antonio Galves; Eva L\"ocherbach

arXiv:1705.11150·math.PR·November 1, 2017

Information transmission and criticality in the contact process

Marzio Cassandro, Antonio Galves, Eva L\"ocherbach

PDF

TL;DR

This paper investigates how information transmission in the one-dimensional contact process varies with the infection parameter, revealing that maximum transmission occurs not at criticality but at other values, challenging common beliefs.

Contribution

It demonstrates that information transmission, measured by sensitivity, continues to increase beyond the critical point, providing a counterexample to the idea that maximal information occurs at criticality.

Findings

01

Sensitivity increases for λ < λ_c

02

Sensitivity continues increasing after λ_c

03

Maximum information transmission occurs away from criticality

Abstract

In the present paper, we study the relation between criticality and information transmission in the one-dimensional contact process with infection parameter $λ .$ To do this we define the {\it sensitivity} of the process to its initial condition. This sensitivity increases for values of $λ < λ_{c},$ the value of the critical parameter. The main point of the present paper is that we show that actually it continues increasing even after $λ_{c}$ and only starts decreasing for sufficiently large values of $λ .$ This provides a counterexample to the common belief that associates maximal information transmission to criticality.

Equations97

L f (ξ) = i \in Z \sum c (i, ξ) [f (ξ^{i}) - f (ξ)],

L f (ξ) = i \in Z \sum c (i, ξ) [f (ξ^{i}) - f (ξ)],

ξ^{i} (j)

ξ^{i} (j)

ξ^{i} (i)

c(i,\xi)=\left\{\begin{array}[]{ll}1&\mbox{ if }\xi(i)=1\\ \lambda\sum_{j=i\pm 1}\xi(j)&\mbox{ if }\xi(i)=0\end{array}\right\}.

c(i,\xi)=\left\{\begin{array}[]{ll}1&\mbox{ if }\xi(i)=1\\ \lambda\sum_{j=i\pm 1}\xi(j)&\mbox{ if }\xi(i)=0\end{array}\right\}.

S_{p, q, Λ} (λ, t) := P \in Π in f E (∣ η_{t}^{λ, p, Λ} (0)) - η_{t}^{λ, q, Λ} (0)) ∣),

S_{p, q, Λ} (λ, t) := P \in Π in f E (∣ η_{t}^{λ, p, Λ} (0)) - η_{t}^{λ, q, Λ} (0)) ∣),

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = S_{p, q, Λ_{r}} (λ_{2}, t) - S_{p, q, Λ_{r}} (λ_{1}, t) .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = S_{p, q, Λ_{r}} (λ_{2}, t) - S_{p, q, Λ_{r}} (λ_{1}, t) .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) > 0.

Δ_{p, q} (λ_{1}, λ_{2}, r, t) > 0.

Δ_{p, q} (λ_{1}, λ_{2}, r, t) < 0.

Δ_{p, q} (λ_{1}, λ_{2}, r, t) < 0.

P [η_{t}^{λ, ξ} \cap A \neq = \emptyset] = P [η_{t}^{λ, A} \cap ξ \neq = \emptyset] .

P [η_{t}^{λ, ξ} \cap A \neq = \emptyset] = P [η_{t}^{λ, A} \cap ξ \neq = \emptyset] .

P \in Π in f E (∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣) = E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}],

P \in Π in f E (∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣) = E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}],

∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣ = 1_{{η_{t}^{λ, ξ} (0) = 0, η_{t}^{λ, ξ^{'}} (0) = 1}} .

∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣ = 1_{{η_{t}^{λ, ξ} (0) = 0, η_{t}^{λ, ξ^{'}} (0) = 1}} .

P (η_{t}^{λ, ξ} (0) = 0, η_{t}^{λ, ξ^{'}} (0) = 1) = P (ξ \cap η_{t}^{λ, 0} = \emptyset, ξ^{'} \cap η_{t}^{λ, 0} \neq = \emptyset) = E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] .

P (η_{t}^{λ, ξ} (0) = 0, η_{t}^{λ, ξ^{'}} (0) = 1) = P (ξ \cap η_{t}^{λ, 0} = \emptyset, ξ^{'} \cap η_{t}^{λ, 0} \neq = \emptyset) = E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] .

P \in Π in f E (∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣) \leq E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] .

P \in Π in f E (∣ η_{t}^{λ, ξ} (0) - η_{t}^{λ, ξ^{'}} (0) ∣) \leq E [(1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣} - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] .

E ∣ η_{t} (0) - η_{t}^{'} (0) ∣ = P (η_{t} (0) \neq = η_{t}^{'} (0)) = P (η_{t} (0) = 0, η_{t}^{'} (0) = 1) + P (η_{t} (0) = 1, η_{t}^{'} (0) = 0) .

E ∣ η_{t} (0) - η_{t}^{'} (0) ∣ = P (η_{t} (0) \neq = η_{t}^{'} (0)) = P (η_{t} (0) = 0, η_{t}^{'} (0) = 1) + P (η_{t} (0) = 1, η_{t}^{'} (0) = 0) .

P (η_{t}^{'} (0) = a, η_{t} (0) = a) = P (η_{t}^{'} (0) = a) \land P (η_{t} (0) = a)

P (η_{t}^{'} (0) = a, η_{t} (0) = a) = P (η_{t}^{'} (0) = a) \land P (η_{t} (0) = a)

P (η_{t} (0) = 0, η_{t}^{'} (0) = 1) = P (η_{t}^{'} (0) = 1) - P (η_{t} (0) = 1, η_{t}^{'} (0) = 1)

P (η_{t} (0) = 0, η_{t}^{'} (0) = 1) = P (η_{t}^{'} (0) = 1) - P (η_{t} (0) = 1, η_{t}^{'} (0) = 1)

P (η_{t} (0) = 1, η_{t}^{'} (0) = 0) = P (η_{t} (0) = 1) - P (η_{t} (0) = 1, η_{t}^{'} (0) = 1) .

P (η_{t} (0) = 1, η_{t}^{'} (0) = 0) = P (η_{t} (0) = 1) - P (η_{t} (0) = 1, η_{t}^{'} (0) = 1) .

P (η_{t} (0) \neq = η_{t}^{'} (0)) \geq ∣ P (η_{t} (0) = 1) - P (η_{t}^{'} (0) = 1) ∣,

P (η_{t} (0) \neq = η_{t}^{'} (0)) \geq ∣ P (η_{t} (0) = 1) - P (η_{t}^{'} (0) = 1) ∣,

∣ P (η_{t} (0) = 1) - P (η_{t}^{'} (0) = 1) ∣ = ∣ E [1 - (1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣}] - E [1 - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] ∣,

∣ P (η_{t} (0) = 1) - P (η_{t}^{'} (0) = 1) ∣ = ∣ E [1 - (1 - p)^{∣ η_{t}^{λ, 0} \cap Λ∣}] - E [1 - (1 - q)^{∣ η_{t}^{λ, 0} \cap Λ∣}] ∣,

ϱ (λ) = P (η_{t}^{λ, 0} \mbox s u r v i v es f or e v er) .

ϱ (λ) = P (η_{t}^{λ, 0} \mbox s u r v i v es f or e v er) .

t \to \infty lim E (f (η_{t}^{λ, 0})) = (1 - ϱ (λ)) f (0) + ϱ (λ) \int f d ν_{λ},

t \to \infty lim E (f (η_{t}^{λ, 0})) = (1 - ϱ (λ)) f (0) + ϱ (λ) \int f d ν_{λ},

∣ ν_{λ} (f g) - (ν_{λ} f) (ν_{λ} g) ∣ \leq C e^{- ε d i s t (R_{1}, R_{2})} .

∣ ν_{λ} (f g) - (ν_{λ} f) (ν_{λ} g) ∣ \leq C e^{- ε d i s t (R_{1}, R_{2})} .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = E_{λ_{1}, λ_{2}} (f (∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣) - f (∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣)),

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = E_{λ_{1}, λ_{2}} (f (∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣) - f (∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣)),

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = [f (2) - f (1)] P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] + f (2) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] + f (1) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 1, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) = [f (2) - f (1)] P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] + f (2) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] + f (1) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 1, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) \geq [f (2) - f (1)] P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] + f (2) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ \neq = 0, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] .

Δ_{p, q} (λ_{1}, λ_{2}, r, t) \geq [f (2) - f (1)] P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] + f (2) P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ \neq = 0, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] .

P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] = 2 P_{λ_{1}, λ_{2}} [η_{t}^{λ_{2}, 0} (\pm r) = 1, η_{t}^{λ_{1}, 0} (- r) = 0, η_{t}^{λ_{1}, 0} (r) = 1],

P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ = 2, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 1] = 2 P_{λ_{1}, λ_{2}} [η_{t}^{λ_{2}, 0} (\pm r) = 1, η_{t}^{λ_{1}, 0} (- r) = 0, η_{t}^{λ_{1}, 0} (r) = 1],

P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ \neq = 0, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] \leq 2 P_{λ_{1}, λ_{2}} [η_{t}^{λ_{2}, 0} (- r) = 1, η_{t}^{λ_{1}, 0} (\pm r) = 0] .

P_{λ_{1}, λ_{2}} [∣ η_{t}^{λ_{2}, 0} \cap Λ_{r} ∣ \neq = 0, ∣ η_{t}^{λ_{1}, 0} \cap Λ_{r} ∣ = 0] \leq 2 P_{λ_{1}, λ_{2}} [η_{t}^{λ_{2}, 0} (- r) = 1, η_{t}^{λ_{1}, 0} (\pm r) = 0] .

\Delta_{p,q}(\lambda_{1},\lambda_{2},r,t)\geq 2{\mathbb{P}}_{\lambda_{1},\lambda_{2}}(A_{t})\Big{(}[f(2)-f(1)]{\mathbb{P}}_{\lambda_{1},\lambda_{2}}(\eta^{\lambda_{1},0}_{t}(r)=1|A_{t})\\ +f(2){\mathbb{P}}_{\lambda_{1},\lambda_{2}}(\eta^{\lambda_{1},0}_{t}(r)=0|A_{t})\Big{)}.

\Delta_{p,q}(\lambda_{1},\lambda_{2},r,t)\geq 2{\mathbb{P}}_{\lambda_{1},\lambda_{2}}(A_{t})\Big{(}[f(2)-f(1)]{\mathbb{P}}_{\lambda_{1},\lambda_{2}}(\eta^{\lambda_{1},0}_{t}(r)=1|A_{t})\\ +f(2){\mathbb{P}}_{\lambda_{1},\lambda_{2}}(\eta^{\lambda_{1},0}_{t}(r)=0|A_{t})\Big{)}.

P_{λ_{1}, λ_{2}} (η_{t}^{λ_{1}, 0} (r) = 1∣ A_{t}) = P (η_{t}^{λ_{1}, 0} (r) = 1∣ η_{t}^{λ_{1}, 0} (- r) = 0) .

P_{λ_{1}, λ_{2}} (η_{t}^{λ_{1}, 0} (r) = 1∣ A_{t}) = P (η_{t}^{λ_{1}, 0} (r) = 1∣ η_{t}^{λ_{1}, 0} (- r) = 0) .

P_{λ_{1}, λ_{2}} (η_{t}^{λ_{1}, 0} (r) = 0∣ A_{t}) = P (η_{t}^{λ_{1}, 0} (r) = 0∣ η_{t}^{λ_{1}, 0} (- r) = 0),

P_{λ_{1}, λ_{2}} (η_{t}^{λ_{1}, 0} (r) = 0∣ A_{t}) = P (η_{t}^{λ_{1}, 0} (r) = 0∣ η_{t}^{λ_{1}, 0} (- r) = 0),

\Delta_{p,q}(\lambda_{1},\lambda_{2},r,t)\geq 2{\mathbb{P}}_{\lambda_{1},\lambda_{2}}(A_{t})\Big{(}[f(2)-f(1)]{\mathbb{P}}(\eta^{\lambda_{1},0}_{t}(r)=1|\eta^{\lambda_{1},0}_{t}(-r)=0)\\ +f(2){\mathbb{P}}(\eta^{\lambda_{1},0}_{t}(r)=0|\eta^{\lambda_{1},0}_{t}(-r)=0)\Big{)}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Information transmission and criticality in the contact process.

M. Cassandro, A. Galves, E. Löcherbach

M. Cassandro : Gran Sasso Science Institute, L’Aquila, Italy.

[email protected]

A. Galves: Universidade de São Paulo, Instituto de Matemática e Estatística, São Paulo, Brazil

[email protected]

E. Löcherbach: Université de Cergy-Pontoise, AGM, CNRS-UMR 8088, 95000 Cergy-Pontoise, France.

[email protected]

(Date: October 30, 2017)

Abstract.

In the present paper, we study the relation between criticality and information transmission in the one-dimensional contact process with infection parameter $\lambda.$ We introduce a notion of sensitivity of the process to its initial condition and prove that it increases not only for values of $\lambda<\lambda_{c},$ the value of the critical parameter, but keeps increasing even after $\lambda_{c},$ before finally starting to decrease for values of $\lambda$ sufficiently above $\lambda_{c}.$ This provides a counterexample to the common belief that associates maximal information transmission to criticality.

Key words and phrases:

Contact process. Criticality. Information transmission. Duality and coupling.

2010 Mathematics Subject Classification:

60K35; 82B27

1. Introduction

From swarms of birds to neuronal activity, an increasing number of recent papers claims that the ability of a complex system to transmit information is maximized at the critical point. Just to cite a few examples, [15] advertise that in swarms of cooperative units such as bird flocks “the information transfer is made possible by the nonlocal nature of the criticality condition.” See also [6] who claim that “flocks behave as critical systems, poised to respond maximally to environmental perturbations.” Similar ideas have emerged in neurobiology.

It is reasonable to conjecture that these ideas originate under the influential work of Per Bak, see for instance [1] where he devotes a whole chapter to the question “Why Should the Brain Be Critical? ” [3] suggest the following answer. “The fact that the critical state […] maximized information transmission in these networks is consistent with an intuitive understanding of how a branching process would work in the context of a highly parallel network. If the network were subcritical, an input signal would attenuate, causing most output units to be inactive, thus leaving little evidence of the input. If the network were supercritical, any input signal would eventually lead to most output units being active, again leaving little information as to what the input was.” This interpretation echoes Per Bak’s own answer given in [1]: “The brain must operate at the critical case where the information is just able to propagate”.

A preliminary step in this direction is to clarify what we mean by information transmission in a complex system and how to measure it. From a neuroscientific point of view, [12] suggested the following answer to this question. They propose to use the notion of dynamical range borrowed from acoustics to measure the sensitivity of the system to external stimuli. More precisely, they consider a stochastic system of interacting neurons exposed to an external stimulus modeled by a Poisson point process. In their model, the graph of interactions is given by an undirected Erdös-Rényi random graph. For this model, the authors are able to precisely define the notion of criticality, using as relevant parameter the average branching ratio. In fact they show, by numerical simulations, that a critical parameter value exists such that the dynamical range increases monotonically below this parameter and decreases monotonically above.

In the present paper, we formulate the problem of information transmission in the following way. We study to which extent a system is able to discriminate between two different initial stimuli. We do this for the one-dimensional contact process which is probably the simplest non-trivial complex system one might think of. More precisely, we compare two coupled time evolutions starting from Bernoulli product measures on a finite set of points having different densities. We then define the sensitivity of the model with respect to the initial signal as the total variation distance between the two associated processes at a given single site, at some fixed time. We prove that the sensitivity of the system to the initial condition, as a function of $\lambda,$ keeps increasing even after the critical point, before finally starting to decrease. This contradicts the common belief that information transmission is maximized at the critical point.

In Section 4, we discuss our results in comparison with those, already present in the literature, where similar features are pointed out by numerical simulations.

This paper is organized as follows. In Section 2, we recall the definition of the one-dimensional contact process, we give the definition of our measure of sensitivity in (2.3) and state Theorem 1 which is our main result. The proofs are collected in Section 3.

2. Definitions and main results

In the following, we briefly recall the definition of the one-dimensional contact process introduced in [9]. Let $X=\{0,1\}^{\mathbb{Z}}$ and write $\xi=(\xi(i))_{i\in{\mathbb{Z}}}$ for the elements of $X.$ The contact process on ${\mathbb{Z}}$ is the continuous time Markov process $(\eta^{\lambda}_{t}(i),i\in{\mathbb{Z}},t\geq 0),$ taking values in $X$ having generator

[TABLE]

for any cylinder function $f.$ In the above formula, $\xi^{i}$ the configuration defined by

[TABLE]

and

[TABLE]

Here, $\lambda>0$ is a fixed constant. We shall write $(\eta^{\lambda,\xi}_{t},t\geq 0)$ for a version of the above process starting from $\eta^{\lambda,\xi}_{0}=\xi,$ for any fixed initial configuration $\xi\in X.$

Recall that [9] proved that there exists a critical value $\lambda_{c}$ with $0<\lambda_{c}<\infty$ such that for all $\lambda<\lambda_{c},$ there is only one invariant measure, the Dirac-measure supported by the $0-$ configuration, while for $\lambda>\lambda_{c},$ a second and non-trivial extremal invariant measure appears. This result was completed by [5] who prove that at the critical point only one invariant measure, the trivial one, exists. In particular, in the subcritical case, starting from any initial configuration, the process converges to the zero-configuration, while in the supercritical case it converges to a convex combination of the two extremal invariant measures. In the supercritical case, the influence of the initial configuration appears in the weighting factor defining this mixture.

Here, to measure the sensitivity to the initial condition we must adopt a non-asymptotic point of view and measure, at a given time $t,$ how well the system discriminates between two different initial states. The precise definition is as follows. We suppose that the system starts with a initial configuration with a given density $p$ within a finite subset $\Lambda$ of ${\mathbb{Z}}.$ This process will be denoted $(\eta_{t}^{\lambda,p,\Lambda},t\geq 0),$ where $\lambda$ is the intensity of the infection rate appearing in (2.1). We suppose that $\eta^{\lambda,p,\Lambda}_{0}(i)=\eta(i)$ for all $i\in\Lambda,$ where $\eta(i),i\in\Lambda,$ are i.i.d. Bernoulli random variables with parameter $p.$ We also suppose that $\eta^{\lambda,p,\Lambda}_{0}(i)=0$ for all $i\in\Lambda^{c}.$

We consider two intensities $p<q$ and the associated processes $(\eta_{t}^{\lambda,p,\Lambda},t\geq 0)$ and $(\eta_{t}^{\lambda,q,\Lambda},t\geq 0).$ We then define the sensitivity of the process with parameter $\lambda$ with respect to the initial condition by

[TABLE]

where $t>0$ and where the infimum is taken over all possible couplings ${\mathbb{P}}$ of $\eta_{t}^{\lambda,p,\Lambda}(0)$ and $\eta_{t}^{\lambda,q,\Lambda}(0).$ The quantity ${\mathcal{S}}_{p,q,\Lambda}(\lambda,t)$ measures the minimal distance of the two processes in a given position (here, the position [math]) at time $t,$ under two initial Bernoulli configurations of density $p$ and $q.$

In the following we choose $\Lambda:=\Lambda_{r}=\{-r,r\},$ for a fixed position $r\in{\mathbb{Z}},$ and we pose for any $0<\lambda_{1}<\lambda_{2}<\infty,$

[TABLE]

The quantity $\Delta_{p,q}(\lambda_{1},\lambda_{2},r,t)$ measures the sensitivity variation with respect to increasing values of the intensity $\lambda,$ at time $t.$ We show that this sensitivity variation is non-decreasing for all $\lambda<\lambda_{c}$ and continues increasing even after $\lambda_{c}$ before finally being decreasing. This is the content of our main theorem that we present now.

Theorem 1.

For any fixed $q>p>\frac{2}{3}$ there exist $\lambda_{c}<\lambda_{1}(p)\leq\lambda_{2}(p)$ such that the following holds.

For $\lambda_{1}<\lambda_{2}<\lambda_{1}(p),$ there exist $r^{*}=r^{*}(\lambda_{1},p,q)$ and $t^{*}=t^{*}(\lambda_{1},p,q,r^{*}),$ such that for all $t\geq t^{*}$ and $r\geq r^{*},$

[TABLE]

For $\lambda_{2}>\lambda_{1}>\lambda_{2}(p),$ there exist $r^{*}=r^{*}(\lambda_{1},p,q)$ and $t^{*}=t^{*}(\lambda_{1},p,q,r^{*}),$ such that for all $t\geq t^{*}$ and $r\geq r^{*},$

[TABLE]

3. Proof of Theorem 1

The proof of Theorem 1 uses the self-duality of the contact process. For the sake of completeness we recall here this property. We start by introducing some notation. First of all, in the following we will not distinguish between the configuration $\eta^{\lambda}_{t}$ of the contact process at time $t$ and the associated subset of ${\mathbb{Z}}$ given by $S(\eta^{\lambda}_{t})=\{i\in{\mathbb{Z}}:\eta^{\lambda}_{t}(i)=1\};$ that is, depending on the context, we will interpret $\eta^{\lambda}_{t}$ as element of $X$ or as element of ${\mathcal{P}}({\mathbb{Z}}),$ the set of all subsets of ${\mathbb{Z}}.$ Moreover, for any subset $A\subset{\mathbb{Z}},$ we write $(\eta_{t}^{\lambda,A},t\geq 0)$ for the contact process starting from the initial configuration $\eta^{\lambda,A}_{0}(i)=1$ if and only if $i\in A.$ We observe that if $A$ is finite, then $(\eta^{\lambda,A}_{t},t\geq 0)$ is just a pure jump Markov process, taking values in the set of finite subsets of ${\mathbb{Z}}.$ If $A=\{i\},$ then we write simply $\eta_{t}^{i}$ for $\eta_{t}^{\{i\}}.$

The duality property of the contact process can be stated as follows. For any finite subset $A\in{\mathcal{P}}({\mathbb{Z}})$ and any initial configuration $\xi\in X,$

[TABLE]

For more on duality see [4] and [10].

Our proof of Theorem 1 relies on the following result.

Proposition 1.

Let $\Lambda\in{\mathcal{P}}({\mathbb{Z}})$ be a finite subset of ${\mathbb{Z}}$ and assume that $\xi(i),\xi^{\prime}(i),i\in\Lambda,$ are i.i.d. Bernoulli random variables with parameter $p,$ $q,$ respectively, for $0<p<q<1,$ and that $\xi(i)=\xi^{\prime}(i)=0$ for all $i\in\Lambda^{c}.$ Then for all $i\in{\mathbb{Z}},$

[TABLE]

where $\Pi$ denotes all possible couplings ${\mathbb{P}}$ of $\eta_{t}^{\lambda,\xi}(0)$ and $\eta_{t}^{\lambda,\xi^{\prime}}(0).$

Remark 1.

Notice that in our definition of sensitivity in (2.2) above, to make explicit the relationship with $p$ and $\Lambda,$ we wrote $\eta_{t}^{\lambda,p,\Lambda}$ for $\eta_{t}^{\lambda,\xi}$ and $\eta_{t}^{\lambda,q,\Lambda}$ for $\eta_{t}^{\lambda,\xi^{\prime}}.$

Proof.

We take the maximal coupling of $\xi$ and $\xi^{\prime},$ that is, $\xi(i)\leq\xi^{\prime}(i)$ for all $i\in\Lambda.$ Moreover, we use the canonical monotone coupling of $\eta^{\lambda,\xi}$ and $\eta^{\lambda,\xi^{\prime}};$ that is, $\eta_{t}^{\lambda,\xi}\leq\eta_{t}^{\lambda,\xi^{\prime}}$ (in the sense of $\eta_{t}^{\lambda,\xi}(i)\leq\eta_{t}^{\lambda,\xi^{\prime}}(i)$ for all $i$ ) for all $t\geq 0.$ Then

[TABLE]

We obtain by (3.4) and since $\xi\leq\xi^{\prime},$

[TABLE]

Therefore,

[TABLE]

We now give a lower bound, following Lemma 6.1 of [8]. In the following, write for short $\eta_{t}=\eta_{t}^{\lambda,\xi}$ and $\eta_{t}^{\prime}=\eta_{t}^{\lambda,\xi^{\prime}}.$ We have

[TABLE]

This expression is minimized by the optimal coupling of $\eta_{t}(0)$ and $\eta^{\prime}_{t}(0)$ given by

[TABLE]

for $a=0,1,$

[TABLE]

and

[TABLE]

In this way, for any possible coupling,

[TABLE]

which, due to (3.4), equals

[TABLE]

implying the assertion. ∎

A second important ingredient for the proof of Theorem 1 is the following monotone coupling construction of $(\eta_{t}^{\lambda_{1},A},t\geq 0)$ and $(\eta_{t}^{\lambda_{2},A},t\geq 0)$ for $\lambda_{1}<\lambda_{2},$ where $A$ is some finite subset of ${\mathbb{Z}}.$ We associate to each site $i\in{\mathbb{Z}}$ five independent Poisson processes having jump times $(T_{n}^{i,{\dagger}})_{n}$ with rate $1,$ $(T_{n}^{i\to i+1})_{n}$ with rate $\lambda_{1},$ $(T_{n}^{i\to i-1})_{n}$ with rate $\lambda_{1},$ $(S_{n}^{i\to i+1})_{n}$ with rate $\lambda_{2}-\lambda_{1}$ and finally $(S_{n}^{i\to i-1})_{n}$ with rate $\lambda_{2}-\lambda_{1}.$ We assume that the processes attached to different sites are all independent. We then construct $\eta^{\lambda_{1},A}$ and $\eta^{\lambda_{2},0}$ in the following way. Firstly, both processes start from the same initial configuration $A$ at time $0.$ Then we update the configurations according to the following rules.

•

Every time that $T_{n}^{i,{\dagger}}$ rings, both processes simultaneously upgrade the value at site $i$ to $0.$

•

Every time that $T_{n}^{i\to i+1}$ rings, both processes simultaneously try to upgrade the position at site $i+1$ to $1,$ provided that at site $i$ or at site $i+1$ there is a symbol $1.$

•

Every time that $T_{n}^{i\to i-1}$ rings, both processes simultaneously try to upgrade the position at site $i-1$ to $1,$ provided that at site $i$ or at site $i-1$ there is a symbol $1.$

•

Every time that $S_{n}^{i\to i+1}$ rings, only the process $\eta^{\lambda_{2},A}$ tries to upgrade the position at site $i+1$ to $1,$ provided that at site $i$ or at site $i+1$ there is a symbol $1.$

•

Every time that $S_{n}^{i\to i-1}$ rings, only the process $\eta^{\lambda_{2},A}$ tries to upgrade the position at site $i-1$ to $1,$ provided that at site $i$ or at site $i-1$ there is a symbol $1.$

With this construction, we obtain the following proposition.

Proposition 2.

For the above coupled construction of $(\eta^{\lambda_{1},A}_{t},t\geq 0)$ and $(\eta^{\lambda_{2},A}_{t},t\geq 0),$ the following holds.

•

$\eta^{\lambda_{1},A}_{t}\leq\eta^{\lambda_{2},A}_{t}$ * for all $t\geq 0.$ *

•

For any fixed site $r\in{\mathbb{Z}},$ $\{\eta^{\lambda_{1},A}_{t}(r)=1\}$ is conditionally independent of $\{\eta^{\lambda_{2},A}_{t}(-r)=1\},$ conditionally on $\{\eta^{\lambda_{1},A}_{t}(-r)=0\}.$

Finally, we will rely on the following well-known result. We define

[TABLE]

Theorem 2 (Theorems 1.6 and 2.28 in Chapter VI of [13]).

The following properties hold.

•

$\varrho(\lambda)=0$ * for $\lambda\leq\lambda_{c},$ and $\varrho(\lambda)>0$ for all $\lambda>\lambda_{c}.$ *

•

The function $\varrho(\lambda)$ is continuous and non-decreasing in $\lambda,$ and $\varrho(\lambda)\uparrow 1$ as $\lambda\uparrow\infty.$

•

For all $\lambda>\lambda_{c},$ there exists a unique probability measure $\nu_{\lambda}$ on ${\mathcal{P}}({\mathbb{Z}})$ such that for any cylinder function $f,$

[TABLE]

where [math] denotes the configuration $\xi\equiv 0$ “all-zero”.

•

The measure $\nu_{\lambda}$ has exponentially decaying correlations, that is, there exist $C,\varepsilon>0$ with the following property. For all cylinder functions $f_{1}$ and $f_{2}$ such that $f_{1}(B)$ depends only on $B\cap R_{1}$ and $f_{2}(B)$ only on $B\cap R_{2},$ for some fixed $R_{1},R_{2}$ which are finite subsets of ${\mathbb{Z}},$

[TABLE]

The monotonicity of $\varrho(\lambda)$ follows from the construction presented in Proposition 2 above. For the remaining results, we refer the interested reader to [13] for a proof and references.

We are now able to prove Theorem 1.

Proof of Theorem 1.

Step 1. We start with the case $\lambda_{1}<\lambda_{2}<\lambda_{c}.$

Define $f(x)=(1-p)^{x}-(1-q)^{x},$ for any $x\in{\mathbb{N}}.$ We rely on the coupled construction of $\eta^{\lambda_{1},0}_{t}$ and $\eta^{\lambda_{2},0}_{t}$ of Proposition 2 above. By Proposition 1, we may therefore write

[TABLE]

where ${\mathbb{E}}_{\lambda_{1},\lambda_{2}}$ denotes the expectation with respect to this monotone coupling of $\eta^{\lambda_{1},0}_{t}$ and $\eta^{\lambda_{2},0}_{t}.$ Then, by the monotonicity and since $f(0)=0,$

[TABLE]

Notice that $f(2)-f(1)=(q-p)(1-p-q)<0,$ since by assumption, $1/2<p\leq q.$ Therefore,

[TABLE]

By symmetry,

[TABLE]

and

[TABLE]

We put $A_{t}:=\{\eta^{\lambda_{2},0}_{t}(-r)=1,\eta^{\lambda_{1},0}_{t}(-r)=0\}.$ By monotonicity we obtain

[TABLE]

We now use the fact that the event $\{\eta^{\lambda_{1},0}_{t}(r)=1\}$ is conditionally independent of $\{\eta^{\lambda_{2},0}_{t}(-r)=1\},$ conditionally on $\{\eta^{\lambda_{1},0}_{t}(-r)=0\}.$ Thus

[TABLE]

Analogously,

[TABLE]

implying that

[TABLE]

Now, if $\lambda_{1}<\lambda_{c},$ we have that

[TABLE]

and

[TABLE]

as $t\to\infty.$ Therefore, there exists $t_{*}$ depending on $f(2),f(1)$ and on $r,$ such that for all $t\geq t^{*},$

[TABLE]

Since ${\mathbb{P}}_{\lambda_{1},\lambda_{2}}(A_{t})>0$ for all finite $t,$ this implies the first assertion in the subcritical case $\lambda_{1}<\lambda_{2}<\lambda_{c}.$

Let us now consider values of $\lambda_{1}$ which are slightly above $\lambda_{c}.$ Relying on Theorem 2, we have

[TABLE]

and

[TABLE]

By Theorem 2, $\varrho(\lambda)\downarrow 0$ as $\lambda\downarrow\lambda_{c}.$ Observe that

[TABLE]

Using (3.5) we deduce that

[TABLE]

with analogous formulas for the other terms appearing in (3.6) and (3.7) above.

Therefore, fix some $\varepsilon>0$ and choose $\lambda_{1}(p)>\lambda_{c}$ sufficiently close to $\lambda_{c}$ such that

[TABLE]

for all $\lambda_{c}\leq\lambda_{1}\leq\lambda_{1}(p).$

Now, fix any $\lambda_{1}\in[\lambda_{c},\lambda_{1}(p)].$ Thanks to the above convergence results (3.6)–(3.8), it is possible to choose first $r^{*}$ and then $t^{*}$ such that for all $r\geq r^{*}$ and $t\geq t^{*},$

[TABLE]

implying that

[TABLE]

This concludes the proof of the first assertion.

Step 2. We finally consider the case where $\lambda_{1}$ is sufficiently larger than $\lambda_{c}.$ Let $Q$ be the monotone coupling between $\nu_{\lambda_{1}}$ and $\nu_{\lambda_{2}}$ induced by the construction of Proposition 2. Using this coupling and the fact that $f(0)=0,$ we obtain thanks to Theorem 2 that

[TABLE]

We want to show that this expression is negative for sufficiently large values of $\lambda_{1}$ and $r.$ We put $\varepsilon:=2-p-q.$ Since by assumption $q>p\geq\frac{2}{3},$ we have $2(1-\varepsilon)>\varepsilon$ (this will be important in (3.11) below).

Then $f(2)-f(1)=-(1-\varepsilon)f(1)$ and $f(2)=\varepsilon f(1).$ Writing for short

[TABLE]

it is clear that

[TABLE]

Applying the last item of Theorem 2, we have that

[TABLE]

Moreover,

[TABLE]

Putting these results together, we conclude that

[TABLE]

Since $2(1-\varepsilon)>\varepsilon,$ it is possible to choose $\delta^{*}$ such that for all $\delta\leq\delta^{*},$

[TABLE]

for some (sufficiently small) $\kappa>0.$ Recall that $\varepsilon=\varepsilon(p).$ Since $\lim_{\lambda\uparrow\infty}\varrho(\lambda)=1,$ we may choose $\lambda_{2}(p)$ sufficiently large such that $\varrho(\lambda_{1})\geq 1-\delta^{*}$ for all $\lambda_{1}\geq\lambda_{2}(p).$ As a consequence, for all $\lambda_{2}\geq\lambda_{1}\geq\lambda_{2}(p),$

[TABLE]

which implies the assertion. ∎

4. Final discussion

In the present article, we have proved that for the contact process the information transmission – as defined in (2.2) and (2.3) – is maximized at a value of the control parameter $\lambda$ which is strictly larger than the critical value $\lambda_{c}.$ Similar issues have been discussed by many other authors, using different measures of information transmission and considering different models. In the present section, we give an overview of these results and compare our findings to the ones already established in the literature.

A commonly used measure to quantify information transfer is a recent information theoretic measure introduced by [14], the so-called transfer entropy. This transfer entropy quantifies “the statistical coherence between systems evolving in time” (cf. [14]). It is “able to distinguish driving and responding elements and to detect asymmetry in the coupling of subsystems” (cf. [14]). An important point is that this quantity measures “to which extent the individual components contribute to information production and at what rate they exchange information among each other”, when an external perturbation is absent (cf. [14]).

In the case of ferromagnetic Ising models, [2] show numerically that this transfer entropy is maximized in the disordered phase, that is, in the region where only one invariant measure exists and which would correspond to the subcritical regime for the contact process. This result is confirmed by the findings of [16]. [2] argue that their result could be related to a subtle interplay between sites within and out the boundaries of same spin domains whose probability distributions are a function of the temperature. On the other hand, [7] consider a Susceptible-Infected-Susceptible (SIS) epidemic model on a homogeneous network and provide simulations showing that the transfer entropy is maximized in the supercritical regime, confirming our result. They argue that “once the disease dynamics reach criticality, we observe strong effects of one individual on a connected neighbor (measured by the transfer entropy). However, as the dynamics become supercritical, the target neighbor becomes more strongly bound to all of its neighbors collectively, and it becomes more difficult to predict its dynamics based on a single source neighbor alone; as such, the transfer entropy begins to decrease” (see [7]). These results are very close to the ones we have found in the present paper for the contact process.

Our paper presents two main differences with respect to the above cited ones. First of all, to the best of our knowledge, our result is the first one available using analytical methods instead of numerical simulations. The second difference is that instead of relying on the transfer entropy, we measure how much a system discriminates between different external inputs to which the system is initially exposed. To do so, we have introduced the notion of sensitivity of the model with respect to the initial signal, given by the total variation distance between the two associated processes at a given single site, at some fixed time.

Let us briefly comment on this choice. Two points of view are commonly adopted to describe the influence of external stimuli in neuronal systems. On the one hand, one might think of external stimuli which are permanently influencing the system, acting as external field. This is the point of view adopted by [12]. The second approach is to think of an initial configuration, coming from another region of the brain, which is exposed as in initial stimulus to the region one is interested in, and to see how this initial stimulus is propagated by the system. This is the point of view we have adopted in the present paper, leading to our definition of sensitivity.

Although our measure of information transfer is different from those used by [2], [7] and also [12], the fact that it is maximized in the supercritical region, that is, in the ordered phase where several invariant measures coexist, is due to the very nature of the system we consider (in the very same way as what was observed in [7]). This is due to the fact that both the SIS epidemics model as well as the contact process describe the evolution and the spread of an epidemics. More precisely, our result can be related to the specific features of the stationary states of our model where $\varrho(\lambda)$ is strictly larger than zero only for $\lambda>\lambda_{c}$ (cf. Theorem 2). In other words, to convey, at large times, a non trivial amount of information, $\lambda$ has to be larger than $\lambda_{c}.$ On the contrary to these results, in the case of the 2d Ising model, [2] observe a peak on the disordered site, that is, in the region, where only one invariant measure exists. This result is characteristic of the very nature of the Ising model and shows that in terms of its information theoretic structure, the Ising model displays different features than the contact process or any other model of epidemics spread.

Acknowledgements

Many thanks to Errico Presutti and Antonio Carlos Roque da Silva Filho for stimulating discussions about this subject. We also thank two anonymous referees for helpful comments and suggestions. We thank the Gran Sasso Science Institute (GSSI) for hospitality and support. This research has been conducted as part of the project Labex MME-DII (ANR11-LBX-0023-01), USP project Mathematics, computation, language and the brain and FAPESP project Research, Innovation and Dissemination Center for Neuromathematics (grant 2013/07699-0). AG is partially supported by CNPq fellowship (grant 311 719/2016-3.)

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Per Bak. How nature works . Springer New York, 1996.
2[2] Lionel Barnett, Joseph T. Lizier, Michael Harré, Anil K. Seth, and Terry Bossomaier. Information flow in a kinetic ising model peaks in the disordered phase. Phys. Rev. Lett. , 111:177203, Oct 2013.
3[3] John M. Beggs and Dietmar Plenz. Neuronal avalanches in neocortical circuits. Journal of Neuroscience , 23(35):11167–11177, 2003.
4[4] Françoise Bertein and Antonio Galves. Une classe de systèmes de particules stable par association. Z. Wahr. Verw. Gebiete , 41:73–85, 1977.
5[5] C. Bezuidenhout and G. R. Grimmett. The critical contact process dies out. Ann. Probab. , 18:1462–1482, 1990.
6[6] Andrea Cavagna, Alessio Cimarelli, Irene Giardina, Giorgio Parisi, Raffaele Santagati, Fabio Stefanini, and Massimiliano Viale. Scale-free correlations in starling flocks. Proceedings of the National Academy of Sciences , 107(26):11865–11870, 2010.
7[7] E. Yagmur Erten, Joseph T. Lizier, Mahendra Piraveenan, and Mikhail Prokopenko. Criticality and information dynamics in epidemiological models. Entropy , 19(5), 2017.
8[8] Antonio Galves, Nancy L. Garcia, and Clémentine Prieur. Perfect simulation of a coupling achieving the d ¯ ¯ 𝑑 \bar{d} -distance between ordered pairs of binary chains of infinite order. Journal of Statistical Physics , 141(4):669–682, 2010.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Information transmission and criticality in the contact process.

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. Definitions and main results

Theorem 1**.**

3. Proof of Theorem 1

Proposition 1**.**

Remark 1**.**

Proof.

Proposition 2**.**

Theorem 2** (Theorems 1.6 and 2.28 in Chapter VI of [13]).**

Proof of Theorem 1.

4. Final discussion

Acknowledgements

Theorem 1.

Proposition 1.

Remark 1.

Proposition 2.

Theorem 2 (Theorems 1.6 and 2.28 in Chapter VI of [13]).