Universal Protocols for Information Dissemination Using Emergent Signals

Bartlomiej Dudek; Adrian Kosowski (GANG)

arXiv:1705.09798·cs.DS·November 30, 2017

Universal Protocols for Information Dissemination Using Emergent Signals

Bartlomiej Dudek, Adrian Kosowski (GANG)

PDF

Open Access

TL;DR

This paper introduces universal, rapid, and practical protocols for information dissemination in decentralized populations, capable of broadcasting and source detection with convergence in logarithmic squared time, leveraging oscillatory dynamics.

Contribution

It presents the first protocols that are universal, fast, and simple for broadcasting and source detection, utilizing self-organizing oscillatory behavior.

Findings

01

Protocols achieve $O( ext{log}^2 n)$ convergence time with high probability.

02

Broadcasting protocol is exact, ensuring all agents learn the source state.

03

Source detection protocol has one-sided error on a small fraction of the population.

Abstract

We consider a population of $n$ agents which communicate with each other in a decentralized manner, through random pairwise interactions. One or more agents in the population may act as authoritative sources of information, and the objective of the remaining agents is to obtain information from or about these source agents. We study two basic tasks: broadcasting, in which the agents are to learn the bit-state of an authoritative source which is present in the population, and source detection, in which the agents are required to decide if at least one source agent is present in the population or not.We focus on designing protocols which meet two natural conditions: (1) universality, i.e., independence of population size, and (2) rapid convergence to a correct global state after a reconfiguration, such as a change in the state of a source agent. Our main positive result is to show that…

Tables1

Problem:	BitBroadcast	Detection
Non-stationarity property: (Applies to all fast $O (1)$ -state protocols)	no fixed points while source transmits random bits	no fixed points while source is present
New protocols with emergent signal:	universal, 74 states	universal, 55 states
Convergence time:
– No error (exact output)	$O (\log^{2} n)$	impossible
– One-sided $ε$ -error		$O (\log^{2} n)$
Other protocols with $ω (1)$ -states:	Clock-Sync (in synchronized round model) [11]	Time-to-Live [5]

Equations339

A_{i}^{^{?}}

A_{i}^{^{?}}

A_{i}^{^{?}}

A_{i}^{+}

A_{i}^{++}

X

A_{? [1]}^{+}

(A_{? [1]}^{+}, A_{? [2]}^{++}, Y_{?})

(A_{? [1]}^{++}, A_{? [2]}^{+}, Y_{?})

X_{[1]}

X_{[1]}

X_{[1]}

X_{[2]}

X_{[2]}

((X or A_{j}^{?}), M_{?})

((X or A_{j}^{?}), M_{?})

(7) :

(8) :

(9) :

(10) :

L_{- 1}

(A_{?}^{?}, M_{- 1}, L_{?})

(A_{?}^{?}, M_{+ 1}, L_{?})

(A_{?}^{?}, M_{?}, L_{?})

Z ∋ z = (z^{(1)}, \dots, z^{(k)}) \mapsto (ln^{\circ} z^{(1)}, \dots, ln^{\circ} z^{(k)} \equiv ln^{\circ} z \in {ln^{\circ} 0, ln^{\circ} 1, \dots, ln^{\circ} n}^{k}),

Z ∋ z = (z^{(1)}, \dots, z^{(k)}) \mapsto (ln^{\circ} z^{(1)}, \dots, ln^{\circ} z^{(k)} \equiv ln^{\circ} z \in {ln^{\circ} 0, ln^{\circ} 1, \dots, ln^{\circ} n}^{k}),

\overset{u}{˙} \equiv \frac{d u}{d t} := n E (Δ u)

\overset{u}{˙} \equiv \frac{d u}{d t} := n E (Δ u)

A_{i} : A_{i - 1} \mapsto A_{i} with probability p,

A_{i} : A_{i - 1} \mapsto A_{i} with probability p,

Δ a_{i} = \frac{1}{n} \cdot Δ^{#} A_{i} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability p a_{i - 1} a_{i}, with probability p a_{i} a_{i + 1}, otherwise,

Δ a_{i} = \frac{1}{n} \cdot Δ^{#} A_{i} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability p a_{i - 1} a_{i}, with probability p a_{i} a_{i + 1}, otherwise,

\overset{a}{˙}_{i} = n E (Δ a_{i}) = p a_{i - 1} a_{i} - p a_{i} a_{i + 1},

\overset{a}{˙}_{i} = n E (Δ a_{i}) = p a_{i - 1} a_{i} - p a_{i} a_{i + 1},

ϕ = ln (a_{1} a_{2} a_{3})

ϕ = ln (a_{1} a_{2} a_{3})

Δ a_{i} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability \frac{1}{3} x (s - a_{i}) + p a_{i}^{+} a_{i - 1} + 2 p a_{i}^{++} a_{i - 1}, with probability \frac{2}{3} x a_{i} + p a_{i} a_{i + 1}^{+} + 2 p a_{i + 1}^{++} a_{i}, otherwise.

Δ a_{i} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability \frac{1}{3} x (s - a_{i}) + p a_{i}^{+} a_{i - 1} + 2 p a_{i}^{++} a_{i - 1}, with probability \frac{2}{3} x a_{i} + p a_{i} a_{i + 1}^{+} + 2 p a_{i + 1}^{++} a_{i}, otherwise.

Δ a_{i}^{++} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability a_{i} (a_{i} - a_{i}^{++}), with probability x a_{i}^{++} + (s - a_{i}) a_{i}^{++}, otherwise.

Δ a_{i}^{++} = ⎩ ⎨ ⎧ + 1/ n, - 1/ n, 0, with probability a_{i} (a_{i} - a_{i}^{++}), with probability x a_{i}^{++} + (s - a_{i}) a_{i}^{++}, otherwise.

\overset{a}{˙}_{i}

\overset{a}{˙}_{i}

= x (s /3 - a_{i}) + p a_{i - 1} (a_{i} + a_{i}^{++}) - p a_{i} (a_{i + 1} + a_{i + 1}^{++})

\overset{a}{˙}_{i}^{++}

\dot{ϕ}

κ_{i} = s \frac{a _{i}^{++}}{a _{i}} - a_{i} thus a_{i}^{++} = \frac{a _{i}}{s} (a_{i} + κ_{i}) .

κ_{i} = s \frac{a _{i}^{++}}{a _{i}} - a_{i} thus a_{i}^{++} = \frac{a _{i}}{s} (a_{i} + κ_{i}) .

δ_{i} = a_{i} - a_{i - 1}

δ_{i} = a_{i} - a_{i - 1}

δ = δ_{1}^{2} + δ_{2}^{2} + δ_{3}^{2}

δ = δ_{1}^{2} + δ_{2}^{2} + δ_{3}^{2}

κ = κ_{1}^{2} + κ_{2}^{2} + κ_{3}^{2}

κ = κ_{1}^{2} + κ_{2}^{2} + κ_{3}^{2}

\dot{ϕ}

\dot{ϕ}

\displaystyle=\frac{p}{s}\bigg{(}\sum(a_{i}+\kappa_{i})(a_{i-1}-a_{i})\bigg{)}

\displaystyle=\frac{p}{s}\bigg{(}-\frac{1}{2}\sum(a_{i}-a_{i-1})^{2}+\sum\kappa_{i}(a_{i-1}-a_{i})\bigg{)}

\displaystyle\leq\frac{p}{s}\bigg{(}-\frac{1}{2}\delta^{2}+\kappa\delta\bigg{)}

\dot{a_{i}^{++}} = a_{i}^{2} - s a_{i}^{++} = - a_{i} κ_{i} .

\dot{a_{i}^{++}} = a_{i}^{2} - s a_{i}^{++} = - a_{i} κ_{i} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDNA and Biological Computing · Modular Robots and Swarm Intelligence · Distributed systems and fault tolerance

Full text

Universal Protocols for Information Dissemination

Using Emergent Signals

Bartłomiej Dudek

University of Wrocław, Poland

Adrian Kosowski111Corresponding Author. Email: [email protected]

Inria Paris, France

Abstract

We consider a population of $n$ agents which communicate with each other in a decentralized manner, through random pairwise interactions. One or more agents in the population may act as authoritative sources of information, and the objective of the remaining agents is to obtain information from or about these source agents. We study two basic tasks: broadcasting, in which the agents are to learn the bit-state of an authoritative source which is present in the population, and source detection, in which the agents are required to decide if at least one source agent is present in the population or not.

We focus on designing protocols which meet two natural conditions: (1) universality, i.e., independence of population size, and (2) rapid convergence to a correct global state after a reconfiguration, such as a change in the state of a source agent. Our main positive result is to show that both of these constraints can be met. For both the broadcasting problem and the source detection problem, we obtain solutions with a convergence time of $O(\log^{2}n)$ rounds, w.h.p., from any starting configuration. The solution to broadcasting is exact, which means that all agents reach the state broadcast by the source, while the solution to source detection admits one-sided error on a $\varepsilon$ -fraction of the population (which is unavoidable for this problem). Both protocols are easy to implement in practice and have a compact formulation.

Our protocols exploit the properties of self-organizing oscillatory dynamics. On the hardness side, our main structural insight is to prove that any protocol which meets the constraints of universality and of rapid convergence after reconfiguration must display a form of non-stationary behavior (of which oscillatory dynamics are an example). We also observe that the periodicity of the oscillatory behavior of the protocol, when present, must necessarily depend on the number $\text{$ {}^{#} $}\!\!X$ of source agents present in the population. For instance, our protocols inherently rely on the emergence of a signal passing through the population, whose period is $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ rounds for most starting configurations. The design of clocks with tunable frequency may be of independent interest, notably in modeling biological networks.

Key words: Gossiping, Epidemic processes, Oscillatory dynamics, Emergent phenomena,

Population protocols, Broadcasting, Distributed clock synchronization.

1 Introduction

Information-spreading protocols, and more broadly epidemic processes, appear in nature, social interactions between humans, as well as in man-made technology, such as computer networks. For some protocols we have a reasonable understanding of the extent to which the information has already spread, i.e., we can identify where the information is located at a given step of the process: we can intuitively say which nodes (or agents) are “informed” and which nodes are “uninformed”. This is the case for usual protocols in which uninformed agents become informed upon meeting a previously informed agent, cf. e.g. mechanisms of rumor spreading and opinion spreading models studied in the theory community [29, 26]). Arguably, most man-made networking protocols for information dissemination also belong to this category.

By contrast, there exists a broad category of complex systems for which it is impossible to locate which agents have acquired some knowledge, and which are as yet devoid of it. In fact, the question of “where the information learned by the system is located” becomes somewhat fuzzy, as in the case of both biological and synthetic neural networks. In such a perspective, information (or knowledge) becomes a global property of the entire system, whereas the state of an individual agent represents in principle its activation, rather than whether it is informed or not. As such, knowledge has to be treated as an emergent property of the system, i.e., a global property not resulting directly from the local states of its agents. The convergence from an uninformed population to an informed population over time is far from monotonous. Even so, once some form of “signal” representing global knowledge has emerged, agents may try to read and copy this signal into their local state, thus each of them eventually also becomes informed. At a very informal conceptual level, we refer to this category of information-dissemination protocols as protocols with emergent behavior. At a more technical level, emergent protocols essentially need to rely on non-linear dynamical effects, which typically include oscillatory behavior, chaotic effects, or a combination of both. (This can be contrasted with simple epidemic protocols for information-spreading, in which nodes do not become deactivated.)

This work exhibits a simple yet fundamental information-spreading scenario which can only be addressed efficiently using emergent protocols. Both the efficient operation of the designed protocols, and the need for non-stationary dynamical effects in any efficient protocol for the considered problems, can be formalized through rigorous theoretical analysis. Our goal in doing this is twofold: to better understand the need for emergent behavior in real-world information spreading, and to display the applicability of such protocols in man-made information spreading designs. For the latter, we describe an interpretation of information as a (quasi-)periodic signal, which can be both decoded from states of individual nodes, and encoded into them.

1.1 Problems and Model

We consider a population of $n$ identical agents, each of which may be in a constant number of possible states. Interactions between agents are pairwise and random. A fair scheduler picks a pair of interacting agents independently and uniformly at random in each step. The protocol definition is provided through a finite sequence of state transition rules, following the precise conventions of the randomized Population Protocol model [6, 8] or (equivalently) of Chemical Reaction Networks [21].222The activation model is thus asynchronous. The same protocols may be deployed in a synchronous setting, with scheduler activations following, e.g., the independent random matching model (with only minor changes to the analysis) or the PULL model [29] (at the cost of significantly complicating details of the protocol formulation).

The input to the problem is given by fixing the state of some subset of agents, to some state of the protocol, which is not available to any of the other agents. Intuitively, the agents whose state has been fixed are to be interpreted as authoritative sources of information, which is to be detected and disseminated through the network (i.e., as the rumor source node, broadcasting station, etc.). For example, the problem of spreading a bit of information through the system is formally defined below.

Problem BitBroadcast

Input States:

$X_{1},X_{2}$ .

Promise:

The population contains a non-zero number of agents in exactly one of the two input states $\{X_{1},X_{2}\}$ .

Question:

Decide if the input state present in the population is $X_{1}$ or $X_{2}$ .

We can, e.g., consider that the transmitting station (or stations) choose whether to be in state $X_{1}$ or $X_{2}$ in a way external to the protocol, and thus transmit the “bit” value $1$ or $2$ , respectively, through the network. Broadcasting a bit is one of the most fundamental networking primitives.

The definition of the population protocol includes a partition of the set of states of the protocol into those corresponding to the possible answers to the problem. When the protocol is executed on the population, the output of each agent may be read at every step by checking, for each agent, whether its state belongs to the subset of an output state with a given answer (in this case, the answer of the agent will be the “bit” it has learned, i.e., 1 or 2). We will call a protocol exact if it eventually converges to a configuration, such that starting from this configuration all agents always provide the correct answer. We will say it operates with $\varepsilon$ -error, for a given constant $\varepsilon>0$ , if starting from some step, at any given step of the protocol, at most an $\varepsilon$ -fraction of the population holds the incorrect answer, with probability $1-O(1/n)$ .

Time is measured in steps of the scheduler, with $n$ time steps called a round, with the expected number of activations of each agent per round being a constant. Our objective is to design protocols which converge to the desired outcome rapidly. Specifically, a protocol is expected to converge in $O(\mathrm{poly}\log n)$ rounds (i.e., in $O(n\mathrm{poly}\log n)$ steps), with probability $1-O(1/n)$ , starting from any possible starting configuration of states in the population, conforming to the promise of the problem.333We adhere to this strong requirement for self-stabilizing (or self-organizing) behavior from any initial configuration in the design of our protocols. The presented impossibility results still hold under significantly weaker assumptions.

Motivated by both applications and also a need for a better understanding of the broadcasting problem, we also consider a variant of the broadcasting problem in which no promise on the presence of the source is given. This problem, called Detection, is formally defined below.

Problem Detection

Input State:

$X$ .

Question:

Decide if at least one agent in state $X$ is present in the population.

Detection of the presence of a source is a task which is not easier than broadcasting a bit. Indeed, any detection protocol is readily converted into a broadcasting protocol for states $\{X_{1},X_{2}\}$ by identifying $X=X_{1}$ and treating $X_{2}$ as a dummy state which does not enter into any interactions (i.e., is effectively not visible in the network). Intuitively, the detection task in the considered setting is much harder: a source $X$ may disappear from the network at any time, forcing other agents to spontaneously “unlearn” the outdated information about the presence of the source. This property is inherently linked to the application of the Detection problem in suppressing false rumors or outdated information in social interactions. Specifically, it may happen that a certain part of a population find themselves in an informed state before the original rumor source is identified as a source of false information, a false rumor may be propagated accidentally because of an agent which previously changed state from “uninformed” to “informed” due to a fault or miscommunication, or the rumor may contain information which is no longer true. Similar challenges with outdated information and/or false-positive activations are faced in Chemical Reaction Networks, e.g. in DNA strand displacement models [5]. In that context, the detection problem has the intuitive interpretation of detecting if a given type of chemical or biological agent (e.g., a contaminant, cancer cell, or hormonal signal) is present in the population, and spreading this information among all agents.

1.2 Our Results

In Section 3, we show that both the BitBroadcast, and the Detection problem can be solved with protocols which converge in $O(\log^{2}n)$ rounds to an outcome, with probability $1-O(1/n)$ , starting from any configuration of the system. The solution to BitBroadcast guarantees a correct output. The solution to Detection admits one-sided $\varepsilon$ -error: in the absence of a source, all agents correctly identify it as absent, whereas when the source is present, at any moment of time after convergence the probability that at least $(1-\varepsilon)n$ agents correctly identify the source as present is at least $1-O(1/n)$ .444The existence of one-sided error is inherent to the Detection problem in the asynchronous setting: indeed, if no agent of the population has not made any communication with the source over an extended period of time, it is impossible to tell for sure if the source has completely disappeared from the network, or if it is not being selected for interaction by the random scheduler. Here, $\varepsilon>0$ is a constant influencing the protocol design, which can be made arbitrarily small.

The designed protocols rely on the same basic building block, namely, a protocol realizing oscillatory dynamics at a rate controlled by the number of present source states in the population. Thus, these protocols display non-stationary behavior. In Section 4, we show that such behavior is a necessary property in the following sense. We prove that in any protocol which solves Detection in sub-polynomial time in $n$ and which uses a constant number of states, the number of agents occupying some state has to undergo large changes: by a polynomially large factor in $n$ during a time window of length proportional to the convergence time of the protocol. For the BitBroadcast problem, we show that similar volatile behavior must appear in a synthetic setting in which a unique source is transmitting its bit as random noise (i.e., selecting its input state $\{X_{1},X_{2}\}$ uniformly at random in subsequent activations).

We note that, informally speaking, our protocols rely on the emergence of a “signal” passing through the population, whose period is $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ rounds when the number of source agents in state $X$ is $\text{$ {}^{#} $}\!\!X$ . In Section 5, we then discuss how the behavior of any oscillatory-type protocol controlled by the existence of $\text{$ {}^{#} $}\!\!X$ has to depend on both $n$ and $\text{$ {}^{#} $}\!\!X$ . We prove that for any such protocol with rapid convergence, the cases of subpolynomial $\text{$ {}^{#} $}\!\!X$ and $\text{$ {}^{#} $}\!\!X=\Theta(n)$ can be separated by looking at the portion of the configuration space regularly visited by the protocol. This, in particular, suggests the nature of the dependence of the oscillation period on the precise value of $\text{$ {}^{#} $}\!\!X$ , and that the protocols we design with period $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ are among the most natural solutions to the considered problems.

The proofs of all theorems are deferred to the closing sections of the paper.

1.3 Comparison to the State-of-the-Art

Our work fits into lines of research on rumor spreading, opinion spreading, population protocols and other interaction models, and emergent systems. We provide a more comprehensive literature overview of some of these topics in Subsection 1.4.

Originality of methods.

The oscillatory dynamics we apply rely on an input-parameter-controlled oscillator. The uncontrolled version of the oscillator which we consider here is the length- $3$ cyclic oscillator of the cyclic type, known in population dynamics under the name of rock-paper-scissors (or RPS). This has been studied intensively in the physics and evolutionary dynamics literature (cf.e.g. [46] for a survey), while algorithmic studies are relatively scarce [15]. We remark that the uncontrolled cyclic oscillator with a longer (but $O(1)$ length cycle) has been applied for clock/phase synchronization in self-stabilizing settings and very recently in the population protocol setting when resolving the leader election problem [25]. (The connection to oscillatory dynamics is not made explicit, and the longer cycle provides for a neater analysis, although it does not seem to be applicable to our parameter-controllable setting.) Whereas we are not aware of any studies of parameter-controlled oscillators in a protocol design setting (nor for that matter, of rigorous studies in other fields), we should note that such oscillators have frequently appeared in models of biological systems, most notably in biological networks and neuroendocrinology ([27] for a survey). Indeed, some hormone release and control mechanisms (e.g., for controlling GnRH surges in vertebrates) appear to be following a similar pattern. To the best of our knowledge, no computational (i.e., interaction-protocol-based) explanation for these mechanisms has yet been proposed, and we hope that our work may provide, specifically on the Detect problem, may provide some insights in this direction.

In terms of lower bounds, we rely on rather tedious coupling techniques for protocols allowing randomization, and many of the details are significantly different from lower-bound techniques found in the population protocols literature. We remark that a recent line of work in this area [21, 3] provides a powerful set of tools for proving lower bounds on the number of states (typically $\Omega(\log\log n)$ states) for fast (typically polylogarithmic) population protocols for different problems, especially for the case of deterministic protocols. We were unable to leverage these results to prove our lower bound for the randomized scenario studied here, and believe our coupling analysis is complementary to their results.

1.4 Other Related Work

Our work fits into the line of research on rumor spreading, population protocols, and related interaction models. Our work also touches on the issue of how distributed systems may spontaneously achieve some form of coordination with minimum agent capabilities. The basic work in this direction, starting with the seminal paper [32], focuses on synchronizing timers through asynchronous interprocess communication to allow processes to construct a total ordering of events. A separate interesting question concerns local clocks which, on their own, have some drift, and which need to synchronize in a network environment (cf. e.g. [37, 35], or [34] for a survey of open problems).

Rumor spreading.

Rumor spreading protocols are frequently studied in a synchronous setting. In a synchronous protocol, in each parallel round, each vertex independently at random activates a local rule, which allows it either to spread the rumor (if it is already informed), or possibly also to receive it (if it has not yet been informed, as is the case in the push-pull model). The standard push rumor spreading model assumes that each informed neighbor calls exactly one uninformed neighbor. In the basic scenario, corresponding to the complete interaction network, the number of parallel rounds for a single rumor source to inform all other nodes is given as $\log_{2}n+\ln n+o(\log n)$ , with high probability [42, 24]. More general graph scenarios have been studied in [22] in the context of applications in broadcasting information in a network. Graph classes studied for the graph model include hypercubes [22], expanders [45], and other models of random graphs [23]. The push-pull model of rumor spreading is an important variation: whereas for complete networks the speedup due to the pull process is in the order of a multiplicative constant [29], the speed up turns out to be asymptotic, e.g., on preferential attachment graphs, where the rumor spreading time is reduced from $\Theta(\log n)$ rounds in the push model to $\Theta(\log n/\log\log n)$ rounds in the push-pull model [18], as well as on other graphs with a non-uniform degree distribution. The push-pull model often also proves more amenable to theoretical analysis. We note that asynchronous rumor spreading on graphs, in models closer to our random scheduler, has also been considered in recent work [40, 26], with [26] pointing out the tight connections between the synchronous (particularly push-pull) and asynchronous models in general networks.

Population protocols.

Population protocols are a model which captures the way in which the complex behavior of systems (biological, sensor nets, etc.) emerges from the underlying local interactions of agents. The original model of Angluin et al. [6, 7] was motivated by applications in sensor mobility. Despite the limited computational capabilities of individual sensors, such protocols permit at least (depending on available extensions to the model) the computation of two important classes of functions: threshold predicates, which decide if the weighted average of types appearing in the population exceeds a certain value, and modulo remainders of similar weighted averages. The majority function, which belongs to the class of threshold functions, was shown to be stably computable for the complete interaction graph [6]; further results in the area of majority computation can be found in [7, 9, 38, 10]. A survey of applications and models of population protocols is provided in [9, 39]. An interesting line of research is related to studies of the algorithmic properties of dynamics of chemical reaction networks [21]. These are as powerful as population protocols, though some extensions of the chemical reaction model also allow the population size to change in time. Two very recent results in the population protocol model are worthy of special attention. Alistarh, Aspnes, and Gelashvili [4] have resolved the question of the number of states required to solve the Majority problem on a complete network in polylogarithmic time as $\Theta(\log n)$ . For the equally notable task of Leader Election, the papers of Gasieniec and Stachowiak [25] (for the upper bound) together with the work of Alistarh, Aspnes, Eisenstat, Gelashvili, and Rivest [3] (for the lower bound) put the number of states required to resolve this question in polylogarithmic time as $\Theta(\log\log n)$ . Both of these results rely on a notion of a self-organizing phase clock.

Nonlinearity in interaction protocols.

Linear dynamical systems, as well as many nonlinear protocols subjected to rigorous analytical study, have a relatively simple structure of point attractors and repellers in the phase space. The underlying continuous dynamics (in the limit of $n\to+\infty$ ) of many interaction protocols defined for complete graphs would fit into this category: basic models of randomized rumor spreading [42]; models of opinion propagation (e.g. [14, 1]); population protocols for problems such as majority and thresholds [6, 7]; all reducible Markov chain processes, such as random walks and randomized iterative load balancing schemes.

Nonlinear dynamics with non-trivial limit orbits are fundamental to many areas of systems science, including the study of physical, chemical and biological systems, and to applications in control science. In general, population dynamics with interactions between pairs of agents are non-linear (representable as a set of quadratic difference equations) and have potentially complicated structure if the number of states is $3$ or more. For example, the simple continuous Lotka-Volterra dynamics [36] gives rise to a number of discrete models, for example one representing interactions of the form $A+B\to A+A$ , over some pairs $A,B$ of states in a population (cf. [46] for further generalizations of the framework or [15] for a rigorous analysis in the random scheduler model). The model describes transient stability in a setting in which several species are in a cyclic predator-prey relation. Cyclic protocols of the type have been consequently identified as a potential mechanism for describing and maintaining biodiversity, e.g., in bacterial colonies [30, 31]. Cycles of length 3, in which type $A_{2}$ attacks type $A_{1}$ , type $A_{3}$ attacks type $A_{2}$ , and type $A_{1}$ attacks type $A_{3}$ , form the basis of the basic oscillator, also used as the starting point for protocols in this work, which is referred to as the RPS (rock-paper-scissors) oscillator or simply the 3-cycle oscillator, which we discuss further in Section 6.1. This protocol has been given a lot of attention in the statistical physics literature. The original analytical estimation method applied to RPS was based on approximation with the Fokker-Planck equation [44]. A subsequent analysis of cyclic $3$ - and $4$ -species models using Khasminskii stochastic averaging can be found in [16], and a mean field approximation-based analysis of RPS is performed in [41]. In [15], we have performed a study of some algorithmic implications of RPS, showing that the protocol may be used to perform randomized choice in a population, promoting minority opinions, in $\tilde{O}(n^{2})$ steps. All of these results provide a good qualitative understanding of the behavior of the basic cyclic protocols. We remark that the protocol used in this paper is directly inspired by the properties of RPS, as we discuss further on, but has a more complicated interaction structure (see Fig. 1).

For protocols with convergence to a single point in the configuration space in the limit of large population size, a discussion of the limit behavior is provided in [12], who provide examples of protocols converging to limit points at coordinates corresponding to any algebraic numbers.

We also remark that local interaction dynamics on arbitrary graphs (as opposed to the complete interaction graph) exhibit a much more complex structure of their limit behavior, even if the graph has periodic structure, e.g., that of a grid. Oscillatory behavior may be overlaid with spatial effects [46], or the system may have an attractor at a critical point, leading to simple dynamic processes displaying self-organized criticality (SOC, [43]).

2 Preliminaries: Building Blocks for Population Protocols

2.1 Protocol Definition

A randomized population protocol for a population of $n$ agents is defined as a pair $P=(K_{n},R_{n})$ , where $K_{n}$ is the set of states and $R_{n}$ is the set of interaction rules. The interaction graph is complete. We will simply write $P=(K,R)$ , when considering a protocol which is universal (i.e., defined in the same way for each value of $n$ ) or if the value of $n$ is clear from the context. All the protocols we design are universal; our lower bounds also apply to some non-universal protocols. The set of rules $R\subseteq{K^{4}\times[0,1]}$ is given so that each rule $j\in R$ is of the form $j=(i_{1}(j),i_{2}(j),o_{1}(j),o_{2}(j),q_{j})$ , describing an interaction read as: $``(i_{1}(j),i_{2}(j))\to(o_{1}(j),o_{2}(j))\text{with probability$ q_{j} $}$ ”. For all $i_{1},i_{2}\in K$ , we define $R_{i_{1},i_{2}}=\{j\in R:(i_{1}(j),i_{2}(j))=(i_{1},i_{2})\}$ as the set of rules acting on the pair of states $i_{1},i_{2}$ , and impose that $\sum_{j\in R_{i_{1},i_{2}}}q_{j}\leq 1$ .

For a state $A\in K$ , we denote the number of agents in state $A$ as $\text{$ {}^{#} $}\!\!A$ , and the concentration of state $A$ as $a=\text{$ {}^{#} $}\!\!A/n$ , and likewise for a set of states $\mathcal{A}$ , we write $\text{$ {}^{#} $}\!\!\mathcal{A}=\sum_{A\in\mathcal{A}}\text{$ {}^{#} $}\!\!A$ .

In any configuration of the system, each of the $n$ agents from the population is in one of states from $K_{n}$ . The protocol is executed by an asynchronous scheduler, which runs in steps. In every step the scheduler uniformly at random chooses from the population a pair of distinct agents to interact: the initiator and the receiver. If the initiator and receiver are in states $i_{1}$ and $i_{2}$ , respectively, then the protocol executes at most one rule from set protocol $R_{i_{1},i_{2}}$ , selecting rule $j\in R_{i_{1},i_{2}}$ with probability $q_{j}$ . If rule $j$ is executed, the initiator then changes its state to $o_{1}(j)$ and the receiver to $o_{2}(j)$ . The source has a special state, denoted $X$ in the Detect problem, or one of two special states, denoted $\{X_{1},X_{2}\}$ in the BitBroadcast problem, which is never modified by any rule.

All protocols are presented in the randomized framework, however, the universal protocols considered here are amenable to a form of conversion into deterministic rules discussed in [3], which simulates randomness of rules by exploiting the inherent randomness of the scheduler in choosing interacting node pairs to distribute weakly dependent random bits around the system.

All protocols designed in this work are initiator-preserving, which means that for any rule $j\in R$ , we have $o_{1}(j)=i_{1}(j)$ (i.e., have all rules of the form $A+B\to A+C$ , also more compactly written as $A\mathbf{\colon}\quad\!\!B\to C$ ), which makes them relevant in a larger number of application. As an illustrative example, we remark that the basic rumor spreading (epidemic) model is initiator-preserving and given simply as $1\mathbf{\colon}\quad\!\!0\to 1$ . All protocols can also obviously be rewritten to act on unordered pairs of agents picked by the scheduler, rather than ordered pairs.

2.2 Protocol Composition Technique

Our protocols will be built from simpler elements. Our basic building block is the input-controlled oscillatory protocol $P_{o}$ (see Fig. 1). We then use protocol $P_{o}$ as a component in the construction of other, more complex protocols, without disrupting the operation of the original protocol.

Formally, we consider a protocol $P_{B}$ using state set $B=\{B_{i}:1\leq i\leq k_{b}\}$ and rule set $R_{B}$ , and a protocol extension $P_{BC}$ using a state set $B\times C=B\times\{C_{i}:1\leq i\leq k_{c}\}$ , where $C$ is disjoint from $B$ , and rule extension set $R_{BC}$ . Each rule extension defines for each pair of states from $B\times C$ (i.e., to each element of $(B\times C)\times(B\times C)$ ) a probability distribution over elements of $C\times C$ .

The composed protocol $P_{B}\circ P_{BC}$ is a population protocol with set of states $B\times C$ . Its rules are defined so that, for a selected pair of agents in states $(B_{i},C_{j})$ and $(B_{i^{\prime}},C_{j^{\prime}})$ , we obtain a pair of agents in states $(B_{i^{*}},C_{j^{*}})$ and $(B_{i^{\prime*}},C_{j^{\prime*}})$ according to a probability distribution defined so that:

•

Each pair $B_{i^{\prime*}},B_{i^{\prime*}}$ appears in the output states of the two agents with the same probability as it would in an execution of protocol $P_{B}$ on a pair of agents in states $B_{i}$ and $B_{i^{\prime}}$ .

•

Each pair $C_{i^{\prime*}},C_{i^{\prime*}}$ appears in the output states of the two agents with the probability given by the definition of $P_{BC}$ .

In the above, the pairs of agents activated by $P_{B}$ and $P_{BC}$ are not independent of each other. This is a crucial property in the composition of protocol $P_{o}$ when composing it with further blocks to solve the Detect problem.

We denote by $\mathbf{1}_{B}$ the identity protocol which preserves agent states on set of states $B$ . For a protocol $P$ , we denote by $P/2$ a lazy version of a protocol $P$ in which the rule activation of $P$ occurs with probability $1/2$ , and with probability $1/2$ the corresponding rule of the identity protocol is activated. Note that all asymptotic bounds on expected and w.h.p. convergence time obtained for any protocol $P$ also apply to protocol $P/2$ , in the regime of at least a logarithmic number of time steps. We also sometimes treat a protocol $P_{BC}$ extension as a protocol in itself, applied to the identity protocol $\mathbf{1}_{B}$ .

The independently composed protocol $P_{B}+P_{BC}$ is defined as an implementation of the composed protocol $(P_{B})\circ(P_{C}/2)$ , realized with the additional constraint that in each step, either the rule of $P_{B}$ is performed with an identity rule extension, or the rule extension of $P_{BC}$ is performed on top of the identity protocol $\mathbf{1}_{B}$ . Such a definition is readily verified to be correct by a simple coupling argument, and allows us to analyze protocols $P_{B}$ and $P_{BC}$ , observing that the pairs (identities) of agents activated by the scheduler in the respective protocols are independent.

All the composed protocols (and protocol extensions) we design are also initiator-preserving, i.e., $C_{i^{*}}=C_{i}$ and $B_{i^{*}}=B_{i}$ , with probability $1$ . In notation, rules omitted from the description of protocol extensions are implicit, occurring with probability [math] (where $C_{j^{*}}\neq C_{j}$ ) or with the probability necessary for the normalization of the distribution to $1$ , where the state is preserved (where $C_{j^{*}}=C_{j}$ ).

As a matter of naming convention, we name the states in the separate state sets of the composed protocols with distinct letters of the alphabet, together with their designated subscripts and superscripts. The rumor source $X$ is treated specially and uses a separate letter (and may be seen as a one state protocol without any rules, on top of which all other protocols are composed; in particular, its state is never modified). The six remaining states of protocol $P_{o}$ are named with the letters $A_{{}_{?}}^{{}^{?}}$ , as usual in its definition. Subsequent protocols will use different letters, e.g., $M_{?}$ and $L_{?}$ .

3 Overview of Protocol Designs

3.1 Main Routine: Input-Controlled Oscillator Protocol $P_{o}$

We first describe the main routine which allows us to convert local input parameters (the existence of source into a form of global periodic signal on the population. This main building block is the construction of a $7$ -state protocol $P_{o}$ following oscillator dynamics, whose design we believe to be of independent interest.

The complete design of protocol $P_{o}$ is shown in Fig. 1. The source state is denoted by $X$ . Additionally, there are six states, called $A_{i}^{+}$ and $A_{i}^{++}$ , for $i\in\{1,2,3\}$ . The naming of states in the protocol is intended to maintain a direct connection with the RPS oscillator dynamics, which is defined by the simple rule “ $A_{i}\mathbf{\colon}\quad A_{i-1}\mapsto A_{i}$ , for $i=1,2,3$ ”. In fact, we will retain the convention $A_{i}=\{A_{i}^{+},A_{i}^{++}\}$ and $a_{i}=a_{i}^{+}+a_{i}^{++}$ , and consider the two states $A_{i}^{+}$ and $A_{i}^{++}$ to be different flavors of the same species $A_{i}$ , referring to the respective superscripts as either lazy (+) or aggressive (++).

The protocol has the property that in the absence of $X$ , it stops in a corner state of the phase space, in which only one of three possible states appears in the population, and otherwise regularly (every $O(\log n)$ steps) moves sufficiently far away from all corner states. An intuitive formalization of the basic properties of the protocol is given by the theorem below.

Theorem 1.

There exists a universal protocol $P_{o}$ with $|K|=7$ states, including a distinguished source state $X$ , which has the following properties.

For any starting configuration, in the absence of the source $(\text{$ {}^{#} $}\!\!X=0)$ , the protocol always reaches a configuration such that:

•

all agents are in the same state: either $A_{1}^{++}$ , or $A_{2}^{++}$ , or $A_{3}^{++}$ ;

•

no further state transitions occur after this time.

Such a configuration is reached in $O(\log n)$ rounds, with constant probability (and in $O(\log^{2}n)$ rounds with probability $1-O(1/n)$ ). 2. 2.

For any starting configuration, in the presence of the source $(\text{$ {}^{#} $}\!\!X\geq 1)$ , we have with probability $1-O(1/n)$ :

•

for each state $i\in K$ , there exists a time step in the next $O(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ rounds when at least a constant fraction of all agents are in state $i$ ;

•

during the next $O(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ rounds, at least a constant fraction of all agents change their state at least once.

The proof of the Theorem is provided in Section 6.

The RPS dynamics provides the basic oscillator mechanism which is still largely retained in our scenario. Most of the difficulty lies in controlling its operation as a function of the presence or absence of the rumor source. We do this by applying two separate mechanisms. The presence of rumor source $X$ shifts the oscillator towards an orbit closer to the central orbit $(A_{1},A_{2},A_{3})=(1/3,1/3,1/3)$ through rule $(5)$ , which increases the value of potential $\phi:=\ln(a_{1}a_{2}a_{3})$ , where $a_{i}=\text{$ {}^{#} $}\!\!A_{i}/n$ . Conversely, independent of the existence of rumor source $X$ , a second mechanism is intended to reduce the value of potential $\phi$ . This mechanism exploits the difference between the aggressive and lazy flavors of the species. Following rule $(1)$ , an agent belonging to a species becomes more aggressive if it meets another from the same species, and subsequently attacks agents from its prey species with doubled probability following rule $(4)$ . This behavior somehow favors larger species, since they are expected to have (proportionally) more aggressive agents than the smaller species (in which pairwise interactions between agents of the same species are less frequent) — the fraction of agents in $A_{i}$ which are aggressive would, in an idealized static scenario, be proportional to $a_{i}$ . (This is, in fact, often far from true due to the interactions between the different aspects of the dynamics). As a very loose intuition, the destabilizing behavior of the considered rule on the oscillator is resemblant of the effect an eccentrically fitted weight has on a rotating wheel, pulling the oscillator towards more external orbits (with smaller values of $\phi$ ).

The intuition for which the proposed dynamics works, and which we will formalize and prove rigorously in Section 6, can now be stated as follows: in the presence of rumor source $X$ , the dynamics will converge to a form of orbit on which the two effects, the stabilizing and destabilizing one, eventually compensate each other (in a time-averaged sense). The period of a single rotation of the oscillator around such an orbit is between $O(1)$ and $O(\log n)$ , depending on the concentration of $X$ . In the absence of $X$ , the destabilizing rule will prevail, and the oscillator will quickly crash into a side of the triangle.

For small values of $\text{$ {}^{#} $}\!\!X>0$ , the protocol can be very roughly (and non-rigorously) viewed as cyclic composition of three dominant rumor spreading processes over three sets of states $A_{1}$ , $A_{2}$ , $A_{3}$ , one converting states $A_{1}$ to $A_{3}$ , the next from $A_{3}$ to $A_{2}$ , and the last from $A_{2}$ to $A_{1}$ , which spontaneously take over at moments of time separated by $O(\log n)$ parallel rounds. For other starting configurations, and especially for the case of $\text{$ {}^{#} $}\!\!X=0$ , the dynamics of the protocol, which has $5$ free dimensions, is more involved to describe and analyze (see Section 6.4). We provide some further insights into the operation of the protocol in Section 7.1, notably formalizing the notion that an intuitively understood oscillation (going from a small number of agents in some state $A_{i}$ , to a large number of agents in state $A_{i}$ , and back again to a small number of agents in state $A_{i}$ ) takes $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ steps, with probability $1-O(1/n)$ . As such, protocol $P_{o}$ can be seen as converting local input $\text{$ {}^{#} $}\!\!X$ into a global periodic signal with period $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ . What remains is allowing nodes to extract information from this periodic signal.

Simulation timelines shown in Fig.9 in the Appendix illustrate the idea of operation of protocol $P_{o}$ and its composition with other protocols.

3.2 Protocols for BitBroadcast

A solution to BitBroadcast is obtained starting with an independent composition of two copies of oscillator $P_{o}$ , called $P_{o[1]}$ and $P_{o[2]}$ , with states in one protocol denoted by subscript $[1]$ and in the other by subscript $[2]$ . The respective sources are thus written as $X_{[1]}$ and $X_{[2]}$ . In view of Theorem 1, in this composition $P_{o[1]}+P_{o[2]}$ , under the promise of the BitBroadcast problem, one of the oscillators will be running and the other will stop in a corner of its state space. Which of the oscillators is running can be identified by the presence of states $A_{i}^{+}[z]$ , which will only appear for $z\in\{1,2\}$ corresponding to the operating oscillator. Moreover, by the same Theorem, every $O(\log n)$ rounds a constant number of agents of this oscillator will be in such a state $A_{i}^{+}[z]$ , for any choice of $i\in\{1,2,3\}$ . We can thus design the protocol extension $P_{b}$ to detect this. This is given by the pair of additional output states $\{Y_{1},Y_{2}\}$ and the rule extension consisting of the two rules shown in Fig. 2.

Theorem 2 (Protocol for BitBroadcast).

Protocol $(P_{o[1]}+P_{o[2]})+P_{b}$ , having $|K|=74$ states, including distinguished source states $X_{[1]}$ , $X_{[2]}$ converges to an exact solution of BitBroadcast. This occurs in $O(\log^{2}n)$ parallel rounds, with probability $1-O(1/n)$ . In the output encoding, agent states of the form $(\cdot,\cdot,Y_{z})$ represent answer “ $z$ ”, for $z\in\{1,2\}$ .

The protocol $(P_{o[1]}+P_{o[2]})+P_{b}$ is not “silent”, i.e., it undergoes perpetual transitions of state, even once the output has been decided. As a side remark, we note that for the single-source broadcasting problem, or more generally for the case when the number of sources is small, $\max\{\text{$ {}^{#} $}\!\!X_{[1]},\text{$ {}^{#} $}\!\!X_{[2]}\}=O(1)$ , we can propose the following simpler silent protocol. We define protocol $P_{o}^{\prime}$ , by modifying protocol $P_{o}$ as follows. We remove from it Rule (5), and replace it by to the four rules shown in Fig. 3. The analysis of the modified protocol follows from the same arguments as those used to prove Theorem 1(1). In the regime of $\max\{\text{$ {}^{#} $}\!\!X_{[1]},\text{$ {}^{#} $}\!\!X_{[2]}\}=O(1)$ , the effect of the source does not influence the convergence of the process and each of the three possible corner configurations, with exclusively species $\{A_{1},A_{2},A_{3}\}$ , is reached in $O(\log n)$ steps with constant probability. However, rules $(5a)-(5d)$ enforce that the only stable configuration which will persist is the one in a corner corresponding to the identity of the source, i.e., $A_{1}$ for source $X_{[1]}$ and $A_{2}$ for source $X_{[2]}$ ; the source will restart the oscillator in all other cases. We thus obtain the following side result, for which we leave out the details of the proof.

Observation 1.

Protocol $P_{o}^{\prime}$ , having $|K|=6+2=8$ states, including distinguished source states $X_{[1]}$ , $X_{[2]}$ converges to an exact solution of BitBroadcast, eventually stopping with all agents in state $A_{1}^{++}$ if source $X_{[1]}$ is present and stopping with all agents in state $A_{2}^{++}$ if source $X_{[2]}$ is present, with no subsequent state transition. The stabilization occurs within $O(\log^{2}n)$ parallel rounds, with probability $1-O(1/n)$ , if $\max\{\text{$ {}^{#} $}\!\!X_{[1]},\text{$ {}^{#} $}\!\!X_{[2]}\}=O(1)$ , i.e., the broadcast originates from a constant number of sources.

3.3 Protocol for Detect

The solution to problem Detect is more involved. It relies on two auxiliary extensions added on top of a single oscillator $P_{o}$ . The first, $P_{m}$ , runs an instance of the 3-state majority protocol of Angluin et al. [7] within each species $A_{i}$ of the oscillator. For this reason, the composition between $P_{o}$ and $P_{m}$ has to be of the form $P_{o}\circ P_{m}$ (i.e., it cannot be independent). The operation of this extension is shown in Fig. 5 and analyzed in Section 7.2. It relies crucially on an interplay of two parameters: the time $\Theta(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ taken by the oscillator to perform an orbit, and the time $\Omega(\log\frac{n}{\text{$ {}^{#} $}\!\!X})$ it takes for the majority protocol (which is reset by the oscillator in its every oscillation) to converge to a solution. When parameters are tuned so that the second time length is larger a constant number of times than the first, a constant proportion of the agents of the population are involved in the majority computation, i.e., both of the clashing states in the fight for dominance still include $\Omega(n)$ agents. In the absence of a source, shortly after the oscillator stops, one of these states takes over, and the other disappears.

The above-described difference can be detected by the second, much simpler, extension $P_{l}$ , designed in Fig. 5 and analyzed in Section 7.3. The number of “lights” switched on during the operation of the protocol will almost always be more than $(1-\varepsilon)n$ , where $\varepsilon>0$ is a parameter controlled by the probability of lights spontaneously disengaging, and may be set to and arbitrarily small constant.

Theorem 3 (Protocol for Detect).

For any $\varepsilon>0$ , protocol $(P_{o}\circ P_{m})+P_{l}$ , having $|K|=6\cdot 3\cdot 3+1=55$ states, including a distinguished source state $X$ , which solves the problem of spreading confirmed rumors as follows:

For any starting configuration, in the presence of the source $(\text{$ {}^{#} $}\!\!X\geq 1)$ , after an initialization period of $O(\log n)$ rounds, at an arbitrary time step the number of agents in an output state corresponding to a “yes” answer is $(1-\varepsilon)n$ , with probability $1-O(1/n)$ . 2. 2.

For any starting configuration, in the absence of the source $(\text{$ {}^{#} $}\!\!X=0)$ , the system always reaches a configuration such that all agents are in output states corresponding to a “no” answer for all subsequent time steps. Such a configuration is reached in $O(\log^{2}n)$ rounds, with probability $1-O(1/n)$ .

4 Impossibility Results for Protocols without Non-Stationary Effects

For convenience of notation, we identify a configuration of the population with a vector $z=(z^{(1)},\ldots,z^{(k)})\in\{0,1,\ldots,n\}^{k}=Z$ , where $z^{(i)}$ , for $1\leq i\leq k$ , denotes the number of agents in the population having state $i$ , and $\|z\|_{1}=n$ . Our main lower bound may now be stated as follows.

Theorem 4 (Fixed points preclude fast stabilization).

Let $\varepsilon_{1}>0$ be arbitrarily chosen, let $P$ be any $k$ -state protocol, and let $z_{0}$ be a configuration of the system with at most $n^{\varepsilon_{0}}$ agents in state $X$ , where $\varepsilon_{0}\in(0,\varepsilon_{1}]$ is a constant depending only on $k$ and $\varepsilon_{1}$ . Let $B$ be a subset of the state space around $z_{0}$ such that the population of each state within $B$ is within a factor of at most $n^{\varepsilon_{0}}$ from that in $z_{0}$ (for any $z\in B$ , for all states $i\in\{1,\ldots,k\}$ , we have $z_{0}^{(i)}/n^{\varepsilon_{0}}<z^{(i)}\leq n^{\varepsilon_{0}}\max\{1,z_{0}^{(i)}\}$ ).

Suppose that in an execution of $P$ starting from configuration $z_{0}$ , with probability $1-o(1)$ , the configurations of the system in the next $n^{2\varepsilon_{1}}$ parallel rounds are confined to $B$ .

Then, an execution of $P$ for $n^{2\varepsilon_{0}}$ parallel rounds, starting from a configuration in which state $X$ has been removed from $z_{0}$ , reaches a configuration in a $O(n^{6\varepsilon_{1}})$ -neighborhood of $B$ , with probability $1-o(1)$ .

In the statement of the Theorem, for the sake of maintaining the size of the population, we interpret “removing state $X$ from $z_{0}$ ” as replacing the state of all agents in state $X$ by some other state, chosen adversarially (in fact, this may be any state which has sufficiently many representatives in configuration $z_{0}$ ). The $O(n^{6\varepsilon_{1}})$ -neighborhood of $B$ is understood in the sense of the $1$ -norm or, asymptotically equivalently, the total variation distance, reflecting configurations which can be converted into a configuration from $B$ by flipping the states of $O(n^{6\varepsilon_{1}})$ agents.

The proof of Theorem 4 is provided in Section 8. It proceeds by a coupling argument between a process starting from $z_{0}$ and a perturbed process in which state $X$ has been removed. The analysis differently treats rules and states which are seldom encountered during the execution of the protocol from those that are encountered with polynomially higher probability (such a clear separation is only possible when $k=O(\mathrm{poly}\log\log n)$ ). Eventually, the probability of success of the coupling reduces to a two-dimensional biased random walk scenario, in which the coordinates represent differences between the number of times particular rules have been executed in the two coupled processes.

We have the following direct corollaries for the problems we are considering. For Detect, if $B$ represents the set of configurations of the considered protocol, which are understood as the protocol giving the answer “ $\text{$ {}^{#} $}\!\!X>0$ ”, then our theorem says that, with probability $1-o(1)$ , the vast majority of agents will not “notice” that $\text{$ {}^{#} $}\!\!X$ had been set to [math], even a polynomial number of steps after this has occurred, and thus cannot yield a correct solution. An essential element of the analysis is that it works only when state $X$ is removed in the perturbed process. Thus, there is nothing to prevent the dynamics from stabilizing even to a single point in the case of $X=0$ , which is indeed the case for our protocol $P_{r}$ . The argument for BitBroadcast only applies to situations where the source agent is sending out white noise (independently random bits in successive interactions). Such a source can be interpreted as a pair of sources in states $X_{1}$ and $X_{2}$ in the population, each disclosing itself with probability $1/2$ upon activation and staying silent otherwise. In the cases covered by the lower bound, the scenario in which the source $X_{1}$ is completely suppressed cannot be distinguished from the scenario in which both $X_{1}$ and $X_{2}$ appear; likewise, the scenario in which the source $X_{2}$ is completely suppressed cannot be distinguished from the scenario in which both $X_{1}$ and $X_{2}$ appear. By coupling all three processes, this would imply the indistinguishability of the all these configurations, including those with only source $X_{1}$ and only source $X_{2}$ , which would imply incorrect operation of the protocol.

Whereas we use the language of discrete dynamics for precise statements, we informally remark that the protocols covered by the lower bound of Theorem 4 include those whose dynamics $z_{t}/n$ , described in the continuous limit $(n\to+\infty)$ , has only point attractors, repellers, and fixed points. In this sense, the use of oscillatory dynamics in our protocol seems inevitable.

The impossibility result is stated in reference to protocols with a constant number of states, however, it may be extended to protocols with a non-constant number of states $k$ , showing that such protocols require $n^{\exp[-O(\mathrm{poly}(k))]}$ time to reach a desirable output. (This time is larger than polylogarithmic up to some threshold value $k=O(\mathrm{poly}\log\log n)$ .) The lower bound covers randomized protocols, including those in which rule probabilities depend on $n$ (i.e., non-universal ones).

5 Input-Controlled Behavior of Protocols for Detect

In this Section, we consider the periodicity of protocols for self-organizing oscillatory dynamics, in order to understand how the period of a phase clock must depend on the input parameters. We focus on the setting of the Detect problem, considering the value $\text{$ {}^{#} $}\!\!X$ of the input parameter. In Section 3.1, we noted informally that the designed oscillatory protocol performs a complete rotation around the triangle in $\Theta(\log n/\text{$ {}^{#} $}\!\!X)$ rounds. Here, we provide partial evidence that the periodicity of any oscillatory protocol depends both on the value of $\text{$ {}^{#} $}\!\!X$ and $n$ . We do this by bounding the portions of the configuration space in which a protocol solving Detect finds itself in most time steps, separating the cases of sub-polynomial $\text{$ {}^{#} $}\!\!X$ (i.e., $\text{$ {}^{#} $}\!\!X<n^{\varepsilon_{0}}$ , where $\varepsilon_{0}>0$ is a constant dependent on the specific protocol), and the case of $\text{$ {}^{#} $}\!\!X=\Theta(n)$ .

Any protocol on $k$ states (not necessarily of oscillatory nature) can be viewed as a Markov chain in its $k$ -dimensional configuration space $[0,n]^{k}$ , and as in Section 4 we identify a configuration with a vector $z\in\{0,1,\ldots,n\}^{k}=Z$ . The configuration at time step $t$ is denoted $z(t)$ . In what follows, we will look at the equivalent space of log-configurations, given by the bijection:

[TABLE]

where $\ln^{\circ}a=\ln a$ for $a>0$ and $\ln^{\circ}a=-1$ for $a=0$ .

For $z_{0}\in Z$ , we will refer to the $d$ -log-neighborhood of $z_{0}$ as the set of points $\{z\in Z:|\ln^{\circ}z-\ln^{\circ}z_{0}|<d\}$ .

Notice first that the notion of a box in the statement of Theorem 4 is closely related to the set of points in the $(\varepsilon_{0}\ln n)$ -log-neighborhood of configuration $z_{0}$ . It follows from the Theorem that any protocol for solving Detect within a polylogarithmic number of rounds $T$ with probability $1-o(1)$ must, in the case of $0<\text{$ {}^{#} $}\!\!X<n^{\varepsilon_{0}}$ , starting from $z_{0}$ at some time $t_{0}$ , leave the $(\varepsilon_{0}\ln n)$ -log-neighborhood of $z_{0}$ within $T$ rounds with probability $1-o(1)$ . We obtain the following corollary.

Proposition 1.

Fix a universal protocol $P$ which solves the Detect problem with $\varepsilon$ -error in $T=O(\mathrm{poly}\log n)$ rounds with probability $1-o(1)$ . Set $0<\text{$ {}^{#} $}\!\!X<n^{\varepsilon_{0}}$ , where $\varepsilon_{0}>0$ is a constant which depends only on the definition of protocol $\text{$ {}^{#} $}\!\!X$ . Let $t_{0}$ be an arbitrarily chosen moment of time after at least $T$ rounds from the initialization of the protocol in any initial state. Then, within $T$ rounds after time $t_{0}$ , there is a moment of time $t$ such that $z(t)$ is not in the $(\varepsilon_{0}\log n)$ -neighborhood of $z(t_{0})$ , with probability $1-o(1)$ . ∎

The above Proposition suggests that oscillatory or quasi-oscillatory behavior at low concentrations of state $X$ must be of length $\Omega(\log n)$ . By contrast, the following Proposition shows that in the case $\text{$ {}^{#} $}\!\!X=\Theta(n)$ , the protocol remains tied to a constant-size log-neighborhood of its configuration space.

Proposition 2.

Fix a universal protocol $P$ with set of states $K$ which solves the Detect problem with $\varepsilon$ -error in $T=O(\mathrm{poly}\log n)$ rounds with probability $1-o(1)$ . Then, there exists a constant $\delta_{0}>0$ , depending only on the design of protocol $P$ , with the following property. Fix $\text{$ {}^{#} $}\!\!X\in[cn,n/2]$ , where $0<c<1/2$ is an arbitrarily chosen constant. Let $t$ be an arbitrarily chosen moment of time, after at least $T$ rounds from the initialization of the protocol at an adversarially chosen initial configuration $z(0)$ , such that each coordinate $z^{(i)}(0)$ satisfies $z^{(i)}(0)=0$ or $z^{(i)}(0)>1/(2|K|)$ , for all $i\in\{1,\ldots,|K|\}$ . Then, with probability $1-e^{-n^{\Omega(1)}}$ , $z(t)$ is in the $\delta_{0}$ -neighborhood of $z(0)$ .

The proof of the Proposition is deferred to Section 9.

Note that, in the regime of a constant-size log-neighborhood of configuration $z(0)$ , the discrete dynamics of the protocol adheres closely to the continuous-time version of its dynamics in the limit $n\to+\infty$ . (See Section 6, and in particular Lemma 1, for a further discussion of this property). Since the latter is independent of $n$ , any oscillatory behavior “inherited” from the continuous dynamic would have a period of $O(1)$ rounds. We leave as open the question whether some form of behavior of a protocol with polylogarithmic (i.e., or more broadly, non-constant and subpolynomial) periodicity for Detect can be designed in the regime of $\text{$ {}^{#} $}\!\!X=\Theta(n)$ despite this obstacle. In particular, the authors believe that the existence of an input-controlled phase clock with a period of $\Theta(\log n)$ for any $\text{$ {}^{#} $}\!\!X>0$ , and the absence of operation for $\text{$ {}^{#} $}\!\!X=0$ , is unlikely in the class of discrete dynamical systems given by the rules of population protocols.

The remaining sections of the paper provide proofs of the Theorems from Sections 3, 4, and 5.

6 Analysis of Oscillator Dynamics $P_{o}$

This section is devoted to the proof of Theorem 1.

6.1 Preliminaries: Discrete vs. Continuous Dynamics

Notation.

For a configuration of a population protocol, we write $z=(z^{(1)},\ldots,z^{(k)})$ to describe the number of agents in the $k$ states of the protocol, and likewise use vector $u=(u^{(1)},\ldots,u^{(k)})=z/n$ to describe their concentrations. The concentration of a state called $A$ which is the $i_{A}$ -th state in vector $u$ is equivalently written as $a\equiv a(u)\equiv u^{(i_{A})}$ , depending on which notation is the easiest to use in a given transformation.

If vector $u$ represents the current configuration of the protocol and $u^{\prime}:=u^{\prime}|u$ is the random variable describing the next configuration of the protocol after the execution of a single rule, we write $\Delta u:=u^{\prime}-u$ . We also use the notation $\Delta$ to functions of state $u$ .

Next, we define the continuous dynamics associated with the protocol by the following vector differential equation:

[TABLE]

and likewise, for each coordinate, $\dot{a}=n\mathbb{E}(\Delta a)$ (we use the dot-notation and $d/dt$ interchangeably for time differentials). This continuous description serves for the analysis only, and reflects the behavior of the protocol in the limit $n\to\infty$ .555We note that some of our results rely on the stochasticity of the random scheduler model, and do not immediately generalize to the continuous case.

Warmup: the RPS oscillator.

Our oscillatory dynamics may be seen as an extension of the rock-paper-scissors (RPS) protocol (see Related work). This is a protocol with three states $A_{1}$ , $A_{2}$ , $A_{3}$ and three rules:

[TABLE]

where $p>0$ is an arbitrarily fixed constant, and the indices of states $A_{i}$ are always $1,2,$ or $3$ , and any other values should be treated as $\,\mathrm{mod}\,3$ in the given range. For $i\in\{1,2,3\}$ , the change of concentration of agents of state $A_{i}$ in the population in the given step can be expressed for the RPS protocol as:

[TABLE]

Thus, the corresponding continuous dynamics for RPS is given as:

[TABLE]

for $i=1,2,3$ . The orbit of motion for this dynamics in $\mathbb{R}^{3}$ is given by two constants of motion. First, $a_{1}+a_{2}+a_{3}=1$ by normalization. Secondly, for any starting configuration with a strictly positive number of agents in each of the three states, the following function $\phi$ of the configuration:

[TABLE]

is easily verified to be constant over time $\dot{\phi}=0$ , hence $\phi=\ln(a_{1}a_{2}a_{3})=\mathrm{const}<0$ (or more simply, $a_{1}a_{2}a_{3}=\mathrm{const}$ ). Thus, for the continuous dynamics, the initial product of concentrations completely determines its perpetual orbit, which is obtained by intersecting the appropriate curve $a_{1}a_{2}a_{3}=\mathrm{const}$ with the plane $a_{1}+a_{2}+a_{3}=1$ . As a matter of convention, the plane $a_{1}+a_{2}+a_{3}=1$ with conditions $a_{i}\geq 0$ is drawn as an equilateral triangle (we adopt this convention throughout the paper, for subsequent protocols). All of the orbits are concentric around the point $(1/3,1/3,1/3)$ , which is in itself a point orbit maximizing the value of $\phi=-\ln 27$ . The discrete dynamics follows a path of motion which typically resembles random-walk-type perturbations around the path of motion, until eventually, after $\tilde{O}(n^{2})$ steps it crashes into one of the sides of the triangle. Subsequently, if $a_{i}=0$ , for some $i=1,2,3$ , then no rule can make $a_{i}$ increase. (If $a_{i-1}>0$ , in the next $O(\log n)$ steps, all remaining agents of $A_{i+1}$ will convert to $A_{i-1}$ , and there will be only agents from $A_{i-1}$ left.) Thus, the protocol will terminate in a corner of the state space.

A further discussion of the RPS dynamics can be found in [28, 15].

6.2 Proof Outline of Theorem 1

The rest of the section is devoted to the proof of Theorem 1. We start by noting some basic properties in Subsection 6.3, then prove the properties of the protocol for the case of $X=0$ (Subsection 6.4, and finally analyze (the somewhat less involved) case of $X>0$ (Subsection 6.5). For the case of $X=0$ , the proof is based on a repeated application of concentration inequalities for several potential functions (applicable in different portions of the $6$ -dimensional phase space). In two specific regions, in the $O(1/\sqrt{n})$ -neighborhood of the center of the $(A_{1},A_{2},A_{3})$ -triangle and very close to its sides, we rely on stochastic noise to “push” the trajectory away from the center of the triangle, and also to push it onto one of its sides. Fortunately, each of these stages takes $O(\log n)$ parallel rounds, with strictly positive probability. Overall, the $O(\log n)$ parallel rounds bound for the case of $X=0$ is provided with constant probability; this translates into $O(\log n)$ parallel rounds in expectation, since subsequent executions of the process for $O(\log n)$ rounds have independently constant success probability, and the process has a geometrically decreasing tail over intervals of length $O(\log n)$ .

6.3 Properties of the Oscillator

In the following, we define $s=a_{1}+a_{2}+a_{3}\in[0,1]$ . Handling the case of $s<1$ allows us not only to take care of the fact that a fraction of the population may be taken up by rumor source $X$ , but also allows for easier composition of $P_{o}$ with other protocols (sharing the same population). We set $p$ as a constant value independent of $n$ , which is sufficiently small (e.g., $p=s^{2}/10^{12}$ is a valid choice; we make no efforts in the proofs to optimize constants, but the protocol appears in simulations to work well with much larger values of $p$ ).

We will occasionally omit an explanation of index $i$ , which will then implicitly mean “for all $i=1,2,3$ ”. We define $a_{\min}:=\min_{i=1,2,3}a_{i}$ and $a_{\max}:=\max_{i=1,2,3}a_{i}$ .

From the definition of the protocol one obtains the distribution of changes of the sizes of states in a step:

[TABLE]

Taking the expectations of the above random variables, and recalling that $a_{i}=a_{i}^{+}+a_{i}^{++}$ , we obtain:

[TABLE]

6.4 Stopping in $O(n\log n)$ Sequential Steps in the Absence of a Source

Throughout this subsection we assume that $x=0$ . We consider first the case where $a_{i}\neq 0$ , for $i=1,2,3$ (noting that as soon as $a_{i}=0$ , we can easily predict the subsequent behavior of the oscillator, as was the case for the RPS dynamics).

The dynamics of $P_{o}$ is defined in such a way that that when $x=0$ and in the absence of the rules of the RPS oscillator, the value of $a_{i}^{++}$ would be close to $\frac{a_{i}^{2}}{s}$ . Consequently, we define $\kappa_{i}$ , $i=1,2,3$ as the appropriate normalized corrective factor:

[TABLE]

Note that as $0\leq a_{i}^{++}\leq a_{i}\leq 1$ , thus $-1\leq\kappa_{i}\leq 1$ . Next, we introduce the following definitions:

[TABLE]

We also reuse potential $\phi$ from the original RPS oscillator. This time, it is no longer a constant of motion. By (5) and the definition of $\kappa_{i}$ , for $x=0$ we upper-bound $\dot{\phi}$ as:

[TABLE]

The above change $\dot{\phi}$ of the potential is indeed negative when $\kappa\approx 0$ (which is in accordance with our intention in designing the destabilizing rules for the oscillator).

The functions $\delta$ , $\phi$ and $\kappa$ are intricately dependent on each other. In general, we will try to show that $\delta$ and $\phi$ increase over time, while $\kappa$ stays close to [math]. This requires that we first introduce a number of auxiliary potentials based on these two functions.

First, for $x=0$ , we can rewrite (4) as:

[TABLE]

Next, introducing the definition of $\kappa_{i}$ to (3), we obtain for $x=0$ :

[TABLE]

From the above, an upper bound on $|\dot{a_{i}}|$ follows directly using elementary transformations:

[TABLE]

We are now ready to estimate $\dot{\kappa}_{i}$ for $x=0$ , using the definition of $\kappa_{i}$ and the previously obtained formula for $\dot{a}_{i}^{++}$ :

[TABLE]

Next from the bound on $|\dot{a_{i}}|$ :

[TABLE]

where in the final transformation we took into account that $p\leq s/18$ .

Now, we define the potential $\eta$ for any configuration with all $a_{i}>0$ as:

[TABLE]

We remark that $\eta$ is always well-defined when $a_{\min}>0$ , and that $\eta\geq 0$ .

Overview of the proof.

The proof for the case of $X=0$ proceeds by following the trajectory of the discrete dynamics of $P_{o}$ , divided into a number of stages. We define a series of time steps $t_{0},t_{1},\ldots,t_{7}$ by conditions on the configuration met at time $t_{i}$ , and show that subject to these conditions holding, we have $t_{j+1}\leq t_{j}+O(n\log n)$ (we recall that here time is measured in steps), with at least constant probability. Overall, it follows that the configuration at time $t_{7}$ , which corresponds to having reached a corner state, is reached from $t_{0}$ , which is any initial configuration with $X=0$ , in $O(n\log n)$ time steps, with constant probability.

The intermediate time steps may be schematically described as follows (see Fig. 6). For configurations which start close to the center of the triangle ( $\delta\leq s/12$ ), we define a pair of potentials $\psi^{(1)}$ , $\psi^{(2)}$ , based on a linear combination of modified versions of $\eta$ and $\kappa$ . The dynamics will eventually escape from the area $\delta\leq s/12$ ; however, first it may potentially reach a very small area of radius $O(1/\sqrt{n})$ around the center of the triangle with $\kappa\approx 0$ (Lemma 3, time $t_{1}$ , reached in $O(n\log n)$ steps by a multiplicative drift analysis on potential $\psi^{(2)}<0$ ), pass through the vicinity of center of the triangle, escaping it with $\kappa\approx 0$ (Lemma 4, time $t_{2}$ , reached in $O(n\log n)$ steps with constant probability by a protocol-specific analysis of the scheduler noise, which with constant probability increases $\eta$ without increasing $\kappa$ too much), and eventually escapes completely to the area of $\delta>s/12$ (Lemma 5, exponentially increasing value of potential $\psi^{(1)}>0$ ).

In the area of $\delta>s/12$ , we define a new potential $\psi$ based on $\phi$ and $\kappa$ . This increases (Lemma 8, additive drift analysis on $\psi$ with bounded variance) until a configuration at time $t_{4}$ with a constant number of agents of some species $A_{i}$ is reached. This configuration then evolves towards a configuration at time $t_{5}$ at which some species has $O(1)$ agents, and additionally its predator species is a constant part of the population (Lemma 9, direct analysis of the process combined with analysis of potential $\psi$ and a geometric drift argument). Then, the species with $O(1)$ agents is eliminated in $O(n)$ steps with constant probability ( $t_{6}$ , Lemma 10), and finally one more species is eliminated in another $O(n\log n)$ steps (at time $t_{7}$ , Lemma 11, straightforward analysis of the dynamics). At this point, the dynamics has reached a corner.

Throughout the proof, we make sure to define boundary conditions on the analyzed cases to make sure that the process does not fall back to a previously considered case with probability $1-o(1)$ .

Phase with $\delta\leq s/12$ .

We then have $a_{i}\in[3s/12,5s/12]$ and $\frac{a_{i}}{s/3}\in[3/4,5/4]$ , for $i=1,2,3$ . In this range, we have:

[TABLE]

Summing the above inequalities for $i=1,2,3$ and noting that $\sum_{i=1}^{3}(\frac{a_{i}}{s/3}-1)=0$ , we obtain:

[TABLE]

Next, we have:

[TABLE]

Combining the two above expressions gives the sought bound between $\eta$ and $\delta$ as:

[TABLE]

and equivalently

[TABLE]

We have directly from (6) and from the relations between $\eta$ and $\delta$ :

[TABLE]

and from (7):

[TABLE]

Moving to the discrete-time model, it is advantageous to eliminate the discontinuity of partial derivatives of $\eta$ and $\kappa$ at points with $\eta=0$ and $\kappa=0$ respectively, which is a side-effect of the applied square root transformation in the respective definitions of $\eta$ and $\kappa$ . We define the auxiliary functions $\eta^{*}$ and $\kappa^{*}$ by adding an appropriate corrective factor:

[TABLE]

and derive accordingly from (8) and (9):

[TABLE]

Let $u$ be the $5$ -dimensional vector representing the current configuration of the system: $u:=(a_{1}^{+},a_{1}^{++},a_{2}^{+},\allowbreak a_{2}^{++},a_{3}^{+})\equiv(u^{(1)},\ldots,u^{(5)})$ ; note that the last element $a_{3}^{++}$ is determined as $a_{3}^{++}=s-\sum_{i=1}^{5}u^{(i)}$ .666In principle it is also correct to represent $u$ as a vector of dimension $6$ , i.e., including $a_{3}^{++}$ in $u$ as a free dimension. However, such a representation would lead to second-order partial derivatives $\frac{\partial^{2}}{\partial u^{(i)}\partial u^{(j)}}\eta^{*}(u)$ which are too large for our purposes. The following lemma is obtained by a folklore application of Taylor’s theorem.

Lemma 1.

Let $f:\mathbb{R}^{5}\to\mathbb{R}$ be a $C^{2}$ function in a sufficiently large neighborhood of $u$ , with $\min_{1\leq i\leq 5}u^{(i)}\geq 2/n$ . Then, $|\mathbb{E}\Delta f(u)-\frac{\dot{f}}{n}|\leq\frac{2}{n^{2}}\max_{\|u^{*}-u\|_{\infty}\leq 1/n}D_{f}(u^{*})$ , where $D_{f}(u^{*}):=\max_{1\leq i,j\leq 5}\left|\frac{\partial f^{2}(u^{*})}{\partial u^{(i)}\partial u^{(j)}}\right|$ .

Proof.

Let $u^{\prime}$ be the random variable representing the configuration of the system after its next transition from configuration $u$ . Observe that in every non-idle step of execution of the protocol, exactly one agent changes its state, so $\|u^{\prime}-u\|_{\infty}\leq 1/n$ .

Applying Taylor approximation we have:

[TABLE]

where $\nabla f(u)$ is the gradient of $f$ at $u$ , $R_{2}(u,u^{\prime})\in\mathbb{R}$ denotes the second-order Taylor remainder for function $f$ expanded at point $u$ along the vector towards point $u^{\prime}$ , and $R_{2}(u)\in\mathbb{R}$ is subsequently an appropriately chosen value, satisfying:

[TABLE]

∎

The following lemma is obtained directly by computing and bounding all second order partial derivatives of functions $\eta^{*}$ and $\kappa^{*}$ with respect to variables $(u^{(1)},\ldots,u^{(5)})$ .

Lemma 2.

There exists a constant $c_{1}>1$ depending only on $s,p$ , such that, for any configuration $u$ with $\delta(u)\leq s/12$ :

•

$\max_{\|u^{*}-u\|_{\infty}\leq 1/n}D_{\eta^{*}}(u^{*})<c_{1}\sqrt{n}$ ,

•

$\max_{\|u^{*}-u\|_{\infty}\leq 1/n}D_{\kappa^{*}}(u^{*})<c_{1}\sqrt{n}$ .

∎

In view of the above lemmas, we obtain from (8) and (9), for an appropriately chosen constant $c_{2}=2c_{1}+s$ :

[TABLE]

For $j=1,2$ , we now define two linear combinations of functions $\eta^{*}$ and $\kappa^{*}$ :

[TABLE]

When $\delta\leq s/12$ , we have:

[TABLE]

where we denoted $c_{3}:=\frac{48c_{2}}{ps}$ and used the fact that $p<\frac{s}{72\cdot 54\cdot 2}$ .

We subsequently perform an analysis of $\psi^{(j)}_{t}=\psi^{(j)}(u_{t})$ , $j=1,2$ , treating them as stochastic processes. We remark that $\psi^{(2)}_{t}\leq\psi^{(1)}_{t}$ , since $\psi^{(1)}_{t}-\psi^{(2)}_{t}=\frac{3p}{s}\kappa^{*}\geq 0$ .

Lemma 3.

Let $u_{t_{0}}$ be an arbitrary starting configuration of the system. Then, with constant probability, for some $t_{1}=t_{0}+O(n\log n)$ , a configuration $u_{t_{1}}$ is reached such that $\psi^{(1)}_{t_{1}}\geq\psi^{(2)}_{t_{1}}\geq-\frac{2c_{3}}{\sqrt{n}}$ .

Proof.

W.l.o.g. assume $t_{0}=0$ . We subsequently only analyze process $\psi^{(2)}_{t}$ . Let $t_{1}$ be the first time step such that $\psi^{(2)}_{t_{1}}>-\frac{2c_{3}}{\sqrt{n}}$ . If $t_{1}\neq 0$ , then $\psi^{(2)}_{0}<0$ . Note that then $\psi^{(2)}_{t}<0$ for all $t\leq t_{1}$ , from which it follows by a straightforward calculation from the definition of $\psi$ , $\kappa$ , and $\eta$ , that $\delta_{t}<\frac{s}{12}$ for all $t\leq t_{1}$ .

We now define the filtered stochastic process $\psi^{*(2)}_{t}$ as $\psi^{*(2)}_{t}:=|\psi^{(2)}_{t}|$ for $t<t_{1}$ , and put $\Delta\psi^{*(2)}_{t}:=0$ for $t\geq t_{1}$ . For all $t\geq 0$ , we then have:

[TABLE]

Since $0\leq\psi^{*(2)}_{t}<9$ for all time steps, a direct application of multiplicative drift analysis (cf. [19]) gives:

[TABLE]

and the claim follows by Markov’s inequality. ∎

Lemma 4.

Let $u_{t_{1}}$ be an arbitrary starting configuration of the system such that $\psi^{(j)}_{t_{1}}\in[-\frac{2c_{3}}{\sqrt{n}},\frac{4c_{3}}{\sqrt{n}}]$ , for $j=1,2$ . Then, with constant probability, for some $t_{2}=t_{1}+O(n)$ , a configuration $u_{t_{2}}$ is reached such that $\psi^{(1)}_{t_{2}}\geq\frac{4c_{3}}{\sqrt{n}}$ .

Proof.

W.l.o.g. assume that $t_{1}=0$ and suppose that initially $\psi^{(2)}_{0}\leq\psi^{(1)}_{0}<\frac{4c_{3}}{\sqrt{n}}$ (i.e., that $t_{2}\neq t_{1}$ ). Then, from the lower and upper bounds on $\psi^{(1)}_{0}$ and $\psi^{(2)}_{0}$ we obtain the following bounds on $\kappa_{0}$ and $\delta_{0}$ :

[TABLE]

It follows that, for $i=1,2,3$ , $a_{i,0}\in[\frac{s}{3}-\frac{10c_{3}s}{\sqrt{n}},\frac{s}{3}+\frac{10c_{3}s}{\sqrt{n}}]$ and $a_{i,0}^{++}=\frac{a_{i,0}}{s}(a_{i,0}+\kappa_{i,0})\in[(\frac{1}{3}-\frac{10c_{3}}{\sqrt{n}})(\frac{s}{3}-\frac{10c_{3}s}{\sqrt{n}}-\frac{2c_{3}s}{p\sqrt{n}}),(\frac{1}{3}+\frac{10c_{3}}{\sqrt{n}})(\frac{s}{3}+\frac{10c_{3}s}{\sqrt{n}}+\frac{2c_{3}s}{p\sqrt{n}})]$ . For the sake of clarity of notation, we will simply write $a_{i,0}=s/3\pm O(1/\sqrt{n})$ and $a_{i,0}^{++}=s/9\pm O(1/\sqrt{n})$ , hence also $a_{i,0}^{+}=2s/9\pm O(1/\sqrt{n})$ .

We will consider now the sequence of exactly $n$ transitions of the protocol, between time steps $t=0,1,\ldots,n$ .

For all $t$ we have $\mathbb{E}\Delta\psi^{(2)}_{t}\geq-\frac{c_{3}ps}{24n^{3/2}}$ . Consider the Doob submartingale $Y_{t}=\sum_{\tau=0}^{t-1}X_{t}$ with increments $(X_{t})$ given as:

[TABLE]

Noting that $|X_{t}|\leq\frac{9}{n}$ , an application of the Azuma inequality for submartingales to $(Y_{n})$ gives: $\Pr[Y_{n}\leq-\frac{c_{3}}{\sqrt{n}}]\leq\exp{[-c_{3}^{2}/162]}$ (cf. e.g. [13][Thm. 16]). From here it follows directly that:

[TABLE]

Noting that $\psi^{(2)}_{0}\geq-\frac{2c_{3}}{\sqrt{n}}$ , we have:

[TABLE]

We now describe the execution of transitions in the protocol for times $t=0,1,\ldots,n-1$ through the following coupling. First, we select the sequence of pairs of agents chosen by the scheduler. Let $V_{2}^{+}$ (respectively, $V_{1}^{+}$ ) denote the subsets of the set of $n$ agents, having initial states $A_{2}^{+}$ (resp., $A_{1}^{+}$ ) at time [math], respectively, which are involved in exactly one transition in the considered time interval, acting in it as the initiator (resp., receiver). Let $S\subseteq\{0,1,\ldots,n-1\}$ denotes the subset of time steps at which the scheduler activates a transition involving an element of $V_{2}^{+}$ as the initiator and an element of $V_{1}^{+}$ as the receiver. The execution of the protocol is now given by:

•

Phase $P_{A}$ : Selecting the sequence of pairs of elements activated by the scheduler in time steps $(0,1,\ldots,n-1)$ . This also defines set $S$ . Executing the rules of the protocol in their usual order for time steps from set $\{0,1,\ldots,n-1\}\setminus S$ .

•

Phase $P_{B}$ : Executing the rules of the protocol for time steps from set $S$ .

Observe that since elements of pairs activated in time steps from $S$ are activated only once throughout the $n$ steps of the protocol, the above probabilistic coupling does not affect the distribution of outcomes.

Directly from (13), we obtain through a standard bound on conditional probabilities that at least a constant fraction of choices made in phase $P_{A}$ leads to an outcome “ $\psi^{(2)}_{n}\geq-\frac{2c_{3}}{\sqrt{n}}$ ” with at least constant probability during phase $P_{B}$ :

[TABLE]

We now remark on the size of set $S$ . The distribution of $|S|$ depends only on $a_{2,0}^{+}$ , $a_{1,0}^{+}$ , and the choices made by the random scheduler. We recall that $a_{2,0}^{+}=2s/9\pm O(1/\sqrt{n})$ . Since the expected number of isolated edges in a random multigraph on $n$ nodes (representing the set of agents) and $n$ edges (representing the set of time steps) is $(1\pm o(1))e^{-4}n$ , the number of such edges having the first endpoint in an agent in state $A_{2}^{+}$ and the second endpoint in an agent in state $A_{1}^{+}$ is $(1\pm o(1))\frac{4e^{-4}s^{2}}{81}n$ . A straightforward concentration analysis (using, e.g., the asymptotic correspondence between $G(n,m)$ and $G(n,p)$ random graph models and an application of Azuma’s inequality for functions of independent random variables) shows that the bound $|S|=(1\pm o(1))\frac{4e^{-4}s^{2}}{81}n$ holds with very high probability. In particular, we have:

[TABLE]

for some choice of constant $c_{4}$ which depends only on $s$ .

Relations (14) and (15) provide all the necessary information about phase $P_{A}$ that we need. Subsequently, we will only analyse phase $P_{B}$ , conditioning on a fixed execution of phase $P_{A}$ such that the following event $F_{A}$ holds:

[TABLE]

We remark that, by a union bound over (14) and (15), $\Pr[F_{A}]\geq 1/3-e^{\Omega(-n)}>1/4$ .

In the remainder of our proof, our objective will be to show that:

[TABLE]

for some constant $c_{5}>0$ depending only on $s,p$ , for any choice of $P_{A}$ for which event $F_{A}$ holds. When this is shown, the claim of the lemma will follow directly, with a probability value given as at least $c_{5}\Pr[F_{A}]>c_{5}/4$ by the law of total probability.

We now proceed to analyze the random choices made during phase $P_{B}$ . Each of the considered $|S|$ interactions involves a pair of agents of the form $(A_{2}^{+},A_{1}^{+})$ , and describes the following transition:

[TABLE]

independently at random for each transition. The only state changes observed during this phase are from $A_{1}^{+}$ to $A_{2}^{+}$ , and we denote by $B$ the number of such state changes. The value of random variable $B$ completely describes the outcome of phase $P_{B}$ .

We have $\mathbb{E}B=p|S|$ , and by a standard additive Chernoff bound:

[TABLE]

Let $\mathcal{B}\subseteq[p|S|-2\sqrt{n},p|S|+2\sqrt{n}]$ be the subset of the considered interval containing values of $B$ such that $\left(\psi^{(1)}_{n}\big{|}P_{A},B\in\mathcal{B}\right)\geq\frac{4c_{3}}{\sqrt{n}}$ . If $\Pr[B\in\mathcal{B}|P_{A}]\geq 1/8$ , then the claim follows directly.

Otherwise, it follows from (16) and (17) that there must exist a value $b\in[p|S|-2\sqrt{n},p|S|+2\sqrt{n}]\setminus\mathcal{B}$ , such that:

[TABLE]

Given that:

[TABLE]

and recalling that $\psi^{(2)}_{n}\leq\psi^{(1)}_{n}$ , we obtain the following bound on $\eta_{n}$ :

[TABLE]

We now consider lower bounds on the value of $\psi^{(2)}_{n}$ , conditioned on $P_{A},B=b^{+}$ (respectively, $P_{A},B=b^{-}$ ), where $b^{+}$ (resp., $b^{-}$ ) is a value arbitrarily fixed in the range $b^{+}\in[b+\frac{20c_{3}s}{\sqrt{n}},b+\frac{21c_{3}s}{\sqrt{n}}]$ (resp., $b^{-}\in[b-\frac{21c_{3}s}{\sqrt{n}},b-\frac{20c_{3}s}{\sqrt{n}}]$ ). The executions of the protocol with $B=b^{+}$ and $B=b^{-}$ differ with respect to the execution with $B=b$ in the number of executed transitions from $a_{1}^{+}$ to $a_{2}^{+}$ by at least $\frac{20c_{3}s}{\sqrt{n}}$ . Recalling that $\delta_{2}=a_{2}-a_{1}$ , it follows that for some $b^{\prime}\in\{b^{+},b^{-}\}$ we have after $n$ steps:

[TABLE]

Subsequently, we will assume that $b^{\prime}=b^{+}$ ; the case of $b^{\prime}=b^{-}$ is handled analogously. From the relation $\eta>\delta/s$ and (18) we have:

[TABLE]

When comparing the value of $\kappa^{*}_{n}$ in the two cases, $B=b^{+}$ and $B=b$ , it is convenient to consider $\kappa^{*}$ as the length of the vector $(\kappa_{1},\kappa_{2},\kappa_{3},1/\sqrt{n})$ in Euclidean space. For each of the coordinates $\kappa_{i}$ , $i=1,2,3$ , we have:

[TABLE]

hence:

[TABLE]

Introducing (19) and (20) into the definition of $\psi^{(1)}_{n}$ , we obtain directly:

[TABLE]

where we again used the fact that $p$ is a sufficiently small constant w.r.t. $s$ . We thus obtain:

[TABLE]

where by the definition of random variable $B$ as a sum of i.i.d. binary random variables and the choice of value $b$ in the direct vicinity of the expectation of $B$ , the event $B\in[b+\frac{20c_{3}s}{\sqrt{n}},b+\frac{21c_{3}s}{\sqrt{n}}]$ holds with constant probability. The case of $b^{\prime}=b^{-}$ is handled analogously. ∎

Lemma 5.

Let $u_{t_{2}}$ be an arbitrary starting configuration of the system such that $\max\{\psi^{(1)}_{t_{2}},\psi^{(2)}_{t_{2}}\}=\psi^{(1)}_{t_{2}}\geq\frac{4c_{3}}{\sqrt{n}}$ . Then, with constant probability, for some $t_{3}=t_{2}+O(n\log n)$ , a configuration $u_{t_{3}}$ is reached such that $\delta_{t_{3}}>s/12$ .

Proof.

We subsequently consider only the process $\psi^{(1)}_{t}$ . We start by showing the following claim.

Claim. Suppose $\psi^{(1)}_{0}=A\geq\frac{4c_{3}}{\sqrt{n}}$ . Then, with probability at least $1-\exp{[-A^{2}psn/46656]}$ , for some time step $t\leq\frac{72n}{ps}$ the process reaches a value $\psi^{(1)}_{t}\geq 2A$ , or $\delta_{t}>s/12$ .

Proof (of claim). Consider the Doob submartingale $Y_{t}=\sum_{\tau=0}^{t-1}X_{t}$ with increments $(X_{t})$ given as:

[TABLE]

Noting that $|X_{t}|\leq\frac{9}{n}$ , an application of the Azuma inequality for submartingales (cf. e.g. [13][Thm. 16]) to $(Y_{T})$ with $T=\frac{72n}{ps}$ gives:

[TABLE]

Moreover, assuming the barrier $\delta_{t}>s/12$ was not reached, we have:

[TABLE]

which completes the proof of the claim.

We now prove the lemma by iteratively applying the claim over successive intervals of time $(\tau_{0},\tau_{1},\ldots)$ , such that $\tau_{0}=t_{2}$ and $\tau_{i+1}$ is the first time step not before $\tau_{i}$ such that $\psi^{(1)}_{\tau_{i+1}}\geq 2\psi^{(1)}_{\tau_{i}}$ or $\delta_{\tau_{i+1}}\geq s/12$ . By the claim, we have:

[TABLE]

Noting that $c_{3}>48/(ps)$ by definition, and that before the barrier $\delta>s/12$ is reached, we have $\psi^{(1)}_{\tau_{i}}\geq\frac{4c_{3}}{\sqrt{n}}2^{i}\geq\frac{192}{ps\sqrt{n}}2^{i}$ , we obtain:

[TABLE]

and further:

[TABLE]

In particular, putting $i=\log_{2}n$ , $\Pr\left[\tau_{i}\leq\frac{72n\log_{2}n}{ps}\right]>0.98$ . Since for this value of $i$ , we must have $\delta_{\tau_{i}}\geq s/12$ (since otherwise we would have $\psi^{(1)}_{\tau_{i}}=\omega(1)$ , which is impossible), the claim of the lemma follows. ∎

Phase with $\delta>s/12$ .

The second phase of convergence corresponds to configurations of the system which are sufficiently far from the center point $(a_{1},a_{2},a_{3})=(s/3,s/3,s/3)$ . Formally, we analyze a variant of potential $\phi$ (with an additive corrective factor proportional to $\kappa^{2}$ ) to show that, starting from a configuration with $\delta>s/12$ , we will eliminate one of the three populations $a_{1},a_{2},a_{3}$ in $O(n\log n)$ sequential steps with constant probability, without approaching the center point too closely (a value of $\delta=\Omega(1)$ will be maintained throughout).

For this part of the analysis, we define the considered potential as:

[TABLE]

for any configuration $u$ with $a_{\min}>0$ .

We have directly from (6) and (7):

[TABLE]

where in the last transformation we took into account that $p\leq s/144$ .

For the sake of technical precision in formulating the subsequent lemmas, we also consider the stochastic process $\psi^{*}_{t}$ , given as $\psi^{*}_{t}=\psi(u_{t})$ for any $t<t_{d}$ , where $t_{d}$ is defined as the first time in the evolution of the system such that a configuration with $a_{\min,t_{d}}<c_{6}/n$ is reached, where $c_{6}=313600/s$ is a constant depending only on $s$ . For all $t\geq t_{d}$ , we define $\psi^{*}_{t}:=\psi^{*}_{t-1}+\frac{1}{n}$ .

Lemma 6.

*In any configuration $u_{t}$ with $\delta\geq s/20$ we have: $\mathbb{E}\Delta\psi_{t}^{*}\geq\frac{1}{8}\frac{p\delta^{2}}{sn}>\frac{ps}{3600n}$ . *

Proof.

We have:

[TABLE]

Following the definition of $\phi$ in Eq. (2), we have by linearity of expectation:

[TABLE]

Next, using the bound $\ln(1+b)\leq b$ which holds for $b>-1$ , we have:

[TABLE]

from which it follows that:

[TABLE]

To analyze $\mathbb{E}\Delta(\kappa^{2})$ , we apply a variant of Lemma 1. A direct application of the lemma is not sufficient due to the singularity related to the $a_{i}^{-1}$ term in the definition of $\kappa_{i}$ ; however, this effect is compensated when we take into account that any change of the value of $\kappa_{i}^{2}$ occurs in the considered protocol with probability at most proportional to $a_{i}$ . For the specific case of $\kappa_{i}^{2}$ , for fixed $i=1,2,3$ , we consider $\kappa_{i}^{2}:\mathbb{R}^{2}\to\mathbb{R}$ as a function of the restricted configuration $\bar{u}=(a_{i}^{+},a_{i}^{++})$ , and we rewrite expression (12) as:

[TABLE]

A straightforward computation from the definition of function $\kappa_{i}$ shows that:

[TABLE]

It follows that

[TABLE]

and so:

[TABLE]

Introducing (24) and (25) into (23), we obtain:

[TABLE]

where in the second-to-last transformation we used (22), and in the last transformation we used the relation $\frac{392s}{a_{\min}n}\leq\frac{\delta^{2}}{2}$ which holds when $\delta\geq s/20$ and $a_{\min}\geq c_{6}/n$ .

The claim thus follows when $\psi^{*}_{t}=\psi_{t}$ and $\psi^{*}_{t+1}=\psi_{t+1}$ , i.e., for $t<t_{d}$ . For larger values of $t$ , the claim follows trivially from the definition of $\psi^{*}_{t}$ . ∎

The above Lemma is used to show that, starting from any configuration with $\delta>s/12$ , we quickly reach a configuration in which some species has a constant number of agents.

Lemma 7.

If $\delta_{t}\geq s/20$ , we have:

(i)

$|\Delta\psi^{*}_{t}|\leq c_{7}$ ,

(ii)

$\mathrm{Var}(\Delta\psi^{*}_{t})\leq\frac{c_{8}}{n}$ .

where $c_{7}>0$ and $c_{8}>0$ are constants depending only on $s$ . Moreover, in any configuration $u$ with $a_{\min}\geq 2/n$ , we have:

(iii)

$|\Delta\psi(u)|\leq\frac{c_{7}}{na_{\min}}$ .

(iv)

$\mathrm{Var}(\Delta\psi(u))\leq\frac{c_{8}}{n^{2}a_{\min}}$ .

Proof.

We first consider the case of a configuration with $a_{\min}\geq 2/n$ . Using the definition of $\psi$ (and within it, of $\phi$ and $\kappa$ ). Consider any transition from a configuration $u$ to a subsequent configuration $u^{\prime}$ and let $S\subseteq\{1,2,3\}$ be defined as the set of indices of configurations changing between $u$ and $u^{\prime}$ ( $S=\{i:a_{i}^{+}(u)\neq a_{i}^{+}(u^{\prime})\vee a_{i}^{++}(u)\neq a_{i}^{++}(u^{\prime})\}$ ). We verify that there exists an absolute constant $c_{7}>0$ such that:

[TABLE]

Moreover, by the definition of the protocol a transition from $u$ to $u^{\prime}$ occurs with probability $\Pr(u^{\prime}|u)\leq\min_{i\in S}a_{i}$ . Since there is only a constant number of possible successor configurations $u_{t+1}$ for $u_{t}$ (loosely bounding, not more than $3^{6}$ ), it follows that:

[TABLE]

The bounds on the variance of $\mathrm{Var}(\Delta\psi(u))$ and that of $\Delta\psi^{*}_{t}=\Delta\psi(u)$ (for $t<t_{d}$ ) with $a_{\min,t}\geq(c+6+1)/n$ follow directly. The analysis of $\Delta\psi^{*}_{t}$ when $a_{\min,t}=c_{6}/n$ and $t<t_{d}$ is performed analogously, noting that if the succeeding configuration $u^{\prime}=u_{t+1}$ is such that $a_{\min}(u^{\prime})<c_{6}/n$ , then $\Delta\psi^{*}_{t}=\frac{1}{n}$ . Finally, for $t\geq t_{d}$ , the result holds trivially by the definition of $\psi^{*}_{t}$ . ∎

Lemma 8.

Let $u_{t_{3}}$ be an arbitrary starting configuration of the system such that $\delta_{t_{3}}>s/12$ . Then, with probability $1-O(1/n)$ , for some $t_{4}=t_{3}+O(n\log n)$ , a configuration $u_{t_{4}}$ is reached such that $a_{\min,t_{4}}=c_{6}$ .

Proof.

W.l.o.g. assume that $t_{3}=0$ . First we remark that, by the relation between $\eta$ and $\delta$ for $\delta\leq s/12$ , a process starting with $\delta_{0}>s/12$ satisfies:

[TABLE]

Moreover, for any configuration $u^{\prime}$ with $\delta(u^{\prime})\leq s/20$ we have:

[TABLE]

Thus, initially $\psi_{0}>\frac{1}{150}$ and as long as for all time steps $t$ we have $\psi_{t}\geq\frac{1}{170}$ , the barrier condition $\delta_{t}\geq s/20$ has not been violated. Moreover, for $\psi_{t}\in[\frac{1}{170},\frac{1}{150}]$ , we have by Lemma 6 that $\mathbb{E}\Delta\psi_{t}\geq 0$ . Moreover, by Lemma 7 (iii) and the fact that $\delta_{t}<\frac{1}{144}$ which implies $a_{\min,t}>s/4$ , we have that $|\Delta\psi_{t}|\leq\frac{4c_{7}}{sn}$ .

It follows from a standard application of Azuma’s inequality for martingales (resembling the analysis of the hitting time of the random walk with step size $O(1/n)$ , from one endpoint of a path of length $\Theta(1)$ to the other) that:

[TABLE]

hence also throughout the first $n^{2}/\ln n$ steps of the process we have $\delta>s/18$ , with probability $1-O(1/n)$ .

We are now ready to analyze the subsequent stages of the process, designing a Doob submartingale $Y_{t}=\sum_{\tau=0}^{t-1}X_{t}$ with time increments $(X_{t})$ defined as:

[TABLE]

Using Lemma 7 (i) and (ii) and applying the Azuma-McDiarmid inequality777If our objective in the proof of the lemma were to show a bound on $t_{4}$ which holds with constant probability (which would be sufficient for our purposes later on), rather than a w.h.p. bound, then this specific step of the proof can also be performed using Markov’s inequality. In any case, we would need to make use of the bounded variance of $\psi^{*}_{t}$ in the proof of the next Lemma. in the bounded variance version (cf. e.g. [13][Thm. 18]) to $Y_{t}$ for $t_{c}=c^{3}n\ln n$ , for some sufficiently large constant $c>0$ depending only on $s$ , we obtain:

[TABLE]

If the event $X_{t}=\Delta\psi^{*}_{t}-\frac{ps}{3600}$ were to hold for all $t<t_{c}$ with $c=\frac{2\cdot 3600}{ps}$ and if $Y_{t_{c}}>-c^{2}\ln n$ , then we would have $\psi^{*}_{t_{c}}=\psi^{*}_{0}+Y_{t_{c}}+\frac{ps}{3600}t_{c}\geq 0-c^{2}\ln n+3c^{2}\ln n=2c^{2}\ln n$ , which would mean that $\psi^{*}_{t_{c}}\neq\psi_{t_{c}}$ , since $\psi\leq 3\ln n+O(1)$ by definition. If $\psi^{*}_{t_{c}}\neq\psi_{t_{c}}$ , then $t_{4}<t_{c}$ , and the proof is complete. (Indeed, to reach a configuration with $a_{\min}<c_{6}/n$ , the protocol has to pass through a configuration with $a_{\min}=c_{6}/n$ , since the size of each population changes by at most $1$ in each transition.) Otherwise, we must have that at least one of the following events holds: $Y_{t_{c}}\leq-c^{2}\ln n$ , or $\psi_{\tau}\leq 1/170$ for some $\tau<t_{c}$ , or $a_{\min,\tau}<c_{6}$ for some $\tau<t_{c}$ . We have established that each of the first two of these events holds with probability $O(1/n)$ , whereas if the latter event holds, then $t_{4}<t_{c}$ . Thus, $t_{4}<t_{c}$ holds with probability $1-O(1/n)$ by a union bound. ∎

Lemma 9.

Let $u_{t_{4}}$ be a starting configuration of the system such that $a_{\min,t_{4}}=c_{6}/n$ . Then, with constant probability, for some $t_{5}=t_{4}+O(n\log n)$ , a configuration $u_{t_{5}}$ is reached such that $a_{j,t_{5}}\leq c_{6}/n$ and $a_{j+1,t_{5}}>s/40$ , for some $j\in\{1,2,3\}$ .

Proof.

W.l.o.g. assume that $\arg\min_{i=1,2,3}a_{i,t_{4}}=2$ . If $a_{3,t_{4}}>s/40$ , then the claim follows immediately, putting $t_{5}=t_{4}$ and $j=2$ . Otherwise, we will show that with constant probability, the system will evolve so that $a_{2}$ will increase over time until within $O(n\log n)$ steps we will have a time step $t_{5}$ with $j=3$ (i.e., $a_{3,t_{5}}\leq c_{6}/n$ and $a_{1,t_{5}}>s/40$ ).

In the considered case, w.l.o.g. assume $t_{4}=0$ . Next, let $T=cn\ln n$ for a sufficiently large constant $c$ ; we choose as $c:=2\log_{2}\frac{1}{0.005ps}$ for convenience in later analysis. Intuitively, in view of Lemmas 6 and 7, the potential $\psi^{*}_{T}$ will be further increased in the next steps: the random variable $(\psi^{*}_{T}-\psi^{*}_{0}|u_{0})$ has an expected value of $\Theta(T/n)=\Theta(\log n)$ , with a standard deviation of $\Theta(\sqrt{T/n})=\Theta(\sqrt{\log n})$ .

By an application of the Azuma-McDiarmid inequality for martingales with bounded variance similar to that in the proof of Lemma 8, we obtain the following result:888Such an analysis can also be performed using Chebyshev’s inequality, obtaining a slightly weaker expression in the probability bound.

[TABLE]

Observe that since $a_{2,0}=c_{6}/n=O(1/n)$ , we have $\psi^{*}_{0}\geq\ln n-O(1)$ . Taking this into account, for our purposes, a slightly weaker and simpler form of expression (26) will be more convenient:

[TABLE]

The proof of the lemma is completed by a more fine-grained analysis of the considered protocol. In the initial configuration $t_{4}=0$ , we have $a_{2,0}=c_{6}/n$ (there are exactly $c_{6}$ agents in state $A_{2}$ ), and since $t_{4}\neq t_{5}$ , we have $a_{3,0}<s/40$ . Consequently, $a_{1,0}=s-s/40-O(1/n)>0.9s$ . Informally, since the prey of $A_{2}$ (i.e., $A_{1}$ ) is more than twice more numerous than its predator (i.e., $A_{3}$ ), we should observe the increase in the size of population of $A_{2}$ , regardless of the activities ( $A_{i}^{+}$ or $A_{i}^{++}$ ) of the agents in the population. We consider the evolution of the system, finishing at the earliest time $t_{e}$ when $a_{2}(t_{e})>s/100$ . The following relations are readily shown (apply e.g. Lemma 14 with $i=2$ and $u=0$ ):

[TABLE]

From (29), taking into account that $|\Delta a_{i,t}|\leq\frac{1}{n}$ and $a_{3,0}<s/40$ , an application of Azuma’s inequality for martingales shows that:

[TABLE]

Taking into account the above, by a straightforward geometric growth analysis (compare e.g. proof of Lemma 5), we obtain from (28):

[TABLE]

Moreover, since the speed of increase of $a_{2}$ is bounded (even in the absence of predators) by that of a standard push rumor spreading process (formally, $\mathbb{E}(\Delta a_{2,t})\leq a_{2,t}$ ), we have (compare e.g. [20]):

[TABLE]

Now, we observe that with constant probability, the size of population $A_{2}$ does not decrease in the time interval $[0,t_{e}]$ below the value $a_{2,0}=c_{6}/n$ , attained at the beginning of this interval:

[TABLE]

Indeed, with constant probability the value $a_{2,t}$ is initially non-decreasing: with constant probability, in the first $O(n)$ rounds each of the $c_{6}=O(1)$ agents from $A_{2}$ will be triggered by the scheduler $O(1)$ times in total, and each interaction involving an agent from $A_{2}$ will have this agent as the initiator, and an agent from the largest of the three populations, $A_{1}$ , as the receiver (the prey). Thus, with constant probability, the number of agents in population $A_{2}$ is increased to an arbitrary large constant (e.g., $1000c_{6}$ ). After this, we use the geometric growth property (28) to show that $a_{2,t}$ reaches the barrier $a_{2,t}>s/100$ (at time $t_{e}$ ) before the event $a_{2,t}<c_{6}/n$ occurs (cf. e.g. proof of Lemma 5, or standard analysis of variants of rumor-spreading processes in their initial phase [29]).

When the event from bound (33) holds, at least one of the following events must also hold:

(A)

$a_{\min,t}\geq c_{6}/n$ , for all $t\leq t_{e}$ ,

(B)

or there exists a time step $t<t_{e}$ such that $a_{1,t}\leq c_{6}/n$ ,

(C)

or there exists a time step $t<t_{e}$ such that $a_{3,t}\leq c_{6}/n$ .

To complete the proof, we will show that each of the events (A) and (B) holds with probability $o(1)$ . Indeed, then in view of (33), event (C) will necessarily hold with probability $\Omega(1)$ . This means that, with probability $\Omega(1)$ , there exists a time step $t<t_{e}$ such that $a_{3,t}<c_{6}/n$ and $a_{2,t}<s/100$ (since $t<t_{e}$ ), and so also $a_{1,t}>s-s/100-c_{6}/n>0.98s>s/40$ ; thus, the claim of the lemma will hold with $t_{5}=t$ and $j=3$ .

To show that event (B) holds with probability $o(1)$ , notice that $a_{2,t}<s/100$ by definition of $t_{e}$ , and moreover $a_{3,t}<0.05s$ with probability $1-o(1)$ , hence the event $a_{1,t}<s-s/100-0.05s=0.94s$ holds with probability $o(1)$ .

To show that event (A) holds with probability $o(1)$ , notice that, substituting in (27) $t=t_{e}$ , by a union bound over (27), (31) and (32) we obtain:

[TABLE]

This means that, with probability $1-o(1)$ , we have $\psi^{*}_{t_{e}}\neq\psi_{t_{e}}$ or $\psi_{t_{e}}\geq(1+10^{-4}ps)\ln n$ . In the first case, event (A) cannot hold. In the second case, observe that $a_{2,t_{e}}=s/100+O(1/n)$ by definition of $t_{e}$ , so $a_{1,t_{e}}>s-s/100-O(1/n)-c_{6}/n>0.98s$ , and it follows that $\psi_{t_{e}}=\sum_{i=1}^{3}\ln\frac{1}{a_{i,t_{e}}}+O(1)=\ln n+O(1)$ . Since the condition $\psi_{t_{e}}\geq(1+10^{-4}ps)\ln n$ is not fulfilled, event (A) can only hold with probability $o(1)$ . ∎

Lemma 10.

Let $u_{t_{5}}$ be a starting configuration of the system such that $a_{j,t_{5}}\leq c_{6}/n$ and $a_{j+1,t_{5}}>s/40$ , for some $j\in\{1,2,3\}$ . Then, with constant probability, for some $t_{6}=t_{5}+O(n)$ , a configuration $u_{t_{6}}$ is reached such that $a_{\min,t_{6}}=0$ .

Proof.

We consider the pairs of interacting agents chosen by the scheduler in precisely the next $n$ rounds after time $t_{5}$ . Given that set $A_{j,t_{5}}$ has constant size, and set $A_{j+1,t_{5}}$ has linear size in $n$ , it is straightforward to verify that with constant probability, the set of randomly chosen $n$ pairs of agents has all of the following properties:

•

Each agent from $A_{j,t_{5}}$ belongs to exactly one pair picked by the scheduler, and is the receiver in this pair.

•

Each agent interacting in a pair with an agent from $A_{j,t_{5}}$ belongs to exactly one pair.

•

Each agent interacting in a pair with an agent from $A_{j,t_{5}}$ belongs to set $A_{j+1,t_{5}}$ .

Conditioned on such a choice of interacting pairs by the scheduler, the protocol changes the state of all agents from set $A_{j,t_{5}}$ to state $j+1$ with probability at least $p^{|A_{j,t_{5}}|}\geq p^{c_{6}}=\Omega(1)$ . State $j$ is then effectively eliminated. ∎

In the absence of species $j$ , the interaction between species $j-1$ and $j+1$ collapses to a lazy predator-prey process, with transitions of the form $(A_{j-1},A_{j+1})\to(A_{j-1},A_{j-1})$ associated with a constant transition probability. A w.h.p. bound on the time of elimination of species $j+1$ follows immediately from the analysis of the push rumor spreading model, and we have the following Lemma.

Lemma 11.

Let $u_{t_{6}}$ be a starting configuration of the system such that $a_{j,t_{6}}=0$ , for some $j\in\{1,2,3\}$ . Then, with probability $1-O(1/n)$ , for some $t_{7}=t_{6}+O(n\log n)$ , a configuration $u_{t_{7}}$ is reached such that for all $t\geq t_{7}$ , $a_{j,t}=a_{j+1,t}=0$ and $a_{j-1,t}=s$ . ∎

After a further $O(n\log n)$ steps after time $t_{7}$ , the final configuration of all agents in the oscillator’s population will be $a_{j-1,t}^{++}=s$ .

6.5 Operation of the Oscillator in the Presence of a Source

In this section we prove properties of the oscillatory dynamics for the case $\text{$ {}^{#} $}\!\!X>0$ . It is possible to provide a detailed analysis of the limit trajectories of the dynamics in this case, as a function of the concentration of $x$ . Here, for the sake of compactness we only show the minimal number of properties of the oscillator required for the proof of Theorem 1. When the given configuration is such that $a_{\min}$ is sufficiently large, say $a_{\min}>0.02s$ , then both the subclaims of Theorem 1(2) hold for the considered configuration. (The first subclaim hold directly; the second subclaim follows by a straightforward concentration analysis of the number of agents changing state in protocol $P_{o}$ over the next $0.01sn$ steps, since we will always have $a_{\min}\geq 0.01s$ during the considered time interval.) Otherwise, the considered configuration is close to one of the sides of the triangle. We will show that in the next $O(n\log n)$ steps, with high probability, the protocol will either reach a configuration with $a_{\min}>0.02s$ , or will visit successive areas around the triangle, as illustrated in Fig. 7. The following Lemmas show that within each area, an exponential growth process occurs, which propagates the agent towards the next area.

Lemma 12.

If $a_{i-1}<0.8s$ and $a_{i+1}<0.05s$ , then $\dot{a}_{i-1}\leq xs/3-0.05psa_{i-1}$ .

Proof.

From the assumptions we have that $a_{i}>0.15s$ . Starting from (3) we obtain:

[TABLE]

∎

From the above bound on expectation, the following Lemma follows directly by a standard concentration analysis. In what follows, we consider an execution in which the concentration $x$ is strictly positive and bounded by a sufficiently small absolute constant (i.e., $\text{$ {}^{#} $}\!\!X$ is at most a given constant fraction of the entire population), with the required upper bounds on $x$ used in the proofs of lemmas given in their statements. This is a technical assumption, which allows us to simplify the proof structure. In particular, the assumption $\text{$ {}^{#} $}\!\!X\leq c_{12}n$ can be omitted in the statement of Theorem 1, and the claim of the theorem can even be proved for executions in which $\text{$ {}^{#} $}\!\!X$ changes during the execution of the protocol, as long as the invariant $\text{$ {}^{#} $}\!\!X>0$ is preserved over the considered interval of time.

Lemma 13.

Let $u_{t_{a}}$ be a starting configuration of the system such that $a_{i-1,t_{a}}<0.75s$ and $a_{i+1,t_{a}}<0.05s$ . Suppose $x<10^{-3}ps$ , starting from time $t_{a}$ . Then, for some $t_{b}\in[t_{a},t_{a}+c_{9}n]$ , where $c_{9}$ is a constant depending only on $p$ and $s$ , with probability $1-e^{-n^{\Omega(1)})}$ , the system reaches a configuration $u_{t_{b}}$ such that exactly one of the following two conditions is fulfilled:

•

either $a_{\min,t_{b}}\geq 0.02s$ ,

•

or $a_{i+1,t_{b}}<0.05s$ and $a_{i-1,t_{b}}<0.02s$ .

Proof.

In the considered range of values of $a_{i-1}$ , we have $a_{i-1,t_{a}}<0.75s$ and $a_{i-1,t}\geq 0.02s$ , for all $t$ until we leave the considered area at time $t_{b}$ . Taking into account that $x<10^{-3}ps$ , it follows from Lemma 12 that:

[TABLE]

Taking into account that $|\Delta{a}_{i-1}\leq\frac{1}{n}|$ , it follows from a straightforward concentration analysis (cf. e.g. proof of Lemma 5 for a typical analysis of this type of exponential growth process) that a boundary of the considered area (either $a_{i-1,t}<0.02s$ or $a_{\min,t}>0.02s$ ) must be reached within $O(n)$ steps with very high probability, as stated in the claim of the lemma. ∎

A similar analysis is performed for the next area.

Lemma 14.

If $a_{i+1}<0.25s$ and $a_{i-1}<0.05s$ then $\dot{a}_{i+1}\geq xs/12+0.6psa_{i+1}$ and $\dot{a}_{i-1}\leq xs/3-0.2psa_{i-1}$ .

Proof.

From assumptions we have that $a_{i}>0.7s$ . Starting from (3) we obtain:

[TABLE]

∎

Again, a concentration result follows directly.

Lemma 15.

Let $u_{t_{b}}$ be a starting configuration of the system such that $a_{i-1,t_{b}}<0.02s$ and $a_{i+1,t_{b}}<0.02s$ . Suppose $x<0.02ps$ , starting from time $t_{b}$ . Then, for some $t_{a^{\prime}}\in[t_{b},t_{b}+c_{10}n\ln\frac{1}{\max\{1/n,a_{i+1,t_{b}}\}}]\subseteq[t_{b},t_{b}+c_{10}n\ln n]$ , where $c_{10}$ is a constant depending only on $p$ and $s$ , with probability $1-O(1/n^{3})$ , the system reaches a configuration $u_{t_{a^{\prime}}}$ such that exactly one of the following two conditions is fulfilled:

•

either $a_{\min,t_{a^{\prime}}}\geq 0.02s$ ,

•

or $a_{i-1,t_{a^{\prime}}}<0.05s$ , $a_{i+1,t_{a^{\prime}}}>0.25s$ , and (consequently) $a_{i,t_{a^{\prime}}}<0.75s$ .

Proof.

We first show that, starting from time $t_{b}$ onward, the process $a_{i-1,t}$ satisfies $a_{i-1,t}<0.05s$ for all $t\in[t_{b},t_{*}]$ with probability $1-e^{-n^{\Omega(1)}}$ , where $t_{*}$ is defined as the minimum of time $t_{a}+c_{10}n\ln n$ and the last time moment such that $a_{i+1,t}\leq 0.25s$ holds for all $t\in[t_{b},t_{*}]$ . By Lemma 14, we have for all $t\in[t_{b},t_{*}]$ such that $a_{i-1,t}>0.01s$ :

[TABLE]

where we took into account the assumption $x<0.02ps$ . The claim on $a_{i-1,t}<0.05s$ follows from a standard concentration analysis, noting that $|\Delta a_{i-1,t}|\leq\frac{1}{n}$ .

In order to analyze the process $a_{i+1,t}$ , we apply a filter and consider the process $a^{\prime}_{i+1,t}$ , starting at time $t_{b}$ , defined as follows. For as long as $a_{i-1,t}<0.05s$ , we put $a^{\prime}_{i+1,t}:=a_{i+1,t}$ , and starting from the first time $t_{**}$ when $a_{i-1,t}>0.05s$ , we compute $a^{\prime}_{i+1,t+1}$ as the subsequent value of $a_{i+1}$ after a simulation of a single step of the process for some state $u$ with concentrations of types: $x(u)=x$ , $a_{i+1}(u)=a^{\prime}_{i+1,t}$ , $a_{i-1}(u)=0.05s$ , and $a_{i}(u)=s-a_{i-1}(u)-a_{i+1}(u)$ .

For a given time step $t$ , let $R_{t}$ denote the event that $\Delta a^{\prime}_{i+1,t}\neq 0$ . By the construction of protocol $P_{o}$ , which always requires at least one agent of type $X$ or type $A_{i+1}$ to be involved in an interaction which creates or destroys an agent of type $A_{i+1}$ , we have:

[TABLE]

Moreover, from Lemma 14 it follows that for $a^{\prime}_{i+1,t}<0.25s$ :

[TABLE]

Since $\Delta a^{\prime}_{i+1,t}|\neg R_{t}=0$ , we have:

[TABLE]

and moreover $\Delta a^{\prime}_{i+1,t}|R_{t}\in\{-\frac{1}{n},0,\frac{1}{n}\}$ . Analysis of this type of process is folklore (in the context of epidemic models with infection and recovery) but somewhat tedious; we sketch the argument for the sake of completeness. When considering only those steps for which event $R_{t}$ holds, the considered process can be dominated by a lazy random walk on the line $\{0,\frac{1}{n},\frac{2,n}{,}\ldots\}$ , with a constant bias towards its right endpoint. To facilitate analysis, we define points $Q_{c}={c\lfloor\alpha\ln n\rfloor}{n}$ , for $c=0,1,\ldots$ , where constant $\alpha>0$ is subsequently suitably chosen, and for any point $Q_{c}$ to the right of the starting point of the walk (i.e, $c>c_{\min}$ where $c_{\min}$ is the smallest integer such that $Q_{c_{\min}+1}>a_{i+1,t_{b}}$ ) define $s_{c}$ as the number of steps of the walk until its first visit to $s_{c}$ . For a suitable choice of constants $\alpha$ and $\beta>0$ sufficiently large, we have that for any $c$ , with probability at least $1-O(1/n^{2})$ , $s_{c+1}-s_{c}\leq\beta\ln n$ , and moreover between its step $s_{c}$ and its step $s_{c+1}$ , the walk is confined to the subpath $(Q_{c-1},Q_{c+1})$ of the considered path. Considering the original time $t$ of our process $a^{\prime}_{i+1,t}$ (including moments with $\neg R_{t}$ ), let $t_{c}$ be the moment of time corresponding to the $s_{c}$ -th step of the walk. Conditioning on events which hold with probability $1-O(1/n^{2})$ , the value $t_{c+1}-t_{c}$ can be stochastically dominated by the sum of $\beta\ln n$ independent geometrically distributed random variables, each with expected value $O(\frac{n}{\max\{1,(c-1)\ln n\}})$ . Let $c_{\max}$ be the largest positive integer such that $Q_{c_{\max}}<0.25s$ . Applying a union bound on the conditioning of all intervals $t_{c+1}-t_{c}$ , for $c\geq c_{min}$ and a concentration bound on the considered geometric random variables, we eventually obtain that with probability $1-O(1/n^{3})$ the condition $a^{\prime}_{i+1,t}$ is achieved for time:

[TABLE]

Recalling that $a^{\prime}_{i+1,t}=a_{i+1,t}$ holds throughout the considered time interval with very high probability, the claim follows. ∎

An iterated application of Lemmas 13 and Lemmas 15 moves the process along time moments $t_{a}$ , $t_{b}$ , $t_{a}^{\prime},\ldots$ , where time moment $t_{a}^{\prime}$ is again be fed to Lemma 13, considering the succeeding value of $i$ . After a threefold application of both Lemmas, the process has w.h.p. in $O(n\log n)$ steps either performed a complete rotation, passing through three moments of time designated as “ $t_{a}$ ”, rotated by one third of a full circle, or has reached at some time $t^{\prime}$ a point with $a_{\min,t^{\prime}}\geq 0.02s$ . In either case, the claim of Theorem 1(2) follows directly.

7 Analysis of Protocol for Detect

7.1 Further Properties of the Oscillator

We start by stating a slight generalization of Lemma 6, capturing the expected change of potential $\psi_{t}^{*}$ (given by (21)) for the case $\text{$ {}^{#} $}\!\!X>0$ , for configurations which are sufficiently far from both the center and the sides of the triangle.

Lemma 16.

In any configuration $u_{t}$ with $10^{-6}s^{2}\leq a_{\min}\leq 0.02s$ and $x<c_{12}$ we have: $\mathbb{E}\Delta\psi_{t}\geq\frac{ps}{7200n}$ , where $c_{12}>0$ is an absolute constant which depends only on $s$ and $p$ .

Proof.

We can condition the expectation of $\mathbb{E}\Delta\psi_{t}$ on the event $E_{t}$ , which holds if an agent in state $X$ participates in the current interaction. Conditioned on $\neg E_{t}$ , the analysis corresponds directly to the computations performed for the case $x=0$ , where we remark that the assumptions of Lemma 6 are satisfied due to the assumed upper bound on $a_{\min}$ . Thus:

[TABLE]

Next, taking into account the lower bound on $a_{\min}$ , by exactly the same argument as in Lemma 7( $iii$ ), we have $|\Delta\psi_{t}|<c^{\prime}_{12}$ /n, for some choice of constant $c^{\prime}_{12}>0$ which depends only on $s$ and $p$ . Obviously,

[TABLE]

and since $\Pr[E_{t}]<2x$ , by the law of total expectation:

[TABLE]

where the last inequality holds for any $x<c_{12}$ , where $c_{12}:=\frac{1}{2(2c^{\prime}_{12}+1)}$ . ∎

Lemma 17.

Suppose $a_{\min,t_{0}}<10^{-6}s^{4}$ at some time $t_{0}$ . Then, there exists an absolute constant $c_{13}>0$ , such that the following event holds with probability $1-e^{-n^{\Omega(1)})}$ : for all $t\in[t_{0},t_{0}+e^{n^{c_{13}}}]$ , we have $a_{\min,t}<0.01s^{2}$ .

Proof.

Let $\hat{\psi}_{t}\equiv\ln\frac{s^{3}}{27}-\psi_{t}$ . Consider any $t\geq t_{0}$ such that $a_{\min,t}<10^{-6}s^{4}$ . Then, $\phi_{t}<\ln a_{\min,t}<\ln(10^{-6}s^{4})$ and consequently:

[TABLE]

where we recall that $\kappa_{t}^{2}\leq 1$ and the last inequality follows for $p$ chosen to be sufficiently small ( $4p/s^{4}<\ln 2$ ).

Further, note that if for some time $t$ we have $\hat{\psi}_{t}<\ln(8\cdot 10^{-6}s^{4})$ , then:

[TABLE]

thus $a_{1,t}a_{2,t}a_{3,t}<8\cdot 10^{-6}s^{4}$ , from which it follows that $a_{\min,t}^{2}<16\cdot 10^{-6}s^{4}$ , and so $a_{\min,t}<0.01s^{2}$ .

Thus, for $\hat{\psi}_{t}<\ln(8\cdot 10^{-6}s^{4})$ , at least one of the following holds:

•

Either $\hat{\psi}_{t}<\ln(2\cdot 10^{-6}s^{4})$ ,

•

Or $\hat{\psi}_{t}\geq\ln(2\cdot 10^{-6}s^{4})$ , thus $a_{\min,t}\geq 10^{-6}s^{4}$ . Then, taking into account that $a_{\min,t}<0.01s^{2}$ , we have by Lemma 16: $\mathbb{E}\Delta\hat{\psi}_{t}<-\frac{ps}{7200n}$ .

Taking into account the known properties of function $\psi_{t}$ (Lemma 7), we have that starting from $\hat{\psi}_{t_{0}}<\ln(2\cdot 10^{-6}s^{4})$ , it takes time exponential in a polynomial of $n$ ( $e^{\Omega(n^{c_{13}})}$ , for some absolute constant $c_{13}>0$ ) to break the potential barrier for $\hat{\psi}$ , i.e., to reach the first moment of time $t_{1}$ such that $\hat{\psi}_{t_{1}}\geq\ln(8\cdot 10^{-6}s^{4})$ , with probability $1-e^{-n^{\Omega(1)})}$ , for some absolute constant $c_{14}>0$ . To complete the proof, recall that for any $t<t_{1}$ , we have $\hat{\psi}_{t}<\ln(8\cdot 10^{-6}s^{4})$ , and so as previously established, $a_{\min,t}<0.01s^{2}$ . ∎

For any execution of the oscillator protocol $P_{o}$ , we can now divide the axis of time into maximal time intervals of two types, which we call oscillatory and central. A central time interval continues for as long as the condition $a_{\min,t}\geq 10^{-6}s^{4}$ is fulfilled, and turns into an oscillatory interval as soon as this condition no longer holds. An oscillatory time interval continues for as long as the condition $a_{\min,t}<0.01s^{2}$ is fulfilled, and turns into an oscillatory interval as soon as this condition no longer holds. Lemma 17 implies that an oscillatory interval is of exponential length w.v.h.p.

Lemma 18.

Suppose $0<x<c_{12}$ . Let $t_{0}$ be an arbitrary moment of time such that $a_{\min,t_{0}}>0$ . Let $T=Cn\ln\frac{1}{a_{\min,t_{0}}}$ , for an arbitrarily fixed constant positive integer $C=O(1)$ . With probability $1-e^{-n^{\Omega(1)}}$ , we have for all subsequent moments of time $t\in[t_{0},t_{0}+T]$ :

[TABLE]

Proof.

Without loss of generality assume $t_{0}=0$ . We can assume in the proof that $a_{\min,0}>n^{-0.01/C}$ , otherwise the claim trivially holds. Thus, initially we have $\phi_{0}\geq 3\ln a_{\min,0}\geq-\frac{0.03}{C}\ln n\geq-0.03\ln n$ .

We proceed to show that potential $\phi$ does not decrease much during the considered motion. We have at any time $\dot{\phi}\geq-p$ , which follows directly from (5).

Suppose at some time $t$ we have $\phi_{t}\geq-0.2\ln n$ . We note that $a_{\min,t}\geq e^{\phi_{t}}\geq n^{-0.2}$ . Applying Lemma 1, we have under these assumptions:

[TABLE]

and moreover by the properties of the natural logarithm (cf. e.g. [15]):

[TABLE]

As usual, we apply a Doob martingale, with $\phi^{\prime}_{t}:=\phi_{t}$ until the first moment of time $t$ such that $\phi_{t}<-0.2\ln n$ , and subsequently $\phi^{\prime}_{t+1}:=\phi^{\prime}_{t}$ for larger $t$ . We have $\mathbb{E}\Delta\phi_{t}\geq-n^{-1}$ and $|\Delta\phi_{t}\|<4n^{-0.8}$ .

Considering $T=Cn\ln\frac{1}{a_{\min,0}}<Cn\ln n$ steps of the process starting from time [math], by a standard application of Azuma’s inequality, we obtain that with probability $1-e^{-n^{\Omega(1)}}$ , for all $t\in[0,T]$ we have:

[TABLE]

From the last inequality it follows that $\phi^{\prime}_{t}\geq\phi^{\prime}_{0}-0.1\ln n\geq-0.2\ln n$ for all $t\in T$ , with probability $1-e^{-n^{\Omega(1)}}$ , and so $\phi^{\prime}_{t}=\phi_{t}$ . We now rewrite the same bound for $\phi_{t}$ , using the relation $\phi\geq 3\ln a_{\min}$ :

[TABLE]

Taking into account that $-\ln\frac{1}{a_{\min,t}}\geq\phi_{t}$ for $a_{\min,t}\neq 0$ , we obtain the claim. ∎

Lemma 19.

Suppose $0<x<c_{12}$ . Fix type $i\in\{1,2,3\}$ . Let $t_{0}$ be any time such that $a_{\min,t_{0}}<10^{-6}s^{4}$ . Let $t^{*}>t$ be the first moment after $t_{0}$ such that $i$ is the most represented type, $a_{i,t^{*}}=a_{\max,t^{*}}$ . Let $t^{**}>t^{*}$ be the first moment after $t^{*}$ such that $i$ is the least represented type, $a_{i,t^{**}}=a_{\min,t^{**}}$ .

Then, with probability $1-O(1/n^{3})$ , $t^{**}\leq t_{0}+c_{11}n\ln\frac{1}{\max\{1/n,a_{i+1,t_{0}}\}}$ , where $c_{11}>0$ is an absolute constant depending only on $s$ and $p$ .

Proof.

From Lemma 17, we have that w.v.h.p. the protocol is in an oscillatory interval which will last super polynomial time, i.e., with probability $1-e^{-n^{\Omega(1)})}$ : for all $t\in[t_{0},t_{0}+e^{n^{c_{13}}}]$ , we have $a_{\min,t}<0.02s$ . Acting as in the previous subsection, we iteratively apply Lemma 13 and Lemma 15. After at most 6 applications of both Lemmas, the process has performed two complete rotations around the triangle, w.h.p., passing in particular through time moments $t^{*}$ where the designated type $i$ was a maximal type and $t^{**}$ where type $i$ was a minimal type. It remains to bound the time required to perform these iterations. We consider as an example a single application of Lemma 15 starting at a time $t_{b}$ and ending at a time $t_{a^{\prime}}$ , where $t_{a^{\prime}}<t_{b}+c_{10}n\ln\frac{1}{\max\{1/n,a_{i+1,t_{b}}\}}$ with probability $1-O(1/n^{3})$ . Applying Lemma 18 at time $t_{b}$ , we obtain $\ln\frac{1}{a_{\min,t_{a^{\prime}}}}\leq 100C\ln\frac{1}{a_{\min,t_{b}}}$ , w.v.h.p. We use this bound for the next application of Lemma 13, and so on. After a total of at most 12 applications, we eventually obtain a bound of the form $c_{11}\ln\frac{1}{a_{\min,t_{0}}}$ on the length of the considered time interval, where the value of $c_{11}$ is computed as a function of $C$ . ∎

7.2 Protocol Extension $P_{m}$ : Majority

The composition of the extension is specified in Fig. 4. In what follows, we denote $M^{(s)}_{i}:=\text{$ {}^{#} $}\!\!(A_{i}^{?},M_{s})$ and $m^{(s)}_{i}:=M^{(s)}_{i}/n$ , for $s\in\{-1,0,+1\}$ .

Lemma 20.

Suppose $0<x<c_{12}$ . Let $t_{0}$ be an arbitrarily chosen moment of time and let $T>0$ . For fixed $i\in\{1,2,3\}$ , we have for all $t\in[t_{0},t_{0}+T]$ :

[TABLE]

with probability $1-O(e^{-n^{1/6}})$ .

Proof.

W.l.o.g. assume $t_{0}=0$ and $i=1$ . Denote $A_{0}:=\text{$ {}^{#} $}\!\!A_{1,0}$ , $D_{t}:=M^{(+1)}_{1,t}-M^{(-1)}_{1,t}$ , and $G_{t}:=D_{t}^{2}$ . As usual, we denote $\Delta D_{t}=D_{t+1}-D_{t}$ and $\Delta G_{t}=G_{t+1}-G_{t}$ . For the subsequent analysis, we choose to use the “squared potential” $G_{t}$ to simplify considerations; this would be the usual potential of choice to analyze an unbiased random walk with a fair coin toss.

First we remark that $|D_{0}|\leq A_{0}$ , so $|G_{0}|\leq A_{0}^{2}$ . Next, observe that since at most one agent changes its state in a single time step, we have $|\Delta D_{t}|\leq 2$ , and so:

[TABLE]

We now upper bound the expectation $\mathbb{E}\Delta G_{t}$ . We condition this expectation on the disjoint set of events $R_{6},R_{7},R_{8},R_{9},R_{10},R_{0}$ , where $R_{r}$ , for $6\leq r\leq 10$ , corresponds to Rule $(r)$ being executed in the current step, and $R_{0}$ is the event that none of these rules is executed. We have the following:

•

If event $R_{0}$ or $R_{6}$ holds, then at least one of the following three situations occurs: (1) the values of $M^{(+1)}_{1}$ and $M^{(-1)}_{1}$ both remain unchanged at time $t$ , (2) an agent changes state from type $A_{1}$ to another type, or (3) an agent turns from another type into type $A_{1}$ . In case (1), we have $D_{t+1}=D_{t}$ . In case (2), the probability that $|D_{t+1}|=|D_{t}|+1$ is not more than the probability that $|D_{t+1}|=|D_{t}|-1$ , since by the construction of the protocol, the choice of the agent leaving the population is completely independent of its value $M_{s}$ . In case (3), we have $\Pr[|D_{t+1}|=|D_{t}|-1]=\Pr[|D_{t+1}|=|D_{t}|+1]=1/2$ by construction. In all cases, $\Pr[|D_{t+1}|=|D_{t}|+1]\leq\Pr[|D_{t+1}|=|D_{t}|-1]$ . We therefore have:

[TABLE]

•

For events $R_{7}$ and $R_{8}$ , we have $D_{t+1}|R_{7}=D_{t}-1$ and $D_{t+1}|R_{8}=D_{t}+1$ . Since events $R_{7}$ and $R_{8}$ hold with equal probability, it follows that:

[TABLE]

•

Finally, for events $R_{9}$ and $R_{10}$ , we have $D_{t+1}|R_{9}=D_{t}+1$ and $D_{t+1}|R_{10}=D_{t}-1$ . Since $\Pr[R_{9}]\cdot M^{(-1)}_{1}=\Pr[R_{10}]\cdot M^{(+1)}_{1}$

[TABLE]

where we assume in notation that $A_{1}>0$ .

Applying the law of total expectation for $\Delta G_{t}$ over the set of events $R_{0}\vee R_{6},R_{7}\vee R_{8},R_{9}\vee R_{10}$ and noting that $\Pr[R_{9}\wedge A_{10}]\leq r\frac{A_{1}}{n}$ , we eventually obtain:

[TABLE]

Inequalities (34) and (35) are sufficient to lower-bound the evolution of random variable $G_{t}$ , which undergoes multiplicative drift with rate parameter $1+4r/n$ (up to lower order terms). Since known multiplicative drift lower bounds (cf.e.g. [17, 33]) do not appear to cover this case explicitly, we sketch the corresponding submartingale analysis (with slightly weaker parameters) for the sake of completeness.

Consider any moment of time $t$ such that $G_{t}\geq n^{1.6}$ . Define target value $G_{\max}=(1+8r)G_{t}\geq n^{1.6}$ . Consider the following filter, defining $G^{\prime}_{\tau}$ , $\tau\geq 0$ as the submartingale with $G^{\prime}_{\tau}=G_{t+\tau}$ until the first moment of time at which $G_{t+\tau}<G_{\max}$ , and $G^{\prime}_{\tau}=G^{\prime}_{\tau+1}+\frac{5r}{n}G_{\max}$ for all subsequent moments of time. Note that $\mathbb{E}\Delta G^{\prime}_{\tau}\leq\frac{5r}{n}G_{\max}$ by (35) and $|\Delta G^{\prime}_{\tau}|\leq 4n+4\leq 5n$ by (34) (where we conduct the entire analysis for $n$ sufficiently large with respect to absolute constants of the algorithm). By Azuma’s inequality, we have for any $\tau>0$ and $z>0$ :

[TABLE]

Next, choosing any $\tau\leq n$ and $z=rG_{\max}\geq rn^{1.6}$ and noting that:

[TABLE]

we rewrite the concentration equality as:

[TABLE]

Applying to the above a union bound over all $\tau\in[0,n]$ , we obtain by another crude estimate:

[TABLE]

from which it follows directly by the definition of $G^{\prime}_{\tau}$ that:

[TABLE]

Thus, given that $G_{\max}=(1+8r)G_{t}$ , the value of $G_{t}$ increases by a factor of at most $(1+8r)$ over $n$ steps, with very high probability. Iterating the argument at most a logarithmic number of times and applying a union bound gives for arbitrary $t$ :

[TABLE]

with probability at least $1-e^{-n^{1/6}}$ , from which the claim of the lemma follows directly after taking the square root and normalizing by a factor of $n$ . ∎

By considering the sizes of populations $m^{(-1)}_{i,t}$ , $m^{(0)}_{i,t}$ , and $m^{(+1)}_{i,t}$ (whose sum is $a_{i,t}$ ), we obtain the following corollary of the above Lemma, applied for a suitably chosen value $T=\frac{0.001n}{r}\ln\frac{1}{a_{i,t_{0}}}$ .

Lemma 21.

Suppose $0<x<c_{12}$ . Let $t_{0}$ be an arbitrarily chosen moment of time with $a_{i,t_{0}}\leq 0.02s^{2}$ . For fixed $i\in\{1,2,3\}$ , we have for all $t\in[t_{0},t_{0}+\frac{0.001n}{r}\ln\frac{1}{\max\{\frac{1}{n},a_{i,t_{0}}\}}]$ :

[TABLE]

with probability $1-O(e^{-n^{1/6}})$ . ∎

The above Lemma provides a crucial lower bound on the size of population $m^{(0)}_{i}$ .

Lemma 22.

Suppose $0<x<c_{12}$ . Let $t_{0}$ be an arbitrarily chosen moment of time with $a_{i,t_{0}}\leq 0.02s^{2}$ . For fixed $i\in\{1,2,3\}$ , we have for all $t\in[t_{0},t_{0}+\frac{0.005n}{r}\ln\frac{1}{\max\{\frac{1}{n},a_{i,t_{0}}\}}]$ such that $a_{i,t}>0.25s$ :

[TABLE]

with probability $1-e^{-n^{\Omega(1)}}$ .

Proof.

Note first that by the conditions $a_{i,t_{0}}\leq 0.02s^{2}<0.1s<0.25s<a_{i,t}$ . Let $t_{1}>t_{0}$ denote the last moment of time before $t_{2}$ such that $a_{i,t_{1}-1}<0.1s$ and let $t_{2}\geq t_{1}+0.15sn$ denote the first moment of time after $t_{1}$ such that $a_{i,t_{2}}>0.25s$ . We have $t\geq t_{2}+0.05sn$ .

We now consider the process $m^{(+1)}_{i,\tau}$ (exactly the same arguments may be applied for process $m^{(-1)}_{i,\tau}$ ). We have $\Delta m^{(+1)}_{i,\tau}\leq 1/n$ . The analysis is divided into two phases:

•

Phase 1: $\tau\in[t_{1},t_{2})$ (thus $a_{i,\tau}\in[0.1s,0.25s]$ ). Initially, $m^{(+1)}_{i,t_{1}}\geq 0$ . Suppose for some step $\tau$ we have $m^{(+1)}_{i,\tau}<0.02s$ . Rule (6) is executed with probability $(a_{i,\tau})(1-a_{i,\tau})\geq 0.1s(1-0.25s)\geq 0.075s^{2}$ , whereas rule (8) or (9) which reduces $m^{(+1)}_{i,\tau}$ is executed with probability at most $2rm^{(+1)}_{i,\tau}<0.04rs<0.01s^{2}$ . A computation of the expected value provides:

[TABLE]

An application of Azuma’s inequality yields that $m^{(+1)}_{i,t_{2}}>\frac{1}{2}\frac{0.06s^{2}}{n}(t_{2}-t_{1})>0.004s^{3}$ , with probability $1-e^{-n^{\Omega(1)}}$ . In case of failure, we consider the process no further.

•

Phase 2: $\tau\in[t_{2},t)$ (thus $a_{i,\tau}\geq 0.1s$ ). From Phase 1, we have that initially $m^{(+1)}_{i,t_{2}}>0.004s^{3}$ . Suppose for some step $\tau$ we have $0.001s^{3}<m^{(+1)}_{i,\tau}<0.01s$ . We consider two cases:

–

If $a_{i,\tau}\geq 0.25s$ , then by Lemma 21 we have:

[TABLE]

with probability $1-e^{-n^{\Omega(1)}}$ . In case of failure we interrupt the analysis (this is an implicit application of union bounds over successive steps $\tau$ ). Under the assumption $m^{(+1)}_{i,\tau}<0.01sr$ , we conclude:

[TABLE]

and hence:

[TABLE]

Now, to compute the expected value $\Delta m^{(+1)}_{i,\tau}$ , we remark that rule (6) does not decrease this expected value since $m^{(0)}_{i,\tau}$ . Moreover, in view of (36), the probability of executing rule (9) (which increases $m^{(-1)}_{i,t}$ by $1/n$ ) exceeds the probability of executing one of the rules (7) or (8) (which decrease $m^{(-1)}_{i,t}$ by $1/n$ ) by $0.005sm^{(+1)}_{i,\tau}r>5\cdot 10^{-}6s^{4}r$ by the assumption $0.001s^{3}<m^{(+1)}_{i,\tau}$ . We eventually obtain in this case:

[TABLE]

–

If $a_{i,\tau}\geq 0.25s$ , then assuming $m^{(+1)}_{i,\tau}<0.01s$ , we can perform an analogous analysis as in the first phase to obtain:

[TABLE]

which, in particular, also implies (37).

We have thus shown that the expected change to $m^{(+1)}_{i,\tau}$ satisfies (37). Noting that initially $m^{(+1)}_{i,t_{2}}>0.004s^{3}=0.001s^{3}+0.003s^{3}$ , an application of Azuma’s inequality to an appropriate Doob martingale with (37) shows that the event $m^{(+1)}_{i,\tau}>0.001s^{3}$ will hold for all remaining steps of the process $\tau$ , with probability $1-e^{-n^{\Omega(1)}}$ .

∎

Lemma 23.

Let $t$ be any moment of time with $a_{\min,t}>4r^{1/2}$ . For all $i\in\{1,2,3\}$ , we have:

[TABLE]

with probability $1-O(e^{-n^{\Omega(1)}})$ .

Proof.

Denote $c=4r^{1/2}$ . To show the claim, observe that necessarily for all $\tau\in[t-cn/2,t]$ we have $a_{\min,\tau}>c/2$ . We consider the change of value $m^{(+1)}_{\tau}$ over time (the argument for $m^{(-1)}_{\tau}$ follows symmetrically). Initially, we have $m^{(+1)}_{t-cn/2}\geq 0$ , and at every step $|\Delta m^{(+1)}_{\tau}|\leq 1/n$ . At any time $\tau$ such that $m^{(+1)}_{\tau}<s/4$ we have the following cases:

•

Rule (6) is executed, which happens with probability at least $a^{2}_{\min,\tau}>c^{2}/4$ . Since $m^{(+1)}_{\tau}<s/4$ , conditioned on this event, the expected value of $\Delta m^{(+1)}_{\tau}$ is at least $1/4$ .

•

One of the rules (7)-(10) is executed, which occurs with probability at most $r$ .

•

In all other cases, we have $\Delta m^{(+1)}_{\tau}=0$ .

Noting that $r=c^{2}/16$ , the claim follows from a standard application of Azuma’s inequality. ∎

Lemma 24.

Suppose $0<x<c_{12}$ . Let $t\geq 2c_{11}n\log n$ be an arbitrary moment of time. Then, $\min\{m_{+1,t},m_{-1,t}\}>c_{15}$ , with probability $1-O(1/n)$ , for some absolute constant $c_{15}>0$ depending only on $s$ , $p$ , and $r$ .

Proof.

Assume w.l.o.g. $t=2c_{11}n\ln n$ . Instead of analyzing the evolution of the real system, we consider an execution of a system which is coupled with it over the first $t$ steps as follows. First, starting from time [math], we perform $t$ steps of protocol $P_{o}$ (i.e., considering only rules $(1)-(5)$ of its definition, and without setting values of the second component $M_{?}$ ). Next, we once again activate the pairs of elements which were activated in the first part of the coupling, in the same order, applying rules $(6)-(10)$ of the protocol with the same outcome which they would have received in the original execution. Clearly, at time $t$ the same configuration $u(t)$ is reached by both the original and coupled execution.

Consider first the execution of $P_{o}$ from time [math]. If $a_{\min,0}<10^{-6}s^{4}$ , then the execution is in an oscillatory interval at time [math], and will remain in it ( $a_{\min}<0.02s^{2}$ ) until time $t=c\ln n$ with probability $1-e^{-n^{\Omega(1)}}$ . Then, for all $\tau\in[0,t]$ we assume that the claim of Lemma 19 holds with $t_{0}=\tau$ for all $i\in\{1,2,3\}$ . By a crude union bound, this event holds with probability $1-O(1/n)$ ; from now on we assume this is true. (Formally, to allow us to proceed, in the analysis we can say with implicitly couple the system with a different set of random choices to which the system switches in the low-probability event that the claim of Lemma 19 were not to hold for some $t_{0}=\tau$ for the original system.) Given that in the claim of Lemma 19 for $t_{0}=0$ we have $t^{**}<t$ , and for $t_{0}=t$ we have $t^{*}\geq t$ , by the properties of the time intervals $[t^{*},t^{**}]$ we observe that there must exist a time $\tau\in[0,t]$ and a type $i\in\{1,2,3\}$ such that $A_{i}$ is the least represented type at time $\tau$ and the most represented type at time $t$ ( $a_{\max,t}=a_{i,t}$ and $a_{\min,\tau}=a_{i,\tau}$ ), and moreover $t\in[t^{*},t^{**}]$ in the claim of Lemma 19 with choice of $t_{0}=\tau$ . Since $a_{\max,t}\geq s/3$ , we now apply Lemma 22 with $t_{0}=\tau$ to obtain the claim, noting that $a_{\min,\tau}<0.02s^{2}$ and moreover that $t<\tau+\frac{0.005n}{r}\ln\frac{1}{\max\{\frac{1}{n},a_{i,t_{0}}\}}$ , given that we have $t<\tau+c_{11}n\ln\frac{1}{\max\{1/n,a_{i+1,t_{0}}\}}$ , where $c_{11}$ is a constant depending only on $s$ and $p$ , and noting that we can choose $r<\frac{c_{1}1}{0.005}$ .

It remains to consider the cases when the execution starts at time [math] with $a_{\min,0}\geq 10^{-6}s^{4}$ . Then, if $a_{\min,t}\geq 10^{-6}s^{4}$ holds, the claim follows from Lemma 23 given that $r$ is chosen so that $10^{-6}s^{4}>4r^{1/2}$ . Otherwise, there must exist some last time $t^{\prime}\leq t$ such that $a_{\min,t^{\prime}}\geq 10^{-6}s^{4}$ . We apply Lemma 19 with $t_{0}=t^{\prime}$ . If the obtained value $t^{**}$ satisfies $t^{**}<t$ , then we can apply an analogous analysis as in the case $a_{\min,0}<10^{-6}s^{4}$ to obtain the claim. Otherwise, we have that $t\leq t^{**}\leq t+cn$ , where the value of constant $c$ , depending only on $s$ and $p$ follows from Lemma 19. By an iterated application of Lemma 18, we obtain that $a_{\min,t}\geq c^{\prime}$ , where the value of constant $c^{\prime}>0$ , depending only on $s$ and $p$ , follows from the application of Lemma 18. Choosing $r$ sufficiently small so that $c^{\prime}>4r^{1/2}$ , we complete the proof using Lemma 23. ∎

Finally, for the sake of completeness we state how the majority protocol stops in the case of $\text{$ {}^{#} $}\!\!X=0$ .

Lemma 25.

Suppose $x=0$ . Then, there exists a moment of time $t_{s}$ such that either $m_{+1,t}=0$ or $m_{-1,t}=0$ holds for all $t>t_{s}$ . Moreover, $t_{s}<c_{16}n\log^{2}n$ with probability $1-O(1/n)$ , for some absolute constant $c_{16}>0$ depending only on $s$ , $p$ , and $r$ .

Proof.

By Theorem 1(1), there exists a moment of time $t_{0}=O(n\log^{2}n)$ such that the system reaches a corner configuration (cf. Lemma 11). W.l.o.g., assume that $a_{1}=s$ and $a_{2}=a_{3}=0$ . At this point, in the majority protocol Rule (7) will never again be activated, whereas the execution of rules (8)-(11) follows precisely the classical majority scenario of Angluin et al. [7]. By a standard concentration analysis (see also [7]), one of the two species $M_{+1},M_{-1}$ will become extinct in $O(n\log^{2}n)$ steps with probability $1-O(1/n)$ . ∎

As a side remark on Lemma 25 that it is possible to initialize the system entirely with a state $(A_{i},M_{0})$ so that $m_{+1,t}=m_{-1,t}=0$ holds throughout the process (even if the designed protocol will never enter such a configuration from most initial configurations).

7.3 Protocol Extension $P_{l}$ : Detection with Lights

To complete the proof of Theorem 3, we design a protocol extension $P_{d}$ , such that Detection is solved by the composition $((P_{o}\circ P_{m})+P_{l})$ . Extension $P_{l}$ , uses three states, $\{L_{-1},L_{+1},L_{\mathit{on}}\}$ . We informally refer to states $L$ as lights. The composition is given in Fig. 5. Informally, state $L_{-1}$ means that the agent is “waiting for meeting $M_{-1}$ ”, then after meeting $M_{-1}$ it becomes $L_{+1}$ , “waiting for $M_{+1}$ ” and finally it becomes $L_{\mathit{on}}$ .

To analyze the operation of the protocol, consider first the case of $x=0$ . By Lemma 25, after $O(n\log^{2}n)$ steps, agents in at least one of the states $\{M_{+1},M_{-1}\}$ are permanently eliminated from the system. Thus, either rule (11) or rule (12) will never again be executed in the future. An agent which is in state $L_{\mathit{on}}$ will spontaneously move to another state following rule (13) within $O(\frac{1}{q(\varepsilon)}n\log n)$ steps, with probability $1-O(1/n^{2})$ , and will never reenter such a state, since this would require the activation of both rule (11) and rule (12). By applying a union bound over all agents, we obtain that state $L_{\mathit{on}}$ never again appears in the population after $O(n\log n)$ steps from the termination of the majority protocol, with probability $1-O(1/n)$ . Overall, all nodes reach a state having a different state than $L_{\mathit{on}}$ after $O(n\log^{2}n)$ steps from the start of the process, with probability $1-O(1/n)$ , and all leave such a state eventually with certainty.

In the presence of the source $X$ , the analysis of the process can be coupled with a Markov chain on three states $L_{-1}$ , $L_{+1}$ , and $L_{\mathit{o}n}$ . In view of Lemma 24, transitions from state $L_{-1}$ to $L_{+1}$ and from state $L_{+1}$ to to $L_{-1}$ occur with at least constant probability (except for an $O(1/n)$ -fraction of all time steps), this 3-state chain is readily shown to be rapidly mixing. For a choice of $q(\varepsilon)>0$ depending only on $s,p,r,\varepsilon$ sufficiently small, we can lower-bound the number of agents occupying state $L_{on}$ by $(1-\varepsilon)n$ , with high probability.

Under the natural decoding of states as “informed” (having component $L_{\mathit{on}}$ ) or “uninformed” (having component $L_{-1}$ or $L_{+1}$ ), the proof of Theorem 3 is complete. We remark that it is also possible to design a related protocol in which exactly one state is recognized as “informed” and exactly one state is recognized as “uninformed”; we omit the details of the construction.

8 Proof of Impossibility Result

This Section is devoted to the proof of Theorem 4. First, we restate some notation. We recall that the vector $z=(z^{(1)},\ldots,z^{(k)})\in\{0,1,\ldots,n\}^{k}=Z$ describes the number of agents having particular states, and $\|z\|_{1}=n$ . In this section we will identify the set of states with $\{1,\ldots,k\}=[1,k]$ . It is now also more convenient for us to work with a scheduler which selects unordered (rather than ordered) pairs of interacting agents; we note that both models are completely equivalent in terms of computing power under a fair random scheduler, since selecting an ordered pair of agents can be seen as selecting an unordered pair, and then setting their orientation through a coin toss. Indexing with integers $\{1,2,\ldots,r\}$ the set of all distinct rules of the protocol, where $r\leq k^{4}$ , for a rule $j\equiv``\{i_{1}(j),i_{2}(j)\}\to\{o_{1}(j),o_{2}(j)\}^{\prime\prime}$ , $1\leq j\leq r$ , $i_{1}(j),i_{2}(j),o_{1}(j),o_{2}(j)\in\{1,\ldots,k\}$ , we will denote by $q_{j}$ , the probability (selected by the protocol designer) that rule $j$ is executed as the next interaction rule once the scheduler has selected $(i_{1},i_{2})$ as the interacting pair, and by $p_{j}(z)$ the probability that $j$ is the next rule chosen in configuration $z$ (we have $p_{j}(z)=q_{j}\frac{z^{(i_{1}(j))}z^{(i_{2}(j))}}{n^{2}}(1-O(1/n))$ , where the $O(1/n)$ factor compensates the property of a scheduler which always selects a distinct pair of elements).

For any configuration $z_{0}\in Z$ , we define the $d$ -box $B_{d}(z_{0})$ around $z_{0}$ as the set of all states $z\in Z$ such that $z_{0}^{(i)}/d\leq z^{(i)}\leq d\max\{1,z_{0}^{(i)}\}$ , for all $1\leq i\leq k$ . We start the proof with the following property of boxes.

Lemma 26.

Fix $k\in\mathbb{Z}^{+}$ and let $0<\varepsilon_{1}<0.001$ be arbitrarily fixed. There exists $\varepsilon_{0}=\varepsilon_{0}(k,\varepsilon_{1})$ , $0<\varepsilon_{0}<\varepsilon_{1}$ , such that, for any interaction protocol $P$ with $k$ states and any configuration $z_{0}\in Z$ , there exists a value $\varepsilon=\varepsilon(P,z_{0})\in[\varepsilon_{0},\varepsilon_{1}]$ such that, for any rule $j$ of the protocol, $1\leq j\leq r$ , exactly one of the following bounds holds:

$(i)$ * for all $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $p_{j}(z)\leq n^{\varepsilon-1}$ ,* 2. 2.

$(ii)$ * for all $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $p_{j}(z)\geq n^{24\varepsilon-1}$ .*

and for any state $i$ , $1\leq i\leq k$ , exactly one of the following bounds holds:

$(iii)$ * for all $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $z^{(i)}\leq n^{\varepsilon}$ ,* 2. 2.

$(iv)$ * for all $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $z^{(i)}\geq n^{24\varepsilon}$ .*

Proof.

Let $k$ be fixed and let $\varepsilon_{0}=96^{-(k+k^{4}+f+1)}\leq 96^{-(k+r+f+1)}$ , where $f=\log_{2}(1/\varepsilon_{1})$ . Consider the (multi)set $M$ of real values $M:=\{\log_{n}\max\{n^{\varepsilon_{0}},z_{0}^{(i)}\}:i\in\{1,\ldots,r\}\}\cup\{\log_{n}\max\{n^{\varepsilon_{0}},np_{j}(z_{0})\}:j\in\{1,\ldots,r\}\}\subseteq[0,1]$ . Since $|M|=k+r$ , by the pigeonhole principle, there must exist an interval $I_{l}=[96^{-l},96^{-l+1})$ , for some $l\in\{f,\ldots,k+r+f\}$ , such that $I_{l}\cap M=\emptyset$ . Now, we set $\varepsilon=2\cdot 96^{-l}>96\varepsilon_{0}$ , we also have $\varepsilon<2\cdot 96^{-f}<\varepsilon_{1}$ . We immediately obtain that for any state $i$ , $1\leq i\leq k$ , we either have $z_{0}^{(i)}\leq n^{\varepsilon/2}$ or $z_{0}^{(i)}\geq n^{48\varepsilon}$ . Recalling that for any $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $z_{0}^{(i)}/n^{\varepsilon_{0}}\leq z^{(i)}\leq n^{\varepsilon_{0}}\max\{1,z_{0}^{(i)}\}$ , claims $(iii)$ and $(iv)$ follow.

To show claims $(i)$ and $(ii)$ , notice that if rule $j$ , $j\in\{1,\ldots,r\}$ is such that $\min\{z_{0}^{(i_{1}(j))},z_{0}^{(i_{2}(j))}\}\leq n^{\varepsilon}$ , then for all $z\in B_{n^{\varepsilon_{0}}}(z_{0})$ we have $\min\{z^{(i_{1}(j))},z^{(i_{2}(j))}\}\leq n^{\varepsilon}$ (by (iii) and (iv)), and so $p_{j}(z)\leq n^{\varepsilon-1}$ by the properties of the random scheduler. Otherwise, we have $\min\{z_{0}^{(i_{1}(j))},z_{0}^{(i_{2}(j))}\}\geq n^{24\varepsilon}$ , and so $\frac{1}{2n^{2\varepsilon_{0}}}\leq p_{j}(z_{0})/p_{j}(z)\leq 2n^{2\varepsilon_{0}}$ , where we recall that $\varepsilon>96\varepsilon_{0}$ . Since we have $p_{j}(z_{0})\leq n^{\varepsilon/2}$ or $p_{j}(z_{0})\geq n^{48\varepsilon}$ , claims (i) and (ii) follow. ∎

Given any $k$ -state protocol $P$ , we will arbitrarily choose a value of $\varepsilon$ for which the claim of the above Lemma holds (e.g., the smallest possible such value of $\varepsilon$ ). Note that a similar analysis is also possible for protocols using a super-constant number of states in $n$ , however, then the value of $\varepsilon_{0}$ is dependent on $n$ ; retracing the arguments in the proof, we can choose appropriately $\varepsilon_{0}\geq n^{\exp[-O(k^{4})]}$ . (We make no effort to optimize the polynomial in $k$ in the exponent.)

In what follows, let $z_{0}$ be a fixed configuration of the protocol (admitting a certain property which we will define later). We will then consider a rule $j$ to be a low probability (LP) rule (writing $j\in LP$ ) in box $B_{n^{\varepsilon_{0}}}(z_{0})$ if it satisfies condition $(i)$ of the Lemma, and a high probability (HP) rule in this box (writing $j\in HP$ ) if it satisfies condition $(ii)$ . Note that $LP\cup HP=\{1,\ldots,r\}$ .

Likewise, for $1\leq i\leq k$ , we will classify $i$ as a low-representation (LR) state (writing $i\in LR$ ) in box $B_{n^{\varepsilon_{0}}}(z_{0})$ if $i$ satisfies condition $(iii)$ of the Lemma, and a high representation (HR) state (writing $i\in HR$ ) in this box if $i$ satisfies condition $(iv)$ . Note that $LR\cup HR=\{1,\ldots,k\}$ . Moreover, we define a set of very high representation (VHR) states, $VHR\subseteq HR$ , as the set of all $i$ such that for all $z^{\prime}\in B_{n^{\varepsilon_{0}}}(z_{0})$ , $z^{\prime}_{i}\geq n^{1-8\varepsilon}$ . Denoting $HR^{\prime}=HR\setminus VHR$ , we have by the definition of a box that for all $i^{\prime}\in HR^{\prime}$ , for all $z^{\prime}\in B_{n^{\varepsilon_{0}}}(z_{0})$ : $z^{\prime}_{i}\leq n^{1-8\varepsilon}/O(n^{2\varepsilon_{0}})<n^{1-6\varepsilon}$ .

From now on, we assume that configuration $z_{0}$ admits the following property: for $T=n^{1+2\varepsilon}$ , an execution of the protocol starting from configuration $z_{0}$ passes through a sequence of configurations $z_{t}$ , $t=1,2,\ldots,T$ , such that the configuration does not leave the box around $z_{0}$ in any step with sufficiently large probability, lower-bounded by some absolute constant $\Pi\in(0,1]$ :

[TABLE]

where $B$ is an arbitrarily fixed subset of $B_{n^{\varepsilon_{0}}}(z_{0})$ .

We now show that the above property has the following crucial implication: for an interacting pair involving selected high and very high representation states, a rule creating a low representation state can only be triggered with sufficiently small probability. Informally, it seldom happens that in the protocol a low representation state is created out of any high representation state.

Lemma 27.

For a protocol having the property given by Eq. (38), for $i_{1}\in HR$ and $i_{2}\in VHR$ , let $R_{i_{1},i_{2}}$ be the set of rules of the form $\{i_{1},i_{2}\}\to\{o_{1},o_{2}\}$ , taken over all $o_{1}\in LR,o_{2}\in[1,r]$ . Then, $\sum_{j\in R_{i_{1},i_{2}}}q_{j}=O(n^{-14\varepsilon})$ .

Proof.

Suppose, by contradiction, that $\sum_{j\in R_{i_{1},i_{2}}}q_{j}>3n^{-14\varepsilon}$ .

Associate with process $z_{t}$ a random variable $J_{t}\in\{0,1\}$ , defined as follows. For all $t<t_{e}$ , where $t_{e}$ is the first moment of time such that $z_{t_{e}}\notin B_{n^{\varepsilon_{0}}}(z_{0})$ , we put $J_{t}=1$ if a rule from $R_{i_{1},i_{2}}$ is used for the interaction made by the protocol in process $z_{t}$ at time $t$ , and set $J_{t}=0$ otherwise. For all $t\geq t_{e}$ , we set $J_{t}$ to $1$ . We have $\mathbb{E}(J_{t}|z_{1},\ldots,z_{t})\geq 2n^{2\varepsilon-1}$ ; indeed, for $t<t_{e}$ , it holds that:

[TABLE]

By a simple stochastic domination argument, $(J_{t})$ can be lower-bounded by a sequence of independent binomial trials with success probability $n^{2\varepsilon-1}$ , hence by an application of a multiplicative Chernoff bound for $T=n^{1+2\varepsilon}$ :

[TABLE]

where the $o(1)$ factor is exponentially small in $n$ .

We now show the following claim.

Claim. With probability $\Pi-o(1)$ , the following event holds: $z_{t}\in B$ for all $t\in[0,T)$ and the total number of rule activations in the time interval $[0,t)$ during which an agent changes state from a state in $LR$ to a different state is at most $O(kn^{3\varepsilon}$ ).

Proof (of claim). Acting similarly as before, we associate with process $z_{t}$ a random variable $L_{t}\in\{0,1\}$ , defined as follows. For all $t<t_{e}$ , we put $L_{t}=1$ if a rule acting on at least one agent in a state from $LR$ is made by the protocol in process $z_{t}$ at time $t$ , and set $L_{t}=0$ otherwise. For all $t\geq t_{e}$ , we set $L_{t}$ to a dummy variable set always to [math], i.a.r. We observe that:

[TABLE]

since $|LR|\leq k$ , and for $t<t_{e}$ , $z_{i}<n^{\varepsilon}$ for any $i\in LR$ , hence the scheduler selects an agent from a $LR$ state into an interacting pair with probability at most $2k\frac{n^{\varepsilon}}{n}$ . Applying an analogous argument as in the case of random variable $L_{t}$ , this time for the upper tail, we obtain:

[TABLE]

The claim follows directly.

Now, by a union bound we obtain:

[TABLE]

Taking into account that $t_{e}>T$ holds with probability $\Pi=\Omega(1)$ by (38), we have by a union bound that with probability at least $\Pi-o(1)=\Omega(1)$ , the following event holds: $z_{t}\in B$ for all $t\in[0,T]$ , $\sum_{t=1}^{T}J_{t}>n^{4\varepsilon}$ , and $L_{t}<4kn^{3\varepsilon}$ . However then $\sum_{t=1}^{T}J_{t}-\sum_{t=1}^{T}L_{t}>n^{4\varepsilon}-kn^{3\varepsilon}>kn^{\varepsilon}$ , so there must exist at time $T$ a state $i\in LR$ such that $z^{(i)}_{T}>n^{\varepsilon}$ . This is a contradiction with $z_{T}\in B\subseteq B_{n^{\varepsilon_{0}}}(z_{0})$ by Lemma 26(iii). ∎

In the rest of the proof, we consider the evolution of a protocol starting from configuration $z_{0}$ and having property (38). We compare this evolution to the evolution of the same protocol, starting from a perturbed configuration $z^{*}_{0}$ , such that:

(C1)

$\|z_{0}-z^{*}_{0}\|_{1}\leq n^{\varepsilon}$ . 2. (C2)

for all low representation states $i\in LR$ , we have $z^{*(i)}_{0}\leq z^{(i)}_{0}$ .

Intuitively, the perturbed state $z^{*}_{0}$ may correspond to removing a small number of agents from $z_{0}$ (and replacing them by high representation states for the sake of normalization), e.g., as in the case of the disappearance of a rumor source from a system which has already performed a rumor-spreading process.

Our objective will be to show that, with probability at least $\Pi-o(1)$ , after $T=n^{1+2\varepsilon}$ the process $z^{*}_{T}$ is still not far from $z_{0}$ , being constrained to a box in a similar way as process $z_{t}$ . To achieve this, we define a coupling between processes $z_{t}$ and $z^{*}_{t}$ (knowing that process $z_{t}$ is constrained to a box around $z_{0}$ with probability $\Pi$ ). Informally, the analysis proceeds as follows. We run the processes together for $T=n^{1+2\varepsilon}$ steps. In most steps, the 1-norm distance $\|z_{t}-z^{*}_{t}\|_{1}$ between the two processes remains unchanged, without exceeding $O(n^{3\varepsilon})$ . Otherwise, exactly one of the two processes executes a rule (and the other pauses). With a frequency of roughly $n^{\varepsilon}/n$ steps (i.e., roughly $n^{3\varepsilon}$ times in total during the process), an LP rule is executed which increases the distance between these two states. We think of this type of “error” as unfixable, contributing to the $O(n^{3\varepsilon})$ distance of the processes; however, such errors are relatively uncommon. With a higher frequency of roughly $n^{3\varepsilon}/n$ steps (i.e., roughly once every $n^{1-3\varepsilon}$ steps), a less serious “error” occurs, when some HP rule $\iota$ increases the distance between the two states. The rate of such errors is too high to leave them unfixed, and we have a time window of about $n^{1-3\varepsilon}$ steps to fix such an error (before the next such error occurs). We observe that since $\iota$ is an HP rule, which is activated with probability at least $n^{24\varepsilon-1}$ , rule $\iota$ will still be activated frequently during this time window. The coupling of transitions of states $z_{t}$ and $z^{*}_{t}$ is in this case performed so as to force the two processes to execute rule $\iota$ lazily, never at the same time. The number of executions of rule $\iota$ in the ensuing time window by each of the two processes follows the standard coupling pattern of a pair of lazy random walks on a line, initially located at distance $1$ , until their next meeting (cf. e.g. [2]). During this part of the coupling, we allow the distance $\|z_{t}-z^{*}_{t}\|_{1}$ to increase even up to $n^{6\varepsilon}$ (as a result of executions of rule $\iota$ ), but the entire contribution to the distance related to rule $\iota$ is reduced to [math] before the next HP rule “error” occurs, with sufficiently high probability (in this case, with probability $1-O(n^{-6\varepsilon})$ . Overall, the coupling is successful with probability $\Pi-O(n^{-\varepsilon})$ .

We remark that we use the bound on the number of states $k$ to enforce a sufficiently large polynomial separation between the frequencies of LR states and HR states, and likewise for LP rules and HP rules. We also implicitly assume that $k=n^{o(1)}$ , throughout the process. The analysis also works for a choice of $k=O(\log\log n)$ , with a sufficiently small hidden constant. The separation between LR/HR states and LP/HP rules is used in at least two places in the proof. First, it enforces that rules creating LR states from VHR states may appear in the definition of the protocol only with polynomially small probability (Lemma 27), which helps to maintain over time the invariant $z^{*(i)}_{t}\leq z^{(i)}_{t}$ , for all LR states. Secondly, we use the separation of LP/HP rules in the analysis of the coupling to show that a fixable “error” caused by a HP rule can be sufficiently quickly repaired, before new errors occur.

In the formalization of the coupling, we make both processes $z_{t}$ and $z^{*}_{t}$ lazy, i.e., add to each process an additional independent coin-toss at each step, and enforce that with probability $1/2$ no rule is executed in a given step (i.e., the step is skipped by the protocol). We assume a random scheduler which picks uniformly a random pair of nodes at each step. Thus, if the scheduler picks a pair of agents in states $\{i_{1},i_{2}\}$ , and $j$ is a rule acting on this pair of states, the probability that the interaction corresponding to rule $j$ will be $q_{j}/2$ . (The laziness of the process here is a purely technical assumption for the analysis, and corresponds to using a measure of time which is scaled by a factor of $2\pm o(1)$ w.h.p.; this does not affect the asymptotic statement of the theorem.)

We will also find it convenient to apply an auxiliary notation for representing the evolution of a state. For process $z_{t}$ (resp., $z^{*}_{t}$ , we define $\rho_{t}(j)$ (resp. $\rho^{*}_{t}(j)$ ), for all $j\in[1,r]$ , as the number of times rule $j$ has been executed since time [math]. Observe that the pair $(z_{0},(\rho_{t}(j):j\in[1,t])$ completely describes the evolution of a state (i.e., the order in which the rules were executed is irrelevant). Moreover, since each execution of a rule changes the states of at most $4$ agents, we have:

[TABLE]

Definition of the coupling.

At each step $t$ , we order the agents of configurations $z_{t}$ and $z^{*}_{t}$ , so that $a_{l}(t)$ denotes the type of the $l$ -th agent in $z_{t}$ and $a^{*}_{l}(t)$ is the type of the $l$ -th agent in $z^{*}_{t}$ . The orderings are such that $|\{l:a_{l}(t)=a^{*}_{l}(t)\}|$ is maximized; in particular, for any state $i$ such that $z^{(i)}(t)\leq z^{*(i)}(t)$ (respectively, $z^{*(i)}(t)\leq z^{(i)}(t)$ ) we have that if for some $l$ , $a_{l}(t)=i$ (resp., $a_{l}^{*}(t)=i$ ), then $a_{l}^{*}(t)=i$ (resp., $a_{l}(t)=i$ ). 2. 2.

The scheduler then picks a pair of distinct indices $l_{1},l_{2}\in\{1,\ldots,n\}$ as the pair of interacting agents.

2.1.

If $a_{l_{1}}{(t)}=a^{*}_{l_{1}}{(t)}$ and $a_{l_{2}}{(t)}=a^{*}_{l_{2}}{(t)}$ , then the same rule $j=j^{*}$ acting on the pair of states $(a_{l_{1}}{(t)},a_{l_{2}}{(t)})$ is chosen as the current interaction rule, with probability $q_{j}$ . 2. 2.2.

Otherwise, a pair of (clearly distinct) rules $j$ and $j^{*}$ are picked independently at random for $z_{t}$ and $z^{*}_{t}$ from among the rules available for state pairs $(a_{l_{1}}{(t)},a_{l_{2}}{(t)})$ and $(a_{l_{1}}^{*}{(t)},a_{l_{2}}^{*}{(t)})$ , with probabilities $q_{j}$ and $q_{j^{*}}$ , respectively. 3. 3.

The processes finally perform their coin tosses to decide which of the selected rules ( $j$ for $z_{t}$ and $j^{*}$ for $z^{*}_{t}$ ) will be applied in the current step.

3.1.

If $j=j^{*}$ and rule $j$ has been executed exactly the same number of times in the history of the two processes ( $\rho_{t}(j)=\rho^{*}_{t}(j)$ ), then with probability $1/2$ both of the processes execute rule $j$ , and with probability $1/2$ neither execute their rule. 2. 3.2.

If $j\neq j^{*}$ , or if $j=j^{*}$ and rule $j$ has been executed a different number of times in the history of the two processes ( $\rho_{t}(j)\neq\rho^{*}_{t}(j)$ ), then exactly one of the two processes performs its chosen rule and the other process waits, with the process performing the rule being chosen as $z_{t}$ or $z^{*}_{t}$ , with probability $1/2$ each.

The correctness of the coupling (i.e., that the marginals $z_{t}$ and $z^{*}_{t}$ each correspond to a valid execution of the given protocol under a random scheduler) is immediate to verify.

Lemma 28.

Let $z_{t}$ be a process satisfying property (38), and let $z^{*}_{0}$ satisfy conditions (C1) and (C2). Then, for $T=n^{1+2\varepsilon}$ , with probability $\Pi-O(n^{-\varepsilon})$ we have $\|z^{*}_{T}-z\|_{1}=O(n^{6\varepsilon})$ , for some $z\in B$ .

Proof.

To prove the claim, it suffices to show that with probability $\Pi-O(n^{-\varepsilon})$ the provided coupling succeeds, i.e., it maintains a sufficiently small difference $z_{T}(i)-z^{*}_{T}(i)$ for all states $i$ , with $z_{T}\in B$ .

In the analysis of the provided coupling, we will assume that the box condition $z_{t}\in B$ holds always throughout the process (otherwise, we assume the coupling does not succeed). To state this formally, we work with auxiliary processes $\bar{z}_{t}$ and $\bar{z}_{t}^{*}$ , given as $\bar{z}_{t}=z_{t}$ and $\bar{z}_{t}^{*}=z_{t}^{*}$ for all $t<t_{e}$ , where $t_{e}$ is the first moment of time such that $z_{t}\notin B_{n^{\varepsilon_{0}}}(z_{0})$ , and set to the dummy value $\bar{z}_{t}=\bar{z}_{t}^{*}=z_{0}$ for all $t\geq t_{e}$ . At the end of the process, we will thus have $\bar{z}_{T}=z_{T}$ and $\bar{z}_{T}^{*}=z_{T}^{*}$ with probability at least $\Pi$ . In the following, we silently assume that $t<t_{e}-1$ (in particular, that $z_{t}\in B$ and $z_{t+1}\in B$ ), and we will simply show that the coupling of $\bar{z}_{t}$ and $\bar{z}^{*}_{t}$ is successful with probability $1-n^{-\varepsilon}$ . The condition of $t\geq t_{e}-1$ is trivially handled.

In addition to the box condition (which is now enforced) we try to maintain, with sufficiently high probability, throughout the first $T$ steps of the process several invariants (all at a time), corresponding to the following events holding:

•

$F_{D}(t)$ : for all states $i\in LR$ , $\bar{z}_{t}^{*(i)}\leq\bar{z}_{t}^{(i)}$ . (LR domination condition)

•

$F_{LR}(t)$ : for all states $i\in LR$ , $\bar{z}_{t}^{*(i)}\leq\bar{z}_{t}^{(i)}\leq n^{\varepsilon}$ . (LR state condition)

•

$F_{LP}(t)$ : for all rules $j\in LP$ , $\max\{p_{j}(\bar{z}_{t}),p_{j}(\bar{z}^{*}_{t})\}\leq 2n^{\varepsilon-1}$ . (LP rule condition)

•

$F_{HR}(t)$ : for all states $i\in HR$ , $\min\{\bar{z}_{t}^{(i)},\bar{z}_{t}^{*(i)}\}\geq n^{24\varepsilon}/2$ . (HR state condition)

•

$F_{HP}(t)$ : for all rules $j\in HP$ , $\min\{p_{j}(\bar{z}_{t}),p_{j}(\bar{z}^{*}_{t})\}\geq n^{24\varepsilon-1}/2$ . (HP rule condition)

•

$F_{HR^{\prime}}(t)$ : for all states $i\in HR^{\prime}$ , $\max\{\bar{z}_{t}^{(i)},\bar{z}_{t}^{*(i)}\}\leq 2n^{1-6\varepsilon}$ . (HR’ state condition)

•

a family of possible events $S_{w,d}(t)$ , for some $d\in\{0,\ldots,4n^{3\varepsilon}\}$ and $w\in\{0,\ldots,n^{6\varepsilon}\}$ , with specific events defined as follows:

–

$S_{0,d}(t)$ holds if for all rules $j\in HP$ we have $\rho_{t}(j)=\rho^{*}_{t}(j)$ , and $\sum_{j\in LP}|\rho_{t}(j)-\rho^{*}_{t}(j)|=d$ . This implies, in particular, $\|\bar{z}_{t}^{*}-\bar{z}_{t}\|_{1}\leq 4d+n^{\varepsilon}\leq 5n^{3\varepsilon}$ . (identical rate of HP execution)

–

$S_{w,d}(t)$ for $w>0$ holds if there exists a rule $\iota\in HP$ such that for all rules $j\in HP\setminus\{\iota\}$ we have $\rho_{t}(j)=\rho^{*}_{t}(j)$ , $|\rho_{t}(\iota)-\rho^{*}_{t}(\iota)|=w$ , and moreover $\sum_{i\in LP}|\rho_{t}(i)-\rho^{*}_{t}(i)|=d$ . This implies, in particular, $\|\bar{z}_{t}^{*}-\bar{z}_{t}\|_{1}\leq 4d+4w+n^{\varepsilon}\leq 5n^{6\varepsilon}$ . (single HP execution difference)

We will call the coupling successful if for all $t\leq T$ , all events $F_{\cdot}(t)$ and some event $S_{w,d}(t)$ holds, and we will say it is a failure otherwise. (We remark that condition $F_{D}(t)$ is implied by condition $F_{LR}(t)$ , but we retain both for convenience in discussion.)

The analysis of the coupled process is now the following. First, we remark that all of the given events $F_{\cdot}(t)$ and event $S_{0,0}(t)$ hold for $t=0$ .

If the process meets condition $S_{0,d}$ at time $t$ and all conditions $F_{\cdot}(t)$ , then we have the following:

•

With probability at least $1-O(n^{3\varepsilon-1})$ , the coupling will follow clauses 2.1 and 3.1 of its definition, and the two processes $\bar{z}$ and $\bar{z}^{*}$ will execute the same rule $j$ (or both pause). Hence, we continue to step $t+1$ satisfying condition $S_{0,d}$ and all of the conditions $F_{\cdot}(t+1)$ , making use of the box condition for process $\bar{z}_{t}$ . (We note that, to show $F_{LP}(t)$ , when considering the special case of a rule involving a state from $LR$ , we can make use of $F_{LR}(t)$ and note that the activation probability of such a rule is bounded by $2n^{\varepsilon-1}$ due to the $n^{\varepsilon}$ bound on the population of a LR state).

•

With probability at most $O(n^{3\varepsilon-1})$ , the coupling will, however, select distinct rules, $j$ for $\bar{z}_{t}$ and $j^{*}$ for $\bar{z}_{t}^{*}$ , and will select exactly one of them to execute, say $j^{\prime}\in\{j,j^{*}\}$ .

–

If $j^{\prime}\in LP$ , which happens in the current step of the process with probability at most $2n^{\varepsilon-1}$ by $F_{LP}$ , then the event $S_{0,d+1}(t+1)$ will hold in the next step (provided $d+1\leq 4n^{3\varepsilon}$ ; otherwise, if $d+1>4n^{3\varepsilon}$ , we will say that the coupling has failed).

–

If $j^{\prime}\in HP$ , which happens in the coupling with probability $O(n^{3\varepsilon-1})$ (as bounded due to clause 2.2), then the event $S_{1,d}(t+1)$ will hold in the next step. The condition $F_{D}(t+1)$ requires more careful consideration. Taking into account that $F_{D}(t)$ holds, we need to consider two cases: either $j^{\prime}=j$ and the rule applied to $\bar{z}_{t}$ changed at least one of the two interacting states $\{i_{1}(j),i_{2}(j)\}$ , say $i_{1}(j)\in LR$ , so that $\bar{z}^{i_{1}(j)}(t)=\bar{z}^{*i_{1}(j)}(t)$ and $\bar{z}^{i_{1}(j)}(t+1)\leq\bar{z}^{*i_{1}(j)}(t+1)-1$ , or $j^{\prime}=j^{*}$ and the rule applied to $\bar{z}^{*}_{t}$ created a pair of states $\{o_{1}(j^{*}),o_{2}(j^{*})\}$ , say $o_{1}(j^{*})\in LR$ . In the first case, by the description of the ordering given in clause 1 of the definition of the coupling, the problem occurs only if one of the agents picked by the scheduler belongs to an $LR$ state, and the other agent is at a position in which the states of $\bar{z}$ and $\bar{z}^{*}$ differ in the ordering of the agents; hence, the probability that the coupling fails at this step is at most $O(\frac{kn^{\varepsilon}\cdot n^{3\varepsilon}}{n^{2}})\leq O(n^{5\varepsilon-2})$ . In the second case, we likewise analyze the ordering of the agents considered by the scheduler, and note that the interacting agent, which belongs to the part of the ordering in which $\bar{z}_{t}$ and $\bar{z}^{*}_{t}$ differ, must be in a HR state, since the agents in a LR state in $\bar{z}^{*}$ are matched by their counterparts in $\bar{z}$ (as noted in clause 1 of the discussion of the coupling). If the other interacting agent is in a state from $LR\cup HR^{\prime}$ , then such an event occurs with probability $O(\frac{n^{3\varepsilon}\cdot kn^{1-6\varepsilon}}{n^{2}})\leq O(n^{-2.9\varepsilon-1})$ , and we say that with this probability the coupling has failed. Finally, if the other interacting agent is in a state from $VHR$ , then by Lemma 27, we have that the probability of picking a rule under which the coupling fails is at most $O(n^{-14\varepsilon})$ , conditioned on the event $j\neq j^{*}$ holding, hence overall the probability of failure is $O(n^{-14\varepsilon}n^{3\varepsilon-1})=O(n^{-11\varepsilon-1})$ . Overall, we obtain that $F_{D}(t+1)$ holds with probability $O(n^{-2.9\varepsilon-1})$ . Given $F_{D}(t+1)$ , $S_{1,d(t+1)}$ , and the box condition, the remaining conditions $F_{\cdot}(t+1)$ follow directly.

Overall, we obtain that following a time $t$ satisfying $S_{0,d}(t)$ and all conditions $F_{\cdot}(t)$ , we reach the following successor state (see Fig. 8):

[TABLE]

At this point, before proceeding further, we can provide some intuition on the meaning of the respective events $S$ . The coupling process can be seen as a walk along the path $(S_{0,d}:d\leq 4n^{3\varepsilon}$ ), starting from state $S_{0,0}$ , and at each step, either staying in the current state $S_{0,d}$ , moving on to the next state $S_{0,d+1}$ , branching to a side branch $S_{1,d}$ (which we will analyze later), or failing. The process also fails if it reaches the endpoint of its path ( $d=4n^{3\varepsilon}$ ). Since the process is run for $T=n^{1+2\varepsilon}$ steps, the probability that failure will occur before the end of the path is reached is $O(n^{-0.9\varepsilon})$ , and the probability of reaching the end of the path and failing is exponentially small in $n^{\varepsilon}$ by a Chernoff bound (in expectation, the process will progress halfway along the path). Hence, we have that the process succeeds with probability $1-O(n^{-0.9\varepsilon})$ , or otherwise may fail in a side branch $S_{\cdot,d}$ .

A side branch is entered with probability $O(n^{\varepsilon-1})$ . To show that the coupling succeeds with the required probability, it suffices to show that we return from any state $S_{1,d}$ to state $S_{0,d}$ with probability at least $1-O(n^{-6\varepsilon})$ ; then, all (i.e., w.h.p. at most $O(n^{1+2\varepsilon}n^{3\varepsilon-1})=O(n^{5\varepsilon})$ ) excursions into side branches during the process will succeed with probability $1-O(n^{-\varepsilon})$ .

Consider now an excursion into a side branch $S_{w,d}$ ( $w\geq 1$ ) associated with a rule $\iota\in HP$ , which has been executed a different number of times in $\bar{z}_{t}$ and $\bar{z}^{*}_{t}$ . Now, if the process meets condition $S_{w,d}$ at time $t$ and all conditions $F_{\cdot}(t)$ , then we have the following:

•

With probability at least $1-O(n^{6\varepsilon-1})$ , the coupling will follow clause 2.1 of its definition, selecting a single rule $j$ .

–

If $j\neq\iota$ , then clause 3.1 will follow, and the two processes $\bar{z}$ and $\bar{z}^{*}$ will execute the same rule $j$ (or both pause). Hence, at time $t+1$ , all of the conditions $F_{\cdot}(t+1)$ and condition $S_{w,d}(t+1)$ is satisfied.

–

Else, the event $j=\iota$ occurs. The probability of such an event is denoted $\pi_{t}\in[p_{\iota}(\bar{z}_{t})-O(n^{6\varepsilon-1}),p_{\iota}(\bar{z}_{t})]$ (due to the conditioning performed in the first clause of the coupling); since $p_{\iota}(\bar{z}_{t})\geq n^{24\varepsilon-1}$ by the box condition for HP rules, it follows that $2\pi_{t}\geq n^{24\varepsilon-1}-O(n^{6\varepsilon-1})\geq n^{24\varepsilon-1}/2$ . Now, following clause 3.2 of the coupling, depending on which of the two processes $\bar{z}_{t}$ , $\bar{z}^{*}_{t}$ is chosen to execute the rule, with probability $\pi_{t}/2=:\pi^{\prime}_{t}$ the system moves to $S_{w-1,d}(t+1)$ , and with probability $\pi^{\prime}_{t}$ the system moves to $S_{w+1,d}(t+1)$ (unless $w+1>n^{6\varepsilon}$ , in which case the coupling has failed). As before, given there was no failure, all conditions $F_{\cdot}(t+1)$ are readily verified to be satisfied in the new time step.

•

With probability at most $O(n^{6\varepsilon-1})$ , for simplicity of analysis we assume the coupling has failed.

This time, for a time $t$ satisfying $S_{w,d}(t)$ for $w\geq 1$ and all conditions $F_{\cdot}(t)$ , we obtain the following distribution of successor states:

[TABLE]

The picture here corresponds to a lazy random walk along the side line $S_{w,d}$ for $w\in[0,n^{6\varepsilon}]$ , with an additional failure probability at each step. The walk starts at $w=1$ and ends with a return to the primary line $S_{0,d}$ if the endpoint $w=0$ is reached, or ends with failure if the other endpoint $w\geq n^{6\varepsilon}=:w_{\max}$ is reached. At each step, the walk is lazy (with probability of transition depending on the current step), but unbiased with respect to transitions to the left or to the right. Assuming that failure does not occur sooner, with probability $1-O(\frac{1}{w_{\max}})=1-O(n^{-6\varepsilon})$ the walk will reach point $w=0$ in $O(w_{\max}^{2})=O(n^{12\varepsilon})$ moves (transitions along the line), without reaching the other endpoint of the line sooner. Since a move is made in each step $t$ with probability $\pi^{\prime}_{t}\geq n^{24\varepsilon-1}/4$ , by a straightforward Chernoff bound, the number of steps spent on this line is given w.h.p. as at most $O(n^{12\varepsilon}/n^{24\varepsilon-1})=O(n^{1-12\varepsilon})$ . As the probability of failure in each of these steps is $O(n^{6\varepsilon-1})$ , the probability that the process fails during these steps is $O(n^{-6\varepsilon})$ . Overall, by a union bound, we obtain that the process successfully returns to $S_{0,d}$ with probability $1-O(n^{-6\varepsilon})$ (and within $O(n^{1-12\varepsilon})$ steps). In view of the previous observations, we have that with probability $1-O(n^{-\varepsilon})$ , all conditions $F_{\cdot}$ and some condition $S_{w,d}$ hold at time $T$ . Thus, with probability $\Pi-n^{-\varepsilon}$ , process $z^{*}_{T}$ is sufficiently close to $B$ , i.e., there exists a point $z\in B$ such that $\|z^{*}_{T}-z\|_{1}=O(n^{6\varepsilon})$ . ∎

9 Proof of Proposition 2

Proof.

Fix protocol $P$ with set of states $K$ , in which the minimum positive probability of executing some rule is $p$ . Let $K^{\prime}\subseteq K$ , $K^{\prime}\ni X$ be any minimal subset of the set of states such that no evolution of protocol $P$ starting in a configuration containing only states from set $K^{\prime}$ will ever contain an agent in a state outside $K^{\prime}$ . Denote $\kappa=|K^{\prime}|-1$ . Consider an initialization of protocol $P$ at time $t_{0}=0$ , at a configuration $z(0)$ with $x\in[c,1/2]$ and with all other states from $K^{\prime}$ represented by the same number of agents, i.e., for each $Q\in K^{\prime}$ , we have $q(0)=(1-x)/\kappa$ .

Let $t\geq kn$ be an arbitrarily chosen time step. Let $t_{1}=t-(\kappa-1)n$ . Fix $Q_{1}\in K^{\prime}\setminus\{X\}$ as any state such that $q_{1}(t_{1})\geq 1/2\kappa$ (we can, e.g., fix $q_{1}$ as the state from $K^{\prime}\setminus\{X\}$ having the most agents at time $t_{1}$ ). Observe that from the minimality of $K^{\prime}$ it follows that there must exist a sequence of states $(Q_{1},\ldots,Q_{\kappa})$ , with $\{Q_{1},\ldots,Q_{\kappa}=K^{\prime}\setminus\{X\}$ , such that in the definition of protocol $P$ , for all $i\in\{1,\ldots,\kappa-1\}$ , some rule of protocol $P$ creates at least one agent (i.e., either 1 or 2 agents) in state $Q_{i+1}$ from an interaction of either the pair of agents in states $(Q_{j},Q_{i})$ or the pair of agents in states $(Q_{i},Q_{j})$ , for some $j\in\{1,\ldots,i\}$ . (Indeed, if for some $i$ there was no possibility of choosing $Q_{i+1}$ in any way, then $K^{\prime\prime}=\{X,Q_{1},\ldots Q_{i}\}\subseteq K^{\prime}$ would be closed under agent creation, contradicting the minimality of the choice of $K^{\prime}$ .) Now, we consider intervals of time steps $[t_{s},t_{s+1}]$ , with $t_{s}=t_{1}+(s-1)n$ for $s>1$ . We make the following claims:

(1)

Fix $i\in\{1,2,\ldots\kappa\}$ . If $q_{i}(t_{s})=\Omega(1)$ , then $q_{i}(t)\geq 0.11q_{i}(t_{s})$ for all $t\in[t_{s},t_{s+1}$ , with probability $1-e^{-n^{\Omega}(1)}$ . Indeed, in a sequence of $n$ steps, the expected number of agents which do not participate in any interaction in the time interval $[t_{s},t_{s+1}]$ following the asynchronous scheduler is $(\frac{n-2}{n})^{n}n>0.13n$ , and thus the number of non-interacting agents is at least $0.12n$ , with probability $1-e^{-n^{\Omega}(1)}$ following standard concentration bounds for the number of isolated vertices in a random graph on $n$ nodes with $n$ edges. Since the choice of agents by the scheduler is independent of their state, and the probability for a uniformly random agent to be in state $Q_{i}$ at time $t_{s}$ is $q_{i}(t_{s})$ , a simple concentration bound shows that $0.11q_{i}(t_{s})n$ having state $Q_{i}$ at time $t_{s}$ do not participate in any interaction in the interval $[t_{s},t_{s+1}]$ .

(2)

Fix $i\in\{1,2,\ldots\kappa-1\}$ . Denote $m_{i}=\min_{j\leq i}q_{j}(t_{i})$ . If $m_{i}=\Omega(1)$ , then $q_{i+1}(t_{i+1})\geq 0.01pm_{i}^{2}$ , with probability $1-e^{-n^{\Omega}(1)}$ . Indeed, consider the value $j\leq i$ such that the interaction $(Q_{j},Q_{i})$ or $(Q_{i},Q_{j})$ creates an agent in state $Q_{i+1}$ . At any time $t$ within the interval $[t_{i},t_{i+1}]$ , we have by Claim (1) that $q_{j}(t)\geq 0.11m_{i}$ and $q_{i}(t)\geq 0.11m_{i}$ , with probability $1-e^{-n^{\Omega}(1)}$ . It follows that an interaction creating a new agent in state $Q_{i+1}$ is triggered with probability at least $p(0.11m_{i})^{2}$ at each step. The number of agents in state $Q_{i+1}$ at time step $t_{i+1}$ may thus be dominated from below by the number of successes in a sequence of $n$ Bernoulli trials with success probability $p(0.11m_{i})^{2}$ , and the claim follows.

By applying Claim (2) iteratively for $i=\{1,2,\ldots\kappa-1\}$ , where we note that $m_{1}\geq 1/2\kappa$ , we have $q_{i+1}(t_{i+1})\geq(0.01p/2\kappa)^{2^{i}}$ , with probability $1-e^{-n^{\Omega}(1)}$ (through successive union bounds). Then, applying Claim (1) up to time $t=t_{\kappa-1}$ , we have $q_{i+1}(t_{\kappa-1})\geq 0.11^{\kappa-1-i-1}(0.01p/2\kappa)^{2^{i}}\geq(0.01p/2\kappa)^{2^{\kappa}}$ , with probability $1-e^{-n^{\Omega}(1)}$ . Applying once again a union bound, we have shown that for all $q\in K^{\prime}$ , we have $q(t)\geq(0.01p/2\kappa)^{2^{\kappa}}\equiv C_{0}$ , with probability $1-e^{-n^{\Omega}(1)}$ . The claim of the lemma follows for a suitable choice of $\delta_{0}>0$ .

∎

Acknowledgment.

We sincerely thank Dan Alistarh and Przemek Uznański for inspiring discussions, and Lucas Boczkowski for many detailed comments which helped to improve this manuscript.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. A. Abdullah and M. Draief. Majority consensus on random graphs of a given degree sequence. Co RR , abs/1209.5025, 2012.
2[2] D. Aldous and J. A. Fill. Reversible markov chains and random walks on graphs, 2002. Unfinished monograph, recompiled 2014, available at http://www.stat.berkeley.edu/~aldous/RWG/book.html .
3[3] D. Alistarh, J. Aspnes, D. Eisenstat, R. Gelashvili, and R. L. Rivest. Time-space trade-offs in population protocols. In Proc. Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2017, Barcelona, Spain , pages 2560–2579, 2017.
4[4] D. Alistarh, J. Aspnes, and R. Gelashvili. Space-optimal majority in population protocols. (To appear, SODA 2018.) Co RR , abs/1704.04947, 2017.
5[5] D. Alistarh, B. Dudek, A. Kosowski, D. Soloveichik, and P. Uznanski. Robust detection in leak-prone population protocols. In DNA , volume 10467 of Lecture Notes in Computer Science , pages 155–171. Springer, 2017.
6[6] D. Angluin, J. Aspnes, Z. Diamadi, M. J. Fischer, and R. Peralta. Computation in networks of passively mobile finite-state sensors. Distributed Computing , 18(4):235–253, 2006.
7[7] D. Angluin, J. Aspnes, and D. Eisenstat. A simple population protocol for fast robust approximate majority. Distributed Computing , 21(2):87–102, 2008.
8[8] D. Angluin, J. Aspnes, D. Eisenstat, and E. Ruppert. The computational power of population protocols. Distributed Computing , 20(4):279–304, 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Universal Protocols for Information Dissemination

Abstract

1 Introduction

1.1 Problems and Model

1.2 Our Results

1.3 Comparison to the State-of-the-Art

Other work on the problems.

Originality of methods.

1.4 Other Related Work

Rumor spreading.

Population protocols.

Nonlinearity in interaction protocols.

2 Preliminaries: Building Blocks for Population Protocols

2.1 Protocol Definition

2.2 Protocol Composition Technique

3 Overview of Protocol Designs

3.1 Main Routine: Input-Controlled Oscillator Protocol PoP_{o}Po​

Theorem 1**.**

3.2 Protocols for BitBroadcast

Theorem 2** (Protocol for BitBroadcast).**

Observation 1**.**

3.3 Protocol for Detect

Theorem 3** (Protocol for Detect).**

4 Impossibility Results for Protocols without Non-Stationary Effects

Theorem 4** (Fixed points preclude fast stabilization).**

5 Input-Controlled Behavior of Protocols for Detect

Proposition 1**.**

Proposition 2**.**

6 Analysis of Oscillator Dynamics PoP_{o}Po​

6.1 Preliminaries: Discrete vs. Continuous Dynamics

Notation.

Warmup: the RPS oscillator.

6.2 Proof Outline of Theorem 1

6.3 Properties of the Oscillator

6.4 Stopping in O(nlog⁡n)O(n\log n)O(nlogn) Sequential Steps in the Absence of a Source

Overview of the proof.

Phase with δ≤s/12\delta\leq s/12δ≤s/12.

Lemma 1**.**

Proof.

Lemma 2**.**

Lemma 3**.**

Proof.

Lemma 4**.**

Proof.

Lemma 5**.**

Proof.

Phase with δ>s/12\delta>s/12δ>s/12.

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

Lemma 8**.**

Proof.

Lemma 9**.**

Proof.

Lemma 10**.**

Proof.

Lemma 11**.**

6.5 Operation of the Oscillator in the Presence of a Source

Lemma 12**.**

Proof.

Lemma 13**.**

Proof.

Lemma 14**.**

Proof.

Lemma 15**.**

Proof.

7 Analysis of Protocol for Detect

7.1 Further Properties of the Oscillator

Lemma 16**.**

Proof.

Lemma 17**.**

Proof.

3.1 Main Routine: Input-Controlled Oscillator Protocol $P_{o}$

Theorem 1.

Theorem 2 (Protocol for BitBroadcast).

Observation 1.

Theorem 3 (Protocol for Detect).

Theorem 4 (Fixed points preclude fast stabilization).

Proposition 1.

Proposition 2.

6 Analysis of Oscillator Dynamics $P_{o}$

6.4 Stopping in $O(n\log n)$ Sequential Steps in the Absence of a Source

Phase with $\delta\leq s/12$ .

Lemma 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Phase with $\delta>s/12$ .

Lemma 6.

Lemma 7.

Lemma 8.

Lemma 9.

Lemma 10.

Lemma 11.

Lemma 12.

Lemma 13.

Lemma 14.

Lemma 15.

Lemma 16.

Lemma 17.

Lemma 18.

Lemma 19.

7.2 Protocol Extension $P_{m}$ : Majority

Lemma 20.

Lemma 21.

Lemma 22.

Lemma 23.

Lemma 24.

Lemma 25.

7.3 Protocol Extension $P_{l}$ : Detection with Lights

Lemma 26.

Lemma 27.

Lemma 28.