Universal Protocols for Information Dissemination Using Emergent Signals
Bartlomiej Dudek, Adrian Kosowski (GANG)

TL;DR
This paper introduces universal, rapid, and practical protocols for information dissemination in decentralized populations, capable of broadcasting and source detection with convergence in logarithmic squared time, leveraging oscillatory dynamics.
Contribution
It presents the first protocols that are universal, fast, and simple for broadcasting and source detection, utilizing self-organizing oscillatory behavior.
Findings
Protocols achieve $O( ext{log}^2 n)$ convergence time with high probability.
Broadcasting protocol is exact, ensuring all agents learn the source state.
Source detection protocol has one-sided error on a small fraction of the population.
Abstract
We consider a population of agents which communicate with each other in a decentralized manner, through random pairwise interactions. One or more agents in the population may act as authoritative sources of information, and the objective of the remaining agents is to obtain information from or about these source agents. We study two basic tasks: broadcasting, in which the agents are to learn the bit-state of an authoritative source which is present in the population, and source detection, in which the agents are required to decide if at least one source agent is present in the population or not.We focus on designing protocols which meet two natural conditions: (1) universality, i.e., independence of population size, and (2) rapid convergence to a correct global state after a reconfiguration, such as a change in the state of a source agent. Our main positive result is to show thatâŠ
| Problem: | BitBroadcast | Detection |
|---|---|---|
|
Non-stationarity property:
(Applies to all fast -state protocols) |
no fixed points while source transmits random bits | no fixed points while source is present |
| New protocols with emergent signal: | universal, 74 states | universal, 55 states |
| Convergence time: | ||
| â No error (exact output) | impossible | |
| â One-sided -error | ||
| Other protocols with -states: | Clock-Sync (in synchronized round model) [11] | Time-to-Live [5] |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Modular Robots and Swarm Intelligence · Distributed systems and fault tolerance
Universal Protocols for Information Dissemination
Using Emergent Signals
BartĆomiej Dudek
University of WrocĆaw, Poland
Adrian Kosowski111Corresponding Author. Email: [email protected]
Inria Paris, France
Abstract
We consider a population of agents which communicate with each other in a decentralized manner, through random pairwise interactions. One or more agents in the population may act as authoritative sources of information, and the objective of the remaining agents is to obtain information from or about these source agents. We study two basic tasks: broadcasting, in which the agents are to learn the bit-state of an authoritative source which is present in the population, and source detection, in which the agents are required to decide if at least one source agent is present in the population or not.
We focus on designing protocols which meet two natural conditions: (1) universality, i.e., independence of population size, and (2) rapid convergence to a correct global state after a reconfiguration, such as a change in the state of a source agent. Our main positive result is to show that both of these constraints can be met. For both the broadcasting problem and the source detection problem, we obtain solutions with a convergence time of rounds, w.h.p., from any starting configuration. The solution to broadcasting is exact, which means that all agents reach the state broadcast by the source, while the solution to source detection admits one-sided error on a -fraction of the population (which is unavoidable for this problem). Both protocols are easy to implement in practice and have a compact formulation.
Our protocols exploit the properties of self-organizing oscillatory dynamics. On the hardness side, our main structural insight is to prove that any protocol which meets the constraints of universality and of rapid convergence after reconfiguration must display a form of non-stationary behavior (of which oscillatory dynamics are an example). We also observe that the periodicity of the oscillatory behavior of the protocol, when present, must necessarily depend on the number \text{{}^{#}}\!\!X of source agents present in the population. For instance, our protocols inherently rely on the emergence of a signal passing through the population, whose period is \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}) rounds for most starting configurations. The design of clocks with tunable frequency may be of independent interest, notably in modeling biological networks.
Key words: Gossiping, Epidemic processes, Oscillatory dynamics, Emergent phenomena,
Population protocols, Broadcasting, Distributed clock synchronization.
1 Introduction
Information-spreading protocols, and more broadly epidemic processes, appear in nature, social interactions between humans, as well as in man-made technology, such as computer networks. For some protocols we have a reasonable understanding of the extent to which the information has already spread, i.e., we can identify where the information is located at a given step of the process: we can intuitively say which nodes (or agents) are âinformedâ and which nodes are âuninformedâ. This is the case for usual protocols in which uninformed agents become informed upon meeting a previously informed agent, cf. e.g. mechanisms of rumor spreading and opinion spreading models studied in the theory community [29, 26]). Arguably, most man-made networking protocols for information dissemination also belong to this category.
By contrast, there exists a broad category of complex systems for which it is impossible to locate which agents have acquired some knowledge, and which are as yet devoid of it. In fact, the question of âwhere the information learned by the system is locatedâ becomes somewhat fuzzy, as in the case of both biological and synthetic neural networks. In such a perspective, information (or knowledge) becomes a global property of the entire system, whereas the state of an individual agent represents in principle its activation, rather than whether it is informed or not. As such, knowledge has to be treated as an emergent property of the system, i.e., a global property not resulting directly from the local states of its agents. The convergence from an uninformed population to an informed population over time is far from monotonous. Even so, once some form of âsignalâ representing global knowledge has emerged, agents may try to read and copy this signal into their local state, thus each of them eventually also becomes informed. At a very informal conceptual level, we refer to this category of information-dissemination protocols as protocols with emergent behavior. At a more technical level, emergent protocols essentially need to rely on non-linear dynamical effects, which typically include oscillatory behavior, chaotic effects, or a combination of both. (This can be contrasted with simple epidemic protocols for information-spreading, in which nodes do not become deactivated.)
This work exhibits a simple yet fundamental information-spreading scenario which can only be addressed efficiently using emergent protocols. Both the efficient operation of the designed protocols, and the need for non-stationary dynamical effects in any efficient protocol for the considered problems, can be formalized through rigorous theoretical analysis. Our goal in doing this is twofold: to better understand the need for emergent behavior in real-world information spreading, and to display the applicability of such protocols in man-made information spreading designs. For the latter, we describe an interpretation of information as a (quasi-)periodic signal, which can be both decoded from states of individual nodes, and encoded into them.
1.1 Problems and Model
We consider a population of identical agents, each of which may be in a constant number of possible states. Interactions between agents are pairwise and random. A fair scheduler picks a pair of interacting agents independently and uniformly at random in each step. The protocol definition is provided through a finite sequence of state transition rules, following the precise conventions of the randomized Population Protocol model [6, 8] or (equivalently) of Chemical Reaction Networks [21].222The activation model is thus asynchronous. The same protocols may be deployed in a synchronous setting, with scheduler activations following, e.g., the independent random matching model (with only minor changes to the analysis) or the PULL model [29] (at the cost of significantly complicating details of the protocol formulation).
The input to the problem is given by fixing the state of some subset of agents, to some state of the protocol, which is not available to any of the other agents. Intuitively, the agents whose state has been fixed are to be interpreted as authoritative sources of information, which is to be detected and disseminated through the network (i.e., as the rumor source node, broadcasting station, etc.). For example, the problem of spreading a bit of information through the system is formally defined below.
Problem BitBroadcast
Input States:
.
Promise:
The population contains a non-zero number of agents in exactly one of the two input states .
Question:
Decide if the input state present in the population is or .
We can, e.g., consider that the transmitting station (or stations) choose whether to be in state or in a way external to the protocol, and thus transmit the âbitâ value or , respectively, through the network. Broadcasting a bit is one of the most fundamental networking primitives.
The definition of the population protocol includes a partition of the set of states of the protocol into those corresponding to the possible answers to the problem. When the protocol is executed on the population, the output of each agent may be read at every step by checking, for each agent, whether its state belongs to the subset of an output state with a given answer (in this case, the answer of the agent will be the âbitâ it has learned, i.e., 1 or 2). We will call a protocol exact if it eventually converges to a configuration, such that starting from this configuration all agents always provide the correct answer. We will say it operates with -error, for a given constant , if starting from some step, at any given step of the protocol, at most an -fraction of the population holds the incorrect answer, with probability .
Time is measured in steps of the scheduler, with time steps called a round, with the expected number of activations of each agent per round being a constant. Our objective is to design protocols which converge to the desired outcome rapidly. Specifically, a protocol is expected to converge in rounds (i.e., in steps), with probability , starting from any possible starting configuration of states in the population, conforming to the promise of the problem.333We adhere to this strong requirement for self-stabilizing (or self-organizing) behavior from any initial configuration in the design of our protocols. The presented impossibility results still hold under significantly weaker assumptions.
Motivated by both applications and also a need for a better understanding of the broadcasting problem, we also consider a variant of the broadcasting problem in which no promise on the presence of the source is given. This problem, called Detection, is formally defined below.
Problem Detection
Input State:
.
Question:
Decide if at least one agent in state is present in the population.
Detection of the presence of a source is a task which is not easier than broadcasting a bit. Indeed, any detection protocol is readily converted into a broadcasting protocol for states by identifying and treating as a dummy state which does not enter into any interactions (i.e., is effectively not visible in the network). Intuitively, the detection task in the considered setting is much harder: a source may disappear from the network at any time, forcing other agents to spontaneously âunlearnâ the outdated information about the presence of the source. This property is inherently linked to the application of the Detection problem in suppressing false rumors or outdated information in social interactions. Specifically, it may happen that a certain part of a population find themselves in an informed state before the original rumor source is identified as a source of false information, a false rumor may be propagated accidentally because of an agent which previously changed state from âuninformedâ to âinformedâ due to a fault or miscommunication, or the rumor may contain information which is no longer true. Similar challenges with outdated information and/or false-positive activations are faced in Chemical Reaction Networks, e.g. in DNA strand displacement models [5]. In that context, the detection problem has the intuitive interpretation of detecting if a given type of chemical or biological agent (e.g., a contaminant, cancer cell, or hormonal signal) is present in the population, and spreading this information among all agents.
1.2 Our Results
In Section 3, we show that both the BitBroadcast, and the Detection problem can be solved with protocols which converge in rounds to an outcome, with probability , starting from any configuration of the system. The solution to BitBroadcast guarantees a correct output. The solution to Detection admits one-sided -error: in the absence of a source, all agents correctly identify it as absent, whereas when the source is present, at any moment of time after convergence the probability that at least agents correctly identify the source as present is at least .444The existence of one-sided error is inherent to the Detection problem in the asynchronous setting: indeed, if no agent of the population has not made any communication with the source over an extended period of time, it is impossible to tell for sure if the source has completely disappeared from the network, or if it is not being selected for interaction by the random scheduler. Here, is a constant influencing the protocol design, which can be made arbitrarily small.
The designed protocols rely on the same basic building block, namely, a protocol realizing oscillatory dynamics at a rate controlled by the number of present source states in the population. Thus, these protocols display non-stationary behavior. In Section 4, we show that such behavior is a necessary property in the following sense. We prove that in any protocol which solves Detection in sub-polynomial time in and which uses a constant number of states, the number of agents occupying some state has to undergo large changes: by a polynomially large factor in during a time window of length proportional to the convergence time of the protocol. For the BitBroadcast problem, we show that similar volatile behavior must appear in a synthetic setting in which a unique source is transmitting its bit as random noise (i.e., selecting its input state uniformly at random in subsequent activations).
We note that, informally speaking, our protocols rely on the emergence of a âsignalâ passing through the population, whose period is \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}) rounds when the number of source agents in state is \text{{}^{#}}\!\!X. In Section 5, we then discuss how the behavior of any oscillatory-type protocol controlled by the existence of \text{{}^{#}}\!\!X has to depend on both and \text{{}^{#}}\!\!X. We prove that for any such protocol with rapid convergence, the cases of subpolynomial \text{{}^{#}}\!\!X and \text{{}^{#}}\!\!X=\Theta(n) can be separated by looking at the portion of the configuration space regularly visited by the protocol. This, in particular, suggests the nature of the dependence of the oscillation period on the precise value of \text{{}^{#}}\!\!X, and that the protocols we design with period \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}) are among the most natural solutions to the considered problems.
The proofs of all theorems are deferred to the closing sections of the paper.
1.3 Comparison to the State-of-the-Art
Our work fits into lines of research on rumor spreading, opinion spreading, population protocols and other interaction models, and emergent systems. We provide a more comprehensive literature overview of some of these topics in Subsection 1.4.
Other work on the problems.
The BitBroadcast problem has been previously considered by Boczkowski, Korman, and Natale in [11], in a self-stabilizing but synchronous (round-based) setting. A protocol solving their problem was presented, giving a stabilized solution in time, using a number of states of agents which depends on population size, but exchanging messages of bit size (this assumption can be modeled in the population protocol framework as a restriction on the permitted rule set). In this sense, our result can be seen as providing improved results with respect to their approach, since it is applicable in an asynchronous setting and reduces the number of states to constant (the latter question was open [11]). We remark that their protocol has a more general application to the problem of deciding which of the sources or is represented by a larger number of agents, provided these two numbers are separated by a multiplicative constants. (Our approach could also be used in such a setting, but the required separation of agent numbers to ensure a correct output would have to be much larger: we can compare values if their logarithms are separated by a multiplicative constant.) The protocol of [11] involves a routine which allows the population to create a synchronized modulo clock, working in a synchronous setting. The period of this clock is independent of the input states of the protocol, which should be contrasted with the oscillators we work on in this paper, which encode the input into a signal (with a period depending on the number of agents in a given input state).
The Detect problem was introduced in a work complementary to this paper [5]. Therein we look at applications of the confirmed rumor spreading problem in DNA computing, focusing on performance on protocols based on a time-to-live principle and on issues of fault tolerance in a real-world model with leaks. The protocols designed there require states, and while self-stabilizing, do not display emergent behavior (in particular, agents can be categorized as âinformedâ and âuninformedâ, the number of correctly informed agents tends to increase over time, while the corresponding continuous dynamical system stabilizes to a fixed point attractor).
Originality of methods.
The oscillatory dynamics we apply rely on an input-parameter-controlled oscillator. The uncontrolled version of the oscillator which we consider here is the length- cyclic oscillator of the cyclic type, known in population dynamics under the name of rock-paper-scissors (or RPS). This has been studied intensively in the physics and evolutionary dynamics literature (cf.e.g. [46] for a survey), while algorithmic studies are relatively scarce [15]. We remark that the uncontrolled cyclic oscillator with a longer (but length cycle) has been applied for clock/phase synchronization in self-stabilizing settings and very recently in the population protocol setting when resolving the leader election problem [25]. (The connection to oscillatory dynamics is not made explicit, and the longer cycle provides for a neater analysis, although it does not seem to be applicable to our parameter-controllable setting.) Whereas we are not aware of any studies of parameter-controlled oscillators in a protocol design setting (nor for that matter, of rigorous studies in other fields), we should note that such oscillators have frequently appeared in models of biological systems, most notably in biological networks and neuroendocrinology ([27] for a survey). Indeed, some hormone release and control mechanisms (e.g., for controlling GnRH surges in vertebrates) appear to be following a similar pattern. To the best of our knowledge, no computational (i.e., interaction-protocol-based) explanation for these mechanisms has yet been proposed, and we hope that our work may provide, specifically on the Detect problem, may provide some insights in this direction.
In terms of lower bounds, we rely on rather tedious coupling techniques for protocols allowing randomization, and many of the details are significantly different from lower-bound techniques found in the population protocols literature. We remark that a recent line of work in this area [21, 3] provides a powerful set of tools for proving lower bounds on the number of states (typically states) for fast (typically polylogarithmic) population protocols for different problems, especially for the case of deterministic protocols. We were unable to leverage these results to prove our lower bound for the randomized scenario studied here, and believe our coupling analysis is complementary to their results.
1.4 Other Related Work
Our work fits into the line of research on rumor spreading, population protocols, and related interaction models. Our work also touches on the issue of how distributed systems may spontaneously achieve some form of coordination with minimum agent capabilities. The basic work in this direction, starting with the seminal paper [32], focuses on synchronizing timers through asynchronous interprocess communication to allow processes to construct a total ordering of events. A separate interesting question concerns local clocks which, on their own, have some drift, and which need to synchronize in a network environment (cf. e.g. [37, 35], or [34] for a survey of open problems).
Rumor spreading.
Rumor spreading protocols are frequently studied in a synchronous setting. In a synchronous protocol, in each parallel round, each vertex independently at random activates a local rule, which allows it either to spread the rumor (if it is already informed), or possibly also to receive it (if it has not yet been informed, as is the case in the push-pull model). The standard push rumor spreading model assumes that each informed neighbor calls exactly one uninformed neighbor. In the basic scenario, corresponding to the complete interaction network, the number of parallel rounds for a single rumor source to inform all other nodes is given as , with high probability [42, 24]. More general graph scenarios have been studied in [22] in the context of applications in broadcasting information in a network. Graph classes studied for the graph model include hypercubes [22], expanders [45], and other models of random graphs [23]. The push-pull model of rumor spreading is an important variation: whereas for complete networks the speedup due to the pull process is in the order of a multiplicative constant [29], the speed up turns out to be asymptotic, e.g., on preferential attachment graphs, where the rumor spreading time is reduced from rounds in the push model to rounds in the push-pull model [18], as well as on other graphs with a non-uniform degree distribution. The push-pull model often also proves more amenable to theoretical analysis. We note that asynchronous rumor spreading on graphs, in models closer to our random scheduler, has also been considered in recent work [40, 26], with [26] pointing out the tight connections between the synchronous (particularly push-pull) and asynchronous models in general networks.
Population protocols.
Population protocols are a model which captures the way in which the complex behavior of systems (biological, sensor nets, etc.) emerges from the underlying local interactions of agents. The original model of Angluin et al. [6, 7] was motivated by applications in sensor mobility. Despite the limited computational capabilities of individual sensors, such protocols permit at least (depending on available extensions to the model) the computation of two important classes of functions: threshold predicates, which decide if the weighted average of types appearing in the population exceeds a certain value, and modulo remainders of similar weighted averages. The majority function, which belongs to the class of threshold functions, was shown to be stably computable for the complete interaction graph [6]; further results in the area of majority computation can be found in [7, 9, 38, 10]. A survey of applications and models of population protocols is provided in [9, 39]. An interesting line of research is related to studies of the algorithmic properties of dynamics of chemical reaction networks [21]. These are as powerful as population protocols, though some extensions of the chemical reaction model also allow the population size to change in time. Two very recent results in the population protocol model are worthy of special attention. Alistarh, Aspnes, and Gelashvili [4] have resolved the question of the number of states required to solve the Majority problem on a complete network in polylogarithmic time as . For the equally notable task of Leader Election, the papers of Gasieniec and Stachowiak [25] (for the upper bound) together with the work of Alistarh, Aspnes, Eisenstat, Gelashvili, and Rivest [3] (for the lower bound) put the number of states required to resolve this question in polylogarithmic time as . Both of these results rely on a notion of a self-organizing phase clock.
Nonlinearity in interaction protocols.
Linear dynamical systems, as well as many nonlinear protocols subjected to rigorous analytical study, have a relatively simple structure of point attractors and repellers in the phase space. The underlying continuous dynamics (in the limit of ) of many interaction protocols defined for complete graphs would fit into this category: basic models of randomized rumor spreading [42]; models of opinion propagation (e.g. [14, 1]); population protocols for problems such as majority and thresholds [6, 7]; all reducible Markov chain processes, such as random walks and randomized iterative load balancing schemes.
Nonlinear dynamics with non-trivial limit orbits are fundamental to many areas of systems science, including the study of physical, chemical and biological systems, and to applications in control science. In general, population dynamics with interactions between pairs of agents are non-linear (representable as a set of quadratic difference equations) and have potentially complicated structure if the number of states is or more. For example, the simple continuous Lotka-Volterra dynamics [36] gives rise to a number of discrete models, for example one representing interactions of the form , over some pairs of states in a population (cf. [46] for further generalizations of the framework or [15] for a rigorous analysis in the random scheduler model). The model describes transient stability in a setting in which several species are in a cyclic predator-prey relation. Cyclic protocols of the type have been consequently identified as a potential mechanism for describing and maintaining biodiversity, e.g., in bacterial colonies [30, 31]. Cycles of length 3, in which type attacks type , type attacks type , and type attacks type , form the basis of the basic oscillator, also used as the starting point for protocols in this work, which is referred to as the RPS (rock-paper-scissors) oscillator or simply the 3-cycle oscillator, which we discuss further in Section 6.1. This protocol has been given a lot of attention in the statistical physics literature. The original analytical estimation method applied to RPS was based on approximation with the Fokker-Planck equation [44]. A subsequent analysis of cyclic - and -species models using Khasminskii stochastic averaging can be found in [16], and a mean field approximation-based analysis of RPS is performed in [41]. In [15], we have performed a study of some algorithmic implications of RPS, showing that the protocol may be used to perform randomized choice in a population, promoting minority opinions, in steps. All of these results provide a good qualitative understanding of the behavior of the basic cyclic protocols. We remark that the protocol used in this paper is directly inspired by the properties of RPS, as we discuss further on, but has a more complicated interaction structure (see Fig. 1).
For protocols with convergence to a single point in the configuration space in the limit of large population size, a discussion of the limit behavior is provided in [12], who provide examples of protocols converging to limit points at coordinates corresponding to any algebraic numbers.
We also remark that local interaction dynamics on arbitrary graphs (as opposed to the complete interaction graph) exhibit a much more complex structure of their limit behavior, even if the graph has periodic structure, e.g., that of a grid. Oscillatory behavior may be overlaid with spatial effects [46], or the system may have an attractor at a critical point, leading to simple dynamic processes displaying self-organized criticality (SOC, [43]).
2 Preliminaries: Building Blocks for Population Protocols
2.1 Protocol Definition
A randomized population protocol for a population of agents is defined as a pair , where is the set of states and is the set of interaction rules. The interaction graph is complete. We will simply write , when considering a protocol which is universal (i.e., defined in the same way for each value of ) or if the value of is clear from the context. All the protocols we design are universal; our lower bounds also apply to some non-universal protocols. The set of rules is given so that each rule is of the form , describing an interaction read as: ``(i_{1}(j),i_{2}(j))\to(o_{1}(j),o_{2}(j))\text{with probability q_{j}}â. For all , we define as the set of rules acting on the pair of states , and impose that .
For a state , we denote the number of agents in state as \text{{}^{#}}\!\!A, and the concentration of state as a=\text{{}^{#}}\!\!A/n, and likewise for a set of states , we write \text{{}^{#}}\!\!\mathcal{A}=\sum_{A\in\mathcal{A}}\text{{}^{#}}\!\!A.
In any configuration of the system, each of the agents from the population is in one of states from . The protocol is executed by an asynchronous scheduler, which runs in steps. In every step the scheduler uniformly at random chooses from the population a pair of distinct agents to interact: the initiator and the receiver. If the initiator and receiver are in states and , respectively, then the protocol executes at most one rule from set protocol , selecting rule with probability . If rule is executed, the initiator then changes its state to and the receiver to . The source has a special state, denoted in the Detect problem, or one of two special states, denoted in the BitBroadcast problem, which is never modified by any rule.
All protocols are presented in the randomized framework, however, the universal protocols considered here are amenable to a form of conversion into deterministic rules discussed in [3], which simulates randomness of rules by exploiting the inherent randomness of the scheduler in choosing interacting node pairs to distribute weakly dependent random bits around the system.
All protocols designed in this work are initiator-preserving, which means that for any rule , we have (i.e., have all rules of the form , also more compactly written as ), which makes them relevant in a larger number of application. As an illustrative example, we remark that the basic rumor spreading (epidemic) model is initiator-preserving and given simply as . All protocols can also obviously be rewritten to act on unordered pairs of agents picked by the scheduler, rather than ordered pairs.
2.2 Protocol Composition Technique
Our protocols will be built from simpler elements. Our basic building block is the input-controlled oscillatory protocol (see Fig. 1). We then use protocol as a component in the construction of other, more complex protocols, without disrupting the operation of the original protocol.
Formally, we consider a protocol using state set and rule set , and a protocol extension using a state set , where is disjoint from , and rule extension set . Each rule extension defines for each pair of states from (i.e., to each element of ) a probability distribution over elements of .
The composed protocol is a population protocol with set of states . Its rules are defined so that, for a selected pair of agents in states and , we obtain a pair of agents in states and according to a probability distribution defined so that:
- âą
Each pair appears in the output states of the two agents with the same probability as it would in an execution of protocol on a pair of agents in states and .
- âą
Each pair appears in the output states of the two agents with the probability given by the definition of .
In the above, the pairs of agents activated by and are not independent of each other. This is a crucial property in the composition of protocol when composing it with further blocks to solve the Detect problem.
We denote by the identity protocol which preserves agent states on set of states . For a protocol , we denote by a lazy version of a protocol in which the rule activation of occurs with probability , and with probability the corresponding rule of the identity protocol is activated. Note that all asymptotic bounds on expected and w.h.p. convergence time obtained for any protocol also apply to protocol , in the regime of at least a logarithmic number of time steps. We also sometimes treat a protocol extension as a protocol in itself, applied to the identity protocol .
The independently composed protocol is defined as an implementation of the composed protocol , realized with the additional constraint that in each step, either the rule of is performed with an identity rule extension, or the rule extension of is performed on top of the identity protocol . Such a definition is readily verified to be correct by a simple coupling argument, and allows us to analyze protocols and , observing that the pairs (identities) of agents activated by the scheduler in the respective protocols are independent.
All the composed protocols (and protocol extensions) we design are also initiator-preserving, i.e., and , with probability . In notation, rules omitted from the description of protocol extensions are implicit, occurring with probability [math] (where ) or with the probability necessary for the normalization of the distribution to , where the state is preserved (where ).
As a matter of naming convention, we name the states in the separate state sets of the composed protocols with distinct letters of the alphabet, together with their designated subscripts and superscripts. The rumor source is treated specially and uses a separate letter (and may be seen as a one state protocol without any rules, on top of which all other protocols are composed; in particular, its state is never modified). The six remaining states of protocol are named with the letters , as usual in its definition. Subsequent protocols will use different letters, e.g., and .
3 Overview of Protocol Designs
3.1 Main Routine: Input-Controlled Oscillator Protocol
We first describe the main routine which allows us to convert local input parameters (the existence of source into a form of global periodic signal on the population. This main building block is the construction of a -state protocol following oscillator dynamics, whose design we believe to be of independent interest.
The complete design of protocol is shown in Fig. 1. The source state is denoted by . Additionally, there are six states, called and , for . The naming of states in the protocol is intended to maintain a direct connection with the RPS oscillator dynamics, which is defined by the simple rule â, for â. In fact, we will retain the convention and , and consider the two states and to be different flavors of the same species , referring to the respective superscripts as either lazy (+) or aggressive (++).
The protocol has the property that in the absence of , it stops in a corner state of the phase space, in which only one of three possible states appears in the population, and otherwise regularly (every steps) moves sufficiently far away from all corner states. An intuitive formalization of the basic properties of the protocol is given by the theorem below.
Theorem 1**.**
There exists a universal protocol with states, including a distinguished source state , which has the following properties.
For any starting configuration, in the absence of the source (\text{{}^{#}}\!\!X=0), the protocol always reaches a configuration such that:
- âą
all agents are in the same state: either , or , or ;
- âą
no further state transitions occur after this time.
Such a configuration is reached in rounds, with constant probability (and in rounds with probability ). 2. 2.
For any starting configuration, in the presence of the source (\text{{}^{#}}\!\!X\geq 1), we have with probability :
- âą
for each state , there exists a time step in the next O(\log\frac{n}{\text{{}^{#}}\!\!X}) rounds when at least a constant fraction of all agents are in state ;
- âą
during the next O(\log\frac{n}{\text{{}^{#}}\!\!X}) rounds, at least a constant fraction of all agents change their state at least once.
The proof of the Theorem is provided in Section 6.
The RPS dynamics provides the basic oscillator mechanism which is still largely retained in our scenario. Most of the difficulty lies in controlling its operation as a function of the presence or absence of the rumor source. We do this by applying two separate mechanisms. The presence of rumor source shifts the oscillator towards an orbit closer to the central orbit through rule , which increases the value of potential , where a_{i}=\text{{}^{#}}\!\!A_{i}/n. Conversely, independent of the existence of rumor source , a second mechanism is intended to reduce the value of potential . This mechanism exploits the difference between the aggressive and lazy flavors of the species. Following rule , an agent belonging to a species becomes more aggressive if it meets another from the same species, and subsequently attacks agents from its prey species with doubled probability following rule . This behavior somehow favors larger species, since they are expected to have (proportionally) more aggressive agents than the smaller species (in which pairwise interactions between agents of the same species are less frequent) â the fraction of agents in which are aggressive would, in an idealized static scenario, be proportional to . (This is, in fact, often far from true due to the interactions between the different aspects of the dynamics). As a very loose intuition, the destabilizing behavior of the considered rule on the oscillator is resemblant of the effect an eccentrically fitted weight has on a rotating wheel, pulling the oscillator towards more external orbits (with smaller values of ).
The intuition for which the proposed dynamics works, and which we will formalize and prove rigorously in Section 6, can now be stated as follows: in the presence of rumor source , the dynamics will converge to a form of orbit on which the two effects, the stabilizing and destabilizing one, eventually compensate each other (in a time-averaged sense). The period of a single rotation of the oscillator around such an orbit is between and , depending on the concentration of . In the absence of , the destabilizing rule will prevail, and the oscillator will quickly crash into a side of the triangle.
For small values of \text{{}^{#}}\!\!X>0, the protocol can be very roughly (and non-rigorously) viewed as cyclic composition of three dominant rumor spreading processes over three sets of states , , , one converting states to , the next from to , and the last from to , which spontaneously take over at moments of time separated by parallel rounds. For other starting configurations, and especially for the case of \text{{}^{#}}\!\!X=0, the dynamics of the protocol, which has free dimensions, is more involved to describe and analyze (see Section 6.4). We provide some further insights into the operation of the protocol in Section 7.1, notably formalizing the notion that an intuitively understood oscillation (going from a small number of agents in some state , to a large number of agents in state , and back again to a small number of agents in state ) takes \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}) steps, with probability . As such, protocol can be seen as converting local input \text{{}^{#}}\!\!X into a global periodic signal with period \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}). What remains is allowing nodes to extract information from this periodic signal.
Simulation timelines shown in Fig.9 in the Appendix illustrate the idea of operation of protocol and its composition with other protocols.
3.2 Protocols for BitBroadcast
A solution to BitBroadcast is obtained starting with an independent composition of two copies of oscillator , called and , with states in one protocol denoted by subscript and in the other by subscript . The respective sources are thus written as and . In view of Theorem 1, in this composition , under the promise of the BitBroadcast problem, one of the oscillators will be running and the other will stop in a corner of its state space. Which of the oscillators is running can be identified by the presence of states , which will only appear for corresponding to the operating oscillator. Moreover, by the same Theorem, every rounds a constant number of agents of this oscillator will be in such a state , for any choice of . We can thus design the protocol extension to detect this. This is given by the pair of additional output states and the rule extension consisting of the two rules shown in Fig. 2.
Theorem 2** (Protocol for BitBroadcast).**
Protocol , having states, including distinguished source states , converges to an exact solution of BitBroadcast. This occurs in parallel rounds, with probability . In the output encoding, agent states of the form represent answer ââ, for .
The protocol is not âsilentâ, i.e., it undergoes perpetual transitions of state, even once the output has been decided. As a side remark, we note that for the single-source broadcasting problem, or more generally for the case when the number of sources is small, \max\{\text{{}^{#}}\!\!X_{[1]},\text{{}^{#}}\!\!X_{[2]}\}=O(1), we can propose the following simpler silent protocol. We define protocol , by modifying protocol as follows. We remove from it Rule (5), and replace it by to the four rules shown in Fig. 3. The analysis of the modified protocol follows from the same arguments as those used to prove Theorem 1(1). In the regime of \max\{\text{{}^{#}}\!\!X_{[1]},\text{{}^{#}}\!\!X_{[2]}\}=O(1), the effect of the source does not influence the convergence of the process and each of the three possible corner configurations, with exclusively species , is reached in steps with constant probability. However, rules enforce that the only stable configuration which will persist is the one in a corner corresponding to the identity of the source, i.e., for source and for source ; the source will restart the oscillator in all other cases. We thus obtain the following side result, for which we leave out the details of the proof.
Observation 1**.**
Protocol , having states, including distinguished source states , converges to an exact solution of BitBroadcast, eventually stopping with all agents in state if source is present and stopping with all agents in state if source is present, with no subsequent state transition. The stabilization occurs within parallel rounds, with probability , if \max\{\text{{}^{#}}\!\!X_{[1]},\text{{}^{#}}\!\!X_{[2]}\}=O(1), i.e., the broadcast originates from a constant number of sources.
3.3 Protocol for Detect
The solution to problem Detect is more involved. It relies on two auxiliary extensions added on top of a single oscillator . The first, , runs an instance of the 3-state majority protocol of Angluin et al. [7] within each species of the oscillator. For this reason, the composition between and has to be of the form (i.e., it cannot be independent). The operation of this extension is shown in Fig. 5 and analyzed in Section 7.2. It relies crucially on an interplay of two parameters: the time \Theta(\log\frac{n}{\text{{}^{#}}\!\!X}) taken by the oscillator to perform an orbit, and the time \Omega(\log\frac{n}{\text{{}^{#}}\!\!X}) it takes for the majority protocol (which is reset by the oscillator in its every oscillation) to converge to a solution. When parameters are tuned so that the second time length is larger a constant number of times than the first, a constant proportion of the agents of the population are involved in the majority computation, i.e., both of the clashing states in the fight for dominance still include agents. In the absence of a source, shortly after the oscillator stops, one of these states takes over, and the other disappears.
The above-described difference can be detected by the second, much simpler, extension , designed in Fig. 5 and analyzed in Section 7.3. The number of âlightsâ switched on during the operation of the protocol will almost always be more than , where is a parameter controlled by the probability of lights spontaneously disengaging, and may be set to and arbitrarily small constant.
Theorem 3** (Protocol for Detect).**
For any , protocol , having states, including a distinguished source state , which solves the problem of spreading confirmed rumors as follows:
For any starting configuration, in the presence of the source (\text{{}^{#}}\!\!X\geq 1), after an initialization period of rounds, at an arbitrary time step the number of agents in an output state corresponding to a âyesâ answer is , with probability . 2. 2.
For any starting configuration, in the absence of the source (\text{{}^{#}}\!\!X=0), the system always reaches a configuration such that all agents are in output states corresponding to a ânoâ answer for all subsequent time steps. Such a configuration is reached in rounds, with probability .
4 Impossibility Results for Protocols without Non-Stationary Effects
For convenience of notation, we identify a configuration of the population with a vector , where , for , denotes the number of agents in the population having state , and . Our main lower bound may now be stated as follows.
Theorem 4** (Fixed points preclude fast stabilization).**
Let be arbitrarily chosen, let be any -state protocol, and let be a configuration of the system with at most agents in state , where is a constant depending only on and . Let be a subset of the state space around such that the population of each state within is within a factor of at most from that in (for any , for all states , we have ).
Suppose that in an execution of starting from configuration , with probability , the configurations of the system in the next parallel rounds are confined to .
Then, an execution of for parallel rounds, starting from a configuration in which state has been removed from , reaches a configuration in a -neighborhood of , with probability .
In the statement of the Theorem, for the sake of maintaining the size of the population, we interpret âremoving state from â as replacing the state of all agents in state by some other state, chosen adversarially (in fact, this may be any state which has sufficiently many representatives in configuration ). The -neighborhood of is understood in the sense of the -norm or, asymptotically equivalently, the total variation distance, reflecting configurations which can be converted into a configuration from by flipping the states of agents.
The proof of Theorem 4 is provided in Section 8. It proceeds by a coupling argument between a process starting from and a perturbed process in which state has been removed. The analysis differently treats rules and states which are seldom encountered during the execution of the protocol from those that are encountered with polynomially higher probability (such a clear separation is only possible when ). Eventually, the probability of success of the coupling reduces to a two-dimensional biased random walk scenario, in which the coordinates represent differences between the number of times particular rules have been executed in the two coupled processes.
We have the following direct corollaries for the problems we are considering. For Detect, if represents the set of configurations of the considered protocol, which are understood as the protocol giving the answer â\text{{}^{#}}\!\!X>0â, then our theorem says that, with probability , the vast majority of agents will not ânoticeâ that \text{{}^{#}}\!\!X had been set to [math], even a polynomial number of steps after this has occurred, and thus cannot yield a correct solution. An essential element of the analysis is that it works only when state is removed in the perturbed process. Thus, there is nothing to prevent the dynamics from stabilizing even to a single point in the case of , which is indeed the case for our protocol . The argument for BitBroadcast only applies to situations where the source agent is sending out white noise (independently random bits in successive interactions). Such a source can be interpreted as a pair of sources in states and in the population, each disclosing itself with probability upon activation and staying silent otherwise. In the cases covered by the lower bound, the scenario in which the source is completely suppressed cannot be distinguished from the scenario in which both and appear; likewise, the scenario in which the source is completely suppressed cannot be distinguished from the scenario in which both and appear. By coupling all three processes, this would imply the indistinguishability of the all these configurations, including those with only source and only source , which would imply incorrect operation of the protocol.
Whereas we use the language of discrete dynamics for precise statements, we informally remark that the protocols covered by the lower bound of Theorem 4 include those whose dynamics , described in the continuous limit , has only point attractors, repellers, and fixed points. In this sense, the use of oscillatory dynamics in our protocol seems inevitable.
The impossibility result is stated in reference to protocols with a constant number of states, however, it may be extended to protocols with a non-constant number of states , showing that such protocols require time to reach a desirable output. (This time is larger than polylogarithmic up to some threshold value .) The lower bound covers randomized protocols, including those in which rule probabilities depend on (i.e., non-universal ones).
5 Input-Controlled Behavior of Protocols for Detect
In this Section, we consider the periodicity of protocols for self-organizing oscillatory dynamics, in order to understand how the period of a phase clock must depend on the input parameters. We focus on the setting of the Detect problem, considering the value \text{{}^{#}}\!\!X of the input parameter. In Section 3.1, we noted informally that the designed oscillatory protocol performs a complete rotation around the triangle in \Theta(\log n/\text{{}^{#}}\!\!X) rounds. Here, we provide partial evidence that the periodicity of any oscillatory protocol depends both on the value of \text{{}^{#}}\!\!X and . We do this by bounding the portions of the configuration space in which a protocol solving Detect finds itself in most time steps, separating the cases of sub-polynomial \text{{}^{#}}\!\!X (i.e., \text{{}^{#}}\!\!X<n^{\varepsilon_{0}}, where is a constant dependent on the specific protocol), and the case of \text{{}^{#}}\!\!X=\Theta(n).
Any protocol on states (not necessarily of oscillatory nature) can be viewed as a Markov chain in its -dimensional configuration space , and as in Section 4 we identify a configuration with a vector . The configuration at time step is denoted . In what follows, we will look at the equivalent space of log-configurations, given by the bijection:
[TABLE]
where for and for .
For , we will refer to the -log-neighborhood of as the set of points .
Notice first that the notion of a box in the statement of Theorem 4 is closely related to the set of points in the -log-neighborhood of configuration . It follows from the Theorem that any protocol for solving Detect within a polylogarithmic number of rounds with probability must, in the case of 0<\text{{}^{#}}\!\!X<n^{\varepsilon_{0}}, starting from at some time , leave the -log-neighborhood of within rounds with probability . We obtain the following corollary.
Proposition 1**.**
Fix a universal protocol which solves the Detect problem with -error in rounds with probability . Set 0<\text{{}^{#}}\!\!X<n^{\varepsilon_{0}}, where is a constant which depends only on the definition of protocol \text{{}^{#}}\!\!X. Let be an arbitrarily chosen moment of time after at least rounds from the initialization of the protocol in any initial state. Then, within rounds after time , there is a moment of time such that is not in the -neighborhood of , with probability . â
The above Proposition suggests that oscillatory or quasi-oscillatory behavior at low concentrations of state must be of length . By contrast, the following Proposition shows that in the case \text{{}^{#}}\!\!X=\Theta(n), the protocol remains tied to a constant-size log-neighborhood of its configuration space.
Proposition 2**.**
Fix a universal protocol with set of states which solves the Detect problem with -error in rounds with probability . Then, there exists a constant , depending only on the design of protocol , with the following property. Fix \text{{}^{#}}\!\!X\in[cn,n/2], where is an arbitrarily chosen constant. Let be an arbitrarily chosen moment of time, after at least rounds from the initialization of the protocol at an adversarially chosen initial configuration , such that each coordinate satisfies or , for all . Then, with probability , is in the -neighborhood of .
The proof of the Proposition is deferred to Section 9.
Note that, in the regime of a constant-size log-neighborhood of configuration , the discrete dynamics of the protocol adheres closely to the continuous-time version of its dynamics in the limit . (See Section 6, and in particular Lemma 1, for a further discussion of this property). Since the latter is independent of , any oscillatory behavior âinheritedâ from the continuous dynamic would have a period of rounds. We leave as open the question whether some form of behavior of a protocol with polylogarithmic (i.e., or more broadly, non-constant and subpolynomial) periodicity for Detect can be designed in the regime of \text{{}^{#}}\!\!X=\Theta(n) despite this obstacle. In particular, the authors believe that the existence of an input-controlled phase clock with a period of for any \text{{}^{#}}\!\!X>0, and the absence of operation for \text{{}^{#}}\!\!X=0, is unlikely in the class of discrete dynamical systems given by the rules of population protocols.
The remaining sections of the paper provide proofs of the Theorems from Sections 3, 4, and 5.
6 Analysis of Oscillator Dynamics
This section is devoted to the proof of Theorem 1.
6.1 Preliminaries: Discrete vs. Continuous Dynamics
Notation.
For a configuration of a population protocol, we write to describe the number of agents in the states of the protocol, and likewise use vector to describe their concentrations. The concentration of a state called which is the -th state in vector is equivalently written as , depending on which notation is the easiest to use in a given transformation.
If vector represents the current configuration of the protocol and is the random variable describing the next configuration of the protocol after the execution of a single rule, we write . We also use the notation to functions of state .
Next, we define the continuous dynamics associated with the protocol by the following vector differential equation:
[TABLE]
and likewise, for each coordinate, (we use the dot-notation and interchangeably for time differentials). This continuous description serves for the analysis only, and reflects the behavior of the protocol in the limit .555We note that some of our results rely on the stochasticity of the random scheduler model, and do not immediately generalize to the continuous case.
Warmup: the RPS oscillator.
Our oscillatory dynamics may be seen as an extension of the rock-paper-scissors (RPS) protocol (see Related work). This is a protocol with three states , , and three rules:
[TABLE]
where is an arbitrarily fixed constant, and the indices of states are always or , and any other values should be treated as in the given range. For , the change of concentration of agents of state in the population in the given step can be expressed for the RPS protocol as:
[TABLE]
Thus, the corresponding continuous dynamics for RPS is given as:
[TABLE]
for . The orbit of motion for this dynamics in is given by two constants of motion. First, by normalization. Secondly, for any starting configuration with a strictly positive number of agents in each of the three states, the following function of the configuration:
[TABLE]
is easily verified to be constant over time , hence (or more simply, ). Thus, for the continuous dynamics, the initial product of concentrations completely determines its perpetual orbit, which is obtained by intersecting the appropriate curve with the plane . As a matter of convention, the plane with conditions is drawn as an equilateral triangle (we adopt this convention throughout the paper, for subsequent protocols). All of the orbits are concentric around the point , which is in itself a point orbit maximizing the value of . The discrete dynamics follows a path of motion which typically resembles random-walk-type perturbations around the path of motion, until eventually, after steps it crashes into one of the sides of the triangle. Subsequently, if , for some , then no rule can make increase. (If , in the next steps, all remaining agents of will convert to , and there will be only agents from left.) Thus, the protocol will terminate in a corner of the state space.
A further discussion of the RPS dynamics can be found in [28, 15].
6.2 Proof Outline of Theorem 1
The rest of the section is devoted to the proof of Theorem 1. We start by noting some basic properties in Subsection 6.3, then prove the properties of the protocol for the case of (Subsection 6.4, and finally analyze (the somewhat less involved) case of (Subsection 6.5). For the case of , the proof is based on a repeated application of concentration inequalities for several potential functions (applicable in different portions of the -dimensional phase space). In two specific regions, in the -neighborhood of the center of the -triangle and very close to its sides, we rely on stochastic noise to âpushâ the trajectory away from the center of the triangle, and also to push it onto one of its sides. Fortunately, each of these stages takes parallel rounds, with strictly positive probability. Overall, the parallel rounds bound for the case of is provided with constant probability; this translates into parallel rounds in expectation, since subsequent executions of the process for rounds have independently constant success probability, and the process has a geometrically decreasing tail over intervals of length .
6.3 Properties of the Oscillator
In the following, we define . Handling the case of allows us not only to take care of the fact that a fraction of the population may be taken up by rumor source , but also allows for easier composition of with other protocols (sharing the same population). We set as a constant value independent of , which is sufficiently small (e.g., is a valid choice; we make no efforts in the proofs to optimize constants, but the protocol appears in simulations to work well with much larger values of ).
We will occasionally omit an explanation of index , which will then implicitly mean âfor all â. We define and .
From the definition of the protocol one obtains the distribution of changes of the sizes of states in a step:
[TABLE]
[TABLE]
Taking the expectations of the above random variables, and recalling that , we obtain:
[TABLE]
6.4 Stopping in Sequential Steps in the Absence of a Source
Throughout this subsection we assume that . We consider first the case where , for (noting that as soon as , we can easily predict the subsequent behavior of the oscillator, as was the case for the RPS dynamics).
The dynamics of is defined in such a way that that when and in the absence of the rules of the RPS oscillator, the value of would be close to . Consequently, we define , as the appropriate normalized corrective factor:
[TABLE]
Note that as , thus . Next, we introduce the following definitions:
[TABLE]
[TABLE]
[TABLE]
We also reuse potential from the original RPS oscillator. This time, it is no longer a constant of motion. By (5) and the definition of , for we upper-bound as:
[TABLE]
The above change of the potential is indeed negative when (which is in accordance with our intention in designing the destabilizing rules for the oscillator).
The functions , and are intricately dependent on each other. In general, we will try to show that and increase over time, while stays close to [math]. This requires that we first introduce a number of auxiliary potentials based on these two functions.
First, for , we can rewrite (4) as:
[TABLE]
Next, introducing the definition of to (3), we obtain for :
[TABLE]
From the above, an upper bound on follows directly using elementary transformations:
[TABLE]
We are now ready to estimate for , using the definition of and the previously obtained formula for :
[TABLE]
Next from the bound on :
[TABLE]
Next:
[TABLE]
where in the final transformation we took into account that .
Now, we define the potential for any configuration with all as:
[TABLE]
We remark that is always well-defined when , and that .
Overview of the proof.
The proof for the case of proceeds by following the trajectory of the discrete dynamics of , divided into a number of stages. We define a series of time steps by conditions on the configuration met at time , and show that subject to these conditions holding, we have (we recall that here time is measured in steps), with at least constant probability. Overall, it follows that the configuration at time , which corresponds to having reached a corner state, is reached from , which is any initial configuration with , in time steps, with constant probability.
The intermediate time steps may be schematically described as follows (see Fig. 6). For configurations which start close to the center of the triangle (), we define a pair of potentials , , based on a linear combination of modified versions of and . The dynamics will eventually escape from the area ; however, first it may potentially reach a very small area of radius around the center of the triangle with (Lemma 3, time , reached in steps by a multiplicative drift analysis on potential ), pass through the vicinity of center of the triangle, escaping it with (Lemma 4, time , reached in steps with constant probability by a protocol-specific analysis of the scheduler noise, which with constant probability increases without increasing too much), and eventually escapes completely to the area of (Lemma 5, exponentially increasing value of potential ).
In the area of , we define a new potential based on and . This increases (Lemma 8, additive drift analysis on with bounded variance) until a configuration at time with a constant number of agents of some species is reached. This configuration then evolves towards a configuration at time at which some species has agents, and additionally its predator species is a constant part of the population (Lemma 9, direct analysis of the process combined with analysis of potential and a geometric drift argument). Then, the species with agents is eliminated in steps with constant probability (, Lemma 10), and finally one more species is eliminated in another steps (at time , Lemma 11, straightforward analysis of the dynamics). At this point, the dynamics has reached a corner.
Throughout the proof, we make sure to define boundary conditions on the analyzed cases to make sure that the process does not fall back to a previously considered case with probability .
Phase with .
We then have and , for . In this range, we have:
[TABLE]
Summing the above inequalities for and noting that , we obtain:
[TABLE]
Next, we have:
[TABLE]
Combining the two above expressions gives the sought bound between and as:
[TABLE]
and equivalently
[TABLE]
We have directly from (6) and from the relations between and :
[TABLE]
and from (7):
[TABLE]
Moving to the discrete-time model, it is advantageous to eliminate the discontinuity of partial derivatives of and at points with and respectively, which is a side-effect of the applied square root transformation in the respective definitions of and . We define the auxiliary functions and by adding an appropriate corrective factor:
[TABLE]
and derive accordingly from (8) and (9):
[TABLE]
Let be the -dimensional vector representing the current configuration of the system: ; note that the last element is determined as .666In principle it is also correct to represent as a vector of dimension , i.e., including in as a free dimension. However, such a representation would lead to second-order partial derivatives which are too large for our purposes. The following lemma is obtained by a folklore application of Taylorâs theorem.
Lemma 1**.**
Let be a function in a sufficiently large neighborhood of , with . Then, , where .
Proof.
Let be the random variable representing the configuration of the system after its next transition from configuration . Observe that in every non-idle step of execution of the protocol, exactly one agent changes its state, so .
Applying Taylor approximation we have:
[TABLE]
where is the gradient of at , denotes the second-order Taylor remainder for function expanded at point along the vector towards point , and is subsequently an appropriately chosen value, satisfying:
[TABLE]
â
The following lemma is obtained directly by computing and bounding all second order partial derivatives of functions and with respect to variables .
Lemma 2**.**
There exists a constant depending only on , such that, for any configuration with :
- âą
,
- âą
.
â
In view of the above lemmas, we obtain from (8) and (9), for an appropriately chosen constant :
[TABLE]
For , we now define two linear combinations of functions and :
[TABLE]
When , we have:
[TABLE]
where we denoted and used the fact that .
We subsequently perform an analysis of , , treating them as stochastic processes. We remark that , since .
Lemma 3**.**
Let be an arbitrary starting configuration of the system. Then, with constant probability, for some , a configuration is reached such that .
Proof.
W.l.o.g. assume . We subsequently only analyze process . Let be the first time step such that . If , then . Note that then for all , from which it follows by a straightforward calculation from the definition of , , and , that for all .
We now define the filtered stochastic process as for , and put for . For all , we then have:
[TABLE]
Since for all time steps, a direct application of multiplicative drift analysis (cf. [19]) gives:
[TABLE]
and the claim follows by Markovâs inequality. â
Lemma 4**.**
Let be an arbitrary starting configuration of the system such that , for . Then, with constant probability, for some , a configuration is reached such that .
Proof.
W.l.o.g. assume that and suppose that initially (i.e., that ). Then, from the lower and upper bounds on and we obtain the following bounds on and :
[TABLE]
It follows that, for , and . For the sake of clarity of notation, we will simply write and , hence also .
We will consider now the sequence of exactly transitions of the protocol, between time steps .
For all we have . Consider the Doob submartingale with increments given as:
[TABLE]
Noting that , an application of the Azuma inequality for submartingales to gives: (cf. e.g. [13][Thm. 16]). From here it follows directly that:
[TABLE]
Noting that , we have:
[TABLE]
We now describe the execution of transitions in the protocol for times through the following coupling. First, we select the sequence of pairs of agents chosen by the scheduler. Let (respectively, ) denote the subsets of the set of agents, having initial states (resp., ) at time [math], respectively, which are involved in exactly one transition in the considered time interval, acting in it as the initiator (resp., receiver). Let denotes the subset of time steps at which the scheduler activates a transition involving an element of as the initiator and an element of as the receiver. The execution of the protocol is now given by:
- âą
Phase : Selecting the sequence of pairs of elements activated by the scheduler in time steps . This also defines set . Executing the rules of the protocol in their usual order for time steps from set .
- âą
Phase : Executing the rules of the protocol for time steps from set .
Observe that since elements of pairs activated in time steps from are activated only once throughout the steps of the protocol, the above probabilistic coupling does not affect the distribution of outcomes.
Directly from (13), we obtain through a standard bound on conditional probabilities that at least a constant fraction of choices made in phase leads to an outcome ââ with at least constant probability during phase :
[TABLE]
We now remark on the size of set . The distribution of depends only on , , and the choices made by the random scheduler. We recall that . Since the expected number of isolated edges in a random multigraph on nodes (representing the set of agents) and edges (representing the set of time steps) is , the number of such edges having the first endpoint in an agent in state and the second endpoint in an agent in state is . A straightforward concentration analysis (using, e.g., the asymptotic correspondence between and random graph models and an application of Azumaâs inequality for functions of independent random variables) shows that the bound holds with very high probability. In particular, we have:
[TABLE]
for some choice of constant which depends only on .
Relations (14) and (15) provide all the necessary information about phase that we need. Subsequently, we will only analyse phase , conditioning on a fixed execution of phase such that the following event holds:
[TABLE]
We remark that, by a union bound over (14) and (15), .
In the remainder of our proof, our objective will be to show that:
[TABLE]
for some constant depending only on , for any choice of for which event holds. When this is shown, the claim of the lemma will follow directly, with a probability value given as at least by the law of total probability.
We now proceed to analyze the random choices made during phase . Each of the considered interactions involves a pair of agents of the form , and describes the following transition:
[TABLE]
independently at random for each transition. The only state changes observed during this phase are from to , and we denote by the number of such state changes. The value of random variable completely describes the outcome of phase .
We have , and by a standard additive Chernoff bound:
[TABLE]
Let be the subset of the considered interval containing values of such that \left(\psi^{(1)}_{n}\big{|}P_{A},B\in\mathcal{B}\right)\geq\frac{4c_{3}}{\sqrt{n}}. If , then the claim follows directly.
Otherwise, it follows from (16) and (17) that there must exist a value , such that:
[TABLE]
Given that:
[TABLE]
and recalling that , we obtain the following bound on :
[TABLE]
We now consider lower bounds on the value of , conditioned on (respectively, ), where (resp., ) is a value arbitrarily fixed in the range (resp., ). The executions of the protocol with and differ with respect to the execution with in the number of executed transitions from to by at least . Recalling that , it follows that for some we have after steps:
[TABLE]
Subsequently, we will assume that ; the case of is handled analogously. From the relation and (18) we have:
[TABLE]
When comparing the value of in the two cases, and , it is convenient to consider as the length of the vector in Euclidean space. For each of the coordinates , , we have:
[TABLE]
hence:
[TABLE]
Introducing (19) and (20) into the definition of , we obtain directly:
[TABLE]
where we again used the fact that is a sufficiently small constant w.r.t. . We thus obtain:
[TABLE]
where by the definition of random variable as a sum of i.i.d. binary random variables and the choice of value in the direct vicinity of the expectation of , the event holds with constant probability. The case of is handled analogously. â
Lemma 5**.**
Let be an arbitrary starting configuration of the system such that . Then, with constant probability, for some , a configuration is reached such that .
Proof.
We subsequently consider only the process . We start by showing the following claim.
Claim. Suppose . Then, with probability at least , for some time step the process reaches a value , or .
Proof (of claim). Consider the Doob submartingale with increments given as:
[TABLE]
Noting that , an application of the Azuma inequality for submartingales (cf. e.g. [13][Thm. 16]) to with gives:
[TABLE]
Moreover, assuming the barrier was not reached, we have:
[TABLE]
which completes the proof of the claim.
We now prove the lemma by iteratively applying the claim over successive intervals of time , such that and is the first time step not before such that or . By the claim, we have:
[TABLE]
Noting that by definition, and that before the barrier is reached, we have , we obtain:
[TABLE]
and further:
[TABLE]
In particular, putting , . Since for this value of , we must have (since otherwise we would have , which is impossible), the claim of the lemma follows. â
Phase with .
The second phase of convergence corresponds to configurations of the system which are sufficiently far from the center point . Formally, we analyze a variant of potential (with an additive corrective factor proportional to ) to show that, starting from a configuration with , we will eliminate one of the three populations in sequential steps with constant probability, without approaching the center point too closely (a value of will be maintained throughout).
For this part of the analysis, we define the considered potential as:
[TABLE]
for any configuration with .
We have directly from (6) and (7):
[TABLE]
where in the last transformation we took into account that .
For the sake of technical precision in formulating the subsequent lemmas, we also consider the stochastic process , given as for any , where is defined as the first time in the evolution of the system such that a configuration with is reached, where is a constant depending only on . For all , we define .
Lemma 6**.**
*In any configuration with we have: . *
Proof.
We have:
[TABLE]
Following the definition of in Eq. (2), we have by linearity of expectation:
[TABLE]
Next, using the bound which holds for , we have:
[TABLE]
from which it follows that:
[TABLE]
To analyze , we apply a variant of Lemma 1. A direct application of the lemma is not sufficient due to the singularity related to the term in the definition of ; however, this effect is compensated when we take into account that any change of the value of occurs in the considered protocol with probability at most proportional to . For the specific case of , for fixed , we consider as a function of the restricted configuration , and we rewrite expression (12) as:
[TABLE]
A straightforward computation from the definition of function shows that:
[TABLE]
It follows that
[TABLE]
and so:
[TABLE]
Introducing (24) and (25) into (23), we obtain:
[TABLE]
where in the second-to-last transformation we used (22), and in the last transformation we used the relation which holds when and .
The claim thus follows when and , i.e., for . For larger values of , the claim follows trivially from the definition of . â
The above Lemma is used to show that, starting from any configuration with , we quickly reach a configuration in which some species has a constant number of agents.
Lemma 7**.**
If , we have:
- (i)
,
- (ii)
.
where and are constants depending only on . Moreover, in any configuration with , we have:
- (iii)
.
- (iv)
.
Proof.
We first consider the case of a configuration with . Using the definition of (and within it, of and ). Consider any transition from a configuration to a subsequent configuration and let be defined as the set of indices of configurations changing between and (). We verify that there exists an absolute constant such that:
[TABLE]
Moreover, by the definition of the protocol a transition from to occurs with probability . Since there is only a constant number of possible successor configurations for (loosely bounding, not more than ), it follows that:
[TABLE]
The bounds on the variance of and that of (for ) with follow directly. The analysis of when and is performed analogously, noting that if the succeeding configuration is such that , then . Finally, for , the result holds trivially by the definition of . â
Lemma 8**.**
Let be an arbitrary starting configuration of the system such that . Then, with probability , for some , a configuration is reached such that .
Proof.
W.l.o.g. assume that . First we remark that, by the relation between and for , a process starting with satisfies:
[TABLE]
Moreover, for any configuration with we have:
[TABLE]
Thus, initially and as long as for all time steps we have , the barrier condition has not been violated. Moreover, for , we have by Lemma 6 that . Moreover, by Lemma 7 (iii) and the fact that which implies , we have that .
It follows from a standard application of Azumaâs inequality for martingales (resembling the analysis of the hitting time of the random walk with step size , from one endpoint of a path of length to the other) that:
[TABLE]
hence also throughout the first steps of the process we have , with probability .
We are now ready to analyze the subsequent stages of the process, designing a Doob submartingale with time increments defined as:
[TABLE]
Using Lemma 7 (i) and (ii) and applying the Azuma-McDiarmid inequality777If our objective in the proof of the lemma were to show a bound on which holds with constant probability (which would be sufficient for our purposes later on), rather than a w.h.p. bound, then this specific step of the proof can also be performed using Markovâs inequality. In any case, we would need to make use of the bounded variance of in the proof of the next Lemma. in the bounded variance version (cf. e.g. [13][Thm. 18]) to for , for some sufficiently large constant depending only on , we obtain:
[TABLE]
If the event were to hold for all with and if , then we would have , which would mean that , since by definition. If , then , and the proof is complete. (Indeed, to reach a configuration with , the protocol has to pass through a configuration with , since the size of each population changes by at most in each transition.) Otherwise, we must have that at least one of the following events holds: , or for some , or for some . We have established that each of the first two of these events holds with probability , whereas if the latter event holds, then . Thus, holds with probability by a union bound. â
Lemma 9**.**
Let be a starting configuration of the system such that . Then, with constant probability, for some , a configuration is reached such that and , for some .
Proof.
W.l.o.g. assume that . If , then the claim follows immediately, putting and . Otherwise, we will show that with constant probability, the system will evolve so that will increase over time until within steps we will have a time step with (i.e., and ).
In the considered case, w.l.o.g. assume . Next, let for a sufficiently large constant ; we choose as for convenience in later analysis. Intuitively, in view of Lemmas 6 and 7, the potential will be further increased in the next steps: the random variable has an expected value of , with a standard deviation of .
By an application of the Azuma-McDiarmid inequality for martingales with bounded variance similar to that in the proof of Lemma 8, we obtain the following result:888Such an analysis can also be performed using Chebyshevâs inequality, obtaining a slightly weaker expression in the probability bound.
[TABLE]
Observe that since , we have . Taking this into account, for our purposes, a slightly weaker and simpler form of expression (26) will be more convenient:
[TABLE]
The proof of the lemma is completed by a more fine-grained analysis of the considered protocol. In the initial configuration , we have (there are exactly agents in state ), and since , we have . Consequently, . Informally, since the prey of (i.e., ) is more than twice more numerous than its predator (i.e., ), we should observe the increase in the size of population of , regardless of the activities ( or ) of the agents in the population. We consider the evolution of the system, finishing at the earliest time when . The following relations are readily shown (apply e.g. Lemma 14 with and ):
[TABLE]
From (29), taking into account that and , an application of Azumaâs inequality for martingales shows that:
[TABLE]
Taking into account the above, by a straightforward geometric growth analysis (compare e.g. proof of Lemma 5), we obtain from (28):
[TABLE]
Moreover, since the speed of increase of is bounded (even in the absence of predators) by that of a standard push rumor spreading process (formally, ), we have (compare e.g. [20]):
[TABLE]
Now, we observe that with constant probability, the size of population does not decrease in the time interval below the value , attained at the beginning of this interval:
[TABLE]
Indeed, with constant probability the value is initially non-decreasing: with constant probability, in the first rounds each of the agents from will be triggered by the scheduler times in total, and each interaction involving an agent from will have this agent as the initiator, and an agent from the largest of the three populations, , as the receiver (the prey). Thus, with constant probability, the number of agents in population is increased to an arbitrary large constant (e.g., ). After this, we use the geometric growth property (28) to show that reaches the barrier (at time ) before the event occurs (cf. e.g. proof of Lemma 5, or standard analysis of variants of rumor-spreading processes in their initial phase [29]).
When the event from bound (33) holds, at least one of the following events must also hold:
- (A)
, for all ,
- (B)
or there exists a time step such that ,
- (C)
or there exists a time step such that .
To complete the proof, we will show that each of the events (A) and (B) holds with probability . Indeed, then in view of (33), event (C) will necessarily hold with probability . This means that, with probability , there exists a time step such that and (since ), and so also ; thus, the claim of the lemma will hold with and .
To show that event (B) holds with probability , notice that by definition of , and moreover with probability , hence the event holds with probability .
To show that event (A) holds with probability , notice that, substituting in (27) , by a union bound over (27), (31) and (32) we obtain:
[TABLE]
This means that, with probability , we have or . In the first case, event (A) cannot hold. In the second case, observe that by definition of , so , and it follows that . Since the condition is not fulfilled, event (A) can only hold with probability . â
Lemma 10**.**
Let be a starting configuration of the system such that and , for some . Then, with constant probability, for some , a configuration is reached such that .
Proof.
We consider the pairs of interacting agents chosen by the scheduler in precisely the next rounds after time . Given that set has constant size, and set has linear size in , it is straightforward to verify that with constant probability, the set of randomly chosen pairs of agents has all of the following properties:
- âą
Each agent from belongs to exactly one pair picked by the scheduler, and is the receiver in this pair.
- âą
Each agent interacting in a pair with an agent from belongs to exactly one pair.
- âą
Each agent interacting in a pair with an agent from belongs to set .
Conditioned on such a choice of interacting pairs by the scheduler, the protocol changes the state of all agents from set to state with probability at least . State is then effectively eliminated. â
In the absence of species , the interaction between species and collapses to a lazy predator-prey process, with transitions of the form associated with a constant transition probability. A w.h.p. bound on the time of elimination of species follows immediately from the analysis of the push rumor spreading model, and we have the following Lemma.
Lemma 11**.**
Let be a starting configuration of the system such that , for some . Then, with probability , for some , a configuration is reached such that for all , and . â
After a further steps after time , the final configuration of all agents in the oscillatorâs population will be .
6.5 Operation of the Oscillator in the Presence of a Source
In this section we prove properties of the oscillatory dynamics for the case \text{{}^{#}}\!\!X>0. It is possible to provide a detailed analysis of the limit trajectories of the dynamics in this case, as a function of the concentration of . Here, for the sake of compactness we only show the minimal number of properties of the oscillator required for the proof of Theorem 1. When the given configuration is such that is sufficiently large, say , then both the subclaims of Theorem 1(2) hold for the considered configuration. (The first subclaim hold directly; the second subclaim follows by a straightforward concentration analysis of the number of agents changing state in protocol over the next steps, since we will always have during the considered time interval.) Otherwise, the considered configuration is close to one of the sides of the triangle. We will show that in the next steps, with high probability, the protocol will either reach a configuration with , or will visit successive areas around the triangle, as illustrated in Fig. 7. The following Lemmas show that within each area, an exponential growth process occurs, which propagates the agent towards the next area.
Lemma 12**.**
If and , then .
Proof.
From the assumptions we have that . Starting from (3) we obtain:
[TABLE]
â
From the above bound on expectation, the following Lemma follows directly by a standard concentration analysis. In what follows, we consider an execution in which the concentration is strictly positive and bounded by a sufficiently small absolute constant (i.e., \text{{}^{#}}\!\!X is at most a given constant fraction of the entire population), with the required upper bounds on used in the proofs of lemmas given in their statements. This is a technical assumption, which allows us to simplify the proof structure. In particular, the assumption \text{{}^{#}}\!\!X\leq c_{12}n can be omitted in the statement of Theorem 1, and the claim of the theorem can even be proved for executions in which \text{{}^{#}}\!\!X changes during the execution of the protocol, as long as the invariant \text{{}^{#}}\!\!X>0 is preserved over the considered interval of time.
Lemma 13**.**
Let be a starting configuration of the system such that and . Suppose , starting from time . Then, for some , where is a constant depending only on and , with probability , the system reaches a configuration such that exactly one of the following two conditions is fulfilled:
- âą
either ,
- âą
or and .
Proof.
In the considered range of values of , we have and , for all until we leave the considered area at time . Taking into account that , it follows from Lemma 12 that:
[TABLE]
Taking into account that , it follows from a straightforward concentration analysis (cf. e.g. proof of Lemma 5 for a typical analysis of this type of exponential growth process) that a boundary of the considered area (either or ) must be reached within steps with very high probability, as stated in the claim of the lemma. â
A similar analysis is performed for the next area.
Lemma 14**.**
If and then and .
Proof.
From assumptions we have that . Starting from (3) we obtain:
[TABLE]
â
Again, a concentration result follows directly.
Lemma 15**.**
Let be a starting configuration of the system such that and . Suppose , starting from time . Then, for some , where is a constant depending only on and , with probability , the system reaches a configuration such that exactly one of the following two conditions is fulfilled:
- âą
either ,
- âą
or , , and (consequently) .
Proof.
We first show that, starting from time onward, the process satisfies for all with probability , where is defined as the minimum of time and the last time moment such that holds for all . By Lemma 14, we have for all such that :
[TABLE]
where we took into account the assumption . The claim on follows from a standard concentration analysis, noting that .
In order to analyze the process , we apply a filter and consider the process , starting at time , defined as follows. For as long as , we put , and starting from the first time when , we compute as the subsequent value of after a simulation of a single step of the process for some state with concentrations of types: , , , and .
For a given time step , let denote the event that . By the construction of protocol , which always requires at least one agent of type or type to be involved in an interaction which creates or destroys an agent of type , we have:
[TABLE]
Moreover, from Lemma 14 it follows that for :
[TABLE]
Since , we have:
[TABLE]
and moreover . Analysis of this type of process is folklore (in the context of epidemic models with infection and recovery) but somewhat tedious; we sketch the argument for the sake of completeness. When considering only those steps for which event holds, the considered process can be dominated by a lazy random walk on the line , with a constant bias towards its right endpoint. To facilitate analysis, we define points , for , where constant is subsequently suitably chosen, and for any point to the right of the starting point of the walk (i.e, where is the smallest integer such that ) define as the number of steps of the walk until its first visit to . For a suitable choice of constants and sufficiently large, we have that for any , with probability at least , , and moreover between its step and its step , the walk is confined to the subpath of the considered path. Considering the original time of our process (including moments with ), let be the moment of time corresponding to the -th step of the walk. Conditioning on events which hold with probability , the value can be stochastically dominated by the sum of independent geometrically distributed random variables, each with expected value . Let be the largest positive integer such that . Applying a union bound on the conditioning of all intervals , for and a concentration bound on the considered geometric random variables, we eventually obtain that with probability the condition is achieved for time:
[TABLE]
Recalling that holds throughout the considered time interval with very high probability, the claim follows. â
An iterated application of Lemmas 13 and Lemmas 15 moves the process along time moments , , , where time moment is again be fed to Lemma 13, considering the succeeding value of . After a threefold application of both Lemmas, the process has w.h.p. in steps either performed a complete rotation, passing through three moments of time designated as ââ, rotated by one third of a full circle, or has reached at some time a point with . In either case, the claim of Theorem 1(2) follows directly.
7 Analysis of Protocol for Detect
7.1 Further Properties of the Oscillator
We start by stating a slight generalization of Lemma 6, capturing the expected change of potential (given by (21)) for the case \text{{}^{#}}\!\!X>0, for configurations which are sufficiently far from both the center and the sides of the triangle.
Lemma 16**.**
In any configuration with and we have: , where is an absolute constant which depends only on and .
Proof.
We can condition the expectation of on the event , which holds if an agent in state participates in the current interaction. Conditioned on , the analysis corresponds directly to the computations performed for the case , where we remark that the assumptions of Lemma 6 are satisfied due to the assumed upper bound on . Thus:
[TABLE]
Next, taking into account the lower bound on , by exactly the same argument as in Lemma 7(), we have /n, for some choice of constant which depends only on and . Obviously,
[TABLE]
and since , by the law of total expectation:
[TABLE]
where the last inequality holds for any , where . â
Lemma 17**.**
Suppose at some time . Then, there exists an absolute constant , such that the following event holds with probability : for all , we have .
Proof.
Let . Consider any such that . Then, and consequently:
[TABLE]
where we recall that and the last inequality follows for chosen to be sufficiently small ().
Further, note that if for some time we have , then:
[TABLE]
thus , from which it follows that , and so .
Thus, for , at least one of the following holds:
- âą
Either ,
- âą
Or , thus . Then, taking into account that , we have by Lemma 16: .
Taking into account the known properties of function (Lemma 7), we have that starting from , it takes time exponential in a polynomial of (, for some absolute constant ) to break the potential barrier for , i.e., to reach the first moment of time such that , with probability , for some absolute constant . To complete the proof, recall that for any , we have , and so as previously established, . â
For any execution of the oscillator protocol , we can now divide the axis of time into maximal time intervals of two types, which we call oscillatory and central. A central time interval continues for as long as the condition is fulfilled, and turns into an oscillatory interval as soon as this condition no longer holds. An oscillatory time interval continues for as long as the condition is fulfilled, and turns into an oscillatory interval as soon as this condition no longer holds. Lemma 17 implies that an oscillatory interval is of exponential length w.v.h.p.
Lemma 18**.**
Suppose . Let be an arbitrary moment of time such that . Let , for an arbitrarily fixed constant positive integer . With probability , we have for all subsequent moments of time :
[TABLE]
Proof.
Without loss of generality assume . We can assume in the proof that , otherwise the claim trivially holds. Thus, initially we have .
We proceed to show that potential does not decrease much during the considered motion. We have at any time , which follows directly from (5).
Suppose at some time we have . We note that . Applying Lemma 1, we have under these assumptions:
[TABLE]
and moreover by the properties of the natural logarithm (cf. e.g. [15]):
[TABLE]
As usual, we apply a Doob martingale, with until the first moment of time such that , and subsequently for larger . We have and .
Considering steps of the process starting from time [math], by a standard application of Azumaâs inequality, we obtain that with probability , for all we have:
[TABLE]
From the last inequality it follows that for all , with probability , and so . We now rewrite the same bound for , using the relation :
[TABLE]
Taking into account that for , we obtain the claim. â
Lemma 19**.**
Suppose . Fix type . Let be any time such that . Let be the first moment after such that is the most represented type, . Let be the first moment after such that is the least represented type, .
Then, with probability , , where is an absolute constant depending only on and .
Proof.
From Lemma 17, we have that w.v.h.p.â the protocol is in an oscillatory interval which will last super polynomial time, i.e., with probability : for all , we have . Acting as in the previous subsection, we iteratively apply Lemma 13 and Lemma 15. After at most 6 applications of both Lemmas, the process has performed two complete rotations around the triangle, w.h.p., passing in particular through time moments where the designated type was a maximal type and where type was a minimal type. It remains to bound the time required to perform these iterations. We consider as an example a single application of Lemma 15 starting at a time and ending at a time , where with probability . Applying Lemma 18 at time , we obtain , w.v.h.p. We use this bound for the next application of Lemma 13, and so on. After a total of at most 12 applications, we eventually obtain a bound of the form on the length of the considered time interval, where the value of is computed as a function of . â
7.2 Protocol Extension : Majority
The composition of the extension is specified in Fig. 4. In what follows, we denote M^{(s)}_{i}:=\text{{}^{#}}\!\!(A_{i}^{?},M_{s}) and , for .
Lemma 20**.**
Suppose . Let be an arbitrarily chosen moment of time and let . For fixed , we have for all :
[TABLE]
with probability .
Proof.
W.l.o.g. assume and . Denote A_{0}:=\text{{}^{#}}\!\!A_{1,0}, , and . As usual, we denote and . For the subsequent analysis, we choose to use the âsquared potentialâ to simplify considerations; this would be the usual potential of choice to analyze an unbiased random walk with a fair coin toss.
First we remark that , so . Next, observe that since at most one agent changes its state in a single time step, we have , and so:
[TABLE]
We now upper bound the expectation . We condition this expectation on the disjoint set of events , where , for , corresponds to Rule being executed in the current step, and is the event that none of these rules is executed. We have the following:
- âą
If event or holds, then at least one of the following three situations occurs: (1) the values of and both remain unchanged at time , (2) an agent changes state from type to another type, or (3) an agent turns from another type into type . In case (1), we have . In case (2), the probability that is not more than the probability that , since by the construction of the protocol, the choice of the agent leaving the population is completely independent of its value . In case (3), we have by construction. In all cases, . We therefore have:
[TABLE]
- âą
For events and , we have and . Since events and hold with equal probability, it follows that:
[TABLE]
- âą
Finally, for events and , we have and . Since
[TABLE]
where we assume in notation that .
Applying the law of total expectation for over the set of events and noting that , we eventually obtain:
[TABLE]
Inequalities (34) and (35) are sufficient to lower-bound the evolution of random variable , which undergoes multiplicative drift with rate parameter (up to lower order terms). Since known multiplicative drift lower bounds (cf.e.g. [17, 33]) do not appear to cover this case explicitly, we sketch the corresponding submartingale analysis (with slightly weaker parameters) for the sake of completeness.
Consider any moment of time such that . Define target value . Consider the following filter, defining , as the submartingale with until the first moment of time at which , and for all subsequent moments of time. Note that by (35) and by (34) (where we conduct the entire analysis for sufficiently large with respect to absolute constants of the algorithm). By Azumaâs inequality, we have for any and :
[TABLE]
Next, choosing any and and noting that:
[TABLE]
we rewrite the concentration equality as:
[TABLE]
Applying to the above a union bound over all , we obtain by another crude estimate:
[TABLE]
from which it follows directly by the definition of that:
[TABLE]
Thus, given that , the value of increases by a factor of at most over steps, with very high probability. Iterating the argument at most a logarithmic number of times and applying a union bound gives for arbitrary :
[TABLE]
with probability at least , from which the claim of the lemma follows directly after taking the square root and normalizing by a factor of . â
By considering the sizes of populations , , and (whose sum is ), we obtain the following corollary of the above Lemma, applied for a suitably chosen value .
Lemma 21**.**
Suppose . Let be an arbitrarily chosen moment of time with . For fixed , we have for all :
[TABLE]
with probability . â
The above Lemma provides a crucial lower bound on the size of population .
Lemma 22**.**
Suppose . Let be an arbitrarily chosen moment of time with . For fixed , we have for all such that :
[TABLE]
with probability .
Proof.
Note first that by the conditions . Let denote the last moment of time before such that and let denote the first moment of time after such that . We have .
We now consider the process (exactly the same arguments may be applied for process ). We have . The analysis is divided into two phases:
- âą
Phase 1: (thus ). Initially, . Suppose for some step we have . Rule (6) is executed with probability , whereas rule (8) or (9) which reduces is executed with probability at most . A computation of the expected value provides:
[TABLE]
An application of Azumaâs inequality yields that , with probability . In case of failure, we consider the process no further.
- âą
Phase 2: (thus ). From Phase 1, we have that initially . Suppose for some step we have . We consider two cases:
- â
If , then by Lemma 21 we have:
[TABLE]
with probability . In case of failure we interrupt the analysis (this is an implicit application of union bounds over successive steps ). Under the assumption , we conclude:
[TABLE]
and hence:
[TABLE]
Now, to compute the expected value , we remark that rule (6) does not decrease this expected value since . Moreover, in view of (36), the probability of executing rule (9) (which increases by ) exceeds the probability of executing one of the rules (7) or (8) (which decrease by ) by by the assumption . We eventually obtain in this case:
[TABLE]
- â
If , then assuming , we can perform an analogous analysis as in the first phase to obtain:
[TABLE]
which, in particular, also implies (37).
We have thus shown that the expected change to satisfies (37). Noting that initially , an application of Azumaâs inequality to an appropriate Doob martingale with (37) shows that the event will hold for all remaining steps of the process , with probability .
â
Lemma 23**.**
Let be any moment of time with . For all , we have:
[TABLE]
with probability .
Proof.
Denote . To show the claim, observe that necessarily for all we have . We consider the change of value over time (the argument for follows symmetrically). Initially, we have , and at every step . At any time such that we have the following cases:
- âą
Rule (6) is executed, which happens with probability at least . Since , conditioned on this event, the expected value of is at least .
- âą
One of the rules (7)-(10) is executed, which occurs with probability at most .
- âą
In all other cases, we have .
Noting that , the claim follows from a standard application of Azumaâs inequality. â
Lemma 24**.**
Suppose . Let be an arbitrary moment of time. Then, , with probability , for some absolute constant depending only on , , and .
Proof.
Assume w.l.o.g. . Instead of analyzing the evolution of the real system, we consider an execution of a system which is coupled with it over the first steps as follows. First, starting from time [math], we perform steps of protocol (i.e., considering only rules of its definition, and without setting values of the second component ). Next, we once again activate the pairs of elements which were activated in the first part of the coupling, in the same order, applying rules of the protocol with the same outcome which they would have received in the original execution. Clearly, at time the same configuration is reached by both the original and coupled execution.
Consider first the execution of from time [math]. If , then the execution is in an oscillatory interval at time [math], and will remain in it () until time with probability . Then, for all we assume that the claim of Lemma 19 holds with for all . By a crude union bound, this event holds with probability ; from now on we assume this is true. (Formally, to allow us to proceed, in the analysis we can say with implicitly couple the system with a different set of random choices to which the system switches in the low-probability event that the claim of Lemma 19 were not to hold for some for the original system.) Given that in the claim of Lemma 19 for we have , and for we have , by the properties of the time intervals we observe that there must exist a time and a type such that is the least represented type at time and the most represented type at time ( and ), and moreover in the claim of Lemma 19 with choice of . Since , we now apply Lemma 22 with to obtain the claim, noting that and moreover that , given that we have , where is a constant depending only on and , and noting that we can choose .
It remains to consider the cases when the execution starts at time [math] with . Then, if holds, the claim follows from Lemma 23 given that is chosen so that . Otherwise, there must exist some last time such that . We apply Lemma 19 with . If the obtained value satisfies , then we can apply an analogous analysis as in the case to obtain the claim. Otherwise, we have that , where the value of constant , depending only on and follows from Lemma 19. By an iterated application of Lemma 18, we obtain that , where the value of constant , depending only on and , follows from the application of Lemma 18. Choosing sufficiently small so that , we complete the proof using Lemma 23. â
Finally, for the sake of completeness we state how the majority protocol stops in the case of \text{{}^{#}}\!\!X=0.
Lemma 25**.**
Suppose . Then, there exists a moment of time such that either or holds for all . Moreover, with probability , for some absolute constant depending only on , , and .
Proof.
By Theorem 1(1), there exists a moment of time such that the system reaches a corner configuration (cf. Lemma 11). W.l.o.g., assume that and . At this point, in the majority protocol Rule (7) will never again be activated, whereas the execution of rules (8)-(11) follows precisely the classical majority scenario of Angluin et al. [7]. By a standard concentration analysis (see also [7]), one of the two species will become extinct in steps with probability . â
As a side remark on Lemma 25 that it is possible to initialize the system entirely with a state so that holds throughout the process (even if the designed protocol will never enter such a configuration from most initial configurations).
7.3 Protocol Extension : Detection with Lights
To complete the proof of Theorem 3, we design a protocol extension , such that Detection is solved by the composition . Extension , uses three states, . We informally refer to states as lights. The composition is given in Fig. 5. Informally, state means that the agent is âwaiting for meeting â, then after meeting it becomes , âwaiting for â and finally it becomes .
To analyze the operation of the protocol, consider first the case of . By Lemma 25, after steps, agents in at least one of the states are permanently eliminated from the system. Thus, either rule (11) or rule (12) will never again be executed in the future. An agent which is in state will spontaneously move to another state following rule (13) within steps, with probability , and will never reenter such a state, since this would require the activation of both rule (11) and rule (12). By applying a union bound over all agents, we obtain that state never again appears in the population after steps from the termination of the majority protocol, with probability . Overall, all nodes reach a state having a different state than after steps from the start of the process, with probability , and all leave such a state eventually with certainty.
In the presence of the source , the analysis of the process can be coupled with a Markov chain on three states , , and . In view of Lemma 24, transitions from state to and from state to to occur with at least constant probability (except for an -fraction of all time steps), this 3-state chain is readily shown to be rapidly mixing. For a choice of depending only on sufficiently small, we can lower-bound the number of agents occupying state by , with high probability.
Under the natural decoding of states as âinformedâ (having component ) or âuninformedâ (having component or ), the proof of Theorem 3 is complete. We remark that it is also possible to design a related protocol in which exactly one state is recognized as âinformedâ and exactly one state is recognized as âuninformedâ; we omit the details of the construction.
8 Proof of Impossibility Result
This Section is devoted to the proof of Theorem 4. First, we restate some notation. We recall that the vector describes the number of agents having particular states, and . In this section we will identify the set of states with . It is now also more convenient for us to work with a scheduler which selects unordered (rather than ordered) pairs of interacting agents; we note that both models are completely equivalent in terms of computing power under a fair random scheduler, since selecting an ordered pair of agents can be seen as selecting an unordered pair, and then setting their orientation through a coin toss. Indexing with integers the set of all distinct rules of the protocol, where , for a rule , , , we will denote by , the probability (selected by the protocol designer) that rule is executed as the next interaction rule once the scheduler has selected as the interacting pair, and by the probability that is the next rule chosen in configuration (we have , where the factor compensates the property of a scheduler which always selects a distinct pair of elements).
For any configuration , we define the -box around as the set of all states such that , for all . We start the proof with the following property of boxes.
Lemma 26**.**
Fix and let be arbitrarily fixed. There exists , , such that, for any interaction protocol with states and any configuration , there exists a value such that, for any rule of the protocol, , exactly one of the following bounds holds:
* for all , ,* 2. 2.
* for all , .*
and for any state , , exactly one of the following bounds holds:
* for all , ,* 2. 2.
* for all , .*
Proof.
Let be fixed and let , where . Consider the (multi)set of real values . Since , by the pigeonhole principle, there must exist an interval , for some , such that . Now, we set , we also have . We immediately obtain that for any state , , we either have or . Recalling that for any , , claims and follow.
To show claims and , notice that if rule , is such that , then for all we have (by (iii) and (iv)), and so by the properties of the random scheduler. Otherwise, we have , and so , where we recall that . Since we have or , claims (i) and (ii) follow. â
Given any -state protocol , we will arbitrarily choose a value of for which the claim of the above Lemma holds (e.g., the smallest possible such value of ). Note that a similar analysis is also possible for protocols using a super-constant number of states in , however, then the value of is dependent on ; retracing the arguments in the proof, we can choose appropriately . (We make no effort to optimize the polynomial in in the exponent.)
In what follows, let be a fixed configuration of the protocol (admitting a certain property which we will define later). We will then consider a rule to be a low probability (LP) rule (writing ) in box if it satisfies condition of the Lemma, and a high probability (HP) rule in this box (writing ) if it satisfies condition . Note that .
Likewise, for , we will classify as a low-representation (LR) state (writing ) in box if satisfies condition of the Lemma, and a high representation (HR) state (writing ) in this box if satisfies condition . Note that . Moreover, we define a set of very high representation (VHR) states, , as the set of all such that for all , . Denoting , we have by the definition of a box that for all , for all : .
From now on, we assume that configuration admits the following property: for , an execution of the protocol starting from configuration passes through a sequence of configurations , , such that the configuration does not leave the box around in any step with sufficiently large probability, lower-bounded by some absolute constant :
[TABLE]
where is an arbitrarily fixed subset of .
We now show that the above property has the following crucial implication: for an interacting pair involving selected high and very high representation states, a rule creating a low representation state can only be triggered with sufficiently small probability. Informally, it seldom happens that in the protocol a low representation state is created out of any high representation state.
Lemma 27**.**
For a protocol having the property given by Eq. (38), for and , let be the set of rules of the form , taken over all . Then, .
Proof.
Suppose, by contradiction, that .
Associate with process a random variable , defined as follows. For all , where is the first moment of time such that , we put if a rule from is used for the interaction made by the protocol in process at time , and set otherwise. For all , we set to . We have ; indeed, for , it holds that:
[TABLE]
By a simple stochastic domination argument, can be lower-bounded by a sequence of independent binomial trials with success probability , hence by an application of a multiplicative Chernoff bound for :
[TABLE]
where the factor is exponentially small in .
We now show the following claim.
Claim. With probability , the following event holds: for all and the total number of rule activations in the time interval during which an agent changes state from a state in to a different state is at most ).
Proof (of claim). Acting similarly as before, we associate with process a random variable , defined as follows. For all , we put if a rule acting on at least one agent in a state from is made by the protocol in process at time , and set otherwise. For all , we set to a dummy variable set always to [math], i.a.r. We observe that:
[TABLE]
since , and for , for any , hence the scheduler selects an agent from a state into an interacting pair with probability at most . Applying an analogous argument as in the case of random variable , this time for the upper tail, we obtain:
[TABLE]
The claim follows directly.
Now, by a union bound we obtain:
[TABLE]
Taking into account that holds with probability by (38), we have by a union bound that with probability at least , the following event holds: for all , , and . However then , so there must exist at time a state such that . This is a contradiction with by Lemma 26(iii). â
In the rest of the proof, we consider the evolution of a protocol starting from configuration and having property (38). We compare this evolution to the evolution of the same protocol, starting from a perturbed configuration , such that:
- (C1)
. 2. (C2)
for all low representation states , we have .
Intuitively, the perturbed state may correspond to removing a small number of agents from (and replacing them by high representation states for the sake of normalization), e.g., as in the case of the disappearance of a rumor source from a system which has already performed a rumor-spreading process.
Our objective will be to show that, with probability at least , after the process is still not far from , being constrained to a box in a similar way as process . To achieve this, we define a coupling between processes and (knowing that process is constrained to a box around with probability ). Informally, the analysis proceeds as follows. We run the processes together for steps. In most steps, the 1-norm distance between the two processes remains unchanged, without exceeding . Otherwise, exactly one of the two processes executes a rule (and the other pauses). With a frequency of roughly steps (i.e., roughly times in total during the process), an LP rule is executed which increases the distance between these two states. We think of this type of âerrorâ as unfixable, contributing to the distance of the processes; however, such errors are relatively uncommon. With a higher frequency of roughly steps (i.e., roughly once every steps), a less serious âerrorâ occurs, when some HP rule increases the distance between the two states. The rate of such errors is too high to leave them unfixed, and we have a time window of about steps to fix such an error (before the next such error occurs). We observe that since is an HP rule, which is activated with probability at least , rule will still be activated frequently during this time window. The coupling of transitions of states and is in this case performed so as to force the two processes to execute rule lazily, never at the same time. The number of executions of rule in the ensuing time window by each of the two processes follows the standard coupling pattern of a pair of lazy random walks on a line, initially located at distance , until their next meeting (cf. e.g. [2]). During this part of the coupling, we allow the distance to increase even up to (as a result of executions of rule ), but the entire contribution to the distance related to rule is reduced to [math] before the next HP rule âerrorâ occurs, with sufficiently high probability (in this case, with probability . Overall, the coupling is successful with probability .
We remark that we use the bound on the number of states to enforce a sufficiently large polynomial separation between the frequencies of LR states and HR states, and likewise for LP rules and HP rules. We also implicitly assume that , throughout the process. The analysis also works for a choice of , with a sufficiently small hidden constant. The separation between LR/HR states and LP/HP rules is used in at least two places in the proof. First, it enforces that rules creating LR states from VHR states may appear in the definition of the protocol only with polynomially small probability (Lemma 27), which helps to maintain over time the invariant , for all LR states. Secondly, we use the separation of LP/HP rules in the analysis of the coupling to show that a fixable âerrorâ caused by a HP rule can be sufficiently quickly repaired, before new errors occur.
In the formalization of the coupling, we make both processes and lazy, i.e., add to each process an additional independent coin-toss at each step, and enforce that with probability no rule is executed in a given step (i.e., the step is skipped by the protocol). We assume a random scheduler which picks uniformly a random pair of nodes at each step. Thus, if the scheduler picks a pair of agents in states , and is a rule acting on this pair of states, the probability that the interaction corresponding to rule will be . (The laziness of the process here is a purely technical assumption for the analysis, and corresponds to using a measure of time which is scaled by a factor of w.h.p.; this does not affect the asymptotic statement of the theorem.)
We will also find it convenient to apply an auxiliary notation for representing the evolution of a state. For process (resp., , we define (resp. ), for all , as the number of times rule has been executed since time [math]. Observe that the pair completely describes the evolution of a state (i.e., the order in which the rules were executed is irrelevant). Moreover, since each execution of a rule changes the states of at most agents, we have:
[TABLE]
Definition of the coupling.
At each step , we order the agents of configurations and , so that denotes the type of the -th agent in and is the type of the -th agent in . The orderings are such that is maximized; in particular, for any state such that (respectively, ) we have that if for some , (resp., ), then (resp., ). 2. 2.
The scheduler then picks a pair of distinct indices as the pair of interacting agents.
- 2.1.
If and , then the same rule acting on the pair of states is chosen as the current interaction rule, with probability . 2. 2.2.
Otherwise, a pair of (clearly distinct) rules and are picked independently at random for and from among the rules available for state pairs and , with probabilities and , respectively. 3. 3.
The processes finally perform their coin tosses to decide which of the selected rules ( for and for ) will be applied in the current step.
- 3.1.
If and rule has been executed exactly the same number of times in the history of the two processes (), then with probability both of the processes execute rule , and with probability neither execute their rule. 2. 3.2.
If , or if and rule has been executed a different number of times in the history of the two processes (), then exactly one of the two processes performs its chosen rule and the other process waits, with the process performing the rule being chosen as or , with probability each.
The correctness of the coupling (i.e., that the marginals and each correspond to a valid execution of the given protocol under a random scheduler) is immediate to verify.
Lemma 28**.**
Let be a process satisfying property (38), and let satisfy conditions (C1) and (C2). Then, for , with probability we have , for some .
Proof.
To prove the claim, it suffices to show that with probability the provided coupling succeeds, i.e., it maintains a sufficiently small difference for all states , with .
In the analysis of the provided coupling, we will assume that the box condition holds always throughout the process (otherwise, we assume the coupling does not succeed). To state this formally, we work with auxiliary processes and , given as and for all , where is the first moment of time such that , and set to the dummy value for all . At the end of the process, we will thus have and with probability at least . In the following, we silently assume that (in particular, that and ), and we will simply show that the coupling of and is successful with probability . The condition of is trivially handled.
In addition to the box condition (which is now enforced) we try to maintain, with sufficiently high probability, throughout the first steps of the process several invariants (all at a time), corresponding to the following events holding:
- âą
: for all states , . (LR domination condition)
- âą
: for all states , . (LR state condition)
- âą
: for all rules , . (LP rule condition)
- âą
: for all states , . (HR state condition)
- âą
: for all rules , . (HP rule condition)
- âą
: for all states , . (HRâ state condition)
- âą
a family of possible events , for some and , with specific events defined as follows:
- â
holds if for all rules we have , and . This implies, in particular, . (identical rate of HP execution)
- â
for holds if there exists a rule such that for all rules we have , , and moreover . This implies, in particular, . (single HP execution difference)
We will call the coupling successful if for all , all events and some event holds, and we will say it is a failure otherwise. (We remark that condition is implied by condition , but we retain both for convenience in discussion.)
The analysis of the coupled process is now the following. First, we remark that all of the given events and event hold for .
If the process meets condition at time and all conditions , then we have the following:
- âą
With probability at least , the coupling will follow clauses 2.1 and 3.1 of its definition, and the two processes and will execute the same rule (or both pause). Hence, we continue to step satisfying condition and all of the conditions , making use of the box condition for process . (We note that, to show , when considering the special case of a rule involving a state from , we can make use of and note that the activation probability of such a rule is bounded by due to the bound on the population of a LR state).
- âą
With probability at most , the coupling will, however, select distinct rules, for and for , and will select exactly one of them to execute, say .
- â
If , which happens in the current step of the process with probability at most by , then the event will hold in the next step (provided ; otherwise, if , we will say that the coupling has failed).
- â
If , which happens in the coupling with probability (as bounded due to clause 2.2), then the event will hold in the next step. The condition requires more careful consideration. Taking into account that holds, we need to consider two cases: either and the rule applied to changed at least one of the two interacting states , say , so that and , or and the rule applied to created a pair of states , say . In the first case, by the description of the ordering given in clause 1 of the definition of the coupling, the problem occurs only if one of the agents picked by the scheduler belongs to an state, and the other agent is at a position in which the states of and differ in the ordering of the agents; hence, the probability that the coupling fails at this step is at most . In the second case, we likewise analyze the ordering of the agents considered by the scheduler, and note that the interacting agent, which belongs to the part of the ordering in which and differ, must be in a HR state, since the agents in a LR state in are matched by their counterparts in (as noted in clause 1 of the discussion of the coupling). If the other interacting agent is in a state from , then such an event occurs with probability , and we say that with this probability the coupling has failed. Finally, if the other interacting agent is in a state from , then by Lemma 27, we have that the probability of picking a rule under which the coupling fails is at most , conditioned on the event holding, hence overall the probability of failure is . Overall, we obtain that holds with probability . Given , , and the box condition, the remaining conditions follow directly.
Overall, we obtain that following a time satisfying and all conditions , we reach the following successor state (see Fig. 8):
[TABLE]
At this point, before proceeding further, we can provide some intuition on the meaning of the respective events . The coupling process can be seen as a walk along the path ), starting from state , and at each step, either staying in the current state , moving on to the next state , branching to a side branch (which we will analyze later), or failing. The process also fails if it reaches the endpoint of its path (). Since the process is run for steps, the probability that failure will occur before the end of the path is reached is , and the probability of reaching the end of the path and failing is exponentially small in by a Chernoff bound (in expectation, the process will progress halfway along the path). Hence, we have that the process succeeds with probability , or otherwise may fail in a side branch .
A side branch is entered with probability . To show that the coupling succeeds with the required probability, it suffices to show that we return from any state to state with probability at least ; then, all (i.e., w.h.p. at most ) excursions into side branches during the process will succeed with probability .
Consider now an excursion into a side branch () associated with a rule , which has been executed a different number of times in and . Now, if the process meets condition at time and all conditions , then we have the following:
- âą
With probability at least , the coupling will follow clause 2.1 of its definition, selecting a single rule .
- â
If , then clause 3.1 will follow, and the two processes and will execute the same rule (or both pause). Hence, at time , all of the conditions and condition is satisfied.
- â
Else, the event occurs. The probability of such an event is denoted (due to the conditioning performed in the first clause of the coupling); since by the box condition for HP rules, it follows that . Now, following clause 3.2 of the coupling, depending on which of the two processes , is chosen to execute the rule, with probability the system moves to , and with probability the system moves to (unless , in which case the coupling has failed). As before, given there was no failure, all conditions are readily verified to be satisfied in the new time step.
- âą
With probability at most , for simplicity of analysis we assume the coupling has failed.
This time, for a time satisfying for and all conditions , we obtain the following distribution of successor states:
[TABLE]
The picture here corresponds to a lazy random walk along the side line for , with an additional failure probability at each step. The walk starts at and ends with a return to the primary line if the endpoint is reached, or ends with failure if the other endpoint is reached. At each step, the walk is lazy (with probability of transition depending on the current step), but unbiased with respect to transitions to the left or to the right. Assuming that failure does not occur sooner, with probability the walk will reach point in moves (transitions along the line), without reaching the other endpoint of the line sooner. Since a move is made in each step with probability , by a straightforward Chernoff bound, the number of steps spent on this line is given w.h.p. as at most . As the probability of failure in each of these steps is , the probability that the process fails during these steps is . Overall, by a union bound, we obtain that the process successfully returns to with probability (and within steps). In view of the previous observations, we have that with probability , all conditions and some condition hold at time . Thus, with probability , process is sufficiently close to , i.e., there exists a point such that . â
9 Proof of Proposition 2
Proof.
Fix protocol with set of states , in which the minimum positive probability of executing some rule is . Let , be any minimal subset of the set of states such that no evolution of protocol starting in a configuration containing only states from set will ever contain an agent in a state outside . Denote . Consider an initialization of protocol at time , at a configuration with and with all other states from represented by the same number of agents, i.e., for each , we have .
Let be an arbitrarily chosen time step. Let . Fix as any state such that (we can, e.g., fix as the state from having the most agents at time ). Observe that from the minimality of it follows that there must exist a sequence of states , with , such that in the definition of protocol , for all , some rule of protocol creates at least one agent (i.e., either 1 or 2 agents) in state from an interaction of either the pair of agents in states or the pair of agents in states , for some . (Indeed, if for some there was no possibility of choosing in any way, then would be closed under agent creation, contradicting the minimality of the choice of .) Now, we consider intervals of time steps , with for . We make the following claims:
- (1)
Fix . If , then for all , with probability . Indeed, in a sequence of steps, the expected number of agents which do not participate in any interaction in the time interval following the asynchronous scheduler is , and thus the number of non-interacting agents is at least , with probability following standard concentration bounds for the number of isolated vertices in a random graph on nodes with edges. Since the choice of agents by the scheduler is independent of their state, and the probability for a uniformly random agent to be in state at time is , a simple concentration bound shows that having state at time do not participate in any interaction in the interval .
- (2)
Fix . Denote . If , then , with probability . Indeed, consider the value such that the interaction or creates an agent in state . At any time within the interval , we have by Claim (1) that and , with probability . It follows that an interaction creating a new agent in state is triggered with probability at least at each step. The number of agents in state at time step may thus be dominated from below by the number of successes in a sequence of Bernoulli trials with success probability , and the claim follows.
By applying Claim (2) iteratively for , where we note that , we have , with probability (through successive union bounds). Then, applying Claim (1) up to time , we have , with probability . Applying once again a union bound, we have shown that for all , we have , with probability . The claim of the lemma follows for a suitable choice of .
â
Acknowledgment.
We sincerely thank Dan Alistarh and Przemek UznaĆski for inspiring discussions, and Lucas Boczkowski for many detailed comments which helped to improve this manuscript.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. A. Abdullah and M. Draief. Majority consensus on random graphs of a given degree sequence. Co RR , abs/1209.5025, 2012.
- 2[2] D. Aldous and J. A. Fill. Reversible markov chains and random walks on graphs, 2002. Unfinished monograph, recompiled 2014, available at http://www.stat.berkeley.edu/~aldous/RWG/book.html .
- 3[3] D. Alistarh, J. Aspnes, D. Eisenstat, R. Gelashvili, and R. L. Rivest. Time-space trade-offs in population protocols. In Proc. Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2017, Barcelona, Spain , pages 2560â2579, 2017.
- 4[4] D. Alistarh, J. Aspnes, and R. Gelashvili. Space-optimal majority in population protocols. (To appear, SODA 2018.) Co RR , abs/1704.04947, 2017.
- 5[5] D. Alistarh, B. Dudek, A. Kosowski, D. Soloveichik, and P. Uznanski. Robust detection in leak-prone population protocols. In DNA , volume 10467 of Lecture Notes in Computer Science , pages 155â171. Springer, 2017.
- 6[6] D. Angluin, J. Aspnes, Z. Diamadi, M. J. Fischer, and R. Peralta. Computation in networks of passively mobile finite-state sensors. Distributed Computing , 18(4):235â253, 2006.
- 7[7] D. Angluin, J. Aspnes, and D. Eisenstat. A simple population protocol for fast robust approximate majority. Distributed Computing , 21(2):87â102, 2008.
- 8[8] D. Angluin, J. Aspnes, D. Eisenstat, and E. Ruppert. The computational power of population protocols. Distributed Computing , 20(4):279â304, 2007.
