Stochastic versus deterministic consensus dynamics on graphs
Dylan Weber, Ryan Theisen, Sebastien Motsch

TL;DR
This paper compares stochastic and deterministic opinion formation models on graphs, analyzing how graph structure influences long-term consensus emergence and convergence rates through theoretical and numerical methods.
Contribution
It provides a comprehensive analysis of both models' convergence conditions and highlights key differences in their dynamics based on graph topology.
Findings
Necessary and sufficient conditions for consensus in both models
Distinct convergence rates between stochastic and deterministic models
Graph structure critically influences long-term opinion dynamics
Abstract
We study two agent based models of opinion formation - one stochastic in nature and one deterministic. Both models are defined in terms of an underlying graph; we study how the structure of the graph affects the long time behavior of the models in all possible cases of graph topology. We are especially interested in the emergence of a consensus among the agents and provide a condition on the graph that is necessary and sufficient for convergence to a consensus in both models. This investigation reveals several contrasts between the models - notably the convergence rates - which are explored through analytical arguments and several numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence · Complex Network Analysis Techniques · Evolutionary Game Theory and Cooperation
Deterministic versus stochastic consensus dynamics on graphs
Dylan Weber Electronic address: [email protected] School of Mathematical and Statistical Sciences, Arizona State University, Tempe
Ryan Theisen Electronic address: [email protected] Department of Statistics, University of California, Berkeley
Sebastien Motsch Electronic address: [email protected] School of Mathematical and Statistical Sciences, Arizona State University, Tempe
(Dated: March 1, 2024)
Deterministic versus stochastic consensus dynamics on graphs
Dylan Weber Electronic address: [email protected] School of Mathematical and Statistical Sciences, Arizona State University, Tempe
Ryan Theisen Electronic address: [email protected] Department of Statistics, University of California, Berkeley
Sebastien Motsch Electronic address: [email protected] School of Mathematical and Statistical Sciences, Arizona State University, Tempe
(Dated: March 1, 2024)
Abstract
We study two agent based models of opinion formation - one stochastic in nature and one deterministic. Both models are defined in terms of an underlying graph; we study how the structure of the graph affects the long time behavior of the models in all possible cases of graph topology. We are especially interested in the emergence of a consensus among the agents and provide a condition on the graph that is necessary and sufficient for convergence to a consensus in both models. This investigation reveals several contrasts between the models - notably the convergence rates - which are explored through analytical arguments and several numerical experiments.
Contents
1 Introduction
Mathematical models involving dynamics defined on network structures have long been objects of theoretical interest. The rise of social networks as the ubiquitous forum for the exchange of information in our society and the meta data associated with these networks clearly demonstrates the need to analyze dynamics on networks. Such models are often posed in an agent based framework where the potential for agents to interact is encoded in a graph; models of opinion formation are especially amenable to such a framework and a considerable amount of research has been done in this context [29, 36, 27, 11, 19]. Many of the dynamics studied are deterministic in nature [34, 6, 5, 19], however the random nature of information exchange among humans strongly motivates the study of similar models in a stochastic setting [14, 1, 26]. In this paper we investigate two models of opinion formation, one stochastic and one deterministic with special attention paid to the emergence of consensus - when the opinions of all agents agree.
A hallmark of the agent-based approach is the investigation of how locally defined interactions affect global behavior observed among the entire population of agents. In the context of opinion formation, local interactions are determined by the underlying network structure; therefore the investigation of the long time behavior of the dynamics translates to understanding how the topology of the underlying graph affects the distribution of opinions among the agents. Of particular interest is how the interplay between the local “rules of engagement” and the structure of the graph affect the emergence of a consensus among the agents [28, 33]. Deterministic models often carry the generic assumption that the opinion of a given agent is continuously influenced by those to whom it is connected in the graph. These connections can be considered to be static or changing often depending on the state of agents. The manner in which influence is exerted is usually globally defined - most models carry the assumption of local consensus i.e. if agents interact they agree in some sense. As the analysis of these dynamics can often be quite delicate, much of the analytical work done in the past uses assumptions on the graph such as symmetry of connections or connectedness of the graph [33, 34, 37, 29]. However, in many cases the dynamics are too complex for analytical proof and main characteristics of the dynamics are identified through numerical simulation [10, 5, 28, 13]. A main takeaway in deterministic models is that a sufficient condition for consensus is the persistence of a suitable amount of connectivity in the underlying graph throughout the evolution of the dynamics [37, 32, 21].
To incorporate stochastic behavior into opinion dynamics, several strategies are available. For instance, given a deterministic model, opinions can be perturbed with white noise [30, 10] to model uncertainty in the agent. Additionally the edges of the interaction graph might change randomly (possibly in a state dependent manner) to model uncertainty in agent interaction [35, 14, 12, 5]. The case where the interaction graph remains fixed, and interactions among neighbors are randomized is especially well studied as the classical voter model falls into this class. In this case assumptions on the structure of the graph are usually made - often that it is a regular lattice. The stochastic nature of these models causes their analysis to require a different set of mathematical tools, most frequently from the theory of Markov chains and martingales. Similar to the deterministic case, a sufficient degree of interaction among agents is often necessary for a consensus to emerge [25, 26, 2].
In both cases we are interested in how the structure of the graph affects the long-time behavior of the model. We are particularly interested in conditions on the graph that cause the opinion of every agent to converge to a consensus - i.e. every agent converges on the same opinion in the long-time limit. Moreover we analyze if the addition of stochasticity helps or hinders the emergence of a consensus; if consensus occurs, we are interested in how the connectivity of the graph and stochasticity affect the speed of convergence. We examine all possible graph topologies: i) undirected and connected (referred to as simple graph), ii) directed and strongly connected, and iii) directed and weakly connected or disconnected.
In the deterministic case we find that the model converges regardless of graph structure. In the case that the underlying graph is simple, we find that a consensus is reached that is the initial average of opinions. Similarly, if the graph is strongly connected, but not necessarily undirected, we again find that a consensus emerges, however it is not in general the initial average. The most challenging case occurs when the underlying network is only weakly connected. In general, a consensus does not form. However, we are able to find necessary and sufficient conditions on the structure of the graph for convergence to a consensus regardless of the initial condition: the graph needs to have only one isolated strongly connected component.
In the stochastic case we similarly find consensus in the case of a strongly connected network using markedly different techniques in the arguments. However in contrast to the deterministic model, the stochastic model does not converge in general in the case of a weakly connected graph. Despite this, we find that the condition that the graph have exactly one isolated strongly connected component prevails and is equivalent to convergence to a consensus in the stochastic case as well. We explore this link between the models further as we find that the deterministic dynamics can be recovered in expectation from the stochastic dynamics.
We discuss speed of convergence for both the deterministic and stochastic models and quantify it in terms of the algebraic connectivity of the graph, also known as the Fielder number[17, 23, 24], which also happens to be the spectral gap of the operator associated with consensus dynamics. The algebraic connectivity measures in a sense, how well connected the graph is and we find generally that better connection in the graph results in faster convergence of the models. However, we find a contrast between the models as the convergence rate depends on the number of agents in the stochastic case whereas the rate in the deterministic case depends only on the connectivity of the graph. In short, the spectral gap reduces drastically in the stochastic model.
This study could be extended through the inclusion of more subtle assumptions about agent-agent interactions via non-linear dynamics. Adding nonlinearities significantly complicates the investigation of consensus as it becomes more difficult to find sufficient conditions for the graph to remain suitably connected [28]. We are also interested in examining the dynamics from a control perspective - we include a brief comment discussing how given control over the network structure, the deterministic dynamics we study can be used as an algorithm to induce consensus to any opinion in the convex hull of initial agent opinions [3]. Finally, we aim to study limiting behavior of the model as the number of agents approaches infinity to develop a global description of the dynamics through a partial differential equation derived possibly through a kinetic description of the model [15, 20].
2 Opinion dynamics: a deterministic model
2.1 Model definition and preliminary properties
We first study the following deterministic consensus model.
Definition 1**.**
Given a collection of N agents, let represent the opinion of the th agent. The consensus model (CM) is defined by the dynamics:
[TABLE]
In vector notation the dynamics can be written as:
[TABLE]
where
[TABLE]
and
[TABLE]
For the sake of clarity of presentation we restrict opinions to being one dimensional. We note that the consensus model can be easily extended for opinions in and that all results discussed will still hold with analogous proofs.
If we interpret as measuring the amount of influence agent exerts on agent , (in particular, in this interpretation if then agent does not influence agent ) then we can interpret the matrix as encoding the structure of a network on which the agents are interacting. In this interpretation, each vertex of the graph represents an agent and an edge from vertex to vertex represents that agent is exerting influence on agent directly, i.e. that . When one agent influences another directly, the influenced agent adjusts its opinion towards the influencer’s at a rate proportional to how much influence is being exerted.
Remark 1*.*
We note that is the transpose of the Laplacian matrix of the directed graph (with the convention that the diagonal entries of measure the indegree of the corresponding vertex in .)
In this paper we will study how different conditions on the matrix affect the behavior of the model. We will mostly be concerned with whether a consensus is formed among the agents, i.e. if the opinions of every agent converge to the same value.
Definition 2**.**
We say that the consensus model converges to a consensus if there exists such that:
[TABLE]
where .
We will also be interested in convergence to a consensus that is unconditional.
Definition 3**.**
The consensus model converges to a consensus unconditionally if a consensus is reached for any choice of the initial state .
Since encodes the structure of a network, the conditions we place on can be viewed as conditions on the structure of , the network on which the agents are interacting. We will show that in the case where no extra assumptions are made about that the model always converges, although not necessarily to a consensus. We find that in the case where is irreducible that the model converges unconditionally to a consensus; if we add the assumption that is symmetric we find that the opinion the agents convene on is the mean of their initial opinions. Assuming that is irreducible is equivalent to assuming that is a directed graph which is strongly connected. Adding the assumption that is symmetric is equivalent to assuming that is a connected undirected graph (often called a simple graph) where every interaction between agents is mutual. However, we find that these assumptions - while sufficient for unconditional convergence to a consensus - aren’t necessary and can be weakened. We finally present conditions on that are equivalent to unconditional convergence to a consensus and are weaker then irreducibility. In terms of these conditions are equivalent to demanding that be weakly connected with only one isolated strongly connected component. In Fig. 1, we give an illustration of these cases of study for and two examples of the time evolution of opinions are plotted in Fig. 2. Similar work including nonlinearities in the case of undirected connected graphs can be found in [33, 34].
Before investigating the cases discussed above we present some general properties of . has a special structure without making any assumptions on its coefficients.
Proposition 1**.**
The matrix is a diagonal dominant matrix, its eigenvalues satisfy and no eigenvalue is purely imaginary except zero.
Proof.
The diagonal entries of satisfy where , thus summing over each row will give zero and we deduce that is diagonal dominant. By the Gershgorin disc theorem the eigenvalues are contained in the closed discs (see Fig. 3). Thus, the eigenvalues are either [math] or have a real part strictly positive. Moreover, [math] is always an eigenvalue of as the constant vector is always an eigenvector of associated with the eigenvalue . ∎
2.2 Strongly connected networks ( irreducible)
We first study the case where is assumed to be irreducible (see Fig. 1 A and B). In terms of the network on which the agents interact, this means that is strongly connected. Intuitively we are assuming that given any two agents and , that agent exerts influence over agent (either directly or indirectly) and vice versa. Formally, in terms of , we introduce the following definitions.
Definition 4**.**
Let the set of agents be given by . We say that agent is influencing agent if there exists a path such that:
[TABLE]
and we write .
In other words, is irreducible if and only if for any we have both and .
Definition 5**.**
A graph is said to be strongly connected if for any vertices and :
[TABLE]
A graph is said to be weakly connected if for any vertices and :
[TABLE]
The information about the eigenvalues of given in Proposition 1 is not sufficient to characterize the long-term behavior of the model. We need to investigate the algebraic multiplicity of the zero eigenvalue of . Since we are assuming that is irreducible we may leverage the Perron-Frobenius theorem to deduce that this eigenvalue is in fact simple.
Lemma 1**.**
If is irreducible then the eigenvalue [math] is simple.
Proof.
By Proposition 1 we have that the real parts of the eigenvalues of are all nonnegative and there are no purely imaginary eigenvalues. We also note that 0 is an eigenvalue of L of at least multiplicity one because it is associated to . Consider the matrix where and notice that is an eigenvalue of if and only if is an eigenvalue of . Therefore, since 0 is an eigenvalue of we must have that is an eigenvalue of . If we choose such that:
[TABLE]
we have that:
[TABLE]
Therefore is the eigenvalue of largest modulus of and since is irreducible we have by the Perron-Frobenius theorem that must be simple and since the eigenvalues of are in a one to one correspondence with the eigenvalues of we must have that 0 is a simple eigenvalue of as desired. ∎
Using the properties of found above we can now exploit the following general fact from the theory of ordinary differential equations to deduce that the model converges to a consensus unconditionally in the case where is irreducible. This result will prove useful for showing the convergence of the model and studying consensus in the case of an arbitrary choice for as well.
Lemma 2**.**
Given a linear system defined by
[TABLE]
Assume has a zero eigenvalue of multiplicity with linearly independent associated eigenvectors and every non-zero eigenvalue of satisfies . Then
[TABLE]
*where is in the center subspace of . *
Proof.
See Appendix A.1. ∎
Since we have shown that in the case where is irreducible that its zero eigenvalue is simple, we obtain immediately the convergence of the consensus model to a consensus in this case.
Theorem 1**.**
If is irreducible then the consensus model converges to a consensus unconditionally.
Proof.
By Proposition 1 we know that the eigenvalues of of satisfy Re. By Lemma 1 we know that [math] is a simple eigenvalue of associated to and therefore by Lemma 2 we can deduce that the model converges to a consensus as the center subspace of is spanned by . ∎
2.2.1 Connected undirected networks ( symmetric)
We now study a special case of when is irreducible by adding the assumption that is also symmetric. Here we are implicitly assuming that every direct interaction between individuals is mutual. We know from the last section that a consensus will be reached unconditionally, however in this case we will show that the opinion the agents convene on is the initial average of opinions. The average opinion is defined as:
[TABLE]
We first show that the average opinion is conserved by the consensus model in this case.
Lemma 3**.**
If is symmetric then the average opinion (9) is preserved by the consensus model.
Proof.
Using the symmetry , we find:
[TABLE]
∎
We can now show that the opinion the agents convene on is the average initial opinion and that the convergence is exponential with rate at least , the second smallest eigenvalue of , also known as the Fieldler number or algebraic connectivity of the graph .
Remark 2*.*
We will point out that we can impose a weaker condition than symmetry on and still maintain the conservation of the average opinion (and therefore convergence to a consensus that is the average opinion). Recall that if is not symmetric then the graph associated to is a directed graph. If we assume that the graph is balanced, i.e. that satisfies:
[TABLE]
then must be a left eigenvector of associated to zero as well (equivalently a right eigenvector of ). In this case the average opinion is conserved as:
[TABLE]
using (10).
However, in the following corollary we will maintain the assumption that is symmetric in order to guarantee that it can be diagonalized.
Corollary 1**.**
Suppose is irreducible and symmetric. Then the solution of the consensus model, , satisfies:
[TABLE]
with the average opinion (9). Moreover, where C depends only on the initial condition and is the second largest eigenvalue of .
Proof.
The consensus model is a linear system, therefore its solution is given by:
[TABLE]
where . Note that since is an eigenvector of corresponding to the eigenvalue [math], it is also an eigenvector of corresponding to the eigenvalue of . Therefore we may write:
[TABLE]
Also note that since is diagonal dominant and symmetric that there exists such that
[TABLE]
where and is the matrix composed of the eigenvectors of . Since is symmetric these eigenvalues are real. By Proposition 1 and Lemma 1 there is exactly one zero eigenvalue and is strictly positive for . We now define:
[TABLE]
this is the difference between the opinion at time t and the average opinion represented in the diagonal coordinate system. Notice that:
[TABLE]
Therefore, since this is an uncoupled system of linear equations we must have that:
[TABLE]
So for we must have that as exponentially with rate at least . To conclude, it remains to show that . Using that the eigenvectors of form an orthogonal basis, the entry is given by:
[TABLE]
since the mean value is preserved over time. ∎
2.3 Weakly connected and disconnected networks (arbitrary )
In this section we remove the assumption that is irreducible. In terms of the network associated to , this translates to assuming that the network on which the agents interact is merely weakly connected or disconnected. The case of weakly connected or disconnected graphs is more delicate. In this case we are implicitly assuming that individuals, or communities of individuals, may be isolated from influence from the network. Under these assumptions on a consensus might not always be reached. See for example the right plot in Figure 2. However we will show that the dynamics always converge and provide a condition on weaker then irreducibility that is equivalent to unconditional convergence to a consensus. Our main tool for studying the dynamics in this case is Lemma 2 and the decomposition of according to its strongly connected components (similar to the so-called Frobenius normal form).
Given a weakly connected or disconnected graph we may partition its set of vertices into strongly connected components. Two vertices and are in the same strongly connected component if there exists a directed path from to and vice versa (i.e. and ). If we treat each strongly connected component as a vertex of a new graph , and notice that this graph is necessarily directed acyclic, then by relabeling the vertices of in a way that agrees with the topological ordering of the vertices in we can represent in the following form (see Appendix A.2 and Figure 4).
[TABLE]
Each block, , on the diagonal is irreducible as they correspond to strongly connected components of . Notice that denotes the number of strongly connected components, thus when is irreducible we have . We give an example of such a relabeling in Figure 4.
Notice that by the definition of each block is diagonal dominant:
[TABLE]
where are the coefficients of the matrix . We can further distinguish between blocks.
Definition 6**.**
Assume is given in the Frobenius normal form (14). A block is called isolated if it satisfies:
[TABLE]
In other words, vertices from the block are only influenced by vertices in block :
[TABLE]
Note however that might influence (i.e. ). A non-isolated block is called absorbing if the vertices from block can only influence vertices in :
[TABLE]
Note again that might influence (i.e. ).
Notice that a block can be neither absorbing nor isolated. In the example from Figure 4, the matrix is composed of two isolated blocks and one absorbing block and has no blocks that are neither. Isolated blocks of correspond to isolated strongly connected components of the graph on which the agents interact. We now establish a correspondence between isolated blocks and zero eigenvectors of .
Proposition 2**.**
If has isolated blocks there exist at least linearly independent eigenvectors of associated to the zero eigenvalue.
Proof.
Assume that is an isolated block of and that is . Then, the vector of length is a right eigenvector of associated to the zero eigenvalue. Since is irreducible and is isolated we have by the Perron-Frobenius theorem that is the only zero right eigenvector of (see the proof of Lemma 1). Since for any matrix the collection of left and right eigenvalues as well as the dimensions of the corresponding eigenspaces are equal, there must exist a unique left eigenvector of associated to zero denoted . We then deduce a left eigenvector of in the form:
[TABLE]
where the first entry of is the th entry of . We could clearly repeat this argument for all isolated blocks to produce a collection of left eigenvectors of associated to zero which are necessarily linearly independent as they are all nonzero in disjoint coordinates. Therefore, since the dimensions of the left and right eigenspaces of are equal, if there are isolated blocks of there must be linearly independent right eigenvectors of associated to zero. ∎
So, if has isolated blocks then it has at least linearly independent zero eigenvectors. We will eventually show that these are the only zero eigenvectors of and deduce convergence by Lemma 2; as a first step we show that non isolated blocks of do not have a zero eigenvalue.
Proposition 3**.**
Suppose that is a non-isolated block. Then does not have a zero eigenvalue.
Proof.
Denote by the components of . Since is a non-isolated block, there exists such that:
[TABLE]
In particular, the constant vector cannot be a zero-eigenvector of . Suppose for the sake of contradiction that had a zero eigenvector, , and let be the largest entry of i.e. (we assume without loss of generality that ). Then we have:
[TABLE]
Since (for ) and , we must have if . In other words, the coefficients of the eigenvector are constant on the indices connected to :
[TABLE]
Iterating the argument, we deduce that if there exists a path joining to . Since is irreducible we deduce that for all . Therefore, . This is a contradiction as is not a zero eigenvector of .
∎
Combining propositions 2 and 3 leads to the following theorem.
Theorem 2**.**
The consensus model (1) converges (but not necessarily to a consensus).
Proof.
Using the decomposition of into strongly connected components (see eq. (14) and appendix A.2), we can write the spectrum of as:
[TABLE]
where corresponds to a strongly connected component. Only isolated blocks have zero eigenvalues and moreover has a zero eigenvalue of multiplicity equal to the number of isolated blocks (denoted ) by (19) and Propositions 3 and 2. Moreover, Proposition 2 states that there exist linearly independent eigenvectors of associated with the zero eigenvalue and therefore we deduce the convergence by Lemma 2.
∎
Remark 3*.*
In the proof of Lemma 2 (see appendix A.1) we find that we can bound the convergence of in terms of the spectral gap of , denoted , some and some :
[TABLE]
If is symmetric, is known as the algebraic connectivity of the graph and is a measure of how well connected is. In this context the bound (20) can be interpreted as stating that the consensus model converges faster if the network on which agents interact is better connected.
We can observe from Theorem 2 that the number of zero eigenvectors of is in correspondence with the number of isolated blocks of . This observation supplies us with a condition on that is equivalent to unconditional convergence to a consensus.
Corollary 2**.**
The consensus model converges to a consensus unconditionally if and only if has exactly one isolated block.
Proof.
The vector is always a zero eigenvector of . Since has only one isolated block, the eigenvalue [math] must be simple by Propositions 2 and 3. Therefore, is the only zero eigenvector of and therefore Lemma 2 implies that the dynamics converge to a consensus as the center subspace of is spanned by . ∎
Remark 4*.*
Recall that isolated blocks of correspond to isolated blocks of the graph on which the agents interact. We can interpret an isolated block as "leading" the rest of the network. Given a network with only one isolated block the consensus model implies that nodes in an isolated block receive no influence from nodes outside of the isolated block and since we know by Corollary 2 that the entire network reaches a consensus, we deduce that the consensus converged to by the whole network is the consensus converged to by agents in the isolated block. The network can fail to converge to a consensus if there are multiple isolated blocks that converge on different opinions (See the right plot in Figure 2) - this clearly depends on the initial opinions of the agents in the network. Thus, if a network has multiple isolated blocks the consensus model does not converge to a consensus unconditionally (although a consensus might still be reached if all isolated blocks happen to converge on the same opinion).
2.4 Consensus as a control problem
We now slightly shift our perspective to consider a question inspired by control theory. Given agents with initial opinions and a desired opinion that one would like every agent to converge on, we investigate if there is a way to connect the agents in a network so that under the consensus model we have:
[TABLE]
We can use our observations about how isolated blocks affect the behavior of the consensus model to answer this question. For ease of notation we denote:
[TABLE]
We also note that there is no way to connect the agents to achieve the desired result if (the convex hull of initial opinions) as the consensus model forces the convex hull of to decrease in diameter as time evolves.
Proposition 4**.**
Given and there exists such that:
[TABLE]
Proof.
For ease of notation, assume
[TABLE]
We first consider the two-agent network who’s evolution is given by
[TABLE]
We find that if we choose and such that
[TABLE]
we have
[TABLE]
So, given agents, if we connect agents 1 and 2 with the weights found above and the rest of the agents in a “chain”, i.e. we choose an of the form:
[TABLE]
then the network on which the agents interact will only have one isolated block consisting of agent 1 and agent 2. Our choice of weights guarantees that the opinions of agents 1 and 2 converge to and since they comprise the only isolated block, Corollary 2 guarantees that we must have that:
[TABLE]
as desired. ∎
3 Opinion dynamics: stochastic approach
3.1 Introduction
We now adopt another point of view to model opinion dynamics. Rather than changing their opinions gradually according to their neighbors, agents now “jump” their opinion to one of their neighbors. The weights now encode the probability of agent switching its opinion to agent . Mathematically, we will use a Poisson process of rate to model this event; opinion “jumps” to opinion at rate .
[TABLE]
The dynamics are now a continuous time Markov chain instead of a system of differential equations. We begin by formally defining the dynamics in terms of the stochastic process that describes the evolution of the opinion of agent .
Definition 7**.**
Consider agents with opinion , . Let be the entries of the adjacency matrix of the graph on which the agents interact. For each tuple with , we associate an independent Poisson process with rate . The stochastic consensus model (SCM) is defined as:
[TABLE]
Again, for the sake of clarity of presentation we restrict opinions to being one dimensional and we note that the stochastic consensus model can be easily extended for opinions in and that all results discussed will still hold analogous proofs. Notice that one can write the Poisson process as where is a Poisson process of rate . Each time a Poisson process increases by one unit, the opinion of agent , , “jumps” to . We illustrate these dynamics in Figure 5 on a graph with only three nodes and two links.
The stochastic consensus model can also be written in terms of its generator. For any pair , denote by the function:
[TABLE]
Let . For any smooth test function , we have:
[TABLE]
The stochastic model can also be more simply described through its associated embedded Markov chain. Instead of describing the continuous evolution of each opinion , we only record the evolution of each jump:
[TABLE]
where is the time of the jump. Notice that is an exponential random variable with frequency - the rate of any jump occurring:
[TABLE]
by the superposition property of independent Poisson processes. The evolution of is then given by a discrete Markov process described by the probability transition matrix:
[TABLE]
where is the rate of agent switching to the opinion of agent and given by (26). The generator of this Markov chain can be described as:
[TABLE]
We will now investigate the long-time behavior of these dynamics and analyze how the structure of the graph affects the convergence.
Remark 5*.*
Taking the expectation of the stochastic dynamics (23), we recover the deterministic model (1). Indeed, denote
[TABLE]
using that [31] and that is independent of and , we deduce:
[TABLE]
which corresponds to the dynamics (1). Thus, we can deduce the behavior of the deterministic model from the stochastic model using a simple Monte-Carlo method. Conversely, the deterministic model gives information about the average behavior of the stochastic model. We also notice that in the case that all agents are connected ( for all ), we recover a special case of the so-called “choose-the-leader” dynamics [8, 9].
3.2 Convergence (in probability)
Analogous to our discussion of the deterministic consensus model, we are going to prove that the stochastic model converges to a consensus unconditionally if the graph, , has only one isolated strongly connected component. However, the convergence will be much slower than the deterministic model. The key tool is to use the notions of absorbing states and absorbing Markov chains.
Definition 8** (Absorbing states and absorbing Markov chain).**
Let be a Markov chain on a state space . A state is called absorbing if – that is, the probability of staying in given that the chain has arrived in is 1. The Markov chain is called an absorbing Markov chain if for any starting state , the chain can reach an absorbing state in finitely many transitions with positive probability. A Markov chain is absorbed if it reaches an absorbing state.
For instance, in our case, is an absorbing state. We can find sufficient conditions on the network for the embedded Markov chain of the stochastic consensus model to be absorbing.
Theorem 3**.**
Assume that the adjacency matrix has only one isolated strongly connected component. Then the stochastic consensus model (SCM) converges with probability to a consensus.
Proof.
We are going to show the convergence of the embedded Markov chain from which we can easily deduce the convergence of the continuous process .
Notice first that for a given initial condition , the embedded Markov chain evolves in a finite space of size . Indeed, each component has to be one of the initial opinion components \big{(}{\bf S}_{i}^{0}\big{)}_{1\leq i\leq N}. Thus, if we can show that is an absorbing Markov chain, then will be absorbed with probability [Theorem 11.3, [18]].
Consider now any opinion in the isolated strongly connected component of the graph. By assumption on the adjacency matrix , there exists for any a path joining to (6). Thus, there is a strictly positive probability (i.e. where path joining to ) such that after steps. Iterating the argument for all , we find a non-zero probability that for all with finite (with a rough upper-bound of ). Thus, we conclude that is an absorbing Markov chain. ∎
Remark 6*.*
From Theorem 3, we can deduce a simple proof for corollary 2 concerning the convergence of the deterministic model. Indeed, for any and initial condition , there exists a time such that for any we have \mathbb{P}\Big{(}{\bf S}(t)\text{ is a consensus}\Big{)}\geq 1-\varepsilon. Taking the expectation of the stochastic process, , we deduce that converges to the expected consensus. However notice that we lose the rate of convergence as we did not analyze the eigenvalues of the dynamics.
3.3 Decay rate for undirected graphs
Theorem 3 gives a necessary and sufficient conditions for the unconditional convergence to consensus of the stochastic model. However, we do not have any information about the rate of convergence of the process. In the deterministic case, we have seen that the convergence is exponential with a explicit rate given in terms of the second largest eigenvalue of the matrix , . We would like to investigate if one could obtain a similar explicit convergence rate for the stochastic model. With this aim, we are going to investigate the empirical variance of the process:
[TABLE]
where is the empirical mean:
[TABLE]
We will need to assume that the interactions are symmetric, i.e. that is symmetric.
Lemma 4**.**
Suppose that the adjacency matrix is symmetric, then the empirical variance (31) decays according to
[TABLE]
Proof.
Using the generator of the stochastic process (25), we find:
[TABLE]
Moreover, notice the following equality:
[TABLE]
Indeed, when jumps to opinion (i.e. ), all the ’’ index become ’’ but the difference between opinion and is zero. We then deduce:
[TABLE]
Furthermore, exploiting the symmetry we get that
[TABLE]
Thus
[TABLE]
∎
We can now deduce a lower bound for the previous equality (33) in terms of , the second smallest eigenvalue of the Laplacian matrix and the algebraic connectivity of the graph .
Lemma 5**.**
Suppose that the Laplacian matrix of the network is symmetric and let be its second smallest eigenvalue. Then:
[TABLE]
with the variance (31).
Proof.
From the above, we note that
[TABLE]
where we use the symmetry of . Therefore we can write
[TABLE]
Notice also that the empirical variance can be written in two ways:
[TABLE]
where is the average opinion (32). Since , we also deduce:
[TABLE]
Decomposing the symmetric matrix as with , we deduce:
[TABLE]
with . Thus,
[TABLE]
Notice that the first component has to be zero as the first row of is for some .
[TABLE]
Since the matrix preserves the norm, we have:
[TABLE]
Thus
[TABLE]
∎
Using these tools we can now characterize the convergence rate of the stochastic consensus model on a symmetric network.
Theorem 4**.**
Assume is symmetric and connected. Denote its second eigenvalue. Then
[TABLE]
Proof.
From the previous lemmas we observe that
[TABLE]
Applying Gronwall’s lemma gives the result. ∎
Remark 7*.*
Thus we see that the expected value of the variance of the opinions decays at rate ; the dependence on the number of agents is naturally contrasted with the deterministic case where the decay rate is only , in other words, the deterministic dynamics converge much faster than the stochastic dynamics.
Recall that in the deterministic case studied earlier we found that in the case of a balanced graph that the consensus converged to is the initial average of opinions. We can make a related observation concerning the stochastic model by exploiting the fact that the average, , is a martingale. The following corollary also gives a succinct proof of the convergence of the stochastic dynamics in this case.
Corollary 3**.**
Suppose is a balanced directed graph (10) and strongly connected. Then with probability 1, the dynamics converge to a consensus opinion at time and the consensus satisfies .
Proof.
Let the average. Using eq. (25), we deduce:
[TABLE]
since the graph is balanced (see remark 2). Therefore is a martingale. Moreover, is bounded with at most states (finite) and therefore there exists a stopping time that is finite with probability : almost surely. Then, by the Optional Stopping Theorem,
[TABLE]
Since the graph is also strongly connected, we know by Theorem 3 that must be a consensus.
∎
4 Numerical simulations
We will now present some numerical experiments to further analyze more subtle features of both of the consensus models studied. We first examine explicitly how the algebraic connectivity of the interaction network affects the speed of convergence in the case of a symmetric - recall that in both the deterministic and stochastic case that we were able to bound the speed of convergence in terms of the algebraic connectivity. Then, we focus on the case where there are multiple isolated blocks as we have shown in this case that a consensus does not form unconditionally. We are interested in how the opinions of isolated blocks affect the distribution of opinions in absorbing components. To this end we will examine the case of a “battle” between two isolated blocks with opinions at the extreme ends of the possible spectrum of opinions. Each of these isolated blocks will have the same number of connections to a central absorbing component whose internal connections are all symmetric.
4.1 Algebraic connectivity and speed of convergence
Earlier in the discussion of the deterministic model we showed that we could characterize the convergence rate of the model in terms of , the eigenvalue of with the second smallest real part. In the case that was symmetric (an undirected network) we could say even more - that the rate of convergence of the dynamics was at least of of order . The second eigenvalue is known as the algebraic connectivity of the graph , it measures in a sense how well the graph is connected. In this interpretation, since bounds the speed of convergence, the dynamics converge faster if the graph is better connected. We illustrate with several examples. We give two examples of graphs with the same number of vertices () but different adjacency matrices. In Fig. 6–left, we present a graph where each vertex is connected with its neighbors. In Fig. 6-right, the graph is composed of two sub-graphs connected by only one link resulting in a weak connectivity. Indeed, computing the algebraic connectivity , we observe that the graph on the left has a higher value even though it has fewer edges.
To illustrate how this affects the speed of convergence, we perform two simulations of the dynamics starting from the same initial configuration but using the two graphs presented in figure 6. The results are given in Fig. 7. We observe that the right figure converges quickly to two “clusters” but then these clusters are very slow to move together. This is due to the weak connectivity of the graph. In Fig. 8, we estimate the variance (denoted in the stochastic dynamics) of the opinions over time :
[TABLE]
The variances are decaying to zero (indicating that a consensus is forming). As expected, the variance is decaying faster for the graph with the larger (left-figure). We also include the evolution of the stochastic dynamics on the graph in Fig. 6–right, from the same initial condition, in Fig. 8–left. Notice that in Fig. 8–right that the variance of the stochastic dynamics is the slowest to decay - this is an illustration of how the stochastic dynamics slow down the speed of convergence.
4.2 Outside of consensus - multiple isolated blocks
Here we examine numerically the effects of more then one isolated block on the long-term behavior of both models. We set up the initial opinions so that each isolated block is at each extreme of the possible spectrum of opinions and that opinions in the central component are randomly distributed uniformly among the possible opinions.
After running a simulation of the dynamics we find that the middle component now exhibits a “gradient” of the possible opinions. Nodes close to the blue isolated block in the sense of graph distance exhibit an opinion close to that of the isolated block, likewise for nodes near the red isolated block. Nodes in the middle of the component exhibit an opinion close to neutral. This shows that in the case of multiple isolated blocks that the graph distance of a node in an absorbing component from isolated blocks plays a role in determining the final opinion of that node.
We now examine the analogous situation for the stochastic model using the same initial condition. We have already seen in the theory of the model that the agents in the central component cannot converge to a limiting opinion distribution as there are multiple isolated blocks in the network. However, to get an idea of how the network evolves in the stochastic case we show the state at .
Notice that similar to the deterministic case that it appears that nodes closer to an isolated block in the graph sense are likely to have the same opinion as that isolated block. However, this could very well be an artifact of the realization shown. To see that this phenomenon does indeed occur in the stochastic case we recall that we have shown that while the dynamics do not converge in this case, they are the same as the deterministic dynamics in expected value.
If we take the mean of 10 realizations at we recover a gradient of opinions in the central component similar to what we found in the deterministic case. This indicates that indeed if a node is closer to an isolated block in the graph sense then it is more likely to have the opinion of that isolated block and serves to illustrate that the dynamics of the stochastic model coincide with the deterministic model in expected value.
5 Conclusion and future work
We have discussed consensus models in both a deterministic and stochastic setting. In both cases we have completely characterized the convergence of the model. In the deterministic case we find that the model always converges although not necessarily to a consensus; this contrasts with the stochastic case where we find that the model only converges almost surely if and only if the network has only one isolated block. We have also given conditions that guarantee unconditional convergence to a consensus - isolated blocks play a critical and identical role in both cases. We find that for both models unconditional convergence to a consensus is equivalent to the network on which the model is defined having only one isolated block. This similarity is explained by the fact that the deterministic dynamics can be recovered from the stochastic dynamics in expectation. Finally, we show in the deterministic case that the rate of convergence depends only on the algebraic connectivity of the network - the stronger the connectivity the faster the model converges. We find a similar result in the stochastic model in the case of symmetric networks however in this case we see that the rate of convergence is much slower as it also depends on the number of agents.
There are many ways in which this study may be extended. We hope to make our assumptions about agent-agent interactions more subtle through adding non-linearities to the model. First, in this paper we assume that the network is static; the connections between individuals do not change with respect to time. To further generalize the consensus model we could study its behavior on graphs in which the connections between individuals are evolving with time and depend on the current opinions of individuals. A consensus model assumes that if two individuals interact for a sufficient amount of time they will reach a consensus, however everyday experience tells us that this is not always (or even often) the case. These considerations motivate the study of the following class of models:
[TABLE]
is known as an influence function and dictates how the opinion of individual changes based on its current value and the opinion of those who that individual is connected with. We also note that the behavior exhibited by the consensus model is reminiscent to that of a transport-diffusion model in the case of “grid-network” (for example those studied in Section 4.2). We hope to characterize this behavior through passing to a continuous description of the graph to obtain in a certain limit a partial differential equation describing the dynamics. One possible avenue to this description would be through a kinetic description of the model combined with a coarsening approach [15, 20]. However, as it has been observed [8], propagation of chaos is not always applicable to these dynamics which could hinder this approach.
Empirical studies of models of collective behavior outside of physics have traditionally been difficult due to lack of data and complexity of the interacting agents. However, progress has been made in biological models [4, 7] and we believe that the advent of social media and the simplicity of the consensus model make an empirical study of consensus dynamics feasible. Ideally the graph of interaction will be constructed from a source of real-world networks; for example, the meta data associated with social media [16, 22]. A possible goal of such a study would be to confirm the phenomenon found in this study; connectivity in the graph is a main factor in determining how effectively influence spreads through a network.
Appendix A Appendix
A.1 Convergence of linear systems
Lemma**.**
Given a linear system defined by:
[TABLE]
Assume has distinct eigenvalues and a zero eigenvalue of multiplicity with linearly independent associated eigenvectors. If for all with we have that then
[TABLE]
*where is in the center subspace of , . *
Proof.
For ease of notation we will write
[TABLE]
where is the algebraic multiplicity of . We will also write that . We know that given that we may find linearly independent generalized eigenvectors of , . The generalized eigenspace corresponding to is given by and by the generalized eigenspace decomposition theorem we can find a basis of consisting of generalized eigenvectors of . Therefore, we may write:
[TABLE]
We know by the fundamental theorem of linear systems that the solution to the system is given by:
[TABLE]
By (37) we may write:
[TABLE]
where for each we have . Notice that since we are given that there are distinct eigenvectors associated with that for each we have that . Consequently we have that which implies that for each . Taking the matrix exponential of both sides yields
[TABLE]
Therefore we must have that
[TABLE]
We now claim that
[TABLE]
for each . Since we are writing with respect to the basis of generalized eigenvectors of A we have that
[TABLE]
where N is nilpotent. Consequently
[TABLE]
with . Therefore, since and every coordinate of is polynomial in we must have that there exist and such that
[TABLE]
So we have that
[TABLE]
which implies
[TABLE]
As by definition we have that as desired. ∎
A.2 Decomposition into strongly connected components
Lemma**.**
If L is the Laplacian of a directed graph then by relabeling vertices L can be represented:
[TABLE]
Proof.
We may partition the vertex set of G, , into its strongly connected components , so that:
[TABLE]
If we consider the set as the vertex set of a new graph with edge set given by:
[TABLE]
Then the graph is a directed acyclic graph as has been partitioned into strongly connected components; if a cycle existed all vertices included in the cycle would represent the same strongly connected component of by (39) contradicting the partition of into strongly connected components. Therefore there exists a topological ordering on the vertex set of . This ordering is given by:
[TABLE]
We only need to label the vertices of in such a way that they respect the topological ordering. Then, this labeling of produces in the desired form. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Aldous. Interacting particle systems as stochastic social dynamics. Bernoulli , 19(4):1122–1149, 2013.
- 2[2] D. Aldous and J. Fill. Reversible Markov chains and random walks on graphs . Berkeley, 2002.
- 3[3] A. Aydoğdu, M. Caponigro, S. Mc Quade, B. Piccoli, N. Pouradier Duteil, F. Rossi, and E. Trélat. Interaction Network, State Space, and Control in Social Dynamics. In N. Bellomo, P. Degond, and E. Tadmor, editors, Active Particles, Volume 1 : Advances in Theory, Models, and Applications , Modeling and Simulation in Science, Engineering and Technology, pages 99–140. Springer International Publishing, Cham, 2017.
- 4[4] M. Ballerini, N. Cabibbo, R. Candelier, A. Cavagna, E. Cisbani, I. Giardina, A. Orlandi, G. Parisi, A. Procaccini, M. Viale, and V. Zdravkovic. Empirical investigation of starling flocks: a benchmark study in collective animal behaviour. Animal Behaviour , 76(1):201–215, July 2008.
- 5[5] E. Ben-Naim, P. L. Krapivsky, F. Vazquez, and S. Redner. Unity and discord in opinion dynamics. Physica A: Statistical Mechanics and its Applications , 330(1):99–106, 2003.
- 6[6] V. Blondel, J. Hendrickx, and J. Tsitsiklis. On Krause’s multi-agent consensus model with state-dependent connectivity. IEEE transactions on Automatic Control , 54(11):2586–2597, 2009.
- 7[7] J. Buhl, D. J. T. Sumpter, I. D. Couzin, J. J. Hale, E. Despland, E. R. Miller, and S. J. Simpson. From Disorder to Order in Marching Locusts. Science , 312(5778):1402–1406, June 2006.
- 8[8] E. Carlen, R. Chatelin, P. Degond, and B. Wennberg. Kinetic hierarchy and propagation of chaos in biological swarm models. Physica D: Nonlinear Phenomena , 260:90–111, 2013.
