Quantifying the impact of network structure on speed and accuracy in   collective decision-making

Bryan C. Daniels; Pawel Romanczuk

arXiv:1903.09710·q-bio.NC·March 26, 2019

Quantifying the impact of network structure on speed and accuracy in collective decision-making

Bryan C. Daniels, Pawel Romanczuk

PDF

TL;DR

This paper investigates how network structure influences the speed and accuracy of binary decision-making in collective systems, revealing key spectral properties that predict performance and exploring effects of hierarchical topology.

Contribution

It introduces spectral measures like eigenvalues and participation ratios as predictors of decision accuracy and analyzes hierarchical network effects on collective computation.

Findings

01

Decision accuracy is mainly influenced by spectral properties of the network.

02

Eigenvalues and participation ratios predict performance scaling in large networks.

03

Hierarchical structures like rich clubs affect localization and decision dynamics.

Abstract

Found in varied contexts from neurons to ants to fish, binary decision-making is one of the simplest forms of collective computation. In this process, information collected by individuals about an uncertain environment is accumulated to guide behavior at the aggregate scale. We study binary decision-making dynamics in networks responding to inputs with small signal-to-noise ratios, looking for quantitative measures of collectivity that control decision-making performance. We find that decision accuracy is controlled largely by three factors: the leading eigenvalue of the network adjacency matrix, the corresponding eigenvector's participation ratio, and distance from the corresponding symmetry-breaking bifurcation. This allows us to predict how decision-making performance scales in large networks based on their spectral properties. Specifically, we explore the effects of localization…

Equations51

\frac{d s _{i}}{d t} = - \frac{s _{i}}{τ} + \frac{μ}{τ} j \sum A_{ij} tanh (s_{j}) + \frac{I}{τ} + ξ,

\frac{d s _{i}}{d t} = - \frac{s _{i}}{τ} + \frac{μ}{τ} j \sum A_{ij} tanh (s_{j}) + \frac{I}{τ} + ξ,

τ \frac{d δ s}{d t} = - δ s + μ A δ s = (μ A - I) δ s,

τ \frac{d δ s}{d t} = - δ s + μ A δ s = (μ A - I) δ s,

τ \frac{d δ e ^ _{λ}}{d t} = (μ λ - 1) δ \overset{e}{^}_{λ} .

τ \frac{d δ e ^ _{λ}}{d t} = (μ λ - 1) δ \overset{e}{^}_{λ} .

μ_{c} = λ_{c}^{- 1} .

μ_{c} = λ_{c}^{- 1} .

∣∣ s^{*} - s_{0} ∣∣ \approx 3Δ μ λ_{c} p = 3 \overset{μ}{ˉ} p,

∣∣ s^{*} - s_{0} ∣∣ \approx 3Δ μ λ_{c} p = 3 \overset{μ}{ˉ} p,

t_{D}^{max} \propto p / σ;

t_{D}^{max} \propto p / σ;

\frac{d ν}{d t} = a ν + b \frac{ν ^{3}}{6} .

\frac{d ν}{d t} = a ν + b \frac{ν ^{3}}{6} .

ν^{*} = \pm 6 a /∣ b ∣ .

ν^{*} = \pm 6 a /∣ b ∣ .

a =

a =

b =

ν^{*} = \pm 3 p \frac{μ - μ _{c}}{μ} = \pm 3 \overset{μ}{ˉ} p + O (Δ μ^{3/2}),

ν^{*} = \pm 3 p \frac{μ - μ _{c}}{μ} = \pm 3 \overset{μ}{ˉ} p + O (Δ μ^{3/2}),

ν^{*} \approx 3 \overset{μ}{ˉ} \frac{λ _{c}}{( A ^{T} \cdot e ^ _{c} ) \cdot ( e ^ _{c} ) ^{3}},

ν^{*} \approx 3 \overset{μ}{ˉ} \frac{λ _{c}}{( A ^{T} \cdot e ^ _{c} ) \cdot ( e ^ _{c} ) ^{3}},

d ⟨ ν_{1} (t)⟩ / d t ∣_{t = t_{cross}} = d ⟨ ν_{2} (t)⟩ / d t ∣_{t = t_{cross}} .

d ⟨ ν_{1} (t)⟩ / d t ∣_{t = t_{cross}} = d ⟨ ν_{2} (t)⟩ / d t ∣_{t = t_{cross}} .

t_{D} (\overset{μ}{ˉ}) = {\frac{3 p μ ˉ}{4 σ ^{2}} τ \frac{τ}{2 μ ˉ} (2 y + lo g \frac{3 p μ ˉ ^{2}}{4 y σ ^{2}}) \overset{μ}{ˉ} < \overset{μ}{ˉ}_{cross}, \overset{μ}{ˉ} \geq \overset{μ}{ˉ}_{cross},

t_{D} (\overset{μ}{ˉ}) = {\frac{3 p μ ˉ}{4 σ ^{2}} τ \frac{τ}{2 μ ˉ} (2 y + lo g \frac{3 p μ ˉ ^{2}}{4 y σ ^{2}}) \overset{μ}{ˉ} < \overset{μ}{ˉ}_{cross}, \overset{μ}{ˉ} \geq \overset{μ}{ˉ}_{cross},

t_{D}^{max} = \frac{τ z 3 p}{2 σ},

t_{D}^{max} = \frac{τ z 3 p}{2 σ},

\frac{d s _{i}}{d t} = - \frac{s}{τ} + \frac{μ ~}{τ} ⟨ tanh s ⟩ + ξ

\frac{d s _{i}}{d t} = - \frac{s}{τ} + \frac{μ ~}{τ} ⟨ tanh s ⟩ + ξ

\frac{d s _{i}}{d t} = - \frac{s}{τ} + \frac{μ ~}{τ} (⟨ s ⟩ - \frac{1}{3} ⟨ s^{3} ⟩) + ξ

\frac{d s _{i}}{d t} = - \frac{s}{τ} + \frac{μ ~}{τ} (⟨ s ⟩ - \frac{1}{3} ⟨ s^{3} ⟩) + ξ

τ \partial_{t} p (s, t) = - \partial_{s} {[- s + \tilde{μ} (⟨ s ⟩ - \frac{1}{3} ⟨ s^{3} ⟩)] p (s, t)} + \frac{σ ^{2}}{2} \partial_{s}^{2} p (s, t)

τ \partial_{t} p (s, t) = - \partial_{s} {[- s + \tilde{μ} (⟨ s ⟩ - \frac{1}{3} ⟨ s^{3} ⟩)] p (s, t)} + \frac{σ ^{2}}{2} \partial_{s}^{2} p (s, t)

\partial_{t} ⟨ s^{n} ⟩ = \partial_{t} \int_{- \infty}^{\infty} s^{n} p (s, t) d s = \int_{- \infty}^{\infty} s^{n} \partial_{t} p (s, t) d s

\partial_{t} ⟨ s^{n} ⟩ = \partial_{t} \int_{- \infty}^{\infty} s^{n} p (s, t) d s = \int_{- \infty}^{\infty} s^{n} \partial_{t} p (s, t) d s

⟨ s ⟩

⟨ s ⟩

⟨ s^{2} ⟩

⟨ s^{3} ⟩

\partial_{t} u

\partial_{t} u

\partial_{t} T

T^{*} = \frac{σ ^{2}}{2}

T^{*} = \frac{σ ^{2}}{2}

u_{1}^{*}

u_{1}^{*}

u_{2, 3}^{*}

\tilde{μ}_{c} = \frac{1}{1 - \frac{σ ^{2}}{2}} \approx 1 + \frac{σ ^{2}}{2}

\tilde{μ}_{c} = \frac{1}{1 - \frac{σ ^{2}}{2}} \approx 1 + \frac{σ ^{2}}{2}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Quantifying the impact of network structure on speed and accuracy in collective decision-making

Bryan C. Daniels

ASU–SFI Center for Biosocial Complex Systems, Arizona State University, Tempe, Arizona, USA

Pawel Romanczuk

Institute for Theoretical Biology, Department of Biology, Humboldt Universität zu Berlin, Germany

Bernstein Center for Computational Neuroscience, Berlin, Germany

Abstract

Found in varied contexts from neurons to ants to fish, binary decision-making is one of the simplest forms of collective computation. In this process, information collected by individuals about an uncertain environment is accumulated to guide behavior at the aggregate scale. We study binary decision-making dynamics in networks responding to inputs with small signal-to-noise ratios, looking for quantitative measures of collectivity that control decision-making performance. We find that decision accuracy is controlled largely by three factors: the leading eigenvalue of the network adjacency matrix, the corresponding eigenvector’s participation ratio, and distance from the corresponding symmetry-breaking bifurcation. This allows us to predict how decision-making performance scales in large networks based on their spectral properties. Specifically, we explore the effects of localization caused by the hierarchical assortative structure of a “rich club” topology. This gives insight into the tradeoffs involved in the higher-order structure found in living networks performing collective computations.

Keywords: collective computation, neural networks, symmetry breaking transition, stochastic dynamical systems, rich club

Introduction

Collective intelligence refers to the ability of groups of individual components to process environmental information and successfully perform adaptive functions at a larger collective scale. Building a coherent framework for understanding distributed functionality is challenging in that the internal structure of natural and engineered collectives varies strongly, from quasi-homogeneous systems like swarms of identical robots, to fish-schools consisting of similarly behaving individuals but with persistent behavioral differences (“personalities”) [1, 2] or different prior information [3, 4], to strongly heterogeneous and hierarchical systems like primate societies [5, 6] or neurons in a brain [7, 8]. Facing this diversity, a key challenge for building a better abstract understanding of collective intelligence is to determine which details of such systems are most important to collective function and which are incidental and can be ignored. In this way, we are searching for measures that usefully quantify “collectivity” across a broad continuum of complex systems.

In addition to diversity in heterogeneity and communication structure, myriad types of functions may be implemented in a collective system, ranging in complexity from simple majority consensus to high-level abstract information processing. Here we focus on a particularly simple function—making a correct binary decision about the sign of a noisy distributed input—and look for network statistics that delineate the full range of strategies that can be used to successfully perform this collective function.

Past experimental investigations of collective decision-making have mostly not addressed network structure, instead assuming all-to-all coupling and focusing on optimal rules for aggregating decisions made by individuals [9, 10, 11, 12, 13]. However, an increasing number of studies are beginning to investigate non-trivial network structures [14, 15, 16, 17]. For example, Kearns et al [14] look at the effect of varying network structure in consensus formation in human groups. They find, e.g., that “preferential attachment” networks lead to faster consensus than Erdos–Renyi.

Theoretically, many examples of collective decision-making can be effectively described using networks of coupled dynamical components. Structural properties of such networks and how they affect self-organization and collective behavior have long been a focus of complex systems research (see e.g. [18, 19, 20, 21]). Particularly well studied are effects of network structure on emergent dynamics in the context of synchronization and consensus formation [18, 19]. Theoretical research often aims to map the phase diagram of system dynamics as a function of underlying network structure parameters, for example to identify regions of synchronized versus random dynamics (see e.g. [19] and references therein). This language of phase diagrams, originating in statistical physics, has also been used to hypothesize that in order to ensure optimal information processing, collective systems should operate near phase transitions (critical manifolds) [22, 23]. 111In some large $N$ limits, hierarchical modular networks can have infinitely many localized modes corresponding to critical coupling strength values over a continuous range—this produces a so-called “Griffiths phase” [24]. We do not focus on this here because we anticipate our methods will be most useful applied to known finite networks.

Corresponding theoretical insights have driven the systematic analysis of structural properties of artificial and real-world networks, including node and degree heterogeneity [25] and structural hierarchies [26, 27]. In particular, many real-world collective systems exhibit a “rich club” (core-periphery) structure, with examples coming from neuroscience [28, 7, 29, 30, 8], social science, and biochemistry [12, 31]. The rich club refers to a subset of nodes that a) have a larger (in-)degree and b) are more likely to be connected to other rich club nodes than in an otherwise random wiring. It has been argued that such a core-periphery topology may play an important role for the function of complex information processing systems (see e.g. [7, 32, 30]).

The dynamical effects of network structure have been explored largely in the context of synchronization or consensus, the problem of collective agreement. Extending to the problem of decision-making also requires a notion of correctness: we want a system that not only produces collective agreement on any consensus state, but on the correct state, given a source of input information. Binary collective decision-making in this sense maps naturally onto “noisy integrator” models, such as leaky integration to bound (Ornstein–Ullenbeck [33]) and related models with stable attractors representing decision states [34, 35]. A general constraint for any such decision-making system is the tradeoff between speed and accuracy [36, 37, 33]. Recently, it has been shown that the speed–accuracy tradeoff in simple collective decision models can be quantified in terms of distance from a bifurcation [35, 38].

Motivated by the above findings, we focus in this work on the question of how a rich-club structure affects the speed and accuracy of collective decision-making. In particular, we look for network statistics that capture the most important properties controlling collective performance in decision dynamics.

Results

Collective decision-making model

A simple minimal model of distributed decision-making defines dynamics for the internal noisy states of individual components, each of which receives the same input signal $I$ , recovers to a null state on a timescale $\tau$ , and is affected by its neighbors through a saturating function of its neighbors’ states [35, 38]:

[TABLE]

where $I$ is an input signal given uniformly to every node and $\xi$ is uncorrelated Gaussian noise with $\langle\xi(t)\xi(t+\Delta t)\rangle=\sigma^{2}\tau^{-1}\delta(\Delta t)$ . We explicitly write the differential equations in terms of an overall timescale $\tau$ ; in describing neural dynamics, for example, we expect $\tau$ to be on the order of tens of milliseconds.

We initialize the system in a state $\vec{s}_{0}$ that, in the case of zero noise, corresponds to a fixed point undergoing a pitchfork bifurcation as a function of the coupling strength $\mu$ . This bifurcation separates the case of a single stable fixed point at $\vec{s}_{0}$ and the case of two distinct stable fixed points at $\vec{s}_{0}\pm\epsilon\hat{e}_{c}$ , which we treat as decision states (where $\hat{e}_{c}$ is the unit vector pointing in the direction in which the decision states emerge from $\vec{s}_{0}$ at the bifurcation). We focus here on the simplest such bifurcation,222 Tuning a second control parameter can locate more general pitchfork bifurcations; see [38].

which occurs at $\vec{s}_{0}=\vec{0}$ .

To test how the existence of higher-order structure changes the decision-making performance, we vary the adjacency matrix $A$ to test symmetric networks with fixed size $N$ and total number of edges, changing only the degree distribution and higher-order structure (Figure 1). First, a “rich club” network is created through a random generation of a fixed number of edges from the $N(N-1)$ possible edges, where edges between $N_{\mathrm{rich}}$ core nodes are biased to be more likely to appear by a factor $b_{\mathrm{rich}}$ . A corresponding network that has exactly the same degree distribution but no rich club is then created by randomly swapping existing edges. Finally, we test an Erdos–Renyi variant in which all possible edges are equally likely. We use the same three specific example networks shown in Figure 1 in simulations throughout the paper; results are qualitatively similar for other networks sampled from each ensemble.

Speed–accuracy tradeoff

We test each network in its ability to integrate information about a signal that is small compared to the noise and then retain that information after the signal is removed. We define accuracy in terms of whether the system ends the simulation near the correct decision state, the one that lies in the direction of the input $+I$ (instead of $-I$ ). Specifically, we test whether the sign of the final state along the unstable dimension, $(\vec{s}_{\mathrm{final}}-\vec{s}_{0})\cdot\hat{e}_{c}$ , is the same as the sign of the input along that dimension, $I\vec{1}\cdot\hat{e}_{c}$ .

As in previous studies using a similar model [35, 38] (and across a wide variety of systems in general [39, 36, 37]), we expect to find a speed–accuracy tradeoff: Slow dynamics should produce better accuracy, as the system is able to integrate the input over a longer time before fixating within a single decision state, whereas fast dynamics produce a decision based primarily on noise.

To quantify the speed of the decision, we measure a characteristic time $t_{D}$ over which the system approaches the final decision fixed point. We first define the two decision states $\vec{s}_{\pm}^{*}$ as the stable fixed points of the dynamics in Eq. (1) with zero input and zero noise.333 In the case here with $\vec{s}_{0}=\vec{0}$ , the two decision states are related simply by an inversion symmetry: $\vec{s}_{+}^{*}=-\vec{s}_{-}^{*}$ . Sufficiently close to the bifurcation, we expect analogous results for the speed–accuracy tradeoff even in more complicated cases where this symmetry does not hold [38]. We then define the decision timescale $t_{D}$ as the first time the state $\vec{s}$ reaches halfway to the decision state $\vec{s}^{*}$ along the dimension $\hat{s}^{*}=(\vec{s}^{*}-\vec{s}_{0})/|\vec{s}^{*}-\vec{s}_{0}|$ .

As expected, our simulations show a speed–accuracy tradeoff as we vary the overall connection strength $\mu$ , shown in Figure 2. When tuned to a given decision timescale, the accuracy is largely unaffected by network structure. The highest accuracy is observed when the system supports long timescale dynamics.

Given that performance is largely controlled by the decision timescale $t_{D}$ , we would like to understand how the network structure, defined by the adjacency matrix $A$ , controls $t_{D}$ . We expect that the largest timescales will occur near the symmetry-breaking transition that creates the two decision states.

Locating the transition and decision states

First, we must locate the relevant pitchfork bifurcation, which controls the transition between dynamics in which node states are not correlated over long times (when $\mu$ is small and interactions between nodes are weak) into dynamics in which a nodes can collectively store a long-term memory (when $\mu$ is large enough that interactions support a self-reinforcing consensus state). With zero input and zero noise, it is straightforward to find this transition, by analyzing how a small perturbation $\delta\vec{s}$ to the initial state $\vec{s}_{0}$ changes under the dynamics:

[TABLE]

where $\mathcal{I}$ is the identity matrix. Then the behavior is most easily analyzed in the basis of eigenvectors of $A$ : the dynamics along eigenvector $\hat{e}_{\lambda}$ are given by

[TABLE]

Thus the initial state $\vec{s}_{0}$ will be stable until $\mu$ becomes large enough to make $(\mu\lambda-1)$ positive. In other words, as we expect from basic linear stability analysis [40], the critical value of $\mu$ at which $\vec{s}_{0}$ first becomes unstable is controlled by the largest eigenvalue $\lambda_{c}$ of $A$ :

[TABLE]

The symmetry between positive and negative values of $s$ means that this is a pitchfork bifurcation, and two stable fixed points emerge from the unstable fixed point along the dimension of the eigenvector corresponding to $\lambda_{c}$ .

The distance between each stable fixed point (decision state) and the unstable starting point $\vec{s}_{0}$ grows as a function of $\mu$ : Near the transition [see Appendix Eq. (8)],

[TABLE]

where $\Delta\mu=\mu-\mu_{c}$ , $\bar{\mu}=\Delta\mu/\mu_{c}$ is the reduced distance from the transition, and $p=1/|(\hat{e}_{c})^{4}|$ characterizes the distributedness of the eigenvector $\hat{e}_{c}$ corresponding to the leading eigenvalue $\lambda_{c}$ .

The value $p$ sets the scale of the distance between the collective decision states, and it corresponds roughly to the number of individual nodes contributing to the mode (see the bottom row of Figure 1). In general, $p$ varies between 1 for a completely localized mode and $N$ for a completely delocalized mode (produced, for example, by homogeneous all-to-all coupling). The inverse of $p$ appears in studies of localization in random matrix theory, where it has been called the “inverse participation ratio” [41, 42]; we therefore call $p$ the participation ratio.

Figure 3 compares this zero-noise local approximation to the zero-noise numerical solution for the fixed point $\vec{s}^{*}$ and to the final state of the simulation including noise.444 Noise also affects the location of the transition. In our simulations here we use a small noise parameter $\sigma$ (with input signal $I$ even smaller to achieve a small signal-to-noise ratio); this means the effect of noise on the transition location is minimal on the scales we test. We calculate the lowest-order correction to $\mu_{c}$ in the Appendix and find that it is on the order of $10^{-3}$ for the plotted cases. For each network, the transition occurs at the expected $\mu_{c}$ and with the expected local dependence on $\Delta\mu$ . Because the largest timescales also occur near the transition, this local analysis will allow us to approximate the maximal decision timescale in the next section.555 Note that, in the rich club network, increasing the coupling beyond the scale shown in Figure 3 can also create bistability in the peripheral nodes. These cases have four stable fixed points, two of which correspond to the core and periphery nodes coming to consensus on conflicting decisions, and two in which core and periphery disagree. In our current setup, these cases do not change our analysis because the core always decides first, biasing the remainder of the system.

Predicting the timescale of the decision

In the absence of noise, the timescale of the decision is expected to diverge at the transition, because the fixed point at the origin becomes marginally stable. With noise, this is smoothed out in a predictable way, leading to a simple equation for the timescale derived in the Appendix, Eq. (13). Roughly, the timescale is determined by a combination of the distance between the two decision states (proportional to $\sqrt{\Delta\mu}$ ), the characteristic timescale of exponential growth away from the unstable fixed point (proportional to $\Delta\mu^{-1}$ ), and the characteristic speed of motion due solely to noise (proportional to $\sigma$ ).

In Figure 4, we demonstrate that the decision timescale is well-approximated by Eq. (13). As we saw before in Figure 2, longer timescales correspond to better accuracy. Further, this analysis allows us to predict the maximal decision timescale supported by a given network under a given level of noise $\sigma$ ; we find [see Appendix Eq. (14)]

[TABLE]

that is, the timescale supported by the collective mode scales with the square root of the participation ratio. Consequently, due to the fundamental speed–accuracy tradeoff, $p$ becomes a useful quantification of higher order network structure that sets a limit on collective decision accuracy.

Discussion

We study here a collective decision process that relies on the phenomenon of critical slowing down, a mechanism for creating long-timescale dynamics. In the case of small signal-to-noise ratio, decision accuracy is limited by the timescale of collective dynamics, and the system must be tuned near a symmetry-breaking bifurcation to successfully integrate information into an accurate decision. Varying the distance from the bifurcation $\Delta\mu$ traces out a speed–accuracy tradeoff (Figure 2).

In the spirit of quantifying collectivity, our aim is to characterize the aspects of network connectivity that control this timescale and therefore place limits on collective decision accuracy. We find that the most important factors characterize the normal mode of the network that is least stable. This is the mode that first becomes unstable as interaction strengths are increased, thereby leading to bistability that encodes a binary decision. First, the leading eigenvalue $\lambda_{c}$ , effectively a measure of the connectivity of individuals participating in the mode, sets the scale of the critical coupling $\mu_{c}$ required for reinforcing the decision state. Second, the participation ratio $p$ of the corresponding eigenvector is a measure of the number of individuals participating in the mode.

Our main result is to identify $\lambda_{c}$ and $p$ as important measures for quantifying collective behavior in heterogeneous networks. Given any detailed network structure, these two simple statistics encapsulate the network’s ability to create long-timescale dynamics. This allows us to predict how collective timescales behave across a variety of network structures. For instance, the maximal decision timescale that can be produced by the critical slowing mechanism increases in a predictable way as more individual nodes are allowed to participate in the unstable mode, scaling as $\sqrt{p}$ . This fits with our rough intuition, as we expect that the effects of noise will shrink in a group of $N$ individuals as $1/\sqrt{N}$ .

Generally, our results resonate with recent studies that focus on low-dimensional collective modes controlling the most important aspects of distributed computation in biological networks [43, 44]. The importance of the principal eigenvalue and corresponding (inverse) participation ratio hints at connections between our model of decision-making and related characterizations of disease spreading [45], correlations in financial data [41], and Anderson localization in condensed matter physics [46].

Our motivation began with understanding the functional consequences of rich-club structure and criticality in the brain. These results allow us to speculate about fundamental tradeoffs: What are potential advantages and disadvantages to hierarchical rich-club structure? On the one hand, more distributed connectivity may be advantageous in that it leads to more distributed collective modes, longer timescales, and therefore better averages over the noisy knowledge of individuals. On the other hand, the localized modes created by a rich club structure could be advantageous for modularized function and localized control. In this way, the rich club could be a way to bring only a subset of the system supercritical, with consequently reduced noise-reduction benefits of collectivity.

We expect this framework and intuition to be useful in systems in which the interaction structure remains fixed over the timescale of a single decision process, but may vary over longer adaptive timescales. Besides neural dynamics, such a framework may be useful for describing genetic regulatory networks producing cell fate decisions during development [47], social networks producing consensus about dominance hierarchies [48], and networks of influence underlying decisions by political bodies [49, 50]. In such systems, the computation of decisions happens on relatively fast timescales, while on longer adaptive timescales, there may be tuning of the network that could change the relevant parameters $\lambda_{c}$ and $p$ .

To guide our intuition, our analysis focused on the simplest symmetry-breaking bifurcation, where the initial state of each individual is the same ( $\vec{s}_{0}=\vec{0}$ ). It will be useful in future work to focus on more complicated transitions (as explored in [38]), where we expect that differing states and therefore saturations across individuals will modify the calculation, perhaps leading to a generalized form of the participation ratio that weights individuals by their contributions.

Acknowledgments

PR acknowledges funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC 2002/1 "Science of Intelligence" – project number 390523135, as well as through the Emmy Noether program, project number RO4766/2-1.

Appendix

Derivation of distance between stable fixed points

The normal form of a system undergoing a pitchfork bifurcation is

[TABLE]

In a one-dimensional system with state $x$ and dynamics $dx/dt=F(x)$ that has a pitchfork bifurcation at $x=x_{0}$ , the system is described by Eq. (7) near $x_{0}$ , with $\nu=x-x_{0}$ , $a=dF(x)/dx|_{x=x_{0}}$ , and $b=d^{3}F(x)/dx^{3}|_{x=x_{0}}$ . This is the Taylor series of $F(x)$ at $x=x_{0}$ up to third order, where the second-order term disappears due to the symmetry that is required for a pitchfork bifurcation: $F(x_{0}+\delta)=-F(x_{0}-\delta)$ near $\delta=0$ . The bifurcation happens when $a$ changes sign. We focus here on the case that creates two stable fixed points (decision states), which coincides with $b<0$ .

Solving Eq. (7) for $d\nu/dt=0$ , we find one fixed point at $\nu^{*}=\nu_{0}$ that changes from stable when $a<0$ to unstable when $a>0$ , and two stable fixed points when $a>0$ at

[TABLE]

For example, in a simple one-dimensional case where $F(x)=-x+\mu\tanh{x}$ , we have $\nu=x$ , $x_{0}=0$ , $a=\mu-1$ , and $b=-2\mu$ . Inserting into Eq. (8), we find, for small $\Delta\mu\equiv\mu-1$ , $\nu^{*}\approx\pm\sqrt{3\Delta\mu}$ .

In the higher-dimensional context of Eq. (1), $\nu$ becomes the linear combination of state $\vec{s}$ along the dimension of the least-stable dimension $\hat{e}_{c}$ : $\nu=\vec{s}\cdot\hat{e}_{c}$ . Then, to produce the Taylor series corresponding to Eq. (7), we take the relevant directional derivatives of the right-hand side of Eq. (1). Calling the zero-noise, zero-input part of the dynamics $\vec{F}$ [that is, $F_{i}(\vec{s})=-s_{i}+\mu\sum_{j}A_{ij}\tanh(s_{j})$ ], we have

[TABLE]

Inserting this into Eq. (8) produces

[TABLE]

where $\mu_{c}=1/\lambda_{c}$ , $p=1/\sum_{i}(\hat{e}_{c})_{i}^{4}$ , $\Delta\mu=\mu-\mu_{c}$ , and $\bar{\mu}=\Delta\mu/\mu_{c}$ .

We note that the above result assumes that the adjacency matrix $A$ is symmetric. In the asymmetric case,

[TABLE]

where $\hat{e}_{c}$ is the normalized eigenvector of $A$ corresponding to $\lambda_{c}$ .

Derivation of approximate decision timescale $t_{D}$

We define the decision timescale $t_{D}$ as the time for $\nu=|\vec{s}\cdot\hat{s}^{*}|$ to reach halfway to the fixed point $\nu^{*}$ . To approximate $t_{D}$ in the presence of noise, we patch together two types of behavior. First, for sufficiently small times $t$ , we expect the average behavior along $\hat{e}_{c}$ to be dominated by noise [the $\xi$ term dominates in Eq. (1)]. Noise dominates here because the system is still close to the fixed point at the origin, where, at the bifurcation, the first two terms cancel up to second order in $\nu$ . Neglecting all terms other than the noise term, the average behavior is given by $\langle\nu_{1}(t)\rangle=\sigma\sqrt{t/\tau}$ . Then, after a crossing time $t_{\mathrm{cross}}$ , the first two terms dominate and noise becomes unimportant. Now neglecting the noise term, given an initial condition of $\nu_{0}=\langle\nu_{1}(t_{\mathrm{cross}})\rangle$ , and considering for simplicity only the lowest-order approximation of $F$ near the unstable fixed point, the state simply grows exponentially: $\langle\nu_{2}(t)\rangle=\nu_{0}\exp(t-t_{\mathrm{cross}})\bar{\mu}/\tau$ .

We patch these two solutions together by defining $t_{\mathrm{cross}}$ as the time when their derivative matches:

[TABLE]

Solving this produces $t_{\mathrm{cross}}=y\tau/\bar{\mu}$ , where $y\approx 0.3517$ is the solution to $2\exp y=y^{-1}$ . Finally, we solve for $t_{D}$ , the time to reach $\nu^{*}/2$ , as a function of the reduced distance from the transition $\bar{\mu}$ :

[TABLE]

where $\bar{\mu}_{\mathrm{cross}}=2\sigma\sqrt{y/3p}$ . This approximation of the decision timescale is plotted in Figure 4 as dashed and solid lines (dashed for $\bar{\mu}<\bar{\mu}_{\mathrm{cross}}$ and solid for $\bar{\mu}>\bar{\mu}_{\mathrm{cross}}$ ). The function $t_{D}(\bar{\mu})$ has a maximum at $\bar{\mu}_{\mathrm{max}}=\bar{\mu}_{\mathrm{cross}}\exp(1-y)$ , producing the maximal decision timescale for a given transition as

[TABLE]

where $z=e^{y-1}/\sqrt{y}\approx 0.882$ .

Impact of noise on the critical point

In order to asses the general impact of noise on the critical coupling strength $\mu_{c}$ , we consider a mean-field approach (fully connected graph) in the absence of an external signal ( $I=0$ ). The general approach employed here is analogous to the one used in [51, 52], where a more detailed account can be found. The mean field stochastic differential equation corresponding to Eq. 1 reads:

[TABLE]

Here $\langle\tanh s\rangle$ represents the expectation value of the interaction term, and $\tilde{\mu}=N\mu$ is the mean field coupling strength scaled by the number of nodes $N$ . Assuming $s\ll 1$ , we use the Taylor expansion $\tanh x\approx x-x^{3}/3$ to rewrite the interaction of the individual node with the mean field in terms of the first and third moment of $s$ :

[TABLE]

From the above stochastic differential equation we can derive the following nonlinear Fokker-Planck equation for the probability density function (PDF) $p(s,t)=\langle\frac{1}{N}\sum_{j}\delta(s-s_{j})\rangle$ :

[TABLE]

Here, the crucial simplifying assumption in the derivation is that the $N$ -particle distribution function factorizes, i.e. that the correlations between nodes can be neglected (mean-field ansatz).

Inserting Eq. 17 into

[TABLE]

produces a hierarchy of coupled evolution equations for the different moments $\langle s^{n}\rangle$ of the PDF.

We rewrite the state variable as $s=u+\delta s$ , where $u$ is the average state of the system and $\delta s$ is the fluctuation around the mean, and assume that $\langle\delta s^{k}\rangle=0$ for all odd $k$ ( $k=1,3,5,\dots$ ). This allows us to express the first three moments of $p(s,t)$ as:

[TABLE]

In the following, we will use the notation $T=\langle\delta s^{2}\rangle$ for the variance of the fluctuations. It can be viewed as an effective “temperature” quantifying the intensity of the fluctuations around the mean.

Eventually combining equations 17, 18 and 19 we arrive at the following evolution equations for the mean $u$ and the temperature $T$ :

[TABLE]

The corresponding stationary solutions can be easily obtained by solving the above equations for $\partial_{t}u=\partial_{t}T=0$ . The stationary temperature $T^{*}$ is

[TABLE]

The cubic equation for the mean yields three stationary solutions $u^{*}$ , which correspond directly to the fixed points $\nu^{*}$ discussed in the context of the general pitch-fork bifurcation above:

[TABLE]

Here, the $u_{1}^{*}$ corresponds to the disordered solution below a critical coupling strength. If the coupling strength becomes too large, then this disordered solution becomes unstable. In the absence of an external signal (bias), we observe a spontaneous symmetry breaking, where $u_{2,3}^{*}$ correspond to the two possible solutions, identical with the two different branches of the pitchfork. These two stationary solutions exist only if the argument in the square root is positive (as $u^{*}\in\mathbb{R}$ ).

For vanishing noise, $\sigma^{2}=0$ , the critical mean-field coupling strength is $\tilde{\mu}_{c}=1$ , which is consistent with our previous results if we consider a fully connected graph. For small noise $\sigma\ll 1$ the critical point is modified according to:

[TABLE]

Thus, introducing noise leads effectively only to a shift of the critical coupling strength to larger values, without any qualitative change regarding general result obtained for the zero noise case. Furthermore, the shift is small for the regime we test in this study: with $\sigma=0.05$ we expect corrections to $\mu_{c}$ on the order of $10^{-3}$ .

Bibliography52

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Jolle W Jolles, Neeltje J Boogert, Vivek H Sridhar, Iain D Couzin, and Andrea Manica. Consistent individual differences drive collective behavior and group functioning of schooling fish. Current Biology , 27(18):2862–2868, 2017.
2[2] David Bierbach, Tim Landgraf, Pawel Romanczuk, Juliane Lukas, Hai Nguyen, Max Wolf, and Jens Krause. Using a robotic fish to investigate individual differences in social responsiveness in the guppy. Royal Society Open Science , 5, 2018.
3[3] Iain D Couzin, Christos C Ioannou, Güven Demirel, Thilo Gross, Colin J Torney, Andrew Hartnett, Larissa Conradt, Simon A Levin, and Naomi E Leonard. Uninformed individuals promote democratic consensus in animal groups. science , 334(6062):1578–1580, 2011.
4[4] Itai Pinkoviezky, Iain D Couzin, and Nir S Gov. Collective conflict resolution in groups on the move. Physical Review E , 97(3):032304, 2018.
5[5] Bryan C Daniels, David C Krakauer, and Jessica C Flack. Sparse code of conflict in a primate society. Proceedings of the National Academy of Sciences , 109(35):14259–14264, 2012.
6[6] Bryan C Daniels, David C Krakauer, and Jessica C Flack. Control of finite critical behaviour in a small-scale social system. Nature communications , 8:14301, 2017.
7[7] Logan Harriger, Martijn P. van den Heuvel, and Olaf Sporns. Rich Club Organization of Macaque Cerebral Cortex and Its Role in Network Communication. P Lo S ONE , 7(9), 2012.
8[8] S. Nigam, M. Shimono, S. Ito, F.-C. Yeh, N. Timme, M. Myroshnychenko, C. C. Lapish, Z. Tosi, P. Hottowy, W. C. Smith, S. C. Masmanidis, A. M. Litke, O. Sporns, and J. M. Beggs. Rich-Club Organization in Effective Connectivity among Cortical Neurons. Journal of Neuroscience , 36(3):670–684, 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Quantifying the impact of network structure on speed and accuracy in collective decision-making

Abstract

Introduction

Results

Collective decision-making model

Speed–accuracy tradeoff

Locating the transition and decision states

Predicting the timescale of the decision

Discussion

Acknowledgments

Appendix

Derivation of distance between stable fixed points

Derivation of approximate decision timescale tDt_{D}tD​

Impact of noise on the critical point

Derivation of approximate decision timescale $t_{D}$