Social learning in a simple task allocation game

Rui Chen; Garcia Julian; Meyer Bernd

arXiv:1702.05739·q-bio.PE·February 21, 2017

Social learning in a simple task allocation game

Rui Chen, Garcia Julian, Meyer Bernd

PDF

Open Access

TL;DR

This paper explores how social interactions and learning mechanisms influence task specialization in colonies, using evolutionary game theory and simulations to reveal conditions favoring specialists or generalists.

Contribution

It introduces a simple task-allocation game and demonstrates how different social learning processes lead to diverse colony structures under ecological variations.

Findings

01

Social learning can produce specialized or generalized colonies.

02

Introspective learning favors specialization.

03

Task recruitment promotes generalization.

Abstract

We investigate the effects of social interactions in task al- location using Evolutionary Game Theory (EGT). We propose a simple task-allocation game and study how different learning mechanisms can give rise to specialised and non- specialised colonies under different ecological conditions. By combining agent-based simulations and adaptive dynamics we show that social learning can result in colonies of generalists or specialists, depending on ecological parameters. Agent-based simulations further show that learning dynamics play a crucial role in task allocation. In particular, introspective individual learning readily favours the emergence of specialists, while a process resembling task recruitment favours the emergence of generalists.

Figures4

Click any figure to enlarge with its caption.

Equations8

B^{A} (x_{i}) = - \frac{4}{n ^{2}} \cdot (j = 1 \sum n x_{j})^{2} + \frac{4}{n} \cdot j = 1 \sum n x_{j}

B^{A} (x_{i}) = - \frac{4}{n ^{2}} \cdot (j = 1 \sum n x_{j})^{2} + \frac{4}{n} \cdot j = 1 \sum n x_{j}

B^{B}(x_{i})=\frac{1}{n}\cdot b\bigg{(}\sum_{j=1}^{n}(1-x_{j})\bigg{)}.

B^{B}(x_{i})=\frac{1}{n}\cdot b\bigg{(}\sum_{j=1}^{n}(1-x_{j})\bigg{)}.

\frac{8 b}{n ^{2}} (3 x^{*} - 2) + 2 > 0

\frac{8 b}{n ^{2}} (3 x^{*} - 2) + 2 > 0

\frac{8 b}{n ^{2}} (3 x^{*} - 2) + 2 < 0.

\frac{8 b}{n ^{2}} (3 x^{*} - 2) + 2 < 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Game Theory and Cooperation · Evolution and Genetic Dynamics · Opinion Dynamics and Social Influence

Full text

Social learning in a simple task allocation game

Rui Chen1, Julian García1

Bernd Meyer1

1Faculty of Information Technology

Monash University

Australia

{rui.chen, julian.garcia, bernd.meyer}@monash.edu

Background

The ability of social insect colonies to tackle different tasks simultaneously without central control is a key factor in their ecological success. Individual choices by colony members give rise to collective behaviours that are robust to dynamic and uncertain environments (Camazine et al.,, 2001). The mechanisms of this process, known as self-organised task allocation, have inspired a variety of applications in Computer Science, including scheduling (Campos et al.,, 2000) and control (Dressler,, 2007).

Individual choices in task allocation are determined by three different interacting factors: individual personalities, the environment, and interactions amongst colony members (Charbonneau and Dornhaus,, 2015). The interplay between individual traits and the environment has been studied extensively using models (Bonabeau et al.,, 1996; Duarte et al.,, 2012; Lichocki et al.,, 2012); but the effect of social interactions is less well-understood (Jeanson and Weidenmüller,, 2014; Kao et al.,, 2014).

We investigate the effects of social interactions in task allocation using Evolutionary Game Theory (EGT). We propose a simple task-allocation game and study how different learning mechanisms can give rise to specialised and non-specialised colonies under different ecological conditions. By combining agent-based simulations and adaptive dynamics we show that social learning can result in colonies of generalists or specialists, depending on ecological parameters. Agent-based simulations further show that learning dynamics play a crucial role in task allocation. In particular, introspective individual learning readily favours the emergence of specialists, while a process resembling task recruitment favours the emergence of generalists.

A Simple Task-Allocation Game

Our model assumes that individuals live in groups of size $n$ , and choose to allocate their effort into two different tasks: Task A is a regulatory task, such as fanning to cool down nest temperature; Task B is a foraging task, required to support the energy costs of the colony. The behaviour of individual $i$ is determined by a continuous trait $x_{i}\in\left[0,1\right]$ , the probability that individual $i$ will perform Task A. Thus, $1-x_{i}$ is the probability of individual $i$ engaging in Task B. In the most general sense, the payoff of individual $i$ is given by: $\Pi(x_{i})=B(x_{i})-C(x_{i})$ , where $B(x_{i})$ and $C(x_{i})$ are benefits and cost respectively. Total payoff for one individual depends not only on her own trait, but also on the traits of all others in the population.

We define the benefit function as $B(x_{i})=B^{A}(x_{i})\cdot B^{B}(x_{i})$ . This implies that the benefits of foraging are discounted if the colony is not well-regulated. For the regulatory task, we assume that benefits are maximised when an intermediate number of individuals are engaged in regulation, thus $B^{A}(x_{i})$ is a concave function in $\sum_{j=0}^{n}x_{j}$ . For the foraging task we assume that individual benefits increase linearly with collective foraging effort.

To fix ideas $B^{A}(x_{i})$ is assumed to be quadratic with maximum value 1 when half of the workers in the group are engaged in Task A. Likewise, $b$ is the individual reward from performing Task B in one time period. Thus,

[TABLE]

and

[TABLE]

For this payoff structure the rewards of foraging are maintained in full, only when regulation is optimal.

Costs are defined as $C(x_{i})=C^{A}(x_{i})+C^{B}(x_{i})$ . The cost of regulation is assumed to be linear, with $C^{A}(x_{i})=r\cdot x_{i}$ . Here, $r$ represents the cost of performing Task A for an individual in one time period. The nature of foraging implies decreasing marginal costs, thus $C^{B}(x_{i})=-(1-x_{i})^{2}+2(1-x_{i})$ for $x_{i}\in\left[0,1\right]$ .

Social learning

How do individuals in the colony learn to coordinate their efforts to successfully perform both tasks? We obtain a first approximation by using the mathematical framework of adaptive dynamics (Brännström et al.,, 2013). The underlying process resembles social learning, whereby individuals tend to copy those that are most successful. The assumptions of adaptive dynamics imply a very large population where mutations are rare and small (Waxman and Gavrilets,, 2005).

The analysis predicts a unique singular strategy $x^{*}$ that is convergent stable. We derive conditions for evolutionary branching; a situation in which a monomorphous population splits in two morphs (Doebeli et al.,, 2004). In the context of task allocation, this implies that the colony successfully tends to both tasks, but each individual specializes by putting all their effort into one single task. Alternatively, the colony can converge to a state in which generalists share responsibilities and each individual splits efforts in both tasks. We refer to the latter as weak specialisation, as opposed to strong specialisation in the former case. Given $x^{*}$ , the colony exhibits strong specialisation if

[TABLE]

or weak specialisation (evolutionary stable strategy) if

[TABLE]

The system can also converge to a state in which the colony fails to coordinate its efforts in both tasks. In this case we speak of an inviable colony.

Figure 1 shows how these different outcomes depend on ecological conditions as given by the values of $b$ and $r$ .

To check the robustness of our mathematical prediction we use an agent-based simulation. Social learning follows a Wright-Fisher process where the population is not necessarily monomorphous (Imhof and Nowak,, 2006). This is similar to roulette wheel selection in a standard Genetic Algorithm (Fogel,, 2006).

Figure 2 shows that our simulation results are in line with the predictions from adaptive dynamics. Our results indicate that when foraging is expensive, agents fail to coordinate and the colony is not viable. For intermediate costs of foraging individuals strongly specialize, while cheap foraging leads to a population of generalists.

Other Learning Mechanisms

The mathematical framework and the simulations so far assume that agents learn socially. This implies that individuals can observe the traits of all others in the colony, while at the same time inferring how successful a particular trait is. We perform simulations in which this assumption is relaxed in two different ways: i) A model of individual learning assumes that agents learn exclusively from their own experience. ii) A model of task recruitment assumes that agents respond to recruitment signals for either task.

Individual Learning

For individual learning we assume that agents remember their most recent strategy and its associated payoff. Innovations arise via exploration with a small probability. If a new strategy is not better than the most recent one, individuals roll back to their previous strategy. This class of introspective learning is thought to be less cognitively demanding than social learning.

Figure 3 shows the simulation results for this model. Individual learning readily favours strong specialisation across a large range of parameters.

Task Recruitment

We also investigate a process of task recruitment, inspired by the ecological literature on task allocation (Dukas,, 2008). In this version of the model, agents respond to recruitment signals for each task. At each time-step, individuals choose a single task to perform based on their trait, i.e., individual $i$ chooses Task A with probability $x_{i}$ . In a recruitment phase, individuals modify their trait according recruitment signals send by successful individuals in the population. In particular, we assume that the intensity of the signal is proportional to the fitness of the recruiter.

The simulation results for this model are shown in Figure 4. Task recruitment favours weak specialisation across a large range of parameters.

Discussion

This paper shows that EGT is a promising avenue to study the effect of social interactions in task allocation models. In particular, we show that a simple model of social learning can give insights into when to expect different levels of specialization. We show that the mechanism by which individuals learn can dramatically change predictions: while social learning is conducive to societies of specialists and generalists, individual learning readily leads to strong specialization.

More research is needed to understand how different species may rely on different kinds of learning. Our models suggest that the ecology of the tasks interacts in a non-trivial way with the cognitive capacities of the species.

Extensions of this model can introduce different types of tasks as well as the option of performing no tasks, which has been widely observed in social insect colonies. Further research is also needed to provide analytic predictions for individual learning, and to understand how different time-scales of learning and ecology may interact with each other in task allocation games.

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Bonabeau et al., (1996) Bonabeau, E., Theraulaz, G., and Deneubourg, J.-L. (1996). Quantitative study of the fixed threshold model for the regulation of division of labour in insect societies. Proc. R. Soc. Lond. B .
2Brännström et al., (2013) Brännström, Å., Johansson, J., and von Festenberg, N. (2013). The Hitchhiker’s Guide to Adaptive Dynamics. Games 2013, Vol. 4, Pages 304-328 .
3Camazine et al., (2001) Camazine, S., Deneubourg, J.-L., Franks, N. R., Sneyd, J., Theraulaz, G., and Bonabeau, E. (2001). Self-organization in biological systems . Princeton University Press.
4Campos et al., (2000) Campos, M., Bonabeau, E., Theraulaz, G., and Deneubourg, J.-L. (2000). Dynamic scheduling and division of labor in social insects. Adpt. Behav.
5Charbonneau and Dornhaus, (2015) Charbonneau, D. and Dornhaus, A. (2015). When doing nothing is something. How task allocation strategies compromise between flexibility, efficiency, and inactive agents. J Bioecon .
6Doebeli et al., (2004) Doebeli, M., Hauert, C., and Killingback, T. (2004). The evolutionary origin of cooperators and defectors. Science , 306(5697):859–862.
7Dressler, (2007) Dressler, F. (2007). Self-organization in sensor and actor networks . Wiley, Chichester, England.
8Duarte et al., (2012) Duarte, A., Pen, I., Keller, L., and Weissing, F. J. (2012). Evolution of self-organized division of labor in a response threshold model. Behav. Ecol. Sociobiol.