Stability of Dining Clubs in the Kolkata Paise Problem with and without   Cheating

Akshat Harlalka; Andrew Belmonte; Christopher Griffin

arXiv:2302.14142·physics.soc-ph·May 17, 2023

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

Akshat Harlalka, Andrew Belmonte, Christopher Griffin

PDF

Open Access

TL;DR

This paper studies the stability of dining clubs in the Kolkata Paise Restaurant problem, analyzing how cooperation, taxation, and cheating influence agents' strategies and system dynamics through theoretical and numerical methods.

Contribution

It introduces dining clubs and an evolutionary game framework, analyzes stability with and without cheating, and explores bifurcations and dynamics of cheater populations.

Findings

01

Dining clubs are evolutionarily stable strategies.

02

Cheating introduces unstable fixed points and bifurcations.

03

Numerical simulations reveal complex dynamics with multiple clubs.

Abstract

We introduce the idea of a dining club to the Kolkata Paise Restaurant Problem. In this problem, $N$ agents choose (randomly) among $N$ restaurants, but if multiple agents choose the same restaurant, only one will eat. Agents in the dining club will coordinate their restaurant choice to avoid choice collision and increase their probability of eating. We model the problem of deciding whether to join the dining club as an evolutionary game and show that the strategy of joining the dining club is evolutionarily stable. We then introduce an optimized member tax to those individuals in the dining club, which is used to provide a safety net for those group members who don't eat because of collision with a non-dining club member. When non-dining club members are allowed to cheat and share communal food within the dining club, we show that a new unstable fixed point emerges in the dynamics. A…

Equations55

p_{g} (n, g) = k = 0 \sum n (k n) (\frac{n + g - 1}{n + g})^{n - k} (\frac{1}{n + g})^{k} \frac{1}{k + 1},

p_{g} (n, g) = k = 0 \sum n (k n) (\frac{n + g - 1}{n + g})^{n - k} (\frac{1}{n + g})^{k} \frac{1}{k + 1},

p_{n} (n, g) = k = 0 \sum n - 1 (k n - 1) (\frac{n}{n + g} \frac{1}{k + 1} + \frac{g}{n + g} \frac{1}{k + 2}) (\frac{n + g - 1}{n + g})^{n - k - 1} (\frac{1}{n + g})^{k} .

p_{n} (n, g) = k = 0 \sum n - 1 (k n - 1) (\frac{n}{n + g} \frac{1}{k + 1} + \frac{g}{n + g} \frac{1}{k + 2}) (\frac{n + g - 1}{n + g})^{n - k - 1} (\frac{1}{n + g})^{k} .

p_{g} (n, α) = \frac{( 1 - \frac{1}{α n + n} ) ^{n} ( ( α + 1 ) n ( ( \frac{1}{α n + n - 1} + 1 ) ^{n} - 1 ) + 1 )}{n + 1} .

p_{g} (n, α) = \frac{( 1 - \frac{1}{α n + n} ) ^{n} ( ( α + 1 ) n ( ( \frac{1}{α n + n - 1} + 1 ) ^{n} - 1 ) + 1 )}{n + 1} .

p_{n} (n, α) = \frac{( 1 - \frac{1}{α n + n} ) ^{n}}{n + 1} {α^{2} n + α n - α - n - 1 - [(α + 1) ((α - 1) n - 1) (\frac{1}{α n + n - 1} + 1)^{n}]} .

p_{n} (n, α) = \frac{( 1 - \frac{1}{α n + n} ) ^{n}}{n + 1} {α^{2} n + α n - α - n - 1 - [(α + 1) ((α - 1) n - 1) (\frac{1}{α n + n - 1} + 1)^{n}]} .

p_{g} (α) = n \to \infty lim p_{g} (n, α) = (1 - e^{- \frac{1}{α + 1}}) (α + 1),

p_{g} (α) = n \to \infty lim p_{g} (n, α) = (1 - e^{- \frac{1}{α + 1}}) (α + 1),

p_{n} (α) = n \to \infty lim p_{n} (n, α) = - α^{2} + e^{- \frac{1}{α + 1}} (α^{2} + α - 1) + 1.

p_{n} (α) = n \to \infty lim p_{n} (n, α) = - α^{2} + e^{- \frac{1}{α + 1}} (α^{2} + α - 1) + 1.

β = \frac{g}{n + g} = \frac{α}{1 + α} .

β = \frac{g}{n + g} = \frac{α}{1 + α} .

α = \frac{β}{1 - β} .

α = \frac{β}{1 - β} .

p_{g} (β)

p_{g} (β)

p_{n} (β)

\overset{g}{˙} = g ⟨ S_{g} ⟩ = g p_{g} (β) .

\overset{g}{˙} = g ⟨ S_{g} ⟩ = g p_{g} (β) .

\dot{β} = β [p_{g} (β) - \overset{p}{ˉ} (β)] = β (⟨ S_{g} ⟩ - ⟨ S ⟩) .

\dot{β} = β [p_{g} (β) - \overset{p}{ˉ} (β)] = β (⟨ S_{g} ⟩ - ⟨ S ⟩) .

\overset{p}{ˉ} (β) = ⟨ S ⟩ = \frac{α p _{g} ( α ) + p _{n} ( α )}{1 + α},

\overset{p}{ˉ} (β) = ⟨ S ⟩ = \frac{α p _{g} ( α ) + p _{n} ( α )}{1 + α},

⟨ S ⟩ = \overset{p}{ˉ} (β) = e^{β - 1} (β - 1) + 1.

⟨ S ⟩ = \overset{p}{ˉ} (β) = e^{β - 1} (β - 1) + 1.

r (β) = p_{g} (β) - \overset{p}{ˉ} (β) = ⟨ S_{g} ⟩ - ⟨ S ⟩ = \frac{1 - e ^{β - 1}}{1 - β} - (e^{β - 1} (β - 1) + 1),

r (β) = p_{g} (β) - \overset{p}{ˉ} (β) = ⟨ S_{g} ⟩ - ⟨ S ⟩ = \frac{1 - e ^{β - 1}}{1 - β} - (e^{β - 1} (β - 1) + 1),

\tilde{p}_{g} (β) = \frac{g p _{g} ( β ) κ}{g - g p _{g} ( β )} = \frac{p _{g} ( β ) κ}{1 - p _{g} ( β )} .

\tilde{p}_{g} (β) = \frac{g p _{g} ( β ) κ}{g - g p _{g} ( β )} = \frac{p _{g} ( β ) κ}{1 - p _{g} ( β )} .

⟨ S_{g} ⟩ = (1 - κ) p_{g} (β) + [1 - p_{g} (β)] \frac{p _{g} ( β ) κ}{1 - p _{g} ( β )} = p_{g} (β) .

⟨ S_{g} ⟩ = (1 - κ) p_{g} (β) + [1 - p_{g} (β)] \frac{p _{g} ( β ) κ}{1 - p _{g} ( β )} = p_{g} (β) .

κ^{*} = 1 - p_{g} (β) .

κ^{*} = 1 - p_{g} (β) .

\tilde{p}_{g} (β) = \frac{κ g p _{g} ( β )}{n [ 1 - p _{n} ( β )] ϕ + [ 1 - p _{g} ( β )] g} = \frac{α κ p _{g} ( β )}{ϕ [ 1 - p _{n} ( β )] + α [ 1 - p _{g} ( β )]},

\tilde{p}_{g} (β) = \frac{κ g p _{g} ( β )}{n [ 1 - p _{n} ( β )] ϕ + [ 1 - p _{g} ( β )] g} = \frac{α κ p _{g} ( β )}{ϕ [ 1 - p _{n} ( β )] + α [ 1 - p _{g} ( β )]},

⟨ S_{g} ⟩

⟨ S_{g} ⟩

⟨ S_{n} ⟩

\dot{β} = β (⟨ S_{g} ⟩ - ⟨ S ⟩) .

\dot{β} = β (⟨ S_{g} ⟩ - ⟨ S ⟩) .

\frac{κ g p _{g} ( β )}{g [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( n + α n )} = \frac{κ α p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 + α )} = \frac{κ α p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 - β ) ^{- 1}} .

\frac{κ g p _{g} ( β )}{g [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( n + α n )} = \frac{κ α p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 + α )} = \frac{κ α p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 - β ) ^{- 1}} .

⟨ S_{g} ⟩ = (1 - κ) p_{g} (β) + [1 - p_{g} (β)] \frac{α κ p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 - β ) ^{- 1}},

⟨ S_{g} ⟩ = (1 - κ) p_{g} (β) + [1 - p_{g} (β)] \frac{α κ p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 - β ) ^{- 1}},

⟨ S_{f} ⟩ = p_{n} (β) + [1 - p_{n} (β)] \frac{α κ p _{g} ( β )}{α [ 1 - p _{g} ( β )] + [ 1 - p _{n} ( β )] χ ( 1 - β ) ^{- 1}}, and

⟨ S_{h} ⟩ = p_{n} (β) .

⟨ S ⟩ = χ ⟨ S_{f} ⟩ + β ⟨ S_{g} ⟩ + η ⟨ S_{h} ⟩ .

⟨ S ⟩ = χ ⟨ S_{f} ⟩ + β ⟨ S_{g} ⟩ + η ⟨ S_{h} ⟩ .

\dot{β} = β (⟨ S_{g} ⟩ - ⟨ S ⟩)

\dot{β} = β (⟨ S_{g} ⟩ - ⟨ S ⟩)

\overset{χ}{˙} = χ (⟨ S_{f} ⟩ - ⟨ S ⟩),

Δ_{2} = {(β, χ) \in R^{2} : β + χ \leq 1, β \geq 0, χ \geq 0} .

Δ_{2} = {(β, χ) \in R^{2} : β + χ \leq 1, β \geq 0, χ \geq 0} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic theories and models · Evolutionary Game Theory and Cooperation · Game Theory and Applications

Full text

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

Akshat Harlalka

[email protected]

Department of Computer Science, Penn State University, University Park, PA 16802

Andrew Belmonte

[email protected]

Department of Mathematics & Huck Institute of Life Sciences, Penn State University, University Park, PA 16802

Christopher Griffin

[email protected]

Applied Research Laboratory, Penn State University, University Park, PA 16802

Abstract

We introduce the idea of a dining club to the Kolkata Paise Restaurant Problem. In this problem, $N$ agents choose (randomly) among $N$ restaurants, but if multiple agents choose the same restaurant, only one will eat. Agents in the dining club will coordinate their restaurant choice to avoid choice collision and increase their probability of eating. We model the problem of deciding whether to join the dining club as an evolutionary game and show that the strategy of joining the dining club is evolutionarily stable. We then introduce an optimized member tax to those individuals in the dining club, which is used to provide a safety net for those group members who don’t eat because of collision with a non-dining club member. When non-dining club members are allowed to cheat and share communal food within the dining club, we show that a new unstable fixed point emerges in the dynamics. A bifurcation analysis is performed in this case. To conclude our theoretical study, we then introduce evolutionary dynamics for the cheater population and study these dynamics. Numerical experiments illustrate the behaviour of the system with more than one dining club and show several potential areas for future research.

I Introduction

The Kolkata Paise Restaurant Problem (KPRP) was first introduced in 2007 [1] during work on the Kolkata Paise Hotel Problem. Since then, it has been studied extensively [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 1, 14, 15, 16, 17] in the econophysics literature. In its simplest form, we assume $N\gg 1$ agents will choose among $N$ restaurants. Choice is governed by a distribution determined by an implicit ranking of the restaurants. The ranking represents the payoff of eating at a given restaurant. If two or more agents select the same restaurant, then the restaurant randomly chooses which agent to serve.

A broad overview of KPRP can be found in [3, 7, 11]. When all restaurants are ranked equally (i.e., have payoff $1$ ) and agents choose a restaurant at random, the expected payoff to each agent is easily seen to be approach $1-1/e$ as $N\to\infty$ . Using stochastic strategies and resource utilization models, the mean payoff can be increased to $\sim 0.8$ [18]. Identifying strategies to improve on the uncoordinated outcome is a central problem in KPRP.

KPRP is an example of an anti-coordination game (such as Hawk-Dove) [19]. Other examples of this class of game are minority games [20, 21] and the El Farol bar problem [22, 23, 24, 25]. These types of games also emerge in models of channel sharing in communications systems [26, 27, 28].

Learning in KPRP is considered in [12, 18, 29] with both classical and quantum learning considered in [12]. Quantum versions of the problem are considered in [12, 15, 16] and its relevance to other areas of physical modelling are considered in [8, 17, 14, 10] with phase transitions considered recently in [2, 9]. Distributed and coordinated solutions to optimizing agent payoff are discussed in [4, 5, 6, 13].

In this paper, we use evolutionary game theory to study a group formation problem within the context of KPRP. We assume that some subset of the population of $N$ individuals forms a dining club. Individuals in the dining club coordinate their actions and will choose distinct restaurants from each other, thus increasing the odds that any individual within the dining club will eat. In this context, we show the following results:

When all restaurants are ranked equally, membership in the dining club is globally stable. That is, asymptotically all players join the dining club (in the limit as $N\to\infty$ ). 2. 2.

When the dining club taxes its members by collecting food for redistribution to those members who did not eat, there is an optimal tax rate that ensures all members are equally well-fed. 3. 3.

When non-club members can choose to deceptively share in the communal food (freeload) of the dining club, a new unstable fixed point emerges. The fixed point corresponding to a population where all members join the dining club remains stable, but is no longer globally stable. We characterize the basin of attraction in this case. This effectively introduces a public goods game into the KPRP. 4. 4.

We then use numerical analysis to study the case where two dining clubs are active. We numerically illustrate the existence of equilibrium surfaces where multiple dining clubs can exist simultaneously along with non-group members as a result of group taxation (food sharing), cheating (freeloading), and cheating detection.

The remainder of this paper is organized as follows: In Section II, we analyse an evolutionary model of KPRP with a dining club. We study resource distribution through taxation and cheating in Section III. Cheating is modelled in an evolutionary context in Section IV. KPRP with multiple dynamic clubs is studied numerically in Section V. Finally, in Section VI we present conclusions and future directions.

II Mathematical Analysis

We first study KPRP with a single dining club. Let $g$ be the size of the dining club and let $n$ be the size of the free population with total population given by $N=g+n$ . The probability that an individual in the dining club eats is given by

[TABLE]

while the probability that a free individual eats is given by

[TABLE]

If we assume $g=\alpha n$ and sum over $k$ , then we can rewrite $p_{g}(n,g)$ in closed form as

[TABLE]

Likewise, $p_{n}(n,g)$ can be written as

[TABLE]

If we compute the limit as $n\to\infty$ , this yields the asymptotic probabilities

[TABLE]

and

[TABLE]

For the remainder of this section and the next, we assume an infinite population. While it was easier to work with $g=\alpha n$ for the previous computation, for further analysis it is simpler to express $g$ as a fraction of the total population. Let

[TABLE]

Substituting

[TABLE]

into Eqs. 1 and 2 yields the simplified forms,

[TABLE]

A simple plot shows that $p_{g}(\beta)\geq p_{n}(\beta)$ for all $\beta\in[0,1]$ .

Let $S_{g}$ be a random variable denoting the meal size for an individual in the dining club, and let $S$ be a random variable denoting the meal size for a randomly chosen member of the population. Then the probability of eating $p_{g}(\beta)$ is now easily seen as the expected meal size $\left\langle S_{g}\right\rangle$ , with a meal size of $1$ corresponding to eating and a meal size of [math] corresponding to not eating. Using this interpretation, and equating meal size with fitness, we assume the rate of growth of the dining club is given by

[TABLE]

From [30], it follows that the proportion $\beta$ must follow the replicator dynamic

[TABLE]

The population mean $\bar{p}(\beta)=\left\langle S\right\rangle$ can be computed as

[TABLE]

and converted to an expression in $\beta$ using Eqs. 1, 2 and 3 as,

[TABLE]

Let

[TABLE]

be the growth rate of $\beta$ . Then $r(0)=0$ and we see that $\lim_{\beta\to 1}r(\beta)=0$ . That is, Eq. 4 has two fixed points. From Fig. 1, we must have $r(\beta)>0$ for $0<\beta<1$ . This is illustrated in Fig. 2 (left). It follows that $\beta(t)$ is described by a non-logistic sigmoid, as shown in Fig. 2 (right).

We conclude that the decision to join the dining club is an evolutionarily stable strategy and the fixed point $\beta=1$ is globally asymptotically stable while the fixed point $\beta=0$ is asymptotically unstable.

III Social Safety Nets and Deceptive Free Loading

Suppose the dining club imposes a food tax on its members at the rate $\kappa\in[0,1]$ so that if a diner is successful in obtaining food, then he reserves $\kappa\times 100\%$ of his meal to be shared with club members who choose a restaurant that is occupied by an independent individual. If we assume these resources are pooled and then shared equally, the expected meal size (normalized to the interval $[0,1]$ ) available for a club member who cannot obtain food on his own is given by

[TABLE]

Note that sharing (for any value of $\kappa$ ) does not affect the expected meal size obtained by a group member, since we have the expected meal size

[TABLE]

We can construct a tax-rate that depends on $\beta$ and ensures all participants in the dining club receive the same meal size. Setting $\tilde{p}_{g}(\beta)=1-\kappa$ and solving, we obtain:

[TABLE]

Thus, as $\beta$ increases, the tax decreases. As a result of Eq. 6, the right-hand-side of Eq. 4 remains unchanged and the decision to join the dining club is still evolutionarily stable, even in the presence of sharing. That is $\beta=1$ is still globally asymptotically stable.

Suppose a proportion $\phi\in[0,1]$ of the independent population that does not eat can deceptively pose as club members, thereby sharing in the communally available food. In the presence of a food tax, the resulting decision to join the dining club now becomes a public goods problem. Then the expected meal size to anyone receiving shared food is given by

[TABLE]

where $\alpha$ is defined in terms of $\beta$ in Eq. 3. Let $S_{n}$ be the random variable denoting the expected meal size for an independent member of the population. Then as a function of $\kappa$ and $\phi$ ,

[TABLE]

It is possible but unwieldy to compute $r(\beta,\phi)=\left\langle S_{g}\right\rangle-\left\langle S\right\rangle$ using the expected meal size with deception rate $\phi$ and group size $\beta$ . Plotting sample curves for $r(\beta,\phi)$ shows that the growth rate now changes sign at some value $\beta(\phi)$ ; see Fig. 3 (left).

As a consequence of this, the replicator equation for $\beta$ is given by

[TABLE]

These dynamics exhibit a new unstable equilibrium point, illustrating a bifurcation in parameter $\phi$ with numerically computed bifurcation diagram shown in Fig. 3 (right). An example solution flow (for various initial conditions) is shown in Fig. 4.

We can compute $\beta^{*}\approx 0.577$ for $\phi=1$ . This is particularly interesting because we have essentially constructed a public goods problem in which joining the dining club enforces a taxation rate of $\kappa=1-p_{g}(\beta)$ on the members, who are then guaranteed (the public good of) a meal each day. The presence of freeloaders destabilizes the group formation process, but does not guarantee that a group cannot form. Since $\beta^{*}(\phi)$ is monotonically increasing, it follows that if $\phi$ grows slowly enough so that at any time $\beta(t)>\beta^{*}[\phi(t)]$ , then the dining club will grow to include the entire population. If $\beta(t)<\beta^{*}[\phi(t)]$ , then the dining club collapses. We impose an evolutionary dynamic on the freeloaders in the next section to study this effect.

IV Evolving Freeloaders

If we divide the population into three groups, dining club members ( $g$ ), non-dining club freeloaders ( $f$ ) and non-dining club non-freeloaders ( $h$ ), we can construct an evolutionary dynamic for the freeloaders. Let $\chi$ be the proportion of the population that is not in the dining club and will freeload (cheating) and $\eta=1-\beta-\chi$ to be the proportion of the population that is not in the dining club and not freeloading (honest). Then the population of freeloaders is $\chi(n+\alpha n)$ . The expected meal size to any agent accepting communal food is then

[TABLE]

Let $S_{g}$ be as before, and let $S_{f}$ be the random variable denoting the meal size for an individual in the freeloading group and $S_{h}$ be the random variable denoting meal size for an individual from the non-freeloading non-dining club group. It follows from Eqs. 8, 9 and 10 that

[TABLE]

Here, we have replaced $\phi$ with its definition in terms of $\chi$ and $\beta$ . Employing the same reasoning we used to obtain Eq. 4, we can construct replicator equations for proportions $\beta$ , $\chi$ and $\eta$ .

The population mean meal size is

[TABLE]

The dynamics of $\eta$ (the non-freeloading, non-dining club group) are extraneous, and we can focus on the two-dimensional system

[TABLE]

which do not depend on the value of $\eta$ .

Fig. 5 shows the dynamics of this evolutionary system. It is straightforward to compute that when $\beta=0$ , then $\left\langle S_{g}\right\rangle-\left\langle S\right\rangle=\left\langle S_{f}\right\rangle-\left\langle S\right\rangle=0$ for all values of $\chi\in[0,1]$ . Thus, the dynamics freeze on the left boundary of the simplex

[TABLE]

There is a single hyperbolic saddle on the boundary of $\Delta_{2}$ that can be numerically computed as $(\beta,\chi)\approx(0.578,0.422)$ . The two boundary equilibria $(\beta,\chi)=(1,0)$ and $(\beta,\chi)=(0,1)$ are both locally asymptotically stable. Thus, the long-run population behaviour is dependent on the initial conditions. We can numerically construct a curve of initial conditions showing this dichotomous behaviour. This is shown in Fig. 6 and as the red curve in Fig. 5.

As $\beta_{0}$ approaches $\beta^{*}\approx 0.578$ corresponding to equilibrium point for $\phi=1$ , the curve stops because $\chi_{0}$ would need to lie outside the simplex to cause the dining club to collapse. It is interesting to note that the phase portrait illustrates trajectories in which both $\beta$ and $\chi$ are increasing up to a point, followed by either the collapse of the dining club (while $\chi$ continues to increase) or the collapse of the freeloading group, as all population members join the dining club (and $\beta$ continues to increase).

V Numerical Results on Multiple Dining Clubs

We now consider KPRP with two dining clubs. We model three groups of agents $\mathcal{F}$ , $\mathcal{G}_{1}$ and $\mathcal{G}_{2}$ for free agents, dining club one and dining club two respectively. We estimate $\left\langle S_{g_{1}}\right\rangle$ , $\left\langle S_{g_{2}}\right\rangle$ and $\left\langle S_{f}\right\rangle$ using Monte Carlo simulation. This Monte Carlo simulation is then embedded into a larger dynamic process for updating the groups.

In the Monte Carlo simulation, the free agent group acts normally, choosing a restaurant randomly. The members of the dining clubs also chose restaurants randomly, but with the constraint that no two agents in a dining club may choose the same restaurant. Since we are studying this system numerically, we introduce two kinds of taxation policies.

Policy I: We assume a given tax rate $\kappa$ with no redistribution; i.e., the tax goes to maintain the dining club in some form. 2. 2.

Policy II: Agents within the dining club are taxed at a rate $\kappa$ given by, Eq. 7 and food is redistributed to club members who do not eat (and possibly freeloaders).

Agents in the free market will randomly choose a dining club to eat in if they do not get food on a given day with probability $1$ . That is, we assume $\phi=1$ . We also introduce a probability $\rho$ that cheaters will be caught. If a cheater gets caught, their food is not distributed and becomes waste.

In the dynamic model that follows, we refer to the process of simulating groups eating over several days by the function $\texttt{MonteCarlo}(\mathcal{F},\mathcal{G}_{1},\mathcal{G}_{2},\kappa,\rho)$ . The system dynamics of our simulation are then described by the following steps:

1:Input: $\mathcal{F}$ , $\mathcal{G}_{1}$ , $\mathcal{G}_{2}$ .

2:while There is at least one agent in each group do

3: Compute $(\left\langle S_{g_{1}}\right\rangle,\left\langle S_{g_{2}}\right\rangle,\left\langle S_{f}\right\rangle)=\texttt{MonteCarlo}(\mathcal{F},\mathcal{G}_{1},\mathcal{G}_{2},\kappa,\phi)$ .

4: Set $\mathcal{P}=\mathcal{F}\cup\mathcal{G}_{1}\cup\mathcal{G}_{2}$ .

5: Choose two agents $i$ and $j$ at random from $\mathcal{P}$ .

6: Let $\texttt{Group}(i)$ (resp. $\texttt{Group}(j)$ ) be the group to which $i$ (resp. $j$ ) belongs.

7: Let $p_{i}$ (resp. $p_{j}$ ) be the probability that $i$ (resp. $j$ ) eats.

8: if $p_{i}>p_{j}$ then

9: Move $j$ to $\texttt{Group}(i)$

10: else if $p_{j}>p_{i}$ then

11: Move $i$ to $\texttt{Group}(j)$

12: end if

13: Remove $i$ and $j$ from $\mathcal{P}$ .

14: if $|\mathcal{P}|>1$ then

15: goto 5

16: else

17: goto 3

18: end if

19:end while

It is clear in the dynamics simulated by this model, there are three equilibria corresponding to the cases when all agents are in $\mathcal{F}$ or $\mathcal{G}_{1}$ or $\mathcal{G}_{2}$ .

V.1 Simulation Results

For each simulation, we divide 100 agents into $\mathcal{F}$ , $\mathcal{G}_{1}$ and $\mathcal{G}_{2}$ . To construct an approximation for the basins of attraction for three equilibrium populations, we ran the simulation using 1000 replications simulation and every possible (discrete) starting condition on $|\mathcal{F}|$ , $|\mathcal{G}_{1}|$ and $|\mathcal{G}_{2}|$ .

Tax Policy I:

We explore the effect of varying $\kappa$ from $0.05$ to $0.15$ . To manage simulation time, we executed the while loop at most, 10000 times. If all players had not joined a single community by then, we declared this a failed run, suggesting slow convergence from this initial condition. The outcome of almost all experiments resulted in a dominant group (either free agents or dinning clubs) being formed. This is illustrated in Fig. 7.

Let $\beta_{1}$ and $\beta_{2}$ be the proportion of the population in dining clubs one and two, respectively, and let $\nu=1-\beta_{1}-\beta_{2}$ be the free group proportion. Then the dynamics can be projected to the two-dimensional unit simplex $\Delta_{2}$ embedded in $\mathbb{R}^{3}$ with coordinates $(\beta_{1},\beta_{2},\eta)$ . When the simulation converges, can determine the $\omega$ -limit set of trajectories leaving (near) an initial condition $(\beta_{1}^{0},\beta_{2}^{0},\eta^{0})$ . Fig. 7 shows that the size of the tax rate $\kappa$ is correlated with the size of the basin of attraction for the free agent group. The dynamics roughly partition the simplex into three basins of attraction, with the basins of attraction for the two dining clubs exhibiting symmetry as expected. On the boundaries of these regions, we expect unstable coexistence of multiple groups would be possible. This is qualitatively similar to the unstable fixed point identified in Fig. 4.

Tax Policy II:

In a second set of experiments, we let $\rho$ vary between [math] and $1$ and used Eq. 7 to set the tax policy. The cheating probability was fixed at $\phi=1$ . As before, we executed the while loop at most, 10000 times. If all players had not joined a single community by then, we declared this a failed run, suggesting slow convergence from this initial condition. Basins of attraction for various fixed points are shown in Fig. 8.

It is interesting to notice that there are a substantial number of failed cases between the clubs. This suggests an area of slow dynamics and possibly the existence of a slow manifold. Constructing a mathematical model of this scenario is an area reserved for future work, since it is unclear exactly how the dynamics are changing in this region.

VI Conclusions and Future Directions

In this paper, we studied the Kolkata Paise Restaurant Problem (KPRP) with dining clubs. Agents in a dining club mutually agree to visit separate restaurants, thereby increasing the probability that they eat (obtain a resource). An evolutionary game model was formulated describing the choice to join the dining club. We showed that joining the dining club is an evolutionarily stable strategy, even when members are taxed (in food) and resources are distributed. When cheating was introduced to the non-dining club members, i.e. the non-dining club members could deceptively benefit from the communal food collected by the dining club, a new unstable fixed point appears. We analysed this bifurcation as well as the decision to cheat using the resulting replicator dynamic. Numeric experiments on two dining clubs show that the behaviour in this case is similar to the case with one dining club, but may exhibit richer dynamics.

There are several directions for future research. Studying the theoretical properties of two (or more) dining clubs is clearly of interest. Adding many groups (i.e., so that the number of groups is a proportion of the number of players) might lead to unexpected phenomena. Also, allowing groups to compete for membership (by varying tax rates) might create interesting dynamics. As part of this research, investigation of the dynamics on the boundary both in theory and through numeric simulation would be of interest. A final area of future research would be to investigate the effect of taxing cheaters who are caught, thus allowing them to eat, but discouraging them from cheating. Determining the impact on the basins of attraction in this case would be the primary research objective.

Acknowledgements

A.H., A.B., and C.G. were supported in part by the National Science Foundation under grant DMS-1814876.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. K. Chakrabarti, M. Mitra, A.-S. Chakrabarti, The kolkata paise hotel problem, ar Xiv preprint ar Xiv:0711.1639 (2007).
2[2] S. Biswas, A. Ghosh, A. Chatterjee, T. Naskar, B. K. Chakrabarti, Continuous transition of social efficiencies in the stochastic-strategy minority game, Physical Review E 85 (3) (2012) 031104.
3[3] B. K. Chakrabarti, A. Chatterjee, A. Ghosh, S. Mukherjee, B. Tamir, et al., Econophysics of the Kolkata Restaurant problem and related games, Springer, 2017.
4[4] A. S. Chakrabarti, D. Ghosh, Emergence of anti-coordination through reinforcement learning in generalized minority games, Journal of Economic Interaction and Coordination 14 (2019) 225–245.
5[5] D. Ghosh, A. S. Chakrabarti, Emergence of distributed coordination in the kolkata paise restaurant problem with finite information, Physica A: Statistical Mechanics and its Applications 483 (2017) 16–24.
6[6] D. Dhar, V. Sasidevan, B. K. Chakrabarti, Emergent cooperation amongst competing agents in minority games, Physica A: Statistical Mechanics and its Applications 390 (20) (2011) 3477–3485.
7[7] P. Banerjee, M. Mitra, C. Mukherjee, Kolkata paise restaurant problem and the cyclically fair norm, Econophysics of Systemic Risk and Network Dynamics (2013) 201–216.
8[8] S. Biswas, A. K. Mandal, Parallel minority game and it’s application in movement optimization during an epidemic, Physica A: Statistical Mechanics and its Applications 561 (2021) 125271.