Demographic Prisoner's Dilemma : a probabilistic framework
Sylvain Gibaud

TL;DR
This paper introduces a probabilistic framework for the Demographic Prisoner's Dilemma, analyzing conditions for cooperation survival or extinction, and establishes mean field models with eternal coexistence under certain payoff conditions.
Contribution
It develops a novel probabilistic model for the Demographic Prisoner's Dilemma and proves conditions for cooperation extinction or eternal survival, including mean field approximations.
Findings
Cooperators die out if Sucker payoff exceeds Reward significantly.
Cooperators and defectors can coexist indefinitely under certain payoff conditions.
Mean Field model shows potential for eternal cooperation under weaker assumptions.
Abstract
We put a probabilistic framework on the Demographic Prisoner's Dilemma. In this model, cooperating and defecting individuals are placed on a torus to move and play prisoner's dilemma game, if they are on the same site. Each individual accumulates its payoff into a quantity called wealth. If an individual becomes wealthy enough, it can have an offspring. If its wealth becomes negative, it disappears. In this framework we prove that if if the Sucker payoff is far greater than the Reward then for all initial state almost surely all cooperators will die. Moreover if the Temptation payoff (resp. Reward) are far greater than the Punition (resp. Sucker payoff) then for all initial state with positive probability cooperators and defectors live \emph{ad vitam eternam}. We also set a Mean Field model on the demographic prisoner's dilemma and prove on a linearized version of the Mean Field model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies
Dynamical Prisoner’s Dilemma : A probabilistic framework
Sylvain Gibaud
Abstract
We put a probabilistic framework on the Demographic Prisoner’s Dilemma. In this model, cooperating and defecting individuals are placed on a torus to move and play prisoner’s dilemma game, if they are on the same site. Each individual accumulates its payoff into a quantity called wealth. If an individual becomes wealthy enough, it can have an offspring. If its wealth becomes negative, it disappears.
In this framework we prove that if if the Sucker payoff is far greater than the Reward then for all initial state almost surely all cooperators will die. Moreover if the Temptation payoff (resp. Reward) are far greater than the Punition (resp. Sucker payoff) then for all initial state with positive probability cooperators and defectors live ad vitam eternam. We also set a Mean Field model on the demographic prisoner’s dilemma and prove on a linearized version of the Mean Field model that with weaker assumptions with positive probability Cooperators live ad vitam eternam.
Contents
1 Introduction
Playing particle systems as a modeling tool in the evolution literature have been introduced by [Smith and Price, 1973]. Since, many ecological models have used playing particle system’s approach (see a list of examples in the book of [Hartl and Clark, 1997]). Yet a lot of the literature on the evolutionary games have been focused on random matching over finites games models [Boylan, 1992, Ellison, 1994, Weibull, 1997]. Then motivated by biological applications, spatiality have been introduced in models [Nowak and May, 1993, Szabó and Tőke, 1998, Hauert, 2002]. In those previous models, individuals cannot move and evolution is managed by a Wright-Fisher generation system (i.e. at each step of the evolution one whole generation gives birth to another generation and dies).
[Epstein, 1998] introduced a model where individuals can move and are not synchronized (they die at different times and gives birth at different times). This model is called Demographic Prisoner’s dilemma. Before explaining the model and a bit of its probabilistic framework, let us give an intuition inspired by the article of [Turner and Chao, 1999].
You put viruses on a torus. Each virus has embedded in his RNA one of the two following behaviors: either she manufactures diffusible (shared) intracellular product, either she sequesters it and take advantage of the virus producing the product. Each virus can move on a fixed torus. They also have energy such that when they have intracellular product they get energy and when they manufacture it or sequester nothing they lose energy. A virus without energy dies. A virus with a lot of energy can split and bring a child (with its RNA) in the torus.
Notations**.**
text
We denote by the set of non negative integers and the set of positive integers.
text
The model is:
Let be a fixed torus with . It is the space where individuals move. 2. 2.
We make the assumption that the torus cannot bear an infinite number of individuals. Let us call the maximum number of individuals in the torus. 3. 3.
The individuals move according to continuous time independent symmetric simple random walks. At random times (of rate ) an individual moves from his position to one of its nearest neighbors with equal probability, as shown in the following example of evolution.
Each individual has a wealth. If the wealth becomes non positive, the individual dies (and then stop playing with the other individuals). 5. 5.
The wealth changes through games. The game is the prisoner’s dilemma:
- •
The players have two actions to Cooperate or to Defect.
- •
If both Cooperate they get a Reward . But if one of the two defects, the Defector gets a Temptation payoff , and this payoff is bigger than the Reward. If a Cooperator is being Defected instead of getting a reward he gets a Sucker payoff . If both players Defect (let us call them Player 1 and Player 2) the nature flips a coin, if it is head Player 1 gets a Punishment payoff of and Player 2 gets a payoff of 0, if it is tail Player 2 gets a Punishment payoff of and Player 1 gets a payoff of 0.
- •
Then the payoff satisfies and . This is summarized in the following payoff matrix (action Top and action Left are Cooperate, action Bottom and action Right are Defect).
[TABLE]
where is a random variable with distribution
- •
Playing Cooperate is strictly dominated by playing Defect.
- •
To determine the action played by the individuals we place ourself in the framework where: "Each individual plays only one action, either he will Defect every time either he will Cooperate every time". Then a particle has a fixed action (Cooperate or Defect). There are two kinds of particles: the ones who always Cooperate and the ones who always Defect.
- •
At the end of each game the wealth are updated, adding the respective payoff of the game played.
- •
To make the games happen, each couple of individuals is given a Poisson process independent of everything of parameter . When this Poisson process realizes, if the individuals are on the same site and if their wealths are positive the individuals play together. Otherwise nothing happens. 6. 6.
Each individual is given a Poisson process of parameter . When this birth Poisson process realizes if the individual’s wealth is more than a given threshold and if there is less than individuals on the torus then the individual gives birth to an offspring. The offspring has the same strategy as its unique parent (it cooperates if its parent cooperates, defects if its parent defects). Moreover the fixed birth wealth of the offspring is given by its parent. That is after the birth (then the parent lose wealth, and the child begins its life with a wealth equal to ).
[Epstein, 1998] introduced the first Demographic Prisoner’s dilemma model, and explore it doing simulations. The demographic prisoner’s dilemma has been a popular model [Axelrod, 2000, Ifti et al., 2004, Ohtsuki and Nowak, 2006] in theoretical biology. Also since he used multi-agent as a way of modeling evolutionary game theory, Epstein made an useful link between computer sciences and biology as presented by [Tumer and Wolpert, 2004].
Yet in conclusion of his article Epstein said: "The only claims that can be advanced definitely are that this specific complex of assumptions is sufficient to generate cooperative persistence on the timescale explored in the research. […] Obviously it would be worthwhile to […] if possible, assess their generality mathematically."
After this article, [Dorofeenko and Shorish, 2002], assuming statistical independence, proved the convergence of the Epstein model (when the payoff and the step of the grid tend to 0) to a reaction-diffusion process. They also studied via numerical results this reaction diffusion process. [Namekata and Namekata, 2011] add reluctant players (Tit for Tat players where the first action is Defect) in the model. They studied the extended model using simulations.
The main contributions of this article is generalizing Epstein model mathematically that is:
- •
Putting a probabilistic framework on the Epstein model without doing too strong assumptions (slightly modifying the payoff matrix, making individuals moving following independent simple random walks instead of exclusion process).
- •
Proving asymptotic (on time) results on the survival of the cooperators.
- •
Introducing a Mean Field model (or well mixed)
- •
Proving that we can insure a certain level of cooperation without too strong assumption on the payoff matrix.
Let us introducing the first results.
Notations**.**
- •
We call configuration the data of the positions, the behaviors, and the wealth of every particles. Then a configuration is an element of where indicates the behavior of a particle.
- •
Since Markov processes are defined by a collection of probability distribution on the space of right continuous, left limited functions from to (this space is denoted ) indexed by . We denote the distributions of the previous model ( represents the initial state).
Theorem**.**
There exists a constant depending only on and such that if:
[TABLE]
*then for each initial configuration with at least one defector with positive wealth:
[TABLE]
text
Theorem**.**
There exists a constant depending only on and such that if:
[TABLE]
*then for each initial configuration with at least two cooperators and one defector with positive wealth:
[TABLE]
Those are qualitative results. The first theorem says that when the Sucker payoff is far bigger than the Reward payoff, then with any initial configuration (except the trivial ones) the cooperators die almost surely. The second theorem says that if the Reward is far bigger than the sum of the Sucker payoff and the birth wealth then from every non trivial configuration with positive probability the cooperators will never die.
The two theorems talk of the red and blue area (or the areas pointed by the arrows) on the following graph. In order to understand better the behavior of this system, we draw some simulations with the following data:
- •
The size of the torus is .
- •
There are initially 10 cooperators and 10 defectors.
- •
Initially all individuals have a wealth of .
- •
The rate of the game Poisson processes is the same than the rate of the moving Poisson processes .
- •
The Temptation payoff is and the Punishment payoff is .
- •
The stopping time of the simulation is 10,000 moves, games or birth Poisson process realizations. (Usually in the previous simulations this density becomes nearly constant after 3,000 realizations of Poisson processes).
- •
The critical wealth (wealth that allows to give birth) is: .
- •
The birth wealth for a child is: .
- •
The maximum number of particles .
*Method
*We make a batch of 100 simulations of the system for each couple of payoff (). At the end of each simulation we measure the number of cooperators with a positive wealth. After drawing these simulations we take the average of this measure over all these 100 simulations. Finally we make the figure saying that the color red corresponds to a small number of cooperators with a positive wealth at the end of the simulation, and the color blue corresponds to a high number of cooperators with a positive wealth at the end of the simulation (dark red correspond to 0 cooperator with a positive wealth at the end of the simulation).
In order to understand better the area in green and orange (intermediate area between the two pointed areas) we will consider the following Mean Field model:
- •
particles stop giving birth.
- •
there is no spatial condition.
- •
instead of considering a linear interaction with many particles, we consider one defector and one cooperator with non linear interaction (the evolution depends also on the distribution of the process).
- •
all particles begin with the same distribution of wealth.
The Mean Field system is the Markov process . We will look at the distribution of . The intuition of is: at time , of the population of cooperators has a wealth . We define by: (with fixed) and an initial measure
[TABLE]
where:
and are sequences of Poisson times of intensity (*i.e. * for example with a Poisson process of parameter for all is defined by ).
- -
for , all and and are random variables independent from everything such that:
[TABLE]
and
[TABLE]
The intuition behind these formulas is that we update (for example ) following a Poisson process of intensity . If we update it doing:
[TABLE]
An important remark is that the law of the wealth of any cooperator, resp any defector, (as a stochastic process) in the spatial model converges in finite dimensional distribution to , resp when and then when .
This distribution has three principles:
- •
A drift (positive or negative)
- •
A diffusion
- •
An absorbing bound on 0 causing non linearity.
Hence this process is non linear. We consider a simpler process instead : a linearized process .
[TABLE]
avec et two independent sequences of i.i.d. random variables such that
[TABLE]
et
[TABLE]
On this linearized system we have the following results:
Theorem 1.1**.**
Let us suppose that:
[TABLE]
Then we have: for any fixed
[TABLE]
[TABLE]
We also have the following proposition about the concentration of wealth.
Proposition**.**
For all
We denote :
[TABLE]
Then we have :
[TABLE]
Moreover for all such that:
[TABLE]
then
[TABLE]
This article is in two parts. In the first one, we introduce the spatial model with his probabilistic framework. We also state and give the sketch of the proofs of the two qualitative theorems. In the second part, we give some reminders on Continuous Time Markov processes (in particular the theory of infinitesimal generators). We do that in order to describe and justify the Mean Field model. Then we state Mean field theorem. The proofs of the theorems are in the appendix.
2 Spatial Model
Remark*.*
We call particle, player or individual, the agent defined in the following.
2.1 Model
Let us give a probabilistic framework for the Demographic prisoner’s dilemma of [Epstein, 1998]. Let be a probability space.
Let . Let . 2. 2.
For a particle , let us denote its position at time . Then is a continuous simple symmetric random walk on of rate . 3. 3.
We denote the wealth of particle at time by . 4. 4.
The action played by is coded in a process . Particle plays Cooperate if and plays Defect if . If the particle does not play. 5. 5.
The wealth changes through games.
- •
To make the games happen, each couple of individuals is given a Poisson process independent of everything of parameter . When this Poisson process realizes (for example at a time ), if the individuals are on the same site (i.e. if ), if their wealths are positive (i.e. if and ) and if are both not equal to -1 the individuals play together. Otherwise nothing happens. Then if an individual has a negative wealth, he can’t play with the other individuals.
- •
A point to notice is that two players cannot lose wealth simultaneously. As a consequence, two individuals cannot kill each other.
- •
At the end of each game (between for example player and ) and are updated, adding the payoff of the game played by and . 6. 6.
For when , and when the birth Poisson process of realizes, he gives birth to an offspring. The index of the offspring is drawn uniformly randomly from , and then , , are updated doing , and .
We have that is a Continuous Time Markov Chain, we denote by its filtration such that is adapted to .
Definition 2.1**.**
We call set of configuration . A configuration is an element usually denoted of . We denote the non trivial configurations by:
- •
the set of configurations with at least 2 cooperators and 1 defector with positive wealths.
- •
the set of configurations with at least 1 cooperator and 1 defector with positive wealths.
A particle system is a Continuous Time Markov Chain with state space: the space of configuration. It is usually denoted .
Before stating the theorems, let us remind the data of the model.
- •
the size of the torus, state space of the position.
- •
is the maximum number of players on the torus.
- •
We fix the payoff initially and . The payoff matrix is
[TABLE]
where is a random variable with distribution
- •
The position is denoted by , the wealth by and the strategy by . If the particle is a cooperator, if the particle is defector. If the particle cannot play (because she is not born yet).
- •
If , the particle stop playing. If the particle can give birth to a particle with initial wealth .
- •
is the rate of the Poisson process making a particle move. is rate of the Poisson process making a couple of particles plays together. is the rate of the Poisson process making the player trying to have an offspring.
2.2 Almost sure extinction
In this subsection we will show a first qualitative theoretic result: when the sucker payoff is far greater than the reward payoff then almost surely for any initial configuration (with at least one defector) cooperators will die.
Theorem 2.2**.**
There exists a constant depending only on and such that if:
[TABLE]
*then for each initial configuration with at least one defector :
[TABLE]
{id'ee}
The aim of the proof is showing that the total wealth of cooperators goes to 0 almost surely. The sketch of the proof is the following:
The only way the total cooperator wealth decreases is via games with defectors (giving birth does not change the total wealth of cooperators). Since is finite, going from a configuration , the first game between a defector and a cooperator arrives at a finite random time . At this time the variation of total wealth is less than where is the Poisson process counting the games. 2. 2.
Then to apply a Law of Large Number argument we want a condition such that:
. 3. 3.
To find this condition we have to upper bound uniformly in the configurations the first time for a defector to play with a cooperator. This will be one of the main difficulty of the proof.
- (a)
We remark that there exists a minimum number of realizations of Poisson processes such that for each non trivial configuration there is a Cooperator-Defector game in less than realizations. 2. (b)
Since the evolution is managed by Poisson processes, for every configuration there is a Cooperator-Defector game in less than realizations of Poisson processes with probability . 3. (c)
Since is the probability of a particle moving and a game happening, does not depend on the wealth of the individuals. Since if we don’t look at the wealth of the individuals there is a finite number of configurations, we take the minimum over these configurations. This part is really detailed in the proof. 4. (d)
The probability that a Cooperator-Defector game happens after realizations of Poisson processes is bugger than . 4. 4.
We finish the proof using Borel Cantelli’s Lemma.
2.3 Coexistence ad vitam eternam
In this subsection we will show a second qualitative result that is: when the reward is far greater than the sucker payoff and the initial offspring wealth then with positive probability the cooperators live until the end of times.
Theorem 2.3**.**
There exists a constant depending only on ,,, and such that if:
[TABLE]
*then for each configuration with at least two cooperators and one defector:
[TABLE]
{id'ee}
The main ideas are to consider a Ghost system and then to extend the result of the Ghost system using Burkholder-Davis-Gundy inequalities and Borel-Cantelli Lemma. We call the system introduced in the first section True system.
In the Ghost system, everybody plays even with negative wealth. Also whenever something in the system happens all individual’s wealth decrease of (except if an individual gives birth, this particular individual does not lose additional wealth). If in the Ghost system we can prove that every individual always have positive wealth then this Ghost system is actually the True system. 2. 2.
Using a Burkholder-Davis-Gundy argument with Borel Cantelli’s Lemma, we prove on the Ghost system that the minimum wealth over all individuals a.s. when . 3. 3.
We just have to consider an event (of positive probability) such that no individual has non positive wealth until they accumulated enough wealth to make the coupling between the Ghost system and the True system.
Theorem 2.2 and Thm 2.3 are qualitative results. In order to have quantitative results we consider in the following a Mean Field model. In simulation we notice that birth are really useful for maintaining the cooperation (see for example Run 3 and Run 4 of Eptein [Epstein, 1998]), in order to guarantee some result on the survival of cooperators in the Mean Field model we won’t consider births.
3 Mean Field model
3.1 Reminders on Markov process and infinitesimal generators
Let us firstly give some reminders about Markov processes.
One good way to study Markov processes is via their distribution. For a complete separable state space , a process has value in the canonical space.
Definition 3.1** (Canonical space).**
The canonical space is the space of right continuous functions from to with left limits, endowed with the Skohorod topology associated to its usual metric. With this metric, is complete and separable (see the book of Ethier Kurtz [Ethier and Kurtz, 1986] for more details).
We denote by the natural filtration associated to . We define the canonical process by:
[TABLE]
To define a Markov process in a discrete countable state space (which will be the case here), we need an initial measure on and a matrix called rate matrix with . For gives the rates of transition of the future process from to . With this matrix we define the generator (which is one of the main tool in the study of Markov processes) of the following Markov process such that: for each bounded and measurable (for the Borel sets of ) from to (this set is denoted ), and for all :
[TABLE]
The generator is a bounded linear operator on the bounded functions from to . We note in the same way a rate matrix and the associated Markov generator (with the space of measurable functions of ). We will always denoting the matrix with two entries (for example ) and the generator with no entry (for example ).
Conversely giving a generator we can construct the rate matrix looking at the rates (terms that will be before the ) in the expression of the generator.
Definition 3.2**.**
Let be a rate matrix and an initial measure we can construct a Markov process as follow:
Let be a Markov chain in with initial distribution and with transition matrix . We allow by taking for one and for all . We put only to have well defined, doesn’t have any interest to as we will see in the following. 2. 2.
Let be independent and exponentially distributed with parameter 1 (and independent of ) random variables. 3. 3.
We define the Markov process in with initial distribution and generator by:
[TABLE]
Note that we allow taking .
Since it will be constant in the evolution, we denote the (initial) number of particles. Let us firstly introduce the generator of the spatial system (but without birth). Let us begin by some notations
**Notations
**Let be a probability space. Let be the canonical basis of (with ). Let be the canonical basis of .
We denote . We identify to , then we can write with .
Let us denote the set of continuous bounded functions from to .
We denote by all the individuals couples which can play together.
The infinitesimal generator with domain of is: for all and for all
[TABLE]
The part is the generator representing the motion of the individuals. The individuals move following independent random walks, then is defined by: for all :
[TABLE]
The generator is the generator representing the evolution of the wealth of individuals through games. Let us give firstly then the explication of how it works. We have for each function :
[TABLE]
Let us explain a bit this generator:
- •
represents the spatial structure meaning that a game makes the wealth change only if the two individuals are on the same site.
- •
checks if both individuals have a positive wealth (i.e. there wealth are positive). To have the generator of the ghost system, the only change is: replacing this indicator function by 1.
- •
the big bracket represents the change of wealth through the game of the two individuals. If the individual is a cooperator, if the individual is a defector. The term looks at the strategies of the players and choose the right payoff to give to the individuals. For example if individual 1 and 2 are playing (both have a positive wealth and on the same site) if individual 1 cooperates and 2 defects then individual 1 gets a payoff of and individual 2 gets , the term in the generator representing this type of transition is:
[TABLE]
Notations**.**
The distribution of the previous defined by the generator (7) is: .
3.2 Mean Field Model
3.2.1 Definition and intuition
The Mean Field process is a non linear Markov process. That means that the temporal marginals of the distribution of the process is described by a generator also depending on these time marginals . For that purpose we call this kind of generator non-linear generator. The change between a non linear Markov process and a homogeneous Markov process is: when determining the transitions in an homogeneous Markov process you have to look where the process is, while in the non linear case to determine the transition you look both where the process is and where it could be.
The Mean Field system is the non linear Markov process : for fixed (representing the initial density of cooperators and defectors) defined by: an initial probability measure of , representing the initial distribution of wealth of all individuals and an increment relation:
[TABLE]
where:
and are sequences of Poisson times of intensity .
- -
for , all and and are random variables independent from everything such that:
[TABLE]
and
[TABLE]
represents the wealth of a typical cooperator and a typical defector in an infinite population of individuals moving at a high speed.
3.2.2 Justification of the mean field model
To arrive to the Mean Field model from the spatial model we have to assume that:
- •
There is no birth anymore. As a consequence is constant over time. We also set the number of particles in the system.
- •
The collection are i.i.d with distribution (with previously fixed).
- •
The moving rate belongs to instead of . This is done in order to have an homogenization result.
- •
The size of the torus depends on the initial number of particles. We set: . (We can also not link the size of the torus and the initial number of particles but in that case we have to slow the time of evolution replacing by ).
- •
Initially the distribution of wealth of all individuals is probability measure of .
The generator of the spatial system is with domain : for all
[TABLE]
with defined in (8) and in (15).
Using Theorem 4.2 of [Gibaud, 2016], when goes to , we have that there exists probability measure of such that the particle system described by the generator (7) is -chaotic. The analytic way of describing is done in the following. The useful consequence of that is the following convergence in finite dimensional distribution: with random variable taking value in and with distribution
[TABLE]
Let us describe analytically . In order to make the notations lighter we use the following:
- •
,
- •
.
To have a good description of the distribution of we use evolution equations given in Corollary 4.3 of [Gibaud, 2016]. The distribution of verifies the following evolution equations: with and :
- •
for all
[TABLE]
- •
for all
[TABLE]
We want to prove that is a good description of . Firstly let us denote and . Let’s verify that and verifies the ordinary differential equation (22),(27). Because for all , we have:
[TABLE]
is equal to:
[TABLE]
Dividing by and making we get the evolution equations (22) and (27) by saying that for all and .
Since this process is non linear, we analyze a linearized version . The linearized system is a system where cooperators and defectors can play even if their wealth are negative. It is a Mean field ghost version. The wealth of cooperators in the linearized is defined by an initial measure and an increment relation:
[TABLE]
where:
are defined in (16).
- -
for all and are random variables independent from everything such that:
[TABLE]
The consequence of this linearization is the following theorem that gives us a simple condition to have "survival" of cooperators ad vitam eternam with positive probability.
Theorem 3.3**.**
Let us suppose that:
[TABLE]
Then we have: for a fixed when almost surely
[TABLE]
and
[TABLE]
We also have a result about the concentration of wealth of cooperators.
Proposition 3.4**.**
For all
We denote :
[TABLE]
Then we have :
[TABLE]
Moreover for all such that:
[TABLE]
then
[TABLE]
4 Proofs on the Spatial Model
Before beginning the proofs let us introduce some useful tools. In Section 2.1, every movement, game, birth is managed by independent Poisson processes with rates , or . Instead of considering this collection of Poisson processes we consider:
A unique Poisson process with rate i.e. the sum of the rates of the previous Poisson processes. This Poisson process gives a sequence of Poisson times 2. 2.
A sequence (independent of everything) of independent random variables selects if a movement, game or birth happens. An element of this sequence choose a movement with probability , a game with probability and a birth with probability . 3. 3.
3 sequences (independent of everything) of i.i.d random variables uniformly choosing the agents. and are made of uniform on random variables. is made of uniform on random variables. 4. 4.
At the -th realization of the unique Poisson process, chooses the type of evolution (movement, game, or birth). If movement (resp birth) then (resp ) gives the particle that will move (resp will give birth (if she is wealthy enough and if there are less than particles on the torus)). If the particle moves, we draw an other uniform on random variable to decide where the particle moves. If game then gives the couple of particles that will play (if their , if they are on the same site and if their wealths are positive).
In this section, instead of considering continuous time Markov process we consider the induced Markov chain where .
Definition 4.1**.**
We call the photograph of a configuration , the data of the positions of individuals and their strategies. The photograph of can be seen as the equivalent class of for the following equivalence relation: if and only if
[TABLE]
In words, two configurations are equivalent if and only if on each position there is the same number of cooperators with positive wealth and defectors with positive wealth.
We denote by the canonical projection on the space of photographs.
Remark*.*
There are less than photographs. We call the number of photographs. Those photographs are denoted .
Notations**.**
For simplicity, we will denote the wealth of cooperator (with ), and the wealth of defector (with ), .
For a configuration we denote (resp. ) the set of the indexes of the cooperators (resp. defectors) with positive wealth.
4.1 Almost sure extinction
In this section we prove Theorem Theorem.
Proof.
text
Firstly let us define the first cooperator-defector time. For that let us denote for all , the sum of wealth of cooperators.
Definition 4.2**.**
We define the first time when a cooperator-defector game happens that is:
[TABLE]
Let us denote the sequence of these cumulated stopping time with . If there is no cooperator left after we define for all a.s. . That is the time of the first Cooperator-Defector game is , the time of the second one is et so on.
Firstly let us prove the uniform (on the configurations) upper bounding lemma.
Lemma 4.3**.**
There is and such that for each configuration and all we have:
[TABLE]
Proof.
Firstly let us notice, if there is at least one defector initially, defectors cannot be extinct. Indeed the only way that a defector dies is via Defector-Defector game yet since the payoff of such a game is a random variable with distribution then, when they play together two defectors cannot die at the same time. As a consequence there will be at least one defector left at any time. Moreover since is finite we have for each a.s.
Let be the sequence of events . Let the maximum number of realizations of Poisson process for a defector to play with a cooperator with positive probability in all configurations i.e.
[TABLE]
Because there is a finite number of photographs the existence of is insured.
Then there is such that for each configuration:
[TABLE]
For each configuration and all we have:
[TABLE]
Moreover by the Strong Markov property, we have:
[TABLE]
∎
Now that the lemma is proved let us use it to prove the theorem.
[TABLE]
The hypothesis of the theorem is:
[TABLE]
We have: for all
[TABLE]
Let we get:
[TABLE]
But and then we have:
[TABLE]
Applying Borel Cantelli’s Lemma we have: almost surely eventually all cooperators will be dead.
∎
4.2 Coexistence ad vitam eternam
Before proving Theorem 2.3, let us prove an almost sure divergence lemma.
Lemma 4.4**.**
Let a Markov chain on such that:
- •
* random variable in such that: , , and and such that: *
[TABLE]
- •
* random variable from to with positive expectation such that:*
[TABLE]
Then we have: for a fixed almost surely: ,
[TABLE]
Corollary 4.5**.**
Let a Markov chain of a space with countable dimension. Let be one of its coordinates. Let us suppose
- •
* random variable in such that: , , and and such that: *
[TABLE]
- •
* random variable from to with positive expectation such that:*
[TABLE]
Then we have: for a fixed almost surely: ,
[TABLE]
The proof of the corollary is the same as the proof of the lemma except that the initial condition is for example with and the Strong Markov property are done on . Let us begin the proof of the lemma.
Proof.
To prove that almost surely we firstly prove that is a sub-martingale and use Doob decomposition, then we upper bound the expectation of the quadratic variation of the martingale part of in order to finally use Borel Cantelli’s Lemma with Burkholder-Davis-Gundy inequality.
Let us prove that is a sub-martingale. Let .
[TABLE]
Then using a Doob decomposition we get: for all
[TABLE]
with and a martingale. 2. 2.
Now let us upper bound for all and all ,
- •
Let us prove that such that . First let us rewrite in function of the variation of .
[TABLE]
We get using Strong Markov property:
[TABLE]
Then we get:
[TABLE]
Then using Strong Markov property we have:
[TABLE]
- •
Let us prove that such that and
Using the previous arguments we have: such that for all :
[TABLE]
We have:
[TABLE]
Using Strong Markov property we have for all :
[TABLE]
Finally we have:
[TABLE] 3. 3.
Firstly let us notice that
To finish the divergence of we need Burkhholder-Davis-Gundy inequality [Beiglböck et al., 2015] which says that for there exists such that for every and every martingale we have:
[TABLE]
Then we have:
[TABLE]
which is summable.
Applying Borel Cantelli’s lemma we have almost surely: ,
[TABLE]
∎
Let us now prove Theorem 2.3.
Proof.
The configuration is fixed for the sequence of the proof.
Definition 4.6**.**
For all configuration , for a cooperator we denote the first time where there is a game between and a cooperator.
[TABLE]
Let us denote the sequence of these cumulated stopping times (by the same way we define the cumulative stopping times in the previous proof) with for all cooperators.
The issue here is that we can have extinction of the cooperators. Hence we don’t have a.s.
Hence we have to consider an other system and couple them.
We call the system introduced in Section 2.1 the True System. Let be the wealth of cooperator at time in the True System. 2. 2.
The other system is called the Ghost system. In this system we have that:
- •
cooperators and defectors can play even if they have negative wealth,
- •
cooperators cannot give birth,
- •
at each step of time, there is a decrease of of cooperator wealths,
- •
when defectors give birth they don’t loose wealth (this part is not necessary to make the proof but make the proof easier).
We denote the probability measure associated to the Ghost system . For example probability that the wealth of cooperator is 3 in the Ghost system is denoted .
It is the following system that we couple to . Since the cooperators won’t die in the Ghost system we have a.s. also we have the following upper bounding lemma. The proof is the same as Lemma 4.3, the only modification is to change by
Lemma 4.7**.**
There is and such that for each configuration with at least two cooperators and all we have:
[TABLE]
We denote the expectation using the probability measure . We look at the wealth of one individual then the birth it gives, make its wealth decrease, then the upper bounding of the variation of wealth has to take it into account. Using the same argument as those in the previous theorem, we prove that: such that: with at least two cooperators
[TABLE]
Also for all we have: (because )
[TABLE]
Then using Lemma 4.7 we can apply Corollary 4.5 to get: for a fixed almost surely:
[TABLE]
Then intersecting those events we have for a fixed , almost surely:
[TABLE]
Now let us prove that cannot go under 0 infinitely often. Because we have and because we have for all :
[TABLE]
Hence we get: for all
[TABLE]
which is summable using Lemma 4.7. Then we have with (33)
[TABLE]
We do the same reasoning with the defectors. We have and for all there exists such that there is and such that for each configuration with at least one cooperator and one defector and all we have:
[TABLE]
Then we get for a fixed , almost surely:
[TABLE]
and also we have:
[TABLE]
Let , using (33), (34), (35) and (36) we have for all . A a consequence for fixed, such that:
[TABLE]
Let us denote the set of configuration such that for all and the wealth of every individual with positive wealth in is increased by at least in . Then if in an individual has a wealth of , in configurations of he has a wealth of . As a consequence we have and .
Since in the Ghost system, the evolution of wealth only depends on the photographs we have: for all
[TABLE]
Let such that:
[TABLE]
We denote . We have using Markov property: is equal to:
[TABLE]
Since in cooperators always have positive wealth, on this event the Ghost system is just the True system and we get:
[TABLE]
∎
5 Proof on the Mean Field system
5.1 Proof of Theorem 3.3
In this part we will prove Theorem 3.3
Let us begin the Proof.
Using the Law of Large Numbers we have for all , a.s. for all such that
[TABLE]
Then if we have a.s. and since the increasing are bounded, as a consequence for all there exists such that:
[TABLE]
Since is a Levy process i.e. since the increments of don’t depend on where is, we have:
[TABLE]
Let be such that
[TABLE]
Hence we get using Markov property:
[TABLE]
∎
5.2 Proof of Proposition 3.4
Firstly let us notice that only take values in . Then we get:
[TABLE]
Let us compute for all the variance of .
[TABLE]
Then we have:
[TABLE]
Then using the Chebyshev inequality we have the following concentration inequality: for all
[TABLE]
We now take for all , and obtain (30).
Moreover we notice that
[TABLE]
The minimum of is reached in
[TABLE]
This minimum is Hence for all such that:
[TABLE]
We have:
[TABLE]
Acknowledgment
I would like to thank my advisors for the review Laurent Miclo and Jérôme Renault. Thanks to Xavier Bressaud for initiating me into the world of Research and to giving me this topic.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Axelrod, 2000] Axelrod, R. M. (2000). On six advances in cooperation theory. Analyse & Kritik , 22(1):130–151.
- 2[Beiglböck et al., 2015] Beiglböck, M., Siorpaes, P., et al. (2015). Pathwise versions of the burkholder–davis–gundy inequality. Bernoulli , 21(1):360–373.
- 3[Boylan, 1992] Boylan, R. T. (1992). Laws of large numbers for dynamical systems with randomly matched individuals. Journal of Economic Theory , 57(2):473 – 504.
- 4[Dorofeenko and Shorish, 2002] Dorofeenko, V. and Shorish, J. (2002). Dynamical modeling of the demographic prisoner’s dilemma. Technical report, Reihe Ökonomie/Economics Series, Institut für Höhere Studien (IHS).
- 5[Ellison, 1994] Ellison, G. (1994). Cooperation in the prisoner’s dilemma with anonymous random matching. The Review of Economic Studies , 61(3):567–588.
- 6[Epstein, 1998] Epstein, J. M. (1998). Zones of cooperation in demographic prisoner’s dilemma. Complexity , 4(2):36–48.
- 7[Ethier and Kurtz, 1986] Ethier, S. N. and Kurtz, T. G. (1986). Markov processes – characterization and convergence . Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. John Wiley & Sons Inc., New York.
- 8[Gibaud, 2016] Gibaud, S. (2016). Spatialized evolutionary prisoner’s dilemma: Homogenization and propagation of chaos. ar Xiv preprint ar Xiv:1609.05036 .
