Evolutionary Exploration of the Finitely Repeated Prisoners' Dilemma--The Effect of Out-of-Equilibrium Play
Kristian Lindgren, Vilhelm Verendel

TL;DR
This paper explores how evolutionary dynamics influence strategy stability in the finitely repeated Prisoners' Dilemma, revealing conditions under which cooperation persists or cycles emerge despite the Nash equilibrium predicting defection.
Contribution
It introduces an evolutionary analysis of conditional strategies, including Convincer and Follower types, showing how out-of-equilibrium play affects stability and convergence to equilibrium.
Findings
Stable fixed points align with Nash equilibrium at low mutation rates.
Cycles of strategies can occur in certain parameter regions.
Evolutionary dynamics may prevent convergence to defection, contrary to classical predictions.
Abstract
The finitely repeated Prisoners' Dilemma is a good illustration of the discrepancy between the strategic behaviour suggested by a game-theoretic analysis and the behaviour often observed among human players, where cooperation is maintained through most of the game. A game-theoretic reasoning based on backward induction eliminates strategies step by step until defection from the first round is the only remaining choice, reflecting the Nash equilibrium of the game. We investigate the Nash equilibrium solution for two different sets of strategies in an evolutionary context, using replicator-mutation dynamics. The first set consists of conditional cooperators, up to a certain round, while the second set in addition to these contains two strategy types that react differently on the first round action: The "Convincer" strategies insist with two rounds of initial cooperation, trying to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
