Reinforcement Learning Produces Dominant Strategies for the Iterated   Prisoner's Dilemma

Marc Harper; Vincent Knight; Martin Jones; Georgios; Koutsovoulos; Nikoleta E. Glynatsi; Owen Campbell

arXiv:1707.06307·cs.GT·February 7, 2018

Reinforcement Learning Produces Dominant Strategies for the Iterated Prisoner's Dilemma

Marc Harper, Vincent Knight, Martin Jones, Georgios, Koutsovoulos, Nikoleta E. Glynatsi, Owen Campbell

PDF

1 Repo

TL;DR

This paper demonstrates that reinforcement learning techniques can develop highly effective strategies for the Iterated Prisoner's Dilemma, outperforming traditional strategies in tournament settings, including noisy environments.

Contribution

It introduces novel reinforcement learning-based strategies that outperform existing methods in the Iterated Prisoner's Dilemma, validated through extensive tournament results.

Findings

01

Trained strategies outperform all opponents in standard tournaments.

02

Reinforcement learning strategies excel even in noisy conditions.

03

The strategies are effective against a diverse set of over 170 opponents.

Abstract

We present tournament results and several powerful strategies for the Iterated Prisoner's Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Axelrod-Python/Axelrod
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.