Foolproof Cooperative Learning

Alexis Jacq; Julien Perolat; Matthieu Geist; Olivier Pietquin

arXiv:1906.09831·cs.GT·October 16, 2020·1 cites

Foolproof Cooperative Learning

Alexis Jacq, Julien Perolat, Matthieu Geist, Olivier Pietquin

PDF

Open Access

TL;DR

This paper introduces Foolproof Cooperative Learning (FCL), an algorithm that achieves cooperative behavior in stochastic and symmetric games, ensuring convergence to a stable equilibrium and robustness against selfish players.

Contribution

The paper extends learning equilibrium concepts to stochastic games and proposes FCL, a novel algorithm that promotes cooperation and resists exploitation in repeated symmetric games.

Findings

01

FCL converges to Tit-for-Tat behavior in symmetric games.

02

FCL is a learning equilibrium in repeated symmetric games.

03

FCL demonstrates robustness to selfish learners.

Abstract

This paper extends the notion of learning equilibrium in game theory from matrix games to stochastic games. We introduce Foolproof Cooperative Learning (FCL), an algorithm that converges to a Tit-for-Tat behavior. It allows cooperative strategies when played against itself while being not exploitable by selfish players. We prove that in repeated symmetric games, this algorithm is a learning equilibrium. We illustrate the behavior of FCL on symmetric matrix and grid games, and its robustness to selfish learners.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research