A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Marc Lanctot; Vinicius Zambaldi; Audrunas Gruslys; Angeliki Lazaridou,; Karl Tuyls; Julien Perolat; David Silver; Thore Graepel

arXiv:1711.00832·cs.AI·November 8, 2017·142 cites

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Marc Lanctot, Vinicius Zambaldi, Audrunas Gruslys, Angeliki Lazaridou,, Karl Tuyls, Julien Perolat, David Silver, Thore Graepel

PDF

Open Access 1 Repo

TL;DR

This paper introduces a unified game-theoretic framework for multiagent reinforcement learning that improves policy generalization and scalability, demonstrated in gridworld and poker environments.

Contribution

It proposes a new algorithm combining deep RL, empirical game theory, and meta-strategies, unifying and extending previous MARL methods.

Findings

01

Policies overfit to training partners, reducing generalization.

02

The new algorithm outperforms previous methods in complex environments.

03

Scalable implementation reduces memory requirements.

Abstract

To achieve general intelligence, agents must learn how to interact with others in a shared environment: this is the challenge of multiagent reinforcement learning (MARL). The simplest form is independent reinforcement learning (InRL), where each agent treats its experience as part of its (non-stationary) environment. In this paper, we first observe that policies learned using InRL can overfit to the other agents' policies during training, failing to sufficiently generalize during execution. We introduce a new metric, joint-policy correlation, to quantify this effect. We describe an algorithm for general MARL, based on approximate best responses to mixtures of policies generated using deep reinforcement learning, and empirical game-theoretic analysis to compute meta-strategies for policy selection. The algorithm generalizes previous ones such as InRL, iterated best response, double…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmind/open_spiel
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Evolutionary Algorithms and Applications