Multi-Agent Generative Adversarial Imitation Learning

Jiaming Song; Hongyu Ren; Dorsa Sadigh; Stefano Ermon

arXiv:1807.09936·cs.LG·July 27, 2018·94 cites

Multi-Agent Generative Adversarial Imitation Learning

Jiaming Song, Hongyu Ren, Dorsa Sadigh, Stefano Ermon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel multi-agent imitation learning framework based on inverse reinforcement learning, along with a practical actor-critic algorithm, enabling imitation of complex multi-agent behaviors in high-dimensional environments.

Contribution

It extends imitation learning to multi-agent Markov games, addressing Nash equilibria and non-stationarity, with a new algorithm demonstrating strong empirical results.

Findings

01

Effective in high-dimensional multi-agent environments

02

Handles both cooperative and competitive scenarios

03

Shows good empirical performance

Abstract

Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal. However, most existing approaches are not applicable in multi-agent settings due to the existence of multiple (Nash) equilibria and non-stationary environments. We propose a new framework for multi-agent imitation learning for general Markov games, where we build upon a generalized notion of inverse reinforcement learning. We further introduce a practical multi-agent actor-critic algorithm with good empirical performance. Our method can be used to imitate complex behaviors in high-dimensional environments with multiple cooperative or competing agents.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ermongroup/multiagent-gail
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics