Learning Policy Representations in Multiagent Systems

Aditya Grover; Maruan Al-Shedivat; Jayesh K. Gupta; Yura Burda,; Harrison Edwards

arXiv:1806.06464·cs.MA·August 2, 2018·43 cites

Learning Policy Representations in Multiagent Systems

Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda,, Harrison Edwards

PDF

Open Access

TL;DR

This paper introduces a general unsupervised learning framework for modeling agent behavior in multiagent systems, using minimal interaction data to learn policy representations applicable across diverse environments.

Contribution

It presents a novel representation learning approach for agent modeling that is task-agnostic and does not rely on domain-specific prior knowledge.

Findings

01

Effective in high-dimensional competitive environments

02

Successful in cooperative communication tasks

03

Improves policy optimization in reinforcement learning

Abstract

Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general learning framework for modeling agent behavior in any multiagent system using only a handful of interaction data. Our framework casts agent modeling as a representation learning problem. Consequently, we construct a novel objective inspired by imitation learning and agent identification and design an algorithm for unsupervised learning of representations of agent policies. We demonstrate empirically the utility of the proposed framework in (i) a challenging high-dimensional competitive environment for continuous control and (ii) a cooperative environment for communication, on supervised predictive tasks, unsupervised clustering, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Topic Modeling · Machine Learning and Data Classification