Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan; Mikayel Samvelyan; Lei Mao; Viktor Makoviychuk; Animesh; Garg; Jean Kossaifi; Shimon Whiteson; Yuke Zhu; Animashree Anandkumar

arXiv:2106.00136·cs.LG·June 2, 2021·6 cites

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh, Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

PDF

Open Access 1 Video

TL;DR

Tesseract introduces a tensor-based approach to multi-agent reinforcement learning, effectively modeling agent interactions with low-rank tensor approximations to improve sample efficiency in large action spaces.

Contribution

The paper proposes Tesseract, a novel tensorised formulation of the Bellman equation that enables efficient representation of multi-agent value functions using low-rank tensor approximations.

Findings

01

Tesseract achieves significant sample efficiency improvements.

02

Theoretical PAC analysis supports the method's effectiveness.

03

Empirical results confirm the benefits in various domains.

Abstract

Reinforcement Learning in large action spaces is a challenging problem. Cooperative multi-agent reinforcement learning (MARL) exacerbates matters by imposing various constraints on communication and observability. In this work, we consider the fundamental hurdle affecting both value-based and policy-gradient approaches: an exponential blowup of the action space with the number of agents. For value-based methods, it poses challenges in accurately representing the optimal value function. For policy gradient methods, it makes training the critic difficult and exacerbates the problem of the lagging critic. We show that from a learning theory perspective, both problems can be addressed by accurately representing the associated action-value function with a low-complexity hypothesis class. This requires accurately modelling the agent interactions in a sample efficient way. To this end, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning· slideslive

Taxonomy

TopicsLattice Boltzmann Simulation Studies · Model Reduction and Neural Networks · Tensor decomposition and applications