Information-Bottleneck-Based Behavior Representation Learning for   Multi-agent Reinforcement learning

Yue Jin; Shuangqing Wei; Jian Yuan; Xudong Zhang

arXiv:2109.14188·cs.LG·September 30, 2021

Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning

Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang

PDF

Open Access

TL;DR

This paper introduces IBORM, a method that uses the information bottleneck principle to learn compact, informative behavior representations in multi-agent reinforcement learning, leading to faster convergence and improved performance.

Contribution

It proposes a novel explicit behavior representation learning approach based on the information bottleneck, enhancing interpretability and efficiency in multi-agent RL.

Findings

01

IBORM achieves the fastest convergence among compared methods.

02

It results in the best policy performance.

03

The approach effectively balances information compression and utility.

Abstract

In multi-agent deep reinforcement learning, extracting sufficient and compact information of other agents is critical to attain efficient convergence and scalability of an algorithm. In canonical frameworks, distilling of such information is often done in an implicit and uninterpretable manner, or explicitly with cost functions not able to reflect the relationship between information compression and utility in representation. In this paper, we present Information-Bottleneck-based Other agents' behavior Representation learning for Multi-agent reinforcement learning (IBORM) to explicitly seek low-dimensional mapping encoder through which a compact and informative representation relevant to other agents' behaviors is established. IBORM leverages the information bottleneck principle to compress observation information, while retaining sufficient information relevant to other agents'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Evolutionary Algorithms and Applications