Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning
Yue Jin, Shuangqing Wei, Jian Yuan, Xudong Zhang

TL;DR
This paper introduces IBORM, a method that uses the information bottleneck principle to learn compact, informative behavior representations in multi-agent reinforcement learning, leading to faster convergence and improved performance.
Contribution
It proposes a novel explicit behavior representation learning approach based on the information bottleneck, enhancing interpretability and efficiency in multi-agent RL.
Findings
IBORM achieves the fastest convergence among compared methods.
It results in the best policy performance.
The approach effectively balances information compression and utility.
Abstract
In multi-agent deep reinforcement learning, extracting sufficient and compact information of other agents is critical to attain efficient convergence and scalability of an algorithm. In canonical frameworks, distilling of such information is often done in an implicit and uninterpretable manner, or explicitly with cost functions not able to reflect the relationship between information compression and utility in representation. In this paper, we present Information-Bottleneck-based Other agents' behavior Representation learning for Multi-agent reinforcement learning (IBORM) to explicitly seek low-dimensional mapping encoder through which a compact and informative representation relevant to other agents' behaviors is established. IBORM leverages the information bottleneck principle to compress observation information, while retaining sufficient information relevant to other agents'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Evolutionary Algorithms and Applications
