Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu, Jie Song, Yihe Zhou, Na Yu, Kaixuan Chen, Zunlei Feng,, Mingli Song

TL;DR
This paper introduces OPT, a novel method for disentangling interaction patterns in multi-agent reinforcement learning, improving interpretability and generalization by filtering noisy interactions and capturing underlying interaction prototypes.
Contribution
OPT is the first approach to explicitly disentangle interaction patterns into prototypes, enhancing interpretability and robustness in multi-agent reinforcement learning.
Findings
OPT outperforms state-of-the-art methods on various benchmarks.
The sparse disagreement mechanism promotes diversity among prototypes.
Maximizing mutual information stabilizes training under partial observability.
Abstract
Deep cooperative multi-agent reinforcement learning has demonstrated its remarkable success over a wide spectrum of complex control tasks. However, recent advances in multi-agent learning mainly focus on value decomposition while leaving entity interactions still intertwined, which easily leads to over-fitting on noisy interactions between entities. In this work, we introduce a novel interactiOn Pattern disenTangling (OPT) method, to disentangle the entity interactions into interaction prototypes, each of which represents an underlying interaction pattern within a subgroup of the entities. OPT facilitates filtering the noisy interactions between irrelevant entities and thus significantly improves generalizability as well as interpretability. Specifically, OPT introduces a sparse disagreement mechanism to encourage sparsity and diversity among discovered interaction prototypes. Then the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsOPT
