S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?
Shuang Luo, Yinchuan Li, Jiahui Li, Kun Kuang, Furui Liu, Yunfeng, Shao, Chao Wu

TL;DR
This paper introduces S2RL, a sparse attention-based framework for multi-agent reinforcement learning that selectively focuses on important local states, leading to significant performance improvements in complex environments.
Contribution
The paper proposes a novel sparse attention mechanism for MARL that discards irrelevant local information, enhancing decentralized policy performance and general applicability.
Findings
S2RL significantly outperforms existing methods in StarCraft II tasks.
Sparse attention improves the efficiency of local state perception.
The framework is versatile and can be integrated with various MARL algorithms.
Abstract
Collaborative multi-agent reinforcement learning (MARL) has been widely used in many practical applications, where each agent makes a decision based on its own observation. Most mainstream methods treat each local observation as an entirety when modeling the decentralized local utility functions. However, they ignore the fact that local observation information can be further divided into several entities, and only part of the entities is helpful to model inference. Moreover, the importance of different entities may change over time. To improve the performance of decentralized policies, the attention mechanism is used to capture features of local information. Nevertheless, existing attention models rely on dense fully connected graphs and cannot better perceive important states. To this end, we propose a sparse state based MARL (S2RL) framework, which utilizes a sparse attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
