S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent   Reinforcement Learning?

Shuang Luo; Yinchuan Li; Jiahui Li; Kun Kuang; Furui Liu; Yunfeng; Shao; Chao Wu

arXiv:2206.11054·cs.LG·June 23, 2022·1 cites

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

Shuang Luo, Yinchuan Li, Jiahui Li, Kun Kuang, Furui Liu, Yunfeng, Shao, Chao Wu

PDF

Open Access

TL;DR

This paper introduces S2RL, a sparse attention-based framework for multi-agent reinforcement learning that selectively focuses on important local states, leading to significant performance improvements in complex environments.

Contribution

The paper proposes a novel sparse attention mechanism for MARL that discards irrelevant local information, enhancing decentralized policy performance and general applicability.

Findings

01

S2RL significantly outperforms existing methods in StarCraft II tasks.

02

Sparse attention improves the efficiency of local state perception.

03

The framework is versatile and can be integrated with various MARL algorithms.

Abstract

Collaborative multi-agent reinforcement learning (MARL) has been widely used in many practical applications, where each agent makes a decision based on its own observation. Most mainstream methods treat each local observation as an entirety when modeling the decentralized local utility functions. However, they ignore the fact that local observation information can be further divided into several entities, and only part of the entities is helpful to model inference. Moreover, the importance of different entities may change over time. To improve the performance of decentralized policies, the attention mechanism is used to capture features of local information. Nevertheless, existing attention models rely on dense fully connected graphs and cannot better perceive important states. To this end, we propose a sparse state based MARL (S2RL) framework, which utilizes a sparse attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics