Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
Hengyuan Hu, Jakob N Foerster

TL;DR
This paper introduces the Simplified Action Decoder (SAD), a novel deep multi-agent reinforcement learning method that improves cooperation and communication in multi-agent settings like Hanabi, achieving state-of-the-art results.
Contribution
SAD leverages centralized training to enable agents to observe both exploratory and greedy actions of teammates, enhancing multi-agent learning in cooperative environments.
Findings
SAD achieves new state-of-the-art performance on Hanabi for 2-5 players.
The method improves agents' ability to reason over teammates' intentions.
Ablation studies confirm the effectiveness of SAD components.
Abstract
In recent years we have seen fast progress on a number of benchmark problems in AI, with modern methods achieving near or super human performance in Go, Poker and Dota. One common aspect of all of these challenges is that they are by design adversarial or, technically speaking, zero-sum. In contrast to these settings, success in the real world commonly requires humans to collaborate and communicate with others, in settings that are, at least partially, cooperative. In the last year, the card game Hanabi has been established as a new benchmark environment for AI to fill this gap. In particular, Hanabi is interesting to humans since it is entirely focused on theory of mind, i.e., the ability to effectively reason over the intentions, beliefs and point of view of other agents when observing their actions. Learning to be informative when observed by others is an interesting challenge for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Sports Analytics and Performance
