Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning
Yaodong Yang, Jianye Hao, Ben Liao, Kun Shao, Guangyong Chen, Wulong, Liu, Hongyao Tang

TL;DR
Qatten introduces a theoretically grounded, attention-based framework for cooperative multiagent reinforcement learning that improves coordination and performance by explicitly modeling agent contributions.
Contribution
It provides a general formula for multiagent Q-values and implements an attention mechanism to enhance value decomposition without restrictive assumptions.
Findings
Outperforms state-of-the-art MARL methods on StarCraft benchmarks
Provides a theoretical foundation for multiagent value decomposition
Uses attention to model agent contributions explicitly
Abstract
In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior performance in such challenging settings. One representative class of work is multiagent value decomposition, which decomposes the global shared multiagent Q-value into individual Q-values to guide individuals' behaviors, i.e. VDN imposing an additive formation and QMIX adopting a monotonic assumption using an implicit mixing method. However, most of the previous efforts impose certain assumptions between and and lack theoretical groundings. Besides, they do not explicitly consider the agent-level impact of individuals to the whole system when transforming individual s into . In this paper, we theoretically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Mobile Crowdsensing and Crowdsourcing
