Qatten: A General Framework for Cooperative Multiagent Reinforcement   Learning

Yaodong Yang; Jianye Hao; Ben Liao; Kun Shao; Guangyong Chen; Wulong; Liu; Hongyao Tang

arXiv:2002.03939·cs.MA·June 11, 2020·108 cites

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

Yaodong Yang, Jianye Hao, Ben Liao, Kun Shao, Guangyong Chen, Wulong, Liu, Hongyao Tang

PDF

Open Access

TL;DR

Qatten introduces a theoretically grounded, attention-based framework for cooperative multiagent reinforcement learning that improves coordination and performance by explicitly modeling agent contributions.

Contribution

It provides a general formula for multiagent Q-values and implements an attention mechanism to enhance value decomposition without restrictive assumptions.

Findings

01

Outperforms state-of-the-art MARL methods on StarCraft benchmarks

02

Provides a theoretical foundation for multiagent value decomposition

03

Uses attention to model agent contributions explicitly

Abstract

In many real-world tasks, multiple agents must learn to coordinate with each other given their private observations and limited communication ability. Deep multiagent reinforcement learning (Deep-MARL) algorithms have shown superior performance in such challenging settings. One representative class of work is multiagent value decomposition, which decomposes the global shared multiagent Q-value $Q_{t o t}$ into individual Q-values $Q^{i}$ to guide individuals' behaviors, i.e. VDN imposing an additive formation and QMIX adopting a monotonic assumption using an implicit mixing method. However, most of the previous efforts impose certain assumptions between $Q_{t o t}$ and $Q^{i}$ and lack theoretical groundings. Besides, they do not explicitly consider the agent-level impact of individuals to the whole system when transforming individual $Q^{i}$ s into $Q_{t o t}$ . In this paper, we theoretically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Mobile Crowdsensing and Crowdsourcing