UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative   Scenarios

Jiajun Chai; Weifan Li; Yuanheng Zhu; Dongbin Zhao; Zhe Ma; Kewu Sun,; Jishiyu Ding

arXiv:2203.14477·cs.MA·March 29, 2022

UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios

Jiajun Chai, Weifan Li, Yuanheng Zhu, Dongbin Zhao, Zhe Ma, Kewu Sun,, Jishiyu Ding

PDF

TL;DR

This paper introduces UNMAS, a novel multi-agent reinforcement learning method that adapts to changing numbers of agents and action sets, demonstrating superior performance in complex StarCraft II scenarios.

Contribution

UNMAS is the first method to effectively handle unshaped multi-agent scenarios with dynamic agent counts and action sets, using self-weighting mixing networks and dual-stream individual action-value networks.

Findings

01

UNMAS outperforms state-of-the-art MARL algorithms in StarCraft II scenarios.

02

UNMAS achieves the highest winning rates, especially in complex scenarios.

03

Agents learn effective cooperative behaviors where others fail.

Abstract

Multi-agent reinforcement learning methods such as VDN, QMIX, and QTRAN that adopt centralized training with decentralized execution (CTDE) framework have shown promising results in cooperation and competition. However, in some multi-agent scenarios, the number of agents and the size of action set actually vary over time. We call these unshaped scenarios, and the methods mentioned above fail in performing satisfyingly. In this paper, we propose a new method called Unshaped Networks for Multi-Agent Systems (UNMAS) that adapts to the number and size changes in multi-agent systems. We propose the self-weighting mixing network to factorize the joint action-value. Its adaption to the change in agent number is attributed to the nonlinear mapping from each-agent Q value to the joint action-value with individual weights. Besides, in order to address the change in action set, each agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.