MANSA: Learning Fast and Slow in Multi-Agent Systems

David Mguni; Haojun Chen; Taher Jafferjee; Jianhong Wang; Long Fei,; Xidong Feng; Stephen McAleer; Feifei Tong; Jun Wang; Yaodong Yang

arXiv:2302.05910·cs.MA·June 6, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems

David Mguni, Haojun Chen, Taher Jafferjee, Jianhong Wang, Long Fei,, Xidong Feng, Stephen McAleer, Feifei Tong, Jun Wang, Yaodong Yang

PDF

Open Access

TL;DR

MANSA is a novel multi-agent reinforcement learning framework that selectively employs centralized learning only when necessary, significantly reducing computational costs while maintaining or improving coordination performance.

Contribution

Introduces MANSA, a plug-and-play framework that uses an additional agent with switching controls to optimize when centralized learning is employed during training.

Findings

01

MANSA reduces CL calls by 40% in SMAC.

02

Achieves superior performance in LBF and SMAC.

03

Maintains convergence and coordination quality.

Abstract

In multi-agent reinforcement learning (MARL), independent learning (IL) often shows remarkable performance and easily scales with the number of agents. Yet, using IL can be inefficient and runs the risk of failing to successfully train, particularly in scenarios that require agents to coordinate their actions. Using centralised learning (CL) enables MARL agents to quickly learn how to coordinate their behaviour but employing CL everywhere is often prohibitively expensive in real-world applications. Besides, using CL in value-based methods often needs strong representational constraints (e.g. individual-global-max condition) that can lead to poor performance if violated. In this paper, we introduce a novel plug & play IL framework named Multi-Agent Network Selection Algorithm (MANSA) which selectively employs CL only at states that require coordination. At its core, MANSA has an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Memory and Neural Computing · Evolutionary Algorithms and Applications