Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning   with Goal Imagination

Liangzhou Wang; Kaiwen Zhu; Fengming Zhu; Xinghu Yao; Shujie Zhang,; Deheng Ye; Haobo Fu; Qiang Fu; Wei Yang

arXiv:2403.03172·cs.AI·March 6, 2024·1 cites

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination

Liangzhou Wang, Kaiwen Zhu, Fengming Zhu, Xinghu Yao, Shujie Zhang,, Deheng Ye, Haobo Fu, Qiang Fu, Wei Yang

PDF

Open Access

TL;DR

This paper introduces MAGI, a model-based framework that enables multi-agent systems to explicitly reach consensus through goal imagination, improving coordination, sample efficiency, and performance in cooperative reinforcement learning tasks.

Contribution

The paper proposes a novel goal imagination framework that models future state distributions to explicitly coordinate agents in multi-agent reinforcement learning.

Findings

01

MAGI improves sample efficiency in multi-agent environments.

02

MAGI outperforms existing methods in cooperative tasks.

03

The approach effectively guides agents to high-value future states.

Abstract

Reaching consensus is key to multi-agent coordination. To accomplish a cooperative task, agents need to coherently select optimal joint actions to maximize the team reward. However, current cooperative multi-agent reinforcement learning (MARL) methods usually do not explicitly take consensus into consideration, which may cause miscoordination problem. In this paper, we propose a model-based consensus mechanism to explicitly coordinate multiple agents. The proposed Multi-agent Goal Imagination (MAGI) framework guides agents to reach consensus with an Imagined common goal. The common goal is an achievable state with high value, which is obtained by sampling from the distribution of future states. We directly model this distribution with a self-supervised generative model, thus alleviating the "curse of dimensinality" problem induced by multi-agent multi-step policy rollout commonly used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics