CPIG: Leveraging Consistency Policy with Intention Guidance for   Multi-agent Exploration

Yuqian Fu; Yuanheng Zhu; Haoran Li; Zijie Zhao; Jiajun Chai; Dongbin; Zhao

arXiv:2411.03603·cs.MA·December 9, 2024

CPIG: Leveraging Consistency Policy with Intention Guidance for Multi-agent Exploration

Yuqian Fu, Yuanheng Zhu, Haoran Li, Zijie Zhao, Jiajun Chai, Dongbin, Zhao

PDF

Open Access

TL;DR

This paper introduces CPIG, a novel multi-agent reinforcement learning approach that combines a multimodal consistency policy with intention sharing to improve exploration and cooperation, especially in sparse-reward environments.

Contribution

The paper proposes a new method, CPIG, integrating a multimodal consistency policy and intention sharing to enhance exploration and cooperation in multi-agent RL.

Findings

01

Significantly outperforms SOTA in sparse-reward environments by 20%.

02

Achieves comparable performance to baselines in dense-reward settings.

03

Effectively promotes exploration and cooperation among agents.

Abstract

Efficient exploration is crucial in cooperative multi-agent reinforcement learning (MARL), especially in sparse-reward settings. However, due to the reliance on the unimodal policy, existing methods are prone to falling into the local optima, hindering the effective exploration of better policies. Furthermore, in sparse-reward settings, each agent tends to receive a scarce reward, which poses significant challenges to inter-agent cooperation. This not only increases the difficulty of policy learning but also degrades the overall performance of multi-agent tasks. To address these issues, we propose a Consistency Policy with Intention Guidance (CPIG), with two primary components: (a) introducing a multimodal policy to enhance the agent's exploration capability, and (b) sharing the intention among agents to foster agent cooperation. For component (a), CPIG incorporates a Consistency model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed and Parallel Computing Systems · Multi-Agent Systems and Negotiation