Cooperative Reward Shaping for Multi-Agent Pathfinding

Zhenyu Song; Ronghao Zheng; Senlin Zhang; Meiqin Liu

arXiv:2407.10403·cs.AI·July 18, 2024

Cooperative Reward Shaping for Multi-Agent Pathfinding

Zhenyu Song, Ronghao Zheng, Senlin Zhang, Meiqin Liu

PDF

Open Access

TL;DR

This paper introduces a reward shaping technique based on Independent Q-Learning to enhance cooperation among agents in multi-agent pathfinding, improving efficiency especially in large-scale scenarios.

Contribution

It proposes a novel reward shaping method that incorporates agent interactions to promote cooperation in distributed multi-agent pathfinding using MARL.

Findings

01

Outperforms state-of-the-art planners in large-scale scenarios.

02

Facilitates active cooperation among agents through reward shaping.

03

Maintains competitive performance in smaller scenarios.

Abstract

The primary objective of Multi-Agent Pathfinding (MAPF) is to plan efficient and conflict-free paths for all agents. Traditional multi-agent path planning algorithms struggle to achieve efficient distributed path planning for multiple agents. In contrast, Multi-Agent Reinforcement Learning (MARL) has been demonstrated as an effective approach to achieve this objective. By modeling the MAPF problem as a MARL problem, agents can achieve efficient path planning and collision avoidance through distributed strategies under partial observation. However, MARL strategies often lack cooperation among agents due to the absence of global information, which subsequently leads to reduced MAPF efficiency. To address this challenge, this letter introduces a unique reward shaping technique based on Independent Q-Learning (IQL). The aim of this method is to evaluate the influence of one agent on its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Digital Rights Management and Security

MethodsQ-Learning