Cooperative Reward Shaping for Multi-Agent Pathfinding
Zhenyu Song, Ronghao Zheng, Senlin Zhang, Meiqin Liu

TL;DR
This paper introduces a reward shaping technique based on Independent Q-Learning to enhance cooperation among agents in multi-agent pathfinding, improving efficiency especially in large-scale scenarios.
Contribution
It proposes a novel reward shaping method that incorporates agent interactions to promote cooperation in distributed multi-agent pathfinding using MARL.
Findings
Outperforms state-of-the-art planners in large-scale scenarios.
Facilitates active cooperation among agents through reward shaping.
Maintains competitive performance in smaller scenarios.
Abstract
The primary objective of Multi-Agent Pathfinding (MAPF) is to plan efficient and conflict-free paths for all agents. Traditional multi-agent path planning algorithms struggle to achieve efficient distributed path planning for multiple agents. In contrast, Multi-Agent Reinforcement Learning (MARL) has been demonstrated as an effective approach to achieve this objective. By modeling the MAPF problem as a MARL problem, agents can achieve efficient path planning and collision avoidance through distributed strategies under partial observation. However, MARL strategies often lack cooperation among agents due to the absence of global information, which subsequently leads to reduced MAPF efficiency. To address this challenge, this letter introduces a unique reward shaping technique based on Independent Q-Learning (IQL). The aim of this method is to evaluate the influence of one agent on its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Digital Rights Management and Security
MethodsQ-Learning
