TL;DR
SACHA introduces a novel multi-agent reinforcement learning approach with heuristic-based attention mechanisms to improve cooperation and scalability in partially observable multi-agent pathfinding, outperforming existing methods.
Contribution
The paper proposes SACHA, a multi-agent actor-critic method with heuristic attention and an agent-centered critic, enhancing cooperation and generalization in MAPF tasks.
Findings
SACHA outperforms state-of-the-art methods in success rate and solution quality.
Heuristic-based attention improves agent cooperation in congested environments.
Agent-centered critic enhances scalability and generalization across different MAPF scenarios.
Abstract
Multi-Agent Path Finding (MAPF) is a crucial component for many large-scale robotic systems, where agents must plan their collision-free paths to their given goal positions. Recently, multi-agent reinforcement learning has been introduced to solve the partially observable variant of MAPF by learning a decentralized single-agent policy in a centralized fashion based on each agent's partial observation. However, existing learning-based methods are ineffective in achieving complex multi-agent cooperation, especially in congested environments, due to the non-stationarity of this setting. To tackle this challenge, we propose a multi-agent actor-critic method called Soft Actor-Critic with Heuristic-Based Attention (SACHA), which employs novel heuristic-based attention mechanisms for both the actors and critics to encourage cooperation among agents. SACHA learns a neural network for each agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
