Improving Learnt Local MAPF Policies with Heuristic Search

Rishi Veerapaneni; Qian Wang; Kevin Ren; Arthur Jakobsson; Jiaoyang; Li; Maxim Likhachev

arXiv:2403.20300·cs.MA·April 1, 2024·1 cites

Improving Learnt Local MAPF Policies with Heuristic Search

Rishi Veerapaneni, Qian Wang, Kevin Ren, Arthur Jakobsson, Jiaoyang, Li, Maxim Likhachev

PDF

Open Access

TL;DR

This paper enhances machine learning-based local policies for multi-agent pathfinding by integrating heuristic search, significantly improving success rates and scalability, including in high congestion scenarios with up to 20% agent density.

Contribution

It introduces a novel method to combine heuristic search with learned local policies, enabling better deadlock resolution and full horizon planning in MAPF.

Findings

01

Improved success rates of ML policies with heuristic search.

02

Scalability demonstrated in high congestion scenarios.

03

First ML-based MAPF approach to handle 20% agent density.

Abstract

Multi-agent path finding (MAPF) is the problem of finding collision-free paths for a team of agents to reach their goal locations. State-of-the-art classical MAPF solvers typically employ heuristic search to find solutions for hundreds of agents but are typically centralized and can struggle to scale when run with short timeouts. Machine learning (ML) approaches that learn policies for each agent are appealing as these could enable decentralized systems and scale well while maintaining good solution quality. Current ML approaches to MAPF have proposed methods that have started to scratch the surface of this potential. However, state-of-the-art ML approaches produce "local" policies that only plan for a single timestep and have poor success rates and scalability. Our main idea is that we can improve a ML local policy by using heuristic search methods on the output probability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization · AI-based Problem Solving and Planning