Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning

Anton Andreychuk; Konstantin Yakovlev; Aleksandr Panov; Alexey Skrynnik

arXiv:2506.23793·cs.AI·July 1, 2025

Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning

Anton Andreychuk, Konstantin Yakovlev, Aleksandr Panov, Alexey Skrynnik

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces MAPF-GPT-DDG, a fine-tuned learning-based multi-agent pathfinding solver that significantly improves solution quality and scalability, handling up to one million agents in complex environments.

Contribution

It presents a novel fine-tuning method with delta-data generation that enhances a pre-trained MAPF model's performance and scalability beyond existing solutions.

Findings

01

Outperforms all existing learning-based MAPF solvers.

02

Handles environments with up to 1 million agents.

03

Achieves significant improvements in solution quality.

Abstract

Multi-agent pathfinding (MAPF) is a common abstraction of multi-robot trajectory planning problems, where multiple homogeneous robots simultaneously move in the shared environment. While solving MAPF optimally has been proven to be NP-hard, scalable, and efficient, solvers are vital for real-world applications like logistics, search-and-rescue, etc. To this end, decentralized suboptimal MAPF solvers that leverage machine learning have come on stage. Building on the success of the recently introduced MAPF-GPT, a pure imitation learning solver, we introduce MAPF-GPT-DDG. This novel approach effectively fine-tunes the pre-trained MAPF model using centralized expert data. Leveraging a novel delta-data generation mechanism, MAPF-GPT-DDG accelerates training while significantly improving performance at test time. Our experiments demonstrate that MAPF-GPT-DDG surpasses all existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Cognitive-AI-Systems/MAPF-GPT
pytorch

Datasets

aandreychuk/MAPF-GPT-DDG
dataset· 303 dl
303 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning