Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding

Valeriy Vyaltsev; Alsu Sagirova; Anton Andreychuk; Oleg Bulichev; Yuri Kuratov; Konstantin Yakovlev; Aleksandr Panov; Alexey Skrynnik

arXiv:2605.07637·cs.AI·May 13, 2026

Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding

Valeriy Vyaltsev, Alsu Sagirova, Anton Andreychuk, Oleg Bulichev, Yuri Kuratov, Konstantin Yakovlev, Aleksandr Panov, Alexey Skrynnik

PDF

1 Datasets 1 Video

TL;DR

This paper introduces LC-MAPF, a learnable communication module for multi-agent pathfinding that improves cooperation and scalability in decentralized, machine learning-based solutions.

Contribution

The paper proposes a generalizable pre-trained model with multi-round communication for enhanced multi-agent cooperation in MAPF.

Findings

01

Outperforms existing learning-based MAPF solvers in diverse metrics.

02

Maintains scalability despite incorporating communication.

03

Effective in unseen test scenarios.

Abstract

Multi-agent pathfinding (MAPF) is a widely used abstraction for multi-robot trajectory planning problems, where multiple homogeneous agents move simultaneously within a shared environment. Although solving MAPF optimally is NP-hard, scalable and efficient solvers are critical for real-world applications such as logistics and search-and-rescue. To this end, the research community has proposed various decentralized suboptimal MAPF solvers that leverage machine learning. Such methods frame MAPF (from a single agent perspective) as a Dec-POMDP where at each time step an agent has to decide an action based on the local observation and typically solve the problem via reinforcement learning or imitation learning. We follow the same approach but additionally introduce a learnable communication module tailored to enhance cooperation between agents via efficient feature sharing. We present the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

aoiandroid/papers
dataset· 28 dl
28 dl

Videos

Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding· underline