Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning
Alexey Skrynnik, Anton Andreychuk, Maria Nesterova, Konstantin, Yakovlev, Aleksandr Panov

TL;DR
This paper introduces a decentralized approach combining planning and reinforcement learning to solve lifelong multi-agent pathfinding problems efficiently, outperforming existing methods in throughput and speed.
Contribution
It presents a novel decentralized MAPF method integrating heuristic search planning with reinforcement learning for collision avoidance, effective in lifelong scenarios.
Findings
Outperforms state-of-the-art solvers in throughput
Generalizes well to unseen maps
Significantly faster than search-based solvers
Abstract
Multi-agent Pathfinding (MAPF) problem generally asks to find a set of conflict-free paths for a set of agents confined to a graph and is typically solved in a centralized fashion. Conversely, in this work, we investigate the decentralized MAPF setting, when the central controller that posses all the information on the agents' locations and goals is absent and the agents have to sequientially decide the actions on their own without having access to a full state of the environment. We focus on the practically important lifelong variant of MAPF, which involves continuously assigning new goals to the agents upon arrival to the previous ones. To address this complex problem, we propose a method that integrates two complementary approaches: planning with heuristic search and reinforcement learning through policy optimization. Planning is utilized to construct and re-plan individual paths.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRobotic Path Planning Algorithms
MethodsFocus
