Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via   Planning and Learning

Alexey Skrynnik; Anton Andreychuk; Maria Nesterova; Konstantin; Yakovlev; Aleksandr Panov

arXiv:2310.01207·cs.AI·October 3, 2023·2 cites

Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning

Alexey Skrynnik, Anton Andreychuk, Maria Nesterova, Konstantin, Yakovlev, Aleksandr Panov

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a decentralized approach combining planning and reinforcement learning to solve lifelong multi-agent pathfinding problems efficiently, outperforming existing methods in throughput and speed.

Contribution

It presents a novel decentralized MAPF method integrating heuristic search planning with reinforcement learning for collision avoidance, effective in lifelong scenarios.

Findings

01

Outperforms state-of-the-art solvers in throughput

02

Generalizes well to unseen maps

03

Significantly faster than search-based solvers

Abstract

Multi-agent Pathfinding (MAPF) problem generally asks to find a set of conflict-free paths for a set of agents confined to a graph and is typically solved in a centralized fashion. Conversely, in this work, we investigate the decentralized MAPF setting, when the central controller that posses all the information on the agents' locations and goals is absent and the agents have to sequientially decide the actions on their own without having access to a full state of the environment. We focus on the practically important lifelong variant of MAPF, which involves continuously assigning new goals to the agents upon arrival to the previous ones. To address this complex problem, we propose a method that integrates two complementary approaches: planning with heuristic search and reinforcement learning through policy optimization. Planning is utilized to construct and re-plan individual paths.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

airi-institute/learn-to-follow
none

Videos

Learn to Follow: Decentralized Lifelong Multi-Agent Pathfinding via Planning and Learning· underline

Taxonomy

TopicsRobotic Path Planning Algorithms

MethodsFocus