Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT Outperforms Large Scale Imitation Learning for MAPF
Rishi Veerapaneni, Arthur Jakobsson, Kevin Ren, Samuel Kim, Jiaoyang, Li, Maxim Likhachev

TL;DR
This paper demonstrates that simple imitation learning combined with a collision post-processing method (CS-PIBT) significantly outperforms existing large-scale ML approaches in multi-agent pathfinding, emphasizing the importance of collision resolution techniques.
Contribution
The study shows that integrating a collision post-processing step with simple imitation learning yields superior MAPF performance, challenging the focus on complex architectures in prior ML MAPF research.
Findings
Simple imitation learning with CS-PIBT outperforms complex ML MAPF policies.
Using collision shields like CS-PIBT enables rapid training and better scalability.
Future ML MAPF methods should incorporate collision resolution and focus on longer planning horizons.
Abstract
Multi-Agent Path Finding (MAPF) is the problem of effectively finding efficient collision-free paths for a group of agents in a shared workspace. The MAPF community has largely focused on developing high-performance heuristic search methods. Recently, several works have applied various machine learning (ML) techniques to solve MAPF, usually involving sophisticated architectures, reinforcement learning techniques, and set-ups, but none using large amounts of high-quality supervised data. Our initial objective in this work was to show how simple large scale imitation learning of high-quality heuristic search methods can lead to state-of-the-art ML MAPF performance. However, we find that, at least with our model architecture, simple large scale (700k examples with hundreds of agents per example) imitation learning does \textit{not} produce impressive results. Instead, we find that by using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Hand Gesture Recognition Systems
