Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT   Outperforms Large Scale Imitation Learning for MAPF

Rishi Veerapaneni; Arthur Jakobsson; Kevin Ren; Samuel Kim; Jiaoyang; Li; Maxim Likhachev

arXiv:2409.14491·cs.MA·September 24, 2024

Work Smarter Not Harder: Simple Imitation Learning with CS-PIBT Outperforms Large Scale Imitation Learning for MAPF

Rishi Veerapaneni, Arthur Jakobsson, Kevin Ren, Samuel Kim, Jiaoyang, Li, Maxim Likhachev

PDF

Open Access

TL;DR

This paper demonstrates that simple imitation learning combined with a collision post-processing method (CS-PIBT) significantly outperforms existing large-scale ML approaches in multi-agent pathfinding, emphasizing the importance of collision resolution techniques.

Contribution

The study shows that integrating a collision post-processing step with simple imitation learning yields superior MAPF performance, challenging the focus on complex architectures in prior ML MAPF research.

Findings

01

Simple imitation learning with CS-PIBT outperforms complex ML MAPF policies.

02

Using collision shields like CS-PIBT enables rapid training and better scalability.

03

Future ML MAPF methods should incorporate collision resolution and focus on longer planning horizons.

Abstract

Multi-Agent Path Finding (MAPF) is the problem of effectively finding efficient collision-free paths for a group of agents in a shared workspace. The MAPF community has largely focused on developing high-performance heuristic search methods. Recently, several works have applied various machine learning (ML) techniques to solve MAPF, usually involving sophisticated architectures, reinforcement learning techniques, and set-ups, but none using large amounts of high-quality supervised data. Our initial objective in this work was to show how simple large scale imitation learning of high-quality heuristic search methods can lead to state-of-the-art ML MAPF performance. However, we find that, at least with our model architecture, simple large scale (700k examples with hundreds of agents per example) imitation learning does \textit{not} produce impressive results. Instead, we find that by using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Hand Gesture Recognition Systems