TL;DR
This paper introduces Neural Tree Expansion, a novel multi-robot planning method that combines centralized expert guidance with decentralized real-time decision-making, achieving superior performance and coordination in complex environments.
Contribution
It adapts AlphaZero's approach to multi-robot settings with partial information and continuous actions, integrating expert demonstrations and neural networks for efficient online planning.
Findings
Outperforms larger resource MCTS in solution quality
Demonstrates effective robot coordination in simulations
Enables real-time planning at 20Hz on aerial hardware
Abstract
We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online planning in non-cooperative environments, where each robot attempts to maximize its cumulative reward while interacting with other self-interested robots. Our algorithm adapts the centralized, perfect information, discrete-action space method from AlphaZero to a decentralized, partial information, continuous action space setting for multi-robot applications. Our method has three interacting components: (i) a centralized, perfect-information "expert" Monte Carlo Tree Search (MCTS) with large computation resources that provides expert demonstrations, (ii) a decentralized, partial-information "learner" MCTS with small computation resources that runs in real-time and provides self-play examples, and (iii) policy & value neural networks that are trained with the expert demonstrations and bias both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
