Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative   Environments

Benjamin Riviere; Wolfgang Hoenig; Matthew Anderson; Soon-Jo Chung

arXiv:2104.09705·cs.RO·July 12, 2021

Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments

Benjamin Riviere, Wolfgang Hoenig, Matthew Anderson, Soon-Jo Chung

PDF

1 Repo

TL;DR

This paper introduces Neural Tree Expansion, a novel multi-robot planning method that combines centralized expert guidance with decentralized real-time decision-making, achieving superior performance and coordination in complex environments.

Contribution

It adapts AlphaZero's approach to multi-robot settings with partial information and continuous actions, integrating expert demonstrations and neural networks for efficient online planning.

Findings

01

Outperforms larger resource MCTS in solution quality

02

Demonstrates effective robot coordination in simulations

03

Enables real-time planning at 20Hz on aerial hardware

Abstract

We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online planning in non-cooperative environments, where each robot attempts to maximize its cumulative reward while interacting with other self-interested robots. Our algorithm adapts the centralized, perfect information, discrete-action space method from AlphaZero to a decentralized, partial information, continuous action space setting for multi-robot applications. Our method has three interacting components: (i) a centralized, perfect-information "expert" Monte Carlo Tree Search (MCTS) with large computation resources that provides expert demonstrations, (ii) a decentralized, partial-information "learner" MCTS with small computation resources that runs in real-time and provides self-play examples, and (iii) policy & value neural networks that are trained with the expert demonstrations and bias both the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bpriviere/decision_making
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.