Decentralized Non-communicating Multiagent Collision Avoidance with Deep   Reinforcement Learning

Yu Fan Chen; Miao Liu; Michael Everett; and Jonathan P. How

arXiv:1609.07845·cs.MA·September 29, 2016

Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning

Yu Fan Chen, Miao Liu, Michael Everett, and Jonathan P. How

PDF

TL;DR

This paper introduces a decentralized deep reinforcement learning-based collision avoidance method for multiagent systems that predicts interaction patterns offline, enabling real-time, collision-free path planning even in non-communicating scenarios.

Contribution

It presents a novel value network approach that encodes time-to-goal estimates considering neighboring agents, improving path efficiency and computational feasibility.

Findings

01

Over 26% improvement in path quality over ORCA

02

Efficient real-time collision avoidance queries

03

Effective handling of uncertainty in agent motion

Abstract

Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.