Motion Planning Among Dynamic, Decision-Making Agents with Deep   Reinforcement Learning

Michael Everett; Yu Fan Chen; Jonathan P. How

arXiv:1805.01956·cs.RO·May 8, 2018

Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning

Michael Everett, Yu Fan Chen, Jonathan P. How

PDF

5 Repos

TL;DR

This paper presents a deep reinforcement learning algorithm for robot navigation among dynamic agents that adapts to varying numbers of agents and does not assume specific agent behaviors, improving safety and efficiency.

Contribution

It introduces a novel LSTM-based strategy allowing the algorithm to observe an arbitrary number of agents without fixed input size, and extends previous methods to more realistic multi-agent scenarios.

Findings

01

Outperforms previous approaches as the number of agents increases

02

Successfully navigates a robotic vehicle at human walking speed without 3D Lidar

03

Demonstrates robustness in simulation and real-world tests

Abstract

Robots that navigate among pedestrians use collision avoidance algorithms to enable safe and efficient operation. Recent works present deep reinforcement learning as a framework to model the complex interactions and cooperation. However, they are implemented using key assumptions about other agents' behavior that deviate from reality as the number of agents in the environment increases. This work extends our previous approach to develop an algorithm that learns collision avoidance among a variety of types of dynamic agents without assuming they follow any particular behavior rules. This work also introduces a strategy using LSTM that enables the algorithm to use observations of an arbitrary number of other agents, instead of previous methods that have a fixed observation size. The proposed algorithm outperforms our previous approach in simulation as the number of agents increases, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory