Collision Avoidance in Pedestrian-Rich Environments with Deep   Reinforcement Learning

Michael Everett; Yu Fan Chen; Jonathan P. How

arXiv:1910.11689·cs.RO·January 26, 2021

Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning

Michael Everett, Yu Fan Chen, Jonathan P. How

PDF

3 Repos

TL;DR

This paper introduces a deep reinforcement learning approach with LSTM to enable robots to avoid collisions in environments with many heterogeneous, non-communicating pedestrians and robots, outperforming previous methods and generalizing to various applications.

Contribution

It develops a scalable RL algorithm using LSTM to handle arbitrary numbers of agents without assuming specific behaviors, advancing collision avoidance in complex environments.

Findings

01

Outperforms classical and previous deep RL algorithms in collision avoidance.

02

Scales effectively with the number of agents, reducing collisions and time to goal.

03

Generalizes to formation control, multirotor fleets, and autonomous vehicles.

Abstract

Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · Long Short-Term Memory