TL;DR
This paper introduces a deep reinforcement learning approach with LSTM to enable robots to avoid collisions in environments with many heterogeneous, non-communicating pedestrians and robots, outperforming previous methods and generalizing to various applications.
Contribution
It develops a scalable RL algorithm using LSTM to handle arbitrary numbers of agents without assuming specific behaviors, advancing collision avoidance in complex environments.
Findings
Outperforms classical and previous deep RL algorithms in collision avoidance.
Scales effectively with the number of agents, reducing collisions and time to goal.
Generalizes to formation control, multirotor fleets, and autonomous vehicles.
Abstract
Collision avoidance algorithms are essential for safe and efficient robot operation among pedestrians. This work proposes using deep reinforcement (RL) learning as a framework to model the complex interactions and cooperation with nearby, decision-making agents, such as pedestrians and other robots. Existing RL-based works assume homogeneity of agent properties, use specific motion models over short timescales, or lack a principled method to handle a large, possibly varying number of agents. Therefore, this work develops an algorithm that learns collision avoidance among a variety of heterogeneous, non-communicating, dynamic agents without assuming they follow any particular behavior rules. It extends our previous work by introducing a strategy using Long Short-Term Memory (LSTM) that enables the algorithm to use observations of an arbitrary number of other agents, instead of a small,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Sigmoid Activation · Tanh Activation · Long Short-Term Memory
