Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning
Daniel Fl\"ogel, Marcos G\'omez Villafa\~ne, Joshua Ransiek, and S\"oren Hohmann

TL;DR
This paper presents a novel deep reinforcement learning framework for safe social robot navigation that estimates different types of uncertainty to improve decision-making and collision avoidance in pedestrian environments.
Contribution
It introduces a method combining aleatoric, epistemic, and predictive uncertainty estimation into DRL for social navigation, using ODV and dropout with PPO.
Findings
Improved training performance with ODV and dropout.
MC-dropout better detects perturbations and correlates with uncertainty types.
Enhanced navigation safety with conservative behaviors in uncertain situations.
Abstract
Autonomous mobile robots are increasingly used in pedestrian-rich environments where safe navigation and appropriate human interaction are crucial. While Deep Reinforcement Learning (DRL) enables socially integrated robot behavior, challenges persist in novel or perturbed scenarios to indicate when and why the policy is uncertain. Unknown uncertainty in decision-making can lead to collisions or human discomfort and is one reason why safe and risk-aware navigation is still an open problem. This work introduces a novel approach that integrates aleatoric, epistemic, and predictive uncertainty estimation into a DRL navigation framework for policy distribution uncertainty estimates. We, therefore, incorporate Observation-Dependent Variance (ODV) and dropout into the Proximal Policy Optimization (PPO) algorithm. For different types of perturbations, we compare the ability of deep ensembles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
MethodsEntropy Regularization · Proximal Policy Optimization · Dropout · Deep Ensembles
