Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Daniel Fl\"ogel; Marcos G\'omez Villafa\~ne; Joshua Ransiek; and S\"oren Hohmann

arXiv:2409.10655·cs.RO·July 10, 2025

Disentangling Uncertainty for Safe Social Navigation using Deep Reinforcement Learning

Daniel Fl\"ogel, Marcos G\'omez Villafa\~ne, Joshua Ransiek, and S\"oren Hohmann

PDF

Open Access

TL;DR

This paper presents a novel deep reinforcement learning framework for safe social robot navigation that estimates different types of uncertainty to improve decision-making and collision avoidance in pedestrian environments.

Contribution

It introduces a method combining aleatoric, epistemic, and predictive uncertainty estimation into DRL for social navigation, using ODV and dropout with PPO.

Findings

01

Improved training performance with ODV and dropout.

02

MC-dropout better detects perturbations and correlates with uncertainty types.

03

Enhanced navigation safety with conservative behaviors in uncertain situations.

Abstract

Autonomous mobile robots are increasingly used in pedestrian-rich environments where safe navigation and appropriate human interaction are crucial. While Deep Reinforcement Learning (DRL) enables socially integrated robot behavior, challenges persist in novel or perturbed scenarios to indicate when and why the policy is uncertain. Unknown uncertainty in decision-making can lead to collisions or human discomfort and is one reason why safe and risk-aware navigation is still an open problem. This work introduces a novel approach that integrates aleatoric, epistemic, and predictive uncertainty estimation into a DRL navigation framework for policy distribution uncertainty estimates. We, therefore, incorporate Observation-Dependent Variance (ODV) and dropout into the Proximal Policy Optimization (PPO) algorithm. For different types of perturbations, we compare the ability of deep ensembles…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications

MethodsEntropy Regularization · Proximal Policy Optimization · Dropout · Deep Ensembles