Safe Reinforcement Learning with Model Uncertainty Estimates

Bj\"orn L\"utjens; Michael Everett; Jonathan P. How

arXiv:1810.08700·cs.RO·March 4, 2019

Safe Reinforcement Learning with Model Uncertainty Estimates

Bj\"orn L\"utjens, Michael Everett, Jonathan P. How

PDF

TL;DR

This paper introduces a safe reinforcement learning approach that uses computationally efficient model uncertainty estimates to improve robustness and safety in autonomous navigation around pedestrians, especially in unseen scenarios.

Contribution

It integrates MC-Dropout and Bootstrapping for uncertainty estimation into reinforcement learning, enhancing safety and robustness in autonomous systems.

Findings

01

More robust to novel observations

02

Takes safer actions around unseen pedestrians

03

Outperforms uncertainty-unaware baseline

Abstract

Many current autonomous systems are being designed with a strong reliance on black box predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in predictions on unseen data and can give unpredictable results for far-from-distribution test data. The importance of predictions that are robust to this distributional shift is evident for safety-critical applications, such as collision avoidance around pedestrians. Measures of model uncertainty can be used to identify unseen data, but the state-of-the-art extraction methods such as Bayesian neural networks are mostly intractable to compute. This paper uses MC-Dropout and Bootstrapping to give computationally tractable and parallelizable uncertainty estimates. The methods are embedded in a Safe Reinforcement Learning framework to form uncertainty-aware navigation around pedestrians. The result is a collision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.