# Uncertainty-Based Out-of-Distribution Detection in Deep Reinforcement   Learning

**Authors:** Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia, Linnhoff-Popien

arXiv: 1901.02219 · 2019-01-09

## TL;DR

This paper investigates how uncertainty estimation techniques, especially bootstrap-based methods, can effectively detect out-of-distribution samples in deep reinforcement learning, highlighting their reliability over dropout-based approaches.

## Contribution

It analyzes the effectiveness of Bayesian inference and ensembling methods for OOD detection in deep RL, emphasizing bootstrap approaches' superior reliability.

## Key findings

- Bootstrap-based methods produce more reliable uncertainty estimates.
- Uncertainty estimation effectively differentiates in- from out-of-distribution samples.
- Dropout-based methods are less reliable for OOD detection in deep RL.

## Abstract

We consider the problem of detecting out-of-distribution (OOD) samples in deep reinforcement learning. In a value based reinforcement learning setting, we propose to use uncertainty estimation techniques directly on the agent's value estimating neural network to detect OOD samples. The focus of our work lies in analyzing the suitability of approximate Bayesian inference methods and related ensembling techniques that generate uncertainty estimates. Although prior work has shown that dropout-based variational inference techniques and bootstrap-based approaches can be used to model epistemic uncertainty, the suitability for detecting OOD samples in deep reinforcement learning remains an open question. Our results show that uncertainty estimation can be used to differentiate in- from out-of-distribution samples. Over the complete training process of the reinforcement learning agents, bootstrap-based approaches tend to produce more reliable epistemic uncertainty estimates, when compared to dropout-based approaches.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.02219/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1901.02219/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1901.02219/full.md

---
Source: https://tomesphere.com/paper/1901.02219