Automatic Risk Adaptation in Distributional Reinforcement Learning

Frederik Schubert; Theresa Eimer; Bodo Rosenhahn; Marius Lindauer

arXiv:2106.06317·cs.LG·December 7, 2021·1 cites

Automatic Risk Adaptation in Distributional Reinforcement Learning

Frederik Schubert, Theresa Eimer, Bodo Rosenhahn, Marius Lindauer

PDF

Open Access

TL;DR

This paper introduces ARA, a method for dynamically adapting risk levels in distributional RL using RND error, leading to significantly fewer failures and better generalization in various environments.

Contribution

The paper proposes a novel automatic risk adaptation technique that dynamically adjusts risk levels in distributional RL based on environment feedback.

Findings

01

Reduced failure rates by up to 7 times

02

Improved generalization performance by up to 14%

03

Effective in both known and unknown environments

Abstract

The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Bayesian Modeling and Causal Inference