Automatic Risk Adaptation in Distributional Reinforcement Learning
Frederik Schubert, Theresa Eimer, Bodo Rosenhahn, Marius Lindauer

TL;DR
This paper introduces ARA, a method for dynamically adapting risk levels in distributional RL using RND error, leading to significantly fewer failures and better generalization in various environments.
Contribution
The paper proposes a novel automatic risk adaptation technique that dynamically adjusts risk levels in distributional RL based on environment feedback.
Findings
Reduced failure rates by up to 7 times
Improved generalization performance by up to 14%
Effective in both known and unknown environments
Abstract
The use of Reinforcement Learning (RL) agents in practical applications requires the consideration of suboptimal outcomes, depending on the familiarity of the agent with its environment. This is especially important in safety-critical environments, where errors can lead to high costs or damage. In distributional RL, the risk-sensitivity can be controlled via different distortion measures of the estimated return distribution. However, these distortion functions require an estimate of the risk level, which is difficult to obtain and depends on the current state. In this work, we demonstrate the suboptimality of a static risk level estimation and propose a method to dynamically select risk levels at each environment step. Our method ARA (Automatic Risk Adaptation) estimates the appropriate risk level in both known and unknown environments using a Random Network Distillation error. We show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Bayesian Modeling and Causal Inference
