Risk Sensitive Model-Based Reinforcement Learning using Uncertainty   Guided Planning

Stefan Radic Webster; Peter Flach

arXiv:2111.04972·cs.LG·November 10, 2021

Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning

Stefan Radic Webster, Peter Flach

PDF

Open Access

TL;DR

This paper introduces a risk-sensitive model-based reinforcement learning method that uses uncertainty-guided planning with bootstrap ensembles to promote safe actions in high-risk environments, balancing risk and reward.

Contribution

It presents a novel uncertainty-guided cross-entropy planning approach that penalizes high-variance predictions, enhancing safety without explicit constraints.

Findings

01

Agent identifies uncertain regions during planning

02

Actions are guided to low-uncertainty areas

03

Trade-off observed between risk and reward

Abstract

Identifying uncertainty and taking mitigating actions is crucial for safe and trustworthy reinforcement learning agents, especially when deployed in high-risk environments. In this paper, risk sensitivity is promoted in a model-based reinforcement learning algorithm by exploiting the ability of a bootstrap ensemble of dynamics models to estimate environment epistemic uncertainty. We propose uncertainty guided cross-entropy method planning, which penalises action sequences that result in high variance state predictions during model rollouts, guiding the agent to known areas of the state space with low uncertainty. Experiments display the ability for the agent to identify uncertain regions of the state space during planning and to take actions that maintain the agent within high confidence areas, without the requirement of explicit constraints. The result is a reduction in the performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning