Risk-Sensitive Policy with Distributional Reinforcement Learning

Thibaut Th\'eate; Damien Ernst

arXiv:2212.14743·cs.LG·January 2, 2023

Risk-Sensitive Policy with Distributional Reinforcement Learning

Thibaut Th\'eate, Damien Ernst

PDF

Open Access 1 Repo

TL;DR

This paper introduces a risk-sensitive reinforcement learning approach using distributional RL to incorporate risk considerations into decision policies, balancing expected return and risk in a practical, interpretable manner.

Contribution

It proposes a novel risk-based utility function derived from the return distribution, enabling risk-sensitive policies with minimal modifications to existing distributional RL algorithms.

Findings

01

Effective risk-sensitive policies learned with distributional RL

02

Balances risk minimization and return maximization

03

Provides interpretable decision-making framework

Abstract

Classical reinforcement learning (RL) techniques are generally concerned with the design of decision-making policies driven by the maximisation of the expected outcome. Nevertheless, this approach does not take into consideration the potential risk associated with the actions taken, which may be critical in certain applications. To address that issue, the present research work introduces a novel methodology based on distributional RL to derive sequential decision-making policies that are sensitive to the risk, the latter being modelled by the tail of the return probability distribution. The core idea is to replace the $Q$ function generally standing at the core of learning schemes in RL by another function taking into account both the expected return and the risk. Named the risk-based utility function $U$ , it can be extracted from the random return distribution $Z$ naturally learnt by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thibauttheate/risk-sensitive-policy-with-distributional-reinforcement-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSupply Chain and Inventory Management · Auction Theory and Applications · Reinforcement Learning in Robotics