Risk-Sensitive Reinforcement Learning with Exponential Criteria

Erfaun Noorani; Christos Mavridis; John Baras

arXiv:2212.09010·eess.SY·December 2, 2024·1 cites

Risk-Sensitive Reinforcement Learning with Exponential Criteria

Erfaun Noorani, Christos Mavridis, John Baras

PDF

Open Access

TL;DR

This paper introduces a risk-sensitive reinforcement learning framework using exponential criteria, proposing novel algorithms that enhance robustness and sample efficiency, validated through simulated experiments.

Contribution

It formulates a new risk-sensitive RL problem with exponential criteria and develops a model-free Actor-Critic algorithm based on a multiplicative Bellman equation.

Findings

01

Improves robustness against environment perturbations

02

Enhances sample efficiency over traditional methods

03

Demonstrates effectiveness through simulated experiments

Abstract

While reinforcement learning has shown experimental success in a number of applications, it is known to be sensitive to noise and perturbations in the parameters of the system, leading to high variance in the total reward amongst different episodes in slightly different environments. To introduce robustness, as well as sample efficiency, risk-sensitive reinforcement learning methods are being thoroughly studied. In this work, we provide a definition of robust reinforcement learning policies and formulate a risk-sensitive reinforcement learning problem to approximate them, by solving an optimization problem with respect to a modified objective based on exponential criteria. In particular, we study a model-free risk-sensitive variation of the widely-used Monte Carlo Policy Gradient algorithm and introduce a novel risk-sensitive online Actor-Critic algorithm based on solving a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Energy, Environment, and Transportation Policies · Supply Chain and Inventory Management