Risk-Sensitive Reinforcement Learning with Exponential Criteria
Erfaun Noorani, Christos Mavridis, John Baras

TL;DR
This paper introduces a risk-sensitive reinforcement learning framework using exponential criteria, proposing novel algorithms that enhance robustness and sample efficiency, validated through simulated experiments.
Contribution
It formulates a new risk-sensitive RL problem with exponential criteria and develops a model-free Actor-Critic algorithm based on a multiplicative Bellman equation.
Findings
Improves robustness against environment perturbations
Enhances sample efficiency over traditional methods
Demonstrates effectiveness through simulated experiments
Abstract
While reinforcement learning has shown experimental success in a number of applications, it is known to be sensitive to noise and perturbations in the parameters of the system, leading to high variance in the total reward amongst different episodes in slightly different environments. To introduce robustness, as well as sample efficiency, risk-sensitive reinforcement learning methods are being thoroughly studied. In this work, we provide a definition of robust reinforcement learning policies and formulate a risk-sensitive reinforcement learning problem to approximate them, by solving an optimization problem with respect to a modified objective based on exponential criteria. In particular, we study a model-free risk-sensitive variation of the widely-used Monte Carlo Policy Gradient algorithm and introduce a novel risk-sensitive online Actor-Critic algorithm based on solving a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Energy, Environment, and Transportation Policies · Supply Chain and Inventory Management
