Robust Risk-Sensitive Reinforcement Learning with Conditional   Value-at-Risk

Xinyi Ni; Lifeng Lai

arXiv:2405.01718·cs.LG·May 6, 2024

Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Xinyi Ni, Lifeng Lai

PDF

Open Access

TL;DR

This paper extends risk-sensitive reinforcement learning to robust settings using CVaR within RMDPs, introducing new ambiguity sets and algorithms to handle decision-dependent uncertainties.

Contribution

It introduces NCVaR, a new risk measure for state-action-dependent ambiguity sets, and develops value iteration algorithms for robust CVaR optimization.

Findings

01

Validated approach through simulation experiments.

02

Established connection between robustness and risk sensitivity.

03

Proposed algorithms effectively handle decision-dependent uncertainties.

Abstract

Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing for the worst-case scenarios within ambiguity sets. While earlier studies on RMDPs have largely centered on risk-neutral reinforcement learning (RL), with the goal of minimizing expected total discounted costs, in this paper, we analyze the robustness of CVaR-based risk-sensitive RL under RMDP. Firstly, we consider predetermined ambiguity sets. Based on the coherency of CVaR, we establish a connection between robustness and risk sensitivity, thus, techniques in risk-sensitive RL can be adopted to solve the proposed problem. Furthermore, motivated by the existence of decision-dependent uncertainty in real-world problems, we study problems with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics