Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems   with Markov Risk Measures

Shiping Shao; Abhishek Gupta; William B. Haskell

arXiv:2209.12937·math.OC·September 28, 2022

Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems with Markov Risk Measures

Shiping Shao, Abhishek Gupta, William B. Haskell

PDF

Open Access

TL;DR

This paper investigates the robustness of risk-sensitive Markov decision processes to modeling errors, providing conditions under which small parameter perturbations minimally impact optimal policies and value functions.

Contribution

It introduces sufficient conditions ensuring robustness of risk-sensitive MDPs to model perturbations, extending understanding of decision-making under model uncertainty.

Findings

01

Small model perturbations cause limited changes in optimal policies.

02

Robustness conditions are applicable to data-driven and preference-uncertain systems.

03

Implications for systems with changing noise distributions are discussed.

Abstract

We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. We further establish the robustness of the risk-sensitive optimal policies to modeling errors. Implications of the results for data-driven decision-making, decision-making with preference uncertainty, and systems with changing noise distributions are discussed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization