Bounding the Difference between the Values of Robust and Non-Robust   Markov Decision Problems

Ariel Neufeld; Julian Sester

arXiv:2308.05520·math.OC·April 30, 2025·J. Appl. Probab.

Bounding the Difference between the Values of Robust and Non-Robust Markov Decision Problems

Ariel Neufeld, Julian Sester

PDF

Open Access

TL;DR

This paper establishes a dimension-free upper bound on the difference in value functions between distributionally robust and non-robust Markov decision processes, based on Wasserstein-ball ambiguity sets.

Contribution

It provides a novel, linear, and dimension-free upper bound for the value difference in robust MDPs with Wasserstein ambiguity sets.

Findings

01

Upper bound is dimension-free

02

Difference depends linearly on Wasserstein radius

03

Applicable to distributionally robust MDPs

Abstract

In this note we provide an upper bound for the difference between the value function of a distributionally robust Markov decision problem and the value function of a non-robust Markov decision problem, where the ambiguity set of probability kernels of the distributionally robust Markov decision process is described by a Wasserstein-ball around some reference kernel whereas the non-robust Markov decision process behaves according to a fixed probability kernel contained in the ambiguity set. Our derived upper bound for the difference between the value functions is dimension-free and depends linearly on the radius of the Wasserstein-ball.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Probabilistic and Robust Engineering Design · Risk and Safety Analysis