Efficient Algorithms for Robust Markov Decision Processes with $s$-Rectangular Ambiguity Sets
Chin Pang Ho, Marek Petrik, Wolfram Wiesemann

TL;DR
This paper introduces efficient algorithms for solving robust Markov decision processes with $s$-rectangular ambiguity sets, significantly improving computational speed while maintaining solution quality, applicable to various ambiguity measures.
Contribution
The paper develops a unified, fast solution framework for $s$-rectangular robust MDPs with different ambiguity sets, outperforming existing solvers in speed.
Findings
Algorithms solve robust MDPs several orders faster than commercial solvers.
Performance scales favorably on synthetic and benchmark instances.
Solution times are often only logarithmically slower than classical MDPs.
Abstract
Robust Markov decision processes (MDPs) have attracted significant interest due to their ability to protect MDPs from poor out-of-sample performance in the presence of ambiguity. In contrast to classical MDPs, which account for stochasticity by modeling the dynamics through a stochastic process with a known transition kernel, a robust MDP additionally accounts for ambiguity by optimizing against the most adverse transition kernel from an ambiguity set constructed via historical data. In this paper, we develop a unified solution framework for a broad class of robust MDPs with -rectangular ambiguity sets, where the most adverse transition probabilities are considered independently for each state. Using our algorithms, we show that -rectangular robust MDPs with - and -norm as well as -divergence ambiguity sets can be solved several orders of magnitude faster than with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Target Tracking and Data Fusion in Sensor Networks · Simulation Techniques and Applications
