Dual Formulation for Non-Rectangular Lp Robust Markov Decision Processes
Navdeep Kumar, Adarsh Gupta, Maxence Mohamed Elfatihi, Giorgia, Ramponi, Kfir Yehuda Levy, Shie Mannor

TL;DR
This paper introduces a dual formulation for non-rectangular Lp robust Markov decision processes, enabling efficient policy evaluation and revealing adversarial strategies, thus advancing robust decision-making under complex uncertainties.
Contribution
It develops a novel dual formulation for non-rectangular Lp RMDPs, facilitating the first robust policy evaluation algorithms for these complex models.
Findings
Our approach outperforms brute-force methods in empirical tests.
The structural decomposition simplifies analysis of non-rectangular uncertainty sets.
The dual formulation provides new insights into adversarial strategies in RMDPs.
Abstract
We study robust Markov decision processes (RMDPs) with non-rectangular uncertainty sets, which capture interdependencies across states unlike traditional rectangular models. While non-rectangular robust policy evaluation is generally NP-hard, even in approximation, we identify a powerful class of -bounded uncertainty sets that avoid these complexity barriers due to their structural simplicity. We further show that this class can be decomposed into infinitely many \texttt{sa}-rectangular -bounded sets and leverage its structural properties to derive a novel dual formulation for RMDPs. This formulation provides key insights into the adversary's strategy and enables the development of the first robust policy evaluation algorithms for non-rectangular RMDPs. Empirical results demonstrate that our approach significantly outperforms brute-force methods, establishing a promising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Fault Detection and Control Systems · Software Reliability and Analysis Research
