Time-Constrained Robust MDPs
Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel, Rachelson

TL;DR
This paper introduces a novel time-constrained robust MDP framework that models correlated, time-dependent disturbances, leading to more realistic and less conservative policies in robust reinforcement learning.
Contribution
It proposes a new TC-RMDP formulation that relaxes rectangularity assumptions and develops three algorithms evaluated on continuous control benchmarks.
Findings
Algorithms outperform traditional methods in time-constrained environments.
The approach balances performance and robustness effectively.
It extends the analytical framework for robust RL beyond conventional assumptions.
Abstract
Robust reinforcement learning is essential for deploying reinforcement learning algorithms in real-world scenarios where environmental uncertainty predominates. Traditional robust reinforcement learning often depends on rectangularity assumptions, where adverse probability measures of outcome states are assumed to be independent across different states and actions. This assumption, rarely fulfilled in practice, leads to overly conservative policies. To address this problem, we introduce a new time-constrained robust MDP (TC-RMDP) formulation that considers multifactorial, correlated, and time-dependent disturbances, thus more accurately reflecting real-world dynamics. This formulation goes beyond the conventional rectangularity paradigm, offering new perspectives and expanding the analytical framework for robust RL. We propose three distinct algorithms, each using varying levels of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFormal Methods in Verification · Advanced Control Systems Optimization · Robotic Path Planning Algorithms
