Time-Constrained Robust MDPs

Adil Zouitine; David Bertoin; Pierre Clavier; Matthieu Geist; Emmanuel; Rachelson

arXiv:2406.08395·cs.LG·June 13, 2024·1 cites

Time-Constrained Robust MDPs

Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel, Rachelson

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel time-constrained robust MDP framework that models correlated, time-dependent disturbances, leading to more realistic and less conservative policies in robust reinforcement learning.

Contribution

It proposes a new TC-RMDP formulation that relaxes rectangularity assumptions and develops three algorithms evaluated on continuous control benchmarks.

Findings

01

Algorithms outperform traditional methods in time-constrained environments.

02

The approach balances performance and robustness effectively.

03

It extends the analytical framework for robust RL beyond conventional assumptions.

Abstract

Robust reinforcement learning is essential for deploying reinforcement learning algorithms in real-world scenarios where environmental uncertainty predominates. Traditional robust reinforcement learning often depends on rectangularity assumptions, where adverse probability measures of outcome states are assumed to be independent across different states and actions. This assumption, rarely fulfilled in practice, leads to overly conservative policies. To address this problem, we introduce a new time-constrained robust MDP (TC-RMDP) formulation that considers multifactorial, correlated, and time-dependent disturbances, thus more accurately reflecting real-world dynamics. This formulation goes beyond the conventional rectangularity paradigm, offering new perspectives and expanding the analytical framework for robust RL. We propose three distinct algorithms, each using varying levels of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Time-Constrained Robust MDPs· slideslive

Taxonomy

TopicsFormal Methods in Verification · Advanced Control Systems Optimization · Robotic Path Planning Algorithms