Distributionally Robust Constrained Reinforcement Learning under Strong Duality
Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman,, Yisong Yue

TL;DR
This paper introduces a novel algorithmic framework for distributionally robust constrained reinforcement learning (DRC-RL) that guarantees convergence and handles environmental uncertainties and constraints simultaneously.
Contribution
We develop the first end-to-end convergent algorithm for DRC-RL using strong duality, revealing structural challenges that hinder existing iterative methods.
Findings
First provably convergent algorithm for DRC-RL
Structural insights into distributional robustness and constraints
Experimental validation on a car racing benchmark
Abstract
We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for the separate problems of distributionally robust RL and constrained RL, there do not yet exist algorithms with end-to-end convergence guarantees for DRC-RL. We develop an algorithmic framework based on strong duality that enables the first efficient and provable solution in a class of environmental uncertainties. Further, our framework exposes an inherent structure of DRC-RL that arises from the combination of distributional robustness and constraints, which prevents a popular class of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Machine Learning and ELM · Reinforcement Learning in Robotics
