Distributionally Robust Constrained Reinforcement Learning under Strong   Duality

Zhengfei Zhang; Kishan Panaganti; Laixi Shi; Yanan Sui; Adam Wierman,; Yisong Yue

arXiv:2406.15788·cs.LG·June 25, 2024

Distributionally Robust Constrained Reinforcement Learning under Strong Duality

Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman,, Yisong Yue

PDF

Open Access

TL;DR

This paper introduces a novel algorithmic framework for distributionally robust constrained reinforcement learning (DRC-RL) that guarantees convergence and handles environmental uncertainties and constraints simultaneously.

Contribution

We develop the first end-to-end convergent algorithm for DRC-RL using strong duality, revealing structural challenges that hinder existing iterative methods.

Findings

01

First provably convergent algorithm for DRC-RL

02

Structural insights into distributional robustness and constraints

03

Experimental validation on a car racing benchmark

Abstract

We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for the separate problems of distributionally robust RL and constrained RL, there do not yet exist algorithms with end-to-end convergence guarantees for DRC-RL. We develop an algorithmic framework based on strong duality that enables the first efficient and provable solution in a class of environmental uncertainties. Further, our framework exposes an inherent structure of DRC-RL that arises from the combination of distributional robustness and constraints, which prevents a popular class of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Machine Learning and ELM · Reinforcement Learning in Robotics