Safe Reinforcement Learning with Dual Robustness

Zeyang Li; Chuxiong Hu; Yunan Wang; Yujie Yang; Shengbo Eben Li

arXiv:2309.06835·cs.LG·September 14, 2023

Safe Reinforcement Learning with Dual Robustness

Zeyang Li, Chuxiong Hu, Yunan Wang, Yujie Yang, Shengbo Eben Li

PDF

Open Access

TL;DR

This paper introduces a unified framework and algorithm for safe and robust reinforcement learning, effectively handling adversarial disturbances while ensuring safety and high performance.

Contribution

It proposes a dual policy iteration scheme within a constrained zero-sum Markov game framework, unifying safe RL and robust RL with proven convergence and a practical deep RL algorithm.

Findings

01

DRAC achieves high performance under various adversarial scenarios.

02

It outperforms baseline methods significantly in safety-critical benchmarks.

03

The convergence of the proposed iteration scheme is theoretically established.

Abstract

Reinforcement learning (RL) agents are vulnerable to adversarial disturbances, which can deteriorate task performance or compromise safety specifications. Existing methods either address safety requirements under the assumption of no adversary (e.g., safe RL) or only focus on robustness against performance adversaries (e.g., robust RL). Learning one policy that is both safe and robust remains a challenging open problem. The difficulty is how to tackle two intertwined aspects in the worst cases: feasibility and optimality. Optimality is only valid inside a feasible region, while identification of maximal feasible region must rely on learning the optimal policy. To address this issue, we propose a systematic framework to unify safe RL and robust RL, including problem formulation, iteration scheme, convergence analysis and practical algorithm design. This unification is built upon…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Safety Systems Engineering in Autonomy · Occupational Health and Safety Research

MethodsFocus