Robust Safe Reinforcement Learning under Adversarial Disturbances

Zeyang Li; Chuxiong Hu; Shengbo Eben Li; Jia Cheng; Yunan Wang

arXiv:2310.07207·cs.LG·October 12, 2023

Robust Safe Reinforcement Learning under Adversarial Disturbances

Zeyang Li, Chuxiong Hu, Shengbo Eben Li, Jia Cheng, Yunan Wang

PDF

Open Access

TL;DR

This paper introduces a robust safe reinforcement learning framework that ensures safety under worst-case external disturbances by combining Hamilton-Jacobi reachability analysis with a policy iteration scheme, improving safety guarantees in control tasks.

Contribution

It proposes a novel policy iteration algorithm for computing the maximal robust invariant set and integrates it into a constrained RL method for safety under adversarial disturbances.

Findings

01

Achieves zero safety constraint violations under worst-case disturbances.

02

Maintains high reward performance comparable to baseline methods.

03

Ensures safety even without adversarial disturbances.

Abstract

Safety is a primary concern when applying reinforcement learning to real-world control tasks, especially in the presence of external disturbances. However, existing safe reinforcement learning algorithms rarely account for external disturbances, limiting their applicability and robustness in practice. To address this challenge, this paper proposes a robust safe reinforcement learning framework that tackles worst-case disturbances. First, this paper presents a policy iteration scheme to solve for the robust invariant set, i.e., a subset of the safe set, where persistent safety is only possible for states within. The key idea is to establish a two-player zero-sum game by leveraging the safety value function in Hamilton-Jacobi reachability analysis, in which the protagonist (i.e., control inputs) aims to maintain safety and the adversary (i.e., external disturbances) tries to break down…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning