Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning

Weiye Zhao; Feihan Li; Changliu Liu

arXiv:2405.02754·cs.RO·August 15, 2025

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning

Weiye Zhao, Feihan Li, Changliu Liu

PDF

Open Access

TL;DR

This paper introduces a model-free safe control algorithm for reinforcement learning that guarantees safety during training by using a black-box dynamic function, addressing a key challenge in deploying DRL in real-world safety-critical applications.

Contribution

The paper proposes the implicit safe set algorithm, a novel method that ensures provable safety for DRL agents without requiring explicit system models, using only black-box queries.

Findings

01

Guarantees finite-time convergence to the safe set.

02

Achieves zero safety violations on Safety Gym benchmark.

03

Scales efficiently to high-dimensional systems.

Abstract

Deep reinforcement learning (DRL) has demonstrated remarkable performance in many continuous control tasks. However, a significant obstacle to the real-world application of DRL is the lack of safety guarantees. Although DRL agents can satisfy system safety in expectation through reward shaping, designing agents to consistently meet hard constraints (e.g., safety specifications) at every time step remains a formidable challenge. In contrast, existing work in the field of safe control provides guarantees on persistent satisfaction of hard safety constraints. However, these methods require explicit analytical system dynamics models to synthesize safe control, which are typically inaccessible in DRL settings. In this paper, we present a model-free safe control algorithm, the implicit safe set algorithm, for synthesizing safeguards for DRL agents that ensure provable safety throughout…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElevator Systems and Control · Reinforcement Learning in Robotics · Fuzzy Logic and Control Systems

MethodsSparse Evolutionary Training