Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning
Haitong Ma, Changliu Liu, Shengbo Eben Li, Sifa Zheng, Jianyu Chen

TL;DR
This paper introduces a novel constrained reinforcement learning method that jointly synthesizes safety certificates and safe control policies without prior knowledge, ensuring provably safe policies in safety-critical systems.
Contribution
It proposes a new joint synthesis approach that optimizes safety certificates and control policies simultaneously using CRL, without relying on prior models or perfect certificates.
Findings
Learns provably safe policies with no constraint violations
Jointly optimizes safety certificates and control policies
Validated on multiple safety-critical benchmarks
Abstract
Safety is the major consideration in controlling complex dynamical systems using reinforcement learning (RL), where the safety certificate can provide provable safety guarantee. A valid safety certificate is an energy function indicating that safe states are with low energy, and there exists a corresponding safe control policy that allows the energy function to always dissipate. The safety certificate and the safe control policy are closely related to each other and both challenging to synthesize. Therefore, existing learning-based studies treat either of them as prior knowledge to learn the other, which limits their applicability with general unknown dynamics. This paper proposes a novel approach that simultaneously synthesizes the energy-function-based safety certificate and learns the safe control policy with CRL. We do not rely on prior knowledge about either an available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac electrophysiology and arrhythmias · Muscle activation and electromyography studies
