Reachability Constrained Reinforcement Learning
Dongjie Yu, Haitong Ma, Shengbo Eben Li, Jianyu Chen

TL;DR
This paper introduces Reachability CRL (RCRL), a novel method that uses reachability analysis to define and guarantee safety constraints in reinforcement learning, ensuring persistent safety and optimal feasible sets.
Contribution
The paper proposes a new RCRL approach that leverages reachability analysis to accurately characterize feasible sets and guarantees convergence to a local optimum with safety constraints.
Findings
RCRL accurately learns feasible sets and policies.
RCRL outperforms existing CRL and safe control methods.
Empirical results confirm constraint satisfaction and policy effectiveness.
Abstract
Constrained reinforcement learning (CRL) has gained significant interest recently, since safety constraints satisfaction is critical for real-world problems. However, existing CRL methods constraining discounted cumulative costs generally lack rigorous definition and guarantee of safety. In contrast, in the safe control research, safety is defined as persistently satisfying certain state constraints. Such persistent safety is possible only on a subset of the state space, called feasible set, where an optimal largest feasible set exists for a given environment. Recent studies incorporate feasible sets into CRL with energy-based methods such as control barrier function (CBF), safety index (SI), and leverage prior conservative estimations of feasible sets, which harms the performance of the learned policy. To deal with this problem, this paper proposes the reachability CRL (RCRL) method by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Energy Efficiency and Management · Software Reliability and Analysis Research
