FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize
Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How

TL;DR
This paper introduces FISAR, a novel deep neural network-based optimizer for safe reinforcement learning that guarantees forward invariance of safety constraints, ensuring constraint violations decrease monotonically in safety-critical environments.
Contribution
It proposes the first DNN-based optimizer for constrained optimization with forward invariance guarantees, addressing the limitations of classic algorithms in safety-critical RL tasks.
Findings
The optimizer effectively reduces constraint violations in experiments.
It maximizes cumulative reward while maintaining safety constraints.
Validated on numerical optimization and navigation tasks.
Abstract
This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation monotonically decrease, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety
