FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural   Network-Based Optimize

Chuangchuang Sun; Dong-Ki Kim; Jonathan P. How

arXiv:2006.11419·cs.LG·May 7, 2021

FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimize

Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How

PDF

Open Access

TL;DR

This paper introduces FISAR, a novel deep neural network-based optimizer for safe reinforcement learning that guarantees forward invariance of safety constraints, ensuring constraint violations decrease monotonically in safety-critical environments.

Contribution

It proposes the first DNN-based optimizer for constrained optimization with forward invariance guarantees, addressing the limitations of classic algorithms in safety-critical RL tasks.

Findings

01

The optimizer effectively reduces constraint violations in experiments.

02

It maximizes cumulative reward while maintaining safety constraints.

03

Validated on numerical optimization and navigation tasks.

Abstract

This paper investigates reinforcement learning with constraints, which are indispensable in safety-critical environments. To drive the constraint violation monotonically decrease, we take the constraints as Lyapunov functions and impose new linear constraints on the policy parameters' updating dynamics. As a result, the original safety set can be forward-invariant. However, because the new guaranteed-feasible constraints are imposed on the updating dynamics instead of the original policy parameters, classic optimization algorithms are no longer applicable. To address this, we propose to learn a generic deep neural network (DNN)-based optimizer to optimize the objective while satisfying the linear constraints. The constraint-satisfaction is achieved via projection onto a polytope formulated by multiple linear inequality constraints, which can be solved analytically with our newly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety