Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey

Milan Ganai; Sicun Gao; Sylvia Herbert

arXiv:2407.09645·eess.SY·August 23, 2024

Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey

Milan Ganai, Sicun Gao, Sylvia Herbert

PDF

Open Access

TL;DR

This survey reviews recent advances in Hamilton-Jacobi reachability methods integrated with reinforcement learning, highlighting scalable techniques for safety verification and policy improvement in high-dimensional systems.

Contribution

It provides a comprehensive overview of recent methods that enable scalable HJ reachability analysis within reinforcement learning frameworks for complex systems.

Findings

01

Recent methods improve scalability of HJ reachability analysis

02

HJ reachability enhances safety guarantees in RL policies

03

Applications include dynamic obstacle avoidance and vision-based control

Abstract

Recent literature has proposed approaches that learn control policies with high performance while maintaining safety guarantees. Synthesizing Hamilton-Jacobi (HJ) reachable sets has become an effective tool for verifying safety and supervising the training of reinforcement learning-based control policies for complex, high-dimensional systems. Previously, HJ reachability was restricted to verifying low-dimensional dynamical systems primarily because the computational complexity of the dynamic programming approach it relied on grows exponentially with the number of system states. In recent years, a litany of proposed methods addresses this limitation by computing the reachability value function simultaneously with learning control policies to scale HJ reachability analysis while still maintaining a reliable estimate of the true reachable set. These HJ reachability approximations are used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management