Certifying Hamilton-Jacobi Reachability Learned via Reinforcement Learning

Prashant Solanki; Isabelle El-Hajj; Jasper J. van Beers; Erik-Jan van Kampen; Coen C. de Visser

arXiv:2602.16475·eess.SY·February 19, 2026

Certifying Hamilton-Jacobi Reachability Learned via Reinforcement Learning

Prashant Solanki, Isabelle El-Hajj, Jasper J. van Beers, Erik-Jan van Kampen, Coen C. de Visser

PDF

Open Access

TL;DR

This paper introduces a framework to certify Hamilton-Jacobi reachability learned via reinforcement learning by converting value errors into precise safety enclosures, ensuring correctness and safety guarantees.

Contribution

It develops a novel certification method linking RL value errors to HJ PDE offsets, enabling formal safety guarantees for learned value functions.

Findings

01

Successfully certifies a learned value function for a double-integrator system.

02

Provides formal inner and outer safety enclosures using SMT.

03

Demonstrates compatibility with deep reinforcement learning.

Abstract

We present a framework to \emph{certify} Hamilton--Jacobi (HJ) reachability learned by reinforcement learning (RL). Building on a discounted initial time \emph{travel-cost} formulation that makes small-step RL value iteration provably equivalent to a forward Hamilton--Jacobi (HJ) equation with damping, we convert certified learning errors into calibrated inner/outer enclosures of strict backward reachable tube. The core device is an additive-offset identity: if $W_{λ}$ solves the discounted travel-cost Hamilton--Jacobi--Bellman (HJB) equation, then $W_{ε} := W_{λ} + ε$ solves the same PDE with a constant offset $λ ε$ . This means that a uniform value error is \emph{exactly} equal to a constant HJB offset. We establish this uniform value error via two routes: (A) a Bellman operator-residual bound, and (B) a HJB PDE-slack bound. Our framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Formal Methods in Verification · Reinforcement Learning in Robotics