Verified Safe Reinforcement Learning for Neural Network Dynamic Models

Junlin Wu; Huan Zhang; Yevgeniy Vorobeychik

arXiv:2405.15994·cs.LG·November 19, 2024·1 cites

Verified Safe Reinforcement Learning for Neural Network Dynamic Models

Junlin Wu, Huan Zhang, Yevgeniy Vorobeychik

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a new method for training neural network controllers that are both high-performing and formally verified for safety over extended horizons in nonlinear dynamical systems.

Contribution

It introduces a curriculum learning scheme, incremental verification, and multiple controllers to improve verified safety in neural network control policies.

Findings

01

Controllers achieved safety over horizons much longer than previous methods.

02

High reward performance maintained alongside verified safety.

03

Demonstrated effectiveness on five diverse safe control problems.

Abstract

Learning reliably safe autonomous control is one of the core problems in trustworthy autonomy. However, training a controller that can be formally verified to be safe remains a major challenge. We introduce a novel approach for learning verified safe control policies in nonlinear neural dynamical systems while maximizing overall performance. Our approach aims to achieve safety in the sense of finite-horizon reachability proofs, and is comprised of three key parts. The first is a novel curriculum learning scheme that iteratively increases the verified safe horizon. The second leverages the iterative nature of gradient-based learning to leverage incremental verification, reusing information from prior verification runs. Finally, we learn multiple verified initial-state-dependent controllers, an idea that is especially valuable for more complex domains where learning a single universal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jlwu002/vsrl
pytorchOfficial

Videos

Verified Safe Reinforcement Learning for Neural Network Dynamic Models· slideslive

Taxonomy

TopicsNeural Networks and Applications