Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions

Harry Zhang

arXiv:2405.16184·eess.SY·May 28, 2024

Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions

Harry Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel model-based reinforcement learning framework that incorporates Lyapunov functions to ensure safety and stability during training and policy execution, with theoretical guarantees and demonstrated effectiveness.

Contribution

It presents a new stability-augmented RL framework that learns Lyapunov functions and enforces safety constraints during policy learning, a significant advancement over prior methods.

Findings

01

Successfully enforces safety constraints during training

02

Provides mathematically provable stability guarantees

03

Demonstrates effectiveness through simulated experiments

Abstract

Model-based Reinforcement Learning (MBRL) has shown many desirable properties for intelligent control tasks. However, satisfying safety and stability constraints during training and rollout remains an open question. We propose a new Model-based RL framework to enable efficient policy learning with unknown dynamics based on learning model predictive control (LMPC) framework with mathematically provable guarantees of stability. We introduce and explore a novel method for adding safety constraints for model-based RL during training and policy learning. The new stability-augmented framework consists of a neural-network-based learner that learns to construct a Lyapunov function, and a model-based RL agent to consistently complete the tasks while satisfying user-specified constraints given only sub-optimal demonstrations and sparse-cost feedback. We demonstrate the capability of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

harryzhangOG/salved
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics