A reinforcement learning method with closed-loop stability guarantee

Pavel Osinenko; Lukas Beckenbach; Thomas G\"ohrt; Stefan Streif

arXiv:2006.14034·math.OC·June 26, 2020

A reinforcement learning method with closed-loop stability guarantee

Pavel Osinenko, Lukas Beckenbach, Thomas G\"ohrt, Stefan Streif

PDF

2 Repos

TL;DR

This paper introduces a reinforcement learning method that guarantees semi-global stability of control systems by integrating a control Lyapunov function into the RL scheme, ensuring system stability during learning.

Contribution

It develops an online RL scheme that guarantees practical stability using a control Lyapunov function within a Lyapunov-like constraint framework.

Findings

01

Ensures closed-loop stability with RL in control systems.

02

Optimizes cost function while maintaining stability.

03

Validated on a non-holonomic integrator case study.

Abstract

Reinforcement learning (RL) in the context of control systems offers wide possibilities of controller adaptation. Given an infinite-horizon cost function, the so-called critic of RL approximates it with a neural net and sends this information to the controller (called "actor"). However, the issue of closed-loop stability under an RL-method is still not fully addressed. Since the critic delivers merely an approximation to the value function of the corresponding infinite-horizon problem, no guarantee can be given in general as to whether the actor's actions stabilize the system. Different approaches to this issue exist. The current work offers a particular one, which, starting with a (not necessarily smooth) control Lyapunov function (CLF), derives an online RL-scheme in such a way that practical semi-global stability property of the closed-loop can be established. The approach logically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.