Barrier Certified Safety Learning Control: When Sum-of-Square   Programming Meets Reinforcement Learning

Hejun Huang; Zhenglong Li; Dongkun Han

arXiv:2206.07915·eess.SY·July 1, 2022

Barrier Certified Safety Learning Control: When Sum-of-Square Programming Meets Reinforcement Learning

Hejun Huang, Zhenglong Li, Dongkun Han

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel safety learning control method combining sum-of-squares programming with reinforcement learning to guarantee safety in control systems, demonstrated on an inverted pendulum model.

Contribution

It proposes a new algorithm that uses sum-of-squares programming to ensure safety in reinforcement learning, addressing safety guarantees in control systems.

Findings

01

Sum-of-squares programming effectively maintains safety in reinforcement learning.

02

The proposed method outperforms quadratic programming-based approaches.

03

Demonstrated success on an inverted pendulum model.

Abstract

Safety guarantee is essential in many engineering implementations. Reinforcement learning provides a useful way to strengthen safety. However, reinforcement learning algorithms cannot completely guarantee safety over realistic operations. To address this issue, this work adopts control barrier functions over reinforcement learning, and proposes a compensated algorithm to completely maintain safety. Specifically, a sum-of-squares programming has been exploited to search for the optimal controller, and tune the learning hyperparameters simultaneously. Thus, the control actions are pledged to be always within the safe region. The effectiveness of proposed method is demonstrated via an inverted pendulum model. Compared to quadratic programming based reinforcement learning methods, our sum-of-squares programming based reinforcement learning has shown its superiority.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wogwan/ccta2022_saferl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Fault Detection and Control Systems · Safety Systems Engineering in Autonomy