Stability-certified reinforcement learning: A control-theoretic   perspective

Ming Jin; Javad Lavaei

arXiv:1810.11505·cs.SY·October 30, 2018·23 cites

Stability-certified reinforcement learning: A control-theoretic perspective

Ming Jin, Javad Lavaei

PDF

Open Access

TL;DR

This paper presents a control-theoretic approach to certify the stability of reinforcement learning policies when integrated with nonlinear dynamical systems, ensuring robust stability through input-output gradient regulation and semidefinite programming.

Contribution

It introduces a novel method to certify stability of RL policies using gradient regulation and semidefinite programming, with empirical validation on control tasks.

Findings

01

The method certifies a large set of stabilizing controllers.

02

Reinforcement learning agents perform well within the stability-certified region.

03

Agents exhibit stable long-term learning behaviors.

Abstract

We investigate the important problem of certifying stability of reinforcement learning policies when interconnected with nonlinear dynamical systems. We show that by regulating the input-output gradients of policies, strong guarantees of robust stability can be obtained based on a proposed semidefinite programming feasibility problem. The method is able to certify a large set of stabilizing controllers by exploiting problem-specific structures; furthermore, we analyze and establish its (non)conservatism. Empirical evaluations on two decentralized control tasks, namely multi-flight formation and power system frequency regulation, demonstrate that the reinforcement learning agents can have high performance within the stability-certified parameter space, and also exhibit stable learning behaviors in the long run.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems