Learning Provably Stabilizing Neural Controllers for Discrete-Time   Stochastic Systems

Matin Ansaripour; Krishnendu Chatterjee; Thomas A. Henzinger; Mathias; Lechner; {\DJ}or{\dj}e \v{Z}ikeli\'c

arXiv:2210.05304·cs.LG·July 31, 2023·1 cites

Learning Provably Stabilizing Neural Controllers for Discrete-Time Stochastic Systems

Matin Ansaripour, Krishnendu Chatterjee, Thomas A. Henzinger, Mathias, Lechner, {\DJ}or{\dj}e \v{Z}ikeli\'c

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for learning neural control policies that provably stabilize discrete-time stochastic systems with probability one, using stabilizing ranking supermartingales (sRSMs) for certification.

Contribution

The work presents a new concept of stabilizing ranking supermartingales (sRSMs) and a learning procedure to jointly learn control policies and sRSMs as neural networks for stability guarantees.

Findings

01

Successfully learned provably stabilizing policies in experiments

02

Demonstrated formal certification of stability with neural networks

03

Adapted the method for verifying stability under fixed policies

Abstract

We consider the problem of learning control policies in discrete-time stochastic systems which guarantee that the system stabilizes within some specified stabilization region with probability~ $1$ . Our approach is based on the novel notion of stabilizing ranking supermartingales (sRSMs) that we introduce in this work. Our sRSMs overcome the limitation of methods proposed in previous works whose applicability is restricted to systems in which the stabilizing region cannot be left once entered under any control policy. We present a learning procedure that learns a control policy together with an sRSM that formally certifies probability~ $1$ stability, both learned as neural networks. We show that this procedure can also be adapted to formally verifying that, under a given Lipschitz continuous control policy, the stochastic system stabilizes within some stabilizing region with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlech26l/neural_martingales
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics