System stabilization with policy optimization on unstable latent   manifolds

Steffen W. R. Werner; Benjamin Peherstorfer

arXiv:2407.06418·math.OC·November 1, 2024

System stabilization with policy optimization on unstable latent manifolds

Steffen W. R. Werner, Benjamin Peherstorfer

PDF

Open Access

TL;DR

This paper presents a reinforcement learning method that stabilizes unstable dynamical systems by focusing on minimal latent manifolds, enabling effective training from limited data samples.

Contribution

It introduces a novel approach that uses minimal unstable latent manifolds for policy optimization, reducing data requirements and improving stability in complex systems.

Findings

01

Successfully stabilizes complex physical systems from few data samples.

02

Outperforms methods operating in state space or on generic latent manifolds.

03

Demonstrates effectiveness with limited data in unstable dynamics scenarios.

Abstract

Stability is a basic requirement when studying the behavior of dynamical systems. However, stabilizing dynamical systems via reinforcement learning is challenging because only little data can be collected over short time horizons before instabilities are triggered and data become meaningless. This work introduces a reinforcement learning approach that is formulated over latent manifolds of unstable dynamics so that stabilizing policies can be trained from few data samples. The unstable manifolds are minimal in the sense that they contain the lowest dimensional dynamics that are necessary for learning policies that guarantee stabilization. This is in stark contrast to generic latent manifolds that aim to approximate all -- stable and unstable -- system dynamics and thus are higher dimensional and often require higher amounts of data. Experiments demonstrate that the proposed approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications · Advanced Control Systems Optimization