System stabilization with policy optimization on unstable latent manifolds
Steffen W. R. Werner, Benjamin Peherstorfer

TL;DR
This paper presents a reinforcement learning method that stabilizes unstable dynamical systems by focusing on minimal latent manifolds, enabling effective training from limited data samples.
Contribution
It introduces a novel approach that uses minimal unstable latent manifolds for policy optimization, reducing data requirements and improving stability in complex systems.
Findings
Successfully stabilizes complex physical systems from few data samples.
Outperforms methods operating in state space or on generic latent manifolds.
Demonstrates effectiveness with limited data in unstable dynamics scenarios.
Abstract
Stability is a basic requirement when studying the behavior of dynamical systems. However, stabilizing dynamical systems via reinforcement learning is challenging because only little data can be collected over short time horizons before instabilities are triggered and data become meaningless. This work introduces a reinforcement learning approach that is formulated over latent manifolds of unstable dynamics so that stabilizing policies can be trained from few data samples. The unstable manifolds are minimal in the sense that they contain the lowest dimensional dynamics that are necessary for learning policies that guarantee stabilization. This is in stark contrast to generic latent manifolds that aim to approximate all -- stable and unstable -- system dynamics and thus are higher dimensional and often require higher amounts of data. Experiments demonstrate that the proposed approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications · Advanced Control Systems Optimization
