Critic as Lyapunov function (CALF): a model-free, stability-ensuring   agent

Pavel Osinenko; Grigory Yaremenko; Roman Zashchitin; Anton Bolychev,; Sinan Ibrahim; Dmitrii Dobriborsci

arXiv:2409.09869·cs.RO·September 17, 2024

Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent

Pavel Osinenko, Grigory Yaremenko, Roman Zashchitin, Anton Bolychev,, Sinan Ibrahim, Dmitrii Dobriborsci

PDF

Open Access

TL;DR

CALF is a novel model-free reinforcement learning agent that guarantees environment stability during learning, significantly improving performance in a mobile robot simulation compared to traditional methods.

Contribution

The paper introduces CALF, a new reinforcement learning agent that ensures online stability without relying on models, bridging classical control and RL.

Findings

01

CALF outperforms SARSA-m in stabilizing the environment.

02

CALF improves nominal stabilizer performance.

03

Demonstrated success in mobile robot simulation.

Abstract

This work presents and showcases a novel reinforcement learning agent called Critic As Lyapunov Function (CALF) which is model-free and ensures online environment, in other words, dynamical system stabilization. Online means that in each learning episode, the said environment is stabilized. This, as demonstrated in a case study with a mobile robot simulator, greatly improves the overall learning performance. The base actor-critic scheme of CALF is analogous to SARSA. The latter did not show any success in reaching the target in our studies. However, a modified version thereof, called SARSA-m here, did succeed in some learning scenarios. Still, CALF greatly outperformed the said approach. CALF was also demonstrated to improve a nominal stabilizer provided to it. In summary, the presented agent may be considered a viable approach to fusing classical control with reinforcement learning.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl and Stability of Dynamical Systems

MethodsBalanced Selection