Lyapunov Constrained Soft Actor-Critic (LC-SAC) using Koopman Operator Theory for Quadrotor Trajectory Tracking

Dhruv S. Kushwaha; Zoleikha A. Biron

arXiv:2602.04132·eess.SY·March 19, 2026

Lyapunov Constrained Soft Actor-Critic (LC-SAC) using Koopman Operator Theory for Quadrotor Trajectory Tracking

Dhruv S. Kushwaha, Zoleikha A. Biron

PDF

Open Access

TL;DR

This paper introduces a Lyapunov-constrained RL algorithm using Koopman operator theory to ensure stability in quadrotor trajectory tracking, demonstrating improved convergence and stability guarantees over standard methods.

Contribution

It proposes a novel LC-SAC algorithm that incorporates Koopman-based Lyapunov functions into RL for stability guarantees in nonlinear systems.

Findings

01

Enhanced training convergence in quadrotor trajectory tracking

02

Decaying violations of Lyapunov stability criterion

03

Improved stability guarantees over vanilla SAC

Abstract

Reinforcement Learning (RL) has achieved remarkable success in solving complex sequential decision-making problems. However, its application to safety-critical physical systems remains constrained by the lack of stability guarantees. Standard RL algorithms prioritize reward maximization, often yielding policies that may induce oscillations or unbounded state divergence. There has been significant work in incorporating Lyapunov-based stability guarantees in RL algorithms with key challenges being selecting a candidate Lyapunov function, computational complexity by using excessive function approximators and conservative policies by incorporating stability criterion in the learning process. In this work we propose a novel Lyapunov-constrained Soft Actor-Critic (LC-SAC) algorithm using Koopman operator theory. We propose use of extended dynamic mode decomposition (EDMD) to produce a linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Model Reduction and Neural Networks