Efficient Reinforcement Learning using Linear Koopman Dynamics for Nonlinear Robotic Systems

Wenjian Hao; Yuxuan Fang; Zehui Lu; Shaoshuai Mou

arXiv:2604.19980·cs.RO·April 23, 2026

Efficient Reinforcement Learning using Linear Koopman Dynamics for Nonlinear Robotic Systems

Wenjian Hao, Yuxuan Fang, Zehui Lu, Shaoshuai Mou

PDF

TL;DR

This paper introduces a model-based reinforcement learning framework that leverages Koopman operator theory to learn linear dynamics for nonlinear robotic systems, improving sample efficiency and control performance.

Contribution

It develops a novel online policy gradient method using learned Koopman-based models, reducing computational cost and rollout errors in nonlinear control tasks.

Findings

01

Enhanced sample efficiency over model-free RL methods.

02

Superior control performance compared to other model-based RL approaches.

03

Control results comparable to classical methods with known dynamics.

Abstract

This paper presents a model-based reinforcement learning (RL) framework for optimal closed-loop control of nonlinear robotic systems. The proposed approach learns linear lifted dynamics through Koopman operator theory and integrates the resulting model into an actor-critic architecture for policy optimization, where the policy represents a parameterized closed-loop controller. To reduce computational cost and mitigate model rollout errors, policy gradients are estimated using one-step predictions of the learned dynamics rather than multi-step propagation. This leads to an online mini-batch policy gradient framework that enables policy improvement from streamed interaction data. The proposed framework is evaluated on several simulated nonlinear control benchmarks and two real-world hardware platforms, including a Kinova Gen3 robotic arm and a Unitree Go1 quadruped. Experimental results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.