Efficient Reinforcement Learning using Linear Koopman Dynamics for Nonlinear Robotic Systems
Wenjian Hao, Yuxuan Fang, Zehui Lu, Shaoshuai Mou

TL;DR
This paper introduces a model-based reinforcement learning framework that leverages Koopman operator theory to learn linear dynamics for nonlinear robotic systems, improving sample efficiency and control performance.
Contribution
It develops a novel online policy gradient method using learned Koopman-based models, reducing computational cost and rollout errors in nonlinear control tasks.
Findings
Enhanced sample efficiency over model-free RL methods.
Superior control performance compared to other model-based RL approaches.
Control results comparable to classical methods with known dynamics.
Abstract
This paper presents a model-based reinforcement learning (RL) framework for optimal closed-loop control of nonlinear robotic systems. The proposed approach learns linear lifted dynamics through Koopman operator theory and integrates the resulting model into an actor-critic architecture for policy optimization, where the policy represents a parameterized closed-loop controller. To reduce computational cost and mitigate model rollout errors, policy gradients are estimated using one-step predictions of the learned dynamics rather than multi-step propagation. This leads to an online mini-batch policy gradient framework that enables policy improvement from streamed interaction data. The proposed framework is evaluated on several simulated nonlinear control benchmarks and two real-world hardware platforms, including a Kinova Gen3 robotic arm and a Unitree Go1 quadruped. Experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
