Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics

Karim Abdelsalam; Zeyad Gamal; Ayman El-Badawy

arXiv:2512.21081·eess.SY·December 25, 2025

Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics

Karim Abdelsalam, Zeyad Gamal, Ayman El-Badawy

PDF

Open Access

TL;DR

This paper introduces a hybrid reinforcement learning framework combining SINDy and TD3 to efficiently control nonlinear systems, demonstrated on a bi-rotor system with improved accuracy and robustness.

Contribution

It presents a novel integration of data-driven system identification with reinforcement learning, enhancing sample efficiency and control performance for nonlinear dynamics.

Findings

01

Achieves superior control accuracy over traditional RL methods.

02

Demonstrates robustness in stabilization and trajectory tracking.

03

Reduces sample complexity through synthetic rollouts.

Abstract

Controlling systems with complex, nonlinear dynamics poses a significant challenge, particularly in achieving efficient and robust control. In this paper, we propose a Dyna-Style Reinforcement Learning control framework that integrates Sparse Identification of Nonlinear Dynamics (SINDy) with Twin Delayed Deep Deterministic Policy Gradient (TD3) reinforcement learning. SINDy is used to identify a data-driven model of the system, capturing its key dynamics without requiring an explicit physical model. This identified model is used to generate synthetic rollouts that are periodically injected into the reinforcement learning replay buffer during training on the real environment, enabling efficient policy learning with limited data available. By leveraging this hybrid approach, we mitigate the sample inefficiency of traditional model-free reinforcement learning methods while ensuring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Neural Networks and Reservoir Computing