Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics
Karim Abdelsalam, Zeyad Gamal, Ayman El-Badawy

TL;DR
This paper introduces a hybrid reinforcement learning framework combining SINDy and TD3 to efficiently control nonlinear systems, demonstrated on a bi-rotor system with improved accuracy and robustness.
Contribution
It presents a novel integration of data-driven system identification with reinforcement learning, enhancing sample efficiency and control performance for nonlinear dynamics.
Findings
Achieves superior control accuracy over traditional RL methods.
Demonstrates robustness in stabilization and trajectory tracking.
Reduces sample complexity through synthetic rollouts.
Abstract
Controlling systems with complex, nonlinear dynamics poses a significant challenge, particularly in achieving efficient and robust control. In this paper, we propose a Dyna-Style Reinforcement Learning control framework that integrates Sparse Identification of Nonlinear Dynamics (SINDy) with Twin Delayed Deep Deterministic Policy Gradient (TD3) reinforcement learning. SINDy is used to identify a data-driven model of the system, capturing its key dynamics without requiring an explicit physical model. This identified model is used to generate synthetic rollouts that are periodically injected into the reinforcement learning replay buffer during training on the real environment, enabling efficient policy learning with limited data available. By leveraging this hybrid approach, we mitigate the sample inefficiency of traditional model-free reinforcement learning methods while ensuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Neural Networks and Reservoir Computing
