Model-Based Data-Efficient and Robust Reinforcement Learning

Ludvig Svedlund; Constantin Cronrath; Jonas Fredriksson; and Bengt Lennartson

arXiv:2602.00630·eess.SY·February 3, 2026

Model-Based Data-Efficient and Robust Reinforcement Learning

Ludvig Svedlund, Constantin Cronrath, Jonas Fredriksson, and Bengt Lennartson

PDF

Open Access

TL;DR

This paper introduces a model-based reinforcement learning method that enhances data efficiency and robustness by combining system dynamics modeling with a two-level control optimization, outperforming traditional approaches in energy savings.

Contribution

It presents a novel two-level control framework that integrates system dynamics learning with optimization, significantly improving data efficiency and robustness over existing methods.

Findings

01

Reduces energy consumption more effectively than existing RL methods.

02

Achieves over 100-fold reduction in evaluated time steps.

03

Demonstrates robustness against load disturbances and model errors.

Abstract

A data-efficient learning-based control design method is proposed in this paper. It is based on learning a system dynamics model that is then leveraged in a two-level procedure. On the higher level, a simple but powerful optimization procedure is performed such that, for example, energy consumption in a vehicle can be reduced when hard state and action constraints are also introduced. Load disturbances and model errors are compensated for by a feedback controller on the lower level. In that regard, we briefly examine the robustness of both model-free and model-based learning approaches, and it is shown that the model-free approach greatly suffers from the inclusion of unmodeled dynamics. In evaluating the proposed method, it is assumed that a path is given, while the velocity and acceleration can be modified such that energy is saved, while still keeping speed limits and completion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Traffic control and management