Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents

Thomas Banker; Ali Mesbah

arXiv:2507.13491·cs.LG·July 21, 2025

Model-free Reinforcement Learning for Model-based Control: Towards Safe, Interpretable and Sample-efficient Agents

Thomas Banker, Ali Mesbah

PDF

Open Access

TL;DR

This paper explores combining model-free and model-based reinforcement learning to develop agents that are safer, more interpretable, and more sample-efficient, addressing limitations of neural network-based approaches.

Contribution

It introduces a framework for integrating model-based control with model-free RL, highlighting benefits, challenges, and learning strategies for safer, interpretable agents.

Findings

01

Model-based agents can encode prior knowledge to improve safety and interpretability.

02

Combining model-based and model-free RL enhances sample efficiency.

03

Different learning approaches like Bayesian optimization and policy search are analyzed.

Abstract

Training sophisticated agents for optimal decision-making under uncertainty has been key to the rapid development of modern autonomous systems across fields. Notably, model-free reinforcement learning (RL) has enabled decision-making agents to improve their performance directly through system interactions, with minimal prior knowledge about the system. Yet, model-free RL has generally relied on agents equipped with deep neural network function approximators, appealing to the networks' expressivity to capture the agent's policy and value function for complex systems. However, neural networks amplify the issues of sample inefficiency, unsafe learning, and limited interpretability in model-free RL. To this end, this work introduces model-based agents as a compelling alternative for control policy approximation, leveraging adaptable models of system dynamics, cost, and constraints for safe…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)