Model-based Reinforcement Learning: A Survey

Thomas M. Moerland; Joost Broekens; Aske Plaat; Catholijn M. Jonker

arXiv:2006.16712·cs.LG·April 1, 2022·82 cites

Model-based Reinforcement Learning: A Survey

Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

PDF

Open Access

TL;DR

This survey comprehensively reviews model-based reinforcement learning, covering dynamics model learning challenges, planning integration strategies, and potential benefits, providing a broad conceptual overview of combining planning and learning in MDP optimization.

Contribution

It systematically categorizes approaches to dynamics modeling and planning integration in model-based RL, highlighting challenges and potential benefits, and connects related RL fields.

Findings

01

Detailed categorization of dynamics model learning approaches

02

Analysis of planning and learning integration strategies

03

Discussion of implicit model-based RL and its advantages

Abstract

Sequential decision making, commonly formalized as Markov Decision Process (MDP) optimization, is a important challenge in artificial intelligence. Two key approaches to this problem are reinforcement learning (RL) and planning. This paper presents a survey of the integration of both fields, better known as model-based reinforcement learning. Model-based RL has two main steps. First, we systematically cover approaches to dynamics model learning, including challenges like dealing with stochasticity, uncertainty, partial observability, and temporal abstraction. Second, we present a systematic categorization of planning-learning integration, including aspects like: where to start planning, what budgets to allocate to planning and real data collection, how to plan, and how to integrate planning in the learning and acting loop. After these two sections, we also discuss implicit model-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Reinforcement Learning in Robotics · Simulation Techniques and Applications