A Survey on Model-based Reinforcement Learning

Fan-Ming Luo; Tian Xu; Hang Lai; Xiong-Hui Chen; Weinan Zhang; Yang Yu

arXiv:2206.09328·cs.LG·June 22, 2022·26 cites

A Survey on Model-based Reinforcement Learning

Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

PDF

Open Access

TL;DR

This survey reviews recent progress in deep model-based reinforcement learning, emphasizing the importance of understanding model discrepancies, and discusses its applications, challenges, and future prospects in real-world tasks.

Contribution

It provides a comprehensive overview of recent advances in deep MBRL, analyzing model discrepancies, and exploring its applications across various RL paradigms.

Findings

01

Analysis of generalization error between learned and real environment models

02

Discussion on discrepancy-guided algorithm design for better model learning

03

Evaluation of MBRL's potential in real-world applications

Abstract

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error, making errors is always undesired in the real world. To improve the sample efficiency and thus reduce the errors, model-based reinforcement learning (MBRL) is believed to be a promising direction, which builds environment models in which the trial-and-errors can take place without real costs. In this survey, we take a review of MBRL with a focus on the recent progress in deep RL. For non-tabular environments, there is always a generalization error between the learned environment model and the real environment. As such, it is of great importance to analyze the discrepancy between policy training in the environment model and that in the real environment,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques