A Survey on Model-based Reinforcement Learning
Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu

TL;DR
This survey reviews recent progress in deep model-based reinforcement learning, emphasizing the importance of understanding model discrepancies, and discusses its applications, challenges, and future prospects in real-world tasks.
Contribution
It provides a comprehensive overview of recent advances in deep MBRL, analyzing model discrepancies, and exploring its applications across various RL paradigms.
Findings
Analysis of generalization error between learned and real environment models
Discussion on discrepancy-guided algorithm design for better model learning
Evaluation of MBRL's potential in real-world applications
Abstract
Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error, making errors is always undesired in the real world. To improve the sample efficiency and thus reduce the errors, model-based reinforcement learning (MBRL) is believed to be a promising direction, which builds environment models in which the trial-and-errors can take place without real costs. In this survey, we take a review of MBRL with a focus on the recent progress in deep RL. For non-tabular environments, there is always a generalization error between the learned environment model and the real environment. As such, it is of great importance to analyze the discrepancy between policy training in the environment model and that in the real environment,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques
