Benchmarking Model-Based Reinforcement Learning

Tingwu Wang; Xuchan Bao; Ignasi Clavera; Jerrick Hoang; Yeming Wen,; Eric Langlois; Shunshi Zhang; Guodong Zhang; Pieter Abbeel; Jimmy Ba

arXiv:1907.02057·cs.LG·July 4, 2019·242 cites

Benchmarking Model-Based Reinforcement Learning

Tingwu Wang, Xuchan Bao, Ignasi Clavera, Jerrick Hoang, Yeming Wen,, Eric Langlois, Shunshi Zhang, Guodong Zhang, Pieter Abbeel, Jimmy Ba

PDF

Open Access 2 Repos

TL;DR

This paper provides a comprehensive benchmarking suite for model-based reinforcement learning, comparing various algorithms across standardized environments to identify key challenges and facilitate future research.

Contribution

It introduces a unified benchmarking framework with multiple environments and evaluates existing MBRL algorithms to standardize performance assessment.

Findings

01

Identified three key research challenges: dynamics bottleneck, planning horizon dilemma, early-termination dilemma.

02

Provided a comprehensive comparison of MBRL algorithms under unified settings.

03

Open-sourced the benchmark suite for community use.

Abstract

Model-based reinforcement learning (MBRL) is widely seen as having the potential to be significantly more sample efficient than model-free RL. However, research in model-based RL has not been very standardized. It is fairly common for authors to experiment with self-designed environments, and there are several separate lines of research, which are sometimes closed-sourced or not reproducible. Accordingly, it is an open question how these various existing MBRL algorithms perform relative to each other. To facilitate research in MBRL, in this paper we gather a wide collection of MBRL algorithms and propose over 18 benchmarking environments specially designed for MBRL. We benchmark these algorithms with unified problem settings, including noisy environments. Beyond cataloguing performance, we explore and unify the underlying algorithmic differences across MBRL algorithms. We characterize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Elevator Systems and Control · Scheduling and Optimization Algorithms