Calibrated Model-Based Deep Reinforcement Learning

Ali Malik; Volodymyr Kuleshov; Jiaming Song; Danny Nemer; Harlan; Seymour; Stefano Ermon

arXiv:1906.08312·cs.LG·June 21, 2019·23 cites

Calibrated Model-Based Deep Reinforcement Learning

Ali Malik, Volodymyr Kuleshov, Jiaming Song, Danny Nemer, Harlan, Seymour, Stefano Ermon

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that calibrating predictive uncertainties in deep model-based reinforcement learning enhances planning, exploration, and sample efficiency, leading to state-of-the-art results on the HalfCheetah task with fewer samples.

Contribution

It introduces a simple method to augment any model-based RL agent with calibrated models, significantly improving performance and sample efficiency.

Findings

01

Calibrated models improve planning and exploration.

02

Achieved state-of-the-art results on HalfCheetah with 50% fewer samples.

03

Calibration adds minimal computational overhead.

Abstract

Estimates of predictive uncertainty are important for accurate model-based planning and reinforcement learning. However, predictive uncertainties---especially ones derived from modern deep learning systems---can be inaccurate and impose a bottleneck on performance. This paper explores which uncertainties are needed for model-based reinforcement learning and argues that good uncertainties must be calibrated, i.e. their probabilities should match empirical frequencies of predicted events. We describe a simple way to augment any model-based reinforcement learning agent with a calibrated model and show that doing so consistently improves planning, sample complexity, and exploration. On the \textsc{HalfCheetah} MuJoCo task, our system achieves state-of-the-art performance using 50\% fewer samples than the current leading approach. Our findings suggest that calibration can improve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ermongroup/CalibratedModelBasedRL
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Evolutionary Algorithms and Applications