Model-Based Value Estimation for Efficient Model-Free Reinforcement   Learning

Vladimir Feinberg; Alvin Wan; Ion Stoica; Michael I. Jordan; Joseph E.; Gonzalez; Sergey Levine

arXiv:1803.00101·cs.LG·March 2, 2018·168 cites

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

Vladimir Feinberg, Alvin Wan, Ion Stoica, Michael I. Jordan, Joseph E., Gonzalez, Sergey Levine

PDF

Open Access

TL;DR

This paper introduces model-based value expansion, a method that leverages learned dynamics models with controlled uncertainty to improve value estimation and reduce sample complexity in reinforcement learning.

Contribution

It presents a novel approach that allows wider use of learned dynamics models in model-free RL by controlling for uncertainty through fixed-depth imagination.

Findings

01

Improved value estimation in continuous control tasks.

02

Reduced sample complexity in reinforcement learning.

03

Enhanced utilization of learned dynamics models.

Abstract

Recent model-free reinforcement learning algorithms have proposed incorporating learned dynamics models as a source of additional data with the intention of reducing sample complexity. Such methods hold the promise of incorporating imagined data coupled with a notion of model uncertainty to accelerate the learning of continuous control tasks. Unfortunately, they rely on heuristics that limit usage of the dynamics model. We present model-based value expansion, which controls for uncertainty in the model by only allowing imagination to fixed depth. By enabling wider use of learned dynamics models within a model-free reinforcement learning algorithm, we improve value estimation, which, in turn, reduces the sample complexity of learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Traffic control and management · Autonomous Vehicle Technology and Safety