MBMF: Model-Based Priors for Model-Free Reinforcement Learning

Somil Bansal; Roberto Calandra; Kurtland Chua; Sergey Levine; Claire; Tomlin

arXiv:1709.03153·cs.LG·October 19, 2017·20 cites

MBMF: Model-Based Priors for Model-Free Reinforcement Learning

Somil Bansal, Roberto Calandra, Kurtland Chua, Sergey Levine, Claire, Tomlin

PDF

Open Access

TL;DR

This paper introduces MBMF, a hybrid reinforcement learning method that combines model-based priors with model-free optimization to improve data efficiency and robustness, outperforming traditional approaches.

Contribution

The paper presents a novel approach that integrates probabilistic dynamics models as priors into model-free reinforcement learning, effectively bridging the two paradigms.

Findings

01

Outperforms purely model-based methods

02

Outperforms purely model-free methods

03

Better data efficiency and robustness

Abstract

Reinforcement Learning is divided in two main paradigms: model-free and model-based. Each of these two paradigms has strengths and limitations, and has been successfully applied to real world domains that are appropriate to its corresponding strengths. In this paper, we present a new approach aimed at bridging the gap between these two paradigms. We aim to take the best of the two paradigms and combine them in an approach that is at the same time data-efficient and cost-savvy. We do so by learning a probabilistic dynamics model and leveraging it as a prior for the intertwined model-free optimization. As a result, our approach can exploit the generality and structure of the dynamics model, but is also capable of ignoring its inevitable inaccuracies, by directly incorporating the evidence provided by the direct observation of the cost. Preliminary results demonstrate that our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference