Variational Inference MPC for Bayesian Model-based Reinforcement   Learning

Masashi Okada; Tadahiro Taniguchi

arXiv:1907.04202·cs.LG·October 8, 2019·29 cites

Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Masashi Okada, Tadahiro Taniguchi

PDF

Open Access

TL;DR

This paper introduces a Bayesian variational inference approach to model-based reinforcement learning, enhancing uncertainty modeling and improving performance in robotics tasks.

Contribution

It presents a novel variational inference MPC framework and a new probabilistic action ensemble method, PaETS, for better uncertainty handling in MBRL.

Findings

01

Improved asymptotic performance over PETS in locomotion tasks

02

Handles multimodal uncertainties in dynamics and trajectories

03

Reformulates stochastic methods like CEM in a Bayesian framework

Abstract

In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge model free methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynamics modeling and model predictive control (MPC) with stochastic optimization via the cross entropy method (CEM). In this paper, we propose a novel extension to the uncertainty-aware MBRL. Our main contributions are twofold: Firstly, we introduce a variational inference MPC, which reformulates various stochastic methods, including CEM, in a Bayesian fashion. Secondly, we propose a novel instance of the framework, called probabilistic action ensembles with trajectory sampling (PaETS). As a result,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Prosthetics and Rehabilitation Robotics