Synthesizing Neural Network Controllers with Probabilistic Model based   Reinforcement Learning

Juan Camilo Gamboa Higuera; David Meger; Gregory Dudek

arXiv:1803.02291·cs.RO·August 2, 2018

Synthesizing Neural Network Controllers with Probabilistic Model based Reinforcement Learning

Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek

PDF

3 Repos

TL;DR

This paper introduces a data-efficient, model-based reinforcement learning algorithm that uses neural network dynamics with calibrated uncertainty for rapid controller learning in robotics, outperforming existing methods in complex tasks.

Contribution

The paper proposes a novel neural network dynamics model with variational dropout and techniques to improve convergence, enhancing data efficiency and scalability in controller synthesis.

Findings

01

Competitive data-efficiency with PILCO

02

Successful learning of complex neural network controllers

03

Effective control of a six-legged underwater robot

Abstract

We present an algorithm for rapidly learning controllers for robotics systems. The algorithm follows the model-based reinforcement learning paradigm, and improves upon existing algorithms; namely Probabilistic learning in Control (PILCO) and a sample-based version of PILCO with neural network dynamics (Deep-PILCO). We propose training a neural network dynamics model using variational dropout with truncated Log-Normal noise. This allows us to obtain a dynamics model with calibrated uncertainty, which can be used to simulate controller executions via rollouts. We also describe set of techniques, inspired by viewing PILCO as a recurrent neural network model, that are crucial to improve the convergence of the method. We test our method on a variety of benchmark tasks, demonstrating data-efficiency that is competitive with PILCO, while being able to optimize complex neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout