Approximate Inference and Stochastic Optimal Control

Konrad Rawlik; Marc Toussaint; Sethu Vijayakumar

arXiv:1009.3958·cs.LG·September 22, 2010·20 cites

Approximate Inference and Stochastic Optimal Control

Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar

PDF

Open Access

TL;DR

This paper introduces a new approach to stochastic optimal control by reformulating it as an approximate inference problem, leading to novel iterative solution methods and model-free reinforcement learning algorithms.

Contribution

It presents a theoretical reformulation of stochastic control as approximate inference and develops new practical, model-free RL methods based on this insight.

Findings

01

New iterative solutions for stochastic control.

02

Model-free, off-policy reinforcement learning algorithms.

03

Applicable to both discrete and continuous problems.

Abstract

We propose a novel reformulation of the stochastic optimal control problem as an approximate inference problem, demonstrating, that such a interpretation leads to new practical methods for the original problem. In particular we characterise a novel class of iterative solutions to the stochastic optimal control problem based on a natural relaxation of the exact dual formulation. These theoretical insights are applied to the Reinforcement Learning problem where they lead to new model free, off policy methods for discrete and continuous problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Control Systems and Identification