Stochastic Optimal Control as Approximate Input Inference

Joe Watson; Hany Abdulsamad; Jan Peters

arXiv:1910.03003·cs.LG·April 23, 2020·5 cites

Stochastic Optimal Control as Approximate Input Inference

Joe Watson, Hany Abdulsamad, Jan Peters

PDF

Open Access 1 Repo

TL;DR

This paper presents a probabilistic framework for stochastic optimal control by formulating it as an input inference problem, enabling uncertainty quantification and principled regularization.

Contribution

It introduces a novel probabilistic approach using Expectation Maximization and message passing to infer optimal control inputs, unifying control and inference perspectives.

Findings

01

Incorporates uncertainty quantification into control inference.

02

Derives maximum entropy LQG control law for linearized systems.

03

Provides a detailed derivation and comparison with existing methods.

Abstract

Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JoeMWatson/input-inference-for-control
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics