Stochastic Optimal Control as Approximate Input Inference
Joe Watson, Hany Abdulsamad, Jan Peters

TL;DR
This paper presents a probabilistic framework for stochastic optimal control by formulating it as an input inference problem, enabling uncertainty quantification and principled regularization.
Contribution
It introduces a novel probabilistic approach using Expectation Maximization and message passing to infer optimal control inputs, unifying control and inference perspectives.
Findings
Incorporates uncertainty quantification into control inference.
Derives maximum entropy LQG control law for linearized systems.
Provides a detailed derivation and comparison with existing methods.
Abstract
Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and control, we develop the view of Optimal Control as Input Estimation, devising a probabilistic stochastic optimal control formulation that iteratively infers the optimal input distributions by minimizing an upper bound of the control cost. Inference is performed through Expectation Maximization and message passing on a probabilistic graphical model of the dynamical system, and time-varying linear Gaussian feedback controllers are extracted from the joint state-action distribution. This perspective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics
