Stochastic Optimal Control for Multivariable Dynamical Systems Using Expectation Maximization
Prakash Mallick, Zhiyong Chen

TL;DR
This paper introduces SOC-EM, an iterative trajectory optimization method for stochastic control of dynamical systems with measurement noise, combining reinforcement learning and maximum likelihood to improve cost reduction.
Contribution
It reformulates stochastic control as a reinforcement learning problem and proposes SOC-EM, a novel iterative optimization approach with theoretical and empirical performance improvements.
Findings
SOC-EM reduces cumulative cost-to-go effectively.
Theoretical proof of control parameter estimate uniqueness.
Analysis of control covariance balances exploration and exploitation.
Abstract
Trajectory optimization is a fundamental stochastic optimal control problem. This paper deals with a trajectory optimization approach for dynamical systems subject to measurement noise that can be fitted into linear time-varying stochastic models. Exact/complete solutions to these kind of control problems have been deemed analytically intractable in literature because they come under the category of Partially Observable Markov Decision Processes (POMDPs). Therefore, effective solutions with reasonable approximations are widely sought for. We propose a reformulation of stochastic control in a reinforcement learning setting. This type of formulation assimilates the benefits of conventional optimal control procedure, with the advantages of maximum likelihood approaches. Finally, an iterative trajectory optimization paradigm called as Stochastic Optimal Control - Expectation Maximization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Vehicle emissions and performance · Energy, Environment, and Transportation Policies
