Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms
Oktay Arslan, Evangelos Theodorou, Panagiotis Tsiotras

TL;DR
This paper introduces a novel approach combining incremental sampling algorithms with information-theoretic stochastic optimal control, enabling efficient trajectory sampling for solving complex nonlinear stochastic control problems.
Contribution
It presents a new method linking RRT algorithms with path integral control, improving trajectory sampling in stochastic optimal control.
Findings
Effective sampling of low-cost trajectories demonstrated
Enhanced stability in stochastic control computations
Successful application to multiple nonlinear systems
Abstract
This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the optimal control policy for this problem can be obtained as a function of a value function that satisfies a nonlinear partial differential equation, namely, the Hamilton-Jacobi-Bellman equation. This nonlinear PDE must be solved backwards in time, and this computation is intractable for large scale systems. Under certain assumptions, and after applying a logarithmic transformation, an alternative characterization of the optimal policy can be given in terms of a path integral. Path Integral (PI) based control methods have recently been shown to provide elegant solutions to a broad class of stochastic optimal control problems. One of the implementation challenges with this formalism is the computation of the expectation of a cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Gaussian Processes and Bayesian Inference
