Information-Theoretic Stochastic Optimal Control via Incremental   Sampling-based Algorithms

Oktay Arslan; Evangelos Theodorou; Panagiotis Tsiotras

arXiv:1405.7392·cs.RO·May 30, 2014

Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms

Oktay Arslan, Evangelos Theodorou, Panagiotis Tsiotras

PDF

Open Access

TL;DR

This paper introduces a novel approach combining incremental sampling algorithms with information-theoretic stochastic optimal control, enabling efficient trajectory sampling for solving complex nonlinear stochastic control problems.

Contribution

It presents a new method linking RRT algorithms with path integral control, improving trajectory sampling in stochastic optimal control.

Findings

01

Effective sampling of low-cost trajectories demonstrated

02

Enhanced stability in stochastic control computations

03

Successful application to multiple nonlinear systems

Abstract

This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the optimal control policy for this problem can be obtained as a function of a value function that satisfies a nonlinear partial differential equation, namely, the Hamilton-Jacobi-Bellman equation. This nonlinear PDE must be solved backwards in time, and this computation is intractable for large scale systems. Under certain assumptions, and after applying a logarithmic transformation, an alternative characterization of the optimal policy can be given in terms of a path integral. Path Integral (PI) based control methods have recently been shown to provide elegant solutions to a broad class of stochastic optimal control problems. One of the implementation challenges with this formalism is the computation of the expectation of a cost…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Gaussian Processes and Bayesian Inference