Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Konrad Rawlik; Marc Toussaint; Sethu Vijayakumar

arXiv:1208.2523·cs.LG·August 14, 2012·20 cites

Path Integral Control by Reproducing Kernel Hilbert Space Embedding

Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar

PDF

Open Access

TL;DR

This paper introduces a kernel-based, model-free method for solving stochastic optimal control problems in the path integral framework, improving sample efficiency and reusability across tasks.

Contribution

It embeds path integral control problems into reproducing kernel Hilbert spaces, enabling non-parametric, sample-efficient solutions that separate invariant and task-specific components.

Findings

01

Enhanced sample efficiency over previous methods

02

Allows re-use of samples across different control tasks

03

Demonstrates effectiveness through numerical examples

Abstract

We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task dependent component. Consequently, we make much more efficient use of the sample data compared to previous sample based approaches in this domain, e.g., by allowing sample re-use across tasks. Numerical examples on test problems, which illustrate the sample efficiency, are provided.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Probabilistic and Robust Engineering Design · Gaussian Processes and Bayesian Inference