Data-Driven Stochastic Optimal Control Using Kernel Gradients

Adam J. Thorpe; Jake A. Gonzales; Meeko M. K. Oishi

arXiv:2209.09205·math.OC·September 20, 2022

Data-Driven Stochastic Optimal Control Using Kernel Gradients

Adam J. Thorpe, Jake A. Gonzales, Meeko M. K. Oishi

PDF

Open Access

TL;DR

This paper introduces a novel data-driven stochastic optimal control method leveraging kernel embeddings to efficiently approximate and optimize control policies using gradient descent in a reproducing kernel Hilbert space.

Contribution

It presents a gradient-based approach using kernel embeddings for stochastic control, enabling optimization over continuous control spaces with observed data.

Findings

01

Effective in linear regulation problems

02

Applicable to nonlinear target tracking

03

Demonstrates efficiency of kernel gradient methods

Abstract

We present an empirical, gradient-based method for solving data-driven stochastic optimal control problems using the theory of kernel embeddings of distributions. By embedding the integral operator of a stochastic kernel in a reproducing kernel Hilbert space, we can compute an empirical approximation of stochastic optimal control problems, which can then be solved efficiently using the properties of the RKHS. Existing approaches typically rely upon finite control spaces or optimize over policies with finite support to enable optimization. In contrast, our approach uses kernel-based gradients computed using observed data to approximate the cost surface of the optimal control problem, which can then be optimized using gradient descent. We apply our technique to the area of data-driven stochastic optimal control, and demonstrate our proposed approach on a linear regulation problem for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Stochastic Gradient Optimization Techniques