Data-Driven Stochastic Optimal Control in Reproducing Kernel Hilbert Spaces

Nicolas Hoischen; Petar Bevanda; Stefan Sosnowski; Sandra Hirche; Boris Houska

arXiv:2407.16407·math.OC·November 3, 2025

Data-Driven Stochastic Optimal Control in Reproducing Kernel Hilbert Spaces

Nicolas Hoischen, Petar Bevanda, Stefan Sosnowski, Sandra Hirche, Boris Houska

PDF

Open Access

TL;DR

This paper introduces a data-driven method for nonlinear stochastic optimal control using RKHS embeddings and operator regression, enabling scalable solutions for complex control tasks without explicit system models.

Contribution

It presents a novel RKHS-based operator learning framework that integrates with Hamilton-Jacobi-Bellman recursions for scalable nonlinear stochastic control.

Findings

01

Successfully applied to autonomous underwater vehicle depth regulation

02

Handles unknown nonlinear dynamics and costs using only control penalties

03

Scales linearly with state dimensionality in complex control tasks

Abstract

This paper proposes a fully data-driven approach for optimal control of nonlinear control-affine systems represented by a stochastic diffusion. The focus is on the scenario where both the nonlinear dynamics and stage cost functions are unknown, while only a control penalty function and constraints are provided. To this end, we embed state probability densities into a reproducing kernel Hilbert space (RKHS) to leverage recent advances in operator regression, thereby identifying Markov transition operators associated with controlled diffusion processes. This operator learning approach integrates naturally with convex operator-theoretic Hamilton-Jacobi-Bellman recursions that scale linearly with state dimensionality, effectively solving a wide range of nonlinear optimal control problems. Numerical results demonstrate its ability to address diverse nonlinear control tasks, including the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Stochastic Gradient Optimization Techniques

MethodsFocus · Diffusion