Recovery of the optimal control value function in reproducing kernel Hilbert spaces from verification conditions
Tobias Ehring, Behzad Azmi, Bernard Haasdonk

TL;DR
This paper introduces an RKHS-based framework for recovering the optimal value function in nonlinear control problems from verification conditions, ensuring convergence and linking to policy iteration.
Contribution
It develops a novel abstract recovery method in RKHS for approximating the value function from verification conditions, with proven convergence and practical implementation via policy iteration.
Findings
RKHS approximants converge to the true value function as collocation points become dense.
The method guarantees global convergence for analytic value functions.
Numerical experiments demonstrate the approach's effectiveness.
Abstract
Approximating the optimal value function for infinite-horizon, nonlinear, autonomous optimal control problems is both challenging and essential for synthesizing real-time optimal feedback. We develop an abstract optimal recovery framework in reproducing kernel Hilbert spaces (RKHS) for reconstructing unknown target functions from mixed equality and inequality functional constraints. Within this framework, the approximation of is cast as a collocation-type problem derived from verification conditions for optimality -- most prominently, the Hamilton-Jacobi-Bellman (HJB) equation -- that uniquely characterizes . As the set of collocation points becomes dense in the ambient domain , we establish convergence of the RKHS approximants to : globally on in the RKHS norm when is analytic, and locally (in a neighborhood of the origin) in the RKHS norm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Optimization and Variational Analysis
