Rates of Convergence in Certain Native Spaces of Approximations used in   Reinforcement Learning

Ali Bouland; Shengyuan Niu; Sai Tej Paruchuri; Andrew Kurdila; John; Burns; Eugenio Schuster

arXiv:2309.07383·eess.SY·November 20, 2023·2 cites

Rates of Convergence in Certain Native Spaces of Approximations used in Reinforcement Learning

Ali Bouland, Shengyuan Niu, Sai Tej Paruchuri, Andrew Kurdila, John, Burns, Eugenio Schuster

PDF

Open Access

TL;DR

This paper derives explicit convergence rates and error bounds for value function approximations in native RKHS spaces used in reinforcement learning, improving understanding of approximation quality in policy iteration.

Contribution

It introduces new geometric convergence bounds for value function approximations in native RKHS spaces, refining classical results in reinforcement learning.

Findings

01

Explicit error bounds in terms of power functions

02

Geometric convergence rates established

03

Refinement of classical approximation results

Abstract

This paper studies convergence rates for some value function approximations that arise in a collection of reproducing kernel Hilbert spaces (RKHS) $H (Ω)$ . By casting an optimal control problem in a specific class of native spaces, strong rates of convergence are derived for the operator equation that enables offline approximations that appear in policy iteration. Explicit upper bounds on error in value function and controller approximations are derived in terms of power function $P_{H, N}$ for the space of finite dimensional approximants $H_{N}$ in the native space $H (Ω)$ . These bounds are geometric in nature and refine some well-known, now classical results concerning convergence of approximations of value functions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Stability and Controllability of Differential Equations · Stability and Control of Uncertain Systems