Diffusive limit approximation of pure jump optimal ergodic control   problems

Marc Abeille; Bruno Bouchard (CEREMADE); Lorenzo Croissant (CEREMADE)

arXiv:2209.15284·math.OC·October 3, 2022

Diffusive limit approximation of pure jump optimal ergodic control problems

Marc Abeille, Bruno Bouchard (CEREMADE), Lorenzo Croissant (CEREMADE)

PDF

Open Access

TL;DR

This paper investigates the diffusive limit of pure jump ergodic control problems, providing error bounds, correction methods, and numerical strategies to improve computational efficiency in reinforcement learning contexts.

Contribution

It extends existing results to ergodic problems, introduces a first order error correction, and quantifies numerical errors from finite difference schemes.

Findings

01

Error governed by Hessian Hölder continuity

02

Constructed a first order error correction term

03

Quantified numerical error from finite difference schemes

Abstract

Motivated by the design of fast reinforcement learning algorithms, we study the diffusive limit of a class of pure jump ergodic stochastic control problems. We show that, whenever the intensity of jumps is large enough, the approximation error is governed by the H{\"o}lder continuity of the Hessian matrix of the solution to the limit ergodic partial differential equation. This extends to this context the results of [1] obtained for finite horizon problems. We also explain how to construct a first order error correction term under appropriate smoothness assumptions. Finally, we quantify the error induced by the use of the Markov control policy constructed from the numerical finite difference scheme associated to the limit diffusive problem, this seems to be new in the literature and of its own interest. This approach permits to reduce very significantly the numerical resolution cost.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Stochastic processes and financial applications · Traffic control and management