Approximate Q-Learning for Controlled Diffusion Processes and its Near   Optimality

Erhan Bayraktar; Ali Devran Kara

arXiv:2203.07499·math.OC·March 10, 2023

Approximate Q-Learning for Controlled Diffusion Processes and its Near Optimality

Erhan Bayraktar, Ali Devran Kara

PDF

Open Access

TL;DR

This paper introduces a Q-learning algorithm for continuous-time stochastic control problems, providing convergence guarantees, error bounds, and complexity analysis for discretized state and control spaces.

Contribution

It develops a novel Q-learning method for continuous-time control, with theoretical error bounds and complexity analysis based on discretization parameters.

Findings

01

Convergence to the optimality equation of a finite MDP.

02

Explicit upper bounds on approximation and performance loss.

03

Time complexity bounds as functions of discretization parameters.

Abstract

We study a Q learning algorithm for continuous time stochastic control problems. The proposed algorithm uses the sampled state process by discretizing the state and control action spaces under piece-wise constant control processes. We show that the algorithm converges to the optimality equation of a finite Markov decision process (MDP). Using this MDP model, we provide an upper bound for the approximation error for the optimal value function of the continuous time control problem. Furthermore, we present provable upper-bounds for the performance loss of the learned control process compared to the optimal admissible control process of the original problem. The provided error upper-bounds are functions of the time and space discretization parameters, and they reveal the effect of different levels of the approximation: (i) approximation of the continuous time control problem by an MDP,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Reinforcement Learning in Robotics · Machine Learning and ELM