Discrete-time Approximation of Stochastic Optimal Control with Partial   Observation

Yunzhang Li; Xiaolu Tan; Shanjian Tang

arXiv:2302.03329·math.OC·May 22, 2023·SIAM J. Control. Optim.

Discrete-time Approximation of Stochastic Optimal Control with Partial Observation

Yunzhang Li, Xiaolu Tan, Shanjian Tang

PDF

Open Access

TL;DR

This paper develops a convergent discrete-time approximation method for stochastic optimal control problems with partial observation, enabling practical numerical algorithms and potential integration with machine learning techniques.

Contribution

It introduces a new discrete-time approximation framework with proven convergence for partially observed stochastic control problems, facilitating numerical solutions.

Findings

01

Established convergence of discrete-time control approximations

02

Provided a practical numerical algorithm with convergence guarantees

03

Demonstrated the approach with linear quadratic numerical experiments

Abstract

We consider a class of stochastic optimal control problems with partial observation, and study their approximation by discrete-time control problems. We establish a convergence result by using weak convergence technique of Kushner and Dupuis [Numerical Methods for Stochastic Control Problems in Continuous Time (2001), Springer-Verlag, New York], together with the notion of relaxed control rule introduced by El Karoui, Huu Nguyen and Jeanblanc-Picqu\'e [SIAM J. Control Optim., 26 (1988) 1025-1061]. In particular, with a well chosen discrete-time control system, we obtain a first implementable numerical algorithm (with convergence) for the partially observed control problem. Moreover, our discrete-time approximation result would open the door to study convergence of more general numerical approximation methods, such as machine learning based methods. Finally, we illustrate our convergence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications