Method of Successive Approximations for Stochastic Optimal Control:   Contractivity and Convergence

Safouane Taoufik; Badr Missaoui

arXiv:2405.07048·math.OC·May 14, 2024

Method of Successive Approximations for Stochastic Optimal Control: Contractivity and Convergence

Safouane Taoufik, Badr Missaoui

PDF

Open Access

TL;DR

This paper analyzes the convergence of the Method of Successive Approximations for stochastic optimal control problems, proving contractivity and stability under specific conditions on the system's coefficients.

Contribution

It provides a rigorous proof of the contractivity and convergence of MSA for a class of stochastic systems with Lipschitz continuous coefficients.

Findings

01

Proves stability of the state process under control variations

02

Establishes stability of the adjoint process

03

Demonstrates contractivity and convergence of MSA

Abstract

The Method of Successive Approximations (MSA) is a fixed-point iterative method used to solve stochastic optimal control problems. It is an indirect method based on the conditions derived from the Stochastic Maximum Principle (SMP), an extension of the Pontryagin Maximum Principle (PMP) to stochastic control problems. In this study, we investigate the contractivity and the convergence of MSA for a specific and interesting class of stochastic dynamical systems (when the drift coefficient is one-sided-Lipschitz with a negative constant and the diffusion coefficient is Lipschitz continuous). Our analysis unfolds in three key steps: firstly, we prove the stability of the state process with respect to the control process. Secondly, we establish the stability of the adjoint process. Finally, we present rigorous evidence to prove the contractivity and then the convergence of MSA. This study…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAerospace Engineering and Control Systems · Stochastic processes and financial applications · Material Science and Thermodynamics