Mirror descent for constrained stochastic control problems

Deven Sethi; David \v{S}i\v{s}ka

arXiv:2506.02564·math.OC·June 4, 2025

Mirror descent for constrained stochastic control problems

Deven Sethi, David \v{S}i\v{s}ka

PDF

Open Access

TL;DR

This paper develops continuous-time mirror descent methods for constrained stochastic control problems, demonstrating linear and exponential convergence under convexity conditions, and addresses key analytical challenges with PDE and Sobolev space techniques.

Contribution

It introduces a novel mirror descent framework for stochastic control with convex action spaces and provides convergence analysis under convexity assumptions.

Findings

01

Mirror descent converges linearly when the Hamiltonian is uniformly convex.

02

Exponential convergence occurs if the Hamiltonian is strongly convex relative to a Bregman divergence.

03

The paper overcomes analytical challenges using PDE estimates and the performance difference lemma.

Abstract

Mirror descent is a well established tool for solving convex optimization problems with convex constraints. This article introduces continuous-time mirror descent dynamics for approximating optimal Markov controls for stochastic control problems with the action space being bounded and convex. We show that if the Hamiltonian is uniformly convex in its action variable then mirror descent converges linearly while if it is uniformly strongly convex relative to an appropriate Bregman divergence, then the mirror flow converges exponentially. The two fundamental difficulties that must be overcome to prove such results are: first, the inherent lack of convexity of the map from Markov controls to the corresponding value function. Second, maintaining sufficient regularity of the value function and the Markov controls along the mirror descent updates. The first issue is handled using the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAerospace Engineering and Control Systems · Optimization and Search Problems · Advanced Control Systems Optimization