Efficient Online Mirror Descent Stochastic Approximation for Multi-Stage Stochastic Programming

Junhui Zhang; Patrick Jaillet

arXiv:2506.15392·math.OC·March 2, 2026

Efficient Online Mirror Descent Stochastic Approximation for Multi-Stage Stochastic Programming

Junhui Zhang, Patrick Jaillet

PDF

Open Access

TL;DR

This paper introduces an efficient online mirror descent stochastic approximation method for multi-stage stochastic programming, achieving linear gradient complexity in the number of stages, significantly improving over previous algorithms.

Contribution

The paper proposes a novel MDSA algorithm for multi-stage stochastic programming with linear gradient complexity, leveraging decomposability and stochastic conditional gradients.

Findings

01

Achieves gradient complexity linear in the number of stages T.

02

Provides an online implementation suitable for high-dimensional problems.

03

Improves exponentially over existing algorithms in efficiency.

Abstract

We study the unconstrained and the minimax saddle point variants of the convex multi-stage stochastic programming problem, where consecutive decisions are coupled through the objective functions, rather than through the constraints. We approach the problems from the infinite-dimensional policy perspective, but consider an online setting where only the policies corresponding to the actual realization of the underlying stochastic process is needed. This leads to a trackable formulation, where the dimension of the output is linear in the number of stages $T$ . We propose hypothetical Mirror Descent Stochastic Approximation (MDSA) for the infinite dimensional policies using stochastic conditional gradients. By taking advantage of the decomposability of the updates across stages and realizations of the underlying stochastic process, we show that the proposed MDSA algorithms admit efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Stochastic Gradient Optimization Techniques