Batch Sample-wise Stochastic Optimal Control via Stochastic Maximum Principle

Hui Sun; Feng Bao

arXiv:2505.02688·math.OC·June 17, 2025

Batch Sample-wise Stochastic Optimal Control via Stochastic Maximum Principle

Hui Sun, Feng Bao

PDF

Open Access 1 Repo

TL;DR

This paper introduces an improved stochastic optimal control method using sample-wise stochastic maximum principle, higher order schemes, and a damped contraction algorithm, enhancing convergence and accuracy in control estimation.

Contribution

It proposes a novel batch sample-wise approach with higher order schemes and a damped contraction algorithm for stochastic optimal control, improving convergence rates and control accuracy.

Findings

01

Enhanced convergence rate from (rac{N}{K} + rac{1}{N}) to (rac{1}{K} + rac{1}{N^2})

02

Numerical validation of first order convergence rate

03

Potential for efficient control in practical problems with neural networks

Abstract

In this work, we study the stochastic optimal control problem (SOC) mainly from the probabilistic view point, i.e. via the Stochastic Maximum principle (SMP) \cite{Peng4}. We adopt the sample-wise backpropagation scheme proposed in \cite{Hui1} to solve the SOC problem under the strong convexity assumption. Importantly, in the Stochastic Gradient Descent (SGD) procedure, we use batch samples with higher order scheme in the forward SDE to improve the convergence rate in \cite{Hui1} from $\sim O (\frac{N}{K} + \frac{1}{N})$ to $\sim O (\frac{1}{K} + \frac{1}{N ^{2}})$ and note that the main source of uncertainty originates from the scheme for the simulation of $Z$ term in the BSDE. In the meantime, we note the SGD procedure uses only the necessary condition of the SMP, while the batch simulation of the approximating solution of BSDEs allows one to obtain a more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huisun317/batch_smp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization