Batch Sample-wise Stochastic Optimal Control via Stochastic Maximum Principle
Hui Sun, Feng Bao

TL;DR
This paper introduces an improved stochastic optimal control method using sample-wise stochastic maximum principle, higher order schemes, and a damped contraction algorithm, enhancing convergence and accuracy in control estimation.
Contribution
It proposes a novel batch sample-wise approach with higher order schemes and a damped contraction algorithm for stochastic optimal control, improving convergence rates and control accuracy.
Findings
Enhanced convergence rate from (rac{N}{K} + rac{1}{N}) to (rac{1}{K} + rac{1}{N^2})
Numerical validation of first order convergence rate
Potential for efficient control in practical problems with neural networks
Abstract
In this work, we study the stochastic optimal control problem (SOC) mainly from the probabilistic view point, i.e. via the Stochastic Maximum principle (SMP) \cite{Peng4}. We adopt the sample-wise backpropagation scheme proposed in \cite{Hui1} to solve the SOC problem under the strong convexity assumption. Importantly, in the Stochastic Gradient Descent (SGD) procedure, we use batch samples with higher order scheme in the forward SDE to improve the convergence rate in \cite{Hui1} from to and note that the main source of uncertainty originates from the scheme for the simulation of term in the BSDE. In the meantime, we note the SGD procedure uses only the necessary condition of the SMP, while the batch simulation of the approximating solution of BSDEs allows one to obtain a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization
