Continuous-time stochastic gradient descent for optimizing over the stationary distribution of stochastic differential equations
Ziheng Wang, Justin Sirignano

TL;DR
This paper introduces a novel continuous-time stochastic gradient descent method for optimizing the stationary distribution of SDE models, with rigorous convergence analysis and applications in finance, control, and neural networks.
Contribution
It develops a new online gradient algorithm for SDEs, proves its convergence for linear models, and analyzes parameter fluctuations using Poisson PDEs.
Findings
Convergence proven for linear SDE models.
Numerical results demonstrate effectiveness on nonlinear examples.
Applicable to finance, control, and neural network training.
Abstract
We develop a new continuous-time stochastic gradient descent method for optimizing over the stationary distribution of stochastic differential equation (SDE) models. The algorithm continuously updates the SDE model's parameters using an estimate for the gradient of the stationary distribution. The gradient estimate is simultaneously updated using forward propagation of the SDE state derivatives, asymptotically converging to the direction of steepest descent. We rigorously prove convergence of the online forward propagation algorithm for linear SDE models (i.e., the multi-dimensional Ornstein-Uhlenbeck process) and present its numerical results for nonlinear examples. The proof requires analysis of the fluctuations of the parameter evolution around the direction of steepest descent. Bounds on the fluctuations are challenging to obtain due to the online nature of the algorithm (e.g., the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks
