Robust and Scalable SDE Learning: A Functional Perspective
Scott Cameron, Tyron Cameron, Arnu Pretorius, Stephen Roberts

TL;DR
This paper introduces a novel importance-sampling estimator for learning stochastic differential equations that reduces variance, eliminates the need for sequential integrators, and leverages parallel computing for efficiency.
Contribution
It proposes a new importance-sampling based method for SDE learning that is computationally efficient and highly parallelizable, improving over existing integrator-based approaches.
Findings
Lower-variance gradient estimates compared to traditional methods
Elimination of sequential SDE integrators
Enhanced scalability through parallelization
Abstract
Stochastic differential equations provide a rich class of flexible generative models, capable of describing a wide range of spatio-temporal processes. A host of recent work looks to learn data-representing SDEs, using neural networks and other flexible function approximators. Despite these advances, learning remains computationally expensive due to the sequential nature of SDE integrators. In this work, we propose an importance-sampling estimator for probabilities of observations of SDEs for the purposes of learning. Crucially, the approach we suggest does not rely on such integrators. The proposed method produces lower-variance gradient estimates compared to algorithms based on SDE integrators and has the added advantage of being embarrassingly parallelizable. This facilitates the effective use of large-scale parallel hardware for massive decreases in computation time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Gaussian Processes and Bayesian Inference · Hydrology and Drought Analysis
