Fast and Correct Gradient-Based Optimisation for Probabilistic Programming via Smoothing
Basim Khajwal, C.-H. Luke Ong, Dominik Wagner

TL;DR
This paper introduces a smoothing technique for probabilistic programming that ensures the correctness of gradient-based optimization, leading to faster convergence and reduced variance in variational inference.
Contribution
It develops a theoretical framework with type systems for smoothing in probabilistic programming, enabling correct and efficient gradient-based optimization.
Findings
Comparable convergence speed to existing methods
Simpler and faster implementation
Significant reduction in work-normalised variance
Abstract
We study the foundations of variational inference, which frames posterior inference as an optimisation problem, for probabilistic programming. The dominant approach for optimisation in practice is stochastic gradient descent. In particular, a variant using the so-called reparameterisation gradient estimator exhibits fast convergence in a traditional statistics setting. Unfortunately, discontinuities, which are readily expressible in programming languages, can compromise the correctness of this approach. We consider a simple (higher-order, probabilistic) programming language with conditionals, and we endow our language with both a measurable and a smoothed (approximate) value semantics. We present type systems which establish technical pre-conditions. Thus we can prove stochastic gradient descent with the reparameterisation gradient estimator to be correct when applied to the smoothed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Explainable Artificial Intelligence (XAI) · Single-cell and spatial transcriptomics
