Malliavin Calculus as Stochastic Backpropogation
Kevin D. Oden

TL;DR
This paper unifies pathwise and score-function gradient estimators using Malliavin calculus, introduces a hybrid estimator with variance reduction, and demonstrates its effectiveness in variational autoencoders and synthetic problems.
Contribution
It establishes a theoretical connection between gradient estimators and proposes a variance-aware hybrid estimator with convergence guarantees.
Findings
9% variance reduction on CIFAR-10 VAEs
up to 35% variance reduction on synthetic problems
identifies challenges in non-stationary optimization landscapes
Abstract
We establish a rigorous connection between pathwise (reparameterization) and score-function (Malliavin) gradient estimators by showing that both arise from the Malliavin integration-by-parts identity. Building on this equivalence, we introduce a unified and variance-aware hybrid estimator that adaptively combines pathwise and Malliavin gradients using their empirical covariance structure. The resulting formulation provides a principled understanding of stochastic backpropagation and achieves minimum variance among all unbiased linear combinations, with closed-form finite-sample convergence bounds. We demonstrate 9% variance reduction on VAEs (CIFAR-10) and up to 35% on strongly-coupled synthetic problems. Exploratory policy gradient experiments reveal that non-stationary optimization landscapes present challenges for the hybrid approach, highlighting important directions for future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
