Malliavin Calculus as Stochastic Backpropogation

Kevin D. Oden

arXiv:2602.17013·cs.LG·February 20, 2026

Malliavin Calculus as Stochastic Backpropogation

Kevin D. Oden

PDF

Open Access

TL;DR

This paper unifies pathwise and score-function gradient estimators using Malliavin calculus, introduces a hybrid estimator with variance reduction, and demonstrates its effectiveness in variational autoencoders and synthetic problems.

Contribution

It establishes a theoretical connection between gradient estimators and proposes a variance-aware hybrid estimator with convergence guarantees.

Findings

01

9% variance reduction on CIFAR-10 VAEs

02

up to 35% variance reduction on synthetic problems

03

identifies challenges in non-stationary optimization landscapes

Abstract

We establish a rigorous connection between pathwise (reparameterization) and score-function (Malliavin) gradient estimators by showing that both arise from the Malliavin integration-by-parts identity. Building on this equivalence, we introduce a unified and variance-aware hybrid estimator that adaptively combines pathwise and Malliavin gradients using their empirical covariance structure. The resulting formulation provides a principled understanding of stochastic backpropagation and achieves minimum variance among all unbiased linear combinations, with closed-form finite-sample convergence bounds. We demonstrate 9% variance reduction on VAEs (CIFAR-10) and up to 35% on strongly-coupled synthetic problems. Exploratory policy gradient experiments reveal that non-stationary optimization landscapes present challenges for the hybrid approach, highlighting important directions for future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics