Time-Reversed BSDEs for Accurate Gradient Estimation in Diffusion Models

Yuhang Mei; Amirhossein Taghvaei

arXiv:2603.20455·math.OC·March 24, 2026

Time-Reversed BSDEs for Accurate Gradient Estimation in Diffusion Models

Yuhang Mei, Amirhossein Taghvaei

PDF

Open Access

TL;DR

This paper introduces a time-reversed BSDE approach for more stable and accurate gradient estimation in diffusion models, addressing limitations of existing adjoint matching methods.

Contribution

It proposes a novel time-reversed BSDE estimator that produces an adapted adjoint process, improving gradient stability and reducing variance in diffusion model optimization.

Findings

01

Enhanced gradient stability demonstrated in toy experiments

02

Lower variance in gradient estimates compared to adjoint matching

03

Competitive performance in fine-tuning diffusion models

Abstract

There is a growing literature adopting a stochastic optimal control (SOC) perspective to fine-tune diffusion models and related generative policies. A prominent class of methods, known as iterative diffusion optimization, solves the SOC problem by simulating the diffusion process, evaluating a loss function, and applying stochastic optimization algorithms, with adjoint matching emerging as a state-of-the-art approach. However, the adjoint process used in these methods is not adapted to the forward diffusion filtration, which can lead to unstable or high-variance gradient estimates. In this paper, we revisit gradient estimation in diffusion models through the lens of backward stochastic differential equations (BSDEs). We propose an alternative estimator based on a time-reversed BSDE formulation introduced in our prior work, which produces an adjoint process adapted to the underlying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Stochastic processes and financial applications · Model Reduction and Neural Networks