Inference-Time Alignment for Diffusion Models via Variationally Stable Doob's Matching

Jinyuan Chang; Chenguang Duan; Yuling Jiao; Yi Xu; Jerry Zhijian Yang

arXiv:2601.06514·stat.ML·February 5, 2026

Inference-Time Alignment for Diffusion Models via Variationally Stable Doob's Matching

Jinyuan Chang, Chenguang Duan, Yuling Jiao, Yi Xu, Jerry Zhijian Yang

PDF

Open Access

TL;DR

This paper introduces a new inference-time guidance method for diffusion models using Doob's $h$-transform, providing provable convergence guarantees and robustness to high-dimensional data.

Contribution

It proposes variationally stable Doob's matching, a novel guidance estimation framework with theoretical convergence guarantees and adaptability to low-dimensional structures.

Findings

01

Provides non-asymptotic convergence rates for guidance estimation.

02

Proves convergence guarantees for generated distributions in Wasserstein distance.

03

Demonstrates robustness to high-dimensional data under low-dimensional assumptions.

Abstract

Inference-time alignment for diffusion models aims to adapt a pre-trained reference diffusion model toward a target distribution without retraining the reference score network, thereby preserving the generative capacity of the reference model while enforcing desired properties at the inference time. A central mechanism for achieving such alignment is guidance, which modifies the sampling dynamics through an additional drift term. In this work, we introduce variationally stable Doob's matching, a novel framework for provable guidance estimation grounded in Doob's $h$ -transform. Our approach formulates guidance as the gradient of logarithm of an underlying Doob's $h$ -function and employs gradient-regularized regression to simultaneously estimate both the $h$ -function and its gradient, resulting in a consistent estimator of the guidance. Theoretically, we establish non-asymptotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods