Inference-Time Alignment for Diffusion Models via Variationally Stable Doob's Matching
Jinyuan Chang, Chenguang Duan, Yuling Jiao, Yi Xu, Jerry Zhijian Yang

TL;DR
This paper introduces a new inference-time guidance method for diffusion models using Doob's $h$-transform, providing provable convergence guarantees and robustness to high-dimensional data.
Contribution
It proposes variationally stable Doob's matching, a novel guidance estimation framework with theoretical convergence guarantees and adaptability to low-dimensional structures.
Findings
Provides non-asymptotic convergence rates for guidance estimation.
Proves convergence guarantees for generated distributions in Wasserstein distance.
Demonstrates robustness to high-dimensional data under low-dimensional assumptions.
Abstract
Inference-time alignment for diffusion models aims to adapt a pre-trained reference diffusion model toward a target distribution without retraining the reference score network, thereby preserving the generative capacity of the reference model while enforcing desired properties at the inference time. A central mechanism for achieving such alignment is guidance, which modifies the sampling dynamics through an additional drift term. In this work, we introduce variationally stable Doob's matching, a novel framework for provable guidance estimation grounded in Doob's -transform. Our approach formulates guidance as the gradient of logarithm of an underlying Doob's -function and employs gradient-regularized regression to simultaneously estimate both the -function and its gradient, resulting in a consistent estimator of the guidance. Theoretically, we establish non-asymptotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods
