Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform
Qijie Zhu, Zeqi Ye, Han Liu, Zhaoran Wang, Minshuo Chen

TL;DR
This paper introduces DOIT, a training-free, efficient method for adapting pre-trained diffusion models to high-reward distributions using Doob's $h$-transform, with theoretical guarantees and empirical success on RL benchmarks.
Contribution
We propose a novel, training-free adaptation technique for diffusion models that leverages Doob's $h$-transform, enabling efficient, non-differentiable reward optimization with theoretical guarantees.
Findings
Outperforms state-of-the-art baselines on D4RL benchmarks.
Provides theoretical convergence guarantees for the adaptation process.
Maintains sampling efficiency while achieving high-reward distribution alignment.
Abstract
Adaptation methods have been a workhorse for unlocking the transformative power of pre-trained diffusion models in diverse applications. Existing approaches often abstract adaptation objectives as a reward function and steer diffusion models to generate high-reward samples. However, these approaches can incur high computational overhead due to additional training, or rely on stringent assumptions on the reward such as differentiability. Moreover, despite their empirical success, theoretical justification and guarantees are seldom established. In this paper, we propose DOIT (Doob-Oriented Inference-time Transformation), a training-free and computationally efficient adaptation method that applies to generic, non-differentiable rewards. The key framework underlying our method is a measure transport formulation that seeks to transport the pre-trained generative distribution to a high-reward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Advanced Neuroimaging Techniques and Applications
