Training-free Diffusion Model Alignment with Sampling Demons
Po-Hung Yeh, Kuang-Huei Lee, Jun-Cheng Chen

TL;DR
This paper introduces Demon, a novel inference-time, training-free method for aligning diffusion models with user preferences by controlling noise distribution during denoising, validated through experiments with non-differentiable rewards.
Contribution
Demon is the first inference-time, backpropagation-free approach for diffusion model alignment, enabling preference tuning without retraining or differentiable reward functions.
Findings
Significantly improves aesthetics scores in text-to-image generation.
Works with non-differentiable reward sources like VLM APIs and human judgments.
Easily integrates with existing diffusion models without additional training.
Abstract
Aligning diffusion models with user preferences has been a key challenge. Existing methods for aligning diffusion models either require retraining or are limited to differentiable reward functions. To address these limitations, we propose a stochastic optimization approach, dubbed Demon, to guide the denoising process at inference time without backpropagation through reward functions or model retraining. Our approach works by controlling noise distribution in denoising steps to concentrate density on regions corresponding to high rewards through stochastic optimization. We provide comprehensive theoretical and empirical evidence to support and validate our approach, including experiments that use non-differentiable sources of rewards such as Visual-Language Model (VLM) APIs and human judgements. To the best of our knowledge, the proposed approach is the first inference-time,…
Peer Reviews
Decision·ICLR 2025 Poster
- very clearly written and polished writing - clear motivation - compares to prior methods systematically, and shows consistent improvements
- It's hard to gauge the significance/improvements in some of the results. For example, Appendix G.2 the numbers are all tightly concentrated around 0.2, so being unfamiliar with the metric, it's hard to know whether the differences are statistically significant or interesting. - It would be nice to see more experiments with more realistic/extreme use cases of alignment. Most of the variations explored in the paper seem sort of minor. For example, make the model (not) generate NSFW material (or
Overall, this paper is thoroughly written, with comprehensive theoretical and empirical analysis. The implementation details are carefully listed, and various qualitative and quantitative metrics are displayed.
In the “Alignment with Preferences of VLMs” experiment, the role of the VLM doesn’t seem to exhibit a clear preference for a specific style, etc. I understand that the goal is to demonstrate that your method can optimize for a particular non-differentiable preference, such as a VLM API. However, based on the results presented, I’m not convinced that this experiment design effectively shows that your method truly optimizes for a specific preference.
1. The concept of analyzing noise quality is interesting 2. The proposed method is a backpropagation-free preference alignment approach that operates during inference, allowing it to be applied flexibly across various conditions.
1. The comparison includes only a single baseline method, and quantitative results for BoN are missing in Table 3. 2. The evaluation is limited to a small subset of the dataset from DDPO [1]. Expanding the evaluation to include larger datasets, such as the full set from DDPO [1] and the Human Preference Dataset (HPDv2) [2], is recommended. 3. In Table 2, there remains a large performance gap between the 1-step CM and the 6-step ODE. It would be beneficial to consider additional state-of-the-art
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsDiffusion · Demon
