Inference-Time Alignment of Diffusion Models with Direct Noise Optimization
Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang,, Tsung-Hui Chang

TL;DR
This paper introduces Direct Noise Optimization (DNO), a novel inference-time method for aligning diffusion models with specific reward functions, improving sample quality according to desired objectives without retraining.
Contribution
The paper presents DNO, a tuning-free, inference-time alignment method for diffusion models, with theoretical analysis, variants for non-differentiable rewards, and solutions to reward hacking issues.
Findings
DNO achieves state-of-the-art reward scores in experiments.
DNO operates efficiently at inference-time without retraining.
Effective regularization mitigates reward hacking problems.
Abstract
In this work, we focus on the alignment problem of diffusion models with a continuous reward function, which represents specific objectives for downstream tasks, such as increasing darkness or improving the aesthetics of images. The central goal of the alignment problem is to adjust the distribution learned by diffusion models such that the generated samples maximize the target reward function. We propose a novel alignment approach, named Direct Noise Optimization (DNO), that optimizes the injected noise during the sampling process of diffusion models. By design, DNO operates at inference-time, and thus is tuning-free and prompt-agnostic, with the alignment occurring in an online fashion during generation. We rigorously study the theoretical properties of DNO and also propose variants to deal with non-differentiable reward functions. Furthermore, we identify that naive implementation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Mathematical Modeling in Engineering
MethodsFocus · Diffusion
