Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models

Fu-Yun Wang; Yunhao Shui; Jingtan Piao; Keqiang Sun; Hongsheng Li

arXiv:2505.11245·cs.CV·May 19, 2025

Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models

Fu-Yun Wang, Yunhao Shui, Jingtan Piao, Keqiang Sun, Hongsheng Li

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces Diffusion-NPO, a method that trains a model to recognize negative preferences, improving the alignment of diffusion model outputs with human preferences without extensive retraining.

Contribution

It proposes a simple, effective approach to handle negative preferences in diffusion models, enhancing preference alignment without new datasets or training strategies.

Findings

01

Improves alignment of diffusion models with human preferences.

02

Seamlessly integrates with existing diffusion models like SD1.5 and SDXL.

03

Enhances avoidance of undesirable outputs.

Abstract

Diffusion models have made substantial advances in image generation, yet models trained on large, unfiltered datasets often yield outputs misaligned with human preferences. Numerous methods have been proposed to fine-tune pre-trained diffusion models, achieving notable improvements in aligning generated outputs with human preferences. However, we argue that existing preference alignment methods neglect the critical role of handling unconditional/negative-conditional outputs, leading to a diminished capacity to avoid generating undesirable outcomes. This oversight limits the efficacy of classifier-free guidance~(CFG), which relies on the contrast between conditional generation and unconditional/negative-conditional generation to optimize output quality. In response, we propose a straightforward but versatile effective approach that involves training a model specifically attuned to…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 8Confidence 3

Strengths

- Simplicity and generality. The method is simple, intuitive, and can augment a variety of existing preference alignment approaches without additional training data and learning objectives. - Strong results. The method achieves superior qualitative and quantitative results in comparison to baselines not using NPO. - Efficiency. The method obtain stronger results without losing inference-time efficiency.

Weaknesses

I did not find major weaknesses of the paper. Disclaimer: While I find the approach interesting and reasonable, I am not an expert on preference optimization of diffusion models, so I am not able to comment on the novelty of the method and the selection of baselines / benchmarks.

Reviewer 02Rating 6Confidence 4

Strengths

This paper introduces a novel technique called Negative Preference Optimization (NPO) to improve the alignment of diffusion models with human preferences. Unlike traditional methods that focus solely on desirable features, NPO addresses the problem of undesirable outputs by training the model to recognize and avoid them. This innovative approach is both creative and practical, as it leverages existing preference data by simply reversing image pair rankings. The paper demonstrates that this simpl

Weaknesses

1. While the paper's introduction of Negative Preference Optimization (NPO) is innovative, the simple reversal of preference pair rankings may oversimplify the complexity of human aesthetics. Negative preferences are not always straightforward opposites of positive preferences, and undesirable features can be subtle or context-dependent. 2. Metrics-driven approach lacking user perspective 3. The paper validates NPO primarily on general-purpose datasets and models like Stable Diffusion and Drea

Reviewer 03Rating 8Confidence 4

Strengths

- The main idea is very intuitive, simple to implement, and highly effective. - The method is compared against several baseline alignment methods on multiple image and video diffusion models. The proposed algorithm is shown to be an improvement using various quality metrics and human user studies. - The proposed technique is quite general, and can be used alongside any alignment algorithm.

Weaknesses

The paper doesn’t introduce a new alignment technique; instead, it builds on existing alignment algorithms to train the negative-aligned model. This has both pros and cons: on the plus side, it can work alongside any alignment method, but on the downside, its quality is limited by the performance of the alignment algorithm used.

Code & Models

Repositories

g-u-n/diffusion-npo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsDiffusion