Towards General Preference Alignment: Diffusion Models at Nash Equilibrium

Jiaming Hu; Jiamu Bai; Haoyu Wang; Debarghya Mukherjee; Ioannis Ch. Paschalidis

arXiv:2605.04494·cs.LG·May 7, 2026

Towards General Preference Alignment: Diffusion Models at Nash Equilibrium

Jiaming Hu, Jiamu Bai, Haoyu Wang, Debarghya Mukherjee, Ioannis Ch. Paschalidis

PDF

TL;DR

This paper introduces Diffusion Nash Preference Optimization (Diff.-NPO), a game-theoretic approach for aligning diffusion models with human preferences, outperforming existing methods in text-to-image generation tasks.

Contribution

It proposes a novel game-theoretic framework for diffusion alignment that does not rely on reward models or the Bradley--Terry assumption, improving preference alignment.

Findings

01

Diff.-NPO outperforms existing preference-based methods in text-to-image tasks.

02

The framework encourages self-play to improve alignment.

03

Empirical results demonstrate better metrics compared to prior approaches.

Abstract

Reinforcement learning from human feedback (RLHF) has been popular for aligning text-to-image (T2I) diffusion models with human preferences. As a mainstream branch of RLHF, Direct Preference Optimization (DPO) offers a computationally efficient alternative that avoids explicit reward modeling and has been widely adopted in diffusion alignment. However, existing preference-based methods for diffusion alignment still rely on reward-induced preference signals and typically assume that human preferences can be adequately modeled by the Bradley--Terry (BT) model, which may fail to capture the full complexity of human preferences. In this paper, we formulate diffusion alignment from a game-theoretic perspective. We propose Diffusion Nash Preference Optimization (Diff.-NPO), an intuitive general preference framework for diffusion alignment. Diff.-NPO encourages the current policy to play…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.