Beyond Binary Preference: Aligning Diffusion Models to Fine-grained Criteria by Decoupling Attributes

Chenye Meng; Zejian Li; Zhongni Liu; Yize Li; Changle Xie; Kaixin Jia; Ling Yang; Huanghuang Deng; Shiying Ding; Shengyuan Zhang; Jiayi Li; Lingyun Sun

arXiv:2601.04300·cs.CV·January 9, 2026

Beyond Binary Preference: Aligning Diffusion Models to Fine-grained Criteria by Decoupling Attributes

Chenye Meng, Zejian Li, Zhongni Liu, Yize Li, Changle Xie, Kaixin Jia, Ling Yang, Huanghuang Deng, Shiying Ding, Shengyuan Zhang, Jiayi Li, Lingyun Sun

PDF

Open Access

TL;DR

This paper introduces a hierarchical, fine-grained evaluation framework and a two-stage alignment method for diffusion models, enabling them to better match complex human expertise and detailed criteria in image generation.

Contribution

It proposes a novel hierarchical criteria and a two-stage alignment framework, including Complex Preference Optimization, for aligning diffusion models with fine-grained, non-binary human preferences.

Findings

01

CPO improves image quality and alignment with expert criteria

02

Hierarchical criteria enable more nuanced model evaluation

03

Framework applicable to complex, real-world image generation tasks

Abstract

Post-training alignment of diffusion models relies on simplified signals, such as scalar rewards or binary preferences. This limits alignment with complex human expertise, which is hierarchical and fine-grained. To address this, we first construct a hierarchical, fine-grained evaluation criteria with domain experts, which decomposes image quality into multiple positive and negative attributes organized in a tree structure. Building on this, we propose a two-stage alignment framework. First, we inject domain knowledge to an auxiliary diffusion model via Supervised Fine-Tuning. Second, we introduce Complex Preference Optimization (CPO) that extends DPO to align the target diffusion to our non-binary, hierarchical criteria. Specifically, we reformulate the alignment problem to simultaneously maximize the probability of positive attributes while minimizing the probability of negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Aesthetic Perception and Analysis