PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier
Shaomeng Wang, He Wang, Xiaolu Wei, Longquan Dai, Jinhui Tang

TL;DR
PC-Diffusion introduces a lightweight preference classifier to align diffusion models with human preferences, reducing training costs and improving stability without fine-tuning the entire model or relying on reference models.
Contribution
It proposes a novel, decoupled preference alignment framework using a trainable classifier, addressing limitations of existing DPO methods in diffusion models.
Findings
Achieves preference alignment comparable to DPO
Reduces training costs significantly
Enables stable, preference-guided generation
Abstract
Diffusion models have achieved remarkable success in conditional image generation, yet their outputs often remain misaligned with human preferences. To address this, recent work has applied Direct Preference Optimization (DPO) to diffusion models, yielding significant improvements.~However, DPO-like methods exhibit two key limitations: 1) High computational cost,due to the entire model fine-tuning; 2) Sensitivity to reference model quality}, due to its tendency to introduce instability and bias. To overcome these limitations, we propose a novel framework for human preference alignment in diffusion models (PC-Diffusion), using a lightweight, trainable Preference Classifier that directly models the relative preference between samples. By restricting preference learning to this classifier, PC-Diffusion decouples preference alignment from the generative model, eliminating the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
