Aligning Diffusion Models by Optimizing Human Utility

Shufan Li; Konstantinos Kallidromitis; Akash Gokul; Yusuke Kato,; Kazuki Kozuka

arXiv:2404.04465·cs.CV·October 15, 2024·1 cites

Aligning Diffusion Models by Optimizing Human Utility

Shufan Li, Konstantinos Kallidromitis, Akash Gokul, Yusuke Kato,, Kazuki Kozuka

PDF

Open Access 2 Repos 1 Models 1 Video

TL;DR

Diffusion-KTO is a new method for aligning text-to-image diffusion models by maximizing expected human utility using simple binary feedback, improving alignment without complex reward models.

Contribution

It introduces a straightforward alignment approach that leverages binary feedback signals, avoiding the need for costly preference data or reward model training.

Findings

01

Outperforms existing methods in human judgment and automatic metrics.

02

Requires only simple binary feedback signals for alignment.

03

Enhances the applicability of aligning diffusion models with human preferences.

Abstract

We present Diffusion-KTO, a novel approach for aligning text-to-image diffusion models by formulating the alignment objective as the maximization of expected human utility. Since this objective applies to each generation independently, Diffusion-KTO does not require collecting costly pairwise preference data nor training a complex reward model. Instead, our objective requires simple per-image binary feedback signals, e.g. likes or dislikes, which are abundantly available. After fine-tuning using Diffusion-KTO, text-to-image diffusion models exhibit superior performance compared to existing techniques, including supervised fine-tuning and Diffusion-DPO, both in terms of human judgment and automatic evaluation metrics such as PickScore and ImageReward. Overall, Diffusion-KTO unlocks the potential of leveraging readily available per-image binary signals and broadens the applicability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
jacklishufan/diffusion-kto
model· ♡ 1
♡ 1

Videos

Aligning Diffusion Models by Optimizing Human Utility· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques

MethodsDiffusion