Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via   Lightweight Value Optimization

Xingqi Wang; Xiaoyuan Yi; Xing Xie; Jia Jia

arXiv:2410.12700·cs.CV·October 17, 2024

Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization

Xingqi Wang, Xiaoyuan Yi, Xing Xie, Jia Jia

PDF

1 Repo

TL;DR

This paper introduces LiVO, a lightweight method for aligning text-to-image models with human values by optimizing a plug-and-play value encoder, reducing harmful outputs and improving ethical content generation.

Contribution

LiVO is a novel, lightweight approach that aligns T2I models with human values without extensive model fine-tuning, using a preference optimization loss and a large preference dataset.

Findings

01

LiVO significantly reduces harmful and biased outputs.

02

LiVO achieves faster convergence compared to baselines.

03

LiVO effectively balances image quality and ethical alignment.

Abstract

Recent advancements in diffusion models trained on large-scale data have enabled the generation of indistinguishable human-level images, yet they often produce harmful content misaligned with human values, e.g., social bias, and offensive content. Despite extensive research on Large Language Models (LLMs), the challenge of Text-to-Image (T2I) model alignment remains largely unexplored. Addressing this problem, we propose LiVO (Lightweight Value Optimization), a novel lightweight method for aligning T2I models with human values. LiVO only optimizes a plug-and-play value encoder to integrate a specified value principle with the input prompt, allowing the control of generated images over both semantics and values. Specifically, we design a diffusion model-tailored preference optimization loss, which theoretically approximates the Bradley-Terry model used in LLM alignment but provides a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

achernarwang/LiVO
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion