Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Sahil Kuchlous; Marvin Li; Jeffrey G. Wang

arXiv:2409.09569·cs.LG·September 17, 2024

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models

Sahil Kuchlous, Marvin Li, Jeffrey G. Wang

PDF

Open Access

TL;DR

This paper investigates how biased embedding spaces influence the fairness and evaluation of diffusion models in Text-to-Image systems, proposing fairness criteria, analyzing bias impacts, and suggesting mitigation strategies.

Contribution

It introduces statistical fairness criteria based on internal representations, demonstrates the necessity of unbiased embeddings for balanced diffusion models, and develops a framework for bias mitigation in evaluation.

Findings

01

Biased embeddings lead to unfair image generation and lower alignment scores.

02

Unbiased embedding spaces are necessary for fair and diverse diffusion outputs.

03

Proposed bias mitigation methods improve fairness in diffusion model evaluations.

Abstract

With the growing adoption of Text-to-Image (TTI) systems, the social biases of these models have come under increased scrutiny. Herein we conduct a systematic investigation of one such source of bias for diffusion models: embedding spaces. First, because traditional classifier-based fairness definitions require true labels not present in generative modeling, we propose statistical group fairness criteria based on a model's internal representation of the world. Using these definitions, we demonstrate theoretically and empirically that an unbiased text embedding space for input prompts is a necessary condition for representationally balanced diffusion models, meaning the distribution of generated images satisfy diversity requirements with respect to protected attributes. Next, we investigate the impact of biased embeddings on evaluating the alignment between generated images and prompts,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIncome, Poverty, and Inequality · Advanced Causal Inference Techniques

MethodsDiffusion · Contrastive Language-Image Pre-training