Bias Analysis in Unconditional Image Generative Models

Xiaofeng Zhang; Michelle Lin; Simon Lacoste-Julien; Aaron Courville; Yash Goyal

arXiv:2506.09106·cs.CV·June 12, 2025

Bias Analysis in Unconditional Image Generative Models

Xiaofeng Zhang, Michelle Lin, Simon Lacoste-Julien, Aaron Courville, Yash Goyal

PDF

Open Access 3 Reviews

TL;DR

This paper investigates bias in unconditional image generative models, revealing small attribute shifts influenced by classifier sensitivity and emphasizing the need for better evaluation practices to address social complexity.

Contribution

It provides an empirical analysis of bias shifts in generative models and highlights the impact of classifier decision boundaries on bias measurement.

Findings

01

Attribute shifts are generally small in unconditional models.

02

Classifier decision boundaries significantly influence bias detection.

03

Bias evaluation is sensitive to attribute spectrum and labeling practices.

Abstract

The widespread adoption of generative AI models has raised growing concerns about representational harm and potential discriminatory outcomes. Yet, despite growing literature on this topic, the mechanisms by which bias emerges - especially in unconditional generation - remain disentangled. We define the bias of an attribute as the difference between the probability of its presence in the observed distribution and its expected proportion in an ideal reference distribution. In our analysis, we train a set of unconditional image generative models and adopt a commonly used bias evaluation framework to study bias shift between training and generated distributions. Our experiments reveal that the detected attribute shifts are small. We find that the attribute shifts are sensitive to the attribute classifier used to label generated images in the evaluation framework, particularly when its…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 3Confidence 3

Strengths

1. Problem Definition: The paper focuses specifically on studying the inductive bias of generative models themselves, avoiding other factors such as dataset bias and prompts, providing a novel perspective for analyzing bias sources. 2. Methodology: Proposes a standardized bias evaluation framework that uses the same classifier for all label predictions, ensuring consistency in evaluation. 3. Writing: The paper is well-structured and explains complex concepts in an understandable way.

Weaknesses

1. Bias Definition: The paper's definition of bias, which only measures differences in attribute occurrence probabilities, may be overly simplistic. Bias typically encompasses more complex dimensions including social prejudices and systemic discrimination. Oversimplified Metrics: The Average Bias Shift (ABS) metric may be too reductive as it: 2. ABS doesn't consider correlations between attributes and ignores the varying social impact weights of different attributes 3. The 0.01 threshold for s

Reviewer 02Rating 6Confidence 4

Strengths

1. This work presents a bias evaluation framework for unconditional image generative models. 2. The authors proposed two taxonomies for categorizing bias shifts for different attributes. 3. The authors experimented with different sizes of diffusion models to observe how bias shift is happening.

Weaknesses

1. As this paper presents a bias evaluation framework for image dataset, it needs to be compared with other evaluation framework, i.e. compare with [1]. How is the presented framework differ with the [1]? 2. Limitations of this evaluation framework should be discussed in the paper. #### References: [1] Wang, Angelina, et al. "REVISE: A tool for measuring and mitigating bias in visual datasets." _International Journal of Computer Vision_ 130.7 (2022): 1790-1810.

Reviewer 03Rating 5Confidence 4

Strengths

In general, the analysis of the bias shift of different generative models is interesting, and the pipeline's high-level idea seems to be sound. The subjective/non-subjective study is also interesting. The paper includes a vast amount of empirical results for the analysis.

Weaknesses

1. Some of the discussion in the method section (sec. 3) seems to be redundant or not tied to the paper. For example, what is the purpose of introducing $P^{ideal}$? Although it is canceled in the final equation, I don't think it is necessary to introduce such a term because, intuitively, $|{P^{gen} - P^{val}}|$ itself is sufficient to measure the bias shift. Introducing extra and probably unnecessary assumptions may overcomplicate the method and lead to confusion. Also, Section 3.1 also introdu

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Psychology of Moral and Emotional Judgment · Explainable Artificial Intelligence (XAI)