Examining Pathological Bias in a Generative Adversarial Network   Discriminator: A Case Study on a StyleGAN3 Model

Alvin Grissom II; Ryan F. Lei; Matt Gusdorff; Jeova Farias Sales Rocha; Neto; Bailey Lin; Ryan Trotter

arXiv:2402.09786·cs.CV·August 29, 2024·1 cites

Examining Pathological Bias in a Generative Adversarial Network Discriminator: A Case Study on a StyleGAN3 Model

Alvin Grissom II, Ryan F. Lei, Matt Gusdorff, Jeova Farias Sales Rocha, Neto, Bailey Lin, Ryan Trotter

PDF

Open Access

TL;DR

This paper uncovers internal biases in a StyleGAN3 discriminator that are not explained by training data, revealing systematic stratification affecting various demographic categories.

Contribution

It identifies and analyzes internal color and luminance biases in a pre-trained GAN discriminator, highlighting issues beyond training data biases.

Findings

01

Discriminator exhibits internal color and luminance biases.

02

Biases are not explained by training data.

03

Scores are stratified by demographic categories.

Abstract

Generative adversarial networks (GANs) generate photorealistic faces that are often indistinguishable by humans from real faces. While biases in machine learning models are often assumed to be due to biases in training data, we find pathological internal color and luminance biases in the discriminator of a pre-trained StyleGAN3-r model that are not explicable by the training data. We also find that the discriminator systematically stratifies scores by both image- and face-level qualities and that this disproportionately affects images across gender, race, and other categories. We examine axes common in research on stereotyping in social psychology.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning