Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu, Zhou Yang, Yuqing Niu, Jing Jiang, David Lo

TL;DR
This study evaluates existing gender bias detectors in text-to-image models by comparing their outputs with manual annotations, revealing significant discrepancies and limitations, especially with low-quality images, and proposing improvements.
Contribution
It provides a comprehensive validation of gender bias detectors against manually labeled data, highlighting their inaccuracies and proposing an enhanced detection method.
Findings
All three T2I models show a bias towards male images.
Detectors often overestimate bias, sometimes by up to 26.95%.
Low-quality images hinder accurate bias detection.
Abstract
Text-to-Image (T2I) models have recently gained significant attention due to their ability to generate high-quality images and are consequently used in a wide range of applications. However, there are concerns about the gender bias of these models. Previous studies have shown that T2I models can perpetuate or even amplify gender stereotypes when provided with neutral text prompts. Researchers have proposed automated gender bias uncovering detectors for T2I models, but a crucial gap exists: no existing work comprehensively compares the various detectors and understands how the gender bias detected by them deviates from the actual situation. This study addresses this gap by validating previous gender bias detectors using a manually labeled dataset and comparing how the bias identified by various detectors deviates from the actual bias in T2I models, as verified by manual confirmation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
MethodsSoftmax · Attention Is All You Need · Diffusion
