Vision-Language Models display a strong gender bias
Aiswarya Konavoor, Raj Abhijit Dandekar, Rajat Dandekar, Sreedath Panat

TL;DR
This paper investigates gender bias in vision-language models by analyzing how these models associate face images with occupation and activity descriptions, revealing subtle gender stereotypes encoded in their representations.
Contribution
It introduces a novel framework for measuring and visualizing gender bias in contrastive vision-language models using a comprehensive dataset and statistical analysis methods.
Findings
Models show significant gender-linked associations with certain occupations.
Gender bias varies across different categories of activities and roles.
The framework provides a robust way to quantify and interpret bias in VLMs.
Abstract
Vision-language models (VLM) align images and text in a shared representation space that is useful for retrieval and zero-shot transfer. Yet, this alignment can encode and amplify social stereotypes in subtle ways that are not obvious from standard accuracy metrics. In this study, we test whether the contrastive vision-language encoder exhibits gender-linked associations when it places embeddings of face images near embeddings of short phrases that describe occupations and activities. We assemble a dataset of 220 face photographs split by perceived binary gender and a set of 150 unique statements distributed across six categories covering emotional labor, cognitive labor, domestic labor, technical labor, professional roles, and physical labor. We compute unit-norm image embeddings for every face and unit-norm text embeddings for every statement, then define a statement-level association…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
