Freeze and Reveal: Exposing Modality Bias in Vision-Language Models

Vivek Hruday Kavuri; Vysishtya Karanam; Venkata Jahnavi Venkamsetty; Kriti Madumadukala; Lakshmipathi Balaji Darur; Ponnurangam Kumaraguru

arXiv:2508.07432·cs.CV·August 12, 2025

Freeze and Reveal: Exposing Modality Bias in Vision-Language Models

Vivek Hruday Kavuri, Vysishtya Karanam, Venkata Jahnavi Venkamsetty, Kriti Madumadukala, Lakshmipathi Balaji Darur, Ponnurangam Kumaraguru

PDF

Open Access

TL;DR

This paper investigates the sources of gender bias in vision-language models, proposing novel metrics and methods to reduce bias efficiently by targeting vision or text components, thereby improving fairness and accuracy.

Contribution

It introduces a new metric, Degree of Stereotypicality, and a debiasing method, DAUDoS, to effectively mitigate bias with minimal data and computational resources.

Findings

01

CDA reduces gender bias gap by 6%

02

DAUDoS reduces bias gap by 3% using less data

03

Vision encoder (CLIP) is more biased than text encoder (PaliGemma2)

Abstract

Vision Language Models achieve impressive multi-modal performance but often inherit gender biases from their training data. This bias might be coming from both the vision and text modalities. In this work, we dissect the contributions of vision and text backbones to these biases by applying targeted debiasing using Counterfactual Data Augmentation and Task Vector methods. Inspired by data-efficient approaches in hate-speech classification, we introduce a novel metric, Degree of Stereotypicality and a corresponding debiasing method, Data Augmentation Using Degree of Stereotypicality - DAUDoS, to reduce bias with minimal computational cost. We curate a gender annotated dataset and evaluate all methods on VisoGender benchmark to quantify improvements and identify dominant source of bias. Our results show that CDA reduces the gender gap by 6% and DAUDoS by 3% but using only one-third of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Hate Speech and Cyberbullying Detection · Topic Modeling