Joint Vision-Language Social Bias Removal for CLIP

Haoyu Zhang; Yangyang Guo; Mohan Kankanhalli

arXiv:2411.12785·cs.CV·November 21, 2024

Joint Vision-Language Social Bias Removal for CLIP

Haoyu Zhang, Yangyang Guo, Mohan Kankanhalli

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for reducing social biases in CLIP by aligning and removing biases from both image and text modalities, while preserving vision-language alignment, and proposes a comprehensive evaluation protocol.

Contribution

It presents a new multi-modal bias mitigation method that maintains alignment and introduces an evaluation protocol for bias removal and model generalization.

Findings

01

Effective bias reduction in both image and text embeddings

02

Preserved vision-language alignment after debiasing

03

Enhanced model fairness and robustness

Abstract

Vision-Language (V-L) pre-trained models such as CLIP show prominent capabilities in various downstream tasks. Despite this promise, V-L models are notoriously limited by their inherent social biases. A typical demonstration is that V-L models often produce biased predictions against specific groups of people, significantly undermining their real-world applicability. Existing approaches endeavor to mitigate the social bias problem in V-L models by removing biased attribute information from model embeddings. However, after our revisiting of these methods, we find that their bias removal is frequently accompanied by greatly compromised V-L alignment capabilities. We then reveal that this performance degradation stems from the unbalanced debiasing in image and text embeddings. To address this issue, we propose a novel V-L debiasing framework to align image and text biases followed by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haoyusimon/VL_Debiasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterpreting and Communication in Healthcare · Natural Language Processing Techniques · Text Readability and Simplification

MethodsALIGN · Contrastive Language-Image Pre-training