Joint Vision-Language Social Bias Removal for CLIP
Haoyu Zhang, Yangyang Guo, Mohan Kankanhalli

TL;DR
This paper introduces a novel framework for reducing social biases in CLIP by aligning and removing biases from both image and text modalities, while preserving vision-language alignment, and proposes a comprehensive evaluation protocol.
Contribution
It presents a new multi-modal bias mitigation method that maintains alignment and introduces an evaluation protocol for bias removal and model generalization.
Findings
Effective bias reduction in both image and text embeddings
Preserved vision-language alignment after debiasing
Enhanced model fairness and robustness
Abstract
Vision-Language (V-L) pre-trained models such as CLIP show prominent capabilities in various downstream tasks. Despite this promise, V-L models are notoriously limited by their inherent social biases. A typical demonstration is that V-L models often produce biased predictions against specific groups of people, significantly undermining their real-world applicability. Existing approaches endeavor to mitigate the social bias problem in V-L models by removing biased attribute information from model embeddings. However, after our revisiting of these methods, we find that their bias removal is frequently accompanied by greatly compromised V-L alignment capabilities. We then reveal that this performance degradation stems from the unbalanced debiasing in image and text embeddings. To address this issue, we propose a novel V-L debiasing framework to align image and text biases followed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare · Natural Language Processing Techniques · Text Readability and Simplification
MethodsALIGN · Contrastive Language-Image Pre-training
