Intrinsic Bias is Predicted by Pretraining Data and Correlates with Downstream Performance in Vision-Language Encoders

Kshitish Ghate; Isaac Slaughter; Kyra Wilson; Mona Diab; Aylin Caliskan

arXiv:2502.07957·cs.AI·June 11, 2025

Intrinsic Bias is Predicted by Pretraining Data and Correlates with Downstream Performance in Vision-Language Encoders

Kshitish Ghate, Isaac Slaughter, Kyra Wilson, Mona Diab, Aylin Caliskan

PDF

Open Access 1 Repo 1 Video

TL;DR

This study comprehensively analyzes how pretraining data influences intrinsic social biases in CLIP vision-language models and finds that dataset choice significantly impacts bias and correlates with downstream performance.

Contribution

It provides the largest analysis to date linking upstream pretraining factors, especially dataset selection, to intrinsic biases and downstream performance in CLIP models.

Findings

01

Pretraining dataset choice is the most significant predictor of bias.

02

Datasets with filtering for performance tend to increase bias.

03

Intrinsic bias correlates strongly with downstream task performance.

Abstract

While recent work has found that vision-language models trained under the Contrastive Language Image Pre-training (CLIP) framework contain intrinsic social biases, the extent to which different upstream pre-training features of the framework relate to these biases, and hence how intrinsic bias and downstream performance are connected has been unclear. In this work, we present the largest comprehensive analysis to-date of how the upstream pre-training factors and downstream performance of CLIP models relate to their intrinsic biases. Studying 131 unique CLIP models, trained on 26 datasets, using 55 architectures, and in a variety of sizes, we evaluate bias in each model using 26 well-established unimodal and cross-modal principled Embedding Association Tests. We find that the choice of pre-training dataset is the most significant upstream predictor of bias, whereas architectural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kshitishghate/CLIP_bias
pytorchOfficial

Videos

Intrinsic Bias is Predicted by Pretraining Data and Correlates with Downstream Performance in Vision-Language Encoders· underline

Taxonomy

TopicsMachine Learning and Data Classification · Multimodal Machine Learning Applications · Advanced Neural Network Applications

MethodsContrastive Language-Image Pre-training