Intrinsic Bias is Predicted by Pretraining Data and Correlates with Downstream Performance in Vision-Language Encoders
Kshitish Ghate, Isaac Slaughter, Kyra Wilson, Mona Diab, Aylin Caliskan

TL;DR
This study comprehensively analyzes how pretraining data influences intrinsic social biases in CLIP vision-language models and finds that dataset choice significantly impacts bias and correlates with downstream performance.
Contribution
It provides the largest analysis to date linking upstream pretraining factors, especially dataset selection, to intrinsic biases and downstream performance in CLIP models.
Findings
Pretraining dataset choice is the most significant predictor of bias.
Datasets with filtering for performance tend to increase bias.
Intrinsic bias correlates strongly with downstream task performance.
Abstract
While recent work has found that vision-language models trained under the Contrastive Language Image Pre-training (CLIP) framework contain intrinsic social biases, the extent to which different upstream pre-training features of the framework relate to these biases, and hence how intrinsic bias and downstream performance are connected has been unclear. In this work, we present the largest comprehensive analysis to-date of how the upstream pre-training factors and downstream performance of CLIP models relate to their intrinsic biases. Studying 131 unique CLIP models, trained on 26 datasets, using 55 architectures, and in a variety of sizes, we evaluate bias in each model using 26 well-established unimodal and cross-modal principled Embedding Association Tests. We find that the choice of pre-training dataset is the most significant upstream predictor of bias, whereas architectural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsContrastive Language-Image Pre-training
