Variation of Gender Biases in Visual Recognition Models Before and After Finetuning
Jaspreet Ranjit, Tianlu Wang, Baishakhi Ray, Vicente Ordonez

TL;DR
This paper presents a framework to analyze how societal biases in large-scale visual recognition models evolve through fine-tuning, revealing that pretrained biases often persist and can transfer or intensify after adaptation.
Contribution
The study introduces a novel framework for measuring bias changes before and after fine-tuning, highlighting the impact of training data and objectives on bias transfer and amplification.
Findings
Supervised models on large datasets tend to retain pretraining biases.
Finetuning on larger datasets can introduce new biases.
Biases can transfer and be amplified during finetuning.
Abstract
We introduce a framework to measure how biases change before and after fine-tuning a large scale visual recognition model for a downstream task. Deep learning models trained on increasing amounts of data are known to encode societal biases. Many computer vision systems today rely on models typically pretrained on large scale datasets. While bias mitigation techniques have been developed for tuning models for downstream tasks, it is currently unclear what are the effects of biases already encoded in a pretrained model. Our framework incorporates sets of canonical images representing individual and pairs of concepts to highlight changes in biases for an array of off-the-shelf pretrained models across model sizes, dataset sizes, and training objectives. Through our analyses, we find that (1) supervised models trained on datasets such as ImageNet-21k are more likely to retain their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
