They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias
Salma Abdel Magid, Jui-Hsien Wang, Kushal Kafle, Hanspeter Pfister

TL;DR
This paper introduces a method to generate synthetic counterfactual images to fine-tune CLIP, reducing biases related to human appearance and improving fairness in image retrieval without sacrificing performance.
Contribution
The authors propose a novel framework for creating diverse synthetic datasets to mitigate associative bias in CLIP, enhancing fairness while maintaining accuracy.
Findings
CLIP trained on synthetic counterfactuals reduces bias metrics by 40-66%.
The fine-tuned model retains compatibility with original CLIP.
The approach supports adjustable fairness-performance trade-offs.
Abstract
Vision Language Models (VLMs) such as CLIP are powerful models; however they can exhibit unwanted biases, making them less safe when deployed directly in applications such as text-to-image, text-to-video retrievals, reverse search, or classification tasks. In this work, we propose a novel framework to generate synthetic counterfactual images to create a diverse and balanced dataset that can be used to fine-tune CLIP. Given a set of diverse synthetic base images from text-to-image models, we leverage off-the-shelf segmentation and inpainting models to place humans with diverse visual appearances in context. We show that CLIP trained on such datasets learns to disentangle the human appearance from the context of an image, i.e., what makes a doctor is not correlated to the person's visual appearance, like skin color or body type, but to the context, such as background, the attire they are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDeception detection and forensic psychology · Decision-Making and Behavioral Economics
MethodsSparse Evolutionary Training · Inpainting · Balanced Selection · Contrastive Language-Image Pre-training
