They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate   Associative Bias

Salma Abdel Magid; Jui-Hsien Wang; Kushal Kafle; Hanspeter Pfister

arXiv:2406.11331·cs.CV·June 18, 2024

They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias

Salma Abdel Magid, Jui-Hsien Wang, Kushal Kafle, Hanspeter Pfister

PDF

Open Access

TL;DR

This paper introduces a method to generate synthetic counterfactual images to fine-tune CLIP, reducing biases related to human appearance and improving fairness in image retrieval without sacrificing performance.

Contribution

The authors propose a novel framework for creating diverse synthetic datasets to mitigate associative bias in CLIP, enhancing fairness while maintaining accuracy.

Findings

01

CLIP trained on synthetic counterfactuals reduces bias metrics by 40-66%.

02

The fine-tuned model retains compatibility with original CLIP.

03

The approach supports adjustable fairness-performance trade-offs.

Abstract

Vision Language Models (VLMs) such as CLIP are powerful models; however they can exhibit unwanted biases, making them less safe when deployed directly in applications such as text-to-image, text-to-video retrievals, reverse search, or classification tasks. In this work, we propose a novel framework to generate synthetic counterfactual images to create a diverse and balanced dataset that can be used to fine-tune CLIP. Given a set of diverse synthetic base images from text-to-image models, we leverage off-the-shelf segmentation and inpainting models to place humans with diverse visual appearances in context. We show that CLIP trained on such datasets learns to disentangle the human appearance from the context of an image, i.e., what makes a doctor is not correlated to the person's visual appearance, like skin color or body type, but to the context, such as background, the attire they are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDeception detection and forensic psychology · Decision-Making and Behavioral Economics

MethodsSparse Evolutionary Training · Inpainting · Balanced Selection · Contrastive Language-Image Pre-training