BendVLM: Test-Time Debiasing of Vision-Language Embeddings

Walter Gerych; Haoran Zhang; Kimia Hamidieh; Eileen Pan; Maanas; Sharma; Thomas Hartvigsen; Marzyeh Ghassemi

arXiv:2411.04420·cs.CV·November 8, 2024

BendVLM: Test-Time Debiasing of Vision-Language Embeddings

Walter Gerych, Haoran Zhang, Kimia Hamidieh, Eileen Pan, Maanas, Sharma, Thomas Hartvigsen, Marzyeh Ghassemi

PDF

Open Access 1 Repo

TL;DR

BendVLM introduces a nonlinear, input-specific debiasing method for vision-language model embeddings that avoids fine-tuning and is suitable for online, open-set applications.

Contribution

It presents BendVLM, a novel approach that performs input-tailored, fine-tuning-free debiasing of VLM embeddings, overcoming limitations of existing methods.

Findings

01

Effective reduction of societal biases in VLM embeddings.

02

Applicable to online, open-set tasks without prior input knowledge.

03

Outperforms linear debiasing methods in flexibility and effectiveness.

Abstract

Vision-language model (VLM) embeddings have been shown to encode biases present in their training data, such as societal biases that prescribe negative characteristics to members of various racial and gender identities. VLMs are being quickly adopted for a variety of tasks ranging from few-shot classification to text-guided image generation, making debiasing VLM embeddings crucial. Debiasing approaches that fine-tune the VLM often suffer from catastrophic forgetting. On the other hand, fine-tuning-free methods typically utilize a "one-size-fits-all" approach that assumes that correlation with the spurious attribute can be explained using a single linear direction across all possible inputs. In this work, we propose Bend-VLM, a nonlinear, fine-tuning-free approach for VLM embedding debiasing that tailors the debiasing operation to each unique input. This allows for a more flexible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

waltergerych/bend_vlm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling

MethodsSparse Evolutionary Training