Feature and Label Embedding Spaces Matter in Addressing Image Classifier   Bias

William Thong; Cees G. M. Snoek

arXiv:2110.14336·cs.CV·October 28, 2021·6 cites

Feature and Label Embedding Spaces Matter in Addressing Image Classifier Bias

William Thong, Cees G. M. Snoek

PDF

Open Access 1 Repo

TL;DR

This paper investigates the impact of feature and label embedding spaces on image classifier bias, proposing methods to identify and mitigate bias directions to improve fairness without sacrificing accuracy.

Contribution

It introduces a novel approach to detect bias directions in feature space and mitigates bias by using label embedding spaces with projection heads, enhancing fairness in image classification.

Findings

01

Bias directions in feature space can be identified via class prototypes.

02

Mitigating bias in label embedding spaces improves fairness.

03

The proposed methods preserve classification performance while reducing bias.

Abstract

This paper strives to address image classifier bias, with a focus on both feature and label embedding spaces. Previous works have shown that spurious correlations from protected attributes, such as age, gender, or skin tone, can cause adverse decisions. To balance potential harms, there is a growing need to identify and mitigate image classifier bias. First, we identify in the feature space a bias direction. We compute class prototypes of each protected attribute value for every class, and reveal an existing subspace that captures the maximum variance of the bias. Second, we mitigate biases by mapping image inputs to label embedding spaces. Each value of the protected attribute has its projection head where classes are embedded through a latent vector representation rather than a common one-hot encoding. Once trained, we further reduce in the feature space the bias effect by removing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

twuilliam/bias-classifiers
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning