Feature and Label Embedding Spaces Matter in Addressing Image Classifier Bias
William Thong, Cees G. M. Snoek

TL;DR
This paper investigates the impact of feature and label embedding spaces on image classifier bias, proposing methods to identify and mitigate bias directions to improve fairness without sacrificing accuracy.
Contribution
It introduces a novel approach to detect bias directions in feature space and mitigates bias by using label embedding spaces with projection heads, enhancing fairness in image classification.
Findings
Bias directions in feature space can be identified via class prototypes.
Mitigating bias in label embedding spaces improves fairness.
The proposed methods preserve classification performance while reducing bias.
Abstract
This paper strives to address image classifier bias, with a focus on both feature and label embedding spaces. Previous works have shown that spurious correlations from protected attributes, such as age, gender, or skin tone, can cause adverse decisions. To balance potential harms, there is a growing need to identify and mitigate image classifier bias. First, we identify in the feature space a bias direction. We compute class prototypes of each protected attribute value for every class, and reveal an existing subspace that captures the maximum variance of the bias. Second, we mitigate biases by mapping image inputs to label embedding spaces. Each value of the protected attribute has its projection head where classes are embedded through a latent vector representation rather than a common one-hot encoding. Once trained, we further reduce in the feature space the bias effect by removing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
