Sebra: Debiasing Through Self-Guided Bias Ranking

Adarsh Kappiyath; Abhra Chaudhuri; Ajay Jaiswal; Ziquan Liu; Yunpeng; Li; Xiatian Zhu; Lu Yin

arXiv:2501.18277·cs.LG·January 31, 2025

Sebra: Debiasing Through Self-Guided Bias Ranking

Adarsh Kappiyath, Abhra Chaudhuri, Ajay Jaiswal, Ziquan Liu, Yunpeng, Li, Xiatian Zhu, Lu Yin

PDF

Open Access 1 Repo

TL;DR

Sebra introduces an automatic, self-guided bias ranking method that mitigates spurious correlations in data by leveraging the difficulty of learning samples, improving debiasing performance without human supervision.

Contribution

The paper proposes a novel self-guided bias ranking framework that dynamically steers ERM training to learn attributes in order of increasing spuriosity, enabling unsupervised bias mitigation.

Findings

01

Outperforms previous unsupervised debiasing methods on multiple benchmarks.

02

Effectively ranks data points by spuriosity without human supervision.

03

Enhances bias mitigation in complex datasets like ImageNet-1K.

Abstract

Ranking samples by fine-grained estimates of spuriosity (the degree to which spurious cues are present) has recently been shown to significantly benefit bias mitigation, over the traditional binary biased-\textit{vs}-unbiased partitioning of train sets. However, this spuriosity ranking comes with the requirement of human supervision. In this paper, we propose a debiasing framework based on our novel \ul{Se}lf-Guided \ul{B}ias \ul{Ra}nking (\emph{Sebra}), that mitigates biases (spurious correlations) via an automatic ranking of data points by spuriosity within their respective classes. Sebra leverages a key local symmetry in Empirical Risk Minimization (ERM) training -- the ease of learning a sample via ERM inversely correlates with its spuriousity; the fewer spurious correlations a sample exhibits, the harder it is to learn, and vice versa. However, globally across iterations, ERM tends…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kadarsh22/Sebra
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning