Sebra: Debiasing Through Self-Guided Bias Ranking
Adarsh Kappiyath, Abhra Chaudhuri, Ajay Jaiswal, Ziquan Liu, Yunpeng, Li, Xiatian Zhu, Lu Yin

TL;DR
Sebra introduces an automatic, self-guided bias ranking method that mitigates spurious correlations in data by leveraging the difficulty of learning samples, improving debiasing performance without human supervision.
Contribution
The paper proposes a novel self-guided bias ranking framework that dynamically steers ERM training to learn attributes in order of increasing spuriosity, enabling unsupervised bias mitigation.
Findings
Outperforms previous unsupervised debiasing methods on multiple benchmarks.
Effectively ranks data points by spuriosity without human supervision.
Enhances bias mitigation in complex datasets like ImageNet-1K.
Abstract
Ranking samples by fine-grained estimates of spuriosity (the degree to which spurious cues are present) has recently been shown to significantly benefit bias mitigation, over the traditional binary biased-\textit{vs}-unbiased partitioning of train sets. However, this spuriosity ranking comes with the requirement of human supervision. In this paper, we propose a debiasing framework based on our novel \ul{Se}lf-Guided \ul{B}ias \ul{Ra}nking (\emph{Sebra}), that mitigates biases (spurious correlations) via an automatic ranking of data points by spuriosity within their respective classes. Sebra leverages a key local symmetry in Empirical Risk Minimization (ERM) training -- the ease of learning a sample via ERM inversely correlates with its spuriousity; the fewer spurious correlations a sample exhibits, the harder it is to learn, and vice versa. However, globally across iterations, ERM tends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
