Bias Mimicking: A Simple Sampling Approach for Bias Mitigation
Maan Qraitem, Kate Saenko, Bryan A. Plummer

TL;DR
Bias Mimicking introduces a class-conditioned sampling method that mitigates dataset bias in visual recognition by mimicking bias distributions across classes, improving underrepresented group accuracy without overfitting.
Contribution
The paper proposes a novel class-conditioned sampling technique called Bias Mimicking that addresses shortcomings of traditional sampling methods without requiring complex modifications.
Findings
Improves underrepresented group accuracy by 3% across four benchmarks.
Maintains or improves performance compared to non-sampling methods.
Ensures exposure to entire distribution per epoch without sample repetition.
Abstract
Prior work has shown that Visual Recognition datasets frequently underrepresent bias groups (\eg Female) within class labels (\eg Programmers). This dataset bias can lead to models that learn spurious correlations between class labels and bias groups such as age, gender, or race. Most recent methods that address this problem require significant architectural changes or additional loss functions requiring more hyper-parameter tuning. Alternatively, data sampling baselines from the class imbalance literature (\eg Undersampling, Upweighting), which can often be implemented in a single line of code and often have no hyperparameters, offer a cheaper and more efficient solution. However, these methods suffer from significant shortcomings. For example, Undersampling drops a significant part of the input distribution per epoch while Oversampling repeats samples, causing overfitting. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsNetwork On Network
