On how to avoid exacerbating spurious correlations when models are overparameterized
Tina Behnia, Ke Wang, Christos Thrampoulidis

TL;DR
This paper investigates how overparameterized models can avoid worsening spurious correlations in imbalanced datasets, proposing a loss modification that promotes fairness towards minorities with theoretical guarantees.
Contribution
It introduces a new theoretical framework with non-asymptotic bounds for loss modifications that improve minority fairness in overparameterized models dealing with data imbalance.
Findings
VS-loss promotes fairness towards minorities with proper tuning.
Weighted CE and LA-loss can fail under strong spurious correlations.
Theoretical bounds apply to general models and extreme imbalance scenarios.
Abstract
Overparameterized models fail to generalize well in the presence of data imbalance even when combined with traditional techniques for mitigating imbalances. This paper focuses on imbalanced classification datasets, in which a small subset of the population -- a minority -- may contain features that correlate spuriously with the class label. For a parametric family of cross-entropy loss modifications and a representative Gaussian mixture model, we derive non-asymptotic generalization bounds on the worst-group error that shed light on the role of different hyper-parameters. Specifically, we prove that, when appropriately tuned, the recently proposed VS-loss learns a model that is fair towards minorities even when spurious features are strong. On the other hand, alternative heuristics, such as the weighted CE and the LA-loss, can fail dramatically. Compared to previous works, our bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Spectroscopy and Chemometric Analyses · Advanced Statistical Process Monitoring
