Trained Models Tell Us How to Make Them Robust to Spurious Correlation without Group Annotation
Mahdi Ghaznavi, Hesam Asadollahzadeh, Fahimeh Hosseini Noohdani,, Soroush Vafaie Tabar, Hosein Hasani, Taha Akbari Alvanagh, Mohammad Hossein, Rohban, Mahdieh Soleymani Baghshah

TL;DR
This paper introduces EVaLS, a method that improves model robustness to spurious correlations without requiring group annotations, by leveraging loss-based sampling and environment inference for validation and hyperparameter tuning.
Contribution
EVaLS is a novel approach that enhances robustness to spurious correlations without relying on group labels for validation or training, using loss-based sampling and environment inference.
Findings
EVaLS achieves near-optimal worst group accuracy.
It eliminates the need for group annotations in validation.
The method is fast, simple, and effective.
Abstract
Classifiers trained with Empirical Risk Minimization (ERM) tend to rely on attributes that have high spurious correlation with the target. This can degrade the performance on underrepresented (or 'minority') groups that lack these attributes, posing significant challenges for both out-of-distribution generalization and fairness objectives. Many studies aim to enhance robustness to spurious correlation, but they sometimes depend on group annotations for training. Additionally, a common limitation in previous research is the reliance on group-annotated validation datasets for model selection. This constrains their applicability in situations where the nature of the spurious correlation is not known, or when group labels for certain spurious attributes are not available. To enhance model robustness with minimal group annotation assumptions, we propose Environment-based Validation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Analysis with R · Data Mining Algorithms and Applications
