Improving group robustness under noisy labels using predictive uncertainty
Dongpin Oh, Dae Lee, Jeunghyun Byun, and Bonggun Shin

TL;DR
This paper introduces a novel entropy-based framework that leverages predictive uncertainty to identify spurious-cue-free samples, improving group robustness in noisy label settings.
Contribution
The paper proposes the END framework, which uses predictive uncertainty to identify and oversample SCF samples, enhancing robustness against noisy labels and spurious correlations.
Findings
END outperforms baseline methods on real-world benchmarks.
Predictive uncertainty effectively identifies SCF samples.
Theoretical analysis links high-uncertainty to spurious-cue-free samples.
Abstract
The standard empirical risk minimization (ERM) can underperform on certain minority groups (i.e., waterbirds in lands or landbirds in water) due to the spurious correlation between the input and its label. Several studies have improved the worst-group accuracy by focusing on the high-loss samples. The hypothesis behind this is that such high-loss samples are \textit{spurious-cue-free} (SCF) samples. However, these approaches can be problematic since the high-loss samples may also be samples with noisy labels in the real-world scenarios. To resolve this issue, we utilize the predictive uncertainty of a model to improve the worst-group accuracy under noisy labels. To motivate this, we theoretically show that the high-uncertainty samples are the SCF samples in the binary classification problem. This theoretical result implies that the predictive uncertainty is an adequate indicator to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications
