Improving group robustness under noisy labels using predictive   uncertainty

Dongpin Oh; Dae Lee; Jeunghyun Byun; and Bonggun Shin

arXiv:2212.07026·cs.LG·December 15, 2022·1 cites

Improving group robustness under noisy labels using predictive uncertainty

Dongpin Oh, Dae Lee, Jeunghyun Byun, and Bonggun Shin

PDF

Open Access

TL;DR

This paper introduces a novel entropy-based framework that leverages predictive uncertainty to identify spurious-cue-free samples, improving group robustness in noisy label settings.

Contribution

The paper proposes the END framework, which uses predictive uncertainty to identify and oversample SCF samples, enhancing robustness against noisy labels and spurious correlations.

Findings

01

END outperforms baseline methods on real-world benchmarks.

02

Predictive uncertainty effectively identifies SCF samples.

03

Theoretical analysis links high-uncertainty to spurious-cue-free samples.

Abstract

The standard empirical risk minimization (ERM) can underperform on certain minority groups (i.e., waterbirds in lands or landbirds in water) due to the spurious correlation between the input and its label. Several studies have improved the worst-group accuracy by focusing on the high-loss samples. The hypothesis behind this is that such high-loss samples are \textit{spurious-cue-free} (SCF) samples. However, these approaches can be problematic since the high-loss samples may also be samples with noisy labels in the real-world scenarios. To resolve this issue, we utilize the predictive uncertainty of a model to improve the worst-group accuracy under noisy labels. To motivate this, we theoretically show that the high-uncertainty samples are the SCF samples in the binary classification problem. This theoretical result implies that the predictive uncertainty is an adequate indicator to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications