Denoising after Entropy-based Debiasing A Robust Training Method for   Dataset Bias with Noisy Labels

Sumyeong Ahn; Se-Young Yun

arXiv:2212.01189·cs.LG·December 5, 2022

Denoising after Entropy-based Debiasing A Robust Training Method for Dataset Bias with Noisy Labels

Sumyeong Ahn, Se-Young Yun

PDF

Open Access 1 Video

TL;DR

This paper introduces DENEB, a novel training method that combines entropy-based debiasing with a post-process denoising step to improve model robustness against dataset bias and noisy labels.

Contribution

The proposed DENEB method effectively integrates debiasing and denoising stages, addressing limitations of previous approaches affected by noisy labels and improving generalization.

Findings

01

Outperforms existing debiasing methods on multiple benchmarks.

02

Effectively handles noisy labels during training.

03

Improves model generalization and robustness.

Abstract

Improperly constructed datasets can result in inaccurate inferences. For instance, models trained on biased datasets perform poorly in terms of generalization (i.e., dataset bias). Recent debiasing techniques have successfully achieved generalization performance by underestimating easy-to-learn samples (i.e., bias-aligned samples) and highlighting difficult-to-learn samples (i.e., bias-conflicting samples). However, these techniques may fail owing to noisy labels, because the trained model recognizes noisy labels as difficult-to-learn and thus highlights them. In this study, we find that earlier approaches that used the provided labels to quantify difficulty could be affected by the small proportion of noisy labels. Furthermore, we find that running denoising algorithms before debiasing is ineffective because denoising algorithms reduce the impact of difficult-to-learn samples,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Denoising after Entropy-based Debiasing A Robust Training Method for Dataset Bias with Noisy Labels· underline

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)

Methodsfail