Denoising after Entropy-based Debiasing A Robust Training Method for Dataset Bias with Noisy Labels
Sumyeong Ahn, Se-Young Yun

TL;DR
This paper introduces DENEB, a novel training method that combines entropy-based debiasing with a post-process denoising step to improve model robustness against dataset bias and noisy labels.
Contribution
The proposed DENEB method effectively integrates debiasing and denoising stages, addressing limitations of previous approaches affected by noisy labels and improving generalization.
Findings
Outperforms existing debiasing methods on multiple benchmarks.
Effectively handles noisy labels during training.
Improves model generalization and robustness.
Abstract
Improperly constructed datasets can result in inaccurate inferences. For instance, models trained on biased datasets perform poorly in terms of generalization (i.e., dataset bias). Recent debiasing techniques have successfully achieved generalization performance by underestimating easy-to-learn samples (i.e., bias-aligned samples) and highlighting difficult-to-learn samples (i.e., bias-conflicting samples). However, these techniques may fail owing to noisy labels, because the trained model recognizes noisy labels as difficult-to-learn and thus highlights them. In this study, we find that earlier approaches that used the provided labels to quantify difficulty could be affected by the small proportion of noisy labels. Furthermore, we find that running denoising algorithms before debiasing is ineffective because denoising algorithms reduce the impact of difficult-to-learn samples,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)
Methodsfail
