Learning with Feature-Dependent Label Noise: A Progressive Approach
Yikai Zhang, Songzhu Zheng, Pengxiang Wu, Mayank Goswami, Chao Chen

TL;DR
This paper introduces a progressive label correction method for feature-dependent label noise, providing theoretical guarantees and demonstrating superior robustness and accuracy over existing approaches in real-world noisy datasets.
Contribution
The paper proposes a novel iterative label correction algorithm tailored for feature-dependent noise, with proven convergence guarantees and improved empirical performance.
Findings
Outperforms state-of-the-art baselines in noisy datasets
Proves convergence to the Bayes classifier under various noise patterns
Demonstrates robustness across different noise levels and types
Abstract
Label noise is frequently observed in real-world large-scale datasets. The noise is introduced due to a variety of reasons; it is heterogeneous and feature-dependent. Most existing approaches to handling noisy labels fall into two categories: they either assume an ideal feature-independent noise, or remain heuristic without theoretical guarantees. In this paper, we propose to target a new family of feature-dependent label noise, which is much more general than commonly used i.i.d. label noise and encompasses a broad spectrum of noise patterns. Focusing on this general noise family, we propose a progressive label correction algorithm that iteratively corrects labels and refines the model. We provide theoretical guarantees showing that for a wide variety of (unknown) noise patterns, a classifier trained with this strategy converges to be consistent with the Bayes classifier. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Imbalanced Data Classification Techniques
