Learning with Feature-Dependent Label Noise: A Progressive Approach

Yikai Zhang; Songzhu Zheng; Pengxiang Wu; Mayank Goswami; Chao Chen

arXiv:2103.07756·cs.LG·March 30, 2021·32 cites

Learning with Feature-Dependent Label Noise: A Progressive Approach

Yikai Zhang, Songzhu Zheng, Pengxiang Wu, Mayank Goswami, Chao Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a progressive label correction method for feature-dependent label noise, providing theoretical guarantees and demonstrating superior robustness and accuracy over existing approaches in real-world noisy datasets.

Contribution

The paper proposes a novel iterative label correction algorithm tailored for feature-dependent noise, with proven convergence guarantees and improved empirical performance.

Findings

01

Outperforms state-of-the-art baselines in noisy datasets

02

Proves convergence to the Bayes classifier under various noise patterns

03

Demonstrates robustness across different noise levels and types

Abstract

Label noise is frequently observed in real-world large-scale datasets. The noise is introduced due to a variety of reasons; it is heterogeneous and feature-dependent. Most existing approaches to handling noisy labels fall into two categories: they either assume an ideal feature-independent noise, or remain heuristic without theoretical guarantees. In this paper, we propose to target a new family of feature-dependent label noise, which is much more general than commonly used i.i.d. label noise and encompasses a broad spectrum of noise patterns. Focusing on this general noise family, we propose a progressive label correction algorithm that iteratively corrects labels and refines the model. We provide theoretical guarantees showing that for a wide variety of (unknown) noise patterns, a classifier trained with this strategy converges to be consistent with the Bayes classifier. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pxiangwu/PLC
pytorchOfficial

Videos

Learning with Feature-Dependent Label Noise: A Progressive Approach· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Imbalanced Data Classification Techniques