Class Prototype-based Cleaner for Label Noise Learning
Jingjia Huang, Yuanqi Chen, Jiashi Feng, Xinglong Wu

TL;DR
This paper introduces CPC, a class prototype-based label noise cleaner that considers class-specific loss distribution heterogeneity, improving semi-supervised noisy-label learning performance.
Contribution
The paper proposes CPC, a novel class-aware label noise cleaner that models class-specific loss distributions and enhances noise separation in semi-supervised learning.
Findings
CPC outperforms existing methods on CIFAR-10, CIFAR-100, Clothing1M, and WebVision.
CPC effectively models class-specific loss heterogeneity.
Theoretical justification via EM framework supports CPC's effectiveness.
Abstract
Semi-supervised learning based methods are current SOTA solutions to the noisy-label learning problem, which rely on learning an unsupervised label cleaner first to divide the training samples into a labeled set for clean data and an unlabeled set for noise data. Typically, the cleaner is obtained via fitting a mixture model to the distribution of per-sample training losses. However, the modeling procedure is \emph{class agnostic} and assumes the loss distributions of clean and noise samples are the same across different classes. Unfortunately, in practice, such an assumption does not always hold due to the varying learning difficulty of different classes, thus leading to sub-optimal label noise partition criteria. In this work, we reveal this long-ignored problem and propose a simple yet effective solution, named \textbf{C}lass \textbf{P}rototype-based label noise \textbf{C}leaner…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Music and Audio Processing · Text and Document Classification Technologies
