Classification with Noisy Labels by Importance Reweighting
Tongliang Liu, Dacheng Tao

TL;DR
This paper introduces an importance reweighting approach for classification tasks with noisy labels, ensuring consistency and providing a method to estimate the noise rate, validated by experiments on synthetic and real data.
Contribution
It demonstrates that any surrogate loss can be used with importance reweighting for noisy labels and offers a practical way to estimate the noise rate from data.
Findings
Importance reweighting guarantees consistency with noisy labels.
The noise rate can be effectively estimated from data.
Experimental results confirm the method's efficiency.
Abstract
In this paper, we study a classification problem in which sample labels are randomly corrupted. In this scenario, there is an unobservable sample with noise-free labels. However, before being observed, the true labels are independently flipped with a probability , and the random label noise can be class-conditional. Here, we address two fundamental problems raised by this scenario. The first is how to best use the abundant surrogate loss functions designed for the traditional classification problem when there is label noise. We prove that any surrogate loss function can be used for classification with noisy labels by using importance reweighting, with consistency assurance that the label noise does not ultimately hinder the search for the optimal classifier of the noise-free sample. The other is the open problem of how to obtain the noise rate . We show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
