Robust Training under Label Noise by Over-parameterization
Sheng Liu, Zhihui Zhu, Qing Qu, Chong You

TL;DR
This paper introduces a method for robustly training over-parameterized deep networks in the presence of label noise by modeling noise as a sparse component and leveraging implicit regularization, achieving state-of-the-art results.
Contribution
It proposes a novel approach that models label noise as a sparse over-parameterization term, enabling separation of noise from clean data in over-parameterized networks.
Findings
Achieves state-of-the-art accuracy under label noise.
Theoretical proof of noise separation in simplified linear models.
Effective in real datasets with corrupted labels.
Abstract
Recently, over-parameterized deep networks, with increasingly more network parameters than training samples, have dominated the performances of modern machine learning. However, when the training data is corrupted, it has been well-known that over-parameterized networks tend to overfit and do not generalize. In this work, we propose a principled approach for robust training of over-parameterized deep networks in classification tasks where a proportion of training labels are corrupted. The main idea is yet very simple: label noise is sparse and incoherent with the network learned from clean data, so we model the noise and learn to separate it from the data. Specifically, we model the label noise via another sparse over-parameterization term, and exploit implicit algorithmic regularizations to recover and separate the underlying corruptions. Remarkably, when trained using such a simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
