Robust Training under Label Noise by Over-parameterization

Sheng Liu; Zhihui Zhu; Qing Qu; Chong You

arXiv:2202.14026·cs.LG·August 4, 2022·29 cites

Robust Training under Label Noise by Over-parameterization

Sheng Liu, Zhihui Zhu, Qing Qu, Chong You

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method for robustly training over-parameterized deep networks in the presence of label noise by modeling noise as a sparse component and leveraging implicit regularization, achieving state-of-the-art results.

Contribution

It proposes a novel approach that models label noise as a sparse over-parameterization term, enabling separation of noise from clean data in over-parameterized networks.

Findings

01

Achieves state-of-the-art accuracy under label noise.

02

Theoretical proof of noise separation in simplified linear models.

03

Effective in real datasets with corrupted labels.

Abstract

Recently, over-parameterized deep networks, with increasingly more network parameters than training samples, have dominated the performances of modern machine learning. However, when the training data is corrupted, it has been well-known that over-parameterized networks tend to overfit and do not generalize. In this work, we propose a principled approach for robust training of over-parameterized deep networks in classification tasks where a proportion of training labels are corrupted. The main idea is yet very simple: label noise is sparse and incoherent with the network learned from clean data, so we model the noise and learn to separate it from the data. Specifically, we model the label noise via another sparse over-parameterization term, and exploit implicit algorithmic regularizations to recover and separate the underlying corruptions. Remarkably, when trained using such a simple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shengliu66/sop
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning