Compressing Features for Learning with Noisy Labels

Yingyi Chen; Shell Xu Hu; Xi Shen; Chunrong Ai; Johan A.K. Suykens

arXiv:2206.13140·cs.LG·June 28, 2022

Compressing Features for Learning with Noisy Labels

Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A.K. Suykens

PDF

Open Access 1 Repo

TL;DR

This paper introduces a feature compression approach using Dropout and Nested Dropout to improve learning with noisy labels, combined with Co-teaching, supported by theoretical analysis and experiments on real-world noisy datasets.

Contribution

It proposes a novel regularization method based on feature compression to combat overfitting in noisy label learning, with theoretical insights and empirical validation.

Findings

01

Compression regularization reduces overfitting to noisy labels.

02

The method achieves comparable or better results than state-of-the-art on benchmark datasets.

03

Theoretical analysis explains the effectiveness of feature compression in noisy label scenarios.

Abstract

Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper, we focus on the problem of learning with noisy labels and introduce compression inductive bias to network architectures to alleviate this over-fitting problem. More precisely, we revisit one classical regularization named Dropout and its variant Nested Dropout. Dropout can serve as a compression constraint for its feature dropping mechanism, while Nested Dropout further learns ordered feature representations w.r.t. feature importance. Moreover, the trained models with compression…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yingyichen-cyy/Nested-Co-teaching
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsDropout