DisturbLabel: Regularizing CNN on the Loss Layer

Lingxi Xie; Jingdong Wang; Zhen Wei; Meng Wang; Qi Tian

arXiv:1605.00055·cs.CV·May 3, 2016·1 cites

DisturbLabel: Regularizing CNN on the Loss Layer

Lingxi Xie, Jingdong Wang, Zhen Wei, Meng Wang, Qi Tian

PDF

Open Access 2 Repos

TL;DR

DisturbLabel is a simple regularization technique that introduces label noise during training to prevent overfitting in CNNs, working effectively alongside Dropout and improving recognition accuracy.

Contribution

This paper introduces DisturbLabel, the first method to add noise at the loss layer, providing a novel regularization approach that enhances CNN generalization.

Findings

01

DisturbLabel prevents overfitting effectively.

02

It complements Dropout for better regularization.

03

Achieves competitive results on image datasets.

Abstract

During a long period of time we are combating over-fitting in the CNN training process with model regularization, including weight decay, model averaging, data augmentation, etc. In this paper, we present DisturbLabel, an extremely simple algorithm which randomly replaces a part of labels as incorrect values in each iteration. Although it seems weird to intentionally generate incorrect training labels, we show that DisturbLabel prevents the network training from over-fitting by implicitly averaging over exponentially many networks which are trained with different label sets. To the best of our knowledge, DisturbLabel serves as the first work which adds noises on the loss layer. Meanwhile, DisturbLabel cooperates well with Dropout to provide complementary regularization functions. Experiments demonstrate competitive recognition results on several popular image recognition datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Neural Networks and Applications

MethodsDropout