Improved Regularization of Convolutional Neural Networks with Cutout
Terrance DeVries, Graham W. Taylor

TL;DR
This paper introduces 'Cutout', a simple regularization technique that involves masking out random square regions of input images during training, which improves CNN robustness and achieves state-of-the-art results on multiple datasets.
Contribution
The paper proposes a novel, easy-to-implement regularization method called Cutout that enhances CNN performance when combined with existing techniques.
Findings
Achieved new state-of-the-art test errors on CIFAR-10, CIFAR-100, and SVHN datasets.
Cutout improves model robustness and generalization.
Complementary to existing data augmentation and regularization methods.
Abstract
Convolutional neural networks are capable of learning powerful representational spaces, which are necessary for tackling complex learning tasks. However, due to the model capacity required to capture such representations, they are often susceptible to overfitting and therefore require proper regularization in order to generalize well. In this paper, we show that the simple regularization technique of randomly masking out square regions of input during training, which we call cutout, can be used to improve the robustness and overall performance of convolutional neural networks. Not only is this method extremely easy to implement, but we also demonstrate that it can be used in conjunction with existing forms of data augmentation and other regularizers to further improve model performance. We evaluate this method by applying it to current state-of-the-art architectures on the CIFAR-10,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Human Pose and Action Recognition
MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling
