ShuffleBlock: Shuffle to Regularize Deep Convolutional Neural Networks
Sudhakar Kumawat, Gagan Kanojia, and Shanmuganathan Raman

TL;DR
ShuffleBlock introduces a novel regularization method for deep CNNs by shuffling small patches between channels, which improves generalization and outperforms many existing regularization techniques.
Contribution
The paper proposes ShuffleBlock, a new regularization technique involving patch shuffling between channels, enhancing CNN performance on image classification tasks.
Findings
ShuffleBlock improves CNN accuracy on CIFAR and ImageNet datasets.
Random patch shuffling acts as structured noise, aiding regularization.
ShuffleBlock outperforms many existing regularization methods.
Abstract
Deep neural networks have enormous representational power which leads them to overfit on most datasets. Thus, regularizing them is important in order to reduce overfitting and enhance their generalization capabilities. Recently, channel shuffle operation has been introduced for mixing channels in group convolutions in resource efficient networks in order to reduce memory and computations. This paper studies the operation of channel shuffle as a regularization technique in deep convolutional networks. We show that while random shuffling of channels during training drastically reduce their performance, however, randomly shuffling small patches between channels significantly improves their performance. The patches to be shuffled are picked from the same spatial locations in the feature maps such that a patch, when transferred from one channel to another, acts as structured noise for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
MethodsChannel Shuffle
