4-Connected Shift Residual Networks
Andrew Brown, Pascal Mettes, Marcel Worring

TL;DR
This paper explores the use of 4-connected shift operations in residual networks, demonstrating that they can match or surpass traditional convolutions in accuracy while significantly reducing computational costs.
Contribution
It introduces a 4-connected shift residual network that achieves high accuracy with fewer parameters and FLOPs, outperforming 8-connected shifts and traditional ResNet structures.
Findings
4-connected shifts match 8-connected shifts in accuracy on ImageNet.
4-connected shifts outperform 8-connected shifts when applied to all point-wise convolutions.
The proposed shift-based network achieves higher accuracy with over 40% fewer parameters and FLOPs.
Abstract
The shift operation was recently introduced as an alternative to spatial convolutions. The operation moves subsets of activations horizontally and/or vertically. Spatial convolutions are then replaced with shift operations followed by point-wise convolutions, significantly reducing computational costs. In this work, we investigate how shifts should best be applied to high accuracy CNNs. We apply shifts of two different neighbourhood groups to ResNet on ImageNet: the originally introduced 8-connected (8C) neighbourhood shift and the less well studied 4-connected (4C) neighbourhood shift. We find that when replacing ResNet's spatial convolutions with shifts, both shift neighbourhoods give equal ImageNet accuracy, showing the sufficiency of small neighbourhoods for large images. Interestingly, when incorporating shifts to all point-wise convolutions in residual networks, 4-connected shifts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
