Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
Zifeng Wu, Chunhua Shen, and Anton van den Hengel

TL;DR
This paper revisits residual network architectures, proposing a shallower model that outperforms deeper ResNets on image classification and segmentation tasks, offering improved efficiency and transferability.
Contribution
The authors introduce a new, shallower residual network architecture that surpasses deeper models like ResNet-200 in performance and efficiency across multiple visual recognition tasks.
Findings
Shallower residual networks outperform deeper ResNets on ImageNet classification.
The proposed architecture achieves state-of-the-art results in semantic segmentation.
The new model is more memory-efficient and sometimes faster to train.
Abstract
The trend towards increasingly deep neural networks has been driven by a general observation that increasing depth increases the performance of a network. Recently, however, evidence has been amassing that simply increasing depth may not be the best way to increase performance, particularly given other limitations. Investigations into deep residual networks have also suggested that they may not in fact be operating as a single deep network, but rather as an ensemble of many relatively shallow networks. We examine these issues, and in doing so arrive at a new interpretation of the unravelled view of deep residual networks which explains some of the behaviours that have been observed experimentally. As a result, we are able to derive a new, shallower, architecture of residual networks which significantly outperforms much deeper models such as ResNet-200 on the ImageNet classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling
