Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Zifeng Wu; Chunhua Shen; and Anton van den Hengel

arXiv:1611.10080·cs.CV·December 1, 2016·230 cites

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Zifeng Wu, Chunhua Shen, and Anton van den Hengel

PDF

Open Access 3 Repos

TL;DR

This paper revisits residual network architectures, proposing a shallower model that outperforms deeper ResNets on image classification and segmentation tasks, offering improved efficiency and transferability.

Contribution

The authors introduce a new, shallower residual network architecture that surpasses deeper models like ResNet-200 in performance and efficiency across multiple visual recognition tasks.

Findings

01

Shallower residual networks outperform deeper ResNets on ImageNet classification.

02

The proposed architecture achieves state-of-the-art results in semantic segmentation.

03

The new model is more memory-efficient and sometimes faster to train.

Abstract

The trend towards increasingly deep neural networks has been driven by a general observation that increasing depth increases the performance of a network. Recently, however, evidence has been amassing that simply increasing depth may not be the best way to increase performance, particularly given other limitations. Investigations into deep residual networks have also suggested that they may not in fact be operating as a single deep network, but rather as an ensemble of many relatively shallow networks. We examine these issues, and in doing so arrive at a new interpretation of the unravelled view of deep residual networks which explains some of the behaviours that have been observed experimentally. As a result, we are able to derive a new, shallower, architecture of residual networks which significantly outperforms much deeper models such as ResNet-200 on the ImageNet classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling