Squeeze-and-Excitation Networks
Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu

TL;DR
Squeeze-and-Excitation Networks introduce SE blocks that recalibrate channel-wise features in CNNs, significantly improving performance with minimal additional computation, and achieved top results in ImageNet 2017 classification.
Contribution
The paper proposes the SE block, a novel architectural unit for adaptively recalibrating channel features in CNNs, enhancing their representational power and performance.
Findings
SE blocks improve CNN accuracy across multiple datasets.
SE-enhanced models outperform state-of-the-art CNNs with slight computational overhead.
SENet achieved first place in ILSVRC 2017 classification challenge.
Abstract
The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗kadirnar/timm_model_listmodel· ♡ 1♡ 1
- 🤗timm/seresnet33ts.ra2_in1kmodel· 110 dl110 dl
- 🤗timm/seresnext26ts.ch_in1kmodel· 92 dl92 dl
- 🤗timm/senet154.gluon_in1kmodel· 182 dl· ♡ 1182 dl♡ 1
- 🤗timm/seresnet50.a1_in1kmodel· 3.4k dl3.4k dl
- 🤗timm/seresnet50.a2_in1kmodel· 66 dl66 dl
- 🤗timm/seresnet50.a3_in1kmodel· 69 dl69 dl
- 🤗timm/seresnet50.ra2_in1kmodel· 115 dl115 dl
- 🤗timm/seresnet152d.ra2_in1kmodel· 122 dl122 dl
- 🤗timm/seresnext26d_32x4d.bt_in1kmodel· 1.3k dl1.3k dl
Videos
W&B Paper Reading Group: Squeeze-and-Excitation Networks· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Neural Networks and Applications
MethodsDispute^Resolution^Expedia--How do I file a dispute with Expedia? · Average Pooling · Global Average Pooling · Max Pooling · Softmax · Kaiming Initialization · Step Decay · SGD with Momentum · Random Horizontal Flip · Random Resized Crop
