Learning Spatially-Adaptive Squeeze-Excitation Networks for Image   Synthesis and Image Recognition

Jianghao Shen; Tianfu Wu

arXiv:2112.14804·cs.CV·October 4, 2022

Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition

Jianghao Shen, Tianfu Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces spatially-adaptive squeeze-excitation modules that enhance data specificity in lightweight networks, improving performance in image synthesis and recognition while maintaining efficiency and model compactness.

Contribution

It proposes novel spatially-adaptive squeeze-excitation modules as convolutional alternatives to multi-head self-attention, applicable to both image synthesis and recognition tasks.

Findings

01

SASE outperforms prior arts in low-shot and one-shot image synthesis.

02

SASE achieves higher accuracy than vanilla ResNets on ImageNet-1000.

03

SASE matches or surpasses MHSA-based models with smaller size.

Abstract

Learning light-weight yet expressive deep networks in both image synthesis and image recognition remains a challenging problem. Inspired by a more recent observation that it is the data-specificity that makes the multi-head self-attention (MHSA) in the Transformer model so powerful, this paper proposes to extend the widely adopted light-weight Squeeze-Excitation (SE) module to be spatially-adaptive to reinforce its data specificity, as a convolutional alternative of the MHSA, while retaining the efficiency of SE and the inductive basis of convolution. It presents two designs of spatially-adaptive squeeze-excitation (SASE) modules for image synthesis and image recognition respectively. For image synthesis tasks, the proposed SASE is tested in both low-shot and one-shot learning tasks. It shows better performance than prior arts. For image recognition tasks, the proposed SASE is used as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

odegeasslbc/FastGAN-pytorch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Memory and Neural Computing · CCD and CMOS Imaging Sensors

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Adam · Dense Connections · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Layer Normalization