SASE: A Searching Architecture for Squeeze and Excitation Operations
Hanming Wang, Yunlong Li, Zijun Wu, Huifen Wang, Yuan Zhang

TL;DR
This paper introduces SASE, a neural architecture search method that automatically designs near-optimal attention modules for deep neural networks, improving performance across visual tasks by exploring a novel, subdivided search space.
Contribution
SASE is the first to subdivide the attention search space and search for architectures beyond existing modules, automating the design of effective squeeze-and-excitation attention blocks.
Findings
SASE-attention modules outperform state-of-the-art modules on ResNet-50/101.
The search space includes novel operations not previously used in attention mechanisms.
Extensive experiments validate the effectiveness of SASE in various visual tasks.
Abstract
In the past few years, channel-wise and spatial-wise attention blocks have been widely adopted as supplementary modules in deep neural networks, enhancing network representational abilities while introducing low complexity. Most attention modules follow a squeeze-and-excitation paradigm. However, to design such attention modules, requires a substantial amount of experiments and computational resources. Neural Architecture Search (NAS), meanwhile, is able to automate the design of neural networks and spares the numerous experiments required for an optimal architecture. This motivates us to design a search architecture that can automatically find near-optimal attention modules through NAS. We propose SASE, a Searching Architecture for Squeeze and Excitation operations, to form a plug-and-play attention block by searching within certain search space. The search space is separated into 4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Software Engineering Research · AI-based Problem Solving and Planning
MethodsSoftmax · Attention Is All You Need
