GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang, Kai Han, Jianyuan Guo, Chang Xu, Chao Xu, Yunhe Wang

TL;DR
GhostNetV2 introduces a hardware-efficient long-range attention mechanism to enhance lightweight CNNs, improving global feature capture without sacrificing speed, and achieves superior accuracy on ImageNet.
Contribution
The paper proposes DFC attention, a fast, hardware-friendly long-range attention mechanism, and integrates it into GhostNetV2 to improve global feature aggregation in mobile neural networks.
Findings
Achieves 75.3% top-1 accuracy on ImageNet with 167M FLOPs.
Outperforms GhostNetV1 with similar computational cost.
Demonstrates the effectiveness of long-range attention in lightweight CNNs.
Abstract
Light-weight convolutional neural networks (CNNs) are specially designed for applications on mobile devices with faster inference speed. The convolutional operation can only capture local information in a window region, which prevents performance from being further improved. Introducing self-attention into convolution can capture global information well, but it will largely encumber the actual speed. In this paper, we propose a hardware-friendly attention mechanism (dubbed DFC attention) and then present a new GhostNetV2 architecture for mobile applications. The proposed DFC attention is constructed based on fully-connected layers, which can not only execute fast on common hardware but also capture the dependence between long-range pixels. We further revisit the expressiveness bottleneck in previous GhostNet and propose to enhance expanded features produced by cheap operations with DFC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Brain Tumor Detection and Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Depthwise Convolution · Average Pooling · Pointwise Convolution · Global Average Pooling · Depthwise Separable Convolution · Squeeze-and-Excitation Block · Batch Normalization · Softmax
