Channel Pruning Guided by Spatial and Channel Attention for DNNs in Intelligent Edge Computing
Mengran Liu, Weiwei Fang, Xiaodong Ma, Wenyuan Xu, Naixue, Xiong, Yi Ding

TL;DR
This paper introduces a novel attention-guided channel pruning method for deep neural networks, improving model compression efficiency and accuracy on edge devices by focusing on the most informative features.
Contribution
The paper proposes a new attention module combining spatial and channel attention, and a pruning method guided by this module, enhancing model compression without accuracy loss.
Findings
SCA achieves superior inference accuracy with minimal resource overhead.
CPSCA outperforms existing pruning methods at the same pruning ratios.
Experimental results on benchmark datasets validate the effectiveness of the proposed approach.
Abstract
Deep Neural Networks (DNNs) have achieved remarkable success in many computer vision tasks recently, but the huge number of parameters and the high computation overhead hinder their deployments on resource-constrained edge devices. It is worth noting that channel pruning is an effective approach for compressing DNN models. A critical challenge is to determine which channels are to be removed, so that the model accuracy will not be negatively affected. In this paper, we first propose Spatial and Channel Attention (SCA), a new attention module combining both spatial and channel attention that respectively focuses on "where" and "what" are the most informative parts. Guided by the scale values generated by SCA for measuring channel importance, we further propose a new channel pruning approach called Channel Pruning guided by Spatial and Channel Attention (CPSCA). Experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsPruning
