Pruning by Active Attention Manipulation
Zahra Babaiee, Lucas Liebenwein, Ramin Hasani, Daniela Rus, Radu Grosu

TL;DR
This paper introduces PAAM, a novel training method that dynamically learns filter importance during training, enabling effective filter pruning in CNNs without pre-training or layer-specific tuning.
Contribution
PAAM proposes a new attention-based, joint-layer training approach for filter pruning that accounts for inter-layer dependencies and does not require pre-trained models or hyperparameter tuning.
Findings
Outperforms state-of-the-art structured pruning methods on CIFAR-10 and ImageNet.
Achieves significant parameter reduction with accuracy gains.
Can train and prune networks from scratch in a single stage.
Abstract
Filter pruning of a CNN is typically achieved by applying discrete masks on the CNN's filter weights or activation maps, post-training. Here, we present a new filter-importance-scoring concept named pruning by active attention manipulation (PAAM), that sparsifies the CNN's set of filters through a particular attention mechanism, during-training. PAAM learns analog filter scores from the filter weights by optimizing a cost function regularized by an additive term in the scores. As the filters are not independent, we use attention to dynamically learn their correlations. Moreover, by training the pruning scores of all layers simultaneously, PAAM can account for layer inter-dependencies, which is essential to finding a performant sparse sub-network. PAAM can also train and generate a pruned network from scratch in a straightforward, one-stage training process without requiring a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsPruning
