Iterative Activation-based Structured Pruning
Kaiqi Zhao, Animesh Jain, Ming Zhao

TL;DR
This paper introduces activation-based iterative structured pruning methods, IAP and AIAP, which significantly improve model compression efficiency while maintaining accuracy, addressing hardware efficiency issues of unstructured pruning.
Contribution
The paper proposes novel activation-based iterative pruning techniques, IAP and AIAP, that outperform traditional weight-based methods in structured pruning for deep learning models.
Findings
IAP and AIAP achieve higher compression ratios with minimal accuracy loss.
Activation-based pruning methods outperform weight-based ILP in structured pruning.
Significant model size reduction on LeNet-5 and ResNet-50 with minimal accuracy impact.
Abstract
Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning techniques have been proposed for compressing networks. Recent advances based on the Lottery Ticket Hypothesis (LTH) show that iterative model pruning tends to produce smaller and more accurate models. However, LTH research focuses on unstructured pruning, which is hardware-inefficient and difficult to accelerate on hardware platforms. In this paper, we investigate iterative pruning in the context of structured pruning because structurally pruned models map well on commodity hardware. We find that directly applying a structured weight-based pruning technique iteratively, called iterative L1-norm based pruning (ILP), does not produce accurate pruned models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
MethodsPruning
