Iterative Activation-based Structured Pruning

Kaiqi Zhao; Animesh Jain; Ming Zhao

arXiv:2201.09881·cs.LG·January 26, 2022

Iterative Activation-based Structured Pruning

Kaiqi Zhao, Animesh Jain, Ming Zhao

PDF

Open Access

TL;DR

This paper introduces activation-based iterative structured pruning methods, IAP and AIAP, which significantly improve model compression efficiency while maintaining accuracy, addressing hardware efficiency issues of unstructured pruning.

Contribution

The paper proposes novel activation-based iterative pruning techniques, IAP and AIAP, that outperform traditional weight-based methods in structured pruning for deep learning models.

Findings

01

IAP and AIAP achieve higher compression ratios with minimal accuracy loss.

02

Activation-based pruning methods outperform weight-based ILP in structured pruning.

03

Significant model size reduction on LeNet-5 and ResNet-50 with minimal accuracy impact.

Abstract

Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning techniques have been proposed for compressing networks. Recent advances based on the Lottery Ticket Hypothesis (LTH) show that iterative model pruning tends to produce smaller and more accurate models. However, LTH research focuses on unstructured pruning, which is hardware-inefficient and difficult to accelerate on hardware platforms. In this paper, we investigate iterative pruning in the context of structured pruning because structurally pruned models map well on commodity hardware. We find that directly applying a structured weight-based pruning technique iteratively, called iterative L1-norm based pruning (ILP), does not produce accurate pruned models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

MethodsPruning