ResNet Can Be Pruned 60x: Introducing Network Purification and Unused   Path Removal (P-RM) after Weight Pruning

Xiaolong Ma; Geng Yuan; Sheng Lin; Zhengang Li; Hao Sun; Yanzhi Wang

arXiv:1905.00136·cs.LG·May 3, 2019·6 cites

ResNet Can Be Pruned 60x: Introducing Network Purification and Unused Path Removal (P-RM) after Weight Pruning

Xiaolong Ma, Geng Yuan, Sheng Lin, Zhengang Li, Hao Sun, Yanzhi Wang

PDF

Open Access

TL;DR

This paper introduces a novel post-processing framework called Network Purification and Unused Path Removal (P-RM) that significantly enhances structured weight pruning in DNNs, achieving up to 232x compression on LeNet-5 and 60x on ResNet-18.

Contribution

It presents the first combined structured pruning method using ADMM and introduces post-processing algorithms to remove unused weights, improving pruning efficiency and model compression.

Findings

01

Achieved 232x compression on LeNet-5.

02

Achieved 60x compression on ResNet-18 CIFAR-10.

03

Achieved over 5x compression on AlexNet.

Abstract

The state-of-art DNN structures involve high computation and great demand for memory storage which pose intensive challenge on DNN framework resources. To mitigate the challenges, weight pruning techniques has been studied. However, high accuracy solution for extreme structured pruning that combines different types of structured sparsity still waiting for unraveling due to the extremely reduced weights in DNN networks. In this paper, we propose a DNN framework which combines two different types of structured weight pruning (filter and column prune) by incorporating alternating direction method of multipliers (ADMM) algorithm for better prune performance. We are the first to find non-optimality of ADMM process and unused weights in a structured pruned model, and further design an optimization framework which contains the first proposed Network Purification and Unused Path Removal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Advanced Memory and Neural Computing

MethodsPruning · 1x1 Convolution · Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax