Layer Pruning for Accelerating Very Deep Neural Networks

Weiwei Zhang; Changsheng chen; Xuechun Wu; Jialin Gao; Di Bao; Jiwei; Li; Xi Zhou

arXiv:1910.12727·cs.LG·October 29, 2019·1 cites

Layer Pruning for Accelerating Very Deep Neural Networks

Weiwei Zhang, Changsheng chen, Xuechun Wu, Jialin Gao, Di Bao, Jiwei, Li, Xi Zhou

PDF

Open Access

TL;DR

This paper introduces an adaptive layer and channel pruning technique for very deep neural networks that reduces parameters by half without sacrificing accuracy, and sometimes even improving it.

Contribution

It presents a novel adaptive pruning method that learns to cut channels and layers dynamically, enhancing efficiency while maintaining or improving performance.

Findings

01

Reduces model parameters by 50%

02

Maintains or improves baseline accuracy

03

Adaptive pruning learns optimal cuts dynamically

Abstract

In this paper, we propose an adaptive pruning method. This method can cut off the channel and layer adaptively. The proportion of the layer and the channel to be cut is learned adaptively. The pruning method proposed in this paper can reduce half of the parameters, and the accuracy will not decrease or even be higher than baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Speech and Audio Processing

MethodsPruning