Revisiting hard thresholding for DNN pruning

Konstantinos Pitas; Mike Davies; Pierre Vandergheynst

arXiv:1905.08793·cs.LG·May 23, 2019·1 cites

Revisiting hard thresholding for DNN pruning

Konstantinos Pitas, Mike Davies, Pierre Vandergheynst

PDF

Open Access

TL;DR

This paper compares hard thresholding and smart pruning for DNNs, showing hard thresholding remains most efficient overall, while proposing a faster smart pruning method with minimal accuracy loss and analyzing the theoretical effects of pruning.

Contribution

It introduces a novel fast smart pruning algorithm based on difference of convex functions optimization and provides theoretical insights into the impact of hard thresholding on DNN accuracy.

Findings

01

Hard thresholding with retraining is most efficient overall.

02

The proposed smart pruning method is significantly faster and maintains low accuracy degradation.

03

Accuracy loss increases with depth from the pruned layer and relates to data manifold dimensionality.

Abstract

The most common method for DNN pruning is hard thresholding of network weights, followed by retraining to recover any lost accuracy. Recently developed smart pruning algorithms use the DNN response over the training set for a variety of cost functions to determine redundant network weights, leading to less accuracy degradation and possibly less retraining time. For experiments on the total pruning time (pruning time + retraining time) we show that hard thresholding followed by retraining remains the most efficient way of reducing the number of network parameters. However smart pruning algorithms still have advantages when retraining is not possible. In this context we propose a novel smart pruning algorithm based on difference of convex functions optimisation and show that it is often orders of magnitude faster than competing approaches while achieving the lowest classification accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsPruning