DBP: Discrimination Based Block-Level Pruning for Deep Model   Acceleration

Wenxiao Wang; Shuai Zhao; Minghao Chen; Jinming Hu; Deng Cai; Haifeng; Liu

arXiv:1912.10178·cs.CV·December 24, 2019·21 cites

DBP: Discrimination Based Block-Level Pruning for Deep Model Acceleration

Wenxiao Wang, Shuai Zhao, Minghao Chen, Jinming Hu, Deng Cai, Haifeng, Liu

PDF

Open Access

TL;DR

This paper introduces DBP, a block-level pruning method that removes redundant layer blocks based on feature discrimination, achieving higher acceleration ratios and better accuracy than traditional filter-level pruning methods.

Contribution

The paper proposes a novel block-level pruning approach that considers layer dependencies, outperforming filter-level pruning in both acceleration and accuracy.

Findings

01

DBP achieves higher acceleration ratios than filter-level pruning.

02

DBP maintains or improves accuracy compared to state-of-the-art methods.

03

Extensive experiments validate the effectiveness of DBP.

Abstract

Neural network pruning is one of the most popular methods of accelerating the inference of deep convolutional neural networks (CNNs). The dominant pruning methods, filter-level pruning methods, evaluate their performance through the reduction ratio of computations and deem that a higher reduction ratio of computations is equivalent to a higher acceleration ratio in terms of inference time. However, we argue that they are not equivalent if parallel computing is considered. Given that filter-level pruning only prunes filters in layers and computations in a layer usually run in parallel, most computations reduced by filter-level pruning usually run in parallel with the un-reduced ones. Thus, the acceleration ratio of filter-level pruning is limited. To get a higher acceleration ratio, it is better to prune redundant layers because computations of different layers cannot run in parallel. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsPruning