PDP: Parameter-free Differentiable Pruning is All You Need

Minsik Cho; Saurabh Adya; Devang Naik

arXiv:2305.11203·cs.LG·November 21, 2023·2 cites

PDP: Parameter-free Differentiable Pruning is All You Need

Minsik Cho, Saurabh Adya, Devang Naik

PDF

Open Access

TL;DR

PDP introduces a simple, efficient, and universal train-time pruning method that achieves state-of-the-art results in model compression and accuracy across vision and language tasks without complex optimization.

Contribution

It proposes a parameter-free differentiable pruning scheme that is easy to implement, effective, and applicable to various architectures and pruning constraints.

Findings

01

Achieves 68.2% top-1 accuracy at 86.6% sparsity on MobileNet-v1.

02

Yields over 83.1% accuracy on NLP tasks with 90% sparsity.

03

Improves structured pruning results, e.g., ResNet18 and ResNet50.

Abstract

DNN pruning is a popular way to reduce the size of a model, improve the inference latency, and minimize the power consumption on DNN accelerators. However, existing approaches might be too complex, expensive or ineffective to apply to a variety of vision/language tasks, DNN architectures and to honor structured pruning constraints. In this paper, we propose an efficient yet effective train-time pruning scheme, Parameter-free Differentiable Pruning (PDP), which offers state-of-the-art qualities in model size, accuracy, and training cost. PDP uses a dynamic function of weights during training to generate soft pruning masks for the weights in a parameter-free manner for a given pruning target. While differentiable, the simplicity and efficiency of PDP make it universal enough to deliver state-of-the-art random/structured/channel pruning results on various vision and natural language tasks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Weight Decay · Residual Connection · Pruning · Dense Connections · Dropout