PDP: Parameter-free Differentiable Pruning is All You Need
Minsik Cho, Saurabh Adya, Devang Naik

TL;DR
PDP introduces a simple, efficient, and universal train-time pruning method that achieves state-of-the-art results in model compression and accuracy across vision and language tasks without complex optimization.
Contribution
It proposes a parameter-free differentiable pruning scheme that is easy to implement, effective, and applicable to various architectures and pruning constraints.
Findings
Achieves 68.2% top-1 accuracy at 86.6% sparsity on MobileNet-v1.
Yields over 83.1% accuracy on NLP tasks with 90% sparsity.
Improves structured pruning results, e.g., ResNet18 and ResNet50.
Abstract
DNN pruning is a popular way to reduce the size of a model, improve the inference latency, and minimize the power consumption on DNN accelerators. However, existing approaches might be too complex, expensive or ineffective to apply to a variety of vision/language tasks, DNN architectures and to honor structured pruning constraints. In this paper, we propose an efficient yet effective train-time pruning scheme, Parameter-free Differentiable Pruning (PDP), which offers state-of-the-art qualities in model size, accuracy, and training cost. PDP uses a dynamic function of weights during training to generate soft pruning masks for the weights in a parameter-free manner for a given pruning target. While differentiable, the simplicity and efficiency of PDP make it universal enough to deliver state-of-the-art random/structured/channel pruning results on various vision and natural language tasks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Weight Decay · Residual Connection · Pruning · Dense Connections · Dropout
