A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations
Hongrong Cheng, Miao Zhang, Javen Qinfeng Shi

TL;DR
This survey comprehensively reviews deep neural network pruning techniques, categorizing methods, comparing different settings, and exploring emerging topics to guide future research and practical deployment in resource-constrained environments.
Contribution
It provides an up-to-date taxonomy, comparative analysis, and recommendations for neural network pruning, including a curated dataset and evaluation collection for future research.
Findings
Pruning methods vary across speedup, timing, technique, and fusion categories.
Emerging topics include pruning for large language and multimodal models.
The survey offers practical guidelines and a resource repository for researchers.
Abstract
Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1) universal/specific speedup, 2) when to prune, 3) how to prune, and 4) fusion of pruning and other compression techniques. We then provide a thorough comparative analysis of eight pairs of contrast settings for pruning and explore emerging topics, including pruning for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsPruning
