A Survey on Deep Neural Network Compression: Challenges, Overview, and Solutions
Rahul Mishra, Hari Prabhat Gupta, and Tanima Dutta

TL;DR
This paper provides a comprehensive survey of deep neural network compression techniques, categorizing methods like pruning, sparse representation, quantization, and knowledge distillation, highlighting challenges and future directions for resource-constrained IoT applications.
Contribution
It offers a detailed overview and classification of existing DNN compression methods, along with analysis of challenges and future research directions.
Findings
Five main categories of DNN compression techniques identified.
Challenges vary across different compression methods.
Future research directions outlined for each category.
Abstract
Deep Neural Network (DNN) has gained unprecedented performance due to its automated feature extraction capability. This high order performance leads to significant incorporation of DNN models in different Internet of Things (IoT) applications in the past decade. However, the colossal requirement of computation, energy, and storage of DNN models make their deployment prohibitive on resource constraint IoT devices. Therefore, several compression techniques were proposed in recent years for reducing the storage and computation requirements of the DNN model. These techniques on DNN compression have utilized a different perspective for compressing DNN with minimal accuracy compromise. It encourages us to make a comprehensive overview of the DNN compression techniques. In this paper, we present a comprehensive review of existing literature on compressing DNN model that reduces both storage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Brain Tumor Detection and Classification
