A novel channel pruning method for deep neural network compression
Yiming Hu, Siyang Sun, Jianquan Li, Xingang Wang, Qingyi Gu

TL;DR
This paper introduces a genetic algorithm-based channel pruning method to compress deep CNNs, reducing memory and computation for deployment on resource-limited devices while maintaining accuracy.
Contribution
It proposes a novel channel pruning approach using genetic algorithms combined with layer sensitivity and knowledge distillation, improving efficiency and effectiveness over existing methods.
Findings
Achieves 8x parameter reduction on CIFAR-10 with better accuracy.
Outperforms state-of-the-art pruning methods on CIFAR-100 and ImageNet.
Reduces FLOPs by 3x while maintaining model performance.
Abstract
In recent years, deep neural networks have achieved great success in the field of computer vision. However, it is still a big challenge to deploy these deep models on resource-constrained embedded devices such as mobile robots, smart phones and so on. Therefore, network compression for such platforms is a reasonable solution to reduce memory consumption and computation complexity. In this paper, a novel channel pruning method based on genetic algorithm is proposed to compress very deep Convolution Neural Networks (CNNs). Firstly, a pre-trained CNN model is pruned layer by layer according to the sensitivity of each layer. After that, the pruned model is fine-tuned based on knowledge distillation framework. These two improvements significantly decrease the model redundancy with less accuracy drop. Channel selection is a combinatorial optimization problem that has exponential solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Neural Network Applications · Speech and Audio Processing
MethodsPruning · Average Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection
