A novel channel pruning method for deep neural network compression

Yiming Hu; Siyang Sun; Jianquan Li; Xingang Wang; Qingyi Gu

arXiv:1805.11394·cs.CV·May 30, 2018·66 cites

A novel channel pruning method for deep neural network compression

Yiming Hu, Siyang Sun, Jianquan Li, Xingang Wang, Qingyi Gu

PDF

Open Access

TL;DR

This paper introduces a genetic algorithm-based channel pruning method to compress deep CNNs, reducing memory and computation for deployment on resource-limited devices while maintaining accuracy.

Contribution

It proposes a novel channel pruning approach using genetic algorithms combined with layer sensitivity and knowledge distillation, improving efficiency and effectiveness over existing methods.

Findings

01

Achieves 8x parameter reduction on CIFAR-10 with better accuracy.

02

Outperforms state-of-the-art pruning methods on CIFAR-100 and ImageNet.

03

Reduces FLOPs by 3x while maintaining model performance.

Abstract

In recent years, deep neural networks have achieved great success in the field of computer vision. However, it is still a big challenge to deploy these deep models on resource-constrained embedded devices such as mobile robots, smart phones and so on. Therefore, network compression for such platforms is a reasonable solution to reduce memory consumption and computation complexity. In this paper, a novel channel pruning method based on genetic algorithm is proposed to compress very deep Convolution Neural Networks (CNNs). Firstly, a pre-trained CNN model is pruned layer by layer according to the sensitivity of each layer. After that, the pruned model is fine-tuned based on knowledge distillation framework. These two improvements significantly decrease the model redundancy with less accuracy drop. Channel selection is a combinatorial optimization problem that has exponential solution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Advanced Neural Network Applications · Speech and Audio Processing

MethodsPruning · Average Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection