Conditional Automated Channel Pruning for Deep Neural Networks

Yixin Liu; Yong Guo; Zichang Liu; Haohua Liu; Jingjie Zhang; Zejun; Chen; Jing Liu; Jian Chen

arXiv:2009.09724·cs.CV·July 21, 2021

Conditional Automated Channel Pruning for Deep Neural Networks

Yixin Liu, Yong Guo, Zichang Liu, Haohua Liu, Jingjie Zhang, Zejun, Chen, Jing Liu, Jian Chen

PDF

TL;DR

This paper introduces a Conditional Automated Channel Pruning method that efficiently produces deep neural network models with various compression rates in a single process, outperforming existing methods.

Contribution

The paper presents a novel conditional model that enables multi-rate channel pruning in one step, improving efficiency and performance over traditional fixed-rate methods.

Findings

01

Models with different compression rates outperform existing methods.

02

Single pruning process achieves multiple compression rates effectively.

03

Proposed method reduces computational cost compared to multiple separate pruning runs.

Abstract

Model compression aims to reduce the redundancy of deep networks to obtain compact models. Recently, channel pruning has become one of the predominant compression methods to deploy deep models on resource-constrained devices. Most channel pruning methods often use a fixed compression rate for all the layers of the model, which, however, may not be optimal. To address this issue, given a target compression rate for the whole model, one can search for the optimal compression rate for each layer. Nevertheless, these methods perform channel pruning for a specific target compression rate. When we consider multiple compression rates, they have to repeat the channel pruning process multiple times, which is very inefficient yet unnecessary. To address this issue, we propose a Conditional Automated Channel Pruning(CACP) method to obtain the compressed models with different compression rates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning