Channel Pruning In Quantization-aware Training: An Adaptive   Projection-gradient Descent-shrinkage-splitting Method

Zhijian Li; Jack Xin

arXiv:2204.04375·cs.LG·April 12, 2022

Channel Pruning In Quantization-aware Training: An Adaptive Projection-gradient Descent-shrinkage-splitting Method

Zhijian Li, Jack Xin

PDF

Open Access

TL;DR

This paper introduces APGDSSM, a novel method combining adaptive projection, gradient descent, and shrinkage techniques to enable efficient channel pruning within quantization-aware training, achieving extreme compression.

Contribution

It presents a new integrated approach that simultaneously optimizes weights for quantization and sparsity, using innovative penalties and splitting techniques for improved compression.

Findings

01

Effective channel pruning during QAT demonstrated

02

Achieves significant model compression without accuracy loss

03

Stabilizes training with a new transformed l1 penalty

Abstract

We propose an adaptive projection-gradient descent-shrinkage-splitting method (APGDSSM) to integrate penalty based channel pruning into quantization-aware training (QAT). APGDSSM concurrently searches weights in both the quantized subspace and the sparse subspace. APGDSSM uses shrinkage operator and a splitting technique to create sparse weights, as well as the Group Lasso penalty to push the weight sparsity into channel sparsity. In addition, we propose a novel complementary transformed l1 penalty to stabilize the training for extreme compression.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Advanced Image Processing Techniques · Seismic Imaging and Inversion Techniques

MethodsPruning