Improving Network Slimming with Nonconvex Regularization
Kevin Bui, Fredrick Park, Shuai Zhang, Yingyong Qi, Jack Xin

TL;DR
This paper enhances network slimming for CNNs by replacing the traditional $ ext{l}_1$ regularization with nonconvex penalties like $ ext{l}_p$, MCP, and SCAD, achieving better compression and accuracy on standard datasets.
Contribution
It introduces nonconvex regularization techniques into network slimming, improving CNN compression and accuracy over traditional $ ext{l}_1$ methods.
Findings
T$ ext{l}_1$ preserves accuracy similar to $ ext{l}_1$ regularization.
$ ext{l}_{1/2}$ and $ ext{l}_{3/4}$ yield better compressed models with similar accuracy.
MCP and SCAD provide more accurate models after retraining with comparable compression.
Abstract
Convolutional neural networks (CNNs) have developed to become powerful models for various computer vision tasks ranging from object detection to semantic segmentation. However, most of the state-of-the-art CNNs cannot be deployed directly on edge devices such as smartphones and drones, which need low latency under limited power and memory bandwidth. One popular, straightforward approach to compressing CNNs is network slimming, which imposes regularization on the channel-associated scaling factors via the batch normalization layers during training. Network slimming thereby identifies insignificant channels that can be pruned for inference. In this paper, we propose replacing the penalty with an alternative nonconvex, sparsity-inducing penalty in order to yield a more compressed and/or accurate CNN architecture. We investigate , transformed …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning
MethodsVisual Geometry Group 19 Layer CNN · Residual Connection · Bottleneck Residual Block · Residual Block · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Global Average Pooling · Dense Block · Kaiming Initialization · Average Pooling
