$\ell_0$ Regularized Structured Sparsity Convolutional Neural Networks
Kevin Bui, Fredrick Park, Shuai Zhang, Yingyong Qi, Jack, Xin

TL;DR
This paper introduces a novel $\,\ell_0$ regularized structured sparsity method for CNNs that effectively reduces model size while maintaining high accuracy, using a new sparse group lasso variant and an iterative optimization approach.
Contribution
It proposes a new $\,\ell_0$ regularized structured sparsity technique for CNNs, combining $\,\ell_0$ and $\,\ell_{2,1}$ norms, with an algorithm for efficient training and sparsification.
Findings
Achieves high test accuracy with significant sparsity on LeNet-5 and Wide-ResNet.
Outperforms existing sparsification methods in accuracy and compression.
Effective on MNIST and CIFAR datasets.
Abstract
Deepening and widening convolutional neural networks (CNNs) significantly increases the number of trainable weight parameters by adding more convolutional layers and feature maps per layer, respectively. By imposing inter- and intra-group sparsity onto the weights of the layers during the training process, a compressed network can be obtained with accuracy comparable to a dense one. In this paper, we propose a new variant of sparse group lasso that blends the norm onto the individual weight parameters and the norm onto the output channels of a layer. To address the non-differentiability of the norm, we apply variable splitting resulting in an algorithm that consists of executing stochastic gradient descent followed by hard thresholding for each iteration. Numerical experiments are demonstrated on LeNet-5 and wide-residual-networks for MNIST and CIFAR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · Non-Destructive Testing Techniques
MethodsTest
