Towards Generalized Entropic Sparsification for Convolutional Neural   Networks

Tin Barisin; Illia Horenko

arXiv:2404.04734·cs.CV·April 9, 2024·1 cites

Towards Generalized Entropic Sparsification for Convolutional Neural Networks

Tin Barisin, Illia Horenko

PDF

Open Access

TL;DR

This paper presents a scalable, data-driven pruning method for CNNs based on entropic relaxation, effectively reducing network size with minimal accuracy loss across multiple benchmarks.

Contribution

Introduces a novel entropic relaxation-based pruning technique for CNNs that is computationally scalable and effective in achieving high sparsity with minimal accuracy loss.

Findings

01

Achieved 55-84% sparsity on MNIST with 0.1-0.5% accuracy loss

02

Achieved 73-89% sparsity on CIFAR-10 with 0.1-0.5% accuracy loss

03

Validated method on multiple architectures and datasets

Abstract

Convolutional neural networks (CNNs) are reported to be overparametrized. The search for optimal (minimal) and sufficient architecture is an NP-hard problem as the hyperparameter space for possible network configurations is vast. Here, we introduce a layer-by-layer data-driven pruning method based on the mathematical idea aiming at a computationally-scalable entropic relaxation of the pruning problem. The sparse subnetwork is found from the pre-trained (full) CNN using the network entropy minimization as a sparsity constraint. This allows deploying a numerically scalable algorithm with a sublinear scaling cost. The method is validated on several benchmarks (architectures): (i) MNIST (LeNet) with sparsity 55%-84% and loss in accuracy 0.1%-0.5%, and (ii) CIFAR-10 (VGG-16, ResNet18) with sparsity 73-89% and loss in accuracy 0.1%-0.5%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks

MethodsPruning