Deep Neural Networks pruning via the Structured Perspective Regularization
Matteo Cacciola, Antonio Frangioni, Xinlin Li, Andrea Lodi

TL;DR
This paper introduces a novel structured pruning method for deep neural networks using a regularization technique derived from a mixed-integer programming formulation, leading to efficient model compression with competitive results.
Contribution
It proposes the Structured Perspective Regularization, a new regularization approach inspired by operational research, for effective structured pruning of neural networks.
Findings
Achieves competitive performance on ResNet architectures.
Effective in pruning models for CIFAR-10, CIFAR-100, and ImageNet.
Outperforms some existing structured pruning methods.
Abstract
In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes training, storage and inference expensive. This motivated a stream of research about compressing the original networks into smaller ones without excessively sacrificing performances. Among the many proposed compression approaches, one of the most popular is \emph{pruning}, whereby entire elements of the ANN (links, nodes, channels, \ldots) and the corresponding weights are deleted. Since the nature of the problem is inherently combinatorial (what elements to prune and what not), we propose a new pruning method based on Operational Research tools. We start from a natural Mixed-Integer-Programming model for the problem, and we use the Perspective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and Algorithms
MethodsPruning · Test · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Convolution · Max Pooling · Batch Normalization · Kaiming Initialization · Residual Connection · Global Average Pooling
