Deep Neural Networks pruning via the Structured Perspective   Regularization

Matteo Cacciola; Antonio Frangioni; Xinlin Li; Andrea Lodi

arXiv:2206.14056·cs.LG·June 29, 2022·1 cites

Deep Neural Networks pruning via the Structured Perspective Regularization

Matteo Cacciola, Antonio Frangioni, Xinlin Li, Andrea Lodi

PDF

Open Access

TL;DR

This paper introduces a novel structured pruning method for deep neural networks using a regularization technique derived from a mixed-integer programming formulation, leading to efficient model compression with competitive results.

Contribution

It proposes the Structured Perspective Regularization, a new regularization approach inspired by operational research, for effective structured pruning of neural networks.

Findings

01

Achieves competitive performance on ResNet architectures.

02

Effective in pruning models for CIFAR-10, CIFAR-100, and ImageNet.

03

Outperforms some existing structured pruning methods.

Abstract

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes training, storage and inference expensive. This motivated a stream of research about compressing the original networks into smaller ones without excessively sacrificing performances. Among the many proposed compression approaches, one of the most popular is \emph{pruning}, whereby entire elements of the ANN (links, nodes, channels, \ldots) and the corresponding weights are deleted. Since the nature of the problem is inherently combinatorial (what elements to prune and what not), we propose a new pruning method based on Operational Research tools. We start from a natural Mixed-Integer-Programming model for the problem, and we use the Perspective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and Algorithms

MethodsPruning · Test · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Convolution · Max Pooling · Batch Normalization · Kaiming Initialization · Residual Connection · Global Average Pooling