Learning Sparse Neural Networks via Sensitivity-Driven Regularization

Enzo Tartaglione; Skjalg Leps{\o}y; Attilio Fiandrotti; Gianluca; Francini

arXiv:1810.11764·cs.LG·October 30, 2018·6 cites

Learning Sparse Neural Networks via Sensitivity-Driven Regularization

Enzo Tartaglione, Skjalg Leps{\o}y, Attilio Fiandrotti, Gianluca, Francini

PDF

Open Access

TL;DR

This paper introduces a sensitivity-driven regularization method for training sparse neural networks, effectively reducing parameters while maintaining accuracy, and outperforming recent sparsification techniques in both sparsity and error rates.

Contribution

The paper proposes a novel regularization approach based on output sensitivity to selectively prune neural network weights, achieving higher sparsity with comparable or better accuracy.

Findings

01

Method surpasses recent techniques in sparsity and error rates.

02

Achieves up to twice the sparsity at the same error rate.

03

Effectively prunes parameters with minimal impact on performance.

Abstract

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify the output sensitivity to the parameters (i.e. their relevance to the network output) and introduce a regularization term that gradually lowers the absolute value of parameters with low sensitivity. Thus, a very large fraction of the parameters approach zero and are eventually set to zero by simple thresholding. Our method surpasses most of the recent techniques both in terms of sparsity and error rates. In some cases, the method reaches twice the sparsity obtained by other techniques at equal error rates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Model Reduction and Neural Networks