Importance Estimation with Random Gradient for Neural Network Pruning

Suman Sapkota; Binod Bhattarai

arXiv:2310.20203·cs.LG·November 1, 2023·1 cites

Importance Estimation with Random Gradient for Neural Network Pruning

Suman Sapkota, Binod Bhattarai

PDF

Open Access

TL;DR

This paper introduces a novel importance estimation technique for neural network pruning that uses random gradients and normalization to improve efficiency without requiring labeled data, outperforming existing methods.

Contribution

It proposes a new importance estimation method using random gradients and normalization, enhancing pruning performance and complementing existing approaches.

Findings

01

Outperforms previous importance estimation methods on ResNet and VGG.

02

Effective without labeled examples due to random gradient propagation.

03

Improves existing methods when combined with them.

Abstract

Global Neuron Importance Estimation is used to prune neural networks for efficiency reasons. To determine the global importance of each neuron or convolutional kernel, most of the existing methods either use activation or gradient information or both, which demands abundant labelled examples. In this work, we use heuristics to derive importance estimation similar to Taylor First Order (TaylorFO) approximation based methods. We name our methods TaylorFO-abs and TaylorFO-sq. We propose two additional methods to improve these importance estimation methods. Firstly, we propagate random gradients from the last layer of a network, thus avoiding the need for labelled examples. Secondly, we normalize the gradient magnitude of the last layer output before propagating, which allows all examples to contribute similarly to the importance score. Our methods with additional techniques perform better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Batch Normalization · Average Pooling · Max Pooling · Dropout · 1x1 Convolution · Dense Connections · Residual Block · Global Average Pooling