Elimination-compensation pruning for fully-connected neural networks
Enrico Ballini, Luca Muscarnera, Alessio Fumagalli, Anna Scotti, Francesco Regazzoni

TL;DR
This paper introduces a novel pruning method for fully-connected neural networks that compensates for weight removal by perturbing adjacent biases, improving sparsity without sacrificing performance.
Contribution
The work proposes a new pruning technique that considers bias perturbation for weight importance, enhancing network compression while maintaining accuracy.
Findings
The method outperforms popular pruning strategies in various scenarios.
Analytical expressions enable efficient importance computation.
Numerical experiments validate the approach's effectiveness.
Abstract
The unmatched ability of Deep Neural Networks in capturing complex patterns in large and noisy datasets is often associated with their large hypothesis space, and consequently to the vast amount of parameters that characterize model architectures. Pruning techniques affirmed themselves as valid tools to extract sparse representations of neural networks parameters, carefully balancing between compression and preservation of information. However, a fundamental assumption behind pruning is that expendable weights should have small impact on the error of the network, while highly important weights should tend to have a larger influence on the inference. We argue that this idea could be generalized; what if a weight is not simply removed but also compensated with a perturbation of the adjacent bias, which does not contribute to the network sparsity? Our work introduces a novel pruning method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
