Pruning Everything, Everywhere, All at Once
Gustavo Henrique do Nascimento, Ian Pons, Anna Helena Reali Costa, Artur Jordao

TL;DR
This paper introduces a novel pruning method that simultaneously prunes neurons and layers by selecting the most representative subnetwork, leading to highly efficient models with minimal accuracy loss and enhanced robustness.
Contribution
The authors propose a new approach for joint pruning of neurons and layers using representation similarity, outperforming existing methods in efficiency and robustness.
Findings
Achieved up to 86.37% FLOPs reduction on ResNet56
Outperformed state-of-the-art pruning techniques in accuracy retention
Reduced carbon emissions by up to 83.31% with pruned models
Abstract
Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained applications. Extensive studies reveal that pruning structures in these models efficiently reduces model complexity and improves computational efficiency. Successful strategies in this sphere include removing neurons (i.e., filters, heads) or layers, but not both together. Therefore, simultaneously pruning different structures remains an open problem. To fill this gap and leverage the benefits of eliminating neurons and layers at once, we propose a new method capable of pruning different structures within a model as follows. Given two candidate subnetworks (pruned models), one from layer pruning and the other from neuron pruning, our method decides which to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsPruning
