Advanced deep architecture pruning using single filter performance
Yarden Tzach, Yuval Meir, Ronit D. Gross, Ofek Tevet, Ella Koresh, Ido Kanter

TL;DR
This paper introduces a novel pruning method based on single filter performance that significantly reduces neural network complexity while maintaining accuracy, demonstrated on VGG-11 and EfficientNet-B0 architectures.
Contribution
It presents a new pruning approach using applied filter cluster connections (AFCC) inspired by statistical mechanics, outperforming existing methods in high pruning scenarios.
Findings
High pruning efficiency on VGG-11 and EfficientNet-B0
Maintains accuracy despite significant parameter reduction
Effective pruning of fully connected layers
Abstract
Pruning the parameters and structure of neural networks reduces the computational complexity, energy consumption, and latency during inference. Recently, a novel underlying mechanism for successful deep learning (DL) was presented based on a method that quantitatively measures the single filter performance in each layer of a DL architecture, and a new comprehensive mechanism of how deep learning works was presented. This statistical mechanics inspired viewpoint enables to reveal the macroscopic behavior of the entire network from the microscopic performance of each filter and their cooperative behavior. Herein, we demonstrate how this understanding paves the path to high quenched dilution of the convolutional layers of deep architectures without affecting their overall accuracy using applied filter cluster connections (AFCC). AFCC is exemplified on VGG-11 and EfficientNet-B0…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAntenna Design and Optimization
MethodsPruning
