Neural expressiveness for beyond importance model compression
Angelos-Christos Maroudis, Sotirios Xydis

TL;DR
This paper introduces 'Expressiveness', a novel neuron importance criterion based on activation overlap, enabling more effective, data-agnostic model pruning that improves compression ratios with minimal performance loss.
Contribution
The paper proposes a new expressiveness criterion for pruning neural networks, which is independent of training state and can be combined with importance-based methods for enhanced compression.
Findings
Achieved up to 10x parameter compression with 1% performance loss.
Improved pruning efficiency over existing importance-based methods.
Reduced YOLOv8 MACs by 46.1% with a 3% increase in mAP.
Abstract
Neural Network Pruning has been established as driving force in the exploration of memory and energy efficient solutions with high throughput both during training and at test time. In this paper, we introduce a novel criterion for model compression, named "Expressiveness". Unlike existing pruning methods that rely on the inherent "Importance" of neurons' and filters' weights, ``Expressiveness" emphasizes a neuron's or group of neurons ability to redistribute informational resources effectively, based on the overlap of activations. This characteristic is strongly correlated to a network's initialization state, establishing criterion autonomy from the learning state stateless and thus setting a new fundamental basis for the expansion of compression strategies in regards to the "When to Prune" question. We show that expressiveness is effectively approximated with arbitrary data or limited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
