TL;DR
This paper introduces LAASP, a loss-aware, automatic structured pruning method that integrates pruning and training, automatically determines pruning rates, and improves neural network efficiency with minimal accuracy loss.
Contribution
The proposed LAASP method automatically selects pruning criteria and layer-specific pruning rates during training, streamlining the pruning process and enhancing model compression and acceleration.
Findings
Significantly improves accuracy on CIFAR-10 with 52% FLOPs reduction.
Reduces FLOPs by over 42% on ImageNet with minimal accuracy drop.
Outperforms state-of-the-art pruning methods on benchmark datasets.
Abstract
Structured pruning is a well-established technique for compressing neural networks, making it suitable for deployment in resource-limited edge devices. This paper presents an efficient Loss-Aware Automatic Selection of Structured Pruning Criteria (LAASP) for slimming and accelerating deep neural networks. The majority of pruning methodologies employ a sequential process consisting of three stages: 1) training, 2) pruning, and 3) fine-tuning, whereas the proposed pruning technique adopts a pruning-while-training approach that eliminates the first stage and integrates the second and third stages into a single cycle. The automatic selection of magnitude or similarity-based filter pruning criteria from a specified pool of criteria and the specific pruning layer at each pruning iteration is guided by the network's overall loss on a small subset of the training data. To mitigate the abrupt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
