Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration

Deepak Ghimire; Kilho Lee; and Seong-heum Kim

arXiv:2506.20152·cs.CV·June 26, 2025

Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration

Deepak Ghimire, Kilho Lee, and Seong-heum Kim

PDF

2 Repos

TL;DR

This paper introduces LAASP, a loss-aware, automatic structured pruning method that integrates pruning and training, automatically determines pruning rates, and improves neural network efficiency with minimal accuracy loss.

Contribution

The proposed LAASP method automatically selects pruning criteria and layer-specific pruning rates during training, streamlining the pruning process and enhancing model compression and acceleration.

Findings

01

Significantly improves accuracy on CIFAR-10 with 52% FLOPs reduction.

02

Reduces FLOPs by over 42% on ImageNet with minimal accuracy drop.

03

Outperforms state-of-the-art pruning methods on benchmark datasets.

Abstract

Structured pruning is a well-established technique for compressing neural networks, making it suitable for deployment in resource-limited edge devices. This paper presents an efficient Loss-Aware Automatic Selection of Structured Pruning Criteria (LAASP) for slimming and accelerating deep neural networks. The majority of pruning methodologies employ a sequential process consisting of three stages: 1) training, 2) pruning, and 3) fine-tuning, whereas the proposed pruning technique adopts a pruning-while-training approach that eliminates the first stage and integrates the second and third stages into a single cycle. The automatic selection of magnitude or similarity-based filter pruning criteria from a specified pool of criteria and the specific pruning layer at each pruning iteration is guided by the network's overall loss on a small subset of the training data. To mitigate the abrupt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning