A Unified Framework for Soft Threshold Pruning
Yanqi Chen, Zhengyu Ma, Wei Fang, Xiawu Zheng, Zhaofei Yu, Yonghong, Tian

TL;DR
This paper introduces a unified theoretical framework for soft threshold pruning using ISTA, leading to an optimal scheduler and state-of-the-art results across various neural network models and datasets.
Contribution
It reformulates soft threshold pruning as an implicit optimization problem, deriving an optimal threshold scheduler and unifying previous methods under a common theoretical perspective.
Findings
Achieved state-of-the-art pruning performance on ImageNet with ResNet-50 and MobileNet-V1.
Unified various pruning strategies under a single ISTA-based framework.
Demonstrated the framework's applicability to different neural network types, including SNNs.
Abstract
Soft threshold pruning is among the cutting-edge pruning methods with state-of-the-art performance. However, previous methods either perform aimless searching on the threshold scheduler or simply set the threshold trainable, lacking theoretical explanation from a unified perspective. In this work, we reformulate soft threshold pruning as an implicit optimization problem solved using the Iterative Shrinkage-Thresholding Algorithm (ISTA), a classic method from the fields of sparse recovery and compressed sensing. Under this theoretical framework, all threshold tuning strategies proposed in previous studies of soft threshold pruning are concluded as different styles of tuning -regularization term. We further derive an optimal threshold scheduler through an in-depth study of threshold scheduling based on our framework. This scheduler keeps -regularization coefficient stable,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Underwater Acoustics Research · Underwater Vehicles and Communication Systems
MethodsPruning · Stochastic Gradient Descent
