Weight Concentration Regularization for Improving Pruning Robustness Under High Sparsity
Vincent-Daniel Yun, Junhyuk Jo, Sunwoo Lee

TL;DR
This paper introduces a Weight Concentration Regularizer (WCR) that enhances pruning robustness by concentrating weight energy on a few parameters, improving model performance under high sparsity.
Contribution
The paper proposes WCR, a novel regularizer that amplifies key weights during training, leading to better pruning robustness and compatibility with existing optimizers.
Findings
WCR improves pruning robustness across tasks and architectures.
WCR is compatible with existing pruning-robust optimizers.
WCR demonstrates consistent performance gains in various experiments.
Abstract
Deep neural networks achieve outstanding performance across vision and language tasks, yet their large parameter counts limit deployment in resource-constrained settings. One-shot pruning reduces model size without retraining, but models trained with standard objectives often suffer substantial accuracy drops under aggressive sparsity. Prior work mitigates this drop along two directions: regularizers such as and DeepHoyer that shape the weight distribution during training, and pruning-robust optimizers such as SAM, CrAM, and SSAM that flatten the loss landscape. However, existing regularizers either shrink all weights uniformly () or induce scale-invariant sparsity (DeepHoyer), without concentrating weight energy onto a small set of informative parameters. We propose a Weight Concentration Regularizer (WCR), a training-time regularizer that amplifies the magnitude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
