Weight Concentration Regularization for Improving Pruning Robustness Under High Sparsity

Vincent-Daniel Yun; Junhyuk Jo; Sunwoo Lee

arXiv:2511.14282·cs.LG·May 18, 2026

Weight Concentration Regularization for Improving Pruning Robustness Under High Sparsity

Vincent-Daniel Yun, Junhyuk Jo, Sunwoo Lee

PDF

TL;DR

This paper introduces a Weight Concentration Regularizer (WCR) that enhances pruning robustness by concentrating weight energy on a few parameters, improving model performance under high sparsity.

Contribution

The paper proposes WCR, a novel regularizer that amplifies key weights during training, leading to better pruning robustness and compatibility with existing optimizers.

Findings

01

WCR improves pruning robustness across tasks and architectures.

02

WCR is compatible with existing pruning-robust optimizers.

03

WCR demonstrates consistent performance gains in various experiments.

Abstract

Deep neural networks achieve outstanding performance across vision and language tasks, yet their large parameter counts limit deployment in resource-constrained settings. One-shot pruning reduces model size without retraining, but models trained with standard objectives often suffer substantial accuracy drops under aggressive sparsity. Prior work mitigates this drop along two directions: regularizers such as $ℓ_{1}$ and DeepHoyer that shape the weight distribution during training, and pruning-robust optimizers such as SAM, CrAM, and S $^{2}$ SAM that flatten the loss landscape. However, existing regularizers either shrink all weights uniformly ( $ℓ_{1}$ ) or induce scale-invariant sparsity (DeepHoyer), without concentrating weight energy onto a small set of informative parameters. We propose a Weight Concentration Regularizer (WCR), a training-time regularizer that amplifies the magnitude…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition