No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar, Kevin Luo, Mark Sellke

TL;DR
This paper provides a theoretical explanation for the difficulty of pruning neural networks at initialization, highlighting the role of effective parameter count and mutual information, and shows that training influences the capacity of sparse models.
Contribution
It introduces the concept of effective parameter count $p_{eff}$ and explains the limitations of pruning at initialization through information-theoretic barriers.
Findings
Pruning at initialization is limited by information-theoretic barriers.
Training increases mutual information, affecting model capacity.
Experiments confirm information gained during training impacts sparsity and capacity.
Abstract
The existence of "lottery tickets" arXiv:1803.03635 at or near initialization raises the tantalizing question of whether large models are necessary in deep learning, or whether sparse networks can be quickly identified and trained without ever training the dense models that contain them. However, efforts to find these sparse subnetworks without training the dense model ("pruning at initialization") have been broadly unsuccessful arXiv:2009.08576. We put forward a theoretical explanation for this, based on the model's effective parameter count, , given by the sum of the number of non-zero weights in the final network and the mutual information between the sparsity mask and the data. We show the Law of Robustness of arXiv:2105.12806 extends to sparse networks with the usual parameter count replaced by , meaning a sparse neural network which robustly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science
MethodsPruning
