Winning the Lottery with Continuous Sparsification
Pedro Savarese, Hugo Silva, Michael Maire

TL;DR
This paper introduces Continuous Sparsification, a novel method for pruning neural networks and finding lottery tickets, outperforming existing techniques in efficiency and accuracy across multiple models and datasets.
Contribution
It develops a new approximation of $ ext{l}_0$ regularization for pruning and lottery ticket search, surpassing state-of-the-art methods in performance and speed.
Findings
Outperforms existing pruning and ticket search methods
Achieves higher accuracy on CIFAR-10 and ImageNet
Enables fast parallel lottery ticket discovery
Abstract
The search for efficient, sparse deep neural network models is most prominently performed by pruning: training a dense, overparameterized network and removing parameters, usually via following a manually-crafted heuristic. Additionally, the recent Lottery Ticket Hypothesis conjectures that, for a typically-sized neural network, it is possible to find small sub-networks which, when trained from scratch on a comparable budget, match the performance of the original dense counterpart. We revisit fundamental aspects of pruning algorithms, pointing out missing ingredients in previous approaches, and develop a method, Continuous Sparsification, which searches for sparse networks based on a novel approximation of an intractable regularization. We compare against dominant heuristic-based methods on pruning as well as ticket search -- finding sparse subnetworks that can be successfully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsPruning
