Automatic Sparse Connectivity Learning for Neural Networks
Zhimin Tang, Linkai Luo, Bike Xie, Yiyu Zhu, Rujie Zhao, Lvqing Bi,, Chao Lu

TL;DR
This paper introduces Sparse Connectivity Learning (SCL), an automatic pruning method for neural networks that learns sparse connectivity by optimizing binary masks, leading to improved efficiency and performance.
Contribution
The paper proposes a novel automatic pruning approach using a trainable binary mask with STE, eliminating the need for manual pruning criteria and hyper-parameter tuning.
Findings
SCL outperforms state-of-the-art pruning methods in accuracy and sparsity.
SCL reduces FLOPs significantly while maintaining or improving performance.
The method is effective across various neural network architectures.
Abstract
Since sparse neural networks usually contain many zero weights, these unnecessary network connections can potentially be eliminated without degrading network performance. Therefore, well-designed sparse neural networks have the potential to significantly reduce FLOPs and computational resources. In this work, we propose a new automatic pruning method - Sparse Connectivity Learning (SCL). Specifically, a weight is re-parameterized as an element-wise multiplication of a trainable weight variable and a binary mask. Thus, network connectivity is fully described by the binary mask, which is modulated by a unit step function. We theoretically prove the fundamental principle of using a straight-through estimator (STE) for network pruning. This principle is that the proxy gradients of STE should be positive, ensuring that mask variables converge at their minima. After finding Leaky ReLU,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsPruning · HuMan(Expedia)||How do I get a human at Expedia?
