Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs
Paul Wimmer, Jens Mehnert, Alexandru Paul Condurache

TL;DR
This paper introduces interspace pruning (IP), a novel method that enhances sparse CNN training by dynamically representing filters with adaptive bases, outperforming traditional unstructured pruning especially at high sparsity levels.
Contribution
The paper proposes interspace pruning, a new approach that uses adaptive filter bases to improve the effectiveness of unstructured pruning in CNNs.
Findings
IP outperforms standard unstructured pruning across all tested methods.
IP significantly improves performance on challenging datasets like ImageNet.
IP enhances trainability and generalization in sparse CNN training.
Abstract
Unstructured pruning is well suited to reduce the memory footprint of convolutional neural networks (CNNs), both at training and inference time. CNNs contain parameters arranged in filters. Standard unstructured pruning (SP) reduces the memory footprint of CNNs by setting filter elements to zero, thereby specifying a fixed subspace that constrains the filter. Especially if pruning is applied before or during training, this induces a strong bias. To overcome this, we introduce interspace pruning (IP), a general tool to improve existing pruning methods. It uses filters represented in a dynamic interspace by linear combinations of an underlying adaptive filter basis (FB). For IP, FB coefficients are set to zero while un-pruned coefficients and FBs are trained jointly. In this work, we provide mathematical evidence for IP's superior performance and demonstrate that IP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
MethodsPruning
