CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models
Denis Kuznedelev, Eldar Kurtic, Elias Frantar, Dan Alistarh

TL;DR
This paper introduces CAP, a correlation-aware unstructured pruning method that significantly improves the compressibility of state-of-the-art vision models, enabling high sparsity levels with minimal accuracy loss.
Contribution
The paper presents a theoretically-justified pruning framework that accurately handles complex weight correlations and includes an efficient finetuning process, advancing model compression techniques.
Findings
Pruned vision models to over 75% sparsity with ≤1% accuracy drop.
Achieved practical speedups of 1.5 to 2.4x without accuracy loss.
Extended pruning to large self-supervised vision models with negligible accuracy impact.
Abstract
Driven by significant improvements in architectural design and training pipelines, computer vision has recently experienced dramatic progress in terms of accuracy on classic benchmarks such as ImageNet. These highly-accurate models are challenging to deploy, as they appear harder to compress using standard techniques such as pruning. We address this issue by introducing the Correlation Aware Pruner (CAP), a new unstructured pruning framework which significantly pushes the compressibility limits for state-of-the-art architectures. Our method is based on two technical advancements: a new theoretically-justified pruner, which can handle complex weight correlations accurately and efficiently during the pruning process itself, and an efficient finetuning procedure for post-compression recovery. We validate our approach via extensive experiments on several modern vision models such as Vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Advanced Image and Video Retrieval Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Pruning · Attentive Walk-Aggregating Graph Neural Network · Batch Normalization · Depthwise Convolution · Dense Connections · Local Patch Interaction · Linear Layer · Multi-Head Attention
