Data-Free Dynamic Compression of CNNs for Tractable Efficiency
Lukas Meiner, Jens Mehnert, Alexandru Paul Condurache

TL;DR
HASTE is a data-free, plug-and-play method that uses locality-sensitive hashing to compress CNN channels, significantly reducing FLOPs with minimal accuracy loss on benchmarks like CIFAR-10 and ImageNet.
Contribution
The paper introduces HASTE, a novel data-free convolution module that instantly reduces inference costs by detecting and compressing redundant channels using hashing, without training or fine-tuning.
Findings
Achieves 46.72% FLOPs reduction on ResNet34 with 1.25% accuracy loss.
Operates without training data or fine-tuning, enabling quick deployment.
Effective on CIFAR-10 and ImageNet benchmarks.
Abstract
To reduce the computational cost of convolutional neural networks (CNNs) on resource-constrained devices, structured pruning approaches have shown promise in lowering floating-point operations (FLOPs) without substantial drops in accuracy. However, most methods require fine-tuning or specific training procedures to achieve a reasonable trade-off between retained accuracy and reduction in FLOPs, adding computational overhead and requiring training data to be available. To this end, we propose HASTE (Hashing for Tractable Efficiency), a data-free, plug-and-play convolution module that instantly reduces a network's test-time inference cost without training or fine-tuning. Our approach utilizes locality-sensitive hashing (LSH) to detect redundancies in the channel dimension of latent feature maps, compressing similar channels to reduce input and filter depth simultaneously, resulting in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Image Enhancement Techniques
MethodsConvolution · Pruning
