CompressNAS : A Fast and Efficient Technique for Model Compression using Decomposition
Sudhakar Sah, Nikhil Chabbra, Matthieu Durnerin

TL;DR
CompressNAS introduces a global search framework for optimal tensor rank selection in CNN compression, significantly reducing model size with minimal accuracy loss, suitable for deployment on resource-constrained devices.
Contribution
The paper presents CompressNAS, a novel global search approach for tensor rank selection in model compression, improving efficiency and accuracy trade-offs over existing local methods.
Findings
Compresses ResNet-18 by 8x with less than 4% accuracy drop on ImageNet.
Achieves 2x compression of YOLOv5s without accuracy loss on COCO.
Develops STResNet, a new family of efficient compressed models.
Abstract
Deep Convolutional Neural Networks (CNNs) are increasingly difficult to deploy on microcontrollers (MCUs) and lightweight NPUs (Neural Processing Units) due to their growing size and compute demands. Low-rank tensor decomposition, such as Tucker factorization, is a promising way to reduce parameters and operations with reasonable accuracy loss. However, existing approaches select ranks locally and often ignore global trade-offs between compression and accuracy. We introduce CompressNAS, a MicroNAS-inspired framework that treats rank selection as a global search problem. CompressNAS employs a fast accuracy estimator to evaluate candidate decompositions, enabling efficient yet exhaustive rank exploration under memory and accuracy constraints. In ImageNet, CompressNAS compresses ResNet-18 by 8x with less than 4% accuracy drop; on COCO, we achieve 2x compression of YOLOv5s without any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Sparse and Compressive Sensing Techniques · Advanced Neural Network Applications
