Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection
Ali Aghababaei-Harandi, Massih-Reza Amini

TL;DR
This paper introduces a unified framework for neural network compression that combines tensor decomposition with automatic rank selection, enabling efficient model size reduction while preserving accuracy.
Contribution
It proposes an integrated method for decomposition and rank optimization with automatic search, reducing computational costs and eliminating the need for extra training data.
Findings
Effective model compression with maintained accuracy
Automatic rank selection improves efficiency
Validated on multiple benchmarks
Abstract
Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been developed to address these challenges by reducing model size and computational demands while maintaining accuracy. Among these approaches, factorization methods based on tensor decomposition are theoretically sound and effective. However, they face difficulties in selecting the appropriate rank for decomposition. This paper tackles this issue by presenting a unified framework that simultaneously applies decomposition and rank selection, employing a composite compression loss within defined rank constraints. Our method includes an automatic rank search in a continuous space, efficiently identifying optimal rank configurations for the pre-trained model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Blind Source Separation Techniques
