Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection

Ali Aghababaei-Harandi; Massih-Reza Amini

arXiv:2409.03555·cs.LG·September 23, 2025

Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection

Ali Aghababaei-Harandi, Massih-Reza Amini

PDF

Open Access

TL;DR

This paper introduces a unified framework for neural network compression that combines tensor decomposition with automatic rank selection, enabling efficient model size reduction while preserving accuracy.

Contribution

It proposes an integrated method for decomposition and rank optimization with automatic search, reducing computational costs and eliminating the need for extra training data.

Findings

01

Effective model compression with maintained accuracy

02

Automatic rank selection improves efficiency

03

Validated on multiple benchmarks

Abstract

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been developed to address these challenges by reducing model size and computational demands while maintaining accuracy. Among these approaches, factorization methods based on tensor decomposition are theoretically sound and effective. However, they face difficulties in selecting the appropriate rank for decomposition. This paper tackles this issue by presenting a unified framework that simultaneously applies decomposition and rank selection, employing a composite compression loss within defined rank constraints. Our method includes an automatic rank search in a continuous space, efficiently identifying optimal rank configurations for the pre-trained model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Image and Signal Denoising Methods · Blind Source Separation Techniques