Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks

Shihao Zhang; Rayan Saab

arXiv:2502.02766·cs.LG·February 18, 2026

Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks

Shihao Zhang, Rayan Saab

PDF

Open Access

TL;DR

This paper provides a theoretical framework for data-driven low-rank compression of deep neural networks, explaining why such methods outperform data-agnostic approaches and offering guarantees for maintaining accuracy after compression.

Contribution

It develops recovery theorems under weak assumptions, offering the first theoretical guarantees for data-driven low-rank neural network compression.

Findings

01

Proves three recovery theorems for low-rank approximation

02

Shows data-driven methods outperform data-agnostic approaches

03

Provides theoretical guarantees for maintaining accuracy

Abstract

Deep neural networks have achieved state-of-the-art performance across numerous applications, but their high memory and computational demands present significant challenges, particularly in resource-constrained environments. Model compression techniques, such as low-rank approximation, offer a promising solution by reducing the size and complexity of these networks while only minimally sacrificing accuracy. In this paper, we develop an analytical framework for data-driven post-training low-rank compression. We prove three recovery theorems under progressively weaker assumptions about the approximate low-rank structure of activations, modeling deviations via noise. Our results represent a step toward explaining why data-driven low-rank compression methods outperform data-agnostic approaches and towards theoretically grounded compression algorithms that reduce inference costs while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications