Predicting LLM Compression Degradation from Spectral Statistics
Mingxue Xu

TL;DR
This paper introduces a predictive metric based on spectral statistics that accurately estimates the performance degradation of large language models after low-rank compression, enabling efficient compression decisions.
Contribution
It identifies stable rank and information density as key predictors of compression-induced degradation and validates a new predictive metric with high correlation to actual performance loss.
Findings
The interaction term γ · ρ̄_s predicts accuracy degradation with high correlation.
Stable rank and information density dominate performance degradation.
Theoretical links connect spectral predictors to SVD truncation bounds.
Abstract
Matrix-level low-rank compression is a promising way to reduce the cost of large language models, but running compression and evaluating the resulting models on language tasks can be prohibitively expensive. Can compression-induced degradation be predicted before committing to this compute? We systematically analyze the Qwen3 and Gemma3 model families across four representative low-rank compression methods: vanilla SVD, two ASVD variants, and SVD-LLM. We find that stable rank and information density, measured in bits per parameter, dominate performance degradation. The interaction term , defined as compression ratio times stable rank, is a robust predictor of accuracy degradation, achieving leave-one-out cross-validation Pearson correlations of for attention layers and for MLP layers. We provide theoretical intuition for why this predictor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
