Beyond Variance: Knowledge-Aware LLM Compression via Fisher-Aligned Subspace Diagnostics
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

TL;DR
This paper introduces Fisher-Aligned Subspace Compression (FASC), a knowledge-aware method for LLM activation compression that preserves factual knowledge better than traditional variance-based methods, enabling significant model size reduction without losing accuracy.
Contribution
FASC is a novel compression framework that uses Fisher Information to identify knowledge-critical dimensions, improving factual knowledge retention during model compression.
Findings
FASC preserves 6-8% more accuracy on knowledge benchmarks at 50% compression.
FASC enables a 7B model to match the factual recall of a 13B uncompressed model.
The Dependence Violation Score ( {ho}) effectively indicates where factual knowledge is stored in models.
Abstract
Post-training activation compression is essential for deploying Large Language Models (LLMs) on resource-constrained hardware. However, standard methods like Singular Value Decomposition (SVD) are gradient-blind: they preserve high-variance dimensions regardless of their impact on factual knowledge preservation. We introduce Fisher-Aligned Subspace Compression (FASC), a knowledge-aware compression framework that selects subspaces by directly modeling activation-gradient coupling, minimizing a second-order surrogate of the loss function. FASC leverages the Fisher Information Matrix to identify dimensions critical for factual knowledge, which often reside in low-variance but high-gradient-sensitivity subspaces. We propose the Dependence Violation Score (\r{ho}) as a general-purpose diagnostic metric that quantifies activation-gradient coupling, revealing where factual knowledge is stored…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
