Beyond Variance: Knowledge-Aware LLM Compression via Fisher-Aligned Subspace Diagnostics

Ibne Farabi Shihab; Sanjeda Akter; Anuj Sharma

arXiv:2601.07197·cs.LG·January 13, 2026

Beyond Variance: Knowledge-Aware LLM Compression via Fisher-Aligned Subspace Diagnostics

Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma

PDF

Open Access

TL;DR

This paper introduces Fisher-Aligned Subspace Compression (FASC), a knowledge-aware method for LLM activation compression that preserves factual knowledge better than traditional variance-based methods, enabling significant model size reduction without losing accuracy.

Contribution

FASC is a novel compression framework that uses Fisher Information to identify knowledge-critical dimensions, improving factual knowledge retention during model compression.

Findings

01

FASC preserves 6-8% more accuracy on knowledge benchmarks at 50% compression.

02

FASC enables a 7B model to match the factual recall of a 13B uncompressed model.

03

The Dependence Violation Score ( {ho}) effectively indicates where factual knowledge is stored in models.

Abstract

Post-training activation compression is essential for deploying Large Language Models (LLMs) on resource-constrained hardware. However, standard methods like Singular Value Decomposition (SVD) are gradient-blind: they preserve high-variance dimensions regardless of their impact on factual knowledge preservation. We introduce Fisher-Aligned Subspace Compression (FASC), a knowledge-aware compression framework that selects subspaces by directly modeling activation-gradient coupling, minimizing a second-order surrogate of the loss function. FASC leverages the Fisher Information Matrix to identify dimensions critical for factual knowledge, which often reside in low-variance but high-gradient-sensitivity subspaces. We propose the Dependence Violation Score (\r{ho}) as a general-purpose diagnostic metric that quantifies activation-gradient coupling, revealing where factual knowledge is stored…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications