Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models
Rishaank Gupta

TL;DR
This paper introduces Capability-Guided Compression (CGC), a novel framework that allocates compression budgets based on model component capabilities, improving interpretability and addressing phase transition issues in large language model compression.
Contribution
It proposes a capability density measure derived from autoencoder features, providing a new pre-compression predictor for component phase transitions and enabling interpretability-aware compression.
Findings
Capability density is independent of importance scores.
Components with higher capability density reach phase transitions at lower compression ratios.
Theoretical proof links capability density to structural redundancy and phase transition points.
Abstract
Large language model compression has made substantial progress through pruning, quantization, and low-rank decomposition, yet a fundamental limitation persists across all existing methods: compression budgets are allocated without any representation of what individual model components functionally encode. We term this the capability-blind compression problem and argue it is a root cause of two well-documented failures -- the insensitivity of perplexity-based evaluation to reasoning capability loss, and the abrupt phase transitions in model performance recently characterized by Ma et al. (2026). We propose Capability-Guided Compression (CGC), a framework that addresses this by using Sparse Autoencoder (SAE)-derived capability density maps to allocate differential compression budgets across transformer components. Capability density is a formally defined scalar measure combining the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Generative Adversarial Networks and Image Synthesis
