Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models

Rishaank Gupta

arXiv:2603.16440·cs.LG·March 18, 2026

Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models

Rishaank Gupta

PDF

Open Access

TL;DR

This paper introduces Capability-Guided Compression (CGC), a novel framework that allocates compression budgets based on model component capabilities, improving interpretability and addressing phase transition issues in large language model compression.

Contribution

It proposes a capability density measure derived from autoencoder features, providing a new pre-compression predictor for component phase transitions and enabling interpretability-aware compression.

Findings

01

Capability density is independent of importance scores.

02

Components with higher capability density reach phase transitions at lower compression ratios.

03

Theoretical proof links capability density to structural redundancy and phase transition points.

Abstract

Large language model compression has made substantial progress through pruning, quantization, and low-rank decomposition, yet a fundamental limitation persists across all existing methods: compression budgets are allocated without any representation of what individual model components functionally encode. We term this the capability-blind compression problem and argue it is a root cause of two well-documented failures -- the insensitivity of perplexity-based evaluation to reasoning capability loss, and the abrupt phase transitions in model performance recently characterized by Ma et al. (2026). We propose Capability-Guided Compression (CGC), a framework that addresses this by using Sparse Autoencoder (SAE)-derived capability density maps to allocate differential compression budgets across transformer components. Capability density is a formally defined scalar measure combining the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Generative Adversarial Networks and Image Synthesis