Unlocking the Address Book: Dissecting the Sparse Semantic Structure of LLM Key-Value Caches via Sparse Autoencoders

Qingsen Ma; Dianyun Wang; Jiaming Lyu; Yaoye Wang; Lechen Ning; Sujie Zhu; Zhenbo Xu; Liuyu Xiang; Huining Li; Huijia Wu; and Zhaofeng He

arXiv:2512.10547·cs.LG·December 12, 2025

Unlocking the Address Book: Dissecting the Sparse Semantic Structure of LLM Key-Value Caches via Sparse Autoencoders

Qingsen Ma, Dianyun Wang, Jiaming Lyu, Yaoye Wang, Lechen Ning, Sujie Zhu, Zhenbo Xu, Liuyu Xiang, Huining Li, Huijia Wu, and Zhaofeng He

PDF

Open Access

TL;DR

This paper introduces STA-Attention, a framework using Top-K Sparse Autoencoders to interpret and optimize the Key-Value caches in large language models, revealing semantic structures and improving efficiency.

Contribution

It proposes a novel Top-K SAE method for decomposing KV caches into interpretable semantic components, uncovering key-value asymmetry and enabling selective preservation of important information.

Findings

01

Semantic reconstructions preserve model perplexity and performance.

02

Key vectors are sparse routers dominated by a Semantic Elbow.

03

Value vectors carry dense content requiring larger budgets.

Abstract

The Key-Value (KV) cache is the primary memory bottleneck in long-context Large Language Models, yet it is typically treated as an opaque numerical tensor. In this work, we propose \textbf{STA-Attention}, a framework that utilizes Top-K Sparse Autoencoders (SAEs) to decompose the KV cache into interpretable ``semantic atoms.'' Unlike standard $L_{1}$ -regularized SAEs, our Top-K approach eliminates shrinkage bias, preserving the precise dot-product geometry required for attention. Our analysis uncovers a fundamental \textbf{Key-Value Asymmetry}: while Key vectors serve as highly sparse routers dominated by a ``Semantic Elbow,'' deep Value vectors carry dense content payloads requiring a larger budget. Based on this structure, we introduce a Dual-Budget Strategy that selectively preserves the most informative semantic components while filtering representational noise. Experiments on Yi-6B,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · Generative Adversarial Networks and Image Synthesis