Attn-GS: Attention-Guided Context Compression for Efficient Personalized LLMs
Shenglai Zeng, Tianqi Zheng, Chuan Tian, Dante Everaert, Yau-Shian Wang, Yupin Huang, Michael J. Morais, Rohit Patki, Jinjin Tian, Xinnan Dai, Kai Guo, Monica Xiao Cheng, Hui Liu

TL;DR
This paper introduces Attn-GS, a novel attention-guided framework that leverages LLMs' attention patterns to effectively compress user profiles, enabling efficient personalization with minimal token usage and high performance.
Contribution
It proposes a new method utilizing attention feedback for context compression, improving personalization efficiency without sacrificing accuracy.
Findings
Attn-GS reduces token usage by 50 times.
It outperforms baseline methods across multiple tasks.
Performance approaches that of using full context.
Abstract
Personalizing large language models (LLMs) to individual users requires incorporating extensive interaction histories and profiles, but input token constraints make this impractical due to high inference latency and API costs. Existing approaches rely on heuristic methods such as selecting recent interactions or prompting summarization models to compress user profiles. However, these methods treat context as a monolithic whole and fail to consider how LLMs internally process and prioritize different profile components. We investigate whether LLMs' attention patterns can effectively identify important personalization signals for intelligent context compression. Through preliminary studies on representative personalization tasks, we discover that (a) LLMs' attention patterns naturally reveal important signals, and (b) fine-tuning enhances LLMs' ability to distinguish between relevant and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Artificial Intelligence in Healthcare and Education
