Representation Interventions Enable Lifelong Knowledge Memory Control in LLMs
Xuyuan Liu, Shengyu Chen, Xinshuai Dong, Yanchi Liu, Xujiang Zhao, Haoyu Wang, Yujun Yan, Haifeng Chen, Zhengzhang Chen

TL;DR
RILKE is a scalable method that enables precise, lifelong knowledge updates in large language models by intervening in their representation space, minimizing interference and maintaining utility.
Contribution
The paper introduces RILKE, a novel representation-based intervention technique for lifelong knowledge control in LLMs, with modules learned for localized, interference-free updates.
Findings
RILKE achieves high success in knowledge editing across large models.
It maintains model utility while enabling efficient, localized updates.
RILKE scales effectively to large benchmarks with modest memory overhead.
Abstract
Large language models (LLMs) often produce incorrect or outdated content after being employed. Efficient and accurate knowledge updates without costly retraining are a major challenge. This problem is particularly challenging in lifelong settings, where complex, unstructured knowledge must coexist without interference. We introduce RILKE (Representation Intervention for Lifelong KnowledgE Control), a robust and scalable method that treats knowledge control as interventions within the model's representation space. Leveraging representation-space expressiveness, we identify two key properties enabling RILKE to achieve fine-grained control over complex, unstructured knowledge while maintaining general utility with frozen base weights. During training, RILKE learns paraphrase-robust and edit-localized modules that limit each update to a low-dimensional subspace to minimize cross-edit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
