Balancing Knowledge Updates: Toward Unified Modular Editing in LLMs
Jiahao Liu, Zijian Wang, Kuo Zhao, Dong Hu

TL;DR
This paper introduces IntAttn-Edit, a novel method for knowledge editing in LLMs that jointly updates MLP and attention modules based on their contribution, leading to improved effectiveness and preservation of factual knowledge.
Contribution
It reveals the significant role of attention modules in knowledge storage and proposes a balanced updating strategy for both modules, enhancing editing performance.
Findings
IntAttn-Edit outperforms prior methods on standard benchmarks.
Balancing updates improves knowledge preservation and generalization.
Attention modules are crucial for factual knowledge in LLMs.
Abstract
Knowledge editing has emerged as an efficient approach for updating factual knowledge in large language models (LLMs). It typically locates knowledge storage modules and then modifies their parameters. However, most existing methods focus on the weights of multilayer perceptron (MLP) modules, which are often identified as the main repositories of factual information. Other components, such as attention (Attn) modules, are often ignored during editing. This imbalance can leave residual outdated knowledge and limit editing effectiveness. We perform comprehensive knowledge localization experiments on advanced LLMs and find that Attn modules play a substantial role in factual knowledge storage and retrieval, especially in earlier layers. Based on these insights, we propose IntAttn-Edit, a method that extends the associative memory paradigm to jointly update both MLP and Attn modules. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
