Robust and Scalable Model Editing for Large Language Models
Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen, Chen, Kuai Li, Tao Yang, Maosong Sun

TL;DR
This paper introduces EREN, a method for large language model editing that enhances robustness and scalability by leveraging instruction-finetuned models' ability to prioritize contextual knowledge, supported by a new challenging dataset.
Contribution
The paper presents EREN, a novel approach for scalable, robust model editing that outperforms existing methods and can handle multiple edits and irrelevant contexts effectively.
Findings
EREN outperforms state-of-the-art methods significantly.
It can integrate multiple knowledge edits.
It responds correctly to irrelevant or similar-sounding inputs.
Abstract
Large language models (LLMs) can make predictions using parametric knowledge--knowledge encoded in the model weights--or contextual knowledge--knowledge presented in the context. In many scenarios, a desirable behavior is that LLMs give precedence to contextual knowledge when it conflicts with the parametric knowledge, and fall back to using their parametric knowledge when the context is irrelevant. This enables updating and correcting the model's knowledge by in-context editing instead of retraining. Previous works have shown that LLMs are inclined to ignore contextual knowledge and fail to reliably fall back to parametric knowledge when presented with irrelevant context. In this work, we discover that, with proper prompting methods, instruction-finetuned LLMs can be highly controllable by contextual knowledge and robust to irrelevant context. Utilizing this feature, we propose EREN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Model-Driven Software Engineering Techniques
