Context-Robust Knowledge Editing for Language Models
Haewon Park, Gyubin Choi, Minjun Kim, Yohan Jo

TL;DR
This paper introduces CHED, a benchmark for evaluating context robustness in knowledge editing of language models, and proposes CoRE, a method that enhances editing success amidst preceding contexts while maintaining model capabilities.
Contribution
The paper develops CHED for assessing context robustness in KE and proposes CoRE to improve editing success in context-rich scenarios, addressing a key limitation of existing methods.
Findings
KE methods often fail with preceding contexts
CoRE improves editing success in context scenarios
Analysis reveals how context influences editing effectiveness
Abstract
Knowledge editing (KE) methods offer an efficient way to modify knowledge in large language models. Current KE evaluations typically assess editing success by considering only the edited knowledge without any preceding contexts. In real-world applications, however, preceding contexts often trigger the retrieval of the original knowledge and undermine the intended edit. To address this issue, we develop CHED -- a benchmark designed to evaluate the context robustness of KE methods. Evaluations on CHED show that they often fail when preceding contexts are present. To mitigate this shortcoming, we introduce CoRE, a KE method designed to strengthen context robustness by minimizing context-sensitive variance in hidden states of the model for edited knowledge. This method not only improves the editing success rate in situations where a preceding context is present but also preserves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Artificial Intelligence in Healthcare and Education
