Are We Evaluating the Edit Locality of LLM Model Editing Properly?
Wei Liu, Haomei Xu, Hongkai Liu, Zhiying Deng, Ruixuan Li, Heng Huang, Yee Whye Teh, Wee Sun Lee

TL;DR
This paper critically examines current evaluation methods for assessing the specificity of model edits in large language models, identifies their shortcomings, and proposes a new, more sensitive evaluation protocol that better correlates with regularizer strength.
Contribution
It introduces a novel evaluation protocol for model editing specificity that addresses conceptual and empirical limitations of existing metrics.
Findings
Existing metrics are weakly correlated with regularizer strength.
Current metrics lack sensitivity to distinguish different methods.
The proposed protocol improves sensitivity and correlation with regularizer strength.
Abstract
Model editing has recently emerged as a popular paradigm for efficiently updating knowledge in LLMs. A central desideratum of updating knowledge is to balance editing efficacy, i.e., the successful injection of target knowledge, and specificity (also known as edit locality), i.e., the preservation of existing non-target knowledge. However, we find that existing specificity evaluation protocols are inadequate for this purpose. We systematically elaborated on the three fundamental issues it faces. Beyond the conceptual issues, we further empirically demonstrate that existing specificity metrics are weakly correlated with the strength of specificity regularizers. We also find that current metrics lack sufficient sensitivity, rendering them ineffective at distinguishing the specificity performance of different methods. Finally, we propose a constructive evaluation protocol. Under this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Model-Driven Software Engineering Techniques · Topic Modeling
