Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing
Zhuoran Zhang, Yongxiang Li, Zijian Kan, Keyuan Cheng, Lijie Hu, and, Di Wang

TL;DR
This paper introduces IFMET, a new locate-then-edit method that effectively edits both shallow and deep layers of LLMs to improve multi-hop factual recall, addressing limitations of previous approaches.
Contribution
The paper presents IFMET, a novel locate-then-edit knowledge editing approach that targets both shallow and deep layers for better multi-hop fact recall in LLMs.
Findings
IFMET significantly improves multi-hop factual recall performance.
Deep layer knowledge retrieval is crucial for multi-hop tasks.
Current methods mainly focus on shallow layers, limiting multi-hop editing.
Abstract
The locate-then-edit paradigm has shown significant promise for knowledge editing (KE) in Large Language Models (LLMs). While previous methods perform well on single-hop fact recall tasks, they consistently struggle with multi-hop factual recall tasks involving newly edited knowledge. In this paper, leveraging tools in mechanistic interpretability, we first identify that in multi-hop tasks, LLMs tend to retrieve knowledge with implicit subject information from deeper MLP layers, unlike single-hop tasks, which rely on shallow layers. This distinction explains the poor performance of current methods in multi-hop queries, as they primarily focus on editing shallow layers with single-hop edit prompts, leaving deeper layers unchanged. To address this, we propose IFMET, a novel locate-then-edit KE approach designed to edit both shallow and deep MLP layers. Beyond single-hop editing prompts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
MethodsFocus
