Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Zhuoran Zhang; Yongxiang Li; Zijian Kan; Keyuan Cheng; Lijie Hu; and; Di Wang

arXiv:2410.06331·cs.CL·February 4, 2025

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Zhuoran Zhang, Yongxiang Li, Zijian Kan, Keyuan Cheng, Lijie Hu, and, Di Wang

PDF

Open Access

TL;DR

This paper introduces IFMET, a new locate-then-edit method that effectively edits both shallow and deep layers of LLMs to improve multi-hop factual recall, addressing limitations of previous approaches.

Contribution

The paper presents IFMET, a novel locate-then-edit knowledge editing approach that targets both shallow and deep layers for better multi-hop fact recall in LLMs.

Findings

01

IFMET significantly improves multi-hop factual recall performance.

02

Deep layer knowledge retrieval is crucial for multi-hop tasks.

03

Current methods mainly focus on shallow layers, limiting multi-hop editing.

Abstract

The locate-then-edit paradigm has shown significant promise for knowledge editing (KE) in Large Language Models (LLMs). While previous methods perform well on single-hop fact recall tasks, they consistently struggle with multi-hop factual recall tasks involving newly edited knowledge. In this paper, leveraging tools in mechanistic interpretability, we first identify that in multi-hop tasks, LLMs tend to retrieve knowledge with implicit subject information from deeper MLP layers, unlike single-hop tasks, which rely on shallow layers. This distinction explains the poor performance of current methods in multi-hop queries, as they primarily focus on editing shallow layers with single-hop edit prompts, leaving deeper layers unchanged. To address this, we propose IFMET, a novel locate-then-edit KE approach designed to edit both shallow and deep MLP layers. Beyond single-hop editing prompts,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning

MethodsFocus