DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models
Taolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang, Xiaofeng He,, Longtao Huang, Hui Xue, Jun Huang

TL;DR
This paper introduces DAFNet, a novel model for sequentially editing large language models to correct factual errors continuously, using dynamic fusion techniques to prevent forgetting and improve editing accuracy.
Contribution
The paper proposes DAFNet with a new semantic fusion mechanism and introduces DAFSet, a dataset designed for robust sequential model editing in large language models.
Findings
DAFNet outperforms existing baselines in single-turn and sequential editing tasks.
The use of DAFSet enhances the performance of auxiliary network-based editing methods.
DAFNet effectively prevents catastrophic forgetting during multiple knowledge updates.
Abstract
Recently, while large language models (LLMs) have demonstrated impressive results, they still suffer from hallucination, i.e., the generation of false information. Model editing is the task of fixing factual mistakes in LLMs; yet, most previous works treat it as a one-time task, paying little attention to ever-emerging mistakes generated by LLMs. We address the task of sequential model editing (SME) that aims to rectify mistakes continuously. A Dynamic Auxiliary Fusion Network (DAFNet) is designed to enhance the semantic interaction among the factual knowledge within the entire sequence, preventing catastrophic forgetting during the editing process of multiple knowledge triples. Specifically, (1) for semantic fusion within a relation triple, we aggregate the intra-editing attention flow into auto-regressive self-attention with token-level granularity in LLMs. We further leverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
