Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts
Baolong Bi, Shenghua Liu, Lingrui Mei, Yiwei Wang, Pengliang Ji, Xueqi, Cheng

TL;DR
This paper introduces DeCK, a novel method that improves knowledge editing in large language models by contrasting logits to enhance confidence in edited facts, especially addressing stubborn knowledge that resists change.
Contribution
DeCK is a new approach that contrasts logits from edited and unedited knowledge to better update LLMs, significantly improving confidence in edited facts.
Findings
DeCK improves LLaMA3-8B-instruct performance on MQuAKE by up to 219%.
Contrastive decoding enhances confidence in edited knowledge.
Addresses stubborn knowledge that resists editing.
Abstract
The knowledge within large language models (LLMs) may become outdated quickly. While in-context editing (ICE) is currently the most effective method for knowledge editing (KE), it is constrained by the black-box modeling of LLMs and thus lacks interpretability. Our work aims to elucidate the superior performance of ICE on the KE by analyzing the impacts of in-context new knowledge on token-wise distributions. We observe that despite a significant boost in logits of the new knowledge, the performance of is still hindered by stubborn knowledge. Stubborn knowledge refers to as facts that have gained excessive confidence during pretraining, making it hard to edit effectively. To address this issue and further enhance the performance of ICE, we propose a novel approach termed coding by ontrasting nowledge (DeCK). DeCK derives the distribution of the next…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLaw, AI, and Intellectual Property · Library Science and Information Systems · Digital Rights Management and Security
