Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited   Facts

Baolong Bi; Shenghua Liu; Lingrui Mei; Yiwei Wang; Pengliang Ji; Xueqi; Cheng

arXiv:2405.11613·cs.CL·May 22, 2024·1 cites

Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts

Baolong Bi, Shenghua Liu, Lingrui Mei, Yiwei Wang, Pengliang Ji, Xueqi, Cheng

PDF

Open Access 1 Repo

TL;DR

This paper introduces DeCK, a novel method that improves knowledge editing in large language models by contrasting logits to enhance confidence in edited facts, especially addressing stubborn knowledge that resists change.

Contribution

DeCK is a new approach that contrasts logits from edited and unedited knowledge to better update LLMs, significantly improving confidence in edited facts.

Findings

01

DeCK improves LLaMA3-8B-instruct performance on MQuAKE by up to 219%.

02

Contrastive decoding enhances confidence in edited knowledge.

03

Addresses stubborn knowledge that resists editing.

Abstract

The knowledge within large language models (LLMs) may become outdated quickly. While in-context editing (ICE) is currently the most effective method for knowledge editing (KE), it is constrained by the black-box modeling of LLMs and thus lacks interpretability. Our work aims to elucidate the superior performance of ICE on the KE by analyzing the impacts of in-context new knowledge on token-wise distributions. We observe that despite a significant boost in logits of the new knowledge, the performance of is still hindered by stubborn knowledge. Stubborn knowledge refers to as facts that have gained excessive confidence during pretraining, making it hard to edit effectively. To address this issue and further enhance the performance of ICE, we propose a novel approach termed $De$ coding by $C$ ontrasting $K$ nowledge (DeCK). DeCK derives the distribution of the next…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

byronbbl/deck
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaw, AI, and Intellectual Property · Library Science and Information Systems · Digital Rights Management and Security