Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities

Baolong Bi; Shenghua Liu; Yiwei Wang; Lingrui Mei; Hongcheng Gao,; Yilong Xu; Xueqi Cheng

arXiv:2406.12468·cs.CL·June 19, 2024

Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities

Baolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Hongcheng Gao,, Yilong Xu, Xueqi Cheng

PDF

Open Access

TL;DR

The paper introduces ATBias, a decoding technique that improves knowledge editing in large language models by biasing tokens related to key entities, significantly enhancing performance with minimal latency increase.

Contribution

ATBias is a novel decoding method that selectively biases tokens related to key entities, improving in-context editing efficiency and effectiveness in large language models.

Findings

01

Achieves up to 32.3% improvement over state-of-the-art ICE methods.

02

Reduces latency by half compared to existing techniques.

03

Widely applicable to various LLMs with negligible additional cost.

Abstract

The parametric knowledge memorized by large language models (LLMs) becomes outdated quickly. In-context editing (ICE) is currently the most effective method for updating the knowledge of LLMs. Recent advancements involve enhancing ICE by modifying the decoding strategy, obviating the need for altering internal model structures or adjusting external prompts. However, this enhancement operates across the entire sequence generation, encompassing a plethora of non-critical tokens. In this work, we introduce $A$ daptive $T$ oken $Bias$ er ( $ATBias$ ), a new decoding technique designed to enhance ICE. It focuses on the tokens that are mostly related to knowledge during decoding, biasing their logits by matching key entities related to new and parametric knowledge. Experimental results show that ATBias significantly enhances ICE performance, achieving up to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Reinforcement Learning in Robotics