REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Haitian Zhong; Yuhuan Liu; Ziyang Xu; Guofan Liu; Qiang Liu; Shu Wu; Zhe Zhao; Liang Wang; Tieniu Tan

arXiv:2505.18933·cs.AI·May 27, 2025

REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Haitian Zhong, Yuhuan Liu, Ziyang Xu, Guofan Liu, Qiang Liu, Shu Wu, Zhe Zhao, Liang Wang, Tieniu Tan

PDF

Open Access 1 Video

TL;DR

REACT is a two-phase framework that extracts factual representations and applies controllable perturbations to improve knowledge editing in large language models, significantly reducing overfitting and maintaining balanced editing performance.

Contribution

It introduces a novel method combining representation extraction and controllable tuning to address overfitting in LLM knowledge editing.

Findings

01

REACT significantly reduces overfitting across multiple benchmarks.

02

The method preserves balanced editing performance under diverse scenarios.

03

Experiments demonstrate improved reliability, locality, and generality.

Abstract

Large language model editing methods frequently suffer from overfitting, wherein factual updates can propagate beyond their intended scope, overemphasizing the edited target even when it's contextually inappropriate. To address this challenge, we introduce REACT (Representation Extraction And Controllable Tuning), a unified two-phase framework designed for precise and controllable knowledge editing. In the initial phase, we utilize tailored stimuli to extract latent factual representations and apply Principal Component Analysis with a simple learnbale linear transformation to compute a directional "belief shift" vector for each instance. In the second phase, we apply controllable perturbations to hidden states using the obtained vector with a magnitude scalar, gated by a pre-trained classifier that permits edits only when contextually necessary. Relevant experiments on EVOKE benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing· underline

Taxonomy

TopicsNatural Language Processing Techniques