UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Language Models

Xiaojie Gu; Ziying Huang; Jia-Chen Gu; Kai Zhang

arXiv:2505.14679·cs.CL·March 11, 2026

UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Language Models

Xiaojie Gu, Ziying Huang, Jia-Chen Gu, Kai Zhang

PDF

1 Repo 1 Datasets 4 Reviews

TL;DR

UltraEdit is a novel, efficient, and scalable method for lifelong editing of large language models that requires no additional training or memory, enabling fast updates with minimal resource use.

Contribution

UltraEdit introduces a training-, subject-, and memory-free approach for scalable lifelong model editing, significantly improving speed and resource efficiency over previous methods.

Findings

01

UltraEdit achieves over 7x faster editing speeds than state-of-the-art methods.

02

UltraEdit requires 4x less VRAM, enabling editing of 7B models on consumer GPUs.

03

Supports up to 2 million edits while maintaining high accuracy.

Abstract

Lifelong learning enables large language models (LLMs) to adapt to evolving information by continually updating their internal knowledge. An ideal system should support efficient, wide-ranging updates while preserving existing capabilities and ensuring reliable deployment. Model editing stands out as a promising solution for this goal, offering a focused and efficient way to revise a model's internal knowledge. Although recent paradigms have made notable progress, they often struggle to meet the demands of practical lifelong adaptation at scale. To bridge this gap, we propose UltraEdit, a training-, subject-, and memory-free approach that is well-suited for ultra-scalable, real-world lifelong model editing. UltraEdit fundamentally differs from traditional paradigms by computing parameter shifts in one step using only a hidden state and its gradient, making the approach simple yet…

Peer Reviews

Decision·ICLR 2026 Conference Withdrawn Submission

Reviewer 01Rating 4Confidence 4

Strengths

1. The paper is well organized and clearly written, making the method easy to follow and reproduce. 2. ULTRAEDIT employs a one-step parameter shift that requires neither iterative updates nor retraining. It appears to be a simple yet effective approach with a clever and well-motivated formulation. 3. The approach seems computationally efficient and memory-friendly, enabling large-scale and lifelong edits while maintaining model stability. 4. Experimental results are comprehensive, covering mu

Weaknesses

1. While the paper highlights the contribution of the normalization mechanism to feature stability, it lacks deeper theoretical or analysis to substantiate this claim. A more detailed theoretical or analysis could further improve the work. 2. Although efficient overall, the per-module caching of features (H, V) and the required matrix solve can raise peak memory and compute costs as batch size, feature dimensionality, or the number of editable modules grows.

Reviewer 02Rating 8Confidence 3

Strengths

The proposed lifelong normalization strategy is a simple yet elegant contribution that addresses the stability bottleneck in lifelong model editing. It adapts running statistics across editing turns, mitigating edit drift and catastrophic forgetting. It also removes the dependency on hypernetwork training, subject localization, or external memory, making it feasible for real-world continual knowledge updates on consumer GPUs.

Weaknesses

While the method is intuitive and experimental results are convincing, there seems no enough theoretical analysis explaining why lifelong normalization ensures stability.

Reviewer 03Rating 2Confidence 4

Strengths

1. The paper is well written and easy to follow, and the method is described in a modular way. 2. ULTRAEDITBENCH is quite large and could be a useful community resource for assessing editing methods under large-scale conditions. 3. The evaluations cover multiple datasets and model families, with ablations that probe the contribution of the normalization component.

Weaknesses

1. Limited explanation for the method. Authors claim that "by calibrating the mean and variance, it mitigates the overwriting of previously acquired knowledge", but they provide neither a theoretical justification nor targeted tests to substantiate this mechanism beyond ablations. 2. Mechanistic explanation is also underdeveloped. The insight behind concatenating hidden states and gradients into a unified feature remains unclear. Moreover, why can normalization stabilize the learning dynamics a

Reviewer 04Rating 2Confidence 5

Strengths

* The paper tackles an important and timely problem in scalable and continual model editing. The proposed framework enables LLMs to adapt to new information continuously, making lifelong knowledge updating both efficient and practical. * ULTRAEDITBENCH, containing over 2M editing samples, represents the largest benchmark in this area and provides a solid foundation for evaluation at unprecedented scale. * Extensive experiments across diverse LLMs demonstrate that ULTRAEDIT is both effective and

Weaknesses

* **Unreliable evaluation foundation**. Although the authors claim to use LLM-as-judge as a complementary metric, nearly all experiments in the main text rely on **teacher-forcing–based evaluation**, which is known to **overestimate editing performance by feeding the gold answer tokens during decoding**. The key difference between the previous evaluation and the more rigorous WILD framework is not whether an LLM is used as a judge, but whether the evaluation depends on teacher forcing. Prior stu

Code & Models

Repositories

xiaojiegu/ultraedit
pytorchOfficial

Datasets

XiaojieGu/UltraEditBench
dataset· 28 dl
28 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.