EAMET: Robust Massive Model Editing via Embedding Alignment Optimization

Yanbo Dai; Zhenlan Ji; Zongjie Li; Shuai Wang

arXiv:2505.11876·cs.CL·February 25, 2026

EAMET: Robust Massive Model Editing via Embedding Alignment Optimization

Yanbo Dai, Zhenlan Ji, Zongjie Li, Shuai Wang

PDF

Open Access 1 Repo 3 Reviews

TL;DR

EAMET introduces a novel embedding alignment approach to improve the robustness and effectiveness of large language model editing, especially at scale and in complex scenarios.

Contribution

The paper presents EAMET, a new method that aligns embeddings to enhance large language model editing reliability and scalability.

Findings

01

Achieves about 90% editing efficacy on 10,000 facts.

02

Outperforms existing editing methods across multiple models and datasets.

03

Demonstrates robustness in context-rich and multi-fact editing scenarios.

Abstract

Model editing techniques are essential for efficiently updating knowledge in large language models (LLMs). However, the effectiveness of existing approaches degrades in massive editing scenarios, particularly when evaluated with practical metrics. Their robustness is also limited in context-rich settings or when editing multiple facts of the same subject simultaneously. We attribute these failures to the embedding misalignment among knowledge items, which undermines editing reliability at scale. To address this, we propose EAMET (Embedding Alignment Model Editing in Transformers), which addresses this issue by aligning the space of key and residual embeddings. Extensive experiments across six LLMs and three datasets demonstrate that EAMET consistently outperforms existing methods, achieving about 90\% editing efficacy when editing 10k facts. Codes and datasets are publicly available at…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 2

Strengths

+ This paper clearly identifies and theoretically characterizes embedding misalignment as a core scalability bottleneck for existing massive edits approaches. + The proposed EAMET is architecturally compatible with MEMIT-style pipelines. It introduces alignment-based optimization of the derived residual embeddings, which makes sense to address the embedding misalignment problem. + The experiments with 10k+ edits or long prefixes seem to demonstrate the effectiveness of the proposed method.

Weaknesses

- It seems that the per-fact residual optimization and alignment steps may be expensive at very large scales. I'm curious about the detailed runtime, memory, and scalability trade-offs with the MEMIT-style baselines. - The optimization of the alignment seems to be sequential. I also wonder if the optimization order can have a difference to the editing results.

Reviewer 02Rating 6Confidence 4

Strengths

1. The proposed EAMET framework brings an innovative solution to improve the effectiveness of model editing, particularly in massive editing scenarios. By aligning key and residual embeddings, it overcomes limitations seen in traditional methods, making it a valuable contribution to the field. 2. The paper includes comprehensive experiments on multiple datasets and models, demonstrating the effectiveness of the proposed method in real-world scenarios. The experimental design is solid, and the r

Weaknesses

1. The current empirical analysis (starting at line 200) would benefit from a deeper investigation into how the success rate of editing varies across different categories of knowledge, particularly considering their varying degrees of representation inconsistency. This would provide stronger evidence for the challenges addressed by the proposed method. 2. The paper does not provide a detailed complexity analysis of the Key Embedding Preparation step, which involves calculating a large number o

Reviewer 03Rating 4Confidence 4

Strengths

1. The problem is interesting where massive editing fails. 2. It introduces a stricter, more practical success metric tied to generation. 3. The results show the effectiveness of their method.

Weaknesses

1. The primary experiments appropriately focus on large-batch editing, but the performance under single-edit or small-batch scenarios (e.g., editing only one or a few facts) remains unexplored. It would be valuable to examine whether the proposed alignment mechanism still provides benefits in these simpler settings. 2. The use of KL divergence over similarity-based softmax distributions to measure embedding misalignment is unconventional. The paper should clarify whether similar formulations hav

Code & Models

Repositories

ybdai7/NAMET-massive-editing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model-Driven Software Engineering Techniques