EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
Yanbo Dai, Zhenlan Ji, Zongjie Li, Shuai Wang

TL;DR
EAMET introduces a novel embedding alignment approach to improve the robustness and effectiveness of large language model editing, especially at scale and in complex scenarios.
Contribution
The paper presents EAMET, a new method that aligns embeddings to enhance large language model editing reliability and scalability.
Findings
Achieves about 90% editing efficacy on 10,000 facts.
Outperforms existing editing methods across multiple models and datasets.
Demonstrates robustness in context-rich and multi-fact editing scenarios.
Abstract
Model editing techniques are essential for efficiently updating knowledge in large language models (LLMs). However, the effectiveness of existing approaches degrades in massive editing scenarios, particularly when evaluated with practical metrics. Their robustness is also limited in context-rich settings or when editing multiple facts of the same subject simultaneously. We attribute these failures to the embedding misalignment among knowledge items, which undermines editing reliability at scale. To address this, we propose EAMET (Embedding Alignment Model Editing in Transformers), which addresses this issue by aligning the space of key and residual embeddings. Extensive experiments across six LLMs and three datasets demonstrate that EAMET consistently outperforms existing methods, achieving about 90\% editing efficacy when editing 10k facts. Codes and datasets are publicly available at…
Peer Reviews
Decision·ICLR 2026 Poster
+ This paper clearly identifies and theoretically characterizes embedding misalignment as a core scalability bottleneck for existing massive edits approaches. + The proposed EAMET is architecturally compatible with MEMIT-style pipelines. It introduces alignment-based optimization of the derived residual embeddings, which makes sense to address the embedding misalignment problem. + The experiments with 10k+ edits or long prefixes seem to demonstrate the effectiveness of the proposed method.
- It seems that the per-fact residual optimization and alignment steps may be expensive at very large scales. I'm curious about the detailed runtime, memory, and scalability trade-offs with the MEMIT-style baselines. - The optimization of the alignment seems to be sequential. I also wonder if the optimization order can have a difference to the editing results.
1. The proposed EAMET framework brings an innovative solution to improve the effectiveness of model editing, particularly in massive editing scenarios. By aligning key and residual embeddings, it overcomes limitations seen in traditional methods, making it a valuable contribution to the field. 2. The paper includes comprehensive experiments on multiple datasets and models, demonstrating the effectiveness of the proposed method in real-world scenarios. The experimental design is solid, and the r
1. The current empirical analysis (starting at line 200) would benefit from a deeper investigation into how the success rate of editing varies across different categories of knowledge, particularly considering their varying degrees of representation inconsistency. This would provide stronger evidence for the challenges addressed by the proposed method. 2. The paper does not provide a detailed complexity analysis of the Key Embedding Preparation step, which involves calculating a large number o
1. The problem is interesting where massive editing fails. 2. It introduces a stricter, more practical success metric tied to generation. 3. The results show the effectiveness of their method.
1. The primary experiments appropriately focus on large-batch editing, but the performance under single-edit or small-batch scenarios (e.g., editing only one or a few facts) remains unexplored. It would be valuable to examine whether the proposed alignment mechanism still provides benefits in these simpler settings. 2. The use of KL divergence over similarity-based softmax distributions to measure embedding misalignment is unconventional. The paper should clarify whether similar formulations hav
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Model-Driven Software Engineering Techniques
