Merging Methods for Multilingual Knowledge Editing for Large Language Models: An Empirical Odyssey

Kunil Lee; Ki-Young Shin; Jong-Hyeok Lee; Young-Joo Suh

arXiv:2605.13919·cs.CL·May 15, 2026

Merging Methods for Multilingual Knowledge Editing for Large Language Models: An Empirical Odyssey

Kunil Lee, Ki-Young Shin, Jong-Hyeok Lee, Young-Joo Suh

PDF

TL;DR

This paper evaluates vector merging techniques for multilingual knowledge editing in large language models, analyzing their effectiveness, limitations, and factors influencing performance across multiple languages.

Contribution

It systematically compares merging strategies and introduces insights into their practical strengths and limitations for multilingual knowledge editing.

Findings

01

Vector summation with shared covariance is most reliable.

02

TSVM improves some performance but has limited effect on multilingual interference.

03

Performance depends on weight scale and rank ratio, with larger scale and lower rank often better.

Abstract

Multilingual knowledge editing (MKE) remains challenging because language-specific edits interfere with one another, even when locate-then-edit methods work well in monolingual settings. This paper focuses on three issues: the effectiveness of vector merging methods for MKE, the extent to which Task Singular Vectors for Merging (TSVM) can reduce multilingual interference, and the influence of the weight scaling factor and rank compression ratio on performance. We evaluate six merging variants with two popular backbone large language models, two base knowledge editing methods, and 12 languages on the MzsRE benchmark under a large-scale batch-editing setting. Our results show that vector summation with shared covariance is the most reliable overall strategy, whereas simple summation without shared covariance performs poorly. TSVM improves performance in some settings, but its ability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.