Evaluating Cross-Lingual Unlearning in Multilingual Language Models

Tyler Lizzo; Larry Heck

arXiv:2601.06675·cs.CL·January 13, 2026

Evaluating Cross-Lingual Unlearning in Multilingual Language Models

Tyler Lizzo, Larry Heck

PDF

Open Access

TL;DR

This paper evaluates cross-lingual unlearning in multilingual language models, revealing challenges in removing facts across languages and proposing subspace projection as an effective solution for multilingual forgetting.

Contribution

It provides the first comprehensive evaluation of cross-lingual unlearning and demonstrates the effectiveness of subspace-projection methods in multilingual models.

Findings

01

Most unlearning algorithms fail to remove facts outside the training language.

02

Subspace-projection outperforms other methods in cross-lingual forgetting.

03

Removing shared interlingua subspaces harms all languages.

Abstract

We present the first comprehensive evaluation of cross-lingual unlearning in multilingual LLMs. Using translated TOFU benchmarks in seven language/script variants, we test major unlearning algorithms and show that most fail to remove facts outside the training language, even when utility remains high. However, subspace-projection consistently outperforms the other methods, achieving strong cross-lingual forgetting with minimal degradation. Analysis of learned task subspaces reveals a shared interlingua structure: removing this shared subspace harms all languages, while removing language-specific components selectively affects one. These results demonstrate that multilingual forgetting depends on geometry in weight space, motivating subspace-based approaches for future unlearning systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques