Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language   Models

Minseok Choi; Kyunghyun Min; Jaegul Choo

arXiv:2406.12354·cs.CL·October 4, 2024

Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models

Minseok Choi, Kyunghyun Min, Jaegul Choo

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for selectively unlearning sensitive information across multiple languages in multilingual language models, addressing privacy concerns while preserving model performance.

Contribution

It proposes an adaptive unlearning scheme that effectively erases knowledge in specific languages without degrading overall multilingual model performance.

Findings

01

Effective unlearning across languages demonstrated

02

Outperforms existing unlearning baselines

03

Maintains model performance after unlearning

Abstract

Pretrained language models memorize vast amounts of information, including private and copyrighted data, raising significant safety concerns. Retraining these models after excluding sensitive data is prohibitively expensive, making machine unlearning a viable, cost-effective alternative. Previous research has focused on machine unlearning for monolingual models, but we find that unlearning in one language does not necessarily transfer to others. This vulnerability makes models susceptible to low-resource language attacks, where sensitive information remains accessible in less dominant languages. This paper presents a pioneering approach to machine unlearning for multilingual language models, selectively erasing information across different languages while maintaining overall performance. Specifically, our method employs an adaptive unlearning scheme that assigns language-dependent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brightjade/multilingual-unlearning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling