Breaking Boundaries: Investigating the Effects of Model Editing on   Cross-linguistic Performance

Somnath Banerjee; Avik Halder; Rajarshi Mandal; Sayan Layek; Ian; Soboroff; Rima Hazra; Animesh Mukherjee

arXiv:2406.11139·cs.CL·March 19, 2025

Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian, Soboroff, Rima Hazra, Animesh Mukherjee

PDF

Open Access

TL;DR

This paper evaluates how different model editing techniques affect multilingual language models' cross-linguistic performance, highlighting challenges and potential solutions for linguistic equity in NLP.

Contribution

It introduces new strategies like ELFI and ELFO to test and improve multilingual models' cross-lingual consistency and fairness.

Findings

01

Significant discrepancies in cross-lingual performance of models.

02

Strategies like ELFI and ELFO reveal model robustness and limitations.

03

Potential for models to better address linguistic diversity.

Abstract

The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Llama, and Kan-Llama across languages including English, German, French, Italian, Spanish, Hindi, Tamil, and Kannada. Our research identifies significant discrepancies in normal and merged models concerning cross-lingual consistency. We employ strategies like 'each language for itself' (ELFI) and 'each language for others' (ELFO) to stress-test these models. Our findings demonstrate the potential for LLMs to overcome linguistic barriers, laying the groundwork for future research in achieving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsAttention Is All You Need · Cosine Annealing · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Dropout · Adam · Linear Warmup With Cosine Annealing · Linear Layer