Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs

HyoJung Han; Sweta Agrawal; Eleftheria Briakou

arXiv:2510.26024·cs.CL·October 31, 2025

Rethinking Cross-lingual Alignment: Balancing Transfer and Cultural Erasure in Multilingual LLMs

HyoJung Han, Sweta Agrawal, Eleftheria Briakou

PDF

3 Reviews

TL;DR

This paper examines the trade-off between factual transfer and cultural localization in multilingual LLMs, proposing a new method to better balance these objectives by steering internal representations at different model layers.

Contribution

It introduces the transfer-localization plane framework and Surgical Steering, a novel inference-time technique to disentangle and balance transfer and cultural localization.

Findings

01

CLA approaches improve factual transfer but reduce cultural localization

02

Universal transfer and cultural knowledge are steerable at different model layers

03

Surgical Steering improves the balance between transfer and localization

Abstract

Cross-lingual alignment (CLA) aims to align multilingual representations, enabling Large Language Models (LLMs) to seamlessly transfer knowledge across languages. While intuitive, we hypothesize, this pursuit of representational convergence can inadvertently cause "cultural erasure", the functional loss of providing culturally-situated responses that should diverge based on the query language. In this work, we systematically analyze this trade-off by introducing a holistic evaluation framework, the transfer-localization plane, which quantifies both desirable knowledge transfer and undesirable cultural erasure. Using this framework, we re-evaluate recent CLA approaches and find that they consistently improve factual transfer at the direct cost of cultural localization across all six languages studied. Our investigation into the internal representations of these models reveals a key…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 4

Strengths

1. The presentation is clear. 2. The authors provide some valuable, negative results for CLA.

Weaknesses

1. While the negative results are valuable, the overall contribution of the paper is not sufficient. The authors re-run existing frameworks on existing datasets, and the findings are not new. For example, the cultural bias is discussed in GLOBAL_MMLU, which the authors consider and use. 2. The work is not complete. The method, which relies on surgery of the model, is not easy to reproduce on other models. The authors only conduct experiments on one model, making the generality unclear. Is there

Reviewer 02Rating 6Confidence 4

Strengths

1. This paper addresses a compelling topic—the trade-off between knowledge transfer and cultural localization—pioneering research in this area. By exploring this trade-off, multilingual alignment can achieve better semantic consistency across languages while preserving culturally relevant differences. 2. The paper offers deep insights into cross-lingual alignment through the transfer-localization plane, demonstrating that current multilingual methods enhance transfer performance at the cost of c

Weaknesses

1. The insights and contributions of this paper are impressive. However, the evaluation is limited to a single model and one culturally specific dataset. It would be important to demonstrate the method’s generalizability across other models and datasets. 2. The proposed Surgical Steering appears to be an adaptation of existing English Steering methods, which raises questions about presenting it as a wholly novel approach for cross-lingual alignment. Nevertheless, the authors provide a valuable i

Reviewer 03Rating 4Confidence 4

Strengths

- The study is well motivated - The evaluation results of the proposed steering method show that localization can be achieved without negatively affecting global mmlu performance - The paper makes a good contribution by providing a unified view on knowledge transfer and cultural knowledge erasure

Weaknesses

- It's unclear how multilingual instruction-tuning, midalign, and steering are encouraging localization, as they map target languages to English representation space - The proposed SUR-steering method lacks innovation as it applies existing steering to existing cross-lingual alignment methods. While the insights are interesting, performance differences are low for the best performing methods (MidAlign, CLO) - The underlying technical mechanisms negatively affecting cultural knowledge are not dis

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.