TL;DR
This paper examines the trade-off between factual transfer and cultural localization in multilingual LLMs, proposing a new method to better balance these objectives by steering internal representations at different model layers.
Contribution
It introduces the transfer-localization plane framework and Surgical Steering, a novel inference-time technique to disentangle and balance transfer and cultural localization.
Findings
CLA approaches improve factual transfer but reduce cultural localization
Universal transfer and cultural knowledge are steerable at different model layers
Surgical Steering improves the balance between transfer and localization
Abstract
Cross-lingual alignment (CLA) aims to align multilingual representations, enabling Large Language Models (LLMs) to seamlessly transfer knowledge across languages. While intuitive, we hypothesize, this pursuit of representational convergence can inadvertently cause "cultural erasure", the functional loss of providing culturally-situated responses that should diverge based on the query language. In this work, we systematically analyze this trade-off by introducing a holistic evaluation framework, the transfer-localization plane, which quantifies both desirable knowledge transfer and undesirable cultural erasure. Using this framework, we re-evaluate recent CLA approaches and find that they consistently improve factual transfer at the direct cost of cultural localization across all six languages studied. Our investigation into the internal representations of these models reveals a key…
Peer Reviews
Decision·Submitted to ICLR 2026
1. The presentation is clear. 2. The authors provide some valuable, negative results for CLA.
1. While the negative results are valuable, the overall contribution of the paper is not sufficient. The authors re-run existing frameworks on existing datasets, and the findings are not new. For example, the cultural bias is discussed in GLOBAL_MMLU, which the authors consider and use. 2. The work is not complete. The method, which relies on surgery of the model, is not easy to reproduce on other models. The authors only conduct experiments on one model, making the generality unclear. Is there
1. This paper addresses a compelling topic—the trade-off between knowledge transfer and cultural localization—pioneering research in this area. By exploring this trade-off, multilingual alignment can achieve better semantic consistency across languages while preserving culturally relevant differences. 2. The paper offers deep insights into cross-lingual alignment through the transfer-localization plane, demonstrating that current multilingual methods enhance transfer performance at the cost of c
1. The insights and contributions of this paper are impressive. However, the evaluation is limited to a single model and one culturally specific dataset. It would be important to demonstrate the method’s generalizability across other models and datasets. 2. The proposed Surgical Steering appears to be an adaptation of existing English Steering methods, which raises questions about presenting it as a wholly novel approach for cross-lingual alignment. Nevertheless, the authors provide a valuable i
- The study is well motivated - The evaluation results of the proposed steering method show that localization can be achieved without negatively affecting global mmlu performance - The paper makes a good contribution by providing a unified view on knowledge transfer and cultural knowledge erasure
- It's unclear how multilingual instruction-tuning, midalign, and steering are encouraging localization, as they map target languages to English representation space - The proposed SUR-steering method lacks innovation as it applies existing steering to existing cross-lingual alignment methods. While the insights are interesting, performance differences are low for the best performing methods (MidAlign, CLO) - The underlying technical mechanisms negatively affecting cultural knowledge are not dis
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
