Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?

Xi Ai; Mahardika Krisna Ihsani; Min-Yen Kan

arXiv:2507.12838·cs.CL·October 2, 2025

Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?

Xi Ai, Mahardika Krisna Ihsani, Min-Yen Kan

PDF

Open Access 1 Video

TL;DR

This paper investigates the cross-lingual consistency of factual knowledge in multilingual language models, analyzing factors affecting it and proposing strategies like code-switching and alignment to improve consistency.

Contribution

It provides a comprehensive analysis of cross-lingual consistency, evaluates various models and techniques, and highlights effective strategies for enhancing multilingual factual knowledge transfer.

Findings

01

Cross-lingual consistency varies across languages and model layers.

02

Code-switching training improves cross-lingual consistency.

03

Cross-lingual alignment supervision enhances model performance.

Abstract

Cross-lingual consistency should be considered to assess cross-lingual transferability, maintain the factuality of the model knowledge across languages, and preserve the parity of language model performance. We are thus interested in analyzing, evaluating, and interpreting cross-lingual consistency for factual knowledge. To facilitate our study, we examine multiple pretrained models and tuned models with code-mixed coreferential statements that convey identical knowledge across languages. Interpretability approaches are leveraged to analyze the behavior of a model in cross-lingual contexts, showing different levels of consistency in multilingual models, subject to language families, linguistic factors, scripts, and a bottleneck in cross-lingual consistency on a particular layer. Code-switching training and cross-lingual word alignment objectives show the most promising results,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?· underline

Taxonomy

TopicsNatural Language Processing Techniques