Language Models' Factuality Depends on the Language of Inquiry
Tushar Aggarwal, Kumar Tanmay, Ayush Agrawal, Kumar Ayush, Hamid, Palangi, Paul Pu Liang

TL;DR
This paper introduces a benchmark and metrics to evaluate how well multilingual language models recall facts and transfer knowledge across languages, revealing significant weaknesses in cross-lingual generalization.
Contribution
The paper presents a new benchmark with 10,000 facts across 13 languages and three metrics to systematically assess factual recall and knowledge transferability in multilingual LMs.
Findings
State-of-the-art LMs struggle with cross-lingual knowledge transfer.
Models show inconsistent factual recall depending on the language used.
Fundamental weaknesses in multilingual factual knowledge transfer are identified.
Abstract
Multilingual language models (LMs) are expected to recall factual knowledge consistently across languages, yet they often fail to transfer knowledge between languages even when they possess the correct information in one of the languages. For example, we find that an LM may correctly identify Rashed Al Shashai as being from Saudi Arabia when asked in Arabic, but consistently fails to do so when asked in English or Swahili. To systematically investigate this limitation, we introduce a benchmark of 10,000 country-related facts across 13 languages and propose three novel metrics: Factual Recall Score, Knowledge Transferability Score, and Cross-Lingual Factual Knowledge Transferability Score-to quantify factual recall and knowledge transferability in LMs across different languages. Our results reveal fundamental weaknesses in today's state-of-the-art LMs, particularly in cross-lingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
