Evaluating Contextually Mediated Factual Recall in Multilingual Large Language Models

Yihong Liu; Bingyu Xiong; Hinrich Sch\"utze

arXiv:2601.12555·cs.CL·January 21, 2026

Evaluating Contextually Mediated Factual Recall in Multilingual Large Language Models

Yihong Liu, Bingyu Xiong, Hinrich Sch\"utze

PDF

Open Access

TL;DR

This paper investigates how multilingual large language models retrieve factual knowledge when the information is embedded in naturalistic contexts rather than explicit queries, revealing that contextual cues often impair recall performance.

Contribution

It introduces a novel evaluation framework for contextually mediated factual recall across languages and analyzes how model size and name types affect retrieval accuracy.

Findings

01

Contextual mediation degrades factual recall performance.

02

Larger models are more robust to contextual effects.

03

Performance varies significantly across different relations.

Abstract

Large language models (LLMs) can recall a wide range of factual knowledge across languages. However, existing factual recall evaluations primarily assess fact retrieval in isolation, where the queried entity is explicitly named and the fact is requested directly. In natural language use, facts are often accessed through context, where the relevant entity is introduced only indirectly. In this work, we study contextually mediated factual recall, asking whether LLMs can reliably retrieve factual knowledge when the target entity is embedded in a naturalistic context rather than queried explicitly, across languages. We construct controlled prompts that preserve the underlying fact while introducing referential mediation through contextual sentences. To disentangle contextual effects from name-specific associations, we further compare performance using synthetic names and real names across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Information Retrieval and Search Behavior