FIBER: A Multilingual Evaluation Resource for Factual Inference Bias
Evren Ayberk Munis, Deniz Y{\i}lmaz, Arianna Muti,\c{C}a\u{g}r{\i} Toraman

TL;DR
FIBER is a multilingual benchmark designed to evaluate factual inference bias in large language models across different languages and entity settings, revealing language-dependent biases and performance variations.
Contribution
This paper introduces FIBER, a novel multilingual benchmark for assessing factual knowledge and inference biases in language models across multiple languages and entity types.
Findings
Language of prompt influences model output bias, especially for country-associated entities.
Models perform worse on multi-entity questions than single-entity ones.
Larger models outperform smaller ones and English prompts yield higher accuracy.
Abstract
Large language models are widely used across domains, yet there are concerns about their factual reliability and biases. Factual knowledge probing offers a systematic means to evaluate these aspects. Most existing benchmarks focus on single-entity facts and monolingual data. We therefore present FIBER, a multilingual benchmark for evaluating factual knowledge in single- and multi-entity settings. The dataset includes sentence completion, question-answering, and object-count prediction tasks in English, Italian, and Turkish. Using FIBER, we examine whether the prompt language induces inference bias in entity selection and how large language models perform on multi-entity versus single-entity questions. The results indicate that the language of the prompt can influence the model's generated output, particularly for entities associated with the country corresponding to that language.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
