Source framing triggers systematic evaluation bias in Large Language Models

Federico Germani; Giovanni Spitale

arXiv:2505.13488·cs.CL·May 21, 2025

Source framing triggers systematic evaluation bias in Large Language Models

Federico Germani, Giovanni Spitale

PDF

Open Access

TL;DR

This paper investigates how source framing influences the evaluation bias of Large Language Models, revealing that attribution to Chinese sources systematically lowers agreement scores and affects evaluation neutrality.

Contribution

It systematically examines the impact of source framing on LLM evaluation consistency, highlighting biases introduced by attribution to different sources.

Findings

01

High inter- and intra-model agreement in blind evaluations

02

Source attribution to Chinese individuals lowers agreement scores

03

Framing effects significantly impact evaluation fairness

Abstract

Large Language Models (LLMs) are increasingly used not only to generate text but also to evaluate it, raising urgent questions about whether their judgments are consistent, unbiased, and robust to framing effects. In this study, we systematically examine inter- and intra-model agreement across four state-of-the-art LLMs (OpenAI o3-mini, Deepseek Reasoner, xAI Grok 2, and Mistral) tasked with evaluating 4,800 narrative statements on 24 different topics of social, political, and public health relevance, for a total of 192,000 assessments. We manipulate the disclosed source of each statement to assess how attribution to either another LLM or a human author of specified nationality affects evaluation outcomes. We find that, in the blind condition, different LLMs display a remarkably high degree of inter- and intra-model agreement across topics. However, this alignment breaks down when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling