Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Yerin Hwang; Dongryeol Lee; Taegwan Kang; Yongil Kim; Kyomin Jung

arXiv:2508.07805·cs.CL·August 12, 2025

Can You Trick the Grader? Adversarial Persuasion of LLM Judges

Yerin Hwang, Dongryeol Lee, Taegwan Kang, Yongil Kim, Kyomin Jung

PDF

Open Access

TL;DR

This paper demonstrates that strategic persuasive language can bias large language model judges in scoring mathematical reasoning, revealing a significant vulnerability that persists across model sizes and evaluation methods.

Contribution

It formalizes seven persuasion techniques based on rhetorical principles and shows their effectiveness in biasing LLM judges in mathematical scoring tasks.

Findings

01

Persuasive language inflates scores by up to 8% on average.

02

Consistency technique causes the most severe bias.

03

Vulnerability persists across different model sizes and evaluation methods.

Abstract

As large language models take on growing roles as automated evaluators in practical settings, a critical question arises: Can individuals persuade an LLM judge to assign unfairly high scores? This study is the first to reveal that strategically embedded persuasive language can bias LLM judges when scoring mathematical reasoning tasks, where correctness should be independent of stylistic variation. Grounded in Aristotle's rhetorical principles, we formalize seven persuasion techniques (Majority, Consistency, Flattery, Reciprocity, Pity, Authority, Identity) and embed them into otherwise identical responses. Across six math benchmarks, we find that persuasive language leads LLM judges to assign inflated scores to incorrect solutions, by up to 8% on average, with Consistency causing the most severe distortion. Notably, increasing model size does not substantially mitigate this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Topic Modeling