PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

Yusuf \c{C}elebi; \"Ozay Ezerceli; Mahmoud El Hussieni

arXiv:2511.17220·cs.CL·December 2, 2025

PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

Yusuf \c{C}elebi, \"Ozay Ezerceli, Mahmoud El Hussieni

PDF

Open Access

TL;DR

This paper introduces PARROT, a framework to evaluate how large language models' accuracy and confidence are affected by social pressure and authority, revealing significant variability among models and highlighting safety concerns.

Contribution

PARROT provides a systematic, causal, and behavioral taxonomy-based approach to measure and classify sycophantic responses in LLMs, addressing a critical safety and robustness challenge.

Findings

01

Advanced models show low conformity and minimal accuracy loss.

02

Older/smaller models exhibit severe epistemic collapse.

03

Weak models increase confidence in false responses.

Abstract

This study presents PARROT (Persuasion and Agreement Robustness Rating of Output Truth), a robustness focused framework designed to measure the degradation in accuracy that occurs under social pressure exerted on users through authority and persuasion in large language models (LLMs) the phenomenon of sycophancy (excessive conformity). PARROT (i) isolates causal effects by comparing the neutral version of the same question with an authoritatively false version using a double-blind evaluation, (ii) quantifies confidence shifts toward the correct and imposed false responses using log-likelihood-based calibration tracking, and (iii) systematically classifies failure modes (e.g., robust correct, sycophantic agreement, reinforced error, stubborn error, self-correction, etc.) using an eight-state behavioral taxonomy. We evaluated 22 models using 1,302 MMLU-style multiple-choice questions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing