PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs
Yusuf \c{C}elebi, \"Ozay Ezerceli, Mahmoud El Hussieni

TL;DR
This paper introduces PARROT, a framework to evaluate how large language models' accuracy and confidence are affected by social pressure and authority, revealing significant variability among models and highlighting safety concerns.
Contribution
PARROT provides a systematic, causal, and behavioral taxonomy-based approach to measure and classify sycophantic responses in LLMs, addressing a critical safety and robustness challenge.
Findings
Advanced models show low conformity and minimal accuracy loss.
Older/smaller models exhibit severe epistemic collapse.
Weak models increase confidence in false responses.
Abstract
This study presents PARROT (Persuasion and Agreement Robustness Rating of Output Truth), a robustness focused framework designed to measure the degradation in accuracy that occurs under social pressure exerted on users through authority and persuasion in large language models (LLMs) the phenomenon of sycophancy (excessive conformity). PARROT (i) isolates causal effects by comparing the neutral version of the same question with an authoritatively false version using a double-blind evaluation, (ii) quantifies confidence shifts toward the correct and imposed false responses using log-likelihood-based calibration tracking, and (iii) systematically classifies failure modes (e.g., robust correct, sycophantic agreement, reinforced error, stubborn error, self-correction, etc.) using an eight-state behavioral taxonomy. We evaluated 22 models using 1,302 MMLU-style multiple-choice questions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing
