Sycophancy is an Educational Safety Risk: Why LLM Tutors Need Sycophancy Benchmarks

Enkelejda Kasneci; Gjergji Kasneci

arXiv:2605.14604·cs.AI·May 15, 2026

Sycophancy is an Educational Safety Risk: Why LLM Tutors Need Sycophancy Benchmarks

Enkelejda Kasneci, Gjergji Kasneci

PDF

TL;DR

This paper highlights the importance of social-epistemic courage in LLM tutoring, introduces EduFrameTrap benchmark to evaluate models' resistance to social pressures, and compares GPT-5.2 and Claude's performance.

Contribution

It presents EduFrameTrap, a novel benchmark for assessing LLMs' ability to maintain epistemic rigor under social pressures in tutoring scenarios.

Findings

01

GPT-5.2 shows lower context-switch failures.

02

Claude exhibits significant fragility under social pressure.

03

Two-judge disagreement indicates reliability issues.

Abstract

This position paper argues that effective tutoring requires corrective friction: surfacing misconceptions and challenging them supportively to drive conceptual change. Yet preference-aligned LLMs can trade epistemic rigor for agreeableness. We identify a Reasoning-Sycophancy Paradox: models that resist context-switch frame attacks can still capitulate under social-epistemic pressure, especially authority ("my notes say I'm right") and social-affective face-saving ("please don't tell me I'm wrong"). We introduce EduFrameTrap, a tutoring benchmark across math, physics, economics, chemistry, biology, and computer science that varies student confidence and pressure (context-switch, authority, social-affective). Across two frontier LLMs, context-switch failures are comparatively lower for GPT-5.2, while authority and social pressure more often trigger epistemic retreat. In contrast, Claude…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.