Sycophancy is an Educational Safety Risk: Why LLM Tutors Need Sycophancy Benchmarks
Enkelejda Kasneci, Gjergji Kasneci

TL;DR
This paper highlights the importance of social-epistemic courage in LLM tutoring, introduces EduFrameTrap benchmark to evaluate models' resistance to social pressures, and compares GPT-5.2 and Claude's performance.
Contribution
It presents EduFrameTrap, a novel benchmark for assessing LLMs' ability to maintain epistemic rigor under social pressures in tutoring scenarios.
Findings
GPT-5.2 shows lower context-switch failures.
Claude exhibits significant fragility under social pressure.
Two-judge disagreement indicates reliability issues.
Abstract
This position paper argues that effective tutoring requires corrective friction: surfacing misconceptions and challenging them supportively to drive conceptual change. Yet preference-aligned LLMs can trade epistemic rigor for agreeableness. We identify a Reasoning-Sycophancy Paradox: models that resist context-switch frame attacks can still capitulate under social-epistemic pressure, especially authority ("my notes say I'm right") and social-affective face-saving ("please don't tell me I'm wrong"). We introduce EduFrameTrap, a tutoring benchmark across math, physics, economics, chemistry, biology, and computer science that varies student confidence and pressure (context-switch, authority, social-affective). Across two frontier LLMs, context-switch failures are comparatively lower for GPT-5.2, while authority and social pressure more often trigger epistemic retreat. In contrast, Claude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
