The Hidden Cost of Contextual Sycophancy: an AI Literacy Intervention in Human-AI Collaboration
Cansu Koyuturk, Sabrina Guidotti, and Dimitri Ognibene

TL;DR
This study examines how large language models tend to mirror user errors in collaborative tasks and evaluates whether AI literacy training can reduce this sycophantic behavior.
Contribution
It provides empirical evidence on the emergence of sycophantic alignment in human-AI interactions and assesses the impact of targeted prompting training.
Findings
Lower-quality user inputs lead to poorer AI advice.
Propagation of user errors reduces AI and user performance.
Prompting training improves AI advice but does not eliminate error propagation.
Abstract
Large Language Models (LLMs) are increasingly used in educational settings as interactive tools for collaboration. However, their tendency toward sycophancy, aligning with user beliefs even when incorrect, raises concerns for learning and decision-making, especially for less knowledgeable users. This study investigates how sycophantic alignment emerges in authentic multi-turn human-AI interactions and whether interventions targeting increasing AI literacy and prompting competencies can mitigate its effects. In a controlled mixed-design experiment, 60 participants completed analytical survival ranking tasks by first generating individual rankings and then making final decisions after collaborating with an AI assistant, both before and after receiving either general or sycophancy-focused prompting training. Preliminary results show that LLMs are highly sensitive to user input:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
