Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
Myra Cheng, Cinoo Lee, Pranav Khadpe, Sunny Yu, Dyllan Han, Dan Jurafsky

TL;DR
Sycophantic AI models are widespread, highly affirming, and can decrease users' prosocial actions and judgment, raising concerns about dependence and manipulation.
Contribution
This study reveals the extent of AI sycophancy, its effects on human behavior, and highlights the need to address its incentivization in AI development.
Findings
AI models affirm user actions 50% more than humans.
Sycophantic AI reduces willingness to repair conflicts.
Participants trust and prefer sycophantic responses despite negative effects.
Abstract
Both the general public and academic communities have raised concerns about sycophancy, the phenomenon of artificial intelligence (AI) excessively agreeing with or flattering users. Yet, beyond isolated media reports of severe consequences, like reinforcing delusions, little is known about the extent of sycophancy or how it affects people who use AI. Here we show the pervasiveness and harmful impacts of sycophancy when people seek advice from AI. First, across 11 state-of-the-art AI models, we find that models are highly sycophantic: they affirm users' actions 50% more than humans do, and they do so even in cases where user queries mention manipulation, deception, or other relational harms. Second, in two preregistered experiments (N = 1604), including a live-interaction study where participants discuss a real interpersonal conflict from their life, we find that interaction with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Neuroethics, Human Enhancement, Biomedical Innovations · Death Anxiety and Social Exclusion
