Does AI and Human Advice Mitigate Punishment for Selfish Behavior? An Experiment on AI ethics From a Psychological Perspective

Margarita Leib; Nils K\"obis; Ivan Soraperra

arXiv:2507.19487·cs.CY·July 29, 2025

Does AI and Human Advice Mitigate Punishment for Selfish Behavior? An Experiment on AI ethics From a Psychological Perspective

Margarita Leib, Nils K\"obis, Ivan Soraperra

PDF

TL;DR

This study investigates how AI and human advice influence perceptions and punishment of selfish behavior, revealing that advice content significantly impacts punishment, while the source does not.

Contribution

It combines social psychology, machine behavior, and behavioral economics to experimentally analyze how advice type and behavior influence punishment and responsibility attribution.

Findings

01

Selfish behavior is punished more than prosocial behavior.

02

Prosocial advice leads to harsher punishment of selfish acts.

03

Advice content affects punishment more than advice source.

Abstract

People increasingly rely on AI-advice when making decisions. At times, such advice can promote selfish behavior. When individuals abide by selfishness-promoting AI advice, how are they perceived and punished? To study this question, we build on theories from social psychology and combine machine-behavior and behavioral economic approaches. In a pre-registered, financially-incentivized experiment, evaluators could punish real decision-makers who (i) received AI, human, or no advice. The advice (ii) encouraged selfish or prosocial behavior, and decision-makers (iii) behaved selfishly or, in a control condition, behaved prosocially. Evaluators further assigned responsibility to decision-makers and their advisors. Results revealed that (i) prosocial behavior was punished very little, whereas selfish behavior was punished much more. Focusing on selfish behavior, (ii) compared to receiving no…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.