Does AI and Human Advice Mitigate Punishment for Selfish Behavior? An Experiment on AI ethics From a Psychological Perspective
Margarita Leib, Nils K\"obis, Ivan Soraperra

TL;DR
This study investigates how AI and human advice influence perceptions and punishment of selfish behavior, revealing that advice content significantly impacts punishment, while the source does not.
Contribution
It combines social psychology, machine behavior, and behavioral economics to experimentally analyze how advice type and behavior influence punishment and responsibility attribution.
Findings
Selfish behavior is punished more than prosocial behavior.
Prosocial advice leads to harsher punishment of selfish acts.
Advice content affects punishment more than advice source.
Abstract
People increasingly rely on AI-advice when making decisions. At times, such advice can promote selfish behavior. When individuals abide by selfishness-promoting AI advice, how are they perceived and punished? To study this question, we build on theories from social psychology and combine machine-behavior and behavioral economic approaches. In a pre-registered, financially-incentivized experiment, evaluators could punish real decision-makers who (i) received AI, human, or no advice. The advice (ii) encouraged selfish or prosocial behavior, and decision-makers (iii) behaved selfishly or, in a control condition, behaved prosocially. Evaluators further assigned responsibility to decision-makers and their advisors. Results revealed that (i) prosocial behavior was punished very little, whereas selfish behavior was punished much more. Focusing on selfish behavior, (ii) compared to receiving no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
