TL;DR
This paper investigates whether large language models' reasoning skills translate into social intelligence in cooperative scenarios, revealing that current models tend to favor individual gain over group cooperation, mirroring human behaviors.
Contribution
It demonstrates that existing reasoning-enhanced language models often reduce cooperation in social dilemmas, highlighting the need for architectures that better integrate social intelligence.
Findings
Reasoning models decrease cooperation and norm enforcement.
Groups with more reasoning agents have lower collective gains.
Models mirror human patterns of spontaneous giving and calculated greed.
Abstract
Large language models demonstrate strong problem-solving abilities through reasoning techniques such as chain-of-thought prompting and reflection. However, it remains unclear whether these reasoning capabilities extend to a form of social intelligence: making effective decisions in cooperative contexts. We examine this question using economic games that simulate social dilemmas. First, we apply chain-of-thought and reflection prompting to GPT-4o in a Public Goods Game. We then evaluate multiple off-the-shelf models across six cooperation and punishment games, comparing those with and without explicit reasoning mechanisms. We find that reasoning models consistently reduce cooperation and norm enforcement, favoring individual rationality. In repeated interactions, groups with more reasoning agents exhibit lower collective gains. These behaviors mirror human patterns of "spontaneous giving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
