Assessing Large Language Models' ability to predict how humans balance self-interest and the interest of others
Valerio Capraro, Roberto Di Paolo, Veronica Pizziol

TL;DR
This study evaluates GPT-4's ability to predict human decision-making in social dilemmas, revealing its strengths in capturing behavioral patterns but also its biases in estimating self-interest and altruism.
Contribution
It is the first to compare advanced chatbots' predictions of human social behavior across diverse experiments, highlighting GPT-4's unique capabilities and biases.
Findings
GPT-4 correctly identifies behavioral classes
GPT-4 underestimates self-interest and inequity-aversion
GPT-4 overestimates altruistic behavior
Abstract
Generative artificial intelligence (AI) holds enormous potential to revolutionize decision-making processes, from everyday to high-stake scenarios. By leveraging generative AI, humans can benefit from data-driven insights and predictions, enhancing their ability to make informed decisions that consider a wide array of factors and potential outcomes. However, as many decisions carry social implications, for AI to be a reliable assistant for decision-making it is crucial that it is able to capture the balance between self-interest and the interest of others. We investigate the ability of three of the most advanced chatbots to predict dictator game decisions across 108 experiments with human participants from 12 countries. We find that only GPT-4 (not Bard nor Bing) correctly captures qualitative behavioral patterns, identifying three major classes of behavior: self-interested,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation, Sustainability, Human-Machine Systems
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Softmax · Dense Connections · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Residual Connection
