Effects of personality steering on cooperative behavior in Large Language Model agents
Mizuki Sakai, Mizuki Yokoyama, Wakaba Tateishi, Genki Ichinose

TL;DR
This study investigates how personality traits influence cooperation in large language model agents, revealing that agreeableness promotes cooperation but can also increase vulnerability, with effects varying across model generations.
Contribution
It provides the first systematic analysis of personality steering effects on LLM cooperation using the Big Five framework and repeated Prisoner's Dilemma games.
Findings
Agreeableness strongly promotes cooperation.
Personality steering increases cooperation but raises exploitation risk.
Later models show more selective cooperation.
Abstract
Large language models (LLMs) are increasingly used as autonomous agents in strategic and social interactions. Although recent studies suggest that assigning personality traits to LLMs can influence their behavior, how personality steering affects cooperation under controlled conditions remains unclear. In this study, we examine the effects of personality steering on cooperative behavior in LLM agents using repeated Prisoner's Dilemma games. Based on the Big Five framework, we first measure basic personality scores of three models, GPT-3.5-turbo, GPT-4o, and GPT-5, using the Big Five Inventory. We then compare behavior under baseline and personality-informed conditions, and further analyze the effects of independently manipulating each personality dimension to extreme values. Our results show that agreeableness is the dominant factor promoting cooperation across all models, while other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Evolutionary Game Theory and Cooperation · Multi-Agent Systems and Negotiation
