How Far Can LLMs Emulate Human Behavior?: A Strategic Analysis via the Buy-and-Sell Negotiation Game
Mingyu Jeon, Jaeyoung Suh, Suwan Cho, Dohyeon Kim

TL;DR
This paper introduces a negotiation-based evaluation framework to assess how well large language models can imitate human social behaviors and strategic decision-making in realistic scenarios.
Contribution
It proposes a novel methodology using buy-and-sell negotiations to measure LLMs' social, emotional, and strategic capabilities beyond traditional knowledge benchmarks.
Findings
Models with higher benchmark scores perform better in negotiations.
Performance drops in social/emotional contexts for some models.
Cunning traits lead to more successful negotiation outcomes.
Abstract
With the rapid advancement of Large Language Models (LLMs), recent studies have drawn attention to their potential for handling not only simple question-answer tasks but also more complex conversational abilities and performing human-like behavioral imitations. In particular, there is considerable interest in how accurately LLMs can reproduce real human emotions and behaviors, as well as whether such reproductions can function effectively in real-world scenarios. However, existing benchmarks focus primarily on knowledge-based assessment and thus fall short of sufficiently reflecting social interactions and strategic dialogue capabilities. To address these limitations, this work proposes a methodology to quantitatively evaluate the human emotional and behavioral imitation and strategic decision-making capabilities of LLMs by employing a Buy and Sell negotiation simulation. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Topic Modeling · Artificial Intelligence in Law
