MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?

Xixian Yong; Jianxun Lian; Xiaoyuan Yi; Xiao Zhou; Xing Xie

arXiv:2506.13065·cs.CL·June 17, 2025

MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?

Xixian Yong, Jianxun Lian, Xiaoyuan Yi, Xiao Zhou, Xing Xie

PDF

Open Access 1 Datasets

TL;DR

MotiveBench is a comprehensive benchmark with 200 scenarios designed to evaluate how well large language models can replicate human-like motivational reasoning, revealing significant gaps especially in social motivations.

Contribution

This work introduces MotiveBench, a new benchmark with rich scenarios to assess LLMs' ability to reason about human motivations, addressing limitations of previous simplistic benchmarks.

Findings

01

LLMs struggle with 'love & belonging' motivations

02

Advanced LLMs still fall short of human-like reasoning

03

Models tend to be overly rational and idealistic

Abstract

Large language models (LLMs) have been widely adopted as the core of agent frameworks in various scenarios, such as social simulations and AI companions. However, the extent to which they can replicate human-like motivations remains an underexplored question. Existing benchmarks are constrained by simplistic scenarios and the absence of character identities, resulting in an information asymmetry with real-world situations. To address this gap, we propose MotiveBench, which consists of 200 rich contextual scenarios and 600 reasoning tasks covering multiple levels of motivation. Using MotiveBench, we conduct extensive experiments on seven popular model families, comparing different scales and versions within each family. The results show that even the most advanced LLMs still fall short in achieving human-like motivational reasoning. Our analysis reveals key findings, including the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

chicosirius/MotiveBench
dataset· 30 dl
30 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies