InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles

Zizhen Li; Chuanhao Li; Yibin Wang; Qi Chen; Diping Song; Yukang Feng; Jianwen Sun; Jiaxin Ai; Fanrui Zhang; Mingzhu Sun; Kaipeng Zhang

arXiv:2508.16072·cs.AI·September 23, 2025

InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles

Zizhen Li, Chuanhao Li, Yibin Wang, Qi Chen, Diping Song, Yukang Feng, Jianwen Sun, Jiaxin Ai, Fanrui Zhang, Mingzhu Sun, Kaipeng Zhang

PDF

Open Access 1 Video

TL;DR

InMind is a new evaluation framework that assesses whether large language models can understand and adapt to individual human reasoning styles in social deduction games, revealing current models' limitations in personalized reasoning.

Contribution

The paper introduces InMind, a cognitively grounded framework for evaluating LLMs' ability to capture and apply personalized reasoning styles in social deduction contexts.

Findings

01

GPT-4o relies on lexical cues and struggles with temporal reasoning.

02

Reasoning-enhanced models like DeepSeek-R1 show early signs of style-sensitive reasoning.

03

Current LLMs have limited capacity for individualized, adaptive reasoning.

Abstract

LLMs have shown strong performance on human-centric reasoning tasks. While previous evaluations have explored whether LLMs can infer intentions or detect deception, they often overlook the individualized reasoning styles that influence how people interpret and act in social contexts. Social deduction games (SDGs) provide a natural testbed for evaluating individualized reasoning styles, where different players may adopt diverse but contextually valid reasoning strategies under identical conditions. To address this, we introduce InMind, a cognitively grounded evaluation framework designed to assess whether LLMs can capture and apply personalized reasoning styles in SDGs. InMind enhances structured gameplay data with round-level strategy traces and post-game reflections, collected under both Observer and Participant modes. It supports four cognitively motivated tasks that jointly evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles· underline

Taxonomy

TopicsArtificial Intelligence in Law · Natural Language Processing Techniques · Semantic Web and Ontologies