Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Bang Zhang; Ruotian Ma; Qingxuan Jiang; Peisong Wang; Jiaqi Chen; Zheng Xie; Xingyu Chen; Yue Wang; Fanghua Ye; Jian Li; Yifan Yang; Zhaopeng Tu; Xiaolong Li

arXiv:2505.02847·cs.CL·May 22, 2025

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Bang Zhang, Ruotian Ma, Qingxuan Jiang, Peisong Wang, Jiaqi Chen, Zheng Xie, Xingyu Chen, Yue Wang, Fanghua Ye, Jian Li, Yifan Yang, Zhaopeng Tu, Xiaolong Li

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces SAGE, a novel evaluation framework that assesses large language models' understanding of human-like social cognition by simulating emotional and inner thoughts during interactions, providing more realistic and interpretable metrics.

Contribution

The paper presents SAGE, a new automated evaluation method that measures higher-order social cognition in LLMs through simulated emotional and mental state reasoning, validated by psychological benchmarks.

Findings

01

Strong correlation between emotion scores and empathy ratings

02

Uncovered significant gaps between top models and earlier baselines

03

SAGE provides a scalable, interpretable evaluation tool for social cognition

Abstract

Assessing how well a large language model (LLM) understands human, rather than merely text, remains an open challenge. To bridge the gap, we introduce Sentient Agent as a Judge (SAGE), an automated evaluation framework that measures an LLM's higher-order social cognition. SAGE instantiates a Sentient Agent that simulates human-like emotional changes and inner thoughts during interaction, providing a more realistic evaluation of the tested model in multi-turn conversations. At every turn, the agent reasons about (i) how its emotion changes, (ii) how it feels, and (iii) how it should reply, yielding a numerical emotion trajectory and interpretable inner thoughts. Experiments on 100 supportive-dialogue scenarios show that the final Sentient emotion score correlates strongly with Barrett-Lennard Relationship Inventory (BLRI) ratings and utterance-level empathy metrics, validating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tencent/digitalhuman
pytorchOfficial

Datasets

prvmax/SAGE
dataset· 32 dl
32 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mental Health via Writing · Artificial Intelligence in Healthcare and Education