Perceptions to Beliefs: Exploring Precursory Inferences for Theory of   Mind in Large Language Models

Chani Jung; Dongkwan Kim; Jiho Jin; Jiseon Kim; Yeon Seonwoo; Yejin; Choi; Alice Oh; Hyunwoo Kim

arXiv:2407.06004·cs.CL·November 8, 2024

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin, Choi, Alice Oh, Hyunwoo Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the precursors to theory of mind in large language models, introducing new datasets and a method that improve models' understanding of beliefs, especially in false belief situations.

Contribution

The paper introduces two datasets for evaluating perception and perception-to-belief inference in LLMs and proposes PercepToM, a method that enhances ToM performance in these models.

Findings

01

LLMs perform well in perception inference

02

Limited in perception-to-belief inference, especially inhibitory control

03

PercepToM significantly improves false belief understanding

Abstract

While humans naturally develop theory of mind (ToM), the capability to understand other people's mental states and beliefs, state-of-the-art large language models (LLMs) underperform on simple ToM benchmarks. We posit that we can extend our understanding of LLMs' ToM abilities by evaluating key human ToM precursors $-$ perception inference and perception-to-belief inference $-$ in LLMs. We introduce two datasets, Percept-ToMi and Percept-FANToM, to evaluate these precursory inferences for ToM in LLMs by annotating characters' perceptions on ToMi and FANToM, respectively. Our evaluation of eight state-of-the-art LLMs reveals that the models generally perform well in perception inference while exhibiting limited capability in perception-to-belief inference (e.g., lack of inhibitory control). Based on these results, we present PercepToM, a novel ToM method leveraging LLMs' strong perception…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chanijung/PercepToM
pytorchOfficial

Videos

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models· underline

Taxonomy

TopicsTopic Modeling