Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
Bo Yang, Jiaxian Guo, Yusuke Iwasawa, Yutaka Matsuo

TL;DR
This paper introduces ToM-agent, a novel framework enabling large language models to simulate theory of mind in open-domain conversations, improving understanding of mental states and enhancing social behavior modeling.
Contribution
The study presents a new paradigm for LLM-based agents to dynamically infer and reflect on mental states, incorporating counterfactual reflection to improve social interaction capabilities.
Findings
ToM-agent effectively models beliefs, desires, and intentions in conversations.
Counterfactual reflection enhances the accuracy of mental state inference.
The approach improves performance in first- and second-order theory of mind tasks.
Abstract
Recent studies have increasingly demonstrated that large language models (LLMs) possess significant theory of mind (ToM) capabilities, showing the potential for simulating the tracking of mental states in generative agents. In this study, we propose a novel paradigm called ToM-agent, designed to empower LLMs-based generative agents to simulate ToM in open-domain conversational interactions. ToM-agent disentangles the confidence from mental states, facilitating the emulation of an agent's perception of its counterpart's mental states, such as beliefs, desires, and intentions (BDIs). Using past conversation history and verbal reflections, ToM-Agent can dynamically adjust counterparts' inferred BDIs, along with related confidence levels. We further put forth a counterfactual intervention method that reflects on the gap between the predicted responses of counterparts and their real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
