Learning Dynamic Belief Graphs for Theory-of-mind Reasoning
Ruxiao Chen, Xilei Zhao, Thomas J. Cova, Frank A. Drews, Susu Xu

TL;DR
This paper presents a structured belief graph model for enhancing Theory-of-Mind reasoning in LLMs, enabling dynamic belief inference and decision-making in uncertain, high-stakes environments.
Contribution
It introduces a novel probabilistic graphical model approach that captures evolving mental states and belief dependencies over time, improving reasoning in dynamic contexts.
Findings
Significantly improves action prediction accuracy in disaster datasets.
Recovers interpretable belief trajectories aligned with human reasoning.
Enhances LLMs with dynamic ToM capabilities in high-uncertainty scenarios.
Abstract
Theory of Mind (ToM) reasoning with Large Language Models (LLMs) requires inferring how people's implicit, evolving beliefs shape what they seek and how they act under uncertainty -- especially in high-stakes settings such as disaster response, emergency medicine, and human-in-the-loop autonomy. Prior approaches either prompt LLMs directly or use latent-state models that treat beliefs as static and independent, often producing incoherent mental models over time and weak reasoning in dynamic contexts. We introduce a structured cognitive trajectory model for LLM-based ToM that represents mental state as a dynamic belief graph, jointly inferring latent beliefs, learning their time-varying dependencies, and linking belief evolution to information seeking and decisions. Our model contributes (i) a novel projection from textualized probabilistic statements to consistent probabilistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Child and Animal Learning Development
