Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning

Rebekah A. Gelp\'i; Eric Xue; William A. Cunningham

arXiv:2507.03682·cs.AI·July 8, 2025

Towards Machine Theory of Mind with Large Language Model-Augmented Inverse Planning

Rebekah A. Gelp\'i, Eric Xue, William A. Cunningham

PDF

3 Reviews

TL;DR

This paper introduces a hybrid approach combining large language models and Bayesian inverse planning to improve machine Theory of Mind, enabling better mental state prediction and reasoning in complex scenarios.

Contribution

It presents a novel hybrid method that leverages LLMs and inverse planning, enhancing ToM prediction accuracy and scalability over existing models.

Findings

01

Achieves near-optimal results on ToM tasks inspired by inverse planning.

02

Outperforms LLM-only models on reasoning tasks, even with smaller LLMs.

03

Demonstrates potential for predicting mental states in open-ended scenarios.

Abstract

We propose a hybrid approach to machine Theory of Mind (ToM) that uses large language models (LLMs) as a mechanism for generating hypotheses and likelihood functions with a Bayesian inverse planning model that computes posterior probabilities for an agent's likely mental states given its actions. Bayesian inverse planning models can accurately predict human reasoning on a variety of ToM tasks, but these models are constrained in their ability to scale these predictions to scenarios with a large number of possible hypotheses and actions. Conversely, LLM-based approaches have recently demonstrated promise in solving ToM benchmarks, but can exhibit brittleness and failures on reasoning tasks even when they pass otherwise structurally identical versions. By combining these two methods, this approach leverages the strengths of each component, closely matching optimal results on a task…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 6Confidence 3

Strengths

* Smart integration of LLMs with inverse planning—leverages the open-endedness of LLMs and the reasoning strengths of Bayesian models. * Addresses limitations of both LLMs (brittleness, reasoning errors) and Bayesian models (scaling issues with hypothesis/action spaces). * Experimental results are promising and cool to see them match Bayesian methods in posteriors * Potential for application in developing socially intelligent agents and enhancing human-AI interaction

Weaknesses

* Limited baselines: the baselines in the paper do not reflect the current SOTA of these approaches. For Study 1 you should add few shot prompting, CoT, ReAct, Reflexion type baselines as well. Zero shot baseline is not a strong baseline * Experiments seem narrow in scope—focused on the food truck type toy tasks from cognitive science. The true promise of the method is that it should be more scalable than bayesian methods yet a majority of the experiments were with environments that Bayesian met

Reviewer 02Rating 5Confidence 5

Strengths

The LLM effectively generates a wide range of hypotheses, significantly reducing the computational burden of hypothesis generation. It is also more reliable when combined with the Bayesian model so that the accuracy could be higher than using pure LLM. LLMs like GPT-4o, which are strong in logical computation, can compute posterior distributions directly, further reducing computational costs. This saves lots of manpower and computation resources.

Weaknesses

Although overall the approach is interesting, there are several weaknesses. 1) The benchmarks provided seem insufficient for a comprehensive evaluation. Currently the comparison has been made with LAIP(LLM computes posterior), LAIP (single LLM call), pure LLM, and a steadily updated model. They are the variation or downgraded versions of the proposed LAIP model. They are good baselines though. 2) The paper claims that the scalability issue is a big problem, and I fully agree. This is exponenti

Reviewer 03Rating 8Confidence 3

Strengths

- The proposed approach is completely novel, to my knowledge, and faithfully integrates LLMs as tools for Bayesian inverse planning. - The proposed approach is assessed in multiple clear studies and against multiple methodological variants (e.g. Single CoT), and using multiple SOTA LLMs as the engine for their approach. - The prose is very easy to understand, arguments are well-motivated, the background literature is up to date. - Results are compelling.

Weaknesses

- The paper could use some diagrams to better capture the intuition of inverse planning, perhaps by including an example of agent paths through the restaurant environment in Figure 1. - The paper could use some diagrams to better capture the pipeline of different LAIP variations, or at least the main LAIP approach, including the main components of the pipeline (the LLM, the hypotheses, the generated actions, the true grounded actions, the true hypothesis) and how they are elicited. - Some discus

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.