Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

Minyuan Ruan; Ziyue Wang; Kaiming Liu; Yunghwei Lai; Peng Li; Yang Liu

arXiv:2602.13832·cs.CL·February 17, 2026

Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

Minyuan Ruan, Ziyue Wang, Kaiming Liu, Yunghwei Lai, Peng Li, Yang Liu

PDF

Open Access

TL;DR

This paper introduces a new benchmark and dataset to evaluate and improve Large Language Models' ability to understand and resolve epistemic divergence in user interactions using Theory of Mind, enhancing their practical interaction capabilities.

Contribution

It formalizes ToM for LLMs as a tool for epistemic divergence detection and proposes a benchmark and dataset to improve models' understanding of user beliefs in real-world tasks.

Findings

01

Models struggle to identify cognitive gaps affecting task success.

02

Training on belief tracking data improves reasoning about user mental states.

03

Enhanced ToM understanding leads to better downstream task performance.

Abstract

Large Language Models (LLMs) have developed rapidly and are widely applied to both general-purpose and professional tasks to assist human users. However, they still struggle to comprehend and respond to the true user needs when intentions and instructions are imprecisely conveyed, leading to a divergence between subjective user believes and true environment states. Resolving this epistemic divergence requires Theory of Mind (ToM), yet existing ToM evaluations for LLMs primarily focus on isolated belief inference, overlooking its functional utility in real-world interaction. To this end, we formalize ToM for LLMs as a mechanism for epistemic divergence detection and resolution, and propose a benchmark, \benchname, to assess how models reconcile user beliefs and profiles in practice. Results across 11 leading models reveal a significant limitation to identify underlying cognitive gaps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Social Robot Interaction and HRI