Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
Mehdi Jafari, Devin Yuncheng Hua, Hao Xue, Flora Salim

TL;DR
This paper explores how integrating Theory of Mind into Large Language Models can improve human-like interaction by aligning beliefs, desires, and intentions, showing significant enhancements in response consistency and quality.
Contribution
It demonstrates that explicit manipulation of ToM components in open source LLMs enhances their conversational alignment and response quality.
Findings
Incorporating ToM improves response win rates to 67% and 63%.
Explicit ToM manipulation enhances alignment in LLaMA models.
ToM-driven strategies can significantly improve LLM-based conversational agents.
Abstract
Natural language interaction with agentic Artificial Intelligence (AI), driven by Large Language Models (LLMs), is expected to remain a dominant paradigm in the near future. While humans instinctively align their communication with mental states -- an ability known as Theory of Mind (ToM), current LLM powered systems exhibit significant limitations in this regard. This study examines the extent to which open source language models (LLaMA) can capture and preserve ToM related information and how effectively it contributes to consistent ToM reasoning in generated responses. We further investigate whether explicit manipulation of ToM related components, such as beliefs, desires, and intentions, can enhance response alignment. Experiments on two LLaMA 3 variants demonstrate that incorporating ToM informed alignment improves response quality, achieving win rates of 67 and 63 percent for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Social Robot Interaction and HRI
MethodsALIGN · LLaMA
