Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction

Mehdi Jafari; Devin Yuncheng Hua; Hao Xue; Flora Salim

arXiv:2502.14171·cs.CL·May 21, 2025

Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction

Mehdi Jafari, Devin Yuncheng Hua, Hao Xue, Flora Salim

PDF

Open Access 1 Repo

TL;DR

This paper explores how integrating Theory of Mind into Large Language Models can improve human-like interaction by aligning beliefs, desires, and intentions, showing significant enhancements in response consistency and quality.

Contribution

It demonstrates that explicit manipulation of ToM components in open source LLMs enhances their conversational alignment and response quality.

Findings

01

Incorporating ToM improves response win rates to 67% and 63%.

02

Explicit ToM manipulation enhances alignment in LLaMA models.

03

ToM-driven strategies can significantly improve LLM-based conversational agents.

Abstract

Natural language interaction with agentic Artificial Intelligence (AI), driven by Large Language Models (LLMs), is expected to remain a dominant paradigm in the near future. While humans instinctively align their communication with mental states -- an ability known as Theory of Mind (ToM), current LLM powered systems exhibit significant limitations in this regard. This study examines the extent to which open source language models (LLaMA) can capture and preserve ToM related information and how effectively it contributes to consistent ToM reasoning in generated responses. We further investigate whether explicit manipulation of ToM related components, such as beliefs, desires, and intentions, can enhance response alignment. Experiments on two LLaMA 3 variants demonstrate that incorporating ToM informed alignment improves response quality, achieving win rates of 67 and 63 percent for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

meti-94/ToM_and_Alignment
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in Service Interactions · Social Robot Interaction and HRI

MethodsALIGN · LLaMA