MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for   Situated Neural Dialogue Generation

Shuwen Qiu; Mingdian Liu; Hengli Li; Song-Chun Zhu; Zilong Zheng

arXiv:2306.15253·cs.CL·May 27, 2024·1 cites

MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation

Shuwen Qiu, Mingdian Liu, Hengli Li, Song-Chun Zhu, Zilong Zheng

PDF

Open Access

TL;DR

MindDial introduces a theory-of-mind framework for dialogue generation that tracks beliefs and improves alignment and negotiation in conversational AI.

Contribution

The paper presents MindDial, a novel belief-tracking framework with an explicit mind module for better situated dialogue generation.

Findings

01

Models with mind modeling achieve higher task success in alignment and negotiation.

02

The three-level belief design enhances information aggregation and task outcomes.

03

Applicable to both prompting and fine-tuning models.

Abstract

Humans talk in daily conversations while aligning and negotiating the expressed meanings or common ground. Despite the impressive conversational abilities of the large generative language models, they do not consider the individual differences in contextual understanding in a shared situated environment. In this work, we propose MindDial, a novel conversational framework that can generate situated free-form responses with theory-of-mind modeling. We introduce an explicit mind module that can track the speaker's belief and the speaker's prediction of the listener's belief. Then the next response is generated to resolve the belief difference and take task-related action. Our framework is applied to both prompting and fine-tuning-based models, and is evaluated across scenarios involving both common ground alignment and negotiation. Experiments show that models with mind modeling can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Multimodal Machine Learning Applications

MethodsALIGN