$\texttt{DIAMONDs}$: A Dataset for $\mathbb{D}$ynamic $\mathbb{I}$nformation $\mathbb{A}$nd $\mathbb{M}$ental modeling $\mathbb{O}$f $\mathbb{N}$umeric $\mathbb{D}$iscussions
Sayontan Ghosh, Mahnaz Koupaee, Yash Kumar Lal, Pegah Alipoormolabashi, Mohammad Saqib Hasan, Jun Seok Kang, Niranjan Balasubramanian

TL;DR
This paper introduces DIAMONDs, a new dataset designed to evaluate Theory of Mind in multiparty conversations, focusing on dynamic information tracking and numerical reasoning in goal-oriented discussions.
Contribution
The paper presents a scalable methodology for creating high-quality conversational QA datasets and introduces DIAMONDs, enabling precise evaluation of ToM capabilities in complex, real-world conversations.
Findings
State-of-the-art models struggle with participant-centric reasoning.
Models have difficulty handling false beliefs and distractors.
Limited ability to identify insufficient information scenarios.
Abstract
Understanding multiparty conversations demands robust Theory of Mind (ToM) capabilities, including the ability to track dynamic information, manage knowledge asymmetries, and distinguish relevant information across extended exchanges. To advance ToM evaluation in such settings, we present a carefully designed scalable methodology for generating high-quality benchmark conversation-question pairs with these characteristics. Using this methodology, we create , a new conversational QA dataset covering common business, financial or other group interactions. In these goal-oriented conversations, participants often have to track certain numerical quantities (say ) of interest that can be derived from other variable quantities (like , etc.), whose values also change over the course of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Embodied and Extended Cognition
