DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning

Hengli Li; Song-Chun Zhu; Zilong Zheng

arXiv:2306.09030·cs.CL·June 21, 2023·1 cites

DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning

Hengli Li, Song-Chun Zhu, Zilong Zheng

PDF

Open Access 1 Datasets 1 Video

TL;DR

DiPlomat introduces a new dataset and tasks for evaluating machine pragmatic reasoning in situated conversations, revealing current models' limitations in understanding implicit meanings and context.

Contribution

The paper presents a novel dataset, DiPlomat, and two benchmark tasks for pragmatic reasoning, highlighting the challenges faced by state-of-the-art models in this domain.

Findings

01

Large language models perform poorly on pragmatic reasoning tasks.

02

Context comprehension is crucial for effective human-machine interaction.

03

Current models struggle with applying pragmatic reasoning to implicit meanings.

Abstract

Pragmatic reasoning plays a pivotal role in deciphering implicit meanings that frequently arise in real-life conversations and is essential for the development of communicative social agents. In this paper, we introduce a novel challenge, DiPlomat, aiming at benchmarking machines' capabilities on pragmatic reasoning and situated conversational understanding. Compared with previous works that treat different figurative expressions (e.g. metaphor, sarcasm) as individual tasks, DiPlomat provides a cohesive framework towards general pragmatic understanding. Our dataset is created through the utilization of Amazon Mechanical Turk ( AMT ), resulting in a total of 4, 177 multi-turn dialogues. In conjunction with the dataset, we propose two tasks, Pragmatic Identification and Reasoning (PIR) and Conversational Question Answering (CQA). Experimental results with state-of-the-art (SOTA) neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

henry12348/DiPlomat
dataset· 227 dl
227 dl

Videos

Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications