Pragmatically Appropriate Diversity for Dialogue Evaluation

Katherine Stasaski; Marti A. Hearst

arXiv:2304.02812·cs.CL·April 7, 2023·5 cites

Pragmatically Appropriate Diversity for Dialogue Evaluation

Katherine Stasaski, Marti A. Hearst

PDF

Open Access

TL;DR

This paper introduces Pragmatically Appropriate Diversity, a new concept linking speech acts to response diversity in dialogue, supported by human studies and proposing improved evaluation methods for dialogue systems.

Contribution

It defines Pragmatically Appropriate Diversity, demonstrates its relevance through human-created datasets, and proposes a novel human evaluation task for dialogue response diversity.

Findings

01

Speech acts signal the potential for response diversity

02

Human judgments align with Pragmatically Appropriate Diversity

03

Diversity expectations should vary by speech act

Abstract

Linguistic pragmatics state that a conversation's underlying speech acts can constrain the type of response which is appropriate at each turn in the conversation. When generating dialogue responses, neural dialogue agents struggle to produce diverse responses. Currently, dialogue diversity is assessed using automatic metrics, but the underlying speech acts do not inform these metrics. To remedy this, we propose the notion of Pragmatically Appropriate Diversity, defined as the extent to which a conversation creates and constrains the creation of multiple diverse responses. Using a human-created multi-response dataset, we find significant support for the hypothesis that speech acts provide a signal for the diversity of the set of next responses. Building on this result, we propose a new human evaluation task where creative writers predict the extent to which conversations inspire the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsALIGN