Semantic Diversity in Dialogue with Natural Language Inference
Katherine Stasaski, Marti A. Hearst

TL;DR
This paper introduces a new NLI-based metric for measuring semantic diversity in dialogue responses and proposes a method to enhance diversity, significantly improving the variety of generated conversational replies.
Contribution
It presents a novel NLI-based diversity metric and a generation method that substantially increases response diversity in neural dialogue systems.
Findings
NLI Diversity correlates with semantic diversity.
Contradiction relation is more effective than neutral for measuring diversity.
Diversity Threshold Generation increases diversity by 137%.
Abstract
Generating diverse, interesting responses to chitchat conversations is a problem for neural conversational agents. This paper makes two substantial contributions to improving diversity in dialogue generation. First, we propose a novel metric which uses Natural Language Inference (NLI) to measure the semantic diversity of a set of model responses for a conversation. We evaluate this metric using an established framework (Tevet and Berant, 2021) and find strong evidence indicating NLI Diversity is correlated with semantic diversity. Specifically, we show that the contradiction relation is more useful than the neutral relation for measuring this diversity and that incorporating the NLI model's confidence achieves state-of-the-art results. Second, we demonstrate how to iteratively improve the semantic diversity of a sampled set of responses via a new generation procedure called Diversity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Explainable Artificial Intelligence (XAI)
