Building and Evaluating Open-Domain Dialogue Corpora with Clarifying   Questions

Mohammad Aliannejadi; Julia Kiseleva; Aleksandr Chuklin; Jeffrey; Dalton; Mikhail Burtsev

arXiv:2109.05794·cs.CL·September 14, 2021·1 cites

Building and Evaluating Open-Domain Dialogue Corpora with Clarifying Questions

Mohammad Aliannejadi, Julia Kiseleva, Aleksandr Chuklin, Jeffrey, Dalton, Mikhail Burtsev

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset, benchmarks neural models, and proposes a pipeline for evaluating clarifying questions in open-domain dialogue systems to improve response quality.

Contribution

It presents a novel dataset, benchmarks multiple models, and develops an evaluation pipeline for clarifying questions in open-domain dialogues.

Findings

01

New dataset for open-domain dialogues with clarifying questions

02

Benchmark results of state-of-the-art neural models

03

A pipeline for offline and online evaluation of clarifying questions

Abstract

Enabling open-domain dialogue systems to ask clarifying questions when appropriate is an important direction for improving the quality of the system response. Namely, for cases when a user request is not specific enough for a conversation system to provide an answer right away, it is desirable to ask a clarifying question to increase the chances of retrieving a satisfying answer. To address the problem of 'asking clarifying questions in open-domain dialogues': (1) we collect and release a new dataset focused on open-domain single- and multi-turn conversations, (2) we benchmark several state-of-the-art neural baselines, and (3) we propose a pipeline consisting of offline and online steps for evaluating the quality of clarifying questions in various dialogues. These contributions are suitable as a foundation for further research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aliannejadi/ClariQ
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques