Interactive Evaluation of Dialog Track at DSTC9

Shikib Mehri; Yulan Feng; Carla Gordon; Seyed Hossein Alavi; David; Traum; Maxine Eskenazi

arXiv:2207.14403·cs.CL·August 1, 2022·5 cites

Interactive Evaluation of Dialog Track at DSTC9

Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David, Traum, Maxine Eskenazi

PDF

Open Access

TL;DR

This paper discusses the Interactive Evaluation of Dialog Track at DSTC9, focusing on developing and assessing dialog systems in interactive, real-user settings to improve open-domain response quality.

Contribution

It introduces a new interactive evaluation track with two sub-tasks, emphasizing real-user interaction and extending beyond static datasets for dialog system assessment.

Findings

01

Development of models capable of engaging in real-user interactions

02

Insights into evaluation strategies for open-domain dialog systems

03

Challenges faced in transitioning from static to interactive evaluation

Abstract

The ultimate goal of dialog research is to develop systems that can be effectively used in interactive settings by real users. To this end, we introduced the Interactive Evaluation of Dialog Track at the 9th Dialog System Technology Challenge. This track consisted of two sub-tasks. The first sub-task involved building knowledge-grounded response generation models. The second sub-task aimed to extend dialog models beyond static datasets by assessing them in an interactive setting with real users. Our track challenges participants to develop strong response generation models and explore strategies that extend them to back-and-forth interactions with real users. The progression from static corpora to interactive evaluation introduces unique challenges and facilitates a more thorough assessment of open-domain dialog systems. This paper provides an overview of the track, including the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · AI in Service Interactions