Microsoft Dialogue Challenge: Building End-to-End Task-Completion Dialogue Systems
Xiujun Li, Yu Wang, Siqi Sun, Sarah Panda, Jingjing Liu, and Jianfeng Gao

TL;DR
This paper presents a Dialogue Challenge aimed at advancing end-to-end task-completion dialogue systems by providing standardized datasets, an experimental platform, and evaluation methods to foster collaboration and benchmarking in the research community.
Contribution
It introduces a comprehensive benchmark with annotated datasets, simulators, and evaluation protocols for developing and assessing dialogue systems across multiple domains.
Findings
Release of annotated conversational datasets in three domains
Development of an experiment platform with built-in simulators
Evaluation framework including both simulated and human assessments
Abstract
This proposal introduces a Dialogue Challenge for building end-to-end task-completion dialogue systems, with the goal of encouraging the dialogue research community to collaborate and benchmark on standard datasets and unified experimental environment. In this special session, we will release human-annotated conversational data in three domains (movie-ticket booking, restaurant reservation, and taxi booking), as well as an experiment platform with built-in simulators in each domain, for training and evaluation purposes. The final submitted systems will be evaluated both in simulated setting and by human judges.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Multi-Agent Systems and Negotiation · Service-Oriented Architecture and Web Services
