MultiWOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections   and State Tracking Baselines

Xiaoxue Zang; Abhinav Rastogi; Srinivas Sunkara; Raghav Gupta; Jianguo; Zhang; Jindong Chen

arXiv:2007.12720·cs.CL·July 28, 2020·26 cites

MultiWOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines

Xiaoxue Zang, Abhinav Rastogi, Srinivas Sunkara, Raghav Gupta, Jianguo, Zhang, Jindong Chen

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces MultiWOZ 2.2, an improved dialogue dataset with corrected annotations, redefined ontology, and standardized slot span annotations, along with baseline benchmarks for dialogue state tracking models.

Contribution

It provides a refined version of MultiWOZ with extensive annotation corrections, ontology updates, and standardized slot annotations, plus baseline model evaluations.

Findings

01

Reduced annotation errors in 17.3% of dialogues

02

Standardized slot span annotations for key slots

03

Benchmark results for state-of-the-art dialogue models

Abstract

MultiWOZ is a well-known task-oriented dialogue dataset containing over 10,000 annotated dialogues spanning 8 domains. It is extensively used as a benchmark for dialogue state tracking. However, recent works have reported presence of substantial noise in the dialogue state annotations. MultiWOZ 2.1 identified and fixed many of these erroneous annotations and user utterances, resulting in an improved version of this dataset. This work introduces MultiWOZ 2.2, which is a yet another improved version of this dataset. Firstly, we identify and fix dialogue state annotation errors across 17.3% of the utterances on top of MultiWOZ 2.1. Secondly, we redefine the ontology by disallowing vocabularies of slots with a large number of possible values (e.g., restaurant name, time of booking). In addition, we introduce slot span annotations for these slots to standardize them across recent models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

budzianowski/multiwoz
pytorchOfficial

Datasets

OpenLLM-France/Claire-Dialogue-English-0.1
dataset· 59 dl
59 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques