A Few-Shot Semantic Parser for Wizard-of-Oz Dialogues with the Precise   ThingTalk Representation

Giovanni Campagna; Sina J. Semnani; Ryan Kearns; Lucas Jun Koba Sato,; Silei Xu; Monica S. Lam

arXiv:2009.07968·cs.CL·April 11, 2022·1 cites

A Few-Shot Semantic Parser for Wizard-of-Oz Dialogues with the Precise ThingTalk Representation

Giovanni Campagna, Sina J. Semnani, Ryan Kearns, Lucas Jun Koba Sato,, Silei Xu, Monica S. Lam

PDF

Open Access 1 Repo

TL;DR

This paper introduces a sample-efficient method for building precise semantic parsers for Wizard-of-Oz dialogues using an extended ThingTalk representation, achieving high accuracy with limited annotated data.

Contribution

It extends the ThingTalk language for better dialogue state representation and proposes a combined few-shot and synthesized data training strategy for semantic parsing.

Findings

01

ThingTalk captures 98% of test turns

02

Simulator emulates 85% of validation set

03

Semantic parser achieves 79% accuracy

Abstract

Previous attempts to build effective semantic parsers for Wizard-of-Oz (WOZ) conversations suffer from the difficulty in acquiring a high-quality, manually annotated training set. Approaches based only on dialogue synthesis are insufficient, as dialogues generated from state-machine based models are poor approximations of real-life conversations. Furthermore, previously proposed dialogue state representations are ambiguous and lack the precision necessary for building an effective agent. This paper proposes a new dialogue representation and a sample-efficient methodology that can predict precise dialogue states in WOZ conversations. We extended the ThingTalk representation to capture all information an agent needs to respond properly. Our training strategy is sample-efficient: we combine (1) fewshot data sparsely sampling the full dialogue space and (2) synthesized data covering a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stanford-oval/schema2qa
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems