Low-Resource Compositional Semantic Parsing with Concept Pretraining

Subendhu Rongali; Mukund Sridhar; Haidar Khan; Konstantine Arkoudas,; Wael Hamza; and Andrew McCallum

arXiv:2301.09809·cs.CL·February 1, 2023·1 cites

Low-Resource Compositional Semantic Parsing with Concept Pretraining

Subendhu Rongali, Mukund Sridhar, Haidar Khan, Konstantine Arkoudas,, Wael Hamza, and Andrew McCallum

PDF

Open Access

TL;DR

This paper introduces a zero-shot and few-shot domain adaptation method for semantic parsing in voice assistants, leveraging concept encoding and pretraining to improve performance with minimal or no new training data.

Contribution

It proposes a novel architecture combining concept encoding and a decoder-focused pretraining approach for low-resource semantic parsing adaptation.

Findings

01

Outperforms prior methods in few-shot settings on TOPv2 and SNIPS datasets.

02

Effective zero-shot domain adaptation with minimal domain metadata.

03

Improved semantic parsing accuracy in low-resource scenarios.

Abstract

Semantic parsing plays a key role in digital voice assistants such as Alexa, Siri, and Google Assistant by mapping natural language to structured meaning representations. When we want to improve the capabilities of a voice assistant by adding a new domain, the underlying semantic parsing model needs to be retrained using thousands of annotated examples from the new domain, which is time-consuming and expensive. In this work, we present an architecture to perform such domain adaptation automatically, with only a small amount of metadata about the new domain and without any new training data (zero-shot) or with very few examples (few-shot). We use a base seq2seq (sequence-to-sequence) architecture and augment it with a concept encoder that encodes intent and slot tags from the new domain. We also introduce a novel decoder-focused approach to pretrain seq2seq models to be concept aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence · Attentive Walk-Aggregating Graph Neural Network · Balanced Selection