Hierarchical Neural Data Synthesis for Semantic Parsing

Wei Yang; Peng Xu; Yanshuai Cao

arXiv:2112.02212·cs.CL·December 7, 2021

Hierarchical Neural Data Synthesis for Semantic Parsing

Wei Yang, Peng Xu, Yanshuai Cao

PDF

Open Access

TL;DR

This paper introduces a neural data augmentation method for semantic parsing that eliminates the need for rule-based grammar engineering, enabling zero-shot domain adaptation and achieving state-of-the-art results on a text-to-SQL benchmark.

Contribution

A purely neural data synthesis approach for semantic parsing that improves accuracy and supports zero-shot domain adaptation without handcrafted rules.

Findings

01

Achieves 77.2% accuracy on Spider benchmark

02

Outperforms rule-based data augmentation methods

03

Enables zero-shot domain transfer in semantic parsing

Abstract

Semantic parsing datasets are expensive to collect. Moreover, even the questions pertinent to a given domain, which are the input of a semantic parsing system, might not be readily available, especially in cross-domain semantic parsing. This makes data augmentation even more challenging. Existing methods to synthesize new data use hand-crafted or induced rules, requiring substantial engineering effort and linguistic expertise to achieve good coverage and precision, which limits the scalability. In this work, we propose a purely neural approach of data augmentation for semantic parsing that completely removes the need for grammar engineering while achieving higher semantic parsing accuracy. Furthermore, our method can synthesize in the zero-shot setting, where only a new domain schema is available without any input-output examples of the new domain. On the Spider cross-domain text-to-SQL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications