Hierarchical Neural Data Synthesis for Semantic Parsing
Wei Yang, Peng Xu, Yanshuai Cao

TL;DR
This paper introduces a neural data augmentation method for semantic parsing that eliminates the need for rule-based grammar engineering, enabling zero-shot domain adaptation and achieving state-of-the-art results on a text-to-SQL benchmark.
Contribution
A purely neural data synthesis approach for semantic parsing that improves accuracy and supports zero-shot domain adaptation without handcrafted rules.
Findings
Achieves 77.2% accuracy on Spider benchmark
Outperforms rule-based data augmentation methods
Enables zero-shot domain transfer in semantic parsing
Abstract
Semantic parsing datasets are expensive to collect. Moreover, even the questions pertinent to a given domain, which are the input of a semantic parsing system, might not be readily available, especially in cross-domain semantic parsing. This makes data augmentation even more challenging. Existing methods to synthesize new data use hand-crafted or induced rules, requiring substantial engineering effort and linguistic expertise to achieve good coverage and precision, which limits the scalability. In this work, we propose a purely neural approach of data augmentation for semantic parsing that completely removes the need for grammar engineering while achieving higher semantic parsing accuracy. Furthermore, our method can synthesize in the zero-shot setting, where only a new domain schema is available without any input-output examples of the new domain. On the Spider cross-domain text-to-SQL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
