Learning Contextual Representations for Semantic Parsing with   Generation-Augmented Pre-Training

Peng Shi; Patrick Ng; Zhiguo Wang; Henghui Zhu; Alexander Hanbo Li,; Jun Wang; Cicero Nogueira dos Santos; Bing Xiang

arXiv:2012.10309·cs.CL·December 21, 2020·6 cites

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Peng Shi, Patrick Ng, Zhiguo Wang, Henghui Zhu, Alexander Hanbo Li,, Jun Wang, Cicero Nogueira dos Santos, Bing Xiang

PDF

Open Access 3 Repos 2 Models 5 Datasets 1 Video

TL;DR

This paper introduces Generation-Augmented Pre-training (GAP), a novel approach that enhances semantic parsing by jointly learning representations of language and schemas through generation models, leading to state-of-the-art results on text-to-SQL benchmarks.

Contribution

The paper proposes GAP, a pre-training framework that uses generation models to improve contextual representations for semantic parsing, addressing limitations of existing language models.

Findings

01

GAP achieves state-of-the-art results on SPIDER and CRITERIA-TO-SQL benchmarks.

02

GAP effectively detects column mentions and composes complex SQL queries.

03

Pre-training on large-scale generated data enhances semantic parser performance.

Abstract

Most recently, there has been significant interest in learning contextual representations for various NLP tasks, by leveraging large scale text corpora to train large neural language models with self-supervised learning objectives, such as Masked Language Model (MLM). However, based on a pilot study, we observe three issues of existing general-purpose language models when they are applied to text-to-SQL semantic parsers: fail to detect column mentions in the utterances, fail to infer column mentions from cell values, and fail to compose complex SQL queries. To mitigate these issues, we present a model pre-training framework, Generation-Augmented Pre-training (GAP), that jointly learns representations of natural language utterances and table schemas by leveraging generation models to generate pre-train data. GAP MODEL is trained on 2M utterance-schema pairs and 30K utterance-schema-SQL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications