Self-supervised Text-to-SQL Learning with Header Alignment Training

Donggyu Kim; Seanie Lee

arXiv:2103.06402·cs.CL·March 12, 2021·1 cites

Self-supervised Text-to-SQL Learning with Header Alignment Training

Donggyu Kim, Seanie Lee

PDF

Open Access

TL;DR

This paper introduces a self-supervised learning framework for Text-to-SQL tasks that leverages header-column alignment from unlabeled data to improve SQL query prediction, especially with limited labeled data.

Contribution

It proposes a novel self-supervised training method utilizing table structure for header-column alignment, enhancing supervised Text-to-SQL models without external corpora.

Findings

01

Significant performance improvements on existing BERT-based models.

02

Effective training with scarce labeled data.

03

No need for large external datasets.

Abstract

Since we can leverage a large amount of unlabeled data without any human supervision to train a model and transfer the knowledge to target tasks, self-supervised learning is a de-facto component for the recent success of deep learning in various fields. However, in many cases, there is a discrepancy between a self-supervised learning objective and a task-specific objective. In order to tackle such discrepancy in Text-to-SQL task, we propose a novel self-supervised learning framework. We utilize the task-specific properties of Text-to-SQL task and the underlying structures of table contents to train the models to learn useful knowledge of the \textit{header-column} alignment task from unlabeled table data. We are able to transfer the knowledge to the supervised Text-to-SQL training with annotated samples, so that the model can leverage the knowledge to better perform the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis

MethodsLinear Layer · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Weight Decay · Multi-Head Attention · Dense Connections · Softmax · Layer Normalization · Attention Dropout