SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
Kuan Xu, Yongbo Wang, Yongliang Wang, Zujie Wen, Yang Dong

TL;DR
This paper introduces SeaD, a schema-aware denoising approach that enhances transformer-based seq-to-seq models for text-to-SQL tasks, achieving state-of-the-art results on WikiSQL.
Contribution
It proposes a novel schema-aware denoising training method and an improved decoding strategy to better model structural data in text-to-SQL generation.
Findings
Improves schema linking and grammar correctness.
Achieves new state-of-the-art on WikiSQL.
Shows that vanilla seq-to-seq models are more capable than previously thought.
Abstract
In text-to-SQL task, seq-to-seq models often lead to sub-optimal performance due to limitations in their architecture. In this paper, we present a simple yet effective approach that adapts transformer-based seq-to-seq model to robust text-to-SQL generation. Instead of inducing constraint to decoder or reformat the task as slot-filling, we propose to train seq-to-seq model with Schema aware Denoising (SeaD), which consists of two denoising objectives that train model to either recover input or predict output from two novel erosion and shuffle noises. These denoising objectives acts as the auxiliary tasks for better modeling the structural data in S2S generation. In addition, we improve and propose a clause-sensitive execution guided (EG) decoding strategy to overcome the limitation of EG decoding for generative model. The experiments show that the proposed method improves the performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Parallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems
MethodsAttentive Walk-Aggregating Graph Neural Network
