STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing
Zefeng Cai, Xiangyu Li, Binyuan Hui, Min Yang, Bowen Li, Binhua Li,, Zheng Cao, Weijie Li, Fei Huang, Luo Si, Yongbin Li

TL;DR
This paper introduces STAR, a SQL guided pre-training framework that leverages context-dependent interactions and novel objectives to improve text-to-SQL parsing, achieving state-of-the-art results on benchmark datasets.
Contribution
STAR is the first pre-training framework to incorporate schema state tracking and utterance dependency tracking for context-dependent text-to-SQL parsing.
Findings
Achieves new state-of-the-art on SParC and CoSQL benchmarks.
Outperforms previous pre-training methods significantly.
Provides a large-scale corpus and code for future research.
Abstract
In this paper, we propose a novel SQL guided pre-training framework STAR for context-dependent text-to-SQL parsing, which leverages contextual information to enrich natural language (NL) utterance and table schema representations for text-to-SQL conversations. Concretely, we propose two novel pre-training objectives which respectively explore the context-dependent interactions of NL utterances and SQL queries within each text-to-SQL conversation: (i) schema state tracking (SST) objective that tracks and explores the schema states of context-dependent SQL queries in the form of schema-states by predicting and updating the value of each schema slot during interaction; (ii) utterance dependency tracking (UDT) objective that employs weighted contrastive learning to pull together two semantically similar NL utterances and push away the representations of semantically dissimilar NL utterances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsContrastive Learning
