SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL
Ruichu Cai, Jinjie Yuan, Boyan Xu, Zhifeng Hao

TL;DR
SADGA introduces a structure-aware dual graph aggregation network that enhances cross-domain Text-to-SQL translation by unifying encoding and improving question-schema mapping, achieving competitive results on the Spider benchmark.
Contribution
The paper proposes a novel structure-aware dual graph aggregation network that unifies encoding of questions and schemas and improves cross-domain generalization in Text-to-SQL.
Findings
Achieved 3rd place on the Spider benchmark.
Demonstrated improved generalization to unseen schemas.
Validated effectiveness through empirical studies.
Abstract
The Text-to-SQL task, aiming to translate the natural language of the questions into SQL queries, has drawn much attention recently. One of the most challenging problems of Text-to-SQL is how to generalize the trained model to the unseen database schemas, also known as the cross-domain Text-to-SQL task. The key lies in the generalizability of (i) the encoding method to model the question and the database schema and (ii) the question-schema linking method to learn the mapping between words in the question and tables/columns in the database schema. Focusing on the above two key issues, we propose a Structure-Aware Dual Graph Aggregation Network (SADGA) for cross-domain Text-to-SQL. In SADGA, we adopt the graph structure to provide a unified encoding model for both the natural language question and database schema. Based on the proposed unified modeling, we further devise a structure-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
