Feature Representation Learning for NL2SQL Generation Based on Coupling and Decoupling
Chenduo Hao, Xu Zhang, Chuanbao Gao, Deyu Zhou

TL;DR
This paper introduces the CFCDC model for NL2SQL that explicitly models correlations between SQL clauses and sub-tasks, improving accuracy by decoupling and coupling feature representations.
Contribution
The paper proposes a novel CFCDC model that explicitly decouples and couples feature representations for better NL2SQL parsing, addressing overlooked correlation features.
Findings
Significant improvements in logic precision on WikiSQL
Enhanced execution accuracy over baseline models
Effective modeling of clause correlations
Abstract
The NL2SQL task involves parsing natural language statements into SQL queries. While most state-of-the-art methods treat NL2SQL as a slot-filling task and use feature representation learning techniques, they overlook explicit correlation features between the SELECT and WHERE clauses and implicit correlation features between sub-tasks within a single clause. To address this issue, we propose the Clause Feature Correlation Decoupling and Coupling (CFCDC) model, which uses a feature representation decoupling method to separate the SELECT and WHERE clauses at the parameter level. Next, we introduce a multi-task learning architecture to decouple implicit correlation feature representation between different SQL tasks in a specific clause. Moreover, we present an improved feature representation coupling module to integrate the decoupled tasks in the SELECT and WHERE clauses and predict the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Online Learning and Analytics
