MCI-SQL: Text-to-SQL with Metadata-Complete Context and Intermediate Correction
Qin Wang, Youhuan Li, Suixi Lin, Zhuo Tang, Kenli Li, Peng Peng, Quanqing Xu, Chuanhui Yang

TL;DR
MCI-SQL introduces a novel framework for text-to-SQL translation that enhances accuracy through metadata-complete context assignment and intermediate correction, outperforming existing methods on the BIRD benchmark.
Contribution
The paper proposes a new framework with metadata-complete context and intermediate correction mechanisms to improve text-to-SQL translation accuracy.
Findings
Achieves 76.41% execution accuracy on BIRD test set.
Outperforms state-of-the-art methods by over 8 percentage points on BIRD-clear.
Manually corrected 412 samples in BIRD dataset, creating BIRD-clear.
Abstract
Text-to-SQL aims to translate natural language queries into SQL statements. Existing methods typically follow a pipeline of pre-processing, schema linking, candidate SQL generation, SQL alignment, and target SQL selection. However, these methods face significant challenges. First, they often struggle with column filtering during schema linking due to difficulties in comprehending raw metadata. Also, the candidate SQL generation process often suffers from reasoning errors, which limits accuracy improvements. To address these limitations, we propose a framework, called MCI-SQL, to efficiently and precisely generate SQL queries. Specifically, we assign metadata-complete contexts to each column, which significantly improves the accuracy of column filtering for schema linking. Also, for candidate SQL generation, we propose an intermediate correction mechanism that validates SQL queries and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Advanced Database Systems and Queries · Semantic Web and Ontologies
