DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with   Self-Correction

Mohammadreza Pourreza; Davood Rafiei

arXiv:2304.11015·cs.CL·November 6, 2023·58 cites

DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction

Mohammadreza Pourreza, Davood Rafiei

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces DIN-SQL, a decomposed in-context learning approach for text-to-SQL tasks that significantly improves LLM performance by breaking down queries into sub-tasks, achieving state-of-the-art results on multiple benchmarks.

Contribution

The paper proposes a novel decomposed in-context learning method for text-to-SQL that enhances LLM reasoning and surpasses existing fine-tuned models in accuracy.

Findings

01

Improves LLM performance by roughly 10% on Spider dataset

02

Achieves new SOTA accuracy of 85.3% on Spider test set

03

Sets a new SOTA of 55.9% on BIRD benchmark

Abstract

There is currently a significant gap between the performance of fine-tuned models and prompting approaches using Large Language Models (LLMs) on the challenging task of text-to-SQL, as evaluated on datasets such as Spider. To improve the performance of LLMs in the reasoning process, we study how decomposing the task into smaller sub-tasks can be effective. In particular, we show that breaking down the generation problem into sub-problems and feeding the solutions of those sub-problems into LLMs can be an effective approach for significantly improving their performance. Our experiments with three LLMs show that this approach consistently improves their simple few-shot performance by roughly 10%, pushing the accuracy of LLMs towards SOTA or surpassing it. On the holdout test set of Spider, the SOTA, in terms of execution accuracy, was 79.9 and the new SOTA at the time of this writing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohammadrezapourreza/few-shot-nl2sql-with-prompting
noneOfficial

Videos

DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management