Analyzing the Effectiveness of Large Language Models on Text-to-SQL   Synthesis

Richard Roberson; Gowtham Kaki; Ashutosh Trivedi

arXiv:2401.12379·cs.AI·January 24, 2024·2 cites

Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis

Richard Roberson, Gowtham Kaki, Ashutosh Trivedi

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the effectiveness of Large Language Models in Text-to-SQL synthesis, demonstrating significant accuracy improvements through fine-tuning and error correction, and categorizing common query errors to guide future enhancements.

Contribution

It introduces a combined approach of fine-tuning and error correction for LLMs in Text-to-SQL tasks, providing detailed error analysis to identify key areas for improvement.

Findings

01

Fine-tuning WizardLM-15B achieved 61% accuracy.

02

Using GPT-3.5 and GPT-4 turbo increased accuracy to 82.1%.

03

Most errors involve column selection, grouping, and query structure issues.

Abstract

This study investigates various approaches to using Large Language Models (LLMs) for Text-to-SQL program synthesis, focusing on the outcomes and insights derived. Employing the popular Text-to-SQL dataset, spider, the goal was to input a natural language question along with the database schema and output the correct SQL SELECT query. The initial approach was to fine-tune a local and open-source model to generate the SELECT query. After QLoRa fine-tuning WizardLM's WizardCoder-15B model on the spider dataset, the execution accuracy for generated queries rose to a high of 61%. With the second approach, using the fine-tuned gpt-3.5-turbo-16k (Few-shot) + gpt-4-turbo (Zero-shot error correction), the execution accuracy reached a high of 82.1%. Of all the incorrect queries, most can be categorized into a seven different categories of what went wrong: selecting the wrong columns or wrong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cuplv/text-to-sql-wizardcoder
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques