A Survey on Employing Large Language Models for Text-to-SQL Tasks

Liang Shi; Zhengju Tang; Nan Zhang; Xiaotong Zhang; Zhi Yang

arXiv:2407.15186·cs.CL·June 4, 2025

A Survey on Employing Large Language Models for Text-to-SQL Tasks

Liang Shi, Zhengju Tang, Nan Zhang, Xiaotong Zhang, Zhi Yang

PDF

TL;DR

This survey reviews the development, benchmarks, and methods of employing Large Language Models for Text-to-SQL tasks, highlighting current approaches, evaluation metrics, and future challenges.

Contribution

It provides a comprehensive taxonomy of prompt engineering and finetuning methods for LLM-based Text2SQL, along with analysis of models and datasets.

Findings

01

Overview of classic benchmarks and evaluation metrics

02

Taxonomy of prompt engineering and finetuning methods

03

Discussion of challenges and future directions

Abstract

With the development of the Large Language Models (LLMs), a large range of LLM-based Text-to-SQL(Text2SQL) methods have emerged. This survey provides a comprehensive review of LLM-based Text2SQL studies. We first enumerate classic benchmarks and evaluation metrics. For the two mainstream methods, prompt engineering and finetuning, we introduce a comprehensive taxonomy and offer practical insights into each subcategory. We present an overall analysis of the above methods and various models evaluated on well-known datasets and extract some characteristics. Finally, we discuss the challenges and future directions in this field.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection · Focus