TL;DR
XiYan-SQL is a new multi-generator framework that improves Text-to-SQL performance by generating diverse SQL candidates, filtering schemas, and selecting the best query, achieving state-of-the-art results.
Contribution
It introduces a multi-generator ensemble with a schema filter and candidate selection, enhancing SQL generation quality and robustness over previous methods.
Findings
Achieves 75.63% on BIRD benchmark, setting new SOTA.
Attains 89.65% accuracy on Spider test set, surpassing previous methods.
Demonstrates robustness and effectiveness through comprehensive analysis.
Abstract
To leverage the advantages of LLM in addressing challenges in the Text-to-SQL task, we present XiYan-SQL, an innovative framework effectively generating and utilizing multiple SQL candidates. It consists of three components: 1) a Schema Filter module filtering and obtaining multiple relevant schemas; 2) a multi-generator ensemble approach generating multiple highquality and diverse SQL queries; 3) a selection model with a candidate reorganization strategy implemented to obtain the optimal SQL query. Specifically, for the multi-generator ensemble, we employ a multi-task fine-tuning strategy to enhance the capabilities of SQL generation models for the intrinsic alignment between SQL and text, and construct multiple generation models with distinct generation styles by fine-tuning across different SQL formats. The experimental results and comprehensive analysis demonstrate the effectiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗XGenerationLab/XiYanSQL-QwenCoder-3B-2502model· 25 dl· ♡ 525 dl♡ 5
- 🤗XGenerationLab/XiYanSQL-QwenCoder-7B-2502model· 15 dl· ♡ 215 dl♡ 2
- 🤗XGenerationLab/XiYanSQL-QwenCoder-14B-2502model· 11 dl· ♡ 311 dl♡ 3
- 🤗XGenerationLab/XiYanSQL-QwenCoder-32B-2412model· 57 dl· ♡ 1657 dl♡ 16
- 🤗XGenerationLab/XiYanSQL-QwenCoder-32B-2504model· 99 dl· ♡ 1799 dl♡ 17
- 🤗XGenerationLab/XiYanSQL-QwenCoder-14B-2504model· 328 dl· ♡ 2328 dl♡ 2
- 🤗XGenerationLab/XiYanSQL-QwenCoder-7B-2504model· 2.5k dl· ♡ 102.5k dl♡ 10
- 🤗XGenerationLab/XiYanSQL-QwenCoder-3B-2504model· 1.1k dl· ♡ 81.1k dl♡ 8
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
