SQL-Trail: Multi-Turn Reinforcement Learning with Interleaved Feedback for Text-to-SQL

Harper Hua; Zhen Han; Zhengyuan Shen; Jeremy Lee; Patrick Guan; Qi Zhu; Sullam Jeoung; Yueyan Chen; Yunfei Bai; Shuai Wang; Vassilis Ioannidis; Huzefa Rangwala

arXiv:2601.17699·cs.AI·January 27, 2026

SQL-Trail: Multi-Turn Reinforcement Learning with Interleaved Feedback for Text-to-SQL

Harper Hua, Zhen Han, Zhengyuan Shen, Jeremy Lee, Patrick Guan, Qi Zhu, Sullam Jeoung, Yueyan Chen, Yunfei Bai, Shuai Wang, Vassilis Ioannidis, Huzefa Rangwala

PDF

Open Access

TL;DR

SQL-Trail introduces a multi-turn reinforcement learning framework for Text-to-SQL that iteratively refines queries through interaction and feedback, significantly improving accuracy and efficiency over traditional single-pass models.

Contribution

It presents a novel multi-turn RL approach with adaptive interaction and a composite reward, advancing state-of-the-art performance in Text-to-SQL tasks.

Findings

01

Sets new state-of-the-art results on benchmarks.

02

Achieves up to 18x higher data efficiency.

03

Outperforms larger proprietary models by 5% on average.

Abstract

While large language models (LLMs) have substantially improved Text-to-SQL generation, a pronounced gap remains between AI systems and human experts on challenging benchmarks such as BIRD-SQL. We argue this gap stems largely from the prevailing single-pass paradigm, which lacks the iterative reasoning, schema exploration, and error-correction behaviors that humans naturally employ. To address this limitation, we introduce SQL-Trail, a multi-turn reinforcement learning (RL) agentic framework for Text-to-SQL. Rather than producing a query in one shot, SQL-Trail interacts with the database environment and uses execution feedback to iteratively refine its predictions. Our approach centers on two key ideas: (i) an adaptive turn-budget allocation mechanism that scales the agent's interaction depth to match question difficulty, and (ii) a composite reward panel that jointly incentivizes SQL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Scientific Computing and Data Management