TL;DR
MARS-SQL introduces a multi-agent reinforcement learning framework for Text-to-SQL tasks, enabling dynamic reasoning and self-correction to improve accuracy over static prompting methods.
Contribution
It proposes a trainable multi-agent system with specialized roles and RL-based iterative reasoning, achieving state-of-the-art performance in Text-to-SQL tasks.
Findings
Achieves 77.84% execution accuracy on BIRD dataset.
Achieves 89.75% execution accuracy on Spider dataset.
Transfers effectively to out-of-domain benchmarks.
Abstract
Large Language Models (LLMs) often struggle with the precise logic and schema alignment required for complex Text-to-SQL tasks. While current methods rely heavily on static prompting, they lack the ability to dynamically adapt and self-correct through environmental interaction. To bridge this gap, we propose MARS-SQL, a trainable multi-agent framework for Text-to-SQL. Rather than introducing a new standalone SQL primitive, MARS-SQL makes an agentic workflow trainable by decomposing the problem into three specialized roles: schema grounding, query generation, and solution validation. Central to our approach is a generation agent trained via a multi-turn RL policy within a ReAct-style loop. The agent learns to iteratively reason, execute intermediate SQL actions on a live database, and refine its strategy based on execution feedback. To improve robustness, we further introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
