AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis
Cehua Yang, Dongyu Xiao, Junming Lin, Yuyang Song, Hanxu Yan, Shawn Guo, Wei Zhang, Jian Yang, Mingjie Tang, Bryan Dai

TL;DR
This paper introduces AGRO-SQL, a comprehensive framework combining high-fidelity data synthesis and an agentic reinforcement learning approach to significantly improve Text-to-SQL systems in complex reasoning tasks.
Contribution
It presents a novel dual-centric framework that integrates high-quality data generation with an agentic RL method, advancing reasoning capabilities in Text-to-SQL models.
Findings
Achieves state-of-the-art results on BIRD and Spider benchmarks.
Demonstrates the effectiveness of high-fidelity data synthesis.
Shows improved reasoning in complex SQL queries.
Abstract
The advancement of Text-to-SQL systems is currently hindered by the scarcity of high-quality training data and the limited reasoning capabilities of models in complex scenarios. In this paper, we propose a holistic framework that addresses these issues through a dual-centric approach. From a Data-Centric perspective, we construct an iterative data factory that synthesizes RL-ready data characterized by high correctness and precise semantic-logic alignment, ensured by strict verification. From a Model-Centric perspective, we introduce a novel Agentic Reinforcement Learning framework. This framework employs a Diversity-Aware Cold Start stage to initialize a robust policy, followed by Group Relative Policy Optimization (GRPO) to refine the agent's reasoning via environmental feedback. Extensive experiments on BIRD and Spider benchmarks demonstrate that our synergistic approach achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Constraint Satisfaction and Optimization · Cloud Computing and Resource Management
