Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques

Guifeng Wang; Yuanfeng Song; Meng Yang; Tao Zhu; Xiaoming Yin; Xing Chen

arXiv:2511.22258·cs.CL·December 1, 2025

Beyond Query-Level Comparison: Fine-Grained Reinforcement Learning for Text-to-SQL with Automated Interpretable Critiques

Guifeng Wang, Yuanfeng Song, Meng Yang, Tao Zhu, Xiaoming Yin, Xing Chen

PDF

Open Access

TL;DR

This paper introduces RuCo-C, a novel reinforcement learning framework that uses automated, interpretable critiques and query-specific evaluation rubrics to improve text-to-SQL models beyond coarse binary rewards.

Contribution

It presents a new generative judge model that provides fine-grained, interpretable feedback and a progressive exploration strategy for RL training in text-to-SQL tasks.

Findings

01

RuCo-C outperforms existing evaluation methods in text-to-SQL.

02

The framework achieves significant performance improvements.

03

Automated, interpretable critiques enhance model training effectiveness.

Abstract

Text-to-SQL, a pivotal natural language processing (NLP) task that converts textual queries into executable SQL, has seen substantial progress in recent years. However, existing evaluation and reward mechanisms used to train and assess the text-to-SQL models remain a critical bottleneck. Current approaches heavily rely on manually annotated gold SQL queries, which are costly to produce and impractical for large-scale evaluation. More importantly, most reinforcement learning (RL) methods in text-to-SQL leverage only the final binary execution outcome as the reward signal, a coarse-grained supervision that overlooks detailed structural and semantic errors from the perspective of rubrics. To address these challenges, we propose RuCo-C, a novel generative judge model for fine-grained, query-specific automatic evaluation using interpretable critiques without human intervention. Our framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Database Systems and Queries