FINER-SQL: Boosting Small Language Models for Text-to-SQL

Thanh Dat Hoang; Thanh Trung Huynh; Matthias Weidlich; Thanh Tam Nguyen; Tong Chen; Hongzhi Yin; Quoc Viet Hung Nguyen

arXiv:2605.03465·cs.DB·May 6, 2026

FINER-SQL: Boosting Small Language Models for Text-to-SQL

Thanh Dat Hoang, Thanh Trung Huynh, Matthias Weidlich, Thanh Tam Nguyen, Tong Chen, Hongzhi Yin, Quoc Viet Hung Nguyen

PDF

1 Repo

TL;DR

FINER-SQL introduces a reinforcement learning framework that enhances small language models for Text-to-SQL tasks by using dense, interpretable rewards, achieving high accuracy with lower computational costs.

Contribution

It proposes a scalable, critic-free reinforcement learning method with fine-grained feedback, improving small language models' reasoning and instruction following in Text-to-SQL.

Findings

01

Achieves up to 85% execution accuracy on Spider with a 3B model.

02

Reduces inference latency to 5.57 seconds per sample.

03

Matches larger LLMs' performance at a fraction of the cost.

Abstract

Large language models have driven major advances in Text-to-SQL generation. However, they suffer from high computational cost, long latency, and data privacy concerns, which make them impractical for many real-world applications. A natural alternative is to use small language models (SLMs), which enable efficient and private on-premise deployment. Yet, SLMs often struggle with weak reasoning and poor instruction following. Conventional reinforcement learning methods based on sparse binary rewards (0/1) provide little learning signal when the generated SQLs are incorrect, leading to unstable or collapsed training. To overcome these issues, we propose FINER-SQL, a scalable and reusable reinforcement learning framework that enhances SLMs through fine-grained execution feedback. Built on group relative policy optimization, FINER-SQL replaces sparse supervision with dense and interpretable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thanhdath/finer-sql
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.