Schema-R1: A reasoning training approach for schema linking in Text-to-SQL Task
Wuzhenghong Wen, Su Pan, yuwei Sun

TL;DR
Schema-R1 introduces a reinforcement learning-based training approach for schema linking in Text-to-SQL tasks, significantly improving reasoning ability and accuracy over traditional fine-tuning methods.
Contribution
The paper presents a novel reinforcement learning framework for schema linking, addressing the limitations of rote-learning approaches and enhancing reasoning capabilities.
Findings
Achieves 10% improvement in filter accuracy.
Effectively enhances reasoning ability in schema linking.
Uses high-quality reasoning samples and rule-based RL training.
Abstract
Schema linking is a critical step in Text-to-SQL task, aiming to accurately predict the table names and column names required for the SQL query based on the given question. However, current fine-tuning approaches for schema linking models employ a rote-learning paradigm, excessively optimizing for ground truth schema linking outcomes while compromising reasoning ability. This limitation arises because of the difficulty in acquiring a high-quality reasoning sample for downstream tasks. To address this, we propose Schema-R1, a reasoning schema linking model trained using reinforcement learning. Specifically, Schema-R1 consists of three key steps: constructing small batches of high-quality reasoning samples, supervised fine-tuning for cold-start initialization, and rule-based reinforcement learning training. The final results demonstrate that our method effectively enhances the reasoning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries
