CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

James Petullo; Nianwen Xue

arXiv:2605.08057·cs.CL·May 11, 2026

CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation

James Petullo, Nianwen Xue

PDF

TL;DR

CA-SQL introduces a complexity-aware, exploration-driven approach for Text-to-SQL inference, dynamically allocating resources based on task difficulty to improve performance on challenging benchmarks.

Contribution

It proposes a novel pipeline that uses task difficulty estimation, evolutionary-inspired prompt seeding, and a voting mechanism to enhance LLM-based Text-to-SQL reasoning.

Findings

01

Achieves 51.72% score on challenging BIRD benchmark tasks.

02

Outperforms other in-context learning methods with smaller models.

03

Attains 61.06% execution accuracy and 68.77% Soft F1 score on BIRD dataset.

Abstract

While recent advancements in inference-time learning have improved LLM reasoning on Text-to-SQL tasks, current solutions still struggle to perform well on the most challenging tasks in the Bird-Bench (BIRD) benchmark. This is due to inadequate solution space exploration, which is necessary to uncover promising candidate queries that can be further refined to produce the correct output. To address this challenge, we introduce CA-SQL, a novel Text-to-SQL pipeline that utilizes the estimated difficulty of a task to dynamically scale the breadth of the exploration for generating solution candidates. In addition, we use a custom prompt seeding method, based on principles of evolutionary search, to further elicit exploratory behavior from the base LLM and a novel voting method to select the best candidate solution at the end of the search. Experiments demonstrate that our solution achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.