Cost Trade-offs of Reasoning and Non-Reasoning Large Language Models in Text-to-SQL
Saurabh Deochake, Debajyoti Mukhopadhyay

TL;DR
This paper analyzes the cost and efficiency trade-offs between reasoning and non-reasoning Large Language Models in Text-to-SQL tasks, revealing that reasoning models are more cost-efficient and identifying patterns that cause cost inefficiencies.
Contribution
It provides a comprehensive cost analysis of reasoning versus non-reasoning LLMs in Text-to-SQL, with deployment guidelines to reduce financial risks.
Findings
Reasoning models process 44.5% fewer bytes than non-reasoning models.
Execution time correlates weakly with query cost ($r=0.16).
Non-reasoning models show up to 3.4× cost variance and produce large outliers.
Abstract
While Text-to-SQL systems achieve high accuracy, existing efficiency metrics like the Valid Efficiency Score prioritize execution time, a metric we show is fundamentally decoupled from consumption-based cloud billing. This paper evaluates cloud query execution cost trade-offs between reasoning and non-reasoning Large Language Models by performing 180 Text-to-SQL query executions across six LLMs on Google BigQuery using the 230 GB StackOverflow dataset. Our analysis reveals that reasoning models process 44.5% fewer bytes than non-reasoning counterparts while maintaining equivalent correctness at 96.7% to 100%, and that execution time correlates weakly with query cost at , indicating that speed optimization does not imply cost efficiency. Non-reasoning models also exhibit extreme cost variance of up to 3.4, producing outliers exceeding 36 GB per query, over 20 the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Cloud Computing and Resource Management · Natural Language Processing Techniques
