PExA: Parallel Exploration Agent for Complex Text-to-SQL
Tanmay Parekh, Ella Hofmann-Coyle, Shuyi Wang, Sachith Sri Ram Kothur, Srivas Prasad, Yunmo Chen

TL;DR
PExA introduces a parallel exploration framework for text-to-SQL tasks that improves accuracy by using test case coverage to ground SQL generation, balancing performance and latency.
Contribution
The paper presents a novel parallel exploration approach for text-to-SQL that leverages test case coverage to enhance accuracy and efficiency.
Findings
Achieved 70.2% execution accuracy on Spider 2.0 benchmark.
Reformulated text-to-SQL as a test coverage problem with parallel execution.
Outperformed previous state-of-the-art methods in accuracy.
Abstract
LLM-based agents for text-to-SQL often struggle with latency-performance trade-off, where performance improvements come at the cost of latency or vice versa. We reformulate text-to-SQL generation within the lens of software test coverage where the original query is prepared with a suite of test cases with simpler, atomic SQLs that are executed in parallel and together ensure semantic coverage of the original query. After iterating on test case coverage, the final SQL is generated only when enough information is gathered, leveraging the explored test case SQLs to ground the final generation. We validated our framework on a state-of-the-art benchmark for text-to-SQL, Spider 2.0, achieving a new state-of-the-art with 70.2% execution accuracy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
