Bertrand-DR: Improving Text-to-SQL using a Discriminative Re-ranker
Amol Kelkar, Rohan Relan, Vaishali Bhardwaj, Saurabh Vaichal, Chandra, Khatri, Peter Relan

TL;DR
This paper introduces Bertrand-DR, a discriminative re-ranker that enhances generative text-to-SQL models by selecting the best query from beam outputs, significantly improving performance on complex queries.
Contribution
The paper presents a novel schema-agnostic BERT-based re-ranker that boosts generative text-to-SQL models by effectively selecting optimal queries from candidate lists.
Findings
Achieved top 4 score on the Spider leaderboard.
Improved performance when the correct query is in the candidate list.
Effective combination of generative models and re-rankers enhances accuracy.
Abstract
To access data stored in relational databases, users need to understand the database schema and write a query using a query language such as SQL. To simplify this task, text-to-SQL models attempt to translate a user's natural language question to corresponding SQL query. Recently, several generative text-to-SQL models have been developed. We propose a novel discriminative re-ranker to improve the performance of generative text-to-SQL models by extracting the best SQL query from the beam output predicted by the text-to-SQL generator, resulting in improved performance in the cases where the best query was in the candidate list, but not at the top of the list. We build the re-ranker as a schema agnostic BERT fine-tuned classifier. We analyze relative strengths of the text-to-SQL and re-ranker models across different query hardness levels, and suggest how to combine the two models for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
