TL;DR
This paper introduces a data-aware candidate selection method for NL2SQL translation that leverages separating instances and provenance, demonstrating significant improvements over baselines with minimal candidates.
Contribution
The authors present a novel candidate selection approach using separating instances and provenance, outperforming traditional methods in low-candidate scenarios.
Findings
Our method significantly outperforms baselines with only two or three candidates.
It shows strong performance without relying on a consistency score.
The approach is effective on a subset of BIRD-DEV.
Abstract
We propose a data-aware candidate selection method for NL2SQL translation based on separating instances and provenance. We implement this approach and evaluate it against three natural baselines on a subset of BIRD-DEV. Experiments show that our method significantly outperforms baselines when only two or three candidates are given and no consistency score is available. The code of our prototype can be found at https://github.com/staskikotx/SISelection
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
