DeTriever: Decoder-representation-based Retriever for Improving NL2SQL In-Context Learning
Yuxi Feng, Raymond Li, Zhenan Fan, Giuseppe Carenini, Mohammadreza, Pourreza, Weiwei Zhang, Yong Zhang

TL;DR
DeTriever is a novel retrieval framework that enhances NL2SQL in-context learning by learning weighted combinations of LLM hidden states, leading to significant performance improvements over existing methods.
Contribution
It introduces a new demonstration retrieval method that better captures semantic information and estimates example benefits, addressing limitations of prior retrieval approaches.
Findings
Outperforms state-of-the-art baselines on NL2SQL benchmarks
Significantly improves one-shot NL2SQL task performance
Uses a proxy score based on output query similarities
Abstract
While in-context Learning (ICL) has proven to be an effective technique to improve the performance of Large Language Models (LLMs) in a variety of complex tasks, notably in translating natural language questions into Structured Query Language (NL2SQL), the question of how to select the most beneficial demonstration examples remains an open research problem. While prior works often adapted off-the-shelf encoders to retrieve examples dynamically, an inherent discrepancy exists in the representational capacities between the external retrievers and the LLMs. Further, optimizing the selection of examples is a non-trivial task, since there are no straightforward methods to assess the relative benefits of examples without performing pairwise inference. To address these shortcomings, we propose DeTriever, a novel demonstration retrieval framework that learns a weighted combination of LLM hidden…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
