TARGET: Benchmarking Table Retrieval for Generative Tasks
Xingyu Ji, Parker Glenn, Aditya G. Parameswaran, Madelon Hulsebos

TL;DR
TARGET is a benchmark designed to evaluate how effectively different retrieval methods can identify relevant tables for generative tasks involving structured data, highlighting the superiority of dense embedding retrievers over traditional methods.
Contribution
The paper introduces TARGET, a new benchmark for assessing table retrieval methods in generative tasks, and provides a comprehensive analysis of different retrievers' performance.
Findings
Dense embedding retrievers outperform BM25 baseline.
Retrieval performance varies significantly across datasets and tasks.
Retrievers are sensitive to metadata quality, such as missing table titles.
Abstract
The data landscape is rich with structured data, often of high value to organizations, driving important applications in data analysis and machine learning. Recent progress in representation learning and generative models for such data has led to the development of natural language interfaces to structured data, including those leveraging text-to-SQL. Contextualizing interactions, either through conversational interfaces or agentic components, in structured data through retrieval-augmented generation can provide substantial benefits in the form of freshness, accuracy, and comprehensiveness of answers. The key question is: how do we retrieve the right table(s) for the analytical query or task at hand? To this end, we introduce TARGET: a benchmark for evaluating TAble Retrieval for GEnerative Tasks. With TARGET we analyze the retrieval performance of different retrievers in isolation, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Data Quality and Management
