TL;DR
CRAFT introduces a zero-shot cascaded retrieval method for open-domain table question answering, combining sparse filtering with enriched semantic representations to outperform existing models without retraining.
Contribution
It presents a scalable, training-free retrieval approach that enhances semantic matching for tabular QA, bridging the gap between fine-tuned models and lightweight systems.
Findings
Outperforms state-of-the-art retrievers on NQ-Tables dataset
Achieves strong zero-shot performance on OTT-QA benchmark
Enriches table representations with generated titles and summaries
Abstract
Open-Domain Table Question Answering (TQA) involves retrieving relevant tables from a large corpus to answer natural language queries. Traditional dense retrieval models such as DTR and DPR incur high computational costs for large-scale retrieval tasks and require retraining or fine-tuning on new datasets, limiting their adaptability to evolving domains and knowledge. We propose CRAFT, a zero-shot cascaded retrieval approach that first uses a sparse retrieval model to filter a subset of candidate tables before applying more computationally expensive dense models as re-rankers. To improve retrieval quality, we enrich table representations with descriptive titles and summaries generated by Gemini Flash 1.5, enabling richer semantic matching between queries and tabular structures. Our method outperforms state-of-the-art sparse, dense, and hybrid retrievers on the NQ-Tables dataset. It…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
