Table Retrieval May Not Necessitate Table-specific Model Design
Zhiruo Wang, Zhengbao Jiang, Eric Nyberg, Graham Neubig

TL;DR
This paper investigates whether specialized table models are necessary for table retrieval tasks, finding that simple text-based models perform comparably or better than complex, table-specific models, with structure often playing a negligible role.
Contribution
The study demonstrates that generic text-based retrieval models can effectively handle table retrieval, challenging the need for complex, table-specific model designs.
Findings
Text-based DPR performs well without table-specific design.
Table structure plays a negligible role in over 70% of cases.
Adding explicit table structure encoding does not significantly improve performance.
Abstract
Tables are an important form of structured data for both human and machine readers alike, providing answers to questions that cannot, or cannot easily, be found in texts. Recent work has designed special models and training paradigms for table-related tasks such as table-based question answering and table retrieval. Though effective, they add complexity in both modeling and data acquisition compared to generic text solutions and obscure which elements are truly beneficial. In this work, we focus on the task of table retrieval, and ask: "is table-specific model design necessary for table retrieval, or can a simpler text-based model be effectively used to achieve a similar result?" First, we perform an analysis on a table-based portion of the Natural Questions dataset (NQ-table), and find that structure plays a negligible role in more than 70% of the cases. Based on this, we experiment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Text and Document Classification Technologies
