TL;DR
This paper introduces GTR, a graph-based framework for natural language table retrieval that effectively handles complex table layouts and improves generalization across datasets using multi-granular graph representations and self-supervised pre-training.
Contribution
The paper proposes a novel graph-based approach with multi-granular representations and self-supervised learning to improve table retrieval accuracy and generalization beyond existing methods.
Findings
Significant improvements over state-of-the-art on benchmark datasets.
Enhanced cross-dataset generalization capabilities.
Better handling of complex table structures and diverse queries.
Abstract
The task of natural language table retrieval (NLTR) seeks to retrieve semantically relevant tables based on natural language queries. Existing learning systems for this task often treat tables as plain text based on the assumption that tables are structured as dataframes. However, tables can have complex layouts which indicate diverse dependencies between subtable structures, such as nested headers. As a result, queries may refer to different spans of relevant content that is distributed across these structures. Moreover, such systems fail to generalize to novel scenarios beyond those seen in the training set. Prior methods are still distant from a generalizable solution to the NLTR problem, as they fall short in handling complex table layouts or queries over multiple granularities. To address these issues, we propose Graph-based Table Retrieval (GTR), a generalizable NLTR framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Layer Normalization · Label Smoothing · Byte Pair Encoding · Residual Connection
