TableFormer: Robust Transformer Modeling for Table-Text Encoding
Jingfeng Yang, Aditya Gupta, Shyam Upadhyay, Luheng He, Rahul Goel,, Shachi Paul

TL;DR
TableFormer is a novel transformer-based model that robustly encodes table-text data by incorporating structural biases, making it invariant to row and column order changes and improving reasoning performance on multiple datasets.
Contribution
The paper introduces TableFormer, a structurally aware transformer that is invariant to table row and column order, enhancing robustness and understanding in table-text encoding tasks.
Findings
Outperforms baselines on SQA, WTQ, and TabFact datasets.
Achieves state-of-the-art on SQA with 6% improvement under order perturbations.
Maintains performance despite row and column order changes.
Abstract
Understanding tables is an important aspect of natural language understanding. Existing models for table understanding require linearization of the table structure, where row or column order is encoded as an unwanted bias. Such spurious biases make the model vulnerable to row and column order perturbations. Additionally, prior work has not thoroughly modeled the table structures or table-text alignments, hindering the table-text understanding ability. In this work, we propose a robust and structurally aware table-text encoding architecture TableFormer, where tabular structural biases are incorporated completely through learnable attention biases. TableFormer is (1) strictly invariant to row and column orders, and, (2) could understand tables better due to its tabular inductive biases. Our evaluations showed that TableFormer outperforms strong baselines in all settings on SQA, WTQ and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsAttentive Walk-Aggregating Graph Neural Network
