TURL: Table Understanding through Representation Learning
Xiang Deng, Huan Sun, Alyssa Lees, You Wu, Cong Yu

TL;DR
TURL introduces a pre-training and fine-tuning framework for relational Web tables, leveraging a structure-aware Transformer and a novel Masked Entity Recovery objective to improve performance across multiple table understanding tasks.
Contribution
The paper presents a universal pre-trained model for table understanding that requires minimal task-specific adjustments, advancing beyond heavily-engineered prior methods.
Findings
Outperforms existing methods on 6 table understanding tasks
Generalizes well across diverse table-related tasks
Uses a novel Masked Entity Recovery pre-training objective
Abstract
Relational tables on the Web store a vast amount of knowledge. Owing to the wealth of such tables, there has been tremendous progress on a variety of tasks in the area of table understanding. However, existing work generally relies on heavily-engineered task-specific features and model architectures. In this paper, we present TURL, a novel framework that introduces the pre-training/fine-tuning paradigm to relational Web tables. During pre-training, our framework learns deep contextualized representations on relational tables in an unsupervised manner. Its universal model design with pre-trained representations can be applied to a wide range of tasks with minimal task-specific fine-tuning. Specifically, we propose a structure-aware Transformer encoder to model the row-column structure of relational tables, and present a new Masked Entity Recovery (MER) objective for pre-training to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Topic Modeling · Machine Learning in Healthcare
MethodsLinear Layer · TURL: Table Understanding through Representation Learning · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Multi-Head Attention · Adam · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout
