TAPAS at SemEval-2021 Task 9: Reasoning over tables with intermediate pre-training
Thomas M\"uller, Julian Martin Eisenschlos, Syrine Krichene

TL;DR
This paper introduces an enhanced TAPAS model with intermediate pre-training for table reasoning tasks, improving statement verification accuracy by generating artificial neutral examples and leveraging intermediate data and large datasets.
Contribution
It presents a novel approach combining intermediate pre-training, artificial neutral example generation, and large dataset utilization to improve table-based statement verification.
Findings
Artificial neutral examples improve model training.
Intermediate pre-training enhances performance over basic MASKLM.
Achieved 68.03 test F1 score, surpassing baseline.
Abstract
We present the TAPAS contribution to the Shared Task on Statement Verification and Evidence Finding with Tables (SemEval 2021 Task 9, Wang et al. (2021)). SEM TAB FACT Task A is a classification task of recognizing if a statement is entailed, neutral or refuted by the content of a given table. We adopt the binary TAPAS model of Eisenschlos et al. (2020) to this task. We learn two binary classification models: A first model to predict if a statement is neutral or non-neutral and a second one to predict if it is entailed or refuted. As the shared task training set contains only entailed or refuted examples, we generate artificial neutral examples to train the first model. Both models are pre-trained using a MASKLM objective, intermediate counter-factual and synthetic data (Eisenschlos et al., 2020) and TABFACT (Chen et al., 2020), a large table entailment dataset. We find that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTAPAS
