TabFact: A Large-scale Dataset for Table-based Fact Verification
Wenhu Chen, Hongmin Wang, Jianshu Chen, Yunkai Zhang, Hong Wang,, Shiyang Li, Xiyou Zhou, William Yang Wang

TL;DR
This paper introduces TabFact, a large-scale dataset for fact verification using semi-structured table evidence, and proposes models to address the reasoning challenges involved.
Contribution
It creates a new dataset for table-based fact verification and develops two models, Table-BERT and LPA, to handle linguistic and symbolic reasoning tasks.
Findings
Models achieve comparable accuracy but are still below human performance.
The dataset enables exploration of reasoning over semi-structured data.
Analysis highlights future research directions.
Abstract
The problem of verifying whether a textual hypothesis holds based on the given evidence, also known as fact verification, plays an important role in the study of natural language understanding and semantic representation. However, existing studies are mainly restricted to dealing with unstructured evidence (e.g., natural language sentences and documents, news, etc), while verification under structured evidence, such as tables, graphs, and databases, remains under-explored. This paper specifically aims to study the fact verification given semi-structured data as evidence. To this end, we construct a large-scale dataset called TabFact with 16k Wikipedia tables as the evidence for 118k human-annotated natural language statements, which are labeled as either ENTAILED or REFUTED. TabFact is challenging since it involves both soft linguistic reasoning and hard symbolic reasoning. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Time Series Analysis and Forecasting
