FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information
Rami Aly, Zhijiang Guo, Michael Schlichtkrull, James Thorne, Andreas, Vlachos, Christos Christodoulopoulos, Oana Cocarascu, Arpit Mittal

TL;DR
FEVEROUS introduces a comprehensive dataset and benchmark for fact verification that incorporates both unstructured text and structured table data from Wikipedia, addressing limitations of previous benchmarks focused solely on text.
Contribution
The paper presents a novel dataset and benchmark combining textual and tabular evidence for fact verification, along with bias mitigation efforts and a baseline model.
Findings
Dataset contains 87,026 verified claims with evidence from text and tables.
Baseline model achieves 18% accuracy in predicting correct evidence and verdict.
Efforts to reduce dataset biases to prevent models from exploiting superficial cues.
Abstract
Fact verification has attracted a lot of attention in the machine learning and natural language processing communities, as it is one of the key methods for detecting misinformation. Existing large-scale benchmarks for this task have focused mostly on textual sources, i.e. unstructured information, and thus ignored the wealth of information available in structured formats, such as tables. In this paper we introduce a novel dataset and benchmark, Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS), which consists of 87,026 verified claims. Each claim is annotated with evidence in the form of sentences and/or cells from tables in Wikipedia, as well as a label indicating whether this evidence supports, refutes, or does not provide enough information to reach a verdict. Furthermore, we detail our efforts to track and minimize the biases present in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
