FEVEROUS: Fact Extraction and VERification Over Unstructured and   Structured information

Rami Aly; Zhijiang Guo; Michael Schlichtkrull; James Thorne; Andreas; Vlachos; Christos Christodoulopoulos; Oana Cocarascu; Arpit Mittal

arXiv:2106.05707·cs.CL·October 13, 2021·52 cites

FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Rami Aly, Zhijiang Guo, Michael Schlichtkrull, James Thorne, Andreas, Vlachos, Christos Christodoulopoulos, Oana Cocarascu, Arpit Mittal

PDF

Open Access 1 Repo 1 Datasets

TL;DR

FEVEROUS introduces a comprehensive dataset and benchmark for fact verification that incorporates both unstructured text and structured table data from Wikipedia, addressing limitations of previous benchmarks focused solely on text.

Contribution

The paper presents a novel dataset and benchmark combining textual and tabular evidence for fact verification, along with bias mitigation efforts and a baseline model.

Findings

01

Dataset contains 87,026 verified claims with evidence from text and tables.

02

Baseline model achieves 18% accuracy in predicting correct evidence and verdict.

03

Efforts to reduce dataset biases to prevent models from exploiting superficial cues.

Abstract

Fact verification has attracted a lot of attention in the machine learning and natural language processing communities, as it is one of the key methods for detecting misinformation. Existing large-scale benchmarks for this task have focused mostly on textual sources, i.e. unstructured information, and thus ignored the wealth of information available in structured formats, such as tables. In this paper we introduce a novel dataset and benchmark, Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS), which consists of 87,026 verified claims. Each claim is annotated with evidence in the form of sentences and/or cells from tables in Wikipedia, as well as a label indicating whether this evidence supports, refutes, or does not provide enough information to reach a verdict. Furthermore, we detail our efforts to track and minimize the biases present in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Raldir/FEVEROUS
tfOfficial

Datasets

fever/feverous
dataset· 124 dl
124 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification