FEVER: a large-scale dataset for Fact Extraction and VERification

James Thorne; Andreas Vlachos; Christos Christodoulopoulos; Arpit; Mittal

arXiv:1803.05355·cs.CL·December 19, 2018

FEVER: a large-scale dataset for Fact Extraction and VERification

James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Arpit, Mittal

PDF

5 Repos 3 Models 3 Datasets

TL;DR

FEVER introduces a large-scale, challenging dataset for fact verification from Wikipedia-derived claims, aiming to advance research in automated claim verification and evidence retrieval.

Contribution

The paper presents FEVER, a new extensive dataset for fact verification, including annotated evidence, and evaluates baseline pipeline approaches to highlight its difficulty.

Findings

01

Best claim verification accuracy with evidence: 31.87%

02

Claim labeling accuracy without evidence: 50.91%

03

Dataset challenges current verification models

Abstract

In this paper we introduce a new publicly available dataset for verification against textual sources, FEVER: Fact Extraction and VERification. It consists of 185,445 claims generated by altering sentences extracted from Wikipedia and subsequently verified without knowledge of the sentence they were derived from. The claims are classified as Supported, Refuted or NotEnoughInfo by annotators achieving 0.6841 in Fleiss $κ$ . For the first two classes, the annotators also recorded the sentence(s) forming the necessary evidence for their judgment. To characterize the challenge of the dataset presented, we develop a pipeline approach and compare it to suitably designed oracles. The best accuracy we achieve on labeling a claim accompanied by the correct evidence is 31.87%, while if we ignore the evidence we achieve 50.91%. Thus we believe that FEVER is a challenging testbed that will help…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.