BERT for Evidence Retrieval and Claim Verification

Amir Soleimani; Christof Monz; Marcel Worring

arXiv:1910.02655·cs.CL·October 8, 2019

BERT for Evidence Retrieval and Claim Verification

Amir Soleimani, Christof Monz, Marcel Worring

PDF

2 Repos

TL;DR

This paper explores using BERT models for evidence retrieval and claim verification in fact-checking, achieving state-of-the-art recall and high leaderboard scores in the FEVER challenge.

Contribution

The paper introduces a two-BERT model pipeline for evidence retrieval and claim verification, with novel training strategies including pointwise and pairwise loss functions and hard negative mining.

Findings

01

Achieved 87.1% recall for top five evidence retrieval

02

Scored 69.7 FEVER score, second place in leaderboard

03

Demonstrated effectiveness of BERT in fact verification tasks

Abstract

Motivated by the promising performance of pre-trained language models, we investigate BERT in an evidence retrieval and claim verification pipeline for the FEVER fact extraction and verification challenge. To this end, we propose to use two BERT models, one for retrieving potential evidence sentences supporting or rejecting claims, and another for verifying claims based on the predicted evidence sets. To train the BERT retrieval system, we use pointwise and pairwise loss functions, and examine the effect of hard negative mining. A second BERT model is trained to classify the samples as supported, refuted, and not enough information. Our system achieves a new state of the art recall of 87.1 for retrieving top five sentences out of the FEVER documents consisting of 50K Wikipedia pages, and scores second in the official leaderboard with the FEVER score of 69.7.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax