Give Me More Details: Improving Fact-Checking with Latent Retrieval

Xuming Hu; Junzhe Chen; Zhijiang Guo; Philip S. Yu

arXiv:2305.16128·cs.CL·January 30, 2024·2 cites

Give Me More Details: Improving Fact-Checking with Latent Retrieval

Xuming Hu, Junzhe Chen, Zhijiang Guo, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces a latent variable model that leverages full source documents for fact-checking, improving evidence extraction and claim verification beyond traditional snippet-based methods.

Contribution

It proposes a novel latent retrieval approach that jointly extracts evidence and verifies claims using full text documents, along with new multilingual and monolingual datasets.

Findings

01

Including source documents enhances evidence retrieval.

02

The model outperforms existing methods in claim verification.

03

Significant improvements are achieved across various settings.

Abstract

Evidence plays a crucial role in automated fact-checking. When verifying real-world claims, existing fact-checking systems either assume the evidence sentences are given or use the search snippets returned by the search engine. Such methods ignore the challenges of collecting evidence and may not provide sufficient information to verify real-world claims. Aiming at building a better fact-checking system, we propose to incorporate full text from source documents as evidence and introduce two enriched datasets. The first one is a multilingual dataset, while the second one is monolingual (English). We further develop a latent variable model to jointly extract evidence sentences from documents and perform claim verification. Experiments indicate that including source documents can provide sufficient contextual clues even when gold evidence sentences are not annotated. The proposed system is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Data Quality and Management