Scalable Fact-checking with Human-in-the-Loop

Jing Yang; Didier Vega-Oliveros; Tais Seibt; Anderson Rocha

arXiv:2109.10992·cs.CL·October 8, 2024

Scalable Fact-checking with Human-in-the-Loop

Jing Yang, Didier Vega-Oliveros, Tais Seibt, Anderson Rocha

PDF

Open Access 1 Repo

TL;DR

This paper presents a scalable fact-checking approach that groups and summarizes social media posts to reduce redundancy and accelerate verification, combining automated clustering with human evaluation.

Contribution

It introduces a method to organize and summarize large volumes of social media data for fact-checking, integrating semantic graph clustering and claim summarization.

Findings

01

Reduced 28,818 messages to 700 summaries

02

Achieved high ROUGE scores for summaries

03

Demonstrated potential to speed up fact-checking processes

Abstract

Researchers have been investigating automated solutions for fact-checking in a variety of fronts. However, current approaches often overlook the fact that the amount of information released every day is escalating, and a large amount of them overlap. Intending to accelerate fact-checking, we bridge this gap by grouping similar messages and summarizing them into aggregated claims. Specifically, we first clean a set of social media posts (e.g., tweets) and build a graph of all posts based on their semantics; Then, we perform two clustering methods to group the messages for further claim summarization. We evaluate the summaries both quantitatively with ROUGE scores and qualitatively with human evaluation. We also generate a graph of summaries to verify that there is no significant overlap among them. The results reduced 28,818 original messages to 700 summary claims, showing the potential…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jingyng/scalable-fact-checking
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Complex Network Analysis Techniques