CrowdChecked: Detecting Previously Fact-Checked Claims in Social Media

Momchil Hardalov; Anton Chernyavskiy; Ivan Koychev; Dmitry Ilvovsky,; Preslav Nakov

arXiv:2210.04447·cs.CL·October 11, 2022·5 cites

CrowdChecked: Detecting Previously Fact-Checked Claims in Social Media

Momchil Hardalov, Anton Chernyavskiy, Ivan Koychev, Dmitry Ilvovsky,, Preslav Nakov

PDF

Open Access 1 Repo

TL;DR

This paper introduces CrowdChecked, a system that detects social media claims previously fact-checked by leveraging crowd-sourced data, using a large-scale dataset and an end-to-end learning framework to improve fact-checking accuracy.

Contribution

It presents a novel approach to utilize crowd-sourced social media claims and a large dataset for training fact-checking models, addressing data scarcity issues.

Findings

01

Achieved a two-point improvement over the state of the art on CLEF'21 dataset.

02

Created a large-scale dataset of 330,000 tweets with fact-checking articles.

03

Developed an end-to-end framework using modified self-adaptive training for noisy data.

Abstract

While there has been substantial progress in developing systems to automate fact-checking, they still lack credibility in the eyes of the users. Thus, an interesting approach has emerged: to perform automatic fact-checking by verifying whether an input claim has been previously fact-checked by professional fact-checkers and to return back an article that explains their decision. This is a sensible approach as people trust manual fact-checking, and as many claims are repeated multiple times. Yet, a major issue when building such systems is the small number of known tweet--verifying article pairs available for training. Here, we aim to bridge this gap by making use of crowd fact-checking, i.e., mining claims in social media for which users have responded with a link to a fact-checking article. In particular, we mine a large-scale collection of 330,000 tweets paired with a corresponding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mhardalov/crowdchecked-claims
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Software Engineering Research

MethodsTest