TL;DR
This paper introduces a new dataset and model for verifying the credibility of evidence used in automated fact-checking, aiming to improve the detection of unreliable or leaked information and enhance misinformation detection systems.
Contribution
The paper presents the CREDULE dataset and EVVER-Net model for verifying evidence credibility, addressing the challenge of unreliable and leaked sources in automated fact-checking.
Findings
EVVER-Net achieves up to 94.4% accuracy in detecting unreliable evidence.
The CREDULE dataset contains 91,632 articles classified into Credible, Unreliable, and Leaked.
Widely-used fact-checking datasets contain significant amounts of unreliable evidence.
Abstract
Automated fact-checking (AFC) is garnering increasing attention by researchers aiming to help fact-checkers combat the increasing spread of misinformation online. While many existing AFC methods incorporate external information from the Web to help examine the veracity of claims, they often overlook the importance of verifying the source and quality of collected "evidence". One overlooked challenge involves the reliance on "leaked evidence", information gathered directly from fact-checking websites and used to train AFC systems, resulting in an unrealistic setting for early misinformation detection. Similarly, the inclusion of information from unreliable sources can undermine the effectiveness of AFC systems. To address these challenges, we present a comprehensive approach to evidence verification and filtering. We create the "CREDible, Unreliable or LEaked" (CREDULE) dataset, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
