FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain   Fake News Detection

Ziyi Zhou; Xiaoming Zhang; Litian Zhang; Jiacheng Liu; Senzhang Wang,; Zheng Liu; Xi Zhang; Chaozhuo Li; Philip S. Yu

arXiv:2404.01336·cs.CL·October 16, 2024·1 cites

FineFake: A Knowledge-Enriched Dataset for Fine-Grained Multi-Domain Fake News Detection

Ziyi Zhou, Xiaoming Zhang, Litian Zhang, Jiacheng Liu, Senzhang Wang,, Zheng Liu, Xi Zhang, Chaozhuo Li, Philip S. Yu

PDF

Open Access 1 Repo

TL;DR

FineFake is a comprehensive, multi-domain, knowledge-enriched dataset for fake news detection that includes fine-grained annotations, multi-modal content, and social context, addressing limitations of existing single-domain benchmarks.

Contribution

The paper introduces FineFake, a novel multi-domain benchmark with fine-grained annotations and knowledge enrichment, enabling more realistic fake news detection research.

Findings

01

Knowledge-enhanced domain adaptation improves detection accuracy.

02

FineFake covers diverse topics and platforms, reflecting real-world scenarios.

03

Extensive experiments establish reliable benchmarks for future research.

Abstract

Existing benchmarks for fake news detection have significantly contributed to the advancement of models in assessing the authenticity of news content. However, these benchmarks typically focus solely on news pertaining to a single semantic topic or originating from a single platform, thereby failing to capture the diversity of multi-domain news in real scenarios. In order to understand fake news across various domains, the external knowledge and fine-grained annotations are indispensable to provide precise evidence and uncover the diverse underlying strategies for fabrication, which are also ignored by existing benchmarks. To address this gap, we introduce a novel multi-domain knowledge-enhanced benchmark with fine-grained annotations, named \textbf{FineFake}. FineFake encompasses 16,909 data samples spanning six semantic topics and eight platforms. Each news item is enriched with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

accuser907/finefake
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Advanced Malware Detection Techniques

MethodsFocus