How Good Are SOTA Fake News Detectors

Matthew Iceland

arXiv:2308.02727·cs.CL·August 8, 2023·6 cites

How Good Are SOTA Fake News Detectors

Matthew Iceland

PDF

Open Access

TL;DR

This paper evaluates the robustness of traditional and state-of-the-art fake news detectors, revealing that traditional models often generalize better to unseen data than large language models, with the best choice depending on the task.

Contribution

It provides a comparative analysis of traditional and deep learning models' robustness in fake news detection, highlighting their generalization capabilities.

Findings

01

Traditional models generalize better to out-of-distribution data.

02

Large language models perform well on in-distribution data but less so on unseen data.

03

Model choice depends on specific fake news detection tasks.

Abstract

Automatic fake news detection with machine learning can prevent the dissemination of false statements before they gain many views. Several datasets labeling statements as legitimate or false have been created since the 2016 United States presidential election for the prospect of training machine learning models. We evaluate the robustness of both traditional and deep state-of-the-art models to gauge how well they may perform in the real world. We find that traditional models tend to generalize better to data outside the distribution it was trained on compared to more recently-developed large language models, though the best model to use may depend on the specific task at hand.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection