SoK: Machine Learning for Misinformation Detection

Madelyne Xiao; Jonathan Mayer

arXiv:2308.12215·cs.LG·January 28, 2025·1 cites

SoK: Machine Learning for Misinformation Detection

Madelyne Xiao, Jonathan Mayer

PDF

Open Access 1 Repo

TL;DR

This paper critically examines the application of machine learning to misinformation detection, highlighting common methodological flaws, limited real-world effectiveness, and proposing improved evaluation practices and future research directions.

Contribution

It provides a comprehensive survey of existing literature, identifies prevalent errors, and demonstrates the limited efficacy of current methods through replication studies.

Findings

01

Detection methods often do not reflect real-world challenges.

02

Datasets and evaluations are frequently non-representative.

03

Current state-of-the-art has limited effectiveness in identifying human misinformation.

Abstract

We examine the disconnect between scholarship and practice in applying machine learning to trust and safety problems, using misinformation detection as a case study. We survey literature on automated detection of misinformation across a corpus of 248 well-cited papers in the field. We then examine subsets of papers for data and code availability, design missteps, reproducibility, and generalizability. Our paper corpus includes published work in security, natural language processing, and computational social science. Across these disparate disciplines, we identify common errors in dataset and method design. In general, detection tasks are often meaningfully distinct from the challenges that online services actually face. Datasets and model evaluation are often non-representative of real-world contexts, and evaluation frequently is not independent of model training. We demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

citp/sok_misinformation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts