SoK: Machine Learning for Misinformation Detection
Madelyne Xiao, Jonathan Mayer

TL;DR
This paper critically examines the application of machine learning to misinformation detection, highlighting common methodological flaws, limited real-world effectiveness, and proposing improved evaluation practices and future research directions.
Contribution
It provides a comprehensive survey of existing literature, identifies prevalent errors, and demonstrates the limited efficacy of current methods through replication studies.
Findings
Detection methods often do not reflect real-world challenges.
Datasets and evaluations are frequently non-representative.
Current state-of-the-art has limited effectiveness in identifying human misinformation.
Abstract
We examine the disconnect between scholarship and practice in applying machine learning to trust and safety problems, using misinformation detection as a case study. We survey literature on automated detection of misinformation across a corpus of 248 well-cited papers in the field. We then examine subsets of papers for data and code availability, design missteps, reproducibility, and generalizability. Our paper corpus includes published work in security, natural language processing, and computational social science. Across these disparate disciplines, we identify common errors in dataset and method design. In general, detection tasks are often meaningfully distinct from the challenges that online services actually face. Datasets and model evaluation are often non-representative of real-world contexts, and evaluation frequently is not independent of model training. We demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts
