Hidden Biases in Unreliable News Detection Datasets
Xiang Zhou, Heba Elfardy, Christos Christodoulopoulos, Thomas Butler,, Mohit Bansal

TL;DR
This paper critically examines existing unreliable news detection datasets, revealing biases and confounding factors that hinder generalization, and offers practical guidelines for creating more robust datasets and evaluation methods.
Contribution
It identifies selection bias and source overlap as key issues in current datasets and proposes strategies for more reliable dataset creation and evaluation.
Findings
Significant accuracy drop (>10%) in clean data splits.
Models often exploit source overlap rather than true content cues.
Recommendations for bias detection and non-overlapping data splits.
Abstract
Automatic unreliable news detection is a research problem with great potential impact. Recently, several papers have shown promising results on large-scale news datasets with models that only use the article itself without resorting to any fact-checking mechanism or retrieving any supporting evidence. In this work, we take a closer look at these datasets. While they all provide valuable resources for future research, we observe a number of problems that may lead to results that do not generalize in more realistic settings. Specifically, we show that selection bias during data collection leads to undesired artifacts in the datasets. In addition, while most systems train and predict at the level of individual articles, overlapping article sources in the training and evaluation data can provide a strong confounding factor that models can exploit. In the presence of this confounding factor,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Spam and Phishing Detection
