Fake News Detection via Wisdom of Synthetic & Representative Crowds

Fran\c{c}ois t'Serstevens; Roberto Cerina; Giulia Piccillo

arXiv:2408.03154·cs.CY·August 7, 2024

Fake News Detection via Wisdom of Synthetic & Representative Crowds

Fran\c{c}ois t'Serstevens, Roberto Cerina, Giulia Piccillo

PDF

Open Access

TL;DR

This paper introduces a novel methodology for detecting fake news on social media by combining synthetic and representative crowdsourcing, hierarchical Bayesian modeling, and multilevel regression to produce democratically legitimate veracity assessments.

Contribution

It presents an end-to-end approach that leverages synthetic and real crowds, hierarchical Bayesian models, and multilevel regression to improve fake news detection and representation.

Findings

01

Fake news sharing is generally rare, with probabilities between 7% and 14%.

02

Democrats share less fake news than Republicans, with a reduction of about 57%.

03

Women tend to share less fake news than men, with a decrease of approximately 30%.

Abstract

Social media companies have struggled to provide a democratically legitimate definition of "Fake News". Reliance on expert judgment has attracted criticism due to a general trust deficit and political polarisation. Approaches reliant on the ``wisdom of the crowds'' are a cost-effective, transparent and inclusive alternative. This paper provides a novel end-to-end methodology to detect fake news on X via "wisdom of the synthetic & representative crowds". We deploy an online survey on the Lucid platform to gather veracity assessments for a number of pandemic-related tweets from crowd-workers. Borrowing from the MrP literature, we train a Hierarchical Bayesian model to predict the veracity of each tweet from the perspective of different personae from the population of interest. We then weight the predicted veracity assessments according to a representative stratification frame, such that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection