Doing data science with platforms crumbs: an investigation into fakes views on YouTube
Maria Castaldo, Paolo Frasca, Tommaso Venturini, Floriana Gargiulo

TL;DR
This study investigates the prevalence and impact of fake views on YouTube, revealing that most are corrected late and can artificially inflate video popularity, with significant implications for online information quality.
Contribution
The paper provides the first large-scale analysis of fake view removal on YouTube, highlighting the extent, timing, and potential effects of fake views on content visibility.
Findings
Most fake views are corrected late in video lifespan
Fake views significantly influence final view counts
YouTube lacks transparency on fake view data
Abstract
This paper contributes to the ongoing discussions on the scholarly access to social media data, discussing a case where this access is barred despite its value for understanding and countering online disinformation and despite the absence of privacy or copyright issues. Our study concerns YouTube's engagement metrics and, more specifically, the way in which the platform removes "fake views" (i.e., views considered as artificial or illegitimate by the platform). Working with one and a half year of data extracted from a thousand French YouTube channels, we show the massive extent of this phenomenon, which concerns the large majority of the channels and more than half the videos in our corpus. Our analysis indicates that most fakes news are corrected relatively late in the life of the videos and that the final view counts of the videos are not independent from the fake views they received.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Hate Speech and Cyberbullying Detection
