Reap the Wild Wind: Detecting Media Storms in Large-Scale News Corpora
Dror K. Markus, Effi Levi, Tamir Sheafer, and Shaul R. Shenhav

TL;DR
This paper presents an iterative human-in-the-loop method for detecting media storms in large news corpora, enabling systematic empirical analysis of these attention surges.
Contribution
It introduces a novel, scalable approach combining textual signals and expert validation to identify media storms in large-scale news datasets.
Findings
Successfully detects media storms in different time periods
Provides a new dataset for media storm research
Enables empirical analysis of media storm characteristics
Abstract
Media Storms, dramatic outbursts of attention to a story, are central components of media dynamics and the attention landscape. Despite their significance, there has been little systematic and empirical research on this concept due to issues of measurement and operationalization. We introduce an iterative human-in-the-loop method to identify media storms in a large-scale corpus of news articles. The text is first transformed into signals of dispersion based on several textual characteristics. In each iteration, we apply unsupervised anomaly detection to these signals; each anomaly is then validated by an expert to confirm the presence of a storm, and those results are then used to tune the anomaly detection in the next iteration. We demonstrate the applicability of this method in two scenarios: first, supplementing an initial list of media storms within a specific time frame; and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMisinformation and Its Impacts · Complex Network Analysis Techniques · Anomaly Detection Techniques and Applications
