Analysis and Forecasting of Trending Topics in Online Media Streams
Tim Althoff, Damian Borth, J\"orn Hees, Andreas Dengel

TL;DR
This paper conducts a comprehensive analysis of trending topics across Twitter, Google, and Wikipedia over a year and introduces a novel real-time forecasting method for trending topic life cycles using nearest neighbor techniques.
Contribution
It provides the first cross-platform analysis of trending topics and proposes a new automated forecasting approach based on semantic similarity for predicting their evolution.
Findings
Forecasts are 9-48k views closer to actual data than baselines.
Achieves 45-19% mean average percentage error for up to 14-day forecasts.
Highlights differences in content emphasis across media streams.
Abstract
Among the vast information available on the web, social media streams capture what people currently pay attention to and how they feel about certain topics. Awareness of such trending topics plays a crucial role in multimedia systems such as trend aware recommendation and automatic vocabulary selection for video concept detection systems. Correctly utilizing trending topics requires a better understanding of their various characteristics in different social media streams. To this end, we present the first comprehensive study across three major online and social media streams, Twitter, Google, and Wikipedia, covering thousands of trending topics during an observation period of an entire year. Our results indicate that depending on one's requirements one does not necessarily have to turn to Twitter for information about current events and that some media streams strongly emphasize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Text Analysis Techniques · Data-Driven Disease Surveillance
