SURGE: An Event-Centric Social Media Sentiment Time Series Benchmark with Interaction Structure
Chen Su, Pengsen Cheng, Yuanhe Tian, Yan Song

TL;DR
SURGE is a comprehensive social media benchmark dataset with event-level time series, text, and interaction data, enabling advanced forecasting and analysis of social dynamics during public events.
Contribution
It introduces a large, multi-event dataset with aligned textual and interaction data, along with benchmark protocols for evaluating social media forecasting models.
Findings
Naive models perform well due to local persistence.
Limited transferability of text-augmented models to social media data.
Reply-dense periods increase forecasting difficulty.
Abstract
Public events on social media generate large volumes of discussion whose collective dynamics carry direct value for opinion forecasting and crisis response. Capturing how these dynamics evolve across an event's lifecycle requires organizing fragmented posts into event-level time series. Existing datasets cover only a small number of events within a single category, and typically discard the interaction structure between posts when constructing time series, which restricts both transfer across event types and controlled study of how interactions shape the resulting collective dynamics. We present SURGE, a multi-event social media benchmark that pairs event-level time series with aligned text and interaction structure linking posts within an event. SURGE is built through an automated pipeline that produces calendar-aligned time series at three temporal granularities, covering 67 events…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
