Reddit's Globalization over Twenty Years: Inferring Community Time Zone from Activity Timestamps
Franco Della Negra, Mattia Samory, Matteo Cinelli

TL;DR
This paper introduces simple, scalable methods to infer online community time zones from activity timestamps, achieving high accuracy without user data, and applies it to analyze Reddit's geographic evolution over twenty years.
Contribution
It presents a lightweight heuristic and frequency-based methods for inferring community time zones solely from activity patterns, validated across diverse communities and platforms.
Findings
Best method achieves sub-30-minute accuracy on Reddit.
Fewer than a thousand comments suffice for peak performance.
Heuristic recovers correct time zone within one hour on average.
Abstract
Online communities are a global phenomenon, but assessing their actual geographical spread requires accurate and scalable measurement. We propose and evaluate methods that infer the time zone of online communities solely from their temporal activity patterns, requiring nothing beyond hourly activity counts. Grounding our approach in the well-established finding that posting rhythms encode circadian structure, we compare time-domain and frequency-domain methods against a parsimonious heuristic: that activity reaches its minimum around 4 a.m. local time. On Reddit, we show that the best-performing method is accurate to a sub-30-minute resolution, and that fewer than a thousand comments are sufficient to reach peak performance. Similarly, our heuristic almost matches the accuracy of more complex methods, recovering the correct time zone within a one-hour margin on average. This simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
