Exploring limits to prediction in complex social systems
Travis Martin, Jake M. Hofman, Amit Sharma, Ashton Anderson, Duncan J., Watts

TL;DR
This paper investigates the fundamental limits of predicting success in complex social systems, combining theoretical modeling and empirical analysis on Twitter data, revealing inherent unpredictability due to system complexity and data limitations.
Contribution
It introduces a stylized model of success prediction, quantifies bounds on predictability, and empirically demonstrates these limits using Twitter cascade data.
Findings
Models explain less than half of cascade size variance.
Predictive performance is bounded below deterministic accuracy.
System heterogeneity and uncertainty significantly restrict predictability.
Abstract
How predictable is success in complex social systems? In spite of a recent profusion of prediction studies that exploit online social and information network data, this question remains unanswered, in part because it has not been adequately specified. In this paper we attempt to clarify the question by presenting a simple stylized model of success that attributes prediction error to one of two generic sources: insufficiency of available data and/or models on the one hand; and inherent unpredictability of complex social systems on the other. We then use this model to motivate an illustrative empirical study of information cascade size prediction on Twitter. Despite an unprecedented volume of information about users, content, and past performance, our best performing models can explain less than half of the variance in cascade sizes. In turn, this result suggests that even with unlimited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
