A Statistical Framework for Detecting Emergent Narratives in Longitudinal Text Corpora
Cynthia Medeiros, John Quigley, Matthew Revie

TL;DR
This paper introduces a statistical framework using LDA to detect emerging narratives in long-term text data, exemplified by economic publications, by identifying sustained increases in topic prominence.
Contribution
It presents a novel method for identifying narrative emergence through modeling topic trajectories as latent variables over time.
Findings
Topics related to Nobel-winning contributions show sustained prevalence increases.
The framework correlates topic prominence with citation activity and recognition.
It provides a statistically grounded approach for thematic change analysis.
Abstract
Narratives about economic events and policies are widely recognised as influential drivers of economic and business behaviour. Yet the statistical identification of narrative emergence remains underdeveloped. Narratives evolve gradually, exhibit subtle shifts in content, and may exert influence disproportionate to their observable frequency, making it difficult to determine when observed changes reflect genuine structural shifts rather than routine variation in language use. We propose a statistical framework for detecting narrative emergence in longitudinal text corpora using Latent Dirichlet Allocation (LDA). We define emergence as a sustained increase in a topic's relative prominence over time and articulate a statistical framework for interpreting such trajectories, recognising that topic proportions are latent, model-estimated quantities. We illustrate the approach using a corpus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Language and cultural evolution · scientometrics and bibliometrics research
