Persistent Homology of Topic Networks for the Prediction of Reader Curiosity

Manuel D. S. Hopp; Vincent Labatut (LIA); Arthur Amalvy (LIA); Richard Dufour (LS2N - \'equipe TALN); Hannah Stone; Hayley Jach; Kou Murayama

arXiv:2506.11095·cs.CL·September 12, 2025

Persistent Homology of Topic Networks for the Prediction of Reader Curiosity

Manuel D. S. Hopp, Vincent Labatut (LIA), Arthur Amalvy (LIA), Richard Dufour (LS2N - \'equipe TALN), Hannah Stone, Hayley Jach, Kou Murayama

PDF

1 Video

TL;DR

This paper introduces a novel method combining persistent homology and topic modeling to analyze semantic information gaps in texts, effectively predicting reader curiosity levels and advancing understanding of engagement in NLP.

Contribution

It presents a new framework that models reader curiosity through topological analysis of semantic networks, integrating persistent homology with topic modeling for the first time.

Findings

01

Topological features significantly improve curiosity prediction accuracy.

02

The pipeline explains 73% of variance in curiosity ratings.

03

Semantic network topology correlates with reader engagement.

Abstract

Reader curiosity, the drive to seek information, is crucial for textual engagement, yet remains relatively underexplored in NLP. Building on Loewenstein's Information Gap Theory, we introduce a framework that models reader curiosity by quantifying semantic information gaps within a text's semantic structure. Our approach leverages BERTopic-inspired topic modeling and persistent homology to analyze the evolving topology (connected components, cycles, voids) of a dynamic semantic network derived from text segments, treating these features as proxies for information gaps. To empirically evaluate this pipeline, we collect reader curiosity ratings from participants (n = 49) as they read S. Collins's ''The Hunger Games'' novel. We then use the topological features from our pipeline as independent variables to predict these ratings, and experimentally show that they significantly improve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Persistent Homology of Topic Networks for the Prediction of Reader Curiosity· underline