Jointly Dynamic Topic Model for Recognition of Lead-lag Relationship in Two Text Corpora
Yandi Zhu, Xiaoling Lu, Jingya Hong, and Feifei Wang

TL;DR
This paper introduces a joint dynamic topic model to identify lead-lag relationships between two text corpora, improving topic understanding and correlation detection across multiple sources.
Contribution
It proposes a novel joint dynamic topic model with an embedding extension to recognize lead-lag relationships between two corpora, enhancing multi-source topic analysis.
Findings
Successfully identified lead-lag relationships in synthetic data.
Effectively discovered specific and shared topics in real-world corpora.
Improved topic modeling quality through relationship recognition.
Abstract
Topic evolution modeling has received significant attentions in recent decades. Although various topic evolution models have been proposed, most studies focus on the single document corpus. However in practice, we can easily access data from multiple sources and also observe relationships between them. Then it is of great interest to recognize the relationship between multiple text corpora and further utilize this relationship to improve topic modeling. In this work, we focus on a special type of relationship between two text corpora, which we define as the "lead-lag relationship". This relationship characterizes the phenomenon that one text corpus would influence the topics to be discussed in the other text corpus in the future. To discover the lead-lag relationship, we propose a jointly dynamic topic model and also develop an embedding extension to address the modeling problem of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Advanced Text Analysis Techniques
