On Large-Scale Dynamic Topic Modeling with Nonnegative CP Tensor Decomposition
Miju Ahn, Nicole Eikmeier, Jamie Haddock, Lara Kassab, Alona, Kryshchenko, Kathryn Leonard, Deanna Needell, R. W. M. A. Madushani, Elena, Sizikova, Chuntian Wang

TL;DR
This paper introduces a novel application of nonnegative CP tensor decomposition for large-scale dynamic topic modeling, effectively capturing temporal information and outperforming traditional NMF-based methods.
Contribution
It is the first to apply NNCPD to dynamic topic modeling, offering a more effective way to analyze temporal data by directly decomposing data tensors.
Findings
NNCPD significantly outperforms NMF-based methods on synthetic data.
Application to real data demonstrates improved topic evolution tracking.
The approach preserves temporal information more effectively.
Abstract
There is currently an unprecedented demand for large-scale temporal data analysis due to the explosive growth of data. Dynamic topic modeling has been widely used in social and data sciences with the goal of learning latent topics that emerge, evolve, and fade over time. Previous work on dynamic topic modeling primarily employ the method of nonnegative matrix factorization (NMF), where slices of the data tensor are each factorized into the product of lower-dimensional nonnegative matrices. With this approach, however, information contained in the temporal dimension of the data is often neglected or underutilized. To overcome this issue, we propose instead adopting the method of nonnegative CANDECOMP/PARAPAC (CP) tensor decomposition (NNCPD), where the data tensor is directly decomposed into a minimal sum of outer products of nonnegative vectors, thereby preserving the temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Edcuational Technology Systems · Computational and Text Analysis Methods
