Sparseness-constrained Nonnegative Tensor Factorization for Detecting Topics at Different Time Scales
Lara Kassab, Alona Kryshchenko, Hanbaek Lyu, Denali Molitor, Deanna, Needell, Elizaveta Rebrova, Jiahong Yuan

TL;DR
This paper introduces sparseness-constrained nonnegative tensor factorization methods that effectively detect and differentiate between long-lasting and short-term topics in temporal data, with improved control and efficiency.
Contribution
It proposes a novel sparseness-constrained NCPD approach and its online version for better detection of topics at different time scales, along with quantitative measures for topic length.
Findings
S-NCPD can automatically discover topics of varying persistence.
The online variant reduces reconstruction error faster.
Methods effectively identify short and long-term topics in real-world data.
Abstract
Temporal data (such as news articles or Twitter feeds) often consists of a mixture of long-lasting trends and popular but short-lasting topics of interest. A truly successful topic modeling strategy should be able to detect both types of topics and clearly locate them in time. In this paper, we first show that nonnegative CANDECOMP/PARAFAC decomposition (NCPD) is able to discover topics of variable persistence automatically. Then, we propose sparseness-constrained NCPD (S-NCPD) and its online variant in order to actively control the length of the learned topics effectively and efficiently. Further, we propose quantitative ways to measure the topic length and demonstrate the ability of S-NCPD (as well as its online variant) to discover short and long-lasting temporal topics in a controlled manner in semi-synthetic and real-world data including news headlines. We also demonstrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Computational Physics and Python Applications
