Topic Diffusion Discovery based on Sparseness-constrained Non-negative   Matrix Factorization

Yihuang Kang; Keng-Pei Lin; I-Ling Cheng

arXiv:1807.04386·cs.IR·July 13, 2018

Topic Diffusion Discovery based on Sparseness-constrained Non-negative Matrix Factorization

Yihuang Kang, Keng-Pei Lin, I-Ling Cheng

PDF

Open Access

TL;DR

This paper introduces a novel method combining sparseness-constrained Non-negative Matrix Factorization and Jensen-Shannon divergence to discover and visualize the evolution and diffusion of research topics in large text datasets.

Contribution

It presents a new technique for identifying and visualizing topic diffusion and evolution in scholarly literature using advanced matrix factorization and divergence measures.

Findings

01

Extracts prominent topics from large datasets

02

Visualizes term-topic relationships and evolution

03

Helps identify emerging research topics

Abstract

Due to recent explosion of text data, researchers have been overwhelmed by ever-increasing volume of articles produced by different research communities. Various scholarly search websites, citation recommendation engines, and research databases have been created to simplify the text search tasks. However, it is still difficult for researchers to be able to identify potential research topics without doing intensive reviews on a tremendous number of articles published by journals, conferences, meetings, and workshops. In this paper, we consider a novel topic diffusion discovery technique that incorporates sparseness-constrained Non-negative Matrix Factorization with generalized Jensen-Shannon divergence to help understand term-topic evolutions and identify topic diffusions. Our experimental result shows that this approach can extract more prominent topics from large article databases,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Topic Modeling · Text and Document Classification Technologies