On a Guided Nonnegative Matrix Factorization

Joshua Vendrow; Jamie Haddock; Elizaveta Rebrova; Deanna Needell

arXiv:2010.11365·cs.LG·February 8, 2021

On a Guided Nonnegative Matrix Factorization

Joshua Vendrow, Jamie Haddock, Elizaveta Rebrova, Deanna Needell

PDF

1 Repo

TL;DR

This paper introduces Guided NMF, a semi-supervised topic modeling approach that incorporates user-provided seed words to improve the quality of learned topics, demonstrating competitive results with minimal supervision.

Contribution

The paper proposes Guided NMF, a novel semi-supervised extension of NMF that uses seed words to guide topic discovery, addressing issues of redundant or less meaningful topics.

Findings

01

Guided NMF outperforms traditional unsupervised NMF in topic coherence.

02

The method requires minimal supervision to achieve competitive results.

03

Experimental results validate the effectiveness of seed word guidance in topic modeling.

Abstract

Fully unsupervised topic models have found fantastic success in document clustering and classification. However, these models often suffer from the tendency to learn less-than-meaningful or even redundant topics when the data is biased towards a set of features. For this reason, we propose an approach based upon the nonnegative matrix factorization (NMF) model, deemed \textit{Guided NMF}, that incorporates user-designed seed word supervision. Our experimental results demonstrate the promise of this model and illustrate that it is competitive with other methods of this ilk with only very little supervision information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jvendrow/GuidedNMF
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.