Evaluating topic coherence measures

Frank Rosner; Alexander Hinneburg; Michael R\"oder; Martin Nettling,; Andreas Both

arXiv:1403.6397·cs.LG·March 26, 2014·54 cites

Evaluating topic coherence measures

Frank Rosner, Alexander Hinneburg, Michael R\"oder, Martin Nettling,, Andreas Both

PDF

Open Access

TL;DR

This paper assesses various topic coherence measures, including novel ones from scientific philosophy that evaluate complex word subsets, to improve the interpretability of topic models.

Contribution

It introduces and applies coherence measures from scientific philosophy that score complex word subsets, expanding beyond pairwise word evaluations.

Findings

01

New coherence measures effectively distinguish better topics

02

Complex subset scoring improves topic interpretability

03

First application of philosophical coherence measures to topic modeling

Abstract

Topic models extract representative word sets - called topics - from word counts in documents without requiring any semantic annotations. Topics are not guaranteed to be well interpretable, therefore, coherence measures have been proposed to distinguish between good and bad topics. Studies of topic coherence so far are limited to measures that score pairs of individual words. For the first time, we include coherence measures from scientific philosophy that score pairs of more complex word subsets and apply them to topic scoring.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques