Topic Modelling: Going Beyond Token Outputs

Lowri Williams; Eirini Anthi; Laura Arman; Pete Burnap

arXiv:2401.12990·cs.CL·April 26, 2024·1 cites

Topic Modelling: Going Beyond Token Outputs

Lowri Williams, Eirini Anthi, Laura Arman, Pete Burnap

PDF

Open Access

TL;DR

This paper introduces a novel method to enhance the interpretability of topic models by extracting and mapping keywords directly from textual data, eliminating reliance on external sources and improving human understanding.

Contribution

It presents a new approach that extends traditional topic model outputs using only the data itself, improving interpretability without external dependencies.

Findings

01

Higher quality and usefulness of extended topics

02

Increased efficiency in annotation tasks

03

Better interpretability compared to traditional methods

Abstract

Topic modelling is a text mining technique for identifying salient themes from a number of documents. The output is commonly a set of topics consisting of isolated tokens that often co-occur in such documents. Manual effort is often associated with interpreting a topic's description from such tokens. However, from a human's perspective, such outputs may not adequately provide enough information to infer the meaning of the topics; thus, their interpretability is often inaccurately understood. Although several studies have attempted to automatically extend topic descriptions as a means of enhancing the interpretation of topic models, they rely on external language sources that may become unavailable, must be kept up-to-date to generate relevant results, and present privacy issues when training on or processing data. This paper presents a novel approach towards extending the output of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Advanced Text Analysis Techniques · Recommender Systems and Techniques

MethodsSparse Evolutionary Training