Automatic Generation of Topic Labels

Areej Alokaili; Nikolaos Aletras; Mark Stevenson

arXiv:2006.00127·cs.IR·June 2, 2020

Automatic Generation of Topic Labels

Areej Alokaili, Nikolaos Aletras, Mark Stevenson

PDF

1 Repo

TL;DR

This paper introduces a neural sequence-to-sequence model for automatic topic label generation, overcoming limitations of extractive methods by producing more flexible and human-like labels, trained on a large synthetic dataset.

Contribution

It presents a novel neural approach for topic labeling that generates descriptive labels without relying on a restricted candidate set, trained on a new synthetic dataset.

Findings

01

The neural model outperforms extractive methods in human evaluations.

02

Generated labels are more descriptive and relevant according to human ratings.

03

The approach demonstrates effectiveness across diverse topics.

Abstract

Topic modelling is a popular unsupervised method for identifying the underlying themes in document collections that has many applications in information retrieval. A topic is usually represented by a list of terms ranked by their probability but, since these can be difficult to interpret, various approaches have been developed to assign descriptive labels to topics. Previous work on the automatic assignment of labels to topics has relied on a two-stage approach: (1) candidate labels are retrieved from a large pool (e.g. Wikipedia article titles); and then (2) re-ranked based on their semantic similarity to the topic terms. However, these extractive approaches can only assign candidate labels from a restricted set that may not include any suitable ones. This paper proposes using a sequence-to-sequence neural-based approach to generate labels that does not suffer from this limitation. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

areejokaili/topic_labelling
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.