Comprehensive Evaluation of Large Language Models for Topic Modeling

Tomoki Doi; Masaru Isonuma; Hitomi Yanaka

arXiv:2406.00697·cs.CL·June 26, 2024·2 cites

Comprehensive Evaluation of Large Language Models for Topic Modeling

Tomoki Doi, Masaru Isonuma, Hitomi Yanaka

PDF

Open Access

TL;DR

This paper quantitatively evaluates large language models for topic modeling, assessing their topic quality, hallucination tendencies, and controllability, revealing strengths in coherence but limitations in focus and control.

Contribution

It provides a comprehensive quantitative analysis of LLMs in topic modeling, addressing gaps in prior qualitative evaluations.

Findings

01

LLMs produce coherent and diverse topics with few hallucinations.

02

They tend to take shortcuts by focusing on parts of documents.

03

Controllability of topics via prompts is limited.

Abstract

Recent work utilizes Large Language Models (LLMs) for topic modeling, generating comprehensible topic labels for given documents. However, their performance has mainly been evaluated qualitatively, and there remains room for quantitative investigation of their capabilities. In this paper, we quantitatively evaluate LLMs from multiple perspectives: the quality of topics, the impact of LLM-specific concerns, such as hallucination and shortcuts for limited documents, and LLMs' controllability of topic categories via prompts. Our findings show that LLMs can identify coherent and diverse topics with few hallucinations but may take shortcuts by focusing only on parts of documents. We also found that their controllability is limited.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Advanced Text Analysis Techniques