ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

Hwiyeol Jo; Hyunwoo Lee; Kang Min Yoo; Taiwoo Park

arXiv:2406.13342·cs.CL·June 10, 2025

ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

Hwiyeol Jo, Hyunwoo Lee, Kang Min Yoo, Taiwoo Park

PDF

Open Access

TL;DR

This paper introduces ZeroDL, a zero-shot distribution learning approach that leverages large language models for effective text clustering by aggregating open-ended inference results and utilizing meta-information.

Contribution

The paper proposes a novel zero-shot method for text clustering that enhances LLM capabilities through dataset-wide inference and meta-information aggregation.

Findings

01

Improved clustering performance on multiple datasets

02

Effective use of LLM-generated class labels

03

Demonstrated understanding of tasks via data analysis

Abstract

The advancements in large language models (LLMs) have brought significant progress in NLP tasks. However, if a task cannot be fully described in prompts, the models could fail to carry out the task. In this paper, we propose a simple yet effective method to contextualize a task toward a LLM. The method utilizes (1) open-ended zero-shot inference from the entire dataset, (2) aggregate the inference results, and (3) finally incorporate the aggregated meta-information for the actual task. We show the effectiveness in text clustering tasks, empowering LLMs to perform text-to-text-based clustering and leading to improvements on several datasets. Furthermore, we explore the generated class labels for clustering, showing how the LLM understands the task through data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis