LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

Yuanjie Zhu; Liangwei Yang; Ke Xu; Weizhi Zhang; Zihe Song; Jindong Wang; Philip S. Yu

arXiv:2511.15424·cs.CL·April 8, 2026

LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

Yuanjie Zhu, Liangwei Yang, Ke Xu, Weizhi Zhang, Zihe Song, Jindong Wang, Philip S. Yu

PDF

TL;DR

LLM-MemCluster introduces a novel, end-to-end framework that enhances large language models with dynamic memory and dual prompts for improved text clustering without external modules.

Contribution

It redefines clustering as an LLM-native task, enabling iterative refinement and automatic cluster number determination within a unified framework.

Findings

01

Outperforms strong baselines on benchmark datasets

02

Eliminates the need for external modules or complex pipelines

03

Provides an interpretable and tuning-free clustering approach

Abstract

Large Language Models (LLMs) are reshaping unsupervised learning by offering an unprecedented ability to perform text clustering based on their deep semantic understanding. However, their direct application is fundamentally limited by a lack of stateful memory for iterative refinement and the difficulty of managing cluster granularity. As a result, existing methods often rely on complex pipelines with external modules, sacrificing a truly end-to-end approach. We introduce LLM-MemCluster, a novel framework that reconceptualizes clustering as a fully LLM-native task. It leverages a Dynamic Memory to instill state awareness and a Dual-Prompt Strategy to enable the model to reason about and determine the number of clusters. Evaluated on several benchmark datasets, our tuning-free framework significantly and consistently outperforms strong baselines. LLM-MemCluster presents an effective,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.