CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Taeyun Roh; Wonjune Jang; Junha Jung; Jaewoo Kang

arXiv:2603.15421·cs.CL·April 21, 2026

CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Taeyun Roh, Wonjune Jang, Junha Jung, Jaewoo Kang

PDF

TL;DR

CLAG is a memory organization framework for small language model agents that uses clustering to improve knowledge retention, reduce interference, and enhance answer quality.

Contribution

It introduces an agent-driven clustering memory system that autonomously organizes and retrieves relevant information, improving performance over traditional memory methods.

Findings

01

CLAG improves answer quality across multiple QA datasets.

02

It reduces cross-topic interference in memory retrieval.

03

CLAG remains lightweight and efficient for small language models.

Abstract

Large language model agents heavily rely on external memory to support knowledge reuse and complex reasoning tasks. Yet most memory systems store experiences in a single global retrieval pool which can gradually dilute or corrupt stored knowledge. This problem is especially pronounced for small language models (SLMs), which are highly vulnerable to irrelevant context. We introduce CLAG, a CLustering-based AGentic memory framework where an SLM agent actively organizes memory by clustering. CLAG employs an SLM-driven router to assign incoming memories to semantically coherent clusters and autonomously generates cluster-specific profiles, including topic summaries and descriptive tags, to establish each cluster as a self-contained functional unit. By performing localized evolution within these structured neighborhoods, CLAG effectively reduces cross-topic interference and enhances internal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.