Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Dongming Jiang; Yi Li; Songtao Wei; Jinxin Yang; Ayushi Kishore; Alysa Zhao; Dingyi Kang; Xu Hu; Feng Chen; Qiannan Li; Bingzhe Li

arXiv:2602.19320·cs.CL·May 21, 2026

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Dongming Jiang, Yi Li, Songtao Wei, Jinxin Yang, Ayushi Kishore, Alysa Zhao, Dingyi Kang, Xu Hu, Feng Chen, Qiannan Li, Bingzhe Li

PDF

1 Repo

TL;DR

This paper provides a structured taxonomy and empirical analysis of agentic memory systems in large language models, highlighting current limitations and proposing directions for improved evaluation and scalability.

Contribution

It introduces a taxonomy of MAG systems, analyzes key empirical limitations, and connects memory structures to performance issues, guiding future research.

Findings

01

Benchmark saturation affects evaluation reliability.

02

Performance varies significantly across backbone models.

03

Memory maintenance incurs latency and throughput costs.

Abstract

Agentic memory systems enable large language model (LLM) agents to maintain state across long interactions, supporting long-horizon reasoning and personalization beyond fixed context windows. Despite rapid architectural development, the empirical foundations of these systems remain fragile: existing benchmarks are often underscaled, evaluation metrics are misaligned with semantic utility, performance varies significantly across backbone models, and system-level costs are frequently overlooked. This survey presents a structured analysis of agentic memory from both architectural and system perspectives. We first introduce a concise taxonomy of MAG systems based on four memory structures. Then, we analyze key pain points limiting current systems, including benchmark saturation effects, metric validity and judge sensitivity, backbone-dependent accuracy, and the latency and throughput…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fredjiang0324/Anatomy-of-Agentic-Memory
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics