HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

Xiaochen Zhao; Kaikai Wang; Xiaowen Zhang; Chen Yao; Aili Wang

arXiv:2602.13933·cs.AI·May 4, 2026

HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

Xiaochen Zhao, Kaikai Wang, Xiaowen Zhang, Chen Yao, Aili Wang

PDF

TL;DR

HyMem introduces a hybrid memory system with dynamic retrieval scheduling for LLMs, balancing efficiency and effectiveness in long-term memory management, inspired by human cognitive economy principles.

Contribution

The paper presents HyMem, a novel hybrid memory architecture with multi-granular storage and dynamic retrieval, improving long-term memory handling in LLMs.

Findings

01

HyMem outperforms full-context methods on LOCOMO and LongMemEval benchmarks.

02

HyMem reduces computational cost by 92.6%.

03

HyMem achieves a better balance between efficiency and performance.

Abstract

Large language model (LLM) agents demonstrate strong performance in short-text contexts but often underperform in extended dialogues due to inefficient memory management. Existing approaches face a fundamental trade-off between efficiency and effectiveness: memory compression risks losing critical details required for complex reasoning, while retaining raw text introduces unnecessary computational overhead for simple queries. The crux lies in the limitations of monolithic memory representations and static retrieval mechanisms, which fail to emulate the flexible and proactive memory scheduling capabilities observed in humans, thus struggling to adapt to diverse problem scenarios. Inspired by the principle of cognitive economy, we propose HyMem, a hybrid memory architecture that enables dynamic on-demand scheduling through multi-granular memory representations. HyMem adopts a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.