MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents
Shu Wang, Edwin Yu, Oscar Love, Tom Zhang, Tom Wong, Steve Scargall, Charles Fan

TL;DR
MemMachine is an open-source memory system designed for personalized AI agents, integrating various memory types and advanced retrieval techniques to improve long-term memory accuracy and efficiency.
Contribution
It introduces a ground-truth-preserving architecture with contextualized retrieval, outperforming existing methods in accuracy and token efficiency for long-term memory tasks.
Findings
MemMachine achieves 0.9169 accuracy on LoCoMo with gpt4.1-mini.
It reaches 93.0% accuracy on LongMemEvalS with optimized retrieval strategies.
MemMachine reduces input tokens by approximately 80% compared to Mem0.
Abstract
Large Language Model (LLM) agents require persistent memory to maintain personalization, factual continuity, and long-horizon reasoning, yet standard context-window and retrieval-augmented generation (RAG) pipelines degrade over multi-session interactions. We present MemMachine, an open-source memory system that integrates short-term, long-term episodic, and profile memory within a ground-truth-preserving architecture that stores entire conversational episodes and reduces lossy LLM-based extraction. MemMachine uses contextualized retrieval that expands nucleus matches with surrounding context, improving recall when relevant evidence spans multiple dialogue turns. Across benchmarks, MemMachine achieves strong accuracy-efficiency tradeoffs: on LoCoMo it reaches 0.9169 using gpt4.1-mini; on LongMemEvalS (ICLR 2025), a six-dimension ablation yields 93.0 percent accuracy, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
