TL;DR
This paper presents Omni-SimpleMem, a unified multimodal memory framework for lifelong AI agents, discovered through an autonomous research pipeline that outperforms initial configurations on two benchmarks.
Contribution
The paper introduces an autonomous pipeline that discovers a novel multimodal memory system, surpassing baseline performance and revealing key architectural and bug fixes beyond hyperparameter tuning.
Findings
Achieved state-of-the-art results on LoCoMo and Mem-Gallery benchmarks.
Discovered that bug fixes and architectural changes have greater impact than hyperparameter tuning.
Provided a taxonomy of discovery types and properties of multimodal memory for autonomous research.
Abstract
AI agents increasingly operate over extended time horizons, yet their ability to retain, organize, and recall multimodal experiences remains a critical bottleneck. Building effective lifelong memory requires navigating a vast design space spanning architecture, retrieval strategies, prompt engineering, and data pipelines; this space is too large and interconnected for manual exploration or traditional AutoML to explore effectively. We deploy an autonomous research pipeline to discover Omni-SimpleMem, a unified multimodal memory framework for lifelong AI agents. Starting from a na\"ive baseline (F1=0.117 on LoCoMo), the pipeline autonomously executes experiments across two benchmarks, diagnosing failure modes, proposing architectural modifications, and repairing data pipeline bugs, all without human intervention in the inner loop. The resulting system achieves state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
