Auto-Dreamer: Learning Offline Memory Consolidation for Language Agents
Chongrui Ye, Yuxiang Liu, Yu Wang, Haofei Yu, Yining Zhao, Ge Liu, Julian McAuley, Jiaxuan You

TL;DR
Auto-Dreamer introduces a novel offline memory consolidation method for language agents, enabling better knowledge abstraction and reuse across sessions, leading to improved performance with less memory usage.
Contribution
It proposes a decoupled offline memory consolidator trained via GRPO, which enhances cross-session knowledge abstraction for language agents.
Findings
Outperforms baselines on ScienceWorld by 7 points with smaller memory.
Maintains superior performance on ALFWorld and WebArena without retraining.
Uses 6-12 times less memory than strongest baselines.
Abstract
Language agents increasingly operate over streams of related tasks, yet existing memory systems struggle to convert accumulated experience into reusable knowledge. Retrieval-augmented and structured memory methods record per-session observations effectively, but often couple acquisition and consolidation into a single online process, leaving the agent without a global view across sessions to discover recurring patterns, abstract shared procedures, or prune redundant entries. Inspired by complementary learning systems theory, we propose Auto-Dreamer, a learned offline consolidator for language-agent memory. Auto-Dreamer decouples fast per-session memory acquisition from slow cross-session consolidation. Given a selected working region of a typed memory bank, the consolidator treats the region as read-only evidence, performs bounded tool-use to inspect entries and provenance-linked source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
