MMAG: Mixed Memory-Augmented Generation for Large Language Models Applications
Stefano Zeppieri

TL;DR
This paper presents MMAG, a multi-layered memory framework for LLMs inspired by human cognition, enhancing long-term relevance, personalization, and continuity in conversational agents.
Contribution
Introduces the MMAG framework organizing multiple memory types for LLMs, with implementation in the Heero agent demonstrating improved engagement and retention.
Findings
Encrypted long-term memory improves user engagement.
Multi-layered memory enhances conversational coherence.
Framework addresses privacy, storage, and latency challenges.
Abstract
Large Language Models (LLMs) excel at generating coherent text within a single prompt but fall short in sustaining relevance, personalization, and continuity across extended interactions. Human communication, however, relies on multiple forms of memory, from recalling past conversations to adapting to personal traits and situational context. This paper introduces the Mixed Memory-Augmented Generation (MMAG) pattern, a framework that organizes memory for LLM-based agents into five interacting layers: conversational, long-term user, episodic and event-linked, sensory and context-aware, and short-term working memory. Drawing inspiration from cognitive psychology, we map these layers to technical components and outline strategies for coordination, prioritization, and conflict resolution. We demonstrate the approach through its implementation in the Heero conversational agent, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · AI in Service Interactions · Topic Modeling
