Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces
Shreyas Rajesh, Pavan Holur, Chenda Duan, David Chong, Vwani Roychowdhury

TL;DR
This paper introduces the Generative Semantic Workspace, a neuro-inspired memory framework that enhances large language models' ability to reason over long, evolving narratives by building structured, interpretable episodic representations.
Contribution
The paper presents GSW, a novel generative memory framework that constructs structured, interpretable episodic representations, enabling LLMs to track entities and events over extended contexts.
Findings
GSW outperforms existing RAG baselines by up to 20% on EpBench.
GSW reduces query-time tokens by 51%, improving efficiency.
GSW demonstrates human-like episodic memory capabilities in LLMs.
Abstract
Large Language Models (LLMs) face fundamental challenges in long-context reasoning: many documents exceed their finite context windows, while performance on texts that do fit degrades with sequence length, necessitating their augmentation with external memory frameworks. Current solutions, which have evolved from retrieval using semantic embeddings to more sophisticated structured knowledge graphs representations for improved sense-making and associativity, are tailored for fact-based retrieval and fail to build the space-time-anchored narrative representations required for tracking entities through episodic events. To bridge this gap, we propose the \textbf{Generative Semantic Workspace} (GSW), a neuro-inspired generative memory framework that builds structured, interpretable representations of evolving situations, enabling LLMs to reason over evolving roles, actions, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications
