Cognitively-Inspired Episodic Memory Architectures for Accurate and Efficient Character AI
Rafael Arias Gonzalez, Steve DiPaola

TL;DR
This paper introduces a cognitively-inspired episodic memory system for character AI that balances response depth and efficiency, enabling accurate, resource-efficient dialogue and biographical analysis.
Contribution
The authors develop a novel architecture combining offline data augmentation and parallel retrieval from structured memory, improving response quality and speed for character AI.
Findings
Achieves 0.52s prompt generation latency.
Performs on par with GPT-4 RAG on large models.
Outperforms smaller models like GPT-3.5 and GPT-3.
Abstract
Large language models show promise for embodying historical characters in dialogue systems, but existing approaches face a critical trade-off: simple retrieval-augmented generation produces shallow responses, while multi-stage reflection achieves depth at prohibitive latency. We present an architecture that resolves this tension through offline data augmentation and efficient parallel retrieval from structured episodic memory. Our system transforms biographical data into 1,774 enriched first-person memories with affective-semantic metadata, then employs two-stage retrieval achieving 0.52s prompt generation. Evaluation using LLM-as-judge and RAGAs metrics shows our approach achieves parity with traditional RAG on GPT-4 while significantly outperforming it on smaller models (GPT-3.5, GPT-3), suggesting particular value for resource-constrained deployments. Beyond dialogue, the structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Speech and dialogue systems
