Empowering Working Memory for Large Language Model Agents

Jing Guo; Nan Li; Jianchuan Qi; Hang Yang; Ruiqiao Li; Yuzhen Feng; Si; Zhang; Ming Xu

arXiv:2312.17259·cs.CL·May 29, 2024·5 cites

Empowering Working Memory for Large Language Model Agents

Jing Guo, Nan Li, Jianchuan Qi, Hang Yang, Ruiqiao Li, Yuzhen Feng, Si, Zhang, Ming Xu

PDF

Open Access

TL;DR

This paper proposes a novel architecture for large language models that incorporates a working memory system inspired by cognitive psychology to improve memory retention and contextual reasoning across interactions.

Contribution

It introduces a centralized Working Memory Hub and Episodic Buffer to enhance LLMs' memory capabilities, addressing limitations of traditional memory designs.

Findings

01

Enhanced memory retention across episodes

02

Improved contextual reasoning in complex tasks

03

Blueprint for future memory-augmented LLMs

Abstract

Large language models (LLMs) have achieved impressive linguistic capabilities. However, a key limitation persists in their lack of human-like memory faculties. LLMs exhibit constrained memory retention across sequential interactions, hindering complex reasoning. This paper explores the potential of applying cognitive psychology's working memory frameworks, to enhance LLM architecture. The limitations of traditional LLM memory designs are analyzed, including their isolation of distinct dialog episodes and lack of persistent memory links. To address this, an innovative model is proposed incorporating a centralized Working Memory Hub and Episodic Buffer access to retain memories across episodes. This architecture aims to provide greater continuity for nuanced contextual reasoning during intricate tasks and collaborative scenarios. While promising, further research is required into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques