RecallM: An Adaptable Memory Mechanism with Temporal Understanding for   Large Language Models

Brandon Kynoch; Hugo Latapie; Dwane van der Sluis

arXiv:2307.02738·cs.AI·October 4, 2023

RecallM: An Adaptable Memory Mechanism with Temporal Understanding for Large Language Models

Brandon Kynoch, Hugo Latapie, Dwane van der Sluis

PDF

Open Access 1 Repo

TL;DR

RecallM introduces an adaptable long-term memory system for Large Language Models, enhancing belief updating and temporal understanding, which improves knowledge management and reasoning capabilities beyond existing methods.

Contribution

The paper presents RecallM, a novel memory architecture that significantly improves belief updating and temporal understanding in LLMs, surpassing vector database methods.

Findings

01

RecallM is four times more effective than vector databases for belief updating.

02

RecallM achieves competitive results on question-answering and in-context learning tasks.

03

The architecture enhances long-term knowledge management in LLMs.

Abstract

Large Language Models (LLMs) have made extraordinary progress in the field of Artificial Intelligence and have demonstrated remarkable capabilities across a large variety of tasks and domains. However, as we venture closer to creating Artificial General Intelligence (AGI) systems, we recognize the need to supplement LLMs with long-term memory to overcome the context window limitation and more importantly, to create a foundation for sustained reasoning, cumulative learning and long-term user interaction. In this paper we propose RecallM, a novel architecture for providing LLMs with an adaptable and updatable long-term memory mechanism. Unlike previous methods, the RecallM architecture is particularly effective at belief updating and maintaining a temporal understanding of the knowledge provided to it. We demonstrate through various experiments the effectiveness of this architecture.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cisco-open/DeepVision/tree/main/recallm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks