From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Bernal Jim\'enez Guti\'errez; Yiheng Shu; Weijian Qi; Sizhe Zhou; Yu Su

arXiv:2502.14802·cs.CL·June 23, 2025·2 cites

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Bernal Jim\'enez Guti\'errez, Yiheng Shu, Weijian Qi, Sizhe Zhou, Yu Su

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper introduces HippoRAG 2, a non-parametric continual learning framework for large language models that significantly improves factual, sense-making, and associative memory performance over existing RAG methods.

Contribution

HippoRAG 2 enhances retrieval-augmented generation with deeper passage integration and better online LLM use, advancing non-parametric continual learning for LLMs.

Findings

01

7% improvement in associative memory tasks

02

Outperforms standard RAG on factual memory tasks

03

Achieves human-like long-term memory capabilities

Abstract

Our ability to continuously acquire, organize, and leverage knowledge is a key feature of human intelligence that AI systems must approximate to unlock their full potential. Given the challenges in continual learning with large language models (LLMs), retrieval-augmented generation (RAG) has become the dominant way to introduce new information. However, its reliance on vector retrieval hinders its ability to mimic the dynamic and interconnected nature of human long-term memory. Recent RAG approaches augment vector embeddings with various structures like knowledge graphs to address some of these gaps, namely sense-making and associativity. However, their performance on more basic factual memory tasks drops considerably below standard RAG. We address this unintended deterioration and propose HippoRAG 2, a framework that outperforms standard RAG comprehensively on factual, sense-making,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osu-nlp-group/hipporag
noneOfficial

Datasets

osunlp/HippoRAG_2
dataset· 211 dl
211 dl

Videos

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models· slideslive

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Linear Layer · Byte Pair Encoding · WordPiece · Layer Normalization · Residual Connection · Dense Connections · Attention Dropout