MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced   Retrieval Augmentation

Hongjin Qian; Zheng Liu; Peitian Zhang; Kelong Mao; Defu Lian,; Zhicheng Dou; Tiejun Huang

arXiv:2409.05591·cs.CL·April 10, 2025·5 cites

MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced Retrieval Augmentation

Hongjin Qian, Zheng Liu, Peitian Zhang, Kelong Mao, Defu Lian,, Zhicheng Dou, Tiejun Huang

PDF

Open Access 1 Repo

TL;DR

MemoRAG introduces a global memory-augmented retrieval framework that enhances large language models' ability to process long contexts efficiently and effectively, surpassing traditional RAG limitations.

Contribution

It proposes a novel dual-system RAG framework with a global memory module and feedback reinforcement, improving long-context understanding and retrieval capabilities.

Findings

01

Achieves superior performance on long-context tasks.

02

Effectively handles complex and simple scenarios.

03

Enhances retrieval accuracy with memory reinforcement.

Abstract

Processing long contexts presents a significant challenge for large language models (LLMs). While recent advancements allow LLMs to handle much longer contexts than before (e.g., 32K or 128K tokens), it is computationally expensive and can still be insufficient for many applications. Retrieval-Augmented Generation (RAG) is considered a promising strategy to address this problem. However, conventional RAG methods face inherent limitations because of two underlying requirements: 1) explicitly stated queries, and 2) well-structured knowledge. These conditions, however, do not hold in general long-context processing tasks. In this work, we propose MemoRAG, a novel RAG framework empowered by global memory-augmented retrieval. MemoRAG features a dual-system architecture. First, it employs a light but long-range system to create a global memory of the long context. Once a task is presented,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qhjqhj00/memorag
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Automated Systems · Gaze Tracking and Assistive Technology

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Byte Pair Encoding · Softmax · Layer Normalization · Dropout · Attention Is All You Need · WordPiece · Residual Connection · Attention Dropout · Linear Layer