TL;DR
MiniRAG is a simple, efficient retrieval-augmented generation system that enables small language models to perform comparably to larger models with significantly reduced storage needs, suitable for resource-constrained environments.
Contribution
MiniRAG introduces a novel graph-based indexing and retrieval approach that enhances lightweight RAG systems' performance without complex semantic understanding.
Findings
MiniRAG achieves comparable performance to LLM-based methods with only 25% of storage.
It demonstrates effective knowledge retrieval using a simple, graph-structured approach.
The system is validated on a new benchmark dataset for lightweight RAG scenarios.
Abstract
The growing demand for efficient and lightweight Retrieval-Augmented Generation (RAG) systems has highlighted significant challenges when deploying Small Language Models (SLMs) in existing RAG frameworks. Current approaches face severe performance degradation due to SLMs' limited semantic understanding and text processing capabilities, creating barriers for widespread adoption in resource-constrained scenarios. To address these fundamental limitations, we present MiniRAG, a novel RAG system designed for extreme simplicity and efficiency. MiniRAG introduces two key technical innovations: (1) a semantic-aware heterogeneous graph indexing mechanism that combines text chunks and named entities in a unified structure, reducing reliance on complex semantic understanding, and (2) a lightweight topology-enhanced retrieval approach that leverages graph structures for efficient knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Dense Connections · Linear Warmup With Linear Decay · WordPiece · Attention Dropout · Adam · Residual Connection · Dropout · Softmax
