Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge

Qi Luo; Xiaonan Li; Junqi Dai; Shuang Cheng; Xipeng Qiu

arXiv:2511.00505·cs.CL·January 26, 2026

Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge

Qi Luo, Xiaonan Li, Junqi Dai, Shuang Cheng, Xipeng Qiu

PDF

Open Access

TL;DR

Zero-RAG reduces redundant external knowledge in retrieval-augmented generation, pruning the corpus by 30% and speeding up retrieval by 22%, while maintaining performance, thus addressing inefficiencies caused by knowledge redundancy.

Contribution

The paper introduces Zero-RAG, a method that prunes redundant knowledge from the corpus and enhances internal knowledge utilization, improving efficiency without performance loss.

Findings

01

Pruned Wikipedia corpus by 30%.

02

Accelerated retrieval stage by 22%.

03

Maintained RAG performance after pruning.

Abstract

Retrieval-Augmented Generation has shown remarkable results to address Large Language Models' hallucinations, which usually uses a large external corpus to supplement knowledge to LLMs. However, with the development of LLMs, the internal knowledge of LLMs has expanded significantly, thus causing significant knowledge redundancy between the external corpus and LLMs. On the one hand, the indexing cost of dense retrieval is highly related to the corpus size and thus significant redundant knowledge intensifies the dense retrieval's workload. On the other hand, the redundant knowledge in the external corpus is not helpful to LLMs and our exploratory analysis shows that it instead hurts the RAG performance on those questions which the LLM can answer by itself. To address these issues, we propose Zero-RAG to tackle these challenges. Specifically, we first propose the Mastery-Score metric to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks