Zero-RAG: Towards Retrieval-Augmented Generation with Zero Redundant Knowledge
Qi Luo, Xiaonan Li, Junqi Dai, Shuang Cheng, Xipeng Qiu

TL;DR
Zero-RAG reduces redundant external knowledge in retrieval-augmented generation, pruning the corpus by 30% and speeding up retrieval by 22%, while maintaining performance, thus addressing inefficiencies caused by knowledge redundancy.
Contribution
The paper introduces Zero-RAG, a method that prunes redundant knowledge from the corpus and enhances internal knowledge utilization, improving efficiency without performance loss.
Findings
Pruned Wikipedia corpus by 30%.
Accelerated retrieval stage by 22%.
Maintained RAG performance after pruning.
Abstract
Retrieval-Augmented Generation has shown remarkable results to address Large Language Models' hallucinations, which usually uses a large external corpus to supplement knowledge to LLMs. However, with the development of LLMs, the internal knowledge of LLMs has expanded significantly, thus causing significant knowledge redundancy between the external corpus and LLMs. On the one hand, the indexing cost of dense retrieval is highly related to the corpus size and thus significant redundant knowledge intensifies the dense retrieval's workload. On the other hand, the redundant knowledge in the external corpus is not helpful to LLMs and our exploratory analysis shows that it instead hurts the RAG performance on those questions which the LLM can answer by itself. To address these issues, we propose Zero-RAG to tackle these challenges. Specifically, we first propose the Mastery-Score metric to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
