Knowledge Pyramid Construction for Multi-Level Retrieval-Augmented Generation
Rubing Chen, Xulu Zhang, Jiaxin Wu, Wenqi Fan, Xiao-Yong Wei, Qing Li

TL;DR
This paper introduces a multi-layer knowledge pyramid for retrieval-augmented generation to improve answer precision, employing cross-layer techniques and benchmarks, significantly enhancing GPT-4's performance.
Contribution
It proposes a novel multi-layer knowledge pyramid framework with dynamic updates and filtering, advancing retrieval-augmented generation methods.
Findings
Outperforms 19 state-of-the-art methods in experiments.
Achieves 395% F1 score improvement on GPT-4.
Introduces two domain-specific knowledge retrieval benchmarks.
Abstract
This paper addresses the need for improved precision in existing knowledge-enhanced question-answering frameworks, specifically Retrieval-Augmented Generation (RAG) methods that primarily focus on enhancing recall. We propose a multi-layer knowledge pyramid approach within the RAG framework to achieve a better balance between precision and recall. The knowledge pyramid consists of three layers: Ontologies, Knowledge Graphs (KGs), and chunk-based raw text. We employ cross-layer augmentation techniques for comprehensive knowledge coverage and dynamic updates of the Ontology schema and instances. To ensure compactness, we utilize cross-layer filtering methods for knowledge condensation in KGs. Our approach, named PolyRAG, follows a waterfall model for retrieval, starting from the top of the pyramid and progressing down until a confident answer is obtained. We introduce two benchmarks for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Softmax · Absolute Position Encodings · Dense Connections · Dropout · Linear Layer · Attention Dropout
