Clue-RAG: Towards Accurate and Cost-Efficient Graph-based RAG via Multi-Partite Graph and Query-Driven Iterative Retrieval
Yaodong Su, Yixiang Fang, Yingli Zhou, Quanqing Xu, Chuanhui Yang

TL;DR
Clue-RAG introduces a multi-partite graph and query-driven iterative retrieval to improve accuracy and reduce costs in graph-based retrieval-augmented generation for question answering.
Contribution
It proposes a novel multi-partite graph index and a query-driven iterative retrieval strategy, enhancing relevance and efficiency over existing graph-based RAG methods.
Findings
Achieves up to 99.33% higher accuracy on QA benchmarks.
Reduces indexing costs by 72.58%.
Outperforms baselines even without LLM for indexing.
Abstract
Despite the remarkable progress of Large Language Models (LLMs), their performance in question answering (QA) remains limited by the lack of domain-specific and up-to-date knowledge. Retrieval-Augmented Generation (RAG) addresses this limitation by incorporating external information, often from graph-structured data. However, existing graph-based RAG methods suffer from poor graph quality due to incomplete extraction and insufficient utilization of query information during retrieval. To overcome these limitations, we propose Clue-RAG, a novel approach that introduces (1) a multi-partite graph index incorporates Chunk, knowledge unit, and entity to capture semantic content at multiple levels of granularity, coupled with a hybrid extraction strategy that reduces LLM token usage while still producing accurate and disambiguated knowledge units, and (2) Q-Iter, a query-driven iterative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Graph Theory and Algorithms
