Do We Still Need GraphRAG? Benchmarking RAG and GraphRAG for Agentic Search Systems
Dongzhe Fan, Zheyi Xue, Siyuan Liu, Qiaoyu Tan

TL;DR
This paper benchmarks RAG and GraphRAG within agentic search systems, revealing that agentic search enhances dense RAG performance but GraphRAG remains superior for complex reasoning tasks.
Contribution
It introduces RAGSearch, a comprehensive benchmark for evaluating RAG and GraphRAG in agentic search, standardizing protocols and analyzing their relative strengths.
Findings
Agentic search improves dense RAG performance significantly.
GraphRAG remains more stable and effective for multi-hop reasoning.
Performance gap narrows but does not close entirely between dense RAG and GraphRAG.
Abstract
Retrieval-augmented generation (RAG) and its graph-based extensions (GraphRAG) are effective paradigms for improving large language model (LLM) reasoning by grounding generation in external knowledge. However, most existing RAG and GraphRAG systems operate under static or one-shot retrieval, where a fixed set of documents is provided to the LLM in a single pass. In contrast, recent agentic search systems enable dynamic, multi-round retrieval and sequential decision-making during inference, and have shown strong gains when combined with vanilla RAG by introducing implicit structure through interaction. This progress raises a fundamental question: can agentic search compensate for the absence of explicit graph structure, reducing the need for costly GraphRAG pipelines? To answer this question, we introduce RAGSearch, a unified benchmark that evaluates dense RAG and representative GraphRAG…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
