# Youtu-GraphRAG: Vertically Unified Agents for Graph Retrieval-Augmented Complex Reasoning

**Authors:** Junnan Dong, Siyu An, Yifei Yu, Qian-Wen Zhang, Linhao Luo, Xiao Huang, Yunsheng Wu, Di Yin, Xing Sun

arXiv: 2508.19855 · 2025-09-04

## TL;DR

Youtu-GraphRAG introduces a vertically unified framework for graph retrieval-augmented generation, enhancing complex reasoning in large language models through integrated graph construction, retrieval, and reasoning with improved scalability and domain transferability.

## Contribution

It presents a novel integrated agentic paradigm that unifies graph schema extraction, hierarchical knowledge organization, and query reasoning, outperforming prior isolated approaches.

## Key findings

- Achieves up to 90.71% token cost savings.
- Improves accuracy by 16.62% over baselines.
- Demonstrates robustness across six benchmarks.

## Abstract

Graph retrieval-augmented generation (GraphRAG) has effectively enhanced large language models in complex reasoning by organizing fragmented knowledge into explicitly structured graphs. Prior efforts have been made to improve either graph construction or graph retrieval in isolation, yielding suboptimal performance, especially when domain shifts occur. In this paper, we propose a vertically unified agentic paradigm, Youtu-GraphRAG, to jointly connect the entire framework as an intricate integration. Specifically, (i) a seed graph schema is introduced to bound the automatic extraction agent with targeted entity types, relations and attribute types, also continuously expanded for scalability over unseen domains; (ii) To obtain higher-level knowledge upon the schema, we develop novel dually-perceived community detection, fusing structural topology with subgraph semantics for comprehensive knowledge organization. This naturally yields a hierarchical knowledge tree that supports both top-down filtering and bottom-up reasoning with community summaries; (iii) An agentic retriever is designed to interpret the same graph schema to transform complex queries into tractable and parallel sub-queries. It iteratively performs reflection for more advanced reasoning; (iv) To alleviate the knowledge leaking problem in pre-trained LLM, we propose a tailored anonymous dataset and a novel 'Anonymity Reversion' task that deeply measures the real performance of the GraphRAG frameworks. Extensive experiments across six challenging benchmarks demonstrate the robustness of Youtu-GraphRAG, remarkably moving the Pareto frontier with up to 90.71% saving of token costs and 16.62% higher accuracy over state-of-the-art baselines. The results indicate our adaptability, allowing seamless domain transfer with minimal intervention on schema.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.19855/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/2508.19855/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/2508.19855/full.md

---
Source: https://tomesphere.com/paper/2508.19855