Query-Efficient Agentic Graph Extraction Attacks on GraphRAG Systems

Shuhua Yang; Jiahao Zhang; Yilong Wang; Dongwon Lee; Suhang Wang

arXiv:2601.14662·cs.AI·April 21, 2026

Query-Efficient Agentic Graph Extraction Attacks on GraphRAG Systems

Shuhua Yang, Jiahao Zhang, Yilong Wang, Dongwon Lee, Suhang Wang

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces AGEA, a novel query-efficient black-box attack method that effectively reconstructs hidden knowledge graphs in GraphRAG systems, revealing significant vulnerabilities under limited query budgets.

Contribution

The paper presents AGEA, a new framework combining exploration, external memory, and filtering to efficiently extract knowledge graphs, outperforming prior attacks.

Findings

01

AGEA recovers up to 90% of entities and relationships.

02

AGEA outperforms prior attack baselines under the same query budgets.

03

Modern GraphRAG systems are highly vulnerable to structured extraction attacks.

Abstract

Graph-based retrieval-augmented generation (GraphRAG) systems construct knowledge graphs over document collections to support multi-hop reasoning. While prior work shows that GraphRAG responses may leak retrieved subgraphs, the feasibility of query-efficient reconstruction of the hidden graph structure remains unexplored under realistic query budgets. We study a budget-constrained black-box setting where an adversary adaptively queries the system to steal its latent entity-relation graph. We propose AGEA (Agentic Graph Extraction Attack), a framework that leverages a novelty-guided exploration-exploitation strategy, external graph memory modules, and a two-stage graph extraction pipeline combining lightweight discovery with LLM-based filtering. We evaluate AGEA on medical, agriculture, and literary datasets across Microsoft-GraphRAG and LightRAG systems. Under identical query budgets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shuashua0608/AGEA
github

Datasets

molmohsen/awesome-ai-agent-papers
dataset· 39 dl
39 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.