Retrieval Pivot Attacks in Hybrid RAG: Measuring and Mitigating Amplified Leakage from Vector Seeds to Graph Expansion
Scott Thornton

TL;DR
This paper identifies a security vulnerability in hybrid RAG systems where vector-to-graph pivoting can leak sensitive data, proposes metrics to measure this risk, and demonstrates that boundary enforcement effectively mitigates the leakage.
Contribution
The paper formalizes Retrieval Pivot Risk (RPR), introduces new metrics for measuring leakage, and provides a practical mitigation strategy through boundary authorization enforcement.
Findings
Hybrid RAG exhibits high pivot risk without defenses.
Enforcing authorization at the graph boundary eliminates leakage.
Natural shared entities can create pivot paths without adversarial injection.
Abstract
Hybrid Retrieval-Augmented Generation (RAG) pipelines combine vector similarity search with knowledge graph expansion for multi-hop reasoning. We show that this composition introduces a distinct security failure mode: a vector-retrieved "seed" chunk can pivot via entity links into sensitive graph neighborhoods, causing cross-tenant data leakage that does not occur in vector-only retrieval. We formalize this risk as Retrieval Pivot Risk (RPR) and introduce companion metrics Leakage@k, Amplification Factor, and Pivot Depth (PD) to quantify leakage magnitude and traversal structure. We present seven Retrieval Pivot Attacks that exploit the vector-to-graph boundary and show that adversarial injection is not required: naturally shared entities create cross-tenant pivot paths organically. Across a synthetic multi-tenant enterprise corpus and the Enron email corpus, the undefended hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Data Quality and Management · Adversarial Robustness in Machine Learning
