Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs
Yoav Kor Sade, Arvindh Arun, Rishi Puri, Steffen Staab, Maya Bechler-Speicher

TL;DR
Ex-GraphRAG introduces an interpretable encoder for graph-augmented language models that precisely attributes influence to individual nodes, revealing semantic-structural mismatches affecting model performance.
Contribution
It replaces the traditional GNN encoder with M-GNAN, enabling exact attribution of encoder outputs to nodes and feature groups, facilitating faithful auditing.
Findings
Auditable encoder matches black-box performance on STaRK-Prime.
Discovered semantic-structural mismatch in evidence routing.
Removing low-attribution intermediaries degrades multi-hop QA by up to 28%.
Abstract
GraphRAG conditions language models on subgraphs retrieved from knowledge graphs, encoded via message-passing GNNs. Because these encoders entangle node contributions through iterated neighborhood aggregation, there is no closed-form way to determine how much each retrieved entity influenced the encoder's output, and therefore no way to faithfully audit what structural evidence actually reached the model. We introduce Ex-GraphRAG, which replaces the GNN encoder with a Multivariate Graph Neural Additive Network (M-GNAN), an extension of additive graph models to high-dimensional embedding spaces that yields an exact decomposition of the encoder's output across individual nodes and feature groups, without post-hoc approximation. On STaRK-Prime, this auditable encoder matches black-box performance. Using it to audit evidence routing, we uncover a semantic-structural mismatch: the nodes that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
