Privacy-protected Retrieval-Augmented Generation for Knowledge Graph Question Answering

Yunfeng Ning; Mayi Xu; Jintao Wen; Qiankun Pi; Yuanyuan Zhu; Ming Zhong; Jiawei Jiang; Tieyun Qian

arXiv:2508.08785·cs.CL·December 4, 2025

Privacy-protected Retrieval-Augmented Generation for Knowledge Graph Question Answering

Yunfeng Ning, Mayi Xu, Jintao Wen, Qiankun Pi, Yuanyuan Zhu, Ming Zhong, Jiawei Jiang, Tieyun Qian

PDF

Open Access 1 Video

TL;DR

This paper introduces ARoG, a privacy-preserving framework for knowledge graph question answering that anonymizes entities and employs abstraction strategies to enable effective retrieval without exposing sensitive information.

Contribution

The paper proposes a novel ARoG framework with relation-centric and structure-oriented abstractions to protect privacy while enabling knowledge retrieval in KGQA.

Findings

01

ARoG achieves strong performance on three datasets.

02

ARoG demonstrates robustness in privacy protection.

03

The abstraction strategies improve retrieval effectiveness.

Abstract

LLMs often suffer from hallucinations and outdated or incomplete knowledge. RAG is proposed to address these issues by integrating external knowledge like that in KGs into LLMs. However, leveraging private KGs in RAG systems poses significant privacy risks due to the black-box nature of LLMs and potential insecure data transmission, especially when using third-party LLM APIs lacking transparency and control. In this paper, we investigate the privacy-protected RAG scenario for the first time, where entities in KGs are anonymous for LLMs, thus preventing them from accessing entity semantics. Due to the loss of semantics of entities, previous RAG systems cannot retrieve question-relevant knowledge from KGs by matching questions with the meaningless identifiers of anonymous entities. To realize an effective RAG system in this scenario, two key challenges must be addressed: (1) How can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Privacy-protected Retrieval-Augmented Generation for Knowledge Graph Question Answering· underline

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Information Retrieval and Search Behavior