Private Graph Extraction via Feature Explanations
Iyiola E. Olatunji, Mandeep Rathee, Thorben Funke, Megha Khosla

TL;DR
This paper investigates how explanations of graph neural network models can be exploited to reconstruct training graph structures, revealing privacy risks and proposing defenses to mitigate such attacks.
Contribution
It introduces novel graph reconstruction attacks leveraging explanation methods and proposes a randomized response defense to enhance privacy in graph ML explanations.
Findings
Gradient-based explanations reveal the most graph structure.
Privacy leakage increases with explanation utility.
Randomized response mechanism reduces attack success.
Abstract
Privacy and interpretability are two important ingredients for achieving trustworthy machine learning. We study the interplay of these two aspects in graph machine learning through graph reconstruction attacks. The goal of the adversary here is to reconstruct the graph structure of the training data given access to model explanations. Based on the different kinds of auxiliary information available to the adversary, we propose several graph reconstruction attacks. We show that additional knowledge of post-hoc feature explanations substantially increases the success rate of these attacks. Further, we investigate in detail the differences between attack performance with respect to three different classes of explanation methods for graph neural networks: gradient-based, perturbation-based, and surrogate model-based methods. While gradient-based explanations reveal the most in terms of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Privacy-Preserving Technologies in Data
