Stealing Training Graphs from Graph Neural Networks
Minhua Lin, Enyan Dai, Junjie Xu, Jinyuan Jia, Xiang Zhang, Suhang Wang

TL;DR
This paper demonstrates that trained Graph Neural Networks (GNNs) can leak sensitive training graph data, and introduces a novel method to steal such graphs using a diffusion model and parameter-based selection, highlighting privacy risks.
Contribution
The paper presents the first comprehensive framework for stealing training graphs from trained GNNs, combining diffusion-based graph generation with parameter-driven selection techniques.
Findings
Effective graph stealing demonstrated on real-world datasets.
High-quality training graphs can be reconstructed from GNNs.
Training graph leakage poses significant privacy risks.
Abstract
Graph Neural Networks (GNNs) have shown promising results in modeling graphs in various tasks. The training of GNNs, especially on specialized tasks such as bioinformatics, demands extensive expert annotations, which are expensive and usually contain sensitive information of data providers. The trained GNN models are often shared for deployment in the real world. As neural networks can memorize the training samples, the model parameters of GNNs have a high risk of leaking private training data. Our theoretical analysis shows the strong connections between trained GNN parameters and the training graphs used, confirming the training graph leakage issue. However, explorations into training data leakage from trained GNNs are rather limited. Therefore, we investigate a novel problem of stealing graphs from trained GNNs. To obtain high-quality graphs that resemble the target training set, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks
MethodsDiffusion
