TL;DR
This paper introduces GraphIP-Bench, a comprehensive benchmark for evaluating the difficulty of stealing GNNs and the effectiveness of defenses, revealing that GNN theft is generally easy and defenses often ineffective.
Contribution
It provides the first unified benchmark evaluating GNN extraction attacks and defenses across multiple datasets, models, and tasks using a standardized protocol.
Findings
Stealing GNNs is easy at medium query budgets.
Most defenses do not significantly hinder GNN theft.
Heterophilic graphs are harder to steal, and cross-architecture mismatches reduce extraction success.
Abstract
Graph neural networks (GNNs) deployed as cloud services can be \emph{stolen} through \emph{model-extraction attacks}, which train a surrogate from query responses to reproduce the target's behaviour, and a growing line of ownership defenses tries to prevent or trace such theft. The title of this paper asks two questions: \emph{how hard is it to steal a GNN?}, and \emph{can we stop it?} Prior work cannot answer either, because experiments use inconsistent datasets, threat models, and metrics. We introduce \emph{GraphIP-Bench}, a unified benchmark which evaluates both sides under a single black-box protocol. It integrates twelve extraction attacks, twelve defenses spanning watermarking, output-perturbation, and query-pattern-detection families, ten public graphs covering homophilic, heterophilic, and large-scale regimes, three GNN backbones, and three graph-learning tasks, and it reports…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
