Jointly Attacking Graph Neural Network and its Explanations
Wenqi Fan, Wei Jin, Xiaorui Liu, Han Xu, Xianfeng Tang, Suhang Wang,, Qing Li, Jiliang Tang, Jianping Wang, Charu Aggarwal

TL;DR
This paper introduces GEAttack, a novel adversarial attack framework that simultaneously targets GNN models and their explanation methods, revealing vulnerabilities and raising concerns about the robustness of explainability tools.
Contribution
The paper presents the first attack framework that jointly compromises GNNs and their explanations, validated through extensive experiments on real-world datasets.
Findings
GNNExplainer can detect adversarial perturbations effectively.
GEAttack successfully attacks both GNNs and their explanations.
The method demonstrates robustness vulnerabilities in current explainability techniques.
Abstract
Graph Neural Networks (GNNs) have boosted the performance for many graph-related tasks. Despite the great success, recent studies have shown that GNNs are highly vulnerable to adversarial attacks, where adversaries can mislead the GNNs' prediction by modifying graphs. On the other hand, the explanation of GNNs (GNNExplainer) provides a better understanding of a trained GNN model by generating a small subgraph and features that are most influential for its prediction. In this paper, we first perform empirical studies to validate that GNNExplainer can act as an inspection tool and have the potential to detect the adversarial perturbations for graphs. This finding motivates us to further initiate a new problem investigation: Whether a graph neural network and its explanations can be jointly attacked by modifying graphs with malicious desires? It is challenging to answer this question since…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
MethodsGraph Neural Network
