E-SAGE: Explainability-based Defense Against Backdoor Attacks on Graph   Neural Networks

Dingqiang Yuan; Xiaohua Xu; Lei Yu; Tongchang Han; Rongchang Li; Meng; Han

arXiv:2406.10655·cs.CR·June 18, 2024

E-SAGE: Explainability-based Defense Against Backdoor Attacks on Graph Neural Networks

Dingqiang Yuan, Xiaohua Xu, Lei Yu, Tongchang Han, Rongchang Li, Meng, Han

PDF

Open Access 1 Repo

TL;DR

E-SAGE is a novel explainability-based defense method that detects and mitigates backdoor attacks on Graph Neural Networks by iteratively pruning edges based on their importance scores, showing strong effectiveness in experiments.

Contribution

This paper introduces E-SAGE, the first explainability-driven approach to defend GNNs against backdoor attacks through adaptive edge pruning.

Findings

01

E-SAGE effectively detects and defends against state-of-the-art backdoor attacks.

02

E-SAGE outperforms existing defenses in various attack scenarios.

03

E-SAGE also shows robustness against adversarial attacks.

Abstract

Graph Neural Networks (GNNs) have recently been widely adopted in multiple domains. Yet, they are notably vulnerable to adversarial and backdoor attacks. In particular, backdoor attacks based on subgraph insertion have been shown to be effective in graph classification tasks while being stealthy, successfully circumventing various existing defense methods. In this paper, we propose E-SAGE, a novel approach to defending GNN backdoor attacks based on explainability. We find that the malicious edges and benign edges have significant differences in the importance scores for explainability evaluation. Accordingly, E-SAGE adaptively applies an iterative edge pruning process on the graph based on the edge scores. Through extensive experiments, we demonstrate the effectiveness of E-SAGE against state-of-the-art graph backdoor attacks in different attack settings. In addition, we investigate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vanadisArya/E-SAGE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)