XGBD: Explanation-Guided Graph Backdoor Detection
Zihan Guan, Mengnan Du, Ninghao Liu

TL;DR
This paper introduces XGBD, a novel explanation-guided method for detecting backdoor attacks in graph learning models by leveraging topological features and attribution analysis.
Contribution
It proposes a new backdoor detection approach for graph data that uses explanation methods to identify discriminative subgraphs, improving detection effectiveness.
Findings
Effective detection across multiple datasets and attack methods
Utilizes explanation techniques to distinguish backdoor from clean samples
Enhances detection accuracy by incorporating topological information
Abstract
Backdoor attacks pose a significant security risk to graph learning models. Backdoors can be embedded into the target model by inserting backdoor triggers into the training dataset, causing the model to make incorrect predictions when the trigger is present. To counter backdoor attacks, backdoor detection has been proposed. An emerging detection strategy in the vision and NLP domains is based on an intriguing phenomenon: when training models on a mixture of backdoor and clean samples, the loss on backdoor samples drops significantly faster than on clean samples, allowing backdoor samples to be easily detected by selecting samples with the lowest loss values. However, the ignorance of topological feature information on graph data limits its detection effectiveness when applied directly to the graph domain. To this end, we propose an explanation-guided backdoor detection method to take…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
