"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning
Jiale Zhang, Chengcheng Zhu, Bosen Rao, Hao Sui, Xiaobing Sun, Bing, Chen, Chunyi Zhou, Shouling Ji

TL;DR
This paper introduces GCleaner, a novel method for defending GNNs against backdoor attacks by unlearning backdoor triggers through trigger recovery and knowledge distillation, significantly reducing attack success rates with minimal performance loss.
Contribution
GCleaner is the first backdoor mitigation approach for GNNs that recovers triggers and unlearns backdoor features, outperforming existing defenses in effectiveness and efficiency.
Findings
Reduces backdoor attack success rate to 10% with minimal performance loss.
Requires only 1% of clean data for effective mitigation.
Outperforms state-of-the-art backdoor defense methods.
Abstract
Recent studies have exposed that GNNs are vulnerable to several adversarial attacks, among which backdoor attack is one of the toughest. Similar to Deep Neural Networks (DNNs), backdoor attacks in GNNs lie in the fact that the attacker modifies a portion of graph data by embedding triggers and enforces the model to learn the trigger feature during the model training process. Despite the massive prior backdoor defense works on DNNs, defending against backdoor attacks in GNNs is largely unexplored, severely hindering the widespread application of GNNs in real-world tasks. To bridge this gap, we present GCleaner, the first backdoor mitigation method on GNNs. GCleaner can mitigate the presence of the backdoor logic within backdoored GNNs by reversing the backdoor learning procedure, aiming to restore the model performance to a level similar to that is directly trained on the original clean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Anomaly Detection Techniques and Applications
MethodsKnowledge Distillation
