"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning

Jiale Zhang; Chengcheng Zhu; Bosen Rao; Hao Sui; Xiaobing Sun; Bing; Chen; Chunyi Zhou; Shouling Ji

arXiv:2410.01272·cs.CR·November 14, 2024

"No Matter What You Do": Purifying GNN Models via Backdoor Unlearning

Jiale Zhang, Chengcheng Zhu, Bosen Rao, Hao Sui, Xiaobing Sun, Bing, Chen, Chunyi Zhou, Shouling Ji

PDF

Open Access 1 Repo

TL;DR

This paper introduces GCleaner, a novel method for defending GNNs against backdoor attacks by unlearning backdoor triggers through trigger recovery and knowledge distillation, significantly reducing attack success rates with minimal performance loss.

Contribution

GCleaner is the first backdoor mitigation approach for GNNs that recovers triggers and unlearns backdoor features, outperforming existing defenses in effectiveness and efficiency.

Findings

01

Reduces backdoor attack success rate to 10% with minimal performance loss.

02

Requires only 1% of clean data for effective mitigation.

03

Outperforms state-of-the-art backdoor defense methods.

Abstract

Recent studies have exposed that GNNs are vulnerable to several adversarial attacks, among which backdoor attack is one of the toughest. Similar to Deep Neural Networks (DNNs), backdoor attacks in GNNs lie in the fact that the attacker modifies a portion of graph data by embedding triggers and enforces the model to learn the trigger feature during the model training process. Despite the massive prior backdoor defense works on DNNs, defending against backdoor attacks in GNNs is largely unexplored, severely hindering the widespread application of GNNs in real-world tasks. To bridge this gap, we present GCleaner, the first backdoor mitigation method on GNNs. GCleaner can mitigate the presence of the backdoor logic within backdoored GNNs by reversing the backdoor learning procedure, aiming to restore the model performance to a level similar to that is directly trained on the original clean…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

graph-axis/gcleaner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Anomaly Detection Techniques and Applications

MethodsKnowledge Distillation