Identifying Backdoored Graphs in Graph Neural Network Training: An Explanation-Based Approach with Novel Metrics

Jane Downer; Ren Wang; and Binghui Wang

arXiv:2403.18136·cs.LG·May 12, 2026·3 cites

Identifying Backdoored Graphs in Graph Neural Network Training: An Explanation-Based Approach with Novel Metrics

Jane Downer, Ren Wang, and Binghui Wang

PDF

TL;DR

This paper introduces a novel explanation-based detection method for backdoor attacks in Graph Neural Networks, utilizing seven new metrics and adaptive attack evaluation on benchmark datasets.

Contribution

The paper presents a new detection approach that leverages graph explanations and seven innovative metrics to identify backdoor attacks in GNNs.

Findings

01

High detection performance on multiple benchmark datasets

02

Effective against various attack models

03

Advances security in GNN applications

Abstract

Graph Neural Networks (GNNs) have gained popularity in numerous domains, yet they are vulnerable to backdoor attacks that can compromise their performance and ethical application. The detection of these attacks is crucial for maintaining the reliability and security of GNN classification tasks, but existing methods are often inflexible, relying on single metrics that fail to capture the full range of backdoor behaviors. Recognizing the challenge in detecting such intrusions, we devised a novel detection method that creatively leverages graph-level explanations. By extracting and transforming secondary outputs from GNN explanation mechanisms, we developed seven innovative metrics for effective detection of backdoor attacks on GNNs. Additionally, we develop an adaptive attack to rigorously evaluate our approach. We test our method on multiple benchmark datasets and examine its efficacy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.