Improving Generalizability of Graph Anomaly Detection Models via Data Augmentation
Shuang Zhou, Xiao Huang, Ninghao Liu, Fu-Lai Chung, Long-Kai Huang

TL;DR
This paper introduces AugAN, a data augmentation technique designed to improve the generalization of graph anomaly detection models across unseen graphs, addressing the challenge of limited labels and varying normal backgrounds.
Contribution
It proposes a novel data augmentation method, AugAN, to enhance the generalizability of semi-supervised graph anomaly detection models to unseen graphs.
Findings
AugAN significantly improves detection accuracy on unseen graphs.
Enhanced models show better robustness with limited labeled data.
Experiments confirm the effectiveness of data augmentation in GAD.
Abstract
Graph anomaly detection (GAD) is a vital task since even a few anomalies can pose huge threats to benign users. Recent semi-supervised GAD methods, which can effectively leverage the available labels as prior knowledge, have achieved superior performances than unsupervised methods. In practice, people usually need to identify anomalies on new (sub)graphs to secure their business, but they may lack labels to train an effective detection model. One natural idea is to directly adopt a trained GAD model to the new (sub)graph for testing. However, we find that existing semi-supervised GAD methods suffer from poor generalization issue, i.e., well-trained models could not perform well on an unseen area (i.e., not accessible in training) of the same graph. It may cause great troubles. In this paper, we base on the phenomenon and propose a general and novel research problem of generalized graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Software System Performance and Reliability
MethodsBalanced Selection
