How to Use Graph Data in the Wild to Help Graph Anomaly Detection?

Yuxuan Cao; Jiarong Xu; Chen Zhao; Jiaan Wang; Carl Yang; Chunping Wang; Yang Yang

arXiv:2506.04190·cs.LG·June 5, 2025

How to Use Graph Data in the Wild to Help Graph Anomaly Detection?

Yuxuan Cao, Jiarong Xu, Chen Zhao, Jiaan Wang, Carl Yang, Chunping Wang, Yang Yang

PDF

TL;DR

This paper introduces Wild-GAD, a framework leveraging diverse external graph data to improve anomaly detection in graph-structured data, addressing challenges like label scarcity and anomaly variability.

Contribution

It proposes a novel approach using external graph data and a unified database to enhance anomaly detection accuracy in various domains.

Findings

01

18% average AUCROC improvement over baselines

02

32% average AUCPR improvement over baselines

03

Effective selection of external data based on representativity and diversity

Abstract

In recent years, graph anomaly detection has found extensive applications in various domains such as social, financial, and communication networks. However, anomalies in graph-structured data present unique challenges, including label scarcity, ill-defined anomalies, and varying anomaly types, making supervised or semi-supervised methods unreliable. Researchers often adopt unsupervised approaches to address these challenges, assuming that anomalies deviate significantly from the normal data distribution. Yet, when the available data is insufficient, capturing the normal distribution accurately and comprehensively becomes difficult. To overcome this limitation, we propose to utilize external graph data (i.e., graph data in the wild) to help anomaly detection tasks. This naturally raises the question: How can we use external data to help graph anomaly detection tasks? To answer this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.