Unsupervised Graph Outlier Detection: Problem Revisit, New Insight, and Superior Method
Yihong Huang, Liping Wang, Fan Zhang, Xuemin Lin

TL;DR
This paper revisits the problem of unsupervised graph outlier detection, identifies issues with existing data injection methods, and introduces a robust variance-based framework that improves detection accuracy and balance.
Contribution
It uncovers data leakage problems in current datasets, proposes a novel variance-based model, and develops VGOD, a framework that enhances outlier detection performance and robustness.
Findings
VGOD outperforms existing methods on real-world datasets.
The variance-based model is more robust across injection settings.
VGOD achieves balanced detection of structural and contextual outliers.
Abstract
A large number of studies on Graph Outlier Detection (GOD) have emerged in recent years due to its wide applications, in which Unsupervised Node Outlier Detection (UNOD) on attributed networks is an important area. UNOD focuses on detecting two kinds of typical outliers in graphs: the structural outlier and the contextual outlier. Most existing works conduct experiments based on datasets with injected outliers. However, we find that the most widely-used outlier injection approach has a serious data leakage issue. By only utilizing such data leakage, a simple approach can achieve state-of-the-art performance in detecting outliers. In addition, we observe that existing algorithms have a performance drop with the mitigated data leakage issue. The other major issue is on balanced detection performance between the two types of outliers, which has not been considered by existing studies. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Imbalanced Data Classification Techniques
