Scalable Anomaly Ranking of Attributed Neighborhoods
Bryan Perozzi, Leman Akoglu

TL;DR
This paper introduces 'normality', a new measure combining structure and attributes to identify anomalous neighborhoods in large attributed graphs, outperforming existing methods.
Contribution
It proposes a novel quality measure for attributed neighborhoods that accounts for boundary edges and enables efficient optimization for anomaly detection.
Findings
Normality outperforms conductance, density, OddBall, and SODA in anomaly detection.
The measure efficiently infers shared attribute subspaces.
Experiments demonstrate effectiveness on real-world graphs.
Abstract
Given a graph with node attributes, what neighborhoods are anomalous? To answer this question, one needs a quality score that utilizes both structure and attributes. Popular existing measures either quantify the structure only and ignore the attributes (e.g., conductance), or only consider the connectedness of the nodes inside the neighborhood and ignore the cross-edges at the boundary (e.g., density). In this work we propose normality, a new quality measure for attributed neighborhoods. Normality utilizes structure and attributes together to quantify both internal consistency and external separability. It exhibits two key advantages over other measures: (1) It allows many boundary-edges as long as they can be "exonerated"; i.e., either (i) are expected under a null model, and/or (ii) the boundary nodes do not exhibit the subset of attributes shared by the neighborhood members.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
