CADDeLaG: Framework for distributed anomaly detection in large dense graph sequences
Aniruddha Basak, Kamalika Das, Ole J. Mengshoel

TL;DR
CADDeLaG is a scalable framework that enables efficient anomaly detection in large dense graphs using distributed computing, overcoming memory and computational limitations of traditional spectral methods.
Contribution
It introduces a decomposable algorithm and a distributed solver for commute-time distance, tailored for dense graphs in a scalable Apache Spark environment.
Findings
Successfully detects anomalies in climate and election data graphs.
Demonstrates scalability on synthetic and real datasets.
Outperforms sparsification-based methods in dense graph analysis.
Abstract
Random walk based distance measures for graphs such as commute-time distance are useful in a variety of graph algorithms, such as clustering, anomaly detection, and creating low dimensional embeddings. Since such measures hinge on the spectral decomposition of the graph, the computation becomes a bottleneck for large graphs and do not scale easily to graphs that cannot be loaded in memory. Most existing graph mining libraries for large graphs either resort to sampling or exploit the sparsity structure of such graphs for spectral analysis. However, such methods do not work for dense graphs constructed for studying pairwise relationships among entities in a data set. Examples of such studies include analyzing pairwise locations in gridded climate data for discovering long distance climate phenomena. These graphs representations are fully connected by construction and cannot be sparsified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Complex Network Analysis Techniques · Anomaly Detection Techniques and Applications
