Copula Quadrant Similarity for Anomaly Scores
Matthew Davidow, David Matteson

TL;DR
This paper introduces a novel similarity measure for anomaly detection scores based on extremal similarity, enabling effective clustering of algorithms for robust ensemble detection.
Contribution
It proposes a new copula-based quadrant similarity measure focusing on extremal scores, facilitating comparison and clustering of diverse anomaly detection methods.
Findings
The method accurately clusters anomaly detection algorithms.
It improves ensemble robustness by selecting similar methods.
The approach outperforms existing dependence measures.
Abstract
Practical anomaly detection requires applying numerous approaches due to the inherent difficulty of unsupervised learning. Direct comparison between complex or opaque anomaly detection algorithms is intractable; we instead propose a framework for associating the scores of multiple methods. Our aim is to answer the question: how should one measure the similarity between anomaly scores generated by different methods? The scoring crux is the extremes, which identify the most anomalous observations. A pair of algorithms are defined here to be similar if they assign their highest scores to roughly the same small fraction of observations. To formalize this, we propose a measure based on extremal similarity in scoring distributions through a novel upper quadrant modeling approach, and contrast it with tail and other dependence measures. We illustrate our method with simulated and real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Complex Systems and Time Series Analysis · Network Security and Intrusion Detection
